OpenSearch

Commit Graph

Author	SHA1	Message	Date
Mayya Sharipova	ec32e66088	Deprecate reference to _type in lookup queries (#37016 ) Relates to #35190	2019-01-08 18:46:41 -08:00
Josh Soref	edb48321ba	[DOCS] Various spelling corrections (#37046 )	2019-01-07 14:44:12 +01:00
Jim Ferenczi	667c06dc83	Add link to script score query in the top level docs (#36416 ) * add link to script score query in the top level docs * Correct references in script-score-query.asciidoc	2018-12-19 10:18:53 -05:00
Alan Woodward	344917efab	Add script filter to intervals (#36776 ) This commit adds the ability to filter out intervals based on their start and end position, and internal gaps: ``` POST _search { "query": { "intervals" : { "my_text" : { "match" : { "query" : "hot porridge", "filter" : { "script" : { "source" : "interval.start > 10 && interval.end < 20 && interval.gaps == 0" } } } } } } } ```	2018-12-19 11:12:18 +00:00
Nick Knize	ec0dc2c0e9	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#36751 ) * [Geo] Expose BKDBackedGeoShapes as new VECTOR strategy This commit exposes lucene's LatLonShape field as a new strategy in GeoShapeFieldMapper. To use the new indexing approach, strategy should be set to "vector" in the geo_shape field mapper. If the tree parameter is set the mapper will throw an IAE. Note the following: When using vector strategy: * geo_shape query does not support querying by POINT, MULTIPOINT, or GEOMETRYCOLLECTION. * LINESTRING and MULTILINESTRING queries do not support WITHIN relation. * CONTAINS relation is not supported. * The tree, precision, tree_levels, distance_error_pct, and points_only parameters will not throw an exception but they have no effect and will be marked as deprecated.. All other features are supported. * revert change to PercolatorFieldMapper * fix ExistsQuery for geo_shape vector strategy * add deprecation logging for tree, precision, tree_levels, distance_error_pct, and points_only * initial update to geoshape docs, including mapping migration updates * initial support for GeoCollection queries * fix docs and javadoc errors * clean up geocollection queries * set deprecated mapping tests to NOTCONSOLE * fix geo-shape mapper asciidoc mapping and test warnings * add support for point queries using LatLonShapeBoundingBoxQuery * update GeoShapeQueryBuilderTests to include POINT queries for VECTOR strategy. Other comment cleanups * add lucene geometry build testing to ShapeBuilder tests * remove deprecated prefix tree mapping from geo-shape.asciidoc * refactor GeoShapeFieldMapper into LegacyGeoShapeFieldMapper and GeoShapeFieldMapper Both classes derive from BaseGeoShapeFieldMapper that provides shared parameters: coerce, ignoreMalformed, ignore_z_value, orientation. * update docs to remove vector strategy * fix GeometryCollectionBuilder#buildLucene to return the object created by the shape builder * fix LineLength failure in GeoJsonShapeParserTests * ShapeMapper refactor changes from PR feedback * fix typo in geo-shape.asciidoc * ignore circle test in docs * update indexing-approach ref to geoshape-indexing-approach * add warnings check for LegacyGeoShapeFieldMapper to AbstractBuilderTestCase * fix deprecatedParameters setup * update indexing approach * fixing unexpected warnings failures * move orientation back to field type * remove if in LegacyGeoShapeFieldMapper#doXContent. Fix GeoShapeFieldMapper to work with double array as a point * fix indexing-approach link in circle section of geoshape docs * add strategy to deprecation warnings check * fix test failures * fix typo in QueryStringQueryBuilderTests * fix total hits to totalHits().value * fix version number * add version check to BaseGeoShapeFieldMapper * fix line length! * revert version check in BaseGeoShapeFieldMapper * Fix serialization of mappings of legacy shapes.	2018-12-18 09:54:56 -06:00
Nicholas Knize	96d279ed83	Revert "[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 )" This reverts commit `5bc7822562`.	2018-12-17 20:09:46 -06:00
Nick Knize	5bc7822562	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 ) This commit exposes lucene's LatLonShape field as the default type in GeoShapeFieldMapper. To use the new indexing approach, simply set "type" : "geo_shape" in the mappings without setting any of the strategy, precision, tree_levels, or distance_error_pct parameters. Note the following when using the new indexing approach: * geo_shape query does not support querying by MULTIPOINT. * LINESTRING and MULTILINESTRING queries do not yet support WITHIN relation. * CONTAINS relation is not yet supported. The tree, precision, tree_levels, distance_error_pct, and points_only parameters are deprecated.	2018-12-17 14:38:14 -06:00
Alan Woodward	09bf93dc2a	Add intervals query (#36135 ) * Add IntervalQueryBuilder with support for match and combine intervals * Add relative intervals * feedback * YAML test - broekn * yaml test; begin to add block source * Add block; make disjunction its own source * WIP * Extract IntervalBuilder and add tests for it * Fix eq/hashcode in Disjunction * New yaml test * checkstyle * license headers * test fix * YAML format * YAML formatting again * yaml tests; javadoc * Add OR test -> requires fix from LUCENE-8586 * Add docs * Re-do API * Clint's API * Delete bash script * doc fixes * imports * docs * test fix * feedback * comma * docs fixes * Tidy up doc references to old rule	2018-12-14 15:14:00 +00:00
Christoph Büscher	a42502df8b	[Docs] Add description of simple query string flags (#36211 ) Closes #34944	2018-12-10 01:00:42 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00
Jim Ferenczi	74aca756b8	Remove the distinction between query and filter context in QueryBuilders (#35354 ) When building a query Lucene distinguishes two cases, queries that require to produce a score and queries that only need to match. We cloned this mechanism in the QueryBuilders in order to be able to produce different queries based on whether they need to produce a score or not. However the only case in es that require this distinction is the BoolQueryBuilder that sets a different minimum_should_match when a `bool` query is built in a filter context.. This behavior doesn't seem right because it makes the matching of `should` clauses different when the score is not required. Closes #35293	2018-12-03 11:49:11 +01:00
Christophe Bismuth	acdf9666d5	Add `minimum_should_match` section to the query_string docs Closes #34142	2018-11-30 16:10:13 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Mayya Sharipova	b6014d971c	Forbid negative scores in functon_score query (#35709 ) * Forbid negative scores in functon_score query - Throw an exception when scores are negative in field_value_factor function - Throw an exception when scores are negative in script_score function Relates to #33309	2018-11-22 06:08:48 -05:00
Alan Woodward	be8097f9ce	Improve docs for index_prefixes option (#35778 ) This commit moves the documentation and examples for the `index_prefixes` option on text fields to its own file, to bring it in line with other mapping parameters, and expands a bit on both.	2018-11-22 09:20:46 +00:00
Mayya Sharipova	643bb20137	Add a new query type - ScriptScoreQuery (#34533 ) * Add a new query type - ScriptScoreQuery script_score query uses script to calculate document scores. Added as a substitute for function_score with an intentation to deprecate function_scoreq query. ```http GET /_search { "query": { "script_score" : { "query": { "match": { "message": "elasticsearch" } }, "script" : { "source": "Math.log(2 + doc['likes'].value)" }, "min_score" : 2 } } } ``` Add several functions to painless to be used inside script_score: double rational(double, double) double sigmoid(double, double, double) double randomNotReproducible() double randomReproducible(String, int) double decayGeoLinear(String, String, String, double, GeoPoint) double decayGeoExp(String, String, String, double, GeoPoint) double decayGeoGauss(String, String, String, double, GeoPoint) double decayNumericLinear(String, String, String, double, double) double decayNumericExp(String, String, String, double, double) double decayNumericGauss(String, String, String, double, double) double decayDateLinear(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateExp(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateGauss(String, String, String, double, JodaCompatibleZonedDateTime) Date functions only works on dates in the default format and default time zone	2018-11-20 16:10:06 -05:00
Itamar Syn-Hershko	156b3cae15	[Docs] Fix filter context in script-query.asciidoc (#35677 ) The docs say script queries are typically used in a filter context but the example uses a boolean must clause.	2018-11-19 16:30:33 +01:00
Peter Dyson	6d8af9731d	[Docs] Warn about searching across all fields wt. `query_string` (#35570 ) Warn about potential performance impact when a large number of fields is used with query string query and no default field.	2018-11-19 13:21:59 +01:00
Jun Ohtani	0cdfc4cd0a	[Doc] Add clarification to boolean query (#32575 ) It isn't very clear how boosting query works. Add explanation of positive/negative query.	2018-11-16 11:45:32 +09:00
Christoph Büscher	113af7996c	Make limit on number of expanded fields configurable (#35284 ) Currently we introduced a hard limit of 1024 to the number of fields a query can be expanded to in #26541. Instead of using a hard limit, we should make this configurable. This change removes the hard limit check and uses the existing `max_clause_count` setting instead. Closes #34778	2018-11-08 17:04:40 +01:00
Gytis Šk	3ee37b425b	Docs: Add section about range query for range type (#35222 ) This makes their interaction more discoverable.	2018-11-06 10:49:12 -05:00
Jeff Soloshy	14c8a483d5	[Docs] Minor formatting and wording fixes (#35278 )	2018-11-06 07:52:13 +01:00
Julie Tibshirani	fda173d7aa	Add a note around using separate indices for percolator queries and documents. (#35109 )	2018-11-01 12:41:07 -07:00
Nik Everett	2b2e208c4c	Docs: Remove range notation from random score docs (#35093 ) The `random_score` function produces values between 0 (inclusive) and 1 (exclusive) and documented it with fancy methematical range notation. It is so fancy I thought it was a typo. This changes the documentation to use words. Relates to #35084	2018-10-30 14:12:59 -04:00
Julie Tibshirani	f854330e06	Make sure to use the type _doc in the REST documentation. (#34662 ) * Replace custom type names with _doc in REST examples. * Avoid using two mapping types in the percolator docs. * Rename doc -> _doc in the main repository README. * Also replace some custom type names in the HLRC docs.	2018-10-22 11:54:04 -07:00
Nikolay Vasiliev	f5641e61a2	Docs: improve formatting of Query String Query doc page (#34432 ) Merge two tables.	2018-10-15 15:30:48 -04:00
Jim Ferenczi	6e28c8f1c4	[DOCS] Remove experimental label from term_set query (#34328 )	2018-10-05 19:45:23 +02:00
amoreauCoveo	e95dc5474f	Minor corrections in geo-queries.asciidoc (#34314 )	2018-10-05 17:12:18 +02:00
Julie Tibshirani	704d3e4c24	Add a deprecation warning in the type query documentation. (#34017 )	2018-09-24 16:30:38 -07:00
Abdon Pijpelink	32ee6148d2	[DOCS] Clarify scoring for multi_match phrase type (#32672 ) The original statement "Runs a match_phrase query on each field and combines the _score from each field." for the phrase type is a but misleading. The phrase type behaves like the best_fields type and does not combine the scores of each fields.	2018-09-18 16:57:33 +02:00
Joel Green	0b567c0eeb	[Docs] Update match-query.asciidoc (#33610 )	2018-09-12 14:35:27 +02:00
lipsill	b7c0d2830a	[Docs] Remove repeating words (#33087 )	2018-08-28 13:16:43 +02:00
w-bonelli	072c0be8af	Update Fuzzy Query docs to clarify default behavior re max_expansions (#30819 ) Stating that the Fuzzy Query generates "all possible" matching terms is misleading, given that the query's default behavior is to generate a maximum of 50 matching terms. (cherry picked from commit 345a0071a2a41fd7f80ae9ef8a39a2cb4991aedd)	2018-07-30 13:19:26 -07:00
Piotr Prądzyński	4fc833b1de	Unify headers for full text queries Relates #31599	2018-06-27 10:11:14 +02:00
Piotr Prądzyński	f6c64a048d	Remove redundant 'minimum_should_match' Relates #31600	2018-06-27 10:11:07 +02:00
Sue Gallagher	b44e1c1978	[DOCS] Removed and params from MLT. Closes #28128 (#31370 )	2018-06-19 13:48:13 -07:00
Sue Gallagher	cdb486ae70	[DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 (#31160 ) * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes 28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008	2018-06-08 14:41:01 -07:00
Adrien Grand	458bca11bc	Add a `feature_vector` field. (#31102 ) This field is similar to the `feature` field but is better suited to index sparse feature vectors. A use-case for this field could be to record topics associated with every documents alongside a metric that quantifies how well the topic is connected to this document, and then boost queries based on the topics that the logged user is interested in. Relates #27552	2018-06-07 10:05:37 +02:00
Nirmal Chidambaram	75a676c70b	Fail `span_multi` queries that exceeds boolean max clause limit (#30913 ) By default span_multi query will limit term expansions = boolean max clause. This will limit high heap usage in case of high cardinality term expansions. This applies only if top_terms_N is not used in inner multi query.	2018-06-07 09:34:39 +02:00
Adrien Grand	1af6d20efe	Fix docs build.	2018-06-05 14:55:40 +02:00
Adrien Grand	984523dda9	Clarify docs about boolean operator precedence. (#30808 ) Unfortunately, the classic queryparser does not honor the usual precedence rules of boolean operators. See https://issues.apache.org/jira/browse/LUCENE-3674.	2018-06-05 08:59:17 +02:00
Jim Ferenczi	f94a75778c	Fix index prefixes to work with span_multi (#31066 ) * Fix index prefixes to work with span_multi Text fields that use `index_prefixes` can rewrite `prefix` queries into `term` queries internally. This commit fix the handling of this rewriting in the `span_multi` query. This change also copies the index options of the text field into the prefix field in order to be able to run positional queries. This is mandatory for `span_multi` to work but this could also be useful to optimize `match_phrase_prefix` queries in a follow up. Note that this change can only be done on indices created after 6.3 since we set the index options to doc only in this version. Fixes #31056	2018-06-04 21:48:56 +02:00
Igor Motov	cf0e0606af	Use geohash cell instead of just a corner in geo_bounding_box (#30698 ) Treats geohashes as grid cells instead of just points when the geohashes are used to specify the edges in the geo_bounding_box query. For example, if a geohash is used to specify the top_left corner, the top left corner of the geohash cell will be used as the corner of the bounding box. Closes #25154	2018-05-24 14:46:15 -04:00
Christoph Büscher	3f78b3f5e1	[Docs] Explain incomplete dates in range queries (#30689 ) The current documentation isn't very clear about how incomplete dates are treated when specifying custom formats in a `range` query. This change adds a note explaining how missing month or year coordinates translate to dates that have the missings slots filled with unix time start date (1970-01-01) Closes #30634	2018-05-24 11:20:00 +02:00
Igor Motov	4b6915976c	Add support for indexed shape routing in geo_shape query (#30760 ) Adds ability to specify the routing value for the indexed shape in the geo_shape query. Closes #7663	2018-05-23 15:15:19 -04:00
Adrien Grand	886db84ad2	Expose Lucene's FeatureField. (#30618 ) Lucene has a new `FeatureField` which gives the ability to record numeric features as term frequencies. Its main benefit is that it allows to boost queries with the values of these features and efficiently skip non-competitive documents at the same time using block-max WAND and indexed impacts.	2018-05-23 08:55:21 +02:00
Igor Motov	b30f2913cf	Docs: document precision limitations of geo_bounding_box (#30540 ) The geo_bounding_box query might produce false positives alongside the right and upper edges and false negatives alongside left and bottom edges. This commit documents the behavior and defines the maximum error. Closes #29196	2018-05-14 15:54:42 -04:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
wmellouli	c8d8407012	[Docs] Add term query with normalizer example	2018-05-03 10:23:14 +02:00
Julie Tibshirani	6506edfd9c	Fix a reference to match_phrase_prefix in the match query docs. (#30282 )	2018-05-01 13:46:33 -07:00
Julie Tibshirani	b9e1a00213	Add support to match_phrase query for zero_terms_query. (#29598 )	2018-04-19 11:25:27 -07:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Fabien Baligand	199d131385	Improve query string docs (#28882 ) fix query string syntax doc when OR operator is missed	2018-03-30 16:36:40 +02:00
Fabien Baligand	437ad06e40	fix query string example for boolean query (#28881 )	2018-03-30 15:10:14 +02:00
Jim Ferenczi	c93c7f3121	Remove deprecated options for query_string (#29203 ) This commit removes some parameters deprecated in 6.x (or 5.x): `use_dismax`, `split_on_whitespace`, `all_fields` and `lowercase_expanded_terms`. Closes #25551	2018-03-22 18:37:08 +01:00
tnsatish	70f67b17dd	Fix typo in percolate-query.asciidoc (#29155 )	2018-03-20 16:47:53 +00:00
Sue Gallagher	3530a676e0	[Docs]Corrected spelling errors. (#28976 )	2018-03-19 10:22:40 -07:00
Martijn van Groningen	34a264c375	added docs for `wrapper` query. Closes #11591	2018-03-14 11:51:22 +01:00
Jim Ferenczi	48a7425ae6	Clarifies how query_string splits textual part (#28798 ) * Clarifies how the query_string splits textual part to build a query Whitespaces are not considered as operators anymore in 6x but the documentation is not clear about it. This commit changes the example in the documentation and adds a note regarding whitespaces and operators. Closes #28719	2018-03-01 15:08:25 -08:00
FUJI Goro	2baa19ea64	[Docs] Specify function score logarithm modifiers (#28821 ) The logarithm with base 10 is called "Common Logarithm".	2018-02-27 10:29:43 -08:00
Ke Li	a77273fc01	Reject regex search if regex string is too long (#28542 ) * Reject regex search if regex string is too long (#28344) * Add docs * Introduce index level setting `index.max_regex_length` to control the maximum length of the regular expression Closes #28344	2018-02-23 10:41:24 -08:00
Paul Schwarz	81eda1834b	Improve wording "... as less as possible" -> "... as little as possible"	2018-02-15 15:31:00 +00:00
Andrew Anderson	54a9249992	Fixed typo in search for wrong type (#28645 )	2018-02-13 02:47:01 -05:00
Adrien Grand	f7c4740a76	Document that highlighting `terms` queries is best-effort. (#28371 ) The `terms` query is really designed for filtering and highlighting it might cause performance issues if it wraps many terms, so I am documenting highlighting these queries as a best-effort only. Closes #28099	2018-01-31 15:03:08 +01:00
Lukas Olson	7c5619a29a	Fix spelling error	2018-01-23 12:29:11 -07:00
David Kemp	531c58cf81	Documents applicability of term query to range type (#28166 ) Closes #27030	2018-01-18 17:19:01 -05:00
Nicholas Knize	5ed25f1e12	[GEO] Add WKT Support to GeoBoundingBoxQueryBuilder Add WKT BBOX parsing support to GeoBoundingBoxQueryBuilder.	2018-01-15 13:30:51 -06:00
Gytis Šk	86bffa870b	Update fuzzy-query.asciidoc (#28032 )	2018-01-01 08:44:04 +01:00
Mayya Sharipova	dcde895f49	Introduce limit to the number of terms in Terms Query (#27968 ) - Introduce index level settings to control the maximum number of terms that can be used in a Terms Query - Throw an error if a request exceeds this max number Closes #18829	2017-12-28 17:36:29 -05:00
Vlad Holubiev	7b14e4b8e0	[DOCS] Remove extra word (#27989 )	2017-12-26 16:24:29 +00:00
Adrien Grand	1b660821a2	Allow `_doc` as a type. (#27816 ) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751	2017-12-14 17:47:53 +01:00
Martijn van Groningen	6cda5b292c	docs: add paragraph about using `percolate` query in a filter context	2017-12-01 10:55:01 +01:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Jim Ferenczi	53462f6499	Make fields optional in multi_match query and rely on index.query.default_field by default (#27380 ) * Make fields optional in multi_match query and rely on index.query.default_field by default This commit adds the ability to send `multi_match` query without providing any `fields`. When no fields are provided the `multi_match` query will use the fields defined in the index setting `index.query.default_field` (which in turns defaults to ``). The same behavior is already implemented in `query_string` and `simple_query_string` so this change just applies the heuristic to `multi_match` queries. Relying on `index.query.default_field` rather than `` is safer for big mappings that break the 1024 field expansion limit added in 7.0 for all text queries. For these kind of mappings the admin can change the `index.query.default_field` in order to make sure that exploratory queries using `multi_match`, `query_string` or `simple_query_string` do not throw an exception.	2017-11-17 10:25:21 +01:00
Martijn van Groningen	d805c41b28	Added new terms_set query This query returns documents that match with at least one ore more of the provided terms. The number of terms that must match varies per document and is either controlled by a minimum should match field or computed per document in a minimum should match script. Closes #26915	2017-11-01 10:55:18 +01:00
Jim Ferenczi	792641a6e3	[Docs] #26541 : add warning regarding the limit on the number of fields that can be queried at once in the multi_match query.	2017-10-30 18:03:56 +01:00
Clarkie	b1ce5cf836	[Docs] Fix indentation of examples (#27168 )	2017-10-30 11:56:38 +01:00
Jim Ferenczi	a4105c6b4a	[Docs] Clarify `span_not` query behavior for non-overlapping matches (#27150 ) Closes #27134	2017-10-30 11:29:40 +01:00
Martijn van Groningen	f1e944a675	docs: describe parent/child performances	2017-10-26 11:49:13 +02:00
Alexander Kazakov	592ab043dd	Change default value to true for transpositions parameter of fuzzy query (#26901 )	2017-10-11 15:31:48 +02:00
Alexander Kazakov	9c95e91471	Expose `fuzzy_transpositions` parameter in fuzzy queries (#26870 ) Add fuzzy_transpositions parameter to multi_match and query_string queries. Add fuzzy_transpositions, fuzzy_prefix_length and fuzzy_max_expansions parameters to simple_query_string query.	2017-10-05 09:01:09 +02:00
Jim Ferenczi	17b9baf5fd	Clarify pure wilcard matching with `query_string` (#26814 ) In 5.x pure wildcard queries `` in `query_string` are rewritten to `exists` query for efficiency. Though this introduced a change in the document that match such queries because `exists` query also return documents with an empty value for the field. This change clarifies this behavior for 5.x and beyond. Closes #26801 review	2017-10-04 09:55:26 +02:00
javanna	dee2ae1023	[DOCS] Replace mention of string field type with text and keyword Closes #25713	2017-09-25 11:12:06 +02:00
Christoph Büscher	86b00b84bc	Remove parse field deprecations in query builders (#26711 ) The `fielddata` field and the use of the `_name` field in the short syntax of the range query have been deprecated in 5.0 and can be removed. The same goes for the deprecated `score_mode` field in HasParentQueryBuilder, the deprecated `like_text`, `ids` and `docs` parameter in the `more_like_this` query, the deprecated query name in the short version of the `regexp` query, and several deprecated alternative field names in other query builders.	2017-09-20 16:22:21 +02:00
Christoph Büscher	c7c6443b10	[Docs] "The the" is a great band, but ... (#26644 ) Removing several occurrences of this typo in the docs and javadocs, seems to be a common mistake. Corrections turn up once in a while in PRs, better to correct some of this in one sweep.	2017-09-14 15:08:20 +02:00
Lee Hinman	2702918780	Limit the number of expanded fields it query_string and simple_query_string (#26541 ) * Limit the number of expanded fields it query_string and simple_query_string This limits the number of automatically expanded fields for the "all fields" mode (`"default_field": ""`) for the `query_string` and `simple_query_string` queries to 1024 fields. Resolves #25105 Add blurb about limit to the docs	2017-09-08 13:37:55 -06:00
Martijn van Groningen	b391425da1	Added support to the percolate query to percolate multiple documents The percolator will add a `_percolator_document_slot` field to all percolator hits to indicate with what document it has matched. This number matches with the order in which the documents have been specified in the percolate query. Also improved the support for multiple percolate queries in a search request.	2017-09-08 17:28:39 +02:00
Christoph Büscher	f8fc0f3ebe	[Tests] Check that quoteAnalyzer overrides analyzer in `query_string` query (#26473 ) Adding a check to QueryStringQueryBuilderTests that checks the override behaviour of `quote_analyzer`, also adding documentation explaining the use of this parameter in `query_string` query. Closes #25417	2017-09-02 11:53:02 +02:00
Jim Ferenczi	86d97971a4	Remove the _all metadata field (#26356 ) * Remove the _all metadata field This change removes the `_all` metadata field. This field is deprecated in 6 and cannot be activated for indices created in 6 so it can be safely removed in the next major version (e.g. 7).	2017-08-28 17:43:59 +02:00
shaulzorea	a827d545d8	[Docs] Fixing phrasing in has-parent-query.asciidoc (#26396 )	2017-08-28 10:26:59 +02:00
Jim Ferenczi	de1e4e0c15	Accept an array of field names and boosts in the index.query.default_field setting (#26320 ) * Accept an array of field names and boosts in the index.query.default_field setting This commit allows to define an array of field names and boosts for the index setting `index.query.default_field`. The format is equivalent to the `fields` options of the full text search queries (e.g. field_name^boost). This commit also makes this setting dynamically updatable. Fixes #25946	2017-08-23 15:39:54 +02:00
Jim Ferenczi	4bce727165	Refactor simple_query_string to handle text part like multi_match and query_string (#26145 ) This change is a continuation of #25726 that aligns field expansions for the simple_query_string with the query_string and multi_match query. The main changes are: * For exact field name, the new behavior is to rewrite to a matchnodocs query when the field name is not found in the mapping. * For partial field names (with * suffix), the expansion is done only on keyword, text, date, ip and number field types. Other field types are simply ignored. * For all fields (), the expansion is done on accepted field types only (see above) and metadata fields are also filtered. The use_all_fields option is deprecated in this change and can be replaced by setting `` in the fields parameter. This commit also changes how text fields are analyzed. Previously the default search analyzer (or the provided analyzer) was used to analyze every text part , ignoring the analyzer set on the field in the mapping. With this change, the field analyzer is used instead unless an analyzer has been forced in the parameter of the query. Finally now that all full text queries can handle the special "" expansion (`all_fields` mode), the `index.query.default_field` is now set to `` for indices created in 6.	2017-08-21 13:12:27 +02:00
Jim Ferenczi	a7e1610134	Add support for auto_generate_synonyms_phrase_query in match_query, multi_match_query, query_string and simple_query_string (#26097 ) * Add support for auto_generate_synonyms_phrase_query in match_query, multi_match_query, query_string and simple_query_string This change adds a new parameter called auto_generate_synonyms_phrase_query (defaults to true). This option can be used in conjunction with synonym_graph token filter to generate phrase queries when multi terms synonyms are encountered. For example, a synonym like "ny, new york" would produce the following boolean query when "ny city" is parsed: ((ny OR "new york") AND city) Note how the multi terms synonym "new york" produces a phrase query.	2017-08-09 12:15:09 +02:00
Ian Fisk	8cb1391f40	Docs: Use correct field name in Field Value factor docs. (#26104 )	2017-08-08 16:34:20 -04:00
Jim Ferenczi	7868373069	[Docs] remove reference to the deprecated in the docs	2017-07-25 09:41:53 +02:00
Jim Ferenczi	4a9995145c	[Docs]: Clarify query_string parser splits on operator	2017-07-24 18:36:16 +02:00
Jim Ferenczi	c3784326eb	Refactor field expansion for match, multi_match and query_string query (#25726 ) This commit changes the way we handle field expansion in `match`, `multi_match` and `query_string` query. The main changes are: - For exact field name, the new behavior is to rewrite to a matchnodocs query when the field name is not found in the mapping. - For partial field names (with `` suffix), the expansion is done only on `keyword`, `text`, `date`, `ip` and `number` field types. Other field types are simply ignored. - For all fields (``), the expansion is done on accepted field types only (see above) and metadata fields are also filtered. - The `` notation can also be used to set `default_field` option on`query_string` query. This should replace the needs for the extra option `use_all_fields` which is deprecated in this change. This commit also rewrites simple `` query to matchalldocs query when all fields are requested (Fixes #25556). The same change should be done on `simple_query_string` for completeness. `use_all_fields` option in `query_string` is also deprecated in this change, `default_field` should be set to `*` instead. Relates #25551	2017-07-21 16:52:57 +02:00
Adrien Grand	f1ff7f2454	Require a field when a `seed` is provided to the `random_score` function. (#25594 ) We currently use fielddata on the `_id` field which is trappy, especially as we do it implicitly. This changes the `random_score` function to use doc ids when no seed is provided and to suggest a field when a seed is provided. For now the change only emits a deprecation warning when no field is supplied but this should be replaced by a strict check on 7.0. Closes #25240	2017-07-19 14:11:15 +02:00
Simon Willnauer	cb4eebcd6a	Make `index` in TermsLookup mandatory (#25753 ) This change removes the leniency of having a `null` index to fetch terms from in 6.0 onwards. This feature will be deprecated in the 5.x series and 6.0 nodes will require the index to be set. Closes #25750	2017-07-17 18:50:30 +02:00
Martijn van Groningen	c8777c4c2e	docs: Updated reference docs that `document_type` is deprecated	2017-07-14 11:07:46 +02:00
Jim Ferenczi	13da3eb53e	Refactor QueryStringQuery for 6.0 (#25646 ) This change refactors the query_string query to analyze the query text around logical operators of the query string the same way than a match_query/multi_match_query. It also adds a type parameter that can be used to change the way multi fields query are built the same way than a multi_match query does. Now that these queries share the same behavior regarding text analysis, some parameters are obsolete and have been deprecated: split_on_whitespace: This setting is now ignored with a deprecation notice if it is used explicitely. With this PR The query_string always splits on logical operator. It simplifies the understanding of the other parameters that can have different meanings depending on the value of split_on_whitespace. auto_generate_phrase_queries: This setting is now ignored with a deprecation notice if it is used explicitely. This setting only makes sense when the parser splits on whitespace. use_dismax: This setting is now ignored with a deprecation notice if it is used explicitely. The tie_breaker parameter is sufficient to handle best_fields/most_fields. Fixes #25574	2017-07-13 15:32:17 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
olcbean	2ba9fd2aec	Remove deprecated created and found from index, delete and bulk (#25516 ) The created and found fields in index and delete responses became obsolete after the introduction of the result field in index, update and delete responses (#19566). After deprecating the created and found fields in 5.x (#19633), now they are removed. Fixes #19630	2017-07-07 13:58:46 -04:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
dkimdon	fdb3a97152	Update percolate-query.asciidoc (#25364 )	2017-06-23 10:39:57 +02:00
Martijn van Groningen	a977569085	percolator: Deprecate `document_type` parameter. The `document_type` parameter is no longer required to be specified, because by default from 6.0 only a single type is allowed. (`index.mapping.single_type` defaults to `true`)	2017-06-22 09:55:06 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Andrey Groshev	e4fd8485ce	Made the same length of opening and closing lines (#23583 )	2017-06-09 00:50:43 -07:00
David Cho-Lerat	491dc1186a	Add missing word to terms-query.asciidoc (#24960 )	2017-05-30 09:42:07 -04:00
David Cho-Lerat	c939bcb7f5	Correct some spelling in match-phrase-prefix docs (#24956 )	2017-05-30 09:02:01 -04:00
Martijn van Groningen	840da4aebf	Removed deprecated template query. Relates to #19390	2017-05-11 14:56:45 +02:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Nik Everett	416feeb7f9	Rewrite description of `bool`'s `should` (#24342 ) Docs: rewrite description of `bool`'s `should` Rewrites the description of the `bool` query's `should` clauses so it is (hopefully) more clear what the defaults for `minimum_should_match` are. There is still an `[IMPORTANT]` section about `minimum_should_match` in a filter context. I think it is worth keeping because it is, well, important. Closes #23831	2017-04-26 14:09:26 -04:00
Jason Tedor	4796557a30	Add primary term to doc write response This commit adds the primary term to the doc write response. Relates #24171	2017-04-19 14:44:22 -04:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Nik Everett	048191ceb6	CONSOLEify highlighting a function_score docs Converts many of the partial examples into full search requests. Relates #18160	2017-04-06 08:13:56 -04:00
Nik Everett	653f50973a	CONSOLEify geo-shape docs `CONSOLE`ify geo-shape type and geo-shape query docs. Relates to #18160	2017-03-31 09:11:54 -04:00
Nik Everett	9abb125417	Fix exists query doc I managed to push the last one without testing it because I'd changed the way I run tests locally and hadn't picked it up. Ooops. This one works better.	2017-03-30 22:26:10 -04:00
Nik Everett	bc33753aee	Mark exists-query dsl doc properly All the docs for the `exists` query that aren't marked as `CONSOLE` aren't actually `CONSOLE`-worthy so this marks them as `NOTCONSOLE`. It also rewrites the text around `missing` query. Since it was removed in 5.0 we don't need to talk about it in the 6.0 docs. Relates to #18160	2017-03-30 22:01:07 -04:00
Pavel Chertorogov	ff1530592e	Docs: Fix indentation in has-child-query.asciidoc (#23565 )	2017-03-13 08:41:18 -07:00
Pavel Chertorogov	5da7cefbe2	Docs: Fix indentation in has-parent-query.asciidoc	2017-03-13 08:17:11 -07:00
AlexNodex	139eb69fe4	Typo (#23179 ) autoGeneratePhraseQueries should be auto_generate_phrase_queries	2017-02-15 10:10:06 +01:00
Catherine Snow	51bad4300c	Fix typo (#23171 )	2017-02-15 09:38:10 +01:00
Giuseppe	ecbeffcb1e	Add note about min_score filtering efficiency (#23109 ) * Add note about min_score filtering efficiency * Reword to mention 'HAVING' * Remove reference to HAVING	2017-02-13 12:15:01 +01:00
Nik Everett	0e98c9107a	Docs: CONSOLEify some more docs These need to be CONSOLEified now because we're starting to require Content-Type headers and they didn't have any. * cluster/reroute: Marked as CONSOLE but skipped because the docs build runs with a single node. * docs/bulk: Marked as NOTCONSOLE because the snippets describe either examples or `curl` commands. Fixed the `curl` command to include the `Content-Type` header. * query-dsl/terms-query: Marked as CONSOLE. * search/request/rescore: Marked as CONSOLE. Fixed deprecated syntax. Relates #23001 Relates #18160	2017-02-07 16:49:01 -05:00
Nicholas Knize	bc884c1e7b	[Docs] Remove ignore_malformed from Geo Query DSL docs This commit removes the ignore_malformed parameter from the Geo Query DSL documentation.	2017-02-06 14:27:15 -06:00
Nicholas Knize	b41d5747f0	Reduce GeoDistance insanity GeoDistance query, sort, and scripts make use of a crazy GeoDistance enum for handling 4 different ways of computing geo distance: SLOPPY_ARC, ARC, FACTOR, and PLANE. Only two of these are necessary: ARC, PLANE. This commit removes SLOPPY_ARC, and FACTOR and cleans up the way Geo distance is computed.	2017-02-02 12:39:42 -06:00
Nik Everett	f90051e6e0	Docs: Add a note about `<` and `>` in query_string `<` and `>` can't be escaped at all in `query_string`. If we're not going to fix that we should at least document it. Relates to #21703	2017-01-31 12:23:18 -05:00
William Webber	f1a902865f	Update span-multi-term-query.asciidoc (#22733 ) "term" is not actually a multi-term query (perhaps confusion with "term range")	2017-01-23 17:33:40 +01:00
William Webber	abaf728882	"from" => "gte", "to" => "lte" in bool example (#22735 )	2017-01-23 17:29:00 +01:00
Francesc Gil	17342c403f	Indentation error on example of dist_max (#22578 ) There was a problem with the indentation on the example of the `dist_max` query	2017-01-12 09:38:36 +01:00
Lee Hinman	66cf3d3220	Document simple_query_string negation with default_operator of OR This can be confusing when unexpected. Resolves #4707	2017-01-10 10:27:00 -07:00
Jake	d7cc6e28e7	Document `must_not` context and scoring (#22532 ) Document that `must_not` uses filter context and returns a score of `0`.	2017-01-10 17:26:48 +01:00
Nik Everett	75d5b3d9eb	Fix parent_id example in docs And fix some indentation I noticed while looking up the query.	2017-01-10 10:01:31 -05:00
Clinton Gormley	3999e5ba6b	Docs: Added link from bool and constant score query to filter context Closes #22353	2016-12-29 11:05:28 +01:00
Adrien Grand	b2e93d2870	Be explicit about the fact backslashes need to be escaped. (#22257 ) Relates #22255	2016-12-19 14:21:21 +01:00
Luca Cavanna	73cf002293	Un-deprecate fuzzy query (#22088 ) When we decided to deprecate and remove fuzzy query in #15760, we didn't realize we would take away the possibililty for uses to use a fuzzy query as part of a span query, which is not possible using match query. This means we have to go back and un-deprecate fuzzy query, which will not be removed. Closes #15760	2016-12-12 12:09:16 +01:00
Matias Anaya	beb794cb0f	Fix typo in percolated-query.asciidoc (#21991 )	2016-12-09 13:45:57 +01:00
Luca Cavanna	103984a4a1	Remove indices query (#21837 ) The indices query is deprecated since 5.0.0 (#17710). It can now be removed in master (future 6.0 version).	2016-11-30 19:37:01 +01:00
Adrien Grand	eed5de20e0	Remove docs for the removed `geo_distance_range` query.	2016-11-30 16:36:55 +01:00
Adrien Grand	90ab477f19	The `terms` query should always map to a Lucene `TermsQuery`. (#21786 ) Currently, the `terms` query is just syctactic sugar for a `bool` query when used in a query context. This change proposes to always generate the same query in query and filter contexts, which is less confusing.	2016-11-30 15:29:09 +01:00
Luca Cavanna	f253621feb	Remove deprecated query names: in, geo_bbox, mlt, fuzzy_match and match_fuzzy (#21852 ) These query names were all deprecated in 5.0.0: - in is removed in favour of terms - geo_bbox is removed in favour of geo_bounding_box - mlt is removed in favour of more_like_this - fuzzy_match and match_fuzzy are removed in favour of match	2016-11-29 19:07:01 +01:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Clinton Gormley	5ae6845d4d	Update percolate-query.asciidoc Add missing callout to percolate query	2016-11-26 12:35:33 +01:00
Trey Tacon	3ef7f0dec6	Fixing indentation in geospatial querying example. (#21682 ) Specifically the example which shows providing an array of an array of values.	2016-11-21 13:09:21 +01:00
David Pilato	475a7ca84f	Add documentation for lenient in multimatch `lenient` option is documented for `match` query but not for `multi_match` query.	2016-11-17 08:35:20 +01:00
David Pilato	c946094d5b	Add documentation for lenient in multimatch `lenient` option is documented for `match` query but not for `multi_match` query.	2016-11-16 16:15:28 +01:00
Jason Tedor	d06a8903fd	Merge branch 'master' into feature/seq_no * master: (22 commits) Add proper toString() method to UpdateTask (#21582) Fix `InternalEngine#isThrottled` to not always return `false`. (#21592) add `ignore_missing` option to SplitProcessor (#20982) fix trace_match behavior for when there is only one grok pattern (#21413) Remove dead code from GetResponse.java Fixes date range query using epoch with timezone (#21542) Do not cache term queries. (#21566) Updated dynamic mapper section Docs: Clarify date_histogram bucket sizes for DST time zones Handle release of 5.0.1 Fix skip reason for stats API parameters test Reduce skip version for stats API parameter tests Strict level parsing for indices stats Remove cluster update task when task times out (#21578) [DOCS] Mention "all-fields" mode doesn't search across nested documents InternalTestCluster: when restarting a node we should validate the cluster is formed via the node we just restarted Fixed bad asciidoc in boolean mapping docs Fixed bad asciidoc ID in node stats Be strict when parsing values searching for booleans (#21555) Fix time zone rounding edge case for DST overlaps ...	2016-11-16 09:10:35 -05:00
Lee Hinman	17a2fffc9b	[DOCS] Mention "all-fields" mode doesn't search across nested documents	2016-11-15 11:02:43 -07:00
Jason Tedor	33f7cd5a16	Remove shard ID from doc write response This commit removes the shard ID from doc write response; this was useful for debugging but its time has passed. Relates #21508	2016-11-11 15:18:25 -05:00
Jason Tedor	d3417fb022	Merge branch 'master' into feature/seq_no * master: (516 commits) Avoid angering Log4j in TransportNodesActionTests Add trace logging when aquiring and releasing operation locks for replication requests Fix handler name on message not fully read Remove accidental import. Improve log message in TransportNodesAction Clean up of Script. Update Joda Time to version 2.9.5 (#21468) Remove unused ClusterService dependency from SearchPhaseController (#21421) Remove max_local_storage_nodes from elasticsearch.yml (#21467) Wait for all reindex subtasks before rethrottling Correcting a typo-Maan to Man-in README.textile (#21466) Fix InternalSearchHit#hasSource to return the proper boolean value (#21441) Replace all index date-math examples with the URI encoded form Fix typos (#21456) Adapt ES_JVM_OPTIONS packaging test to ubuntu-1204 Add null check in InternalSearchHit#sourceRef to prevent NPE (#21431) Add VirtualBox version check (#21370) Export ES_JVM_OPTIONS for SysV init Skip reindex rethrottle tests with workers Make forbidden APIs be quieter about classpath warnings (#21443) ...	2016-11-10 23:40:33 -05:00
Lee Hinman	7420fd0be3	Add "all fields" execution mode to simple_query_string query This commit introduces a new execution mode for the `simple_query_string` query, which is intended down the road to be a replacement for the current _all field. It now does auto-field-expansion and auto-leniency when the following criteria are ALL met: The _all field is disabled No default_field has been set in the index settings No fields are specified in the request Additionally, a user can force the "all-like" execution by setting the all_fields parameter to true. When executing in all field mode, the `simple_query_string` query will look at all the fields in the mapping that are not metafields and can be searched, and automatically expand the list of fields that are going to be queried. Relates to #20925, which is the `query_string` version of this work. This is basically the same behavior, but for the `simple_query_string` query. Relates to #19784	2016-11-09 10:38:51 -07:00
Lee Hinman	6666fb1614	Add "all field" execution mode to query_string query This commit introduces a new execution mode for the query_string query, which is intended down the road to be a replacement for the current _all field. It now does auto-field-expansion and auto-leniency when the following criteria are ALL met: The _all field is disabled No default_field has been set in the index settings No default_field has been set in the request No fields are specified in the request Additionally, a user can force the "all-like" execution by setting the all_fields parameter to true. When executing in all field mode, the query_string query will look at all the fields in the mapping that are not metafields and can be searched, and automatically expand the list of fields that are going to be queried. Relates to #19784	2016-11-04 05:46:18 -06:00
Christoph Büscher	a0c094d0c1	Add deprecation logging message for 'fuzzy' query This query is deprecated from 5.0 on. Similar to IndicesQueryBuilder we should log a deprecation warning whenever this query is used. Relates to #15760	2016-11-02 15:45:33 +01:00
Adrien Grand	52de0645fb	Remove `lowercase_expanded_terms` and `locale` from query-parser options. (#20208 ) Lucene 6.2 introduces the new `Analyzer.normalize` API, which allows to apply only character-level normalization such as lowercasing or accent folding, which is exactly what is needed to process queries that operate on partial terms such as `prefix`, `wildcard` or `fuzzy` queries. As a consequence, the `lowercase_expanded_terms` option is not necessary anymore. Furthermore, the `locale` option was only needed in order to know how to perform the lowercasing, so this one can be removed as well. Closes #9978	2016-11-02 14:25:08 +01:00
Jim Ferenczi	9d6fac809c	Expose splitOnWhitespace in `Query String Query` (#20965 ) This change adds an option called `split_on_whitespace` which prevents the query parser to split free text part on whitespace prior to analysis. Instead the queryparser would parse around only real 'operators'. Default to true. For instance the query `"foo bar"` would let the analyzer of the targeted field decide how the tokens should be splitted. Some options are missing in this change but I'd like to add them in a follow up PR in order to be able to simplify the backport in 5.x. The missing options (changes) are: * A `type` option which similarly to the `multi_match` query defines how the free text should be parsed when multi fields are defined. * Simple range query with additional tokens like ">100 50" are broken when `split_on_whitespace` is set to false. It should be possible to preserve this syntax and make the parser aware of this special syntax even when `split_on_whitespace` is set to false. * Since all this options would make the `query_string_query` very similar to a match (multi_match) query we should be able to share the code that produce the final Lucene query.	2016-11-02 10:00:40 +01:00
Jack Conradson	185dff7346	Cleanup ScriptType (#21179 ) Refactored ScriptType to clean up some of the variable and method names. Added more documentation. Deprecated the 'in' ParseField in favor of 'stored' to match the indexed scripts being replaced by stored scripts.	2016-10-31 13:48:51 -07:00
Adrien Grand	9cbbddb6dc	Add support for `quote_field_suffix` to `simple_query_string`. (#21060 ) Closes #18641	2016-10-28 09:11:57 +02:00
Quinn Shanahan	1bef6c7fee	Update regexp-syntax.asciidoc (#20973 )	2016-10-17 16:32:17 +02:00
Pascal Borreli	fcb01deb34	Fixed typos (#20843 )	2016-10-10 14:51:47 -06:00
Anatolii Stepaniuk	f895abcf40	Fix grammar issues in some docs This commit fixes some grammar issues in various docs. Closes #20751 Closes #20752 Closes #20754 Closes #20755	2016-10-05 11:20:45 -04:00
Jason Tedor	8879360f66	Fix failing doc tests in feature/seq_no This commit fixes failing doc tests in feature/seq_no after merging master into this branch.	2016-09-29 03:58:02 +02:00
Nicholas Knize	1a60e1c3d2	Update docs for LatLonPoint cut over This commit removes documentation for: * geohash cell query * lat_lon parameter * geohash parameter * geohash_precision parameter * geohash_prefix parameter It also updates failing tests that reference these parameters for backcompat.	2016-09-13 12:18:21 -05:00
Yevhen Bobrov	786508be08	Documentation for field_masking_span query (#20395 ) * Documentation for field_masking_span query. Fixes #20293 * After review fixes	2016-09-13 12:27:33 +01:00
Nik Everett	bebdec570f	[docs] Mark percolator response snippets properly Now the docs tests will catch any errors in the responses. This would have caught the error fixed in https://github.com/elastic/elasticsearch/pull/20351	2016-09-07 09:45:50 -04:00
antonisppn	e77f4710e4	[docs] Percolator samples are not working. Mapping is wrong. Hi all, I was trying to run the percolate examples, but I figured that because of the "type":"keyword" , the code wasn't working. In the saerch query the "message" : "A new bonsai tree in the office" is a pure string. I changed it to "text".	2016-09-07 08:15:20 -04:00
Greg Ichneumon Brown	639b7278d9	Docs: clarify calculation of sigma and lambda in function_score (#20267 ) - Using log() to indicate natural log can add some confusion when trying to further adjust/tweak scores. Other parts of the API (field_value_factor on this same page) use 'ln' and 'log', so this change should be more consistent - Fixes #20027 - I generated the images using http://latex2png.com/ at a resolution of 150 which seemed to be about the same size as before	2016-09-02 14:41:07 +02:00
nrichers	e5bf02b155	Fix broken link reportedby Twitter user (#20291 )	2016-09-01 17:41:44 -07:00
Greg Ichneumon Brown	92c54aa4a1	Docs: clarify scale is applied at origin+offest (#20242 ) - fixes #19832	2016-08-31 17:02:59 +02:00
Martijn van Groningen	7c9af98a3c	docs: add sort workaround	2016-08-26 10:55:42 +02:00
Nik Everett	5b34bec92a	Add deprecation warnings to docs for geohash Relates to #20126	2016-08-23 13:43:35 -04:00
Nicholas Knize	28ed0e7abf	Deprecate optimize_bbox on geodistance queries Deprecates the optimize_bbox parameter on geodistance queries. This has no longer been needed since version 2.2 because lucene geo distance queries (postings and LatLonPoint) already optimize by bounding box.	2016-08-23 09:14:54 -05:00
Munish Goyal	0ee3a479e9	Update wildcard-query.asciidoc (#20057 ) Update sentence grammar	2016-08-18 14:04:46 +02:00
Nik Everett	1e587406d8	Fail yaml tests and docs snippets that get unexpected warnings Adds `warnings` syntax to the yaml test that allows you to expect a `Warning` header that looks like: ``` - do: warnings: - '[index] is deprecated' - quotes are not required because yaml - but this argument is always a list, never a single string - no matter how many warnings you expect get: index: test type: test id: 1 ``` These are accessible from the docs with: ``` // TEST[warning:some warning] ``` This should help to force you to update the docs if you deprecate something. You must add the warnings marker to the docs or the build will fail. While you are there you should update the docs to add deprecation warnings visible in the rendered results.	2016-08-04 15:23:05 -04:00
Mary	fa3420c2a5	Update term-level-queries.asciidoc Typo fix	2016-08-03 10:18:13 -06:00
Adrien Grand	a4cb63b98c	Remove `_missing_` from the docs. It is removed in 5.0, see #15153.	2016-08-01 16:57:37 +02:00
Adrien Grand	37d5bcb264	Clarify `function_score` docs. Closes #18315	2016-07-19 10:25:48 +02:00
Britta Weber	57a734e641	[doc] explain avg in function_score better (#19154 ) * [doc] explain avg in function_score better	2016-06-30 11:52:53 +02:00
Robert Muir	6fc1a22977	cutover some docs to painless	2016-06-27 09:55:16 -04:00
Martijn van Groningen	2a196d4068	docs: update example for finding percolator where query terms couldn't be extracted successfully	2016-06-24 18:18:02 +02:00
Britta Weber	d55f719f8a	[TEST] wait for yellow after setup doc tests (#18726 ) * [TEST] wait for yellow after setup doc tests We have many places in the doc where we expect and index to be yellow before we execute a query. Therefore we have to always wait for yellow after setup.	2016-06-03 16:37:28 +02:00
Nik Everett	2a2730405e	Add wait for yellow to doc snippet so it runs cleanly Found by http://build-us-00.elastic.co/job/es_core_master_window-2008/3866/console	2016-05-24 12:15:52 -04:00
Isabel Drost-Fromm	4c02e97bcd	Add back doc execution to query dsl. Relates to #18211 This reverts commit `20aafb1196`.	2016-05-24 12:43:41 +02:00
Martijn van Groningen	e714a04c67	docs: fix typo	2016-05-22 22:50:31 +02:00
Martijn van Groningen	c1a0929123	percolator: Add support dor MatchNoDocsQuery in query terms extract service Before the query extraction would have been aborted and the percolator query would be marked as unknown. This resulted in a situation that these queries always need to be evaluated by the memory index at search time. By adding support for this query many more percolator query candidate hits can skip the expensive memory index verification step. For example the `match` query parser returns a MatchNoDocsQuery if the query terms are removed by text analysis (lets query text only contained stop words).	2016-05-22 22:42:19 +02:00
Martijn van Groningen	80fee8666f	percolator: Removed percolator cache Before 5.0 for it was required that the percolator queries were cached in jvm heap as Lucene queries for two reasons: 1) Performance. The percolator evaluated all percolator queries all the time. There was no pre-selecting queries that are likely to match like we have today. 2) Updates made to percolator queries were visible in realtime, Today these changes are visible in near realtime. So updating no longer requires the percolator to have the queries in jvm heap. So having the percolator queries in jvm heap via the percolator cache is now less attractive. Especially when there are many percolator queries then these queries can consume many GBs of jvm heap. Removing the percolator cache does make the percolate query slower compared to how the execution time in 5.0.0-alpha1 and alpha2, but it is still faster compared to 2.x and before.	2016-05-20 14:52:16 +02:00
Nik Everett	ee4e470f60	Add a wait_for_stats=yellow to a docs snippet It was making unstable tests.	2016-05-18 15:11:49 -04:00
Isabel Drost-Fromm	4c627a00e5	Merge branch 'master' into docs/add_autosense_to_query_dsl	2016-05-17 21:12:06 +02:00
Isabel Drost-Fromm	20aafb1196	Revert "Add Autosense annotation for query dsl testing"	2016-05-17 20:55:56 +02:00
Isabel Drost-Fromm	9922931144	Fix occasional build error.	2016-05-17 15:40:53 +02:00
Isabel Drost-Fromm	2d402c732c	Merge branch 'master' into docs/add_autosense_to_query_dsl	2016-05-17 11:59:50 +02:00
Adrien Grand	864ed04059	Lessen leniency of the query dsl. #18276 This change does the following: - Queries that are currently unsupported such as prefix queries on numeric fields or term queries on geo fields now throw an error rather than returning a query that does not match anything. - Fuzzy queries on numeric, date and ip fields are now unsupported: they used to create range queries, we now expect users to use range queries directly. Fuzzy, regexp and prefix queries are now only supported on text/keyword fields (including `_all`). - The `_uid` and `_id` fields do not support prefix or range queries anymore as it would prevent us to store them more efficiently in the future, eg. by using a binary encoding. Note that it is still possible to ignore these errors by using the `lenient` option of the `match` or `query_string` queries.	2016-05-16 17:37:00 +02:00
Clinton Gormley	bfc826003b	Documented fuzzy_transpositions in match query Relates to #18320	2016-05-14 11:20:04 +02:00
Christoph Büscher	a40c397c67	Don't allow `fuzziness` for `multi_match` types cross_fields, phrase and phrase_prefix Currently `fuzziness` is not supported for the `cross_fields` type of the `multi_match` query since it complicates the logic that blends the term queries that cross_fields uses internally. At the moment using this combination is silently ignored, which can lead to confusions. Instead we should throw an exception in this case. The same is true for phrase and phrase_prefix type. Closes #7764	2016-05-13 17:32:14 +02:00
Isabel Drost-Fromm	36cd69c6ac	Fix build failure	2016-05-12 13:23:06 +02:00
Isabel Drost-Fromm	0ad87b25cf	Something messed with auto-indent. Fixed now.	2016-05-12 12:58:22 +02:00
Isabel Drost-Fromm	126ff91bf6	Fix indent	2016-05-12 12:30:33 +02:00
Isabel Drost-Fromm	6d5e24726f	Fix test failures.	2016-05-12 12:29:18 +02:00

... 2 3 4 5 6 ...

648 Commits