OpenSearch

Commit Graph

Author	SHA1	Message	Date
w-bonelli	072c0be8af	Update Fuzzy Query docs to clarify default behavior re max_expansions (#30819 ) Stating that the Fuzzy Query generates "all possible" matching terms is misleading, given that the query's default behavior is to generate a maximum of 50 matching terms. (cherry picked from commit 345a0071a2a41fd7f80ae9ef8a39a2cb4991aedd)	2018-07-30 13:19:26 -07:00
Piotr Prądzyński	4fc833b1de	Unify headers for full text queries Relates #31599	2018-06-27 10:11:14 +02:00
Piotr Prądzyński	f6c64a048d	Remove redundant 'minimum_should_match' Relates #31600	2018-06-27 10:11:07 +02:00
Sue Gallagher	b44e1c1978	[DOCS] Removed and params from MLT. Closes #28128 (#31370 )	2018-06-19 13:48:13 -07:00
Sue Gallagher	cdb486ae70	[DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 (#31160 ) * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes 28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008	2018-06-08 14:41:01 -07:00
Adrien Grand	458bca11bc	Add a `feature_vector` field. (#31102 ) This field is similar to the `feature` field but is better suited to index sparse feature vectors. A use-case for this field could be to record topics associated with every documents alongside a metric that quantifies how well the topic is connected to this document, and then boost queries based on the topics that the logged user is interested in. Relates #27552	2018-06-07 10:05:37 +02:00
Nirmal Chidambaram	75a676c70b	Fail `span_multi` queries that exceeds boolean max clause limit (#30913 ) By default span_multi query will limit term expansions = boolean max clause. This will limit high heap usage in case of high cardinality term expansions. This applies only if top_terms_N is not used in inner multi query.	2018-06-07 09:34:39 +02:00
Adrien Grand	1af6d20efe	Fix docs build.	2018-06-05 14:55:40 +02:00
Adrien Grand	984523dda9	Clarify docs about boolean operator precedence. (#30808 ) Unfortunately, the classic queryparser does not honor the usual precedence rules of boolean operators. See https://issues.apache.org/jira/browse/LUCENE-3674.	2018-06-05 08:59:17 +02:00
Jim Ferenczi	f94a75778c	Fix index prefixes to work with span_multi (#31066 ) * Fix index prefixes to work with span_multi Text fields that use `index_prefixes` can rewrite `prefix` queries into `term` queries internally. This commit fix the handling of this rewriting in the `span_multi` query. This change also copies the index options of the text field into the prefix field in order to be able to run positional queries. This is mandatory for `span_multi` to work but this could also be useful to optimize `match_phrase_prefix` queries in a follow up. Note that this change can only be done on indices created after 6.3 since we set the index options to doc only in this version. Fixes #31056	2018-06-04 21:48:56 +02:00
Igor Motov	cf0e0606af	Use geohash cell instead of just a corner in geo_bounding_box (#30698 ) Treats geohashes as grid cells instead of just points when the geohashes are used to specify the edges in the geo_bounding_box query. For example, if a geohash is used to specify the top_left corner, the top left corner of the geohash cell will be used as the corner of the bounding box. Closes #25154	2018-05-24 14:46:15 -04:00
Christoph Büscher	3f78b3f5e1	[Docs] Explain incomplete dates in range queries (#30689 ) The current documentation isn't very clear about how incomplete dates are treated when specifying custom formats in a `range` query. This change adds a note explaining how missing month or year coordinates translate to dates that have the missings slots filled with unix time start date (1970-01-01) Closes #30634	2018-05-24 11:20:00 +02:00
Igor Motov	4b6915976c	Add support for indexed shape routing in geo_shape query (#30760 ) Adds ability to specify the routing value for the indexed shape in the geo_shape query. Closes #7663	2018-05-23 15:15:19 -04:00
Adrien Grand	886db84ad2	Expose Lucene's FeatureField. (#30618 ) Lucene has a new `FeatureField` which gives the ability to record numeric features as term frequencies. Its main benefit is that it allows to boost queries with the values of these features and efficiently skip non-competitive documents at the same time using block-max WAND and indexed impacts.	2018-05-23 08:55:21 +02:00
Igor Motov	b30f2913cf	Docs: document precision limitations of geo_bounding_box (#30540 ) The geo_bounding_box query might produce false positives alongside the right and upper edges and false negatives alongside left and bottom edges. This commit documents the behavior and defines the maximum error. Closes #29196	2018-05-14 15:54:42 -04:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
wmellouli	c8d8407012	[Docs] Add term query with normalizer example	2018-05-03 10:23:14 +02:00
Julie Tibshirani	6506edfd9c	Fix a reference to match_phrase_prefix in the match query docs. (#30282 )	2018-05-01 13:46:33 -07:00
Julie Tibshirani	b9e1a00213	Add support to match_phrase query for zero_terms_query. (#29598 )	2018-04-19 11:25:27 -07:00
Adrien Grand	4918924fae	Remove legacy mapping code. (#29224 ) Some features have been deprecated since `6.0` like the `_parent` field or the ability to have multiple types per index. This allows to remove quite some code, which in-turn will hopefully make it easier to proceed with the removal of types.	2018-04-11 09:41:37 +02:00
Fabien Baligand	199d131385	Improve query string docs (#28882 ) fix query string syntax doc when OR operator is missed	2018-03-30 16:36:40 +02:00
Fabien Baligand	437ad06e40	fix query string example for boolean query (#28881 )	2018-03-30 15:10:14 +02:00
Jim Ferenczi	c93c7f3121	Remove deprecated options for query_string (#29203 ) This commit removes some parameters deprecated in 6.x (or 5.x): `use_dismax`, `split_on_whitespace`, `all_fields` and `lowercase_expanded_terms`. Closes #25551	2018-03-22 18:37:08 +01:00
tnsatish	70f67b17dd	Fix typo in percolate-query.asciidoc (#29155 )	2018-03-20 16:47:53 +00:00
Sue Gallagher	3530a676e0	[Docs]Corrected spelling errors. (#28976 )	2018-03-19 10:22:40 -07:00
Martijn van Groningen	34a264c375	added docs for `wrapper` query. Closes #11591	2018-03-14 11:51:22 +01:00
Jim Ferenczi	48a7425ae6	Clarifies how query_string splits textual part (#28798 ) * Clarifies how the query_string splits textual part to build a query Whitespaces are not considered as operators anymore in 6x but the documentation is not clear about it. This commit changes the example in the documentation and adds a note regarding whitespaces and operators. Closes #28719	2018-03-01 15:08:25 -08:00
FUJI Goro	2baa19ea64	[Docs] Specify function score logarithm modifiers (#28821 ) The logarithm with base 10 is called "Common Logarithm".	2018-02-27 10:29:43 -08:00
Ke Li	a77273fc01	Reject regex search if regex string is too long (#28542 ) * Reject regex search if regex string is too long (#28344) * Add docs * Introduce index level setting `index.max_regex_length` to control the maximum length of the regular expression Closes #28344	2018-02-23 10:41:24 -08:00
Paul Schwarz	81eda1834b	Improve wording "... as less as possible" -> "... as little as possible"	2018-02-15 15:31:00 +00:00
Andrew Anderson	54a9249992	Fixed typo in search for wrong type (#28645 )	2018-02-13 02:47:01 -05:00
Adrien Grand	f7c4740a76	Document that highlighting `terms` queries is best-effort. (#28371 ) The `terms` query is really designed for filtering and highlighting it might cause performance issues if it wraps many terms, so I am documenting highlighting these queries as a best-effort only. Closes #28099	2018-01-31 15:03:08 +01:00
Lukas Olson	7c5619a29a	Fix spelling error	2018-01-23 12:29:11 -07:00
David Kemp	531c58cf81	Documents applicability of term query to range type (#28166 ) Closes #27030	2018-01-18 17:19:01 -05:00
Nicholas Knize	5ed25f1e12	[GEO] Add WKT Support to GeoBoundingBoxQueryBuilder Add WKT BBOX parsing support to GeoBoundingBoxQueryBuilder.	2018-01-15 13:30:51 -06:00
Gytis Šk	86bffa870b	Update fuzzy-query.asciidoc (#28032 )	2018-01-01 08:44:04 +01:00
Mayya Sharipova	dcde895f49	Introduce limit to the number of terms in Terms Query (#27968 ) - Introduce index level settings to control the maximum number of terms that can be used in a Terms Query - Throw an error if a request exceeds this max number Closes #18829	2017-12-28 17:36:29 -05:00
Vlad Holubiev	7b14e4b8e0	[DOCS] Remove extra word (#27989 )	2017-12-26 16:24:29 +00:00
Adrien Grand	1b660821a2	Allow `_doc` as a type. (#27816 ) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751	2017-12-14 17:47:53 +01:00
Martijn van Groningen	6cda5b292c	docs: add paragraph about using `percolate` query in a filter context	2017-12-01 10:55:01 +01:00
Simon Willnauer	fadbe0de08	Automatically prepare indices for splitting (#27451 ) Today we require users to prepare their indices for split operations. Yet, we can do this automatically when an index is created which would make the split feature a much more appealing option since it doesn't have any 3rd party prerequisites anymore. This change automatically sets the number of routinng shards such that an index is guaranteed to be able to split once into twice as many shards. The number of routing shards is scaled towards the default shard limit per index such that indices with a smaller amount of shards can be split more often than larger ones. For instance an index with 1 or 2 shards can be split 10x (until it approaches 1024 shards) while an index created with 128 shards can only be split 3x by a factor of 2. Please note this is just a default value and users can still prepare their indices with `index.number_of_routing_shards` for custom splitting. NOTE: this change has an impact on the document distribution since we are changing the hash space. Documents are still uniformly distributed across all shards but since we are artificually changing the number of buckets in the consistent hashign space document might be hashed into different shards compared to previous versions. This is a 7.0 only change.	2017-11-23 09:48:54 +01:00
Jim Ferenczi	53462f6499	Make fields optional in multi_match query and rely on index.query.default_field by default (#27380 ) * Make fields optional in multi_match query and rely on index.query.default_field by default This commit adds the ability to send `multi_match` query without providing any `fields`. When no fields are provided the `multi_match` query will use the fields defined in the index setting `index.query.default_field` (which in turns defaults to ``). The same behavior is already implemented in `query_string` and `simple_query_string` so this change just applies the heuristic to `multi_match` queries. Relying on `index.query.default_field` rather than `` is safer for big mappings that break the 1024 field expansion limit added in 7.0 for all text queries. For these kind of mappings the admin can change the `index.query.default_field` in order to make sure that exploratory queries using `multi_match`, `query_string` or `simple_query_string` do not throw an exception.	2017-11-17 10:25:21 +01:00
Martijn van Groningen	d805c41b28	Added new terms_set query This query returns documents that match with at least one ore more of the provided terms. The number of terms that must match varies per document and is either controlled by a minimum should match field or computed per document in a minimum should match script. Closes #26915	2017-11-01 10:55:18 +01:00
Jim Ferenczi	792641a6e3	[Docs] #26541 : add warning regarding the limit on the number of fields that can be queried at once in the multi_match query.	2017-10-30 18:03:56 +01:00
Clarkie	b1ce5cf836	[Docs] Fix indentation of examples (#27168 )	2017-10-30 11:56:38 +01:00
Jim Ferenczi	a4105c6b4a	[Docs] Clarify `span_not` query behavior for non-overlapping matches (#27150 ) Closes #27134	2017-10-30 11:29:40 +01:00
Martijn van Groningen	f1e944a675	docs: describe parent/child performances	2017-10-26 11:49:13 +02:00
Alexander Kazakov	592ab043dd	Change default value to true for transpositions parameter of fuzzy query (#26901 )	2017-10-11 15:31:48 +02:00
Alexander Kazakov	9c95e91471	Expose `fuzzy_transpositions` parameter in fuzzy queries (#26870 ) Add fuzzy_transpositions parameter to multi_match and query_string queries. Add fuzzy_transpositions, fuzzy_prefix_length and fuzzy_max_expansions parameters to simple_query_string query.	2017-10-05 09:01:09 +02:00
Jim Ferenczi	17b9baf5fd	Clarify pure wilcard matching with `query_string` (#26814 ) In 5.x pure wildcard queries `` in `query_string` are rewritten to `exists` query for efficiency. Though this introduced a change in the document that match such queries because `exists` query also return documents with an empty value for the field. This change clarifies this behavior for 5.x and beyond. Closes #26801 review	2017-10-04 09:55:26 +02:00

1 2 3 4 5 ...

465 Commits