OpenSearch

mirror of https://github.com/honeymoose/OpenSearch.git synced 2025-03-25 01:19:02 +00:00

Author	SHA1	Message	Date
Jim Ferenczi	c3784326eb	Refactor field expansion for match, multi_match and query_string query (#25726 ) This commit changes the way we handle field expansion in `match`, `multi_match` and `query_string` query. The main changes are: - For exact field name, the new behavior is to rewrite to a matchnodocs query when the field name is not found in the mapping. - For partial field names (with `` suffix), the expansion is done only on `keyword`, `text`, `date`, `ip` and `number` field types. Other field types are simply ignored. - For all fields (``), the expansion is done on accepted field types only (see above) and metadata fields are also filtered. - The `` notation can also be used to set `default_field` option on`query_string` query. This should replace the needs for the extra option `use_all_fields` which is deprecated in this change. This commit also rewrites simple `` query to matchalldocs query when all fields are requested (Fixes #25556). The same change should be done on `simple_query_string` for completeness. `use_all_fields` option in `query_string` is also deprecated in this change, `default_field` should be set to `*` instead. Relates #25551	2017-07-21 16:52:57 +02:00
Adrien Grand	f1ff7f2454	Require a field when a `seed` is provided to the `random_score` function. (#25594 ) We currently use fielddata on the `_id` field which is trappy, especially as we do it implicitly. This changes the `random_score` function to use doc ids when no seed is provided and to suggest a field when a seed is provided. For now the change only emits a deprecation warning when no field is supplied but this should be replaced by a strict check on 7.0. Closes #25240	2017-07-19 14:11:15 +02:00
Simon Willnauer	cb4eebcd6a	Make `index` in TermsLookup mandatory (#25753 ) This change removes the leniency of having a `null` index to fetch terms from in 6.0 onwards. This feature will be deprecated in the 5.x series and 6.0 nodes will require the index to be set. Closes #25750	2017-07-17 18:50:30 +02:00
Martijn van Groningen	c8777c4c2e	docs: Updated reference docs that `document_type` is deprecated	2017-07-14 11:07:46 +02:00
Jim Ferenczi	13da3eb53e	Refactor QueryStringQuery for 6.0 (#25646 ) This change refactors the query_string query to analyze the query text around logical operators of the query string the same way than a match_query/multi_match_query. It also adds a type parameter that can be used to change the way multi fields query are built the same way than a multi_match query does. Now that these queries share the same behavior regarding text analysis, some parameters are obsolete and have been deprecated: split_on_whitespace: This setting is now ignored with a deprecation notice if it is used explicitely. With this PR The query_string always splits on logical operator. It simplifies the understanding of the other parameters that can have different meanings depending on the value of split_on_whitespace. auto_generate_phrase_queries: This setting is now ignored with a deprecation notice if it is used explicitely. This setting only makes sense when the parser splits on whitespace. use_dismax: This setting is now ignored with a deprecation notice if it is used explicitely. The tie_breaker parameter is sufficient to handle best_fields/most_fields. Fixes #25574	2017-07-13 15:32:17 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
olcbean	2ba9fd2aec	Remove deprecated created and found from index, delete and bulk (#25516 ) The created and found fields in index and delete responses became obsolete after the introduction of the result field in index, update and delete responses (#19566). After deprecating the created and found fields in 5.x (#19633), now they are removed. Fixes #19630	2017-07-07 13:58:46 -04:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
dkimdon	fdb3a97152	Update percolate-query.asciidoc (#25364 )	2017-06-23 10:39:57 +02:00
Martijn van Groningen	a977569085	percolator: Deprecate `document_type` parameter. The `document_type` parameter is no longer required to be specified, because by default from 6.0 only a single type is allowed. (`index.mapping.single_type` defaults to `true`)	2017-06-22 09:55:06 +02:00
Adrien Grand	0c117145f6	Upgrade to lucene-7.0.0-snapshot-92b1783. (#25222 ) This snapshot has faster range queries on range fields (LUCENE-7828), more accurate norms (LUCENE-7730) and the ability to use fake term frequencies (LUCENE-7854).	2017-06-15 09:52:07 +02:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Andrey Groshev	e4fd8485ce	Made the same length of opening and closing lines (#23583 )	2017-06-09 00:50:43 -07:00
David Cho-Lerat	491dc1186a	Add missing word to terms-query.asciidoc (#24960 )	2017-05-30 09:42:07 -04:00
David Cho-Lerat	c939bcb7f5	Correct some spelling in match-phrase-prefix docs (#24956 )	2017-05-30 09:02:01 -04:00
Martijn van Groningen	840da4aebf	Removed deprecated template query. Relates to #19390	2017-05-11 14:56:45 +02:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Nik Everett	416feeb7f9	Rewrite description of `bool`'s `should` (#24342 ) Docs: rewrite description of `bool`'s `should` Rewrites the description of the `bool` query's `should` clauses so it is (hopefully) more clear what the defaults for `minimum_should_match` are. There is still an `[IMPORTANT]` section about `minimum_should_match` in a filter context. I think it is worth keeping because it is, well, important. Closes #23831	2017-04-26 14:09:26 -04:00
Jason Tedor	4796557a30	Add primary term to doc write response This commit adds the primary term to the doc write response. Relates #24171	2017-04-19 14:44:22 -04:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Nik Everett	048191ceb6	CONSOLEify highlighting a function_score docs Converts many of the partial examples into full search requests. Relates #18160	2017-04-06 08:13:56 -04:00
Nik Everett	653f50973a	CONSOLEify geo-shape docs `CONSOLE`ify geo-shape type and geo-shape query docs. Relates to #18160	2017-03-31 09:11:54 -04:00
Nik Everett	9abb125417	Fix exists query doc I managed to push the last one without testing it because I'd changed the way I run tests locally and hadn't picked it up. Ooops. This one works better.	2017-03-30 22:26:10 -04:00
Nik Everett	bc33753aee	Mark exists-query dsl doc properly All the docs for the `exists` query that aren't marked as `CONSOLE` aren't actually `CONSOLE`-worthy so this marks them as `NOTCONSOLE`. It also rewrites the text around `missing` query. Since it was removed in 5.0 we don't need to talk about it in the 6.0 docs. Relates to #18160	2017-03-30 22:01:07 -04:00
Pavel Chertorogov	ff1530592e	Docs: Fix indentation in has-child-query.asciidoc (#23565 )	2017-03-13 08:41:18 -07:00
Pavel Chertorogov	5da7cefbe2	Docs: Fix indentation in has-parent-query.asciidoc	2017-03-13 08:17:11 -07:00
AlexNodex	139eb69fe4	Typo (#23179 ) autoGeneratePhraseQueries should be auto_generate_phrase_queries	2017-02-15 10:10:06 +01:00
Catherine Snow	51bad4300c	Fix typo (#23171 )	2017-02-15 09:38:10 +01:00
Giuseppe	ecbeffcb1e	Add note about min_score filtering efficiency (#23109 ) * Add note about min_score filtering efficiency * Reword to mention 'HAVING' * Remove reference to HAVING	2017-02-13 12:15:01 +01:00
Nik Everett	0e98c9107a	Docs: CONSOLEify some more docs These need to be CONSOLEified now because we're starting to require Content-Type headers and they didn't have any. * cluster/reroute: Marked as CONSOLE but skipped because the docs build runs with a single node. * docs/bulk: Marked as NOTCONSOLE because the snippets describe either examples or `curl` commands. Fixed the `curl` command to include the `Content-Type` header. * query-dsl/terms-query: Marked as CONSOLE. * search/request/rescore: Marked as CONSOLE. Fixed deprecated syntax. Relates #23001 Relates #18160	2017-02-07 16:49:01 -05:00
Nicholas Knize	bc884c1e7b	[Docs] Remove ignore_malformed from Geo Query DSL docs This commit removes the ignore_malformed parameter from the Geo Query DSL documentation.	2017-02-06 14:27:15 -06:00
Nicholas Knize	b41d5747f0	Reduce GeoDistance insanity GeoDistance query, sort, and scripts make use of a crazy GeoDistance enum for handling 4 different ways of computing geo distance: SLOPPY_ARC, ARC, FACTOR, and PLANE. Only two of these are necessary: ARC, PLANE. This commit removes SLOPPY_ARC, and FACTOR and cleans up the way Geo distance is computed.	2017-02-02 12:39:42 -06:00
Nik Everett	f90051e6e0	Docs: Add a note about `<` and `>` in query_string `<` and `>` can't be escaped at all in `query_string`. If we're not going to fix that we should at least document it. Relates to #21703	2017-01-31 12:23:18 -05:00
William Webber	f1a902865f	Update span-multi-term-query.asciidoc (#22733 ) "term" is not actually a multi-term query (perhaps confusion with "term range")	2017-01-23 17:33:40 +01:00
William Webber	abaf728882	"from" => "gte", "to" => "lte" in bool example (#22735 )	2017-01-23 17:29:00 +01:00
Francesc Gil	17342c403f	Indentation error on example of dist_max (#22578 ) There was a problem with the indentation on the example of the `dist_max` query	2017-01-12 09:38:36 +01:00
Lee Hinman	66cf3d3220	Document simple_query_string negation with default_operator of OR This can be confusing when unexpected. Resolves #4707	2017-01-10 10:27:00 -07:00
Jake	d7cc6e28e7	Document `must_not` context and scoring (#22532 ) Document that `must_not` uses filter context and returns a score of `0`.	2017-01-10 17:26:48 +01:00
Nik Everett	75d5b3d9eb	Fix parent_id example in docs And fix some indentation I noticed while looking up the query.	2017-01-10 10:01:31 -05:00
Clinton Gormley	3999e5ba6b	Docs: Added link from bool and constant score query to filter context Closes #22353	2016-12-29 11:05:28 +01:00
Adrien Grand	b2e93d2870	Be explicit about the fact backslashes need to be escaped. (#22257 ) Relates #22255	2016-12-19 14:21:21 +01:00
Luca Cavanna	73cf002293	Un-deprecate fuzzy query (#22088 ) When we decided to deprecate and remove fuzzy query in #15760, we didn't realize we would take away the possibililty for uses to use a fuzzy query as part of a span query, which is not possible using match query. This means we have to go back and un-deprecate fuzzy query, which will not be removed. Closes #15760	2016-12-12 12:09:16 +01:00
Matias Anaya	beb794cb0f	Fix typo in percolated-query.asciidoc (#21991 )	2016-12-09 13:45:57 +01:00
Luca Cavanna	103984a4a1	Remove indices query (#21837 ) The indices query is deprecated since 5.0.0 (#17710). It can now be removed in master (future 6.0 version).	2016-11-30 19:37:01 +01:00
Adrien Grand	eed5de20e0	Remove docs for the removed `geo_distance_range` query.	2016-11-30 16:36:55 +01:00
Adrien Grand	90ab477f19	The `terms` query should always map to a Lucene `TermsQuery`. (#21786 ) Currently, the `terms` query is just syctactic sugar for a `bool` query when used in a query context. This change proposes to always generate the same query in query and filter contexts, which is less confusing.	2016-11-30 15:29:09 +01:00
Luca Cavanna	f253621feb	Remove deprecated query names: in, geo_bbox, mlt, fuzzy_match and match_fuzzy (#21852 ) These query names were all deprecated in 5.0.0: - in is removed in favour of terms - geo_bbox is removed in favour of geo_bounding_box - mlt is removed in favour of more_like_this - fuzzy_match and match_fuzzy are removed in favour of match	2016-11-29 19:07:01 +01:00
Jim Ferenczi	d791ddf704	Upgrade to lucene-6.4.0-snapshot-ec38570 (#21853 ) Set lucene version to 6.4.0-snapshot-ec38570 and update all the sha1s/license Fix invalid combo after upgrade in query_string query. split_on_whitespace=false is disallowed if auto_generate_phrase_queries=true Adapt the expectations of some tests to the new format of the Lucene explain output	2016-11-29 18:40:31 +01:00
Clinton Gormley	5ae6845d4d	Update percolate-query.asciidoc Add missing callout to percolate query	2016-11-26 12:35:33 +01:00
Trey Tacon	3ef7f0dec6	Fixing indentation in geospatial querying example. (#21682 ) Specifically the example which shows providing an array of an array of values.	2016-11-21 13:09:21 +01:00

1 2 3 4 5 ...

401 Commits