OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Christoph Büscher	79db16f9bb	[Docs] Add search timeout caveats (#33354 ) Global search timeouts and timeouts specified in the search request body use the same internal mechanism as search cancellation. Therefore the same caveats apply, mostly around the responsiveness of the timeout which gets only checked by a running search on segment boundaries by default. Closes #31263	2018-09-03 20:56:05 +02:00
Luca Cavanna	393eec1482	Set maxScore for empty TopDocs to Nan rather than 0 (#32938 ) We used to set `maxScore` to `0` within `TopDocs` in situations where there is really no score as the size was set to `0` and scores were not even tracked. In such scenarios, `Float.Nan` is more appropriate, which gets converted to `max_score: null` on the REST layer. That's also more consistent with lucene which set `maxScore` to `Float.Nan` when merging empty `TopDocs` (see `TopDocs#merge`).	2018-08-22 17:23:54 +02:00
markharwood	70d80a3d09	Docs enhancement: added reference to cluster-level setting `search.default_allow_partial_results` (#32810 ) Closes #32809	2018-08-16 10:21:37 +01:00
Jim Ferenczi	7dc00ef1f5	Search option terminate_after does not handle post_filters and aggregations correctly (#28459 ) * Search option terminate_after does not handle post_filters and aggregations correctly This change fixes the handling of the `terminate_after` option when post_filters (or min_score) are used. `post_filter` should be applied before `terminate_after` in order to terminate the query when enough document are accepted by the post_filters. This commit also changes the type of exception thrown by `terminate_after` in order to ensure that multi collectors (aggregations) do not try to continue the collection when enough documents have been collected. Closes #28411	2018-02-12 13:36:33 +01:00
markharwood	77d2dd203e	Search - add allow_partial_search_results flag with default setting false (#28440 ) Adds allow_partial_search_results flag to search requests with default setting = true. When false, will error if search either timeouts, has partial errors or has missing shards rather than returning partial search results. A cluster-level setting provides a default for search requests with no flag. Closes #27435	2018-01-31 15:51:29 +00:00
Adrien Grand	1b660821a2	Allow `_doc` as a type. (#27816 ) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751	2017-12-14 17:47:53 +01:00
Alexander Reelsen	483086220f	Docs: Add search response took time explanation (#26202 )	2017-08-15 08:43:26 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
Clinton Gormley	582b3c06b6	Added docs for batched_reduce_size Relates to #23288	2017-05-02 14:25:03 +02:00
Clinton Gormley	938f5194ef	Include field-collapsing docs in request-body search	2017-01-30 11:47:12 +01:00
Nik Everett	2d568ece2d	CONSOLEify some search docs * search/search.asciidoc * search/request-body.asciidoc * search/explain.asciidoc * search/search-shards.asciidoc	2016-09-14 13:06:37 -04:00
Isabel Drost-Fromm	672ffb6e4d	Revert "Add console to docs for inner hits, explain, and friends"	2016-08-01 14:09:54 +02:00
Isabel Drost-Fromm	00a8516780	Merge branch 'master' into docs/add_console_to_search	2016-07-25 11:54:26 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Isabel Drost-Fromm	9e155e48a5	Fix request-body search test.	2016-06-29 11:32:27 +02:00
Isabel Drost-Fromm	9f30ae3359	Merge branch 'master' into docs/add_console_to_search	2016-06-29 10:20:25 +02:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Vladimir Kovpak	d5f71f9e85	Updated from parameter description. (#18852 ) Not sure that my description better but origin description looks very weird, and i try to make emphasize to offset...	2016-06-14 14:33:15 +02:00
Isabel Drost-Fromm	fcd6a08366	Add CONSOLE annotations to request body docs	2016-05-19 15:11:43 +02:00
Jim Ferenczi	9a203dcb66	Fix wrong include in docs	2016-03-04 12:44:28 +01:00
Jim Ferenczi	927303e7a9	Change the field mapping index time boost into a query time boost. Index time boost will still be applied for indices created before 5.0.0.	2016-03-04 11:47:35 +01:00
Jim Ferenczi	aea7660e37	Add search_after parameter in the Search API. The search_after parameter provides a way to efficiently paginate from one page to the next. This parameter accepts an array of sort values, those values are then used by the searcher to sort the top hits from the first document that is greater to the sort values. This parameter must be used in conjunction with the sort parameter, it must contain exactly the same number of values than the number of fields to sort on. NOTE: A field with one unique value per document should be used as the last element of the sort specification. Otherwise the sort order for documents that have the same sort values would be undefined. The recommended way is to use the field `_uuid` which is certain to contain one unique value for each document. Fixes #8192	2016-01-27 09:42:58 +01:00
javanna	ca980b7a83	[DOCS] document replacement for search exists Relates to #13910 Closes #14393	2015-11-09 15:05:07 +01:00
javanna	c5152c7ecb	[DOCS] terminate_after is not experimental anymore we are relying on terminate_after more and more, replaced the limit filter with it and soon it will also replace the search_exists api. At that point we should make it a stable api rather than experimental. Closes #14183	2015-10-19 13:56:42 +02:00
Adrien Grand	bd44dbe5cd	Docs: Insist that setting size=0 will help performance.	2015-09-03 09:36:34 +02:00
Adrien Grand	6fa258b8fa	Deprecate the `scan` search type. This commit deprecates the `scan` search type in favour of regular scroll requests sorted by `_doc`. Related to #12983	2015-08-20 12:47:23 +02:00
Martijn van Groningen	a14913f7b6	Left over from the `query_cache` to `request_cache` rename.	2015-07-27 13:28:15 +02:00
Adrien Grand	38f5cc236a	Rename caches. In order to be more consistent with what they do, the query cache has been renamed to request cache and the filter cache has been renamed to query cache. A known issue is that package/logger names do no longer match settings names, please speak up if you think this is an issue. Here are the settings for which I kept backward compatibility. Note that they are a bit different from what was discussed on #11569 but putting `cache` before the name of what is cached has the benefit of making these settings consistent with the fielddata cache whose size is configured by `indices.fielddata.cache.size`: * index.cache.query.enable -> index.requests.cache.enable * indices.cache.query.size -> indices.requests.cache.size * indices.cache.filter.size -> indices.queries.cache.size Close #11569	2015-06-29 10:15:27 +02:00
Clinton Gormley	f123a53d72	Docs: Refactored modules and index modules sections	2015-06-22 23:49:45 +02:00
aleph-zero	2b483cc806	Removed reference to search type 'count' Removed reference to search type 'count' as this is now a deprecated search type.	2015-05-04 14:48:40 -07:00
aleph-zero	1d60f34944	Remove all doc references to (dfs_)query_and_fetch Removes references to (dfs_)query_and_fetch as possible ‘search_type’ parameters for the REST API.	2015-04-28 15:57:46 -07:00
Clinton Gormley	37ed61807f	Docs: Updated the experimental annotations in the docs as follows: * Removed the docs for `index.compound_format` and `index.compound_on_flush` - these are expert settings which should probably be removed (see https://github.com/elastic/elasticsearch/issues/10778) * Removed the docs for `index.index_concurrency` - another expert setting * Labelled the segments verbose output as experimental * Marked the `compression`, `precision_threshold` and `rehash` options as experimental in the cardinality and percentile aggs * Improved the experimental text on `significant_terms`, `execution_hint` in the terms agg, and `terminate_after` param on count and search * Removed the experimental flag on the `geobounds` agg * Marked the settings in the `merge` and `store` modules as experimental, rather than the modules themselves Closes #10782	2015-04-26 18:49:15 +02:00
Adrien Grand	a608db122d	Search: Remove the `count` search type. This commit brings the benefits of the `count` search type to search requests that have a `size` of 0: - a single round-trip to shards (no fetch phase) - ability to use the query cache Since `count` now provides no benefits over `query_then_fetch`, it has been deprecated. Close #7630	2015-03-31 11:31:49 +02:00
Adrien Grand	95f46f1212	Docs: Use the new experimental annotation. We now have a very useful annotation to mark features or parameters as experimental. Let's use it! This commit replaces some custom text warnings with this annotation and adds this annotation to some existing features/parameters: - inner_hits (unreleased yet) - terminate_after (released in 1.4) - per-bucket doc count errors in the terms agg (released in 1.4) I also tagged with this annotation settings which should either be not needed (like the ability to evict entries from the filter cache based on time) or that are too deep into the way that Elasticsearch works like the Directory implementation or merge settings. Close #9563	2015-02-05 15:29:45 +01:00
Martijn van Groningen	d7e224da04	Added `inner_hits` feature that allows to include nested hits. Inner hits allows to embed nested inner objects, children documents or the parent document that contributed to the matching of the returned search hit as inner hits, which would otherwise be hidden. Closes #8153 Closes #3022 Closes #3152	2014-12-02 12:01:01 +01:00
Clinton Gormley	cb00d4a542	Docs: Removed all the added/deprecated tags from 1.x	2014-09-26 21:04:42 +02:00
Clinton Gormley	e7f1aa4f4f	Documented the query cache module Related to #7161 and #7167	2014-08-06 11:55:11 +02:00
Clinton Gormley	be86556946	Update request-body.asciidoc Added link from `timeout` to time-units Closes #6361	2014-07-28 11:08:59 +02:00
Clinton Gormley	88c8754a3c	Docs: Removed search-shards from request-body	2014-07-26 14:52:50 +02:00
Areek Zillur	5487c56c70	Search & Count: Add option to early terminate doc collection Allow users to control document collection termination, if a specified terminate_after number is set. Upon setting the newly added parameter, the response will include a boolean terminated_early flag, indicating if the document collection for any shard terminated early. closes #6876	2014-07-23 15:10:15 -04:00
Lee Hinman	57bee03193	[DOCS] Add /_search_shards documentation	2014-04-22 08:54:32 -06:00
Lee Hinman	2c289fb538	Add the ability to retrieve fields from field data Adds a new FetchSubPhase, FieldDataFieldsFetchSubPhase, which loads the field data cache for a field and returns an array of values for the field. Also removes `doc['<field>']` and `_source.<field>` workaround no longer needed in field name resolving. Closes #4492	2014-01-21 09:13:32 -07:00
Simon Willnauer	fa16969360	Cleanup comments and class names s/ElasticSearch/Elasticsearch * Clean up s/ElasticSearch/Elasticsearch on docs/* * Clean up s/ElasticSearch/Elasticsearch on src/* bin/* & pom.xml * Clean up s/ElasticSearch/Elasticsearch on NOTICE.txt and README.textile Closes #4634	2014-01-07 11:21:51 +01:00
Martijn van Groningen	23d2b1ea7b	Renamed top level `filter` to `post_filter`. Closes #4119	2013-12-16 17:10:14 +01:00
Boaz Leskes	c63d8c4fb5	[Docs] Added _source filtering to documentation Relates to #3301	2013-11-26 19:16:24 +01:00
Martijn van Groningen	f6f4b5014f	Added docs for named queries. Relates to #3581	2013-09-16 11:17:01 +02:00
Clinton Gormley	822043347e	Migrated documentation into the main repo	2013-08-29 01:24:34 +02:00

50 Commits