OpenSearch

Commit Graph

Author	SHA1	Message	Date
Christoph Büscher	f7ea794312	[Test] Don't expect specific scores in docs tests (#54297 ) The failing suggester documentation test was expecting specific scores in the test response, which is fragile implementation details that e.g. can change with different lucene versions and generally shouldn't be done in documentation test. Instead we usually replace the float values in the output response by the ones in the actual response. Closes #54257	2020-03-27 10:27:47 +01:00
Luca Cavanna	ff269160af	Async search: rename REST parameters (#54198 ) This commit renames wait_for_completion to wait_for_completion_timeout in submit async search and get async search. Also it renames clean_on_completion to keep_on_completion and turns around its behaviour. Closes #54069	2020-03-26 09:40:50 +01:00
Luca Cavanna	6b457abbd3	Async search: prevent users from overriding pre_filter_shard_size (#54088 ) Submit async search forces pre_filter_shard_size for the underlying search that it creates. With this commit we also prevent users from overriding such default as part of request validation.	2020-03-24 17:06:04 +01:00
Jim Ferenczi	9e3f7f4575	Add heuristics to compute pre_filter_shard_size when unspecified (#53873 ) (#54007 ) This commit changes the pre_filter_shard_size default from 128 to unspecified. This allows to apply heuristics based on the request and the target indices when deciding whether the can match phase should run or not. When unspecified, this pr runs the can match phase automatically if one of these conditions is met: * The request targets more than 128 shards. * The request contains read-only indices. * The primary sort of the query targets an indexed field. Users can opt-out from this behavior by setting the `pre_filter_shard_size` to a static value. Closes #39835	2020-03-24 02:05:15 +01:00
Luca Cavanna	932a7e3112	Backport of async search changes (#53976 ) * Get Async Search: omit _clusters section when empty (#53907) The _clusters section is omitted by the search API whenever no remote clusters are searched. Async search should do the same, but Get Async Search returns a deserialized response, hence a weird `_clusters` section with all values set to `0` gets returned instead. In fact the recreated Clusters object is not the same object as the EMPTY constant, yet it has the same content. This commit addresses this by changing the comparison in the `toXContent` method to not print out the section if the number of total clusters is `0`. * Async search: remove version from response (#53960) The goal of the version field was to quickly show when you can expect to find something new in the search response, compared to when nothing has changed. This can also be done by looking at the `_shards` section and `num_reduce_phases` returned with the search response. In fact when there has been one or more additional reduction of the results, you can expect new results in the search response. Otherwise, the `_shards` section could notify of additional failures of shards that have completed the query, but that is not a guarantee that their results will be exposed (only when the following partial reduction is performed their results will be available). That said this commit clarifies this in the docs and removes the version field from the async search response * Async Search: replicas to auto expand from 0 to 1 (#53964) This way single node clusters that are green don't go yellow once async search is used, while all the others still have one replica. * [DOCS] address timing issue in async search docs tests (#53910) The docs snippets for submit async search have proven difficult to test as it is not possible to guarantee that you get a response that is not final, even when providing `wait_for_completion=0`. In the docs we want to show though a proper long-running query, and its first response should be partial rather than final. With this commit we adapt the docs snippets to show a partial response, and replace under the hood all that's needed to make the snippets tests succeed when we get a final response. Also, increased the timeout so we always get a final response. Closes #53887 Closes #53891	2020-03-23 19:13:31 +01:00
Mark Vieira	0cfe6d90cc	Mute async-search test	2020-03-20 11:35:24 -07:00
Luca Cavanna	d486bdefdd	[DOCS] correct async search note The sort optimization kicks in whenever results are sorted by field.	2020-03-20 15:58:19 +01:00
Luca Cavanna	03fca61fcb	[DOCS] add docs for async search (#53675 ) Relates to #49091 Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2020-03-20 14:46:38 +01:00
Julie Tibshirani	c33afea9fb	Small corrections to stored_fields docs. (#53247 ) * Fix a reference to the 'field' option. * Remove claim about detecting script fields. * Specify that object fields will just be ignored.	2020-03-09 10:59:17 -07:00
James Rodewig	2b59f8ac34	[DOCS] Correct `hits.total.relation` response parm def (#52847 ) Fixes a partially completed definition for the `hits.total.relation` response parameter in the search API docs.	2020-03-04 08:23:34 -05:00
Josh Devins	68ba571f70	Adds recall@k metric to rank eval API (#52889 ) This change adds the recall@k metric and refactors precision@k to match the new metric. Recall@k is an important metric to use for learning to rank (LTR) use-cases. Candidate generation or first ranking phase ranking functions are often optimized for high recall, in order to generate as many relevant candidates in the top-k as possible for a second phase of ranking. Adding this metric allows tuning that base query for LTR. See: https://github.com/elastic/elasticsearch/issues/51676 Backports: https://github.com/elastic/elasticsearch/pull/52577	2020-02-27 16:04:24 +01:00
James Rodewig	98bcf06bae	[DOCS] Correct multi search API docs (#52523 ) * Adds an example request to the top of the page. * Relocates several parameters erroneously listed under "Request body" to the appropriate "Query parameters" section. * Updates the "Request body" section to better document the NDJSON structure of msearch requests.	2020-02-24 07:43:10 -05:00
Marios Trivyzas	c03f51f68f	[Docs] Clarify default value for `allow_no_indices` (#52635 ) (#52697 ) Add default value to each one of the usages of `allow_no_indices` since it differs between different APIs. Relates to: #52534 (cherry picked from commit 2eb986488ac326d6da6ab8ad0203a94e08684a36)	2020-02-24 11:57:32 +01:00
debadair	2588022b81	[DOCS] Fixed typo. (#52071 )	2020-02-07 11:04:56 -08:00
Jess	4b31ad1c0c	[Docs] Small edits to Ranking Evaluation API docs (#51116 ) Small updates to grammar, syntax, and unclear wordings.	2020-01-20 10:30:23 +01:00
Adrien Grand	31158ab3d5	Add per-field metadata. (#50333 ) This PR adds per-field metadata that can be set in the mappings and is later returned by the field capabilities API. This metadata is completely opaque to Elasticsearch but may be used by tools that index data in Elasticsearch to communicate metadata about fields with tools that then search this data. A typical example that has been requested in the past is the ability to attach a unit to a numeric field. In order to not bloat the cluster state, Elasticsearch requires that this metadata be small: - keys can't be longer than 20 chars, - values can only be numbers or strings of no more than 50 chars - no inner arrays or objects, - the metadata can't have more than 5 keys in total. Given that metadata is opaque to Elasticsearch, field capabilities don't try to do anything smart when merging metadata about multiple indices, the union of all field metadatas is returned. Here is how the meta might look like in mappings: ```json { "properties": { "latency": { "type": "long", "meta": { "unit": "ms" } } } } ``` And then in the field capabilities response: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms" ] } } } } ``` When there are no conflicts, values are arrays of size 1, but when there are conflicts, Elasticsearch includes all unique values in this array, without giving ways to know which index has which metadata value: ```json { "latency": { "long": { "searchable": true, "aggreggatable": true, "meta": { "unit": [ "ms", "ns" ] } } } } ``` Closes #33267	2020-01-08 16:21:18 +01:00
James Rodewig	3f7f31b6b0	[DOCS] Fix search request body links (#50500 ) PR #44238 changed several links related to the Elasticsearch search request body API. This updates several places still using outdated links or anchors. This will ultimately let us remove some redirects related to those link changes.	2019-12-26 14:31:09 -05:00
Nik Everett	01293ebad5	Fix docs typos (#50365 ) (#50464 ) Fixes a few typos in the docs. Co-authored-by: Xiang Dai <764524258@qq.com>	2019-12-23 12:38:17 -05:00
James Rodewig	27ae9a1435	[DOCS] Remove outdated file scripts refererence (#50437 ) File scripts were removed in 6.0 with #24627. This removes an outdated file scripts reference from the conditional clauses section of the search templates docs.	2019-12-20 14:53:40 -05:00
Adrien Grand	87e72156ce	Upgrade to lucene 8.4.0-snapshot-662c455. (#50016 ) (#50039 ) Lucene 8.4 is about to be released so we should check it doesn't cause problems with Elasticsearch.	2019-12-10 18:04:58 +01:00
Mayya Sharipova	7cf170830c	Optimize sort on numeric long and date fields. (#49732 ) This rewrites long sort as a `DistanceFeatureQuery`, which can efficiently skip non-competitive blocks and segments of documents. Depending on the dataset, the speedups can be 2 - 10 times. The optimization can be disabled with setting the system property `es.search.rewrite_sort` to `false`. Optimization is skipped when an index has 50% or more data with the same value. Optimization is done through: 1. Rewriting sort as `DistanceFeatureQuery` which can efficiently skip non-competitive blocks and segments of documents. 2. Sorting segments according to the primary numeric sort field(#44021) This allows to skip non-competitive segments. 3. Using collector manager. When we optimize sort, we sort segments by their min/max value. As a collector expects to have segments in order, we can not use a single collector for sorted segments. We use collectorManager, where for every segment a dedicated collector will be created. 4. Using Lucene's shared TopFieldCollector manager This collector manager is able to exchange minimum competitive score between collectors, which allows us to efficiently skip the whole segments that don't contain competitive scores. 5. When index is force merged to a single segment, #48533 interleaving old and new segments allows for this optimization as well, as blocks with non-competitive docs can be skipped. Backport for #48804 Co-authored-by: Jim Ferenczi <jim.ferenczi@elastic.co>	2019-11-29 15:37:40 -05:00
James Rodewig	03600e4e12	[DOCS] Document `script_score` float precision limit (#49402 ) All document scores are positive 32-bit floating point numbers. However, this wasn't previously documented. This can result in surprising behavior, such as precision loss, for users when customizing scores using the function score query. This commit updates an existing admonition in the function score query docs to document the 32-bits precision limit. It also updates the search API reference docs to note that `_score` is a 32-bit float.	2019-11-21 08:54:49 -05:00
Orhan Toy	561351d2fc	[Docs] Fix _count HTTP method (#48979 )	2019-11-12 15:45:26 +01:00
Patrick Maynard	4b85498617	[DOCS] Fix typo in search type docs (#48868 )	2019-11-11 09:38:48 -05:00
Christoph Büscher	1de49d8a70	Remove Ranking Evaluation API experimental status (#48603 ) The API has been released long enough to remove the experimental status.	2019-10-29 20:57:39 +01:00
Ian Danforth	82e25c4ac7	[Docs] Fix typo in suggesters search API doc (#48477 )	2019-10-29 09:58:05 +01:00
James Rodewig	e9c8e4f6d1	[DOCS] Fix note format in index suggestion docs (#48536 )	2019-10-25 11:31:47 -04:00
Christoph Büscher	055a0800eb	[Docs] Mention reserved completion suggestion characters (#48445 ) We currently don't mention the three reserved characters anywhere. This change adds a short note mentioning them Closes #48341	2019-10-25 16:58:23 +02:00
James Rodewig	852622d970	[DOCS] Remove binary gendered language (#48362 )	2019-10-23 09:37:12 -05:00
Jim Ferenczi	dc39196ea4	Fix tag in the search request timeout option docs (#47776 ) and add missing parentheses `search_timeout` param	2019-10-10 10:35:44 +02:00
James Rodewig	c03cdb4b15	[DOCS] Correct callouts in search template docs (#47655 )	2019-10-07 09:25:32 -04:00
James Rodewig	fd421bd12d	[7.x] [DOCS] Add response body parms to search API docs (#47042 ) (#47303 )	2019-09-30 13:54:06 -04:00
István Zoltán Szabó	0ab7132c47	[DOCS] Reformats Profile API (#47168 ) * [DOCS] Reformats Profile API. * [DOCS] Fixes failing docs test.	2019-09-27 11:14:14 +02:00
István Zoltán Szabó	74fd21f0b0	[DOCS] Reformats ranking evaluation API (#46974 ) * [DOCS] Reformats ranking evaluation API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-25 15:01:10 +02:00
István Zoltán Szabó	83365e94ba	[DOCS] Reformat suggesters page. (#47010 )	2019-09-25 14:42:16 +02:00
István Zoltán Szabó	fcea154f2e	[DOCS] Reformats Field capabilities API (#46866 ) * [DOCS] Reformats Field capabilities API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-20 11:28:19 +02:00
István Zoltán Szabó	363075cf1d	[DOCS] Reformats explain API (#46857 ) * [DOCS] Reformats explain API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-20 11:00:33 +02:00
James Rodewig	251dbd8522	[DOCS] Remove `lowercase_terms` parm from term suggester docs (#46879 )	2019-09-19 15:56:47 -04:00
Takumasa Ochi	7a3054c5dc	Fix typos in `match` in profile API (#46723 ) * Replace `matches` with correct `match` * Use present tense consistently * Replace `metric` with correct `match`	2019-09-19 16:07:52 +02:00
István Zoltán Szabó	e59be0354a	[DOCS] Reformats validate API (#46389 ) * [DOCS] Reformats validate API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-18 14:31:17 +02:00
István Zoltán Szabó	595bf52927	[DOCS] Reformats count API (#46377 ) * [DOCS] Reformats count API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-17 09:54:19 +02:00
James Rodewig	e253ee6ba6	[DOCS] Change // CONSOLE comments to [source,console] (#46440 ) (#46494 )	2019-09-09 12:35:50 -04:00
James Rodewig	f04573f8e8	[DOCS] [5 of 5] Change // TESTRESPONSE comments to [source,console-results] (#46449 ) (#46459 )	2019-09-06 16:09:09 -04:00
James Rodewig	bb7bff5e30	[DOCS] Replace "// TESTRESPONSE" magic comments with "[source,console-result] (#46295 ) (#46418 )	2019-09-06 09:22:08 -04:00
James Rodewig	1f36c4e50c	[DOCS] Replace "// CONSOLE" comments with [source,console] (#46159 ) (#46332 )	2019-09-05 10:11:25 -04:00
István Zoltán Szabó	0f0b77b263	[DOCS] Reformats search template and multi search template APIs (#46236 ) * [DOCS] Reformats search template and multi search template APIs. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-04 15:14:06 +02:00
István Zoltán Szabó	c71d959d61	[DOCS] Reformats search shards API (#46240 ) * [DOCS] Reformats search shards API Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-04 11:36:08 +02:00
István Zoltán Szabó	5c5af77565	[DOCS] Reformats request body search API (#46254 ) * [DOCS] Reformats request body search API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-04 10:53:16 +02:00
István Zoltán Szabó	f2bdd392e7	[DOCS] Reformats multi search API (#46256 ) * [DOCS] Reformats multi search API. Co-Authored-By: James Rodewig <james.rodewig@elastic.co>	2019-09-04 10:19:43 +02:00
István Zoltán Szabó	53f70ee996	[DOCS] Reformats URI search request (#45844 ) * [DOCS] Reformats URI search request. Co-Authored-By: James Rodewig <james.rodewig@elastic.co> Co-Authored-By: debadair <debadair@elastic.co>	2019-08-30 13:45:29 +02:00

1 2 3 4 5 ...

879 Commits