OpenSearch

Commit Graph

Author	SHA1	Message	Date
Jim Ferenczi	787acb14b9	Track total hits up to 10,000 by default (#37466 ) This commit changes the default for the `track_total_hits` option of the search request to `10,000`. This means that by default search requests will accurately track the total hit count up to `10,000` documents, requests that match more than this value will set the `"total.relation"` to `"gte"` (e.g. greater than or equals) and the `"total.value"` to `10,000` in the search response. Scroll queries are not impacted, they will continue to count the total hits accurately. The default is set back to `true` (accurate hit count) if `rest_total_hits_as_int` is set in the search request. I choose `10,000` as the default because that's also the number we use to limit pagination. This means that users will be able to know how far they can jump (up to 10,000) even if the total number of hits is not accurate. Closes #33028	2019-01-25 13:45:39 +01:00
Christoph Büscher	95a6951f78	Use new bulk API endpoint in the docs (#37698 ) This change switches to using the typeless bulk API endpoint in the documentation snippets where possible	2019-01-23 09:46:28 +01:00
Boaz Leskes	52ba407931	Expose sequence number and primary terms in search responses (#37639 ) Users may require the sequence number and primary terms to perform optimistic concurrency control operations. Currently, you can get the sequence number via the `docvalues_fields` API but the primary term is not accessible because it is maintained by the `SeqNoFieldMapper` and the infrastructure can't find it. This commit adds a dedicated sub fetch phase to return both numbers that is connected to a new `seq_no_primary_term` parameter.	2019-01-23 09:01:58 +01:00
Christoph Büscher	34f2d2ec91	Remove remaining occurances of "include_type_name=true" in docs (#37646 )	2019-01-22 15:13:52 +01:00
Christoph Büscher	3a96608b3f	Remove more include_type_name and types from docs (#37601 )	2019-01-18 14:11:18 +01:00
Christoph Büscher	25aac4f77f	Remove `include_type_name` in asciidoc where possible (#37568 ) The "include_type_name" parameter was temporarily introduced in #37285 to facilitate moving the default parameter setting to "false" in many places in the documentation code snippets. Most of the places can simply be reverted without causing errors. In this change I looked for asciidoc files that contained the "include_type_name=true" addition when creating new indices but didn't look likey they made use of the "_doc" type for mappings. This is mostly the case e.g. in the analysis docs where index creating often only contains settings. I manually corrected the use of types in some places where the docs still used an explicit type name and not the dummy "_doc" type.	2019-01-18 09:34:11 +01:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Josh Soref	edb48321ba	[DOCS] Various spelling corrections (#37046 )	2019-01-07 14:44:12 +01:00
Jim Ferenczi	e38cf1d0dc	Add the ability to set the number of hits to track accurately (#36357 ) In Lucene 8 searches can skip non-competitive hits if the total hit count is not requested. It is also possible to track the number of hits up to a certain threshold. This is a trade off to speed up searches while still being able to know a lower bound of the total hit count. This change adds the ability to set this threshold directly in the track_total_hits search option. A boolean value (true, false) indicates whether the total hit count should be tracked in the response. When set as an integer this option allows to compute a lower bound of the total hits while preserving the ability to skip non-competitive hits when enough matches have been collected. Relates #33028	2019-01-04 20:36:49 +01:00
Abdullah DURSUN	8a02bacf76	Fix typo in multi-search.asciidoc (#37060 )	2019-01-02 10:32:42 +01:00
lcawl	504cfb2fb1	[DOCS] Adds missing anchors for profile API	2018-12-18 15:20:19 -08:00
Julie Tibshirani	87831051dc	Deprecate types in explain requests. (#35611 ) The following updates were made: - Add a new untyped endpoint `{index}/_explain/{id}`. - Add deprecation warnings to RestAction, plus tests in RestActionTests. - For each REST yml test, make sure there is one version without types, and another legacy version that retains types (called *_with_types.yml). - Deprecate relevant methods on the Java HLRC requests/ responses. - Update documentation (for both the REST API and Java HLRC).	2018-12-10 19:45:13 -08:00
Christoph Büscher	54f39d9852	[Docs] Add Profile API limitations (#36252 ) Adding some of the limitations mentioned in #29275. Closes #29275	2018-12-06 00:09:26 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00
João Barbosa	d27aa72b17	Added soft limit to open scroll contexts #25244 (#36009 ) This change adds a soft limit to open scroll contexts that can be controlled with the dynamic cluster setting `search.max_open_scroll_context` (defaults to 500).	2018-12-03 19:57:10 +01:00
patrykk21	bb2cf7e6be	[Docs] Clarify search_after behavior Closes #34232	2018-11-30 14:30:23 +01:00
Christoph Büscher	33865211db	[Docs] Emphazise suggest behaviour with missing query part (#35393 ) Add a short extra sentence that explains that a missing query part in a search request containing a "suggest" section will mean only suggestions are returned. Closes #31640	2018-11-28 12:01:27 +01:00
Julie Tibshirani	c6a0904e0e	Deprecate types in count and msearch. (#35421 ) * Deprecate types in count requests. * Move RestCountAction to the 'search' package. * Deprecate types in multi search requests. * Add tests for types deprecation in the _search endpoint.	2018-11-16 13:04:43 -08:00
Julie Tibshirani	40ba4de5e6	Deprecate types in validate query requests. (#35575 )	2018-11-16 08:59:04 -08:00
Jim Ferenczi	72504c2512	Do not recommend to use the _id field in search_after docs (#35370 ) The documentation of `search_after` recommends to use the `_id` field as a tiebreaker for the sort without warning against the additional memory required. This change changes the recommandation to use a copy of the `_id` field with doc_values enabled.	2018-11-14 10:50:31 +01:00
Jeff Hajewski	d00b23c8b1	Fixes fast vector highlighter docs per issue 24318. (#34190 ) The `fvh` highlighter does not support span queries. This fix updates the docs to add a warning stating the lack of span query support for `fvh`.	2018-11-08 11:09:03 +01:00
lipsill	6df1c9e818	Deprecate `_source_include` and `_source_exclude` url parameters (#33475 ) Deprecates `_source_include` and `_source_exclude` url parameters in favor of `_source_inclues` and `_source_excludes` because those are consistent with the rest of Elasticsearch's APIs. Relates to #22792	2018-10-29 12:06:38 -04:00
Stéphane Campinas	27c4d63340	document the search context is freed if the scroll is not extended (#34739 ) The `fetchPhaseShouldFreeContext` returns true when there is a scroll context but the scroll parameter is null, thus freeing the search context. `183c32d4c3/server/src/main/java/org/elasticsearch/search/SearchService.java (L491)`	2018-10-25 16:49:08 -04:00
Julie Tibshirani	f854330e06	Make sure to use the type _doc in the REST documentation. (#34662 ) * Replace custom type names with _doc in REST examples. * Avoid using two mapping types in the percolator docs. * Rename doc -> _doc in the main repository README. * Also replace some custom type names in the HLRC docs.	2018-10-22 11:54:04 -07:00
Julie Tibshirani	67652b5355	Remove references to multiple types in the search documentation. (#34625 )	2018-10-19 09:47:34 -07:00
eray	daf88335d7	Add max_children limit to nested sort (#33587 ) Add an option to `nested` sort to limit the number of children to visit when picking the sort value of the root document. Closes #33592	2018-10-05 12:02:47 +02:00
Tim Heckel	3928921a1d	[DOCS] Update scroll.asciidoc (#32530 )	2018-09-18 17:00:22 +02:00
Dan Tennery-Spalding	3596512e6a	[DOCS] Corrected several grammar errors (#33781 )	2018-09-18 16:46:22 +02:00
Jim Ferenczi	4561c5ee83	Clarify context suggestions filtering and boosting (#33601 ) This change clarifies the documentation of the context completion suggester regarding filtering and boosting with contexts. Unlike the suggester v1, filtering on multiple contexts works as a disjunction, a suggestion matches if it contains at least one of the provided context values and boosting selects the maximum score among the matching contexts. This commit also adapts an old test that was written for the v1 suggester and commented out for version 2 because the behavior changed.	2018-09-12 08:47:32 +02:00
Jim Ferenczi	7ad71f906a	Upgrade to a Lucene 8 snapshot (#33310 ) The main benefit of the upgrade for users is the search optimization for top scored documents when the total hit count is not needed. However this optimization is not activated in this change, there is another issue opened to discuss how it should be integrated smoothly. Some comments about the change: * Tests that can produce negative scores have been adapted but we need to forbid them completely: #33309 Closes #32899	2018-09-06 14:42:06 +02:00
Christoph Büscher	79db16f9bb	[Docs] Add search timeout caveats (#33354 ) Global search timeouts and timeouts specified in the search request body use the same internal mechanism as search cancellation. Therefore the same caveats apply, mostly around the responsiveness of the timeout which gets only checked by a running search on segment boundaries by default. Closes #31263	2018-09-03 20:56:05 +02:00
Jim Ferenczi	713c07e14d	Add early termination support to BucketCollector (#33279 ) This commit adds the support to early terminate the collection of a leaf in the aggregation framework. This change introduces a MultiBucketCollector which handles CollectionTerminatedException exactly like the Lucene MultiCollector. Any aggregator can now throw a CollectionTerminatedException without stopping the collection of a sibling aggregator. This is useful for aggregators that can infer their result without visiting all documents (e.g.: a min/max aggregation on a match_all query).	2018-09-03 09:34:35 +02:00
lipsill	b7c0d2830a	[Docs] Remove repeating words (#33087 )	2018-08-28 13:16:43 +02:00
Ignacio Vera	d7219c05a2	Search: Support of wildcard on docvalue_fields (#32980 ) * Search: Support of wildcard on docvalue_fields For consistency with stored_fields, docvalue_fields should support the use of wildcards. Documentation of doc values fields is updated accordingly. See also: #26390 Closes #26299	2018-08-23 10:04:00 +02:00
Luca Cavanna	393eec1482	Set maxScore for empty TopDocs to Nan rather than 0 (#32938 ) We used to set `maxScore` to `0` within `TopDocs` in situations where there is really no score as the size was set to `0` and scores were not even tracked. In such scenarios, `Float.Nan` is more appropriate, which gets converted to `max_score: null` on the REST layer. That's also more consistent with lucene which set `maxScore` to `Float.Nan` when merging empty `TopDocs` (see `TopDocs#merge`).	2018-08-22 17:23:54 +02:00
Simon Willnauer	ffb1a5d5b7	Expose `max_concurrent_shard_requests` in `_msearch` (#33016 ) Today `_msearch` doesn't allow modifying the `max_concurrent_shard_requests` per sub search request. This change adds support for setting this parameter on all sub-search requests in an `_msearch`. Relates to #31877	2018-08-22 08:45:08 +02:00
markharwood	70d80a3d09	Docs enhancement: added reference to cluster-level setting `search.default_allow_partial_results` (#32810 ) Closes #32809	2018-08-16 10:21:37 +01:00
Christoph Büscher	c1cc0cef61	Add ERR to ranking evaluation documentation (#32314 ) This change adds a section about the Expected Reciprocal Rank metric (ERR) to the Ranking Evaluation documentation.	2018-07-24 19:58:34 +02:00
Christoph Büscher	fe6bb75eb4	Rename ranking evaluation `quality_level` to `metric_score` (#32168 ) The notion of "quality" is an overloaded term in the search ranking evaluation context. Its usually used to decribe certain levels of "good" vs. "bad" of a seach result with respect to the users information need. We currently report the result of the ranking evaluation as `quality_level` which is a bit missleading. This changes the response parameter name to `metric_score` which fits better.	2018-07-23 22:25:02 +02:00
Christoph Büscher	5cbd9ad177	Rename ranking evaluation response section (#32166 ) Currently the ranking evaluation response contains a 'unknown_docs' section for each search use case in the evaluation set. It contains document ids for results in the search hits that currently don't have a quality rating. This change renames it to `unrated_docs`, which better reflects its purpose.	2018-07-20 11:43:46 +02:00
David Turner	380b45b965	Improve docs for search preferences (#32159 ) Today it is unclear what guarantees are offered by the search preference feature, and we claim a guarantee that is stronger than what we really offer: > A custom value will be used to guarantee that the same shards will be used > for the same custom value. This commit clarifies this documentation. Forward-port of #32098 to `master`.	2018-07-18 12:58:17 +01:00
Mayya Sharipova	80492cacfc	Add second level of field collapsing (#31808 ) * Put second level collapse under inner_hits Closes #24855	2018-07-13 11:40:03 -04:00
Christoph Büscher	450a450b2c	[Docs] Clarify accepted sort case (#31605 ) Rescore only works with an explicite "sort" element if it is on descending "_score". Even using "order" : "asc" will throw an error.	2018-07-06 10:11:36 +02:00
Christoph Büscher	5f87a84bef	[Docs] Correct default window_size (#31582 )	2018-07-04 14:07:20 +02:00
Julie Tibshirani	26a927a120	Fix a formatting issue in the docvalue_fields documentation. (#31563 )	2018-06-26 10:15:56 -07:00
Igor Motov	7a9d9b0abf	Add support for ignore_unmapped to geo sort (#31153 ) Adds support for `ignore_unmapped` parameter in geo distance sorting, which is functionally equivalent to specifying an `unmapped_type` in the field sort. Closes #28152	2018-06-07 11:11:13 -04:00
Jim Ferenczi	0f5e570184	Deprecates indexing and querying a context completion field without context (#30712 ) This change deprecates completion queries and documents without context that target a context enabled completion field. Querying without context degrades the search performance considerably (even when the number of indexed contexts is low). This commit targets master but the deprecation will take place in 6.x and the functionality will be removed in 7 in a follow up. Closes #29222	2018-05-31 16:09:48 +02:00
Adrien Grand	a19df4ab3b	Add a `format` option to `docvalue_fields`. (#29639 ) This commit adds the ability to configure how a docvalue field should be formatted, so that it would be possible eg. to return a date field formatted as the number of milliseconds since Epoch. Closes #27740	2018-05-23 14:39:04 +02:00
Fernando Medina Corey	739bb4f0ec	Fix a grammatical error in the 'search types' documentation. Simple grammatical fix.	2018-05-22 22:09:04 -07:00

1 2 3 4 5 ...

782 Commits