OpenSearch

Commit Graph

Author	SHA1	Message	Date
James Rodewig	d8fe0f5c13	[DOCS] Rewrite `terms_set` query (#43060 )	2019-06-28 12:57:22 -04:00
Alan Woodward	81dbcfb268	Wildcard intervals (#43691 ) This commit adds a wildcard intervals source, similar to the prefix. It also changes the term parameter in prefix to read prefix, to bring it in to line with the pattern parameter in wildcard. Closes #43198	2019-06-28 14:04:03 +01:00
James Rodewig	74dd6e49fc	[DOCS] Rewrite boosting query (#43647 )	2019-06-28 08:35:55 -04:00
Alan Woodward	76d0edd1a4	Add prefix intervals source (#43635 ) This commit adds a prefix intervals source, allowing you to search for intervals that contain terms starting with a given prefix. The source can make use of the index_prefixes mapping option. Relates to #43198	2019-06-26 16:22:12 +01:00
James Rodewig	50eac875e4	[DOCS] Rewrite `range` query (#43282 )	2019-06-25 15:25:48 -04:00
rbayet	66693c2706	Fixing backquote in fail_on_unsupported_field (#43572 )	2019-06-25 16:34:38 +02:00
James Rodewig	014fd19abd	[DOCS] Rewrite `constant_score` query (#43374 )	2019-06-21 12:04:00 -04:00
James Rodewig	359b103f87	[DOCS] Rewrite term-level queries overview (#43337 )	2019-06-21 11:55:02 -04:00
David Kyle	d1280339a8	specifies which index to search in docs for various queries (#43307 ) (#43428 ) the geo-bounding-box and phrase-suggest docs were susceptible to failing due to other indices in the cluster. This change restricts the queries to the index that is set up for the test. relates to #43271.	2019-06-21 10:15:51 +01:00
Christoph Büscher	adab7eae71	[Docs] Remove boost parameter from intervals-query example (#43331 ) The boost factor doesn't seem to be needed and can be removed.	2019-06-20 10:34:14 +02:00
Mayya Sharipova	aa6248d4d7	Move dense_vector and sparse_vector to module (#43280 ) (#43333 )	2019-06-18 11:56:04 -04:00
markharwood	a75964d8fd	Docs change for exists query. (#43092 ) Now emphasises the test is for indexed values. Previous documentation only mentioned the state of the input JSON doc (null values) but this is only one of several reasons why an indexed value may not exist. Closes #24256	2019-06-12 09:28:18 +01:00
Shubham Vipul Majmudar	b2e7045b50	Update regexp-syntax.asciidoc (#43021 ) Corrects a typo.	2019-06-10 10:13:54 +01:00
James Rodewig	ed186b4485	[DOCS] Rewrite terms query (#42889 )	2019-06-06 08:33:52 -04:00
Christoph Büscher	99542e66a6	[Docs] Clarify caveats for phonetic filters replace option (#42807 ) The `replace` option in the phonetic token filter can have suprising side effects, e.g. such as described in #26921. This PR adds a note to be mindful about such scenarios and offers alternatives to using the `replace` option. Closes #26921	2019-06-05 22:03:54 +02:00
Marios Trivyzas	eab88354f2	[Docs] Fix reference to `boost` and `slop` params (#42803 ) For `multi_match` query: link `boost` param to the generic reference for query usage and `slop` to the `match_phrase` query where its usage is documented. Fixes: #40091 (cherry picked from commit 69993049a8bd9e7f042935729fe69a8266d95a0a)	2019-06-03 22:57:19 +02:00
Marios Trivyzas	ce30afcd01	Deprecate CommonTermsQuery and cutoff_frequency (#42619 ) (#42691 ) Since the max_score optimization landed in Elasticsearch 7, the CommonTermsQuery is redundant and slower. Moreover the cutoff_frequency parameter for MatchQuery and MultiMatchQuery is redundant. Relates to #27096 (cherry picked from commit 04b74497314eeec076753a33b3b6cc11549646e8)	2019-05-30 18:04:47 +02:00
James Rodewig	67326252d8	[DOCS] Rewrite 'wildcard' query (#42670 )	2019-05-30 08:31:27 -04:00
Mayya Sharipova	5e02dc6878	Add warning scores are floats (#42667 )	2019-05-29 16:49:04 -04:00
Mayya Sharipova	216c74d10a	Add experimental and warnings to vector functions (#42205 )	2019-05-21 06:39:05 -04:00
James Rodewig	58f2e91684	[DOCS] Rewrite 'rewrite' parameter docs (#42018 )	2019-05-13 08:43:12 -04:00
James Rodewig	ea5019665a	[DOCS] Replace table with def list for ids query (#41865 )	2019-05-09 09:52:20 -04:00
James Rodewig	77f634ba25	[DOCS] Rewrite `exists` query docs (#41868 )	2019-05-07 09:23:20 -04:00
James Rodewig	2306531815	[DOCS] Rewrite `term` query docs for new format (#41498 ) * [DOCS] Restructure `term` query docs.	2019-05-06 10:36:53 -04:00
Gordon Brown	9a4cab9f77	Recommend use of ln1p/log1p to avoid negative scores (#41610 ) As negative scores will now cause an error, and it is easy to accidentally produce negative scores with some of the built-in modifiers (especially `ln` and `log`), this adjusts the documentation to more strongly recommend the use of `ln1p` and `log1p` instead. Also corrects some awkward formatting on the note sections following the table.	2019-05-03 09:10:29 -06:00
James Rodewig	c86f797df8	[DOCS] Reword `type` query deprecation note for Asciidoctor migration (#41595 )	2019-05-01 09:29:05 -04:00
James Rodewig	53702efddd	[DOCS] Add anchors for Asciidoctor migration (#41648 )	2019-04-30 10:20:17 -04:00
Alan Woodward	d7ab86db9c	Document restrictions on fuzzy matching when using synonyms (#40783 ) Relates to #25518 #41592	2019-04-26 10:10:32 -07:00
Mayya Sharipova	4127d6889b	Remove experimental label froms script_score query (#41572 )	2019-04-26 08:51:29 -04:00
James Rodewig	a0b54a9a92	[DOCS] Restructure `ids` to new query docs format (#41077 )	2019-04-10 14:36:50 -04:00
Alan Woodward	8ca7325f65	Document restrictions on fuzzy matching when using synonyms (#40783 ) Relates to #25518	2019-04-04 09:10:44 +01:00
Andy Bristol	e88b862192	fix summary of phrase_prefix scoring (#40567 ) The language here implies that phrase_prefix scoring works like most_fields, but it actually works like best_fields	2019-04-01 12:04:59 -07:00
Christoph Büscher	4f5a83dba2	Clarify using time_zone and date math in range query (#40655 ) Currently, the docs correctly state that using `now` in range queries will not be affected by the `time_zone` parameter. However, using date math roundings like e.g. `now\d` will be affected by the `time_zone`. Adding this example because it seems to be a frequently asked question and source of confusion. Relates to #40581	2019-03-29 23:39:18 +01:00
avshiav	1edebfc834	[DOCS] Fixed missing space (#38305 )	2019-03-29 17:01:14 +01:00
Mayya Sharipova	24755209b4	Add randomScore function in script_score query (#40186 ) To make script_score query to have the same features as function_score query, we need to add randomScore function. This function produces different random scores on different index shards. It is also able to produce random scores based on the internal Lucene Document Ids.	2019-03-28 13:23:47 -04:00
Andy Bristol	23395a9b9f	search as you type fieldmapper (#35600 ) Adds the search_as_you_type field type that acts like a text field optimized for as-you-type search completion. It creates a couple subfields that analyze the indexed terms as shingles, against which full terms are queried, and a prefix subfield that analyze terms as the largest shingle size used and edge-ngrams, against which partial terms are queried Adds a match_bool_prefix query type that creates a boolean clause of a term query for each term except the last, for which a boolean clause with a prefix query is created. The match_bool_prefix query is the recommended way of querying a search as you type field, which will boil down to term queries for each shingle of the input text on the appropriate shingle field, and the final (possibly partial) term as a term query on the prefix field. This field type also supports phrase and phrase prefix queries however	2019-03-27 13:29:13 -07:00
Alan Woodward	83d2870308	Add `use_field` option to intervals query (#40157 ) This is the equivalent of the `field_masking_span` query, allowing users to merge intervals from multiple fields - for example, to search for stemmed tokens near unstemmed tokens.	2019-03-20 16:26:04 +00:00
Mayya Sharipova	49a7c6e0e8	Expose proximity boosting (#39385 ) (#40251 ) Expose DistanceFeatureQuery for geo, date and date_nanos types Closes #33382	2019-03-20 09:24:41 -04:00
Adrien Grand	9731ba4338	Make the `type` parameter optional when percolating existing documents. (#39987 ) (#39989 ) `document_type` is the type to use for parsing the document to percolate, which is already optional and deprecated. However `percotale` queries also have the ability to percolate existing documents, identified by an index, an id and a type. This change makes the latter optional and deprecated. Closes #39963	2019-03-13 15:04:41 +01:00
MK Swanson	7c7933b0aa	[DOCS] Edited intro sentence for clarity. Closes #39865 (#39868 )	2019-03-11 12:36:43 -04:00
Mayya Sharipova	e80284231d	Backport distance functions vectors (#39330 ) Distance functions for dense and sparse vectors Backport for #37947, #39313	2019-02-23 11:52:43 -05:00
Alan Woodward	ab4d5f404f	Add overlapping, before, after filters to intervals query (#38999 ) Lucene recently added `overlapping`, `before` and `after` filters to the intervals package. This commit exposes them in elasticsearch.	2019-02-18 15:06:24 +00:00
Alexander Reelsen	8e5e48319e	Add documentation about breaking java time changes (#38886 ) In addition remove joda time mentions across the docs, make sure links are updated to java time javadocs. Forward port of #38720	2019-02-14 10:18:12 +01:00
Jim Ferenczi	787acb14b9	Track total hits up to 10,000 by default (#37466 ) This commit changes the default for the `track_total_hits` option of the search request to `10,000`. This means that by default search requests will accurately track the total hit count up to `10,000` documents, requests that match more than this value will set the `"total.relation"` to `"gte"` (e.g. greater than or equals) and the `"total.value"` to `10,000` in the search response. Scroll queries are not impacted, they will continue to count the total hits accurately. The default is set back to `true` (accurate hit count) if `rest_total_hits_as_int` is set in the search request. I choose `10,000` as the default because that's also the number we use to limit pagination. This means that users will be able to know how far they can jump (up to 10,000) even if the total number of hits is not accurate. Closes #33028	2019-01-25 13:45:39 +01:00
Mayya Sharipova	a30ce6a00a	Rename feature, feature_vector and feature_query (#37794 ) Ranaming as follows: feature -> rank_feature feature_vector -> rank_features feature query -> rank_feature query Ranaming is done to distinguish from other vector types. Closes #36723	2019-01-24 19:18:48 -05:00
Mayya Sharipova	fdb66039d4	Change `rational` to `saturation` in script_score (#37766 ) This change of the function name is necessary for conformity with feature queries. Closes #37714	2019-01-23 14:28:20 -05:00
Mayya Sharipova	942fc13af5	Use plain text instead of latexmath As latexmath is not rendered, using plain text instead Closes #37718	2019-01-22 16:49:03 -05:00
Christoph Büscher	34f2d2ec91	Remove remaining occurances of "include_type_name=true" in docs (#37646 )	2019-01-22 15:13:52 +01:00
Christoph Büscher	25aac4f77f	Remove `include_type_name` in asciidoc where possible (#37568 ) The "include_type_name" parameter was temporarily introduced in #37285 to facilitate moving the default parameter setting to "false" in many places in the documentation code snippets. Most of the places can simply be reverted without causing errors. In this change I looked for asciidoc files that contained the "include_type_name=true" addition when creating new indices but didn't look likey they made use of the "_doc" type for mappings. This is mostly the case e.g. in the analysis docs where index creating often only contains settings. I manually corrected the use of types in some places where the docs still used an explicit type name and not the dummy "_doc" type.	2019-01-18 09:34:11 +01:00
Julie Tibshirani	36a3b84fc9	Update the default for include_type_name to false. (#37285 ) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs.	2019-01-14 13:08:01 -08:00
Mayya Sharipova	ec32e66088	Deprecate reference to _type in lookup queries (#37016 ) Relates to #35190	2019-01-08 18:46:41 -08:00
Josh Soref	edb48321ba	[DOCS] Various spelling corrections (#37046 )	2019-01-07 14:44:12 +01:00
Jim Ferenczi	667c06dc83	Add link to script score query in the top level docs (#36416 ) * add link to script score query in the top level docs * Correct references in script-score-query.asciidoc	2018-12-19 10:18:53 -05:00
Alan Woodward	344917efab	Add script filter to intervals (#36776 ) This commit adds the ability to filter out intervals based on their start and end position, and internal gaps: ``` POST _search { "query": { "intervals" : { "my_text" : { "match" : { "query" : "hot porridge", "filter" : { "script" : { "source" : "interval.start > 10 && interval.end < 20 && interval.gaps == 0" } } } } } } } ```	2018-12-19 11:12:18 +00:00
Nick Knize	ec0dc2c0e9	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#36751 ) * [Geo] Expose BKDBackedGeoShapes as new VECTOR strategy This commit exposes lucene's LatLonShape field as a new strategy in GeoShapeFieldMapper. To use the new indexing approach, strategy should be set to "vector" in the geo_shape field mapper. If the tree parameter is set the mapper will throw an IAE. Note the following: When using vector strategy: * geo_shape query does not support querying by POINT, MULTIPOINT, or GEOMETRYCOLLECTION. * LINESTRING and MULTILINESTRING queries do not support WITHIN relation. * CONTAINS relation is not supported. * The tree, precision, tree_levels, distance_error_pct, and points_only parameters will not throw an exception but they have no effect and will be marked as deprecated.. All other features are supported. * revert change to PercolatorFieldMapper * fix ExistsQuery for geo_shape vector strategy * add deprecation logging for tree, precision, tree_levels, distance_error_pct, and points_only * initial update to geoshape docs, including mapping migration updates * initial support for GeoCollection queries * fix docs and javadoc errors * clean up geocollection queries * set deprecated mapping tests to NOTCONSOLE * fix geo-shape mapper asciidoc mapping and test warnings * add support for point queries using LatLonShapeBoundingBoxQuery * update GeoShapeQueryBuilderTests to include POINT queries for VECTOR strategy. Other comment cleanups * add lucene geometry build testing to ShapeBuilder tests * remove deprecated prefix tree mapping from geo-shape.asciidoc * refactor GeoShapeFieldMapper into LegacyGeoShapeFieldMapper and GeoShapeFieldMapper Both classes derive from BaseGeoShapeFieldMapper that provides shared parameters: coerce, ignoreMalformed, ignore_z_value, orientation. * update docs to remove vector strategy * fix GeometryCollectionBuilder#buildLucene to return the object created by the shape builder * fix LineLength failure in GeoJsonShapeParserTests * ShapeMapper refactor changes from PR feedback * fix typo in geo-shape.asciidoc * ignore circle test in docs * update indexing-approach ref to geoshape-indexing-approach * add warnings check for LegacyGeoShapeFieldMapper to AbstractBuilderTestCase * fix deprecatedParameters setup * update indexing approach * fixing unexpected warnings failures * move orientation back to field type * remove if in LegacyGeoShapeFieldMapper#doXContent. Fix GeoShapeFieldMapper to work with double array as a point * fix indexing-approach link in circle section of geoshape docs * add strategy to deprecation warnings check * fix test failures * fix typo in QueryStringQueryBuilderTests * fix total hits to totalHits().value * fix version number * add version check to BaseGeoShapeFieldMapper * fix line length! * revert version check in BaseGeoShapeFieldMapper * Fix serialization of mappings of legacy shapes.	2018-12-18 09:54:56 -06:00
Nicholas Knize	96d279ed83	Revert "[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 )" This reverts commit `5bc7822562`.	2018-12-17 20:09:46 -06:00
Nick Knize	5bc7822562	[Geo] Integrate Lucene's LatLonShape (BKD Backed GeoShapes) as default `geo_shape` indexing approach (#35320 ) This commit exposes lucene's LatLonShape field as the default type in GeoShapeFieldMapper. To use the new indexing approach, simply set "type" : "geo_shape" in the mappings without setting any of the strategy, precision, tree_levels, or distance_error_pct parameters. Note the following when using the new indexing approach: * geo_shape query does not support querying by MULTIPOINT. * LINESTRING and MULTILINESTRING queries do not yet support WITHIN relation. * CONTAINS relation is not yet supported. The tree, precision, tree_levels, distance_error_pct, and points_only parameters are deprecated.	2018-12-17 14:38:14 -06:00
Alan Woodward	09bf93dc2a	Add intervals query (#36135 ) * Add IntervalQueryBuilder with support for match and combine intervals * Add relative intervals * feedback * YAML test - broekn * yaml test; begin to add block source * Add block; make disjunction its own source * WIP * Extract IntervalBuilder and add tests for it * Fix eq/hashcode in Disjunction * New yaml test * checkstyle * license headers * test fix * YAML format * YAML formatting again * yaml tests; javadoc * Add OR test -> requires fix from LUCENE-8586 * Add docs * Re-do API * Clint's API * Delete bash script * doc fixes * imports * docs * test fix * feedback * comma * docs fixes * Tidy up doc references to old rule	2018-12-14 15:14:00 +00:00
Christoph Büscher	a42502df8b	[Docs] Add description of simple query string flags (#36211 ) Closes #34944	2018-12-10 01:00:42 +01:00
Jim Ferenczi	18866c4c0b	Make hits.total an object in the search response (#35849 ) This commit changes the format of the `hits.total` in the search response to be an object with a `value` and a `relation`. The `value` indicates the number of hits that match the query and the `relation` indicates whether the number is accurate (in which case the relation is equals to `eq`) or a lower bound of the total (in which case it is equals to `gte`). This change also adds a parameter called `rest_total_hits_as_int` that can be used in the search APIs to opt out from this change (retrieve the total hits as a number in the rest response). Note that currently all search responses are accurate (`track_total_hits: true`) or they don't contain `hits.total` (`track_total_hits: true`). We'll add a way to get a lower bound of the total hits in a follow up (to allow numbers to be passed to `track_total_hits`). Relates #33028	2018-12-05 19:49:06 +01:00
Alan Woodward	73ceaad03a	Update to lucene-8.0.0-snapshot-c78429a554 (#36212 ) Includes: * A fix for a bug in Intervals.or() (https://issues.apache.org/jira/browse/LUCENE-8586) * The ability to disable offset mangling in WordDelimiterGraphFilter (https://issues.apache.org/jira/browse/LUCENE-8509) * BM25Similarity no longer multiplies scores by k1 + 1	2018-12-05 12:43:56 +00:00
Jim Ferenczi	74aca756b8	Remove the distinction between query and filter context in QueryBuilders (#35354 ) When building a query Lucene distinguishes two cases, queries that require to produce a score and queries that only need to match. We cloned this mechanism in the QueryBuilders in order to be able to produce different queries based on whether they need to produce a score or not. However the only case in es that require this distinction is the BoolQueryBuilder that sets a different minimum_should_match when a `bool` query is built in a filter context.. This behavior doesn't seem right because it makes the matching of `should` clauses different when the score is not required. Closes #35293	2018-12-03 11:49:11 +01:00
Christophe Bismuth	acdf9666d5	Add `minimum_should_match` section to the query_string docs Closes #34142	2018-11-30 16:10:13 +01:00
Christophe Bismuth	b95a4db6e6	Throw a parsing exception when boost is set in span_or query (#28390 ) (#34112 )	2018-11-26 12:15:59 -05:00
Mayya Sharipova	b6014d971c	Forbid negative scores in functon_score query (#35709 ) * Forbid negative scores in functon_score query - Throw an exception when scores are negative in field_value_factor function - Throw an exception when scores are negative in script_score function Relates to #33309	2018-11-22 06:08:48 -05:00
Alan Woodward	be8097f9ce	Improve docs for index_prefixes option (#35778 ) This commit moves the documentation and examples for the `index_prefixes` option on text fields to its own file, to bring it in line with other mapping parameters, and expands a bit on both.	2018-11-22 09:20:46 +00:00
Mayya Sharipova	643bb20137	Add a new query type - ScriptScoreQuery (#34533 ) * Add a new query type - ScriptScoreQuery script_score query uses script to calculate document scores. Added as a substitute for function_score with an intentation to deprecate function_scoreq query. ```http GET /_search { "query": { "script_score" : { "query": { "match": { "message": "elasticsearch" } }, "script" : { "source": "Math.log(2 + doc['likes'].value)" }, "min_score" : 2 } } } ``` Add several functions to painless to be used inside script_score: double rational(double, double) double sigmoid(double, double, double) double randomNotReproducible() double randomReproducible(String, int) double decayGeoLinear(String, String, String, double, GeoPoint) double decayGeoExp(String, String, String, double, GeoPoint) double decayGeoGauss(String, String, String, double, GeoPoint) double decayNumericLinear(String, String, String, double, double) double decayNumericExp(String, String, String, double, double) double decayNumericGauss(String, String, String, double, double) double decayDateLinear(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateExp(String, String, String, double, JodaCompatibleZonedDateTime) double decayDateGauss(String, String, String, double, JodaCompatibleZonedDateTime) Date functions only works on dates in the default format and default time zone	2018-11-20 16:10:06 -05:00
Itamar Syn-Hershko	156b3cae15	[Docs] Fix filter context in script-query.asciidoc (#35677 ) The docs say script queries are typically used in a filter context but the example uses a boolean must clause.	2018-11-19 16:30:33 +01:00
Peter Dyson	6d8af9731d	[Docs] Warn about searching across all fields wt. `query_string` (#35570 ) Warn about potential performance impact when a large number of fields is used with query string query and no default field.	2018-11-19 13:21:59 +01:00
Jun Ohtani	0cdfc4cd0a	[Doc] Add clarification to boolean query (#32575 ) It isn't very clear how boosting query works. Add explanation of positive/negative query.	2018-11-16 11:45:32 +09:00
Christoph Büscher	113af7996c	Make limit on number of expanded fields configurable (#35284 ) Currently we introduced a hard limit of 1024 to the number of fields a query can be expanded to in #26541. Instead of using a hard limit, we should make this configurable. This change removes the hard limit check and uses the existing `max_clause_count` setting instead. Closes #34778	2018-11-08 17:04:40 +01:00
Gytis Šk	3ee37b425b	Docs: Add section about range query for range type (#35222 ) This makes their interaction more discoverable.	2018-11-06 10:49:12 -05:00
Jeff Soloshy	14c8a483d5	[Docs] Minor formatting and wording fixes (#35278 )	2018-11-06 07:52:13 +01:00
Julie Tibshirani	fda173d7aa	Add a note around using separate indices for percolator queries and documents. (#35109 )	2018-11-01 12:41:07 -07:00
Nik Everett	2b2e208c4c	Docs: Remove range notation from random score docs (#35093 ) The `random_score` function produces values between 0 (inclusive) and 1 (exclusive) and documented it with fancy methematical range notation. It is so fancy I thought it was a typo. This changes the documentation to use words. Relates to #35084	2018-10-30 14:12:59 -04:00
Julie Tibshirani	f854330e06	Make sure to use the type _doc in the REST documentation. (#34662 ) * Replace custom type names with _doc in REST examples. * Avoid using two mapping types in the percolator docs. * Rename doc -> _doc in the main repository README. * Also replace some custom type names in the HLRC docs.	2018-10-22 11:54:04 -07:00
Nikolay Vasiliev	f5641e61a2	Docs: improve formatting of Query String Query doc page (#34432 ) Merge two tables.	2018-10-15 15:30:48 -04:00
Jim Ferenczi	6e28c8f1c4	[DOCS] Remove experimental label from term_set query (#34328 )	2018-10-05 19:45:23 +02:00
amoreauCoveo	e95dc5474f	Minor corrections in geo-queries.asciidoc (#34314 )	2018-10-05 17:12:18 +02:00
Julie Tibshirani	704d3e4c24	Add a deprecation warning in the type query documentation. (#34017 )	2018-09-24 16:30:38 -07:00
Abdon Pijpelink	32ee6148d2	[DOCS] Clarify scoring for multi_match phrase type (#32672 ) The original statement "Runs a match_phrase query on each field and combines the _score from each field." for the phrase type is a but misleading. The phrase type behaves like the best_fields type and does not combine the scores of each fields.	2018-09-18 16:57:33 +02:00
Joel Green	0b567c0eeb	[Docs] Update match-query.asciidoc (#33610 )	2018-09-12 14:35:27 +02:00
lipsill	b7c0d2830a	[Docs] Remove repeating words (#33087 )	2018-08-28 13:16:43 +02:00
w-bonelli	072c0be8af	Update Fuzzy Query docs to clarify default behavior re max_expansions (#30819 ) Stating that the Fuzzy Query generates "all possible" matching terms is misleading, given that the query's default behavior is to generate a maximum of 50 matching terms. (cherry picked from commit 345a0071a2a41fd7f80ae9ef8a39a2cb4991aedd)	2018-07-30 13:19:26 -07:00
Piotr Prądzyński	4fc833b1de	Unify headers for full text queries Relates #31599	2018-06-27 10:11:14 +02:00
Piotr Prądzyński	f6c64a048d	Remove redundant 'minimum_should_match' Relates #31600	2018-06-27 10:11:07 +02:00
Sue Gallagher	b44e1c1978	[DOCS] Removed and params from MLT. Closes #28128 (#31370 )	2018-06-19 13:48:13 -07:00
Sue Gallagher	cdb486ae70	[DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 (#31160 ) * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes 28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008 * [DOCS] Added 'fail_on_unsupported_field' param to MLT. Closes #28008	2018-06-08 14:41:01 -07:00
Adrien Grand	458bca11bc	Add a `feature_vector` field. (#31102 ) This field is similar to the `feature` field but is better suited to index sparse feature vectors. A use-case for this field could be to record topics associated with every documents alongside a metric that quantifies how well the topic is connected to this document, and then boost queries based on the topics that the logged user is interested in. Relates #27552	2018-06-07 10:05:37 +02:00
Nirmal Chidambaram	75a676c70b	Fail `span_multi` queries that exceeds boolean max clause limit (#30913 ) By default span_multi query will limit term expansions = boolean max clause. This will limit high heap usage in case of high cardinality term expansions. This applies only if top_terms_N is not used in inner multi query.	2018-06-07 09:34:39 +02:00
Adrien Grand	1af6d20efe	Fix docs build.	2018-06-05 14:55:40 +02:00
Adrien Grand	984523dda9	Clarify docs about boolean operator precedence. (#30808 ) Unfortunately, the classic queryparser does not honor the usual precedence rules of boolean operators. See https://issues.apache.org/jira/browse/LUCENE-3674.	2018-06-05 08:59:17 +02:00
Jim Ferenczi	f94a75778c	Fix index prefixes to work with span_multi (#31066 ) * Fix index prefixes to work with span_multi Text fields that use `index_prefixes` can rewrite `prefix` queries into `term` queries internally. This commit fix the handling of this rewriting in the `span_multi` query. This change also copies the index options of the text field into the prefix field in order to be able to run positional queries. This is mandatory for `span_multi` to work but this could also be useful to optimize `match_phrase_prefix` queries in a follow up. Note that this change can only be done on indices created after 6.3 since we set the index options to doc only in this version. Fixes #31056	2018-06-04 21:48:56 +02:00
Igor Motov	cf0e0606af	Use geohash cell instead of just a corner in geo_bounding_box (#30698 ) Treats geohashes as grid cells instead of just points when the geohashes are used to specify the edges in the geo_bounding_box query. For example, if a geohash is used to specify the top_left corner, the top left corner of the geohash cell will be used as the corner of the bounding box. Closes #25154	2018-05-24 14:46:15 -04:00
Christoph Büscher	3f78b3f5e1	[Docs] Explain incomplete dates in range queries (#30689 ) The current documentation isn't very clear about how incomplete dates are treated when specifying custom formats in a `range` query. This change adds a note explaining how missing month or year coordinates translate to dates that have the missings slots filled with unix time start date (1970-01-01) Closes #30634	2018-05-24 11:20:00 +02:00
Igor Motov	4b6915976c	Add support for indexed shape routing in geo_shape query (#30760 ) Adds ability to specify the routing value for the indexed shape in the geo_shape query. Closes #7663	2018-05-23 15:15:19 -04:00
Adrien Grand	886db84ad2	Expose Lucene's FeatureField. (#30618 ) Lucene has a new `FeatureField` which gives the ability to record numeric features as term frequencies. Its main benefit is that it allows to boost queries with the values of these features and efficiently skip non-competitive documents at the same time using block-max WAND and indexed impacts.	2018-05-23 08:55:21 +02:00
Igor Motov	b30f2913cf	Docs: document precision limitations of geo_bounding_box (#30540 ) The geo_bounding_box query might produce false positives alongside the right and upper edges and false negatives alongside left and bottom edges. This commit documents the behavior and defines the maximum error. Closes #29196	2018-05-14 15:54:42 -04:00
Jason Tedor	4a4e3d70d5	Default to one shard (#30539 ) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one).	2018-05-14 12:22:35 -04:00
wmellouli	c8d8407012	[Docs] Add term query with normalizer example	2018-05-03 10:23:14 +02:00

1 2 3 4 5 ...

598 Commits