OpenSearch

Commit Graph

Author	SHA1	Message	Date
Pascal Borreli	af6d890ad5	Docs: Fixed typos Closes #10973	2015-05-05 10:38:05 +02:00
Clément Salaün	c0659ce4d4	Docs: Update geo-distance-range-filter.asciidoc missing comma Closes #10957	2015-05-04 17:17:48 +02:00
Adrien Grand	b72f27a410	Core: Cut over to the Lucene filter cache. This removes Elasticsearch's filter cache and uses Lucene's instead. It has some implications: - custom cache keys (`_cache_key`) are unsupported - decisions are made internally and can't be overridden by users ('_cache`) - not only filters can be cached but also all queries that do not need scores - parent/child queries can now be cached, however cached entries are only valid for the current top-level reader so in practice it will likely only be used on read-only indices - the cache deduplicates filters, which plays nicer with large keys (eg. `terms`) - better stats: we already had ram usage and evictions, but now also hit count, miss count, lookup count, number of cached doc id sets and current number of doc id sets in the cache - dynamically changing the filter cache size is not supported anymore Internally, an important change is that it removes the NoCacheFilter infrastructure in favour of making Query.rewrite specializing the query for the current reader so that it will only be cached on this reader (look for IndexCacheableQuery). Note that consuming filters with the query API (createWeight/scorer) instead of the filter API (getDocIdSet) is important for parent/child queries because otherwise a QueryWrapperFilter(ParentQuery) would run the wrapped query per segment while relations might be cross segments.	2015-05-04 09:02:15 +02:00
Robert Muir	dfe1d1463c	fix doc typo	2015-04-30 23:46:37 -04:00
Robert Muir	aade6194b7	Add span within/containing queries. Expose new span queries from https://issues.apache.org/jira/browse/LUCENE-6083 Within returns matches from 'little' that are enclosed inside of a match from 'big'. Containing returns matches from 'big' that enclose matches from 'little'.	2015-04-30 23:31:31 -04:00
Nik Everett	cb89a14010	Add default to field_value_factor field_value_factor now takes a default that is used if the document doesn't have a value for that field. It looks like: "field_value_factor": { "field": "popularity", "missing": 1 } Closes #10841	2015-04-28 11:06:24 -04:00
navins	84636557e1	Docs: correct three mis-match of brackets Closes #10806	2015-04-26 19:43:14 +02:00
Christine	9e81e4c09b	Docs: Update bool-filter.asciidoc from, to deprecated in favour of gt, lt Closes #10682	2015-04-26 19:23:11 +02:00
Mark Mulder	690c16e81a	Docs: Fix minor spelling mistakes in Match Query doc Closes #10751	2015-04-26 16:29:41 +02:00
Oliver Eilhard	95e9b86505	Mustache tags syntax Hi there. I've been experimenting with the search templates recently and I'm a bit confused. Shouldn't the Mustache tags be written like `{{tagname}}` instead of `{tagname}`? Your using `{{...}}` [here](http://www.elastic.co/guide/en/elasticsearch/reference/current/search-template.html) BTW. Using the first example in that page seems to indicate that something's wrong, or am I missing something? ``` $ curl 'localhost:9200/test/_search' -d '{"query":{"template":{"query":{"match":{"text":"{keywords}"}},"params":{"keywords":"value1_foo"}}}}' {"took":1,"timed_out":false,"_shards":{"total":1,"successful":1,"failed":0},"hits":{"total":0,"max_score":null,"hits":[]}} $ curl 'localhost:9200/test/_search' -d '{"query":{"template":{"query":{"match":{"text":"{{keywords}}"}},"params":{"keywords":"value1_foo"}}}}' {"took":1,"timed_out":false,"_shards":{"total":1,"successful":1,"failed":0},"hits":{"total":1,"max_score":1.0,"hits":[{"_index":"test","_type":"testtype","_id":"1","_score":1.0,"_source":{"text":"value1_foo"}}]}} ```	2015-04-24 21:23:58 +02:00
Adrien Grand	d7abb12100	Replace deprecated filters with equivalent queries. In Lucene 5.1 lots of filters got deprecated in favour of equivalent queries. Additionally, random-access to filters is now replaced with approximations on scorers. This commit - replaces the deprecated NumericRangeFilter, PrefixFilter, TermFilter and TermsFilter with NumericRangeQuery, PrefixQuery, TermQuery and TermsQuery, wrapped in a QueryWrapperFilter - replaces XBooleanFilter, AndFilter and OrFilter with a BooleanQuery in a QueryWrapperFilter - removes DocIdSets.isBroken: the new two-phase iteration API will now help execute slow filters efficiently - replaces FilterCachingPolicy with QueryCachingPolicy Close #8960	2015-04-21 15:32:43 +02:00
Alex Ksikes	d339ee4005	Term Vectors: terms filtering This adds a new feature to the Term Vectors API which allows for filtering of terms based on their tf-idf scores. With `dfs` option on, this could be useful for finding out a good characteric vector of a document or a set of documents. The parameters are similar to the ones used in the MLT Query. Closes #9561	2015-04-14 19:11:09 +02:00
Benoit Delbosc	1b35854768	Docs: Fix simple_query_string example The "&" is not part of the simple_query_string DSL Closes #10563	2015-04-13 14:46:47 +02:00
Adrien Grand	ab8926bc6a	Docs: fix build.	2015-04-10 17:38:36 +02:00
Adrien Grand	5b3cc2f07c	Search: deprecate the limit filter. This is really a Collector instead of a filter. This commit deprecates the `limit` filter, makes it a no-op and recommends to use the `terminate_after` parameter instead that we introduced in the meantime.	2015-04-10 17:18:50 +02:00
Adrien Grand	919589b908	Queries: Remove fuzzy-like-this support. The fuzzy-like-this query builds very expensive queries and only serves esoteric use-cases.	2015-04-10 17:16:02 +02:00
Clinton Gormley	abc7de96ae	Docs: Updated version annotations in master	2015-04-09 14:50:11 +02:00
Isabel Drost-Fromm	60bb65c4d9	Docs: Note on shard vs. index level doc frequencies. Relates to #10154 and #10150 Adds link to additional information on how document frequencies are treated across shards to the cutoff_frequency parameter documentation. Closes #10451	2015-04-07 14:28:01 +02:00
joelbourbon	3c52bc1098	Docs: Missing 1 escape character in example Closes #10446	2015-04-07 14:10:17 +02:00
Patrick Peschlow	a9af488bb3	Update prefix-filter.asciidoc text said phrase instead of prefix, probably due to copy-paste	2015-03-31 09:25:15 +02:00
Lee Hinman	6aec68cd29	Revert "[QUERY] Remove lowercase_expanded_terms and locale options" This reverts commit `d1f7bd97cb`. Ryan pointed out that this needs to work with the multi term query, so additional analysis and tests should be added.	2015-03-13 13:51:44 -06:00
Lee Hinman	d1f7bd97cb	[QUERY] Remove lowercase_expanded_terms and locale options The analysis chain should be used instead of relying on this, as it is confusing when dealing with different per-field analysers. The `locale` option was only used for `lowercase_expanded_terms`, which, once removed, is no longer needed, so it was removed as well. Fixes #9978 Relates to #9973	2015-03-13 13:17:27 -06:00
Clinton Gormley	3f9d4f9635	Update query-string-syntax.asciidoc Closes #9965	2015-03-03 20:03:51 +01:00
cgp	b1e6df3b6c	Update span-multi-term-query.asciidoc Added comma - there is no "term range" query Closes #9855	2015-02-28 03:05:05 +01:00
Lee Hinman	2e9ea4abaf	Add support for `minimum_should_match` to `simple_query_string` This behaves similar to the way that `minimum_should_match` works for the `match` query (in fact it is implemented in the exact same way) Fixes #6449	2015-02-25 11:35:33 -07:00
Michael Sander	fd6c6058ce	Remove Triple Negative! Double negatives are confusing, but a triple negative (1 no, 2 non, 3 null)? It takes five minutes to understand this little sentence. Cleaned that up a bit. Closes #9789	2015-02-23 20:09:05 +01:00
Blake Niemyjski	8cba6c3abb	Fixed an invalid query Closes #9682	2015-02-13 21:11:42 +01:00
Clinton Gormley	856b0fa1a0	Docs: Fixed explanation of how the query string query is rewritten	2015-02-12 12:46:44 +01:00
Alfredo Serafini	e607e53591	Update span-multi-term-query.asciidoc added wildcard to the list of possible nested queries Closes #9586	2015-02-09 16:01:46 +01:00
Adam	928ea82188	Docs: Updated documentation for query-string-syntax to include '>' '<' and '=' as reserved characters Closes #9518	2015-02-04 17:55:15 +01:00
Clinton Gormley	8978aa5465	Docs: Improved the template query docs Added the `file` and `id` parameters. Closes #9458	2015-01-28 14:19:59 +01:00
Alex Ksikes	615513ee9b	Docs: clearer MLT documentation Closes #9351	2015-01-20 16:42:39 +01:00
Adrien Grand	bc86796592	Core: Remove terms filter cache. This is our only cache which is not 'exact' and might allow for stalled results. Additionally, a similar cache that we have and needs to perform lookups in other indices in order to run queries is the script index, and for this index we rely on the filesystem cache, so we should probably do the same with terms filters lookups. Close #9056	2015-01-06 17:21:20 +01:00
Clinton Gormley	f83909f7ae	Docs: The regexp query defaults to the `ALL` flag, and removed the `AUTOMATON` flag which is not used in Elasticsearch. Closes #6180	2014-12-30 19:53:31 +01:00
Adrien Grand	fb6c3b7c29	[Docs] Improve documentation of the new caching policy for filters.	2014-12-22 17:14:47 +01:00
Adrien Grand	ce11e0ee6d	Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449	2014-12-18 15:51:36 +01:00
Timothy Perisho	ceafde41e9	Docs: typo on "frequent" I replaced "high frequent terms" with "high frequency terms" and "low frequent terms" with "low frequency terms". Alternatively, we could write, "highly frequent terms" and "minimally frequent terms" (or just "rare terms"). Closes #8962	2014-12-15 19:59:50 +01:00
John Michael Luy	01ef80a33d	Update range-filter.asciidoc Closes #8741	2014-12-02 18:00:38 +01:00
John Michael Luy	f20f6ffe22	Docs: Update range-query.asciidoc Closes #8740	2014-12-02 12:55:44 +01:00
Alex Ksikes	256712640f	MLT Query: Support for ignore docs Adds a `ignore_like` parameter to the MLT Query, which simply tells the algorithm to skip all the terms from the given documents. This could be useful in order to better guide nearest neighbor search by telling the algorithm to never explore the space spanned by the given `ignore_like` docs. In essence we are interested about the characteristic of a given item, but not of the ones provided by `ignore_like`, thereby forcing the algorithm to go deeper in its selection of terms. Note that this is different than simply performing a must not boolean query on the unliked items. The syntax is exactly the same as the `like` parameter. Closes #8674	2014-11-28 14:48:43 +01:00
pmamat	9e2eaeece4	Docs: Additional info about _score calculation Description taken from http://www.elasticsearch.org/guide/en/elasticsearch/guide/current/multi-query-strings.html / 110_Multi_Field_Search/05_Multiple_query_strings.asciidoc Closes #8635	2014-11-28 13:54:45 +01:00
Britta Weber	59507cf793	function_score: match only document with score above custom score threshold functon_score matched each document regardless of the computed score. This commit adds a query parameter `min_score` (-Float.MAX_VALUE default). Documents that have a score lower than this threshold will not be mached. closes #6952	2014-11-28 12:35:26 +01:00
Britta Weber	f00b431c18	[docs] explain default settings for parameters of decay functions relates to #8624	2014-11-27 19:18:55 +01:00
Martijn van Groningen	1d7cdd7d22	Applied PR, changed the way defaults are handled and updated the docs. Closes #4452	2014-11-24 13:32:41 +01:00
dw	bb81055c33	Docs: Remove reference to imaginary "no_docs_query" No reference to it in the source code except this file. Closes #8566	2014-11-23 13:56:33 +01:00
Michael McCandless	8aebb9656b	Core: add max_determinized_states to query_string and regexp query/filter This prevents too-difficult regular expressions from consuming excessive RAM/CPU; the default max_determinized_states is 10,000 (same as Lucene) but query_string and regepx query/filter can override per-request. The also upgrades to a new Lucene 5.0.0 snapshot. Closes #8386 Closes #8357	2014-11-10 13:43:48 -05:00
Clinton Gormley	cff544dcc2	Docs: Removed old coming/added tags	2014-11-10 14:41:24 +01:00
Britta Weber	c5a4c1d6b4	[docs] add 2d vis for decay functions and parameters closes #8420	2014-11-10 10:56:41 +01:00
Veres Lajos	4059e4ac86	typo fixes - https://github.com/vlajos/misspell_fixer Closes #8323	2014-11-08 18:55:57 +01:00
Clinton Gormley	b9149f836b	Docs: Improve the exists/missing filters documentation Closes #7274	2014-11-08 16:57:41 +01:00

1 2 3 4

185 Commits