OpenSearch

History

Adrien Grand ce11e0ee6d Filter cache: add a `_cache: auto` option and make it the default. Up to now, all filters could be cached using the `_cache` flag that could be set to `true` or `false` and the default was set depending on the type of the `filter`. For instance, `script` filters are not cached by default while `terms` are. For some filters, the default is more complicated and eg. date range filters are cached unless they use `now` in a non-rounded fashion. This commit adds a 3rd option called `auto`, which becomes the default for all filters. So for all filters a cache wrapper will be returned, and the decision will be made at caching time, per-segment. Here is the default logic: - if there is already a cache entry for this filter in the current segment, then return the cache entry. - else if the doc id set cannot iterate (eg. script filter) then do not cache. - else if the doc id set is already cacheable and it has been used twice or more in the last 1000 filters then cache it. - else if the filter is costly (eg. multi-term) and has been used twice or more in the last 1000 filters then cache it. - else if the doc id set is not cacheable and it has been used 5 times or more in the last 1000 filters, then load it into a cacheable set and cache it. - else return the uncached set. So for instance geo-distance filters and script filters are going to use this new default and are not going to be cached because of their iterators. Similarly, date range filters are going to use this default all the time, but it is very unlikely that those that use `now` in a not rounded fashion will get reused so in practice they won't be cached. `terms`, `range`, ... filters produce cacheable doc id sets with good iterators so they will be cached as soon as they have been used twice. Filters that don't produce cacheable doc id sets such as the `term` filter will need to be used 5 times before being cached. This ensures that we don't spend CPU iterating over all documents matching such filters unless we have good evidence of reuse. One last interesting point about this change is that it also applies to compound filters. So if you keep on repeating the same `bool` filter with the same underlying clauses, it will be cached on its own while up to now it used to never be cached by default. `_cache: true` has been changed to only cache on large segments, in order to not pollute the cache since small segments should not be the bottleneck anyway. However `_cache: false` still has the same semantics. Close #8449		2014-12-18 15:51:36 +01:00
..
analysis	Core: upgrade to current Lucene 5.0.0 snapshot	2014-11-24 05:08:42 -05:00
cat	Core: let Lucene kick off merges	2014-11-25 04:13:57 -05:00
cluster	Core: ignore known idle threads by default in /_nodes/hot_threads	2014-12-17 11:59:31 -05:00
docs	Term Vectors: More consistent naming for term vector[s]	2014-11-21 14:06:44 +01:00
images	[docs] add 2d vis for decay functions and parameters	2014-11-10 10:56:41 +01:00
index-modules	Fixing typo	2014-12-01 10:52:00 +01:00
indices	Docs: Adds documentation for indices.exists_template	2014-11-25 19:36:01 +01:00
mapping	Adding unit test for self intersecting polygons. Relevant to #7751 even/odd discussion	2014-12-16 10:54:39 -06:00
migration	java: QueryBuilders cleanup: remove deprecated	2014-12-03 16:07:34 +01:00
modules	[docs] pedantry	2014-12-17 13:46:39 +01:00
query-dsl	Filter cache: add a `_cache: auto` option and make it the default.	2014-12-18 15:51:36 +01:00
search	Update percolate.asciidoc	2014-12-17 14:05:27 +01:00
setup	Update repositories.asciidoc	2014-12-15 18:04:17 +01:00
testing	Docs: add randomizedtesting-runner to testing-framework.asciidoc	2014-12-07 01:30:58 +09:00
analysis.asciidoc	Add more anchor links to documentation	2013-09-30 13:13:16 -06:00
api-conventions.asciidoc	[docs] formatting and general pedantry	2014-12-02 19:23:48 +01:00
cat.asciidoc	[DOCS] reordered cat apis menu	2014-06-03 11:06:35 +02:00
cluster.asciidoc	[DOCS] Fix HTTP endpoints after stats API changes	2014-01-09 11:30:28 +01:00
docs.asciidoc	Bulk UDP: Removal.	2014-09-11 09:52:09 +02:00
getting-started.asciidoc	Update getting-started.asciidoc	2014-12-17 14:03:38 +01:00
glossary.asciidoc	Migrated documentation into the main repo	2013-08-29 01:24:34 +02:00
index-modules.asciidoc	[core] add best_compression option for Lucene 5.0	2014-12-10 22:13:09 -05:00
index.asciidoc	Updated docs to use v1.4.1 as current	2014-11-26 17:18:37 +01:00
indices.asciidoc	Add forgotten include for upgrade docs.	2014-10-10 10:55:45 -07:00
mapping.asciidoc	Facets: Removal from master.	2014-08-21 10:34:39 +02:00
modules.asciidoc	[DOCS] Fixed link to tribe.asciidoc	2014-01-13 22:01:12 +01:00
query-dsl.asciidoc	Facets: Removal from master.	2014-08-21 10:34:39 +02:00
search.asciidoc	Search Exists API: Checks if any matching documents exist for a given query	2014-07-31 15:42:30 -04:00
setup.asciidoc	Docs: Removed all the added/deprecated tags from 1.x	2014-09-26 21:04:42 +02:00
testing.asciidoc	[DOCS] Test framework documentation	2013-12-02 18:01:45 +01:00