OpenSearch/docs/reference
Zachary Tong ea1794832f Add RareTerms aggregation (#35718)
This adds a `rare_terms` aggregation.  It is an aggregation designed
to identify the long-tail of keywords, e.g. terms that are "rare" or
have low doc counts.

This aggregation is designed to be more memory efficient than the
alternative, which is setting a terms aggregation to size: LONG_MAX
(or worse, ordering a terms agg by count ascending, which has
unbounded error).

This aggregation works by maintaining a map of terms that have
been seen. A counter associated with each value is incremented
when we see the term again.  If the counter surpasses a predefined
threshold, the term is removed from the map and inserted into a cuckoo
filter.  If a future term is found in the cuckoo filter we assume it
was previously removed from the map and is "common".

The map keys are the "rare" terms after collection is done.
2019-07-01 10:30:02 -04:00
..
aggregations Add RareTerms aggregation (#35718) 2019-07-01 10:30:02 -04:00
analysis Allow reloading of search time analyzers (#43313) 2019-06-28 09:55:40 +02:00
cat Enable node roles to be pluggable (#43175) 2019-06-13 15:15:48 -04:00
ccr Add note to CCR docs about mapping/alias updates 2019-06-09 22:57:23 -04:00
cluster Add voting-only master node (#43410) 2019-06-26 08:07:56 +02:00
commands Fix docs typo in the certutil CSR mode (#42593) 2019-05-31 01:03:43 +03:00
data-frames/apis [DOCS] Adds data frame API response codes for allow_no_match (#43666) 2019-06-27 15:17:58 -07:00
docs Reindex max_docs parameter name (#42942) 2019-06-07 12:16:36 +02:00
graph Remove X-Pack centric graph endpoints (#36010) 2018-11-29 07:09:37 -05:00
how-to Docs enhancement for quote_field_suffix. (#43093) 2019-06-11 16:33:12 +01:00
ilm Add note about ILM action ordering (#41771) 2019-05-08 16:42:50 -06:00
images Add RareTerms aggregation (#35718) 2019-07-01 10:30:02 -04:00
index-modules Example of how to set slow logs dynamically per-index (#42384) 2019-06-21 09:30:53 -04:00
indices Update docs for Open/Close API (#43809) 2019-07-01 15:20:36 +02:00
ingest Fix GeoIP custom database directory in docs (#43383) 2019-06-19 13:26:07 -04:00
licensing [DOCS] Remind users to include @ symbol when applying license file (#40688) 2019-04-04 11:50:20 -04:00
mapping Add support for 'flattened object' fields. (#43762) 2019-07-01 12:08:50 +03:00
migration add 7.2.0 release notes 2019-06-26 08:50:11 -05:00
ml [DOCS] Adds data frame API response codes for allow_no_match (#43666) 2019-06-27 15:17:58 -07:00
modules Add voting-only master node (#43410) 2019-06-26 08:07:56 +02:00
monitoring [DOCS] Describe setup for monitoring logs (#42655) 2019-06-20 08:17:27 -07:00
query-dsl [DOCS] Rewrite `terms_set` query (#43060) 2019-06-28 12:57:22 -04:00
release-notes add 7.2.0 release notes 2019-06-26 08:50:11 -05:00
rest-api Add support for 'flattened object' fields. (#43762) 2019-07-01 12:08:50 +03:00
rollup [DOCS] Fix API Quick Reference rollup attribute for Asciidoctor (#42403) 2019-05-28 08:53:20 -04:00
scripting [DOCS] Move 'Scripting' section to top-level navigation. (#42939) 2019-06-06 10:46:02 -04:00
search specifies which index to search in docs for various queries (#43307) (#43428) 2019-06-21 10:15:51 +01:00
security Update TLS configuration in Docker docs (#43816) 2019-07-01 15:33:34 +03:00
settings [ML] Introduce a setting for the process connect timeout (#43234) 2019-06-26 09:22:04 +01:00
setup Avoid IP addresses for bootstrapping in setup docs (#43802) 2019-07-01 12:39:54 +01:00
sql SQL: Increase hard limit for sorting on aggregates (#43220) 2019-06-14 13:51:38 +02:00
testing Uppercasing some docs section title (#37781) 2019-01-24 22:54:55 +01:00
upgrade fix assumption that 6.7 is last 6.x release (#42255) 2019-05-20 14:35:28 -05:00
administering.asciidoc [DOCS] Adds administering section (#43493) 2019-06-24 10:15:25 -07:00
aggregations.asciidoc [Docs] Update aggregations.asciidoc (#29265) 2018-03-28 15:01:45 +02:00
analysis.asciidoc Remove more include_type_name and types from docs (#37601) 2019-01-18 14:11:18 +01:00
api-conventions.asciidoc [Docs] Add note for date patterns used for index search. (#42810) 2019-06-03 22:27:19 +02:00
cat.asciidoc [DOCS] Change `// TESTRESPONSE[_cat]` to `// TESTRESPONSE[non_json]` (#43006) 2019-06-10 09:53:05 -04:00
cluster.asciidoc Add voting-only master node (#43410) 2019-06-26 08:07:56 +02:00
docs.asciidoc Document Seq No powered optimistic concurrency control (#37284) 2019-01-11 07:59:15 -08:00
frozen-indices.asciidoc [DOCS] Change `// TESTRESPONSE[_cat]` to `// TESTRESPONSE[non_json]` (#43006) 2019-06-10 09:53:05 -04:00
getting-started.asciidoc [DOCS] Updated the URL for starting in the cloud. 2019-06-20 13:09:21 -07:00
glossary.asciidoc Update glossary.asciidoc (#41364) 2019-04-19 10:25:07 +02:00
gs-index.asciidoc [DOCS] Adding index file for GS "mini book". 2017-07-18 13:44:08 -07:00
how-to.asciidoc Correct grammar in list in how-to docs 2017-01-17 20:57:22 -05:00
index-modules.asciidoc Default number of shards is now 1 instead of 5 (#43573) 2019-06-25 14:51:07 +02:00
index.asciidoc [DOCS] Change 'X-Pack APIs' section to 'REST APIs' (#43451) 2019-06-26 13:46:12 -04:00
index.x.asciidoc [DOCS] Removes redundant index.asciidoc files (#30707) 2018-05-18 11:05:40 -07:00
indices.asciidoc add split index reference in indices.asciidoc 2017-11-06 12:55:41 +01:00
ingest.asciidoc show a full ingest example in the index page, to let user fast understand ingest node. (#43476) 2019-07-01 08:04:57 +02:00
intro.asciidoc [DOCS] Edited title/subtitle. (#43552) 2019-06-24 15:31:19 -07:00
mapping.asciidoc Clarify the settings around limiting nested mappings. (#42686) 2019-05-30 10:36:38 -07:00
modules.asciidoc [DOCS] Move 'Scripting' section to top-level navigation. (#42939) 2019-06-06 10:46:02 -04:00
query-dsl.asciidoc Update query-dsl.asciidoc (#27669) 2017-12-11 18:06:08 +01:00
redirects.asciidoc [DOCS] Change 'X-Pack APIs' section to 'REST APIs' (#43451) 2019-06-26 13:46:12 -04:00
release-notes.asciidoc add 7.2.0 release notes 2019-06-26 08:50:11 -05:00
scripting.asciidoc [DOCS] Move 'Scripting' section to top-level navigation. (#42939) 2019-06-06 10:46:02 -04:00
search.asciidoc Update max_concurrent_shard_request parameter docs (#42227) 2019-06-12 11:25:03 +02:00
setup.asciidoc Clean up docs regarding recommended JVM (#41356) 2019-04-18 14:17:48 -06:00
testing.asciidoc [Docs] Unify spelling of Elasticsearch (#27567) 2017-11-29 09:44:25 +01:00
upgrade.asciidoc fix assumption that 6.7 is last 6.x release (#42255) 2019-05-20 14:35:28 -05:00