OpenSearch

Commit Graph

Author	SHA1	Message	Date
Clinton Gormley	f1a0e2216a	Docs: Mentioned script_id and script_file parameters across all aggs Closes #10760	2015-04-26 17:30:38 +02:00
Adrien Grand	f4d5914511	Docs: Warn about the fact that min_doc_count=0 might return terms that only belong to different types.	2015-04-21 00:57:57 +02:00
Adrien Grand	aecd9ac515	Aggregations: Speed up include/exclude in terms aggregations with regexps. Today we check every regular expression eagerly against every possible term. This can be very slow if you have lots of unique terms, and even the bottleneck if your query is selective. This commit switches to Lucene regular expressions instead of Java (not exactly the same syntax yet most existing regular expressions should keep working) and uses the same logic as RegExpQuery to intersect the regular expression with the terms dictionary. I wrote a quick benchmark (in the PR) to make sure it made things faster and the same request that took 750ms on master now takes 74ms with this change. Close #7526	2015-04-09 12:12:56 +02:00
Colin Goodheart-Smithe	2520dc78ec	[DOCS] added a note for the default shard_size value	2015-02-25 11:00:55 +00:00
Adrien Grand	95f46f1212	Docs: Use the new experimental annotation. We now have a very useful annotation to mark features or parameters as experimental. Let's use it! This commit replaces some custom text warnings with this annotation and adds this annotation to some existing features/parameters: - inner_hits (unreleased yet) - terminate_after (released in 1.4) - per-bucket doc count errors in the terms agg (released in 1.4) I also tagged with this annotation settings which should either be not needed (like the ability to evict entries from the filter cache based on time) or that are too deep into the way that Elasticsearch works like the Directory implementation or merge settings. Close #9563	2015-02-05 15:29:45 +01:00
Oliver	e412dab63a	Docs: Fix sample query Closes #9472	2015-01-29 15:56:24 +01:00
David Pilato	43a1435d3b	[Docs] fix consistency between examples	2014-11-27 20:29:34 +01:00
Adrien Grand	7ea490dfd1	Aggregations: Return the sum of the doc counts of other buckets. This commit adds a new field to the response of the terms aggregation called `sum_other_doc_count` which is equal to the sum of the doc counts of the buckets that did not make it to the list of top buckets. It is typically useful to have a sector called eg. `other` when using terms aggregations to build pie charts. Example query and response: ```json GET test/_search?search_type=count { "aggs": { "colors": { "terms": { "field": "color", "size": 3 } } } } ``` ```json { [...], "aggregations": { "colors": { "doc_count_error_upper_bound": 0, "sum_other_doc_count": 4, "buckets": [ { "key": "blue", "doc_count": 65 }, { "key": "red", "doc_count": 14 }, { "key": "brown", "doc_count": 3 } ] } } } ``` Close #8213	2014-10-27 12:11:26 +01:00
Andrew O'Brien	33097d901b	Docs: Typo: s/by/be/ Closes #8114	2014-10-16 20:51:58 +02:00
Clinton Gormley	cb00d4a542	Docs: Removed all the added/deprecated tags from 1.x	2014-09-26 21:04:42 +02:00
Colin Goodheart-Smithe	d4e83df3b8	Aggregations: Adds ability to sort on multiple criteria The terms aggregation can now support sorting on multiple criteria by replacing the sort object with an array or sort object whose order signifies the priority of the sort. The existing syntax for sorting on a single criteria also still works. Contributes to #6917 Replaces #7588	2014-09-15 11:08:29 +01:00
markharwood	3c8f8cc090	Aggs enhancement - allow Include/Exclude clauses to use array of terms as alternative to a regex Closes #6782	2014-09-12 15:28:03 +01:00
David Pilato	7fdd3651fa	[docs] Fix typo: resonable - reasonable	2014-09-10 15:57:57 +02:00
Colin Goodheart-Smithe	b127b52fd3	Revert "Aggregations: Adds ability to sort on multiple criteria" This reverts commit `bfedd11ffa`.	2014-09-08 20:27:19 +01:00
Colin Goodheart-Smithe	bfedd11ffa	Aggregations: Adds ability to sort on multiple criteria The terms aggregation can now support sorting on multiple criteria by replacing the sort object with an array or sort object whose order signifies the priority of the sort. The existing syntax for sorting on a single criteria also still works. Contributes to #6917	2014-09-08 15:20:33 +01:00
Clinton Gormley	1bdf79e527	Docs: Added explanation of how to do multi-field terms agg Closes #5100	2014-09-07 11:09:52 +02:00
Colin Goodheart-Smithe	655157c83a	Aggregations: Added an option to show the upper bound of the error for the terms aggregation. This is only applicable when the order is set to _count. The upper bound of the error in the doc count is calculated by summing the doc count of the last term on each shard which did not return the term. The implementation calculates the error by summing the doc count for the last term on each shard for which the term IS returned and then subtracts this value from the sum of the doc counts for the last term from ALL shards. Closes #6696	2014-07-25 14:24:24 +01:00
Simon Willnauer	5bfea56457	[DOCS] move all coming tags to added in master	2014-07-23 16:37:19 +02:00
Adrien Grand	abeefbddea	Docs: Update documentation about execution hints for the terms aggregation.	2014-07-21 11:55:57 +02:00
markharwood	724129e6ce	Aggregations optimisation for memory usage. Added changes to core Aggregator class to support a new mode of deferred collection. A new "breadth_first" results collection mode allows upper branches of aggregation tree to be calculated and then pruned to a smaller selection before advancing into executing collection on child branches. Closes #6128	2014-06-06 15:59:51 +01:00
Simon Willnauer	9d5507047f	Update Documentation Feature Flags [1.2.0]	2014-05-22 15:06:42 +02:00
Britta Weber	08e57890f8	use shard_min_doc_count also in TermsAggregation This was discussed in issue #6041 and #5998 . closes #6143	2014-05-14 14:10:04 +02:00
Clinton Gormley	5b93255ec8	[DOCS] Added "Aggregation" to all aggs titles	2014-05-13 01:35:58 +02:00
Martijn van Groningen	ade1d0ef57	Added global ordinals (unique incremental numbering for terms) to fielddata. Added a terms aggregation implementations that work on global ordinals, which is also the default. Closes #5672	2014-04-07 11:06:41 +07:00
uboness	9d0fc76f54	Added support for sorting buckets based on sub aggregations Supports sorting on sub-aggs down the current hierarchy. This is supported as long as the aggregation in the specified order path are of a single-bucket type, where the last aggregation in the path points to either a single-bucket aggregation or a metrics one. If it's a single-bucket aggregation, the sort will be applied on the document count in the bucket (i.e. doc_count), and if it is a metrics type, the sort will be applied on the pointed out metric (in case of a single-metric aggregations, such as avg, the sort will be applied on the single metric value) NOTE: this commit adds a constraint on what should be considered a valid aggregation name. Aggregations names must be alpha-numeric and may contain '-' and '_'. Closes #5253	2014-03-06 00:05:27 +01:00
uboness	d335630e57	[docs] fixed errors in aggs docs - error in nested aggs example - error in terms aggs example	2014-02-13 20:36:02 +01:00
Boaz Leskes	9bf263c741	[DOCS] Fix terms agg value script example	2014-02-06 16:35:49 +01:00
uboness	dd389d1cc5	Made all multi-bucket aggs return consistent response format Closes #4926	2014-01-28 17:46:57 +01:00
Adrien Grand	9282ae4ffd	Terms aggregations: make size=0 return all terms. Terms aggregations return up to `size` terms, so up to now, the way to get all matching terms back was to set `size` to an arbitrary high number that would be larger than the number of unique terms. Terms aggregators already made sure to not allocate memory based on the `size` parameter so this commit mostly consists in making `0` an alias for the maximum integer value in the TermsParser. Close #4837	2014-01-22 11:05:10 +01:00
Dawid Weiss	ae71b25145	Documentation typo.	2014-01-20 11:51:08 +01:00
Adrien Grand	5c237fe834	Add new option `min_doc_count` to terms and histogram aggregations. `min_doc_count` is the minimum number of hits that a term or histogram key should match in order to appear in the response. `min_doc_count=0` replaces `compute_empty_buckets` for histograms and will behave exactly like facets' `all_terms=true` for terms aggregations. Close #4662	2014-01-13 10:09:38 +01:00
Adrien Grand	36bd9cc432	Aggregations: Ordinals-based string bucketing support. When the ValuesSource has ordinals, terms ordinals are used as a cache key to bucket ordinals. This can make terms aggregations on String terms significantly faster. Close #4350	2013-12-13 15:34:02 +01:00
uboness	0d6a35b9a7	- Added support for term filtering based on include/exclude regex on the terms agg - Added javadoc to the TermsBuilder Closes #4267	2013-11-29 13:46:48 +01:00
uboness	afb0d119e4	- Added docs for the value_count aggregation - Fixed typos in the terms facets docs - Fixed aggregation docs layout - Added docs for shard_size in term aggregation	2013-11-29 12:35:42 +01:00
uboness	c7f6c5266d	initial commit of the aggregations module Closes #3300	2013-11-24 03:13:08 -08:00

35 Commits