OpenSearch

Commit Graph

Author	SHA1	Message	Date
Tanguy Leroux	3d07bce504	[Docs] Fix tophits-aggregation.asciidoc	2017-08-30 13:06:44 +02:00
Tanguy Leroux	643eb286dc	[Docs] Convert remaining code snippets in docs (#26422 ) This commit converts the last remaining code snippets so that they are now testable.	2017-08-30 12:11:10 +02:00
Jim Ferenczi	977dcfe789	Deprecate global_ordinals_hash and global_ordinals_low_cardinality (#26173 ) * Deprecate global_ordinals_hash and global_ordinals_low_cardinality This change deprecates the `global_ordinals_hash` and `global_ordinals_low_cardinality` and makes the `global_ordinals` execution hint choose internally if global ords should be remapped or use the segment ord directly. These hints are too sensitive and expert to be exposed and we should be able to take the right decision internally based on the agg tree.	2017-08-21 19:12:27 +02:00
Christoph Büscher	5dae277bb2	Support distance units in GeoHashGrid aggregation precision (#26291 ) Currently the `precision` parameter must be a precision level in the range of [1,12]. In #5042 it was suggested also supporting distance units like "1km" to automatically approcimate the needed precision level. This change adds this support to the Rest API by making use of GeoUtils#geoHashLevelsForPrecision. Plain integer values without a unit are still treated as precision levels like before. Distance values that are too small to be represented by a precision level of 12 (values approx. less than 0.056m) are rejected. Closes #5042	2017-08-21 17:29:28 +02:00
Nik Everett	7e76b2a8c3	Docs: fold section into current chapter In #25602 we added a new chapter on aggregating by day of the week. We intended to add a new section but we were missing a single `=`.	2017-08-17 11:19:02 -04:00
Nik Everett	6d2c40e546	Enforce that responses in docs are valid json (#26249 ) All of the snippets in our docs marked with `// TESTRESPONSE` are checked against the response from Elasticsearch but, due to the way they are implemented they are actually parsed as YAML instead of JSON. Luckilly, all valid JSON is valid YAML! Unfurtunately that means that invalid JSON has snuck into the exmples! This adds a step during the build to parse them as JSON and fail the build if they don't parse. But no! It isn't quite that simple. The displayed text of some of these responses looks like: ``` { ... "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` Note the `...` which isn't valid json but we like it anyway and want it in the output. We use substitution rules to convert the `...` into the response we expect. That yields a response that looks like: ``` { "took": $body.took,"timed_out": false,"_shards": $body._shards,"hits": $body.hits, "aggregations": { "range": { "buckets": [ { "to": 1.4436576E12, "to_as_string": "10-2015", "doc_count": 7, "key": "-10-2015" }, { "from": 1.4436576E12, "from_as_string": "10-2015", "doc_count": 0, "key": "10-2015-" } ] } } } ``` That is what the tests consume but it isn't valid JSON! Oh no! We don't want to go update all the substitution rules because that'd be huge and, ultimately, wouldn't buy much. So we quote the `$body.took` bits before parsing the JSON. Note the responses that we use for the `_cat` APIs are all converted into regexes and there is no expectation that they are valid JSON. Closes #26233	2017-08-17 09:02:10 -04:00
Zachary Tong	829f7cb658	CONSOLEify ip-range bucket agg docs Related #18160	2017-08-03 17:19:54 -04:00
Zachary Tong	e7eda5e1be	CONSOLEify scripted-metric agg docs Related #18160	2017-08-03 17:19:54 -04:00
Zachary Tong	d8414ffa29	CONSOLEify percentile and percentile-ranks docs Related #18160	2017-08-02 17:47:27 -04:00
Zachary Tong	268923ebdc	CONSOLEify extended_stats docs Related #18160	2017-08-02 16:13:30 -04:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Simon Willnauer	e81804cfa4	Add a shard filter search phase to pre-filter shards based on query rewriting (#25658 ) Today if we search across a large amount of shards we hit every shard. Yet, it's quite common to search across an index pattern for time based indices but filtering will exclude all results outside a certain time range ie. `now-3d`. While the search can potentially hit hundreds of shards the majority of the shards might yield 0 results since there is not document that is within this date range. Kibana for instance does this regularly but used `_field_stats` to optimize the indexes they need to query. Now with the deprecation of `_field_stats` and it's upcoming removal a single dashboard in kibana can potentially turn into searches hitting hundreds or thousands of shards and that can easily cause search rejections even though the most of the requests are very likely super cheap and only need a query rewriting to early terminate with 0 results. This change adds a pre-filter phase for searches that can, if the number of shards are higher than a the `pre_filter_shard_size` threshold (defaults to 128 shards), fan out to the shards and check if the query can potentially match any documents at all. While false positives are possible, a negative response means that no matches are possible. These requests are not subject to rejection and can greatly reduce the number of shards a request needs to hit. The approach here is preferable to the kibana approach with field stats since it correctly handles aliases and uses the correct threadpools to execute these requests. Further it's completely transparent to the user and improves scalability of elasticsearch in general on large clusters.	2017-07-12 22:19:20 +02:00
matarrese	2eafbaf759	Document aggregating by day of the week (#25602 ) Add documentation for aggregating by day of the week. Closes #24660	2017-07-07 14:16:53 -04:00
Clinton Gormley	0170e0e8d3	Remove usage of multi-types from the docs and added a page explaining type removal (#25543 ) Closes #25401	2017-07-05 12:30:19 +02:00
Alexander Kazakov	64abc47ab0	[Docs] Fix documentation for percentiles bucket aggregation (#25229 )	2017-06-15 10:16:32 +02:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Colin Goodheart-Smithe	5e7a79636d	[DOCS] Clarify behaviour of scripted-metric arg with empty parent buckets	2017-06-02 11:00:27 +01:00
Tanguy Leroux	528bd25fa7	Add superset size to Significant Term REST response (#24865 ) This commit adds a new bg_count field to the REST response of SignificantTerms aggregations. Similarly to the bg_count that already exists in significant terms buckets, this new bg_count field is set at the aggregation level and is populated with the superset size value.	2017-06-02 09:45:15 +02:00
Tanguy Leroux	28d97df67c	Add document count to Matrix Stats aggregation response (#24776 ) This commit adds a `doc_count` field to the response body of Matrix Stats aggregation. It exposes the number of documents involved in the computation of statistics, a value that can already be retrieved using the method MatrixStats.getDocCount() in the Java API.	2017-05-30 09:39:41 +02:00
markharwood	b7197f5e21	SignificantText aggregation - like significant_terms, but for text (#24432 ) * SignificantText aggregation - like significant_terms but doesn’t require fielddata=true, recommended used with `sampler` agg to limit expense of tokenizing docs and takes optional `filter_duplicate_text`:true setting to avoid stats skew from repeated sections of text in search results. Closes #23674	2017-05-24 13:46:43 +01:00
Ryan Ernst	463fe2f4d4	Scripting: Remove file scripts (#24627 ) This commit removes file scripts, which were deprecated in 5.5. closes #21798	2017-05-17 14:42:25 -07:00
Zachary Tong	a2845c86fe	CONSOLEify some more aggregation docs Related #18160	2017-05-16 17:25:24 -04:00
Vlad Holubiev	557390d7d1	Fix typo in example (grades_count -> types_count) (#24635 ) Looks like `doc.grade` was used for examples before. But not anymore - https://www.elastic.co/guide/en/elasticsearch/reference/2.4/search-aggregations-metrics-valuecount-aggregation.html	2017-05-15 14:08:46 -04:00
qwerty4030	e7d352b489	Compound order for histogram aggregations. (#22343 ) This commit adds support for histogram and date_histogram agg compound order by refactoring and reusing terms agg order code. The major change is that the Terms.Order and Histogram.Order classes have been replaced/refactored into a new class BucketOrder. This is a breaking change for the Java Transport API. For backward compatibility with previous ES versions the (date)histogram compound order will use the first order. Also the _term and _time aggregation order keys have been deprecated; replaced by _key. Relates to #20003: now that all these aggregations use the same order code, it should be easier to move validation to parse time (as a follow up PR). Relates to #14771: histogram and date_histogram aggregation order will now be validated at reduce time. Closes #23613: if a single BucketOrder that is not a tie-breaker is added with the Java Transport API, it will be converted into a CompoundOrder with a tie-breaker.	2017-05-11 18:06:26 +01:00
Suhas Karanth	09c5fbfd00	Docs: Correct description of example (#24541 ) Copy and paste error.	2017-05-09 15:18:43 -04:00
Zachary Tong	4e49c618f2	CONSOLEify Stats Aggregation docs (#24373 )	2017-05-01 13:33:24 -04:00
Zachary Tong	130f1a56f1	Re-enable doc testing for Pipeline Aggregations (#24374 ) * Re-enable doc testing for Pipeline Aggregations Also adds a response + test for movavg pipeline	2017-05-01 13:30:51 -04:00
Christoph Büscher	16a7cbe463	Add `count` value to rest output of `geo_centroid` (#24387 ) Currently we don't write the count value to the geo_centroid aggregation rest response, but it is provided via the java api and the count() method in the GeoCentroid interface. We should add this parameter to the rest output and also provide it via the getProperty() method.	2017-04-28 16:25:22 +02:00
Adrien Grand	1be2800120	Only allow one type on 7.0 indices (#24317 ) This adds the `index.mapping.single_type` setting, which enforces that indices have at most one type when it is true. The default value is true for 6.0+ indices and false for old indices. Relates #15613	2017-04-27 08:43:20 +02:00
Suhas Karanth	cee76295ca	Update aggs reference documentation for 'keyed' options (#23758 ) Add 'keyed' parameter documentation for following: - Date Histogram Aggregation - Date Range Aggregation - Geo Distance Aggregation - Histogram Aggregation - IP range aggregation - Percentiles Aggregation - Percentile Ranks Aggregation	2017-04-18 15:57:50 +02:00
Adrien Grand	4632661bc7	Upgrade to a Lucene 7 snapshot (#24089 ) We want to upgrade to Lucene 7 ahead of time in order to be able to check whether it causes any trouble to Elasticsearch before Lucene 7.0 gets released. From a user perspective, the main benefit of this upgrade is the enhanced support for sparse fields, whose resource consumption is now function of the number of docs that have a value rather than the total number of docs in the index. Some notes about the change: - it includes the deprecation of the `disable_coord` parameter of the `bool` and `common_terms` queries: Lucene has removed support for coord factors - it includes the deprecation of the `index.similarity.base` expert setting, since it was only useful to configure coords and query norms, which have both been removed - two tests have been marked with `@AwaitsFix` because of #23966, which we intend to address after the merge	2017-04-18 15:17:21 +02:00
Andrew Selden	f8b15abe9a	Update reference docs for geocentroid aggregation. (#24141 ) This includes a link to the Wikipedia page explaining what a centroid is. Closes #24140	2017-04-17 21:27:43 -04:00
Ulugbek Baymuradov	9cb477d387	Update filter-aggregation.asciidoc (#24138 ) Fix a discrepancy between the example and the prose.	2017-04-17 18:46:13 -04:00
Suhas Karanth	777b5a3c16	Correct documentation for Min Bucket Aggregation (#23867 )	2017-04-05 12:39:37 +02:00
Nik Everett	5f91241f57	CONSOLEify geo aggregation docs Turns the top example in each of the geo aggregation docs into a working example that can be opened in CONSOLE. Subsequent examples can all also be opened in console and will work after you've run the first example. All examples are tested as part of the build.	2017-03-30 21:28:52 -04:00
Christoph Büscher	413bf05956	Docs: Add comma to reverse nested agg snippet	2017-03-17 14:07:18 +01:00
msancho	a37c759ba2	Fixed typo in documentation (#23406 ) * Fixed typo in documentation The option in "gap_policy" "insert_zeros" was missing a trailing "s" * Update movavg-aggregation.asciidoc	2017-03-01 15:22:26 +01:00
Randall Britten	c54fa177ef	Docs: Fixed Parameters tables to use defaults col (#23396 ) Occurred in a few places for pipeline aggregates.	2017-03-01 14:47:21 +01:00
Randall Britten	05fd2eca6f	Docs: corrected "and" --> "an" (#23376 )	2017-02-27 14:38:29 -05:00
Randall Britten	98e19cced4	Docs: Corrected definition of type param of children agg (#23377 )	2017-02-27 14:38:28 -05:00
Tanguy Leroux	e2e5937455	Use `typed_keys` parameter to prefix suggester names by type in search responses (#23080 ) This pull request reuses the typed_keys parameter added in #22965, but this time it applies it to suggesters. When set to true, the suggester names in the search response will be prefixed with a prefix that reflects their type.	2017-02-10 10:53:38 +01:00
Tanguy Leroux	3553522328	Add parameter to prefix aggs name with type in search responses (#22965 ) This pull request adds a new parameter to the REST Search API named `typed_keys`. When set to true, the aggregation names in the search response will be prefixed with a prefix that reflects the internal type of the aggregation. Here is a simple example: ``` GET /_search?typed_keys { "aggs": { "tweets_per_user": { "terms": { "field": "user" } } }, "size": 0 } ``` And the response: ``` { "aggs": { "sterms:tweets_per_user": { ... } } } ``` This parameter is intended to make life easier for REST clients that could parse back the prefix and could detect the type of the aggregation to parse. It could also be implemented for suggesters.	2017-02-09 11:19:04 +01:00
Nik Everett	0c011cb290	Docs: CONSOLEify histogram aggregation docs This adds the `COPY AS CURL` and `VIEW IN CONSOLE` links to the docs and causes the snippets to be tested during Elasticsearch's build. Relates to #18160	2017-02-07 16:09:32 -05:00
Nik Everett	245aa0404a	Docs: CONSOLEify sum aggregation docs This adds the `COPY AS CURL` and `VIEW IN CONSOLE` buttons to the docs and makes the build execute the snippets as part of `docs:check`. Relates to #18160	2017-02-07 14:18:54 -05:00
Nik Everett	274ee30d34	Docs: CONSOLEify the avg aggregation docs This creates the `COPY AS CURL` and `VIEW IN CONSOLE` buttons and makes the build test the examples. Relates to #18160	2017-02-07 13:48:27 -05:00
Jun Ohtani	7ea457955d	Merge pull request #22879 from johtani/fix_documentation_error_in_date_histogram [Doc]Not support "M" time unit in offset param	2017-02-03 16:40:08 +09:00
Nicholas Knize	b41d5747f0	Reduce GeoDistance insanity GeoDistance query, sort, and scripts make use of a crazy GeoDistance enum for handling 4 different ways of computing geo distance: SLOPPY_ARC, ARC, FACTOR, and PLANE. Only two of these are necessary: ARC, PLANE. This commit removes SLOPPY_ARC, and FACTOR and cleans up the way Geo distance is computed.	2017-02-02 12:39:42 -06:00
markharwood	9e8e556b08	Build fix for broken docs build	2017-01-31 10:27:06 +00:00
markharwood	c0d525b108	[DOCS] [TEST] enhancement - added CONSOLE scripts for sampler aggs (#22869 ) Added missing CONSOLE scripts to documentation for sampler and diversified_sampler aggs. Includes new StackOverflow index setup in build.gradle Closes #22746 * Formatting tweaks	2017-01-31 09:45:25 +00:00
Jun Ohtani	94933f9d19	[Doc]Not support "M" time unit in offset param	2017-01-31 18:23:38 +09:00
Mathieu Berube	e0b8e45cc5	Fix typo - mergins to margins (#22839 )	2017-01-30 13:52:32 +01:00
Nik Everett	d704a880e7	Add tests for top_hits aggregation (#22754 ) Add unit tests for `TopHitsAggregator` and convert some snippets in docs for `top_hits` aggregation to `// CONSOLE`. Relates to #22278 Relates to #18160	2017-01-25 16:15:50 -05:00
Nik Everett	da8740128b	Docs: CONSOLE-ify value_count aggregation docs Adds the `VIEW IN CONSOLE` and `COPY AS CURL` links to the snippets in the `value_count` docs and causes the build to execute the snippets for testing. Release #18160	2017-01-23 10:07:29 -05:00
markharwood	87495750ff	Docs fix - Added missing link to new Adjacency-matrix agg	2017-01-23 10:18:30 +00:00
Nik Everett	a99bddcc7e	CONSOLE-ify filter aggregation docs This adds the `VIEW IN CONSOLE` and `COPY AS CURL` links to the snippet and causes the build to execute the snippet as a test. Relates to #18160	2017-01-23 01:32:56 -05:00
Nik Everett	40e2645177	CONSOLE-ify date_range aggregation docs This adds the `VIEW IN CONSOLE` and `COPY AS CURL` links to the snippets in the docs for the `date_range` aggregation and tests those snippets as part of the build. Relates to #18160	2017-01-22 23:38:45 -05:00
Nik Everett	f7524fbdef	CONSOLE-ify date histogram docs This adds the `VIEW IN SENSE` and `COPY AS CURL` links and has the build automatically execute the snippets and verify that they work. Relates to #18160	2017-01-20 16:23:28 -05:00
Nik Everett	c2a580304b	CONSOLE-ify min and max aggregation docs Adds the `VIEW IN CONSOLE` and `COPY AS CURL` links to the docs and makes the build automatically test them. Relates to #18160	2017-01-20 15:33:00 -05:00
Nik Everett	8c856eaa9f	CONSOLE-ify global-aggregation.asciidoc Adds the `VIEW IN CONSOLE` and `COPY AS CURL` links to the example `global` aggregation. Also improves the example by adding a non-`global` aggregation to compare it to. Relates to #18160	2017-01-20 14:36:51 -05:00
markharwood	f01784205f	New AdjacencyMatrix aggregation Similar to the Filters aggregation but only supports "keyed" filter buckets and automatically "ANDs" pairs of filters to produce a form of adjacency matrix. The intersection of buckets "A" and "B" is named "A&B" (the choice of separator is configurable). Empty intersection buckets are removed from the final results. Closes #22169	2017-01-20 15:49:31 +00:00
Jim Ferenczi	433c822d4f	Promote longs to doubles when a terms agg mixes decimal and non-decimal numbers (#22449 ) * Promote longs to doubles when a terms agg mixes decimal and non-decimal number This change makes the terms aggregation work when the buckets coming from different indices are a mix of decimal numbers and non-decimal numbers. In this case non-decimal number (longs) are promoted to decimal (double) which can result in a loss of precision for big numbers. Fixes #22232	2017-01-10 11:50:56 +01:00
Johannes Kanavin	27c57aeebe	Fixed id's of 'worked example' in scripted metric aggs docs (#22430 )	2017-01-05 14:37:27 -05:00
Florian Hopf	0e18782d11	Update bucket-script-aggregation.asciidoc (#22219 ) Example is missing "params." for painless	2016-12-16 12:39:22 +01:00
Adrien Grand	787519ee4c	Fix `other_bucket` on the `filters` agg to be enabled if a key is set. (#21994 ) Closes #21951	2016-12-09 09:48:48 +01:00
Colin Goodheart-Smithe	8006b105f3	Update order examples to use max instead of avg (#22032 ) The use of the avg aggregation for sorting the terms aggregation is not encouraged since it has unbounded error. This changes the examples to use the max aggregation which does not suffer the same issues	2016-12-07 16:00:24 +00:00
Adrin Jalali	235e6acd73	typo fix (and -> any) (#21860 )	2016-11-30 12:56:00 +01:00
Carney Wu	2c0db3909f	include not work in 5.x anymore (#21815 ) include not work in 5.x anymore use includes instead	2016-11-28 11:02:59 +01:00
Adrien Grand	4c46ffcecf	Document that min/max operate on the double representation of the data. Relates #9545	2016-11-28 10:34:43 +01:00
markharwood	aa60e5cc07	Aggregations - support for partitioning set of terms used in aggregations so that multiple requests can be done without trying to compute everything in one request. Closes #21487	2016-11-24 15:10:46 +00:00
Chris Fritz	546fa92d61	Fix typo in filters aggregation docs (#21690 )	2016-11-21 12:52:45 +01:00
Christoph Büscher	4ccd8e79c1	Docs: Clarify date_histogram bucket sizes for DST time zones Added a warning note that clarifies bucket sizes diverging from the intended `interval` size when using a time zone that has DST changes. Closes #18805	2016-11-16 09:40:07 +01:00
Nik Everett	7dcff27aea	Update docs for scripted metric agg Now that the default language is painless the examples didn't work at all. This fixes them. Closes #21536	2016-11-15 11:47:17 -05:00
Sumit Gupta	e53405f4f3	Update geohashgrid-aggregation.asciidoc (#21530 )	2016-11-15 10:49:02 +01:00
Clinton Gormley	30d342c87c	Update significantterms-aggregation.asciidoc Fix scripted significant terms example to use `params.` prefix for painless	2016-11-14 09:40:04 +01:00
Adrien Grand	263af27d76	Fix docs example after #21218 .	2016-11-07 14:57:20 +01:00
markharwood	dd21aa41be	Docs fix - Diversified sampler agg had incorrect title and example Closes #21347	2016-11-07 10:46:22 +00:00
Clinton Gormley	5ec2ba3166	Update scripted-metric-aggregation.asciidoc Removed docs for `reduce_params` Closes #20917	2016-10-17 19:31:30 +02:00
Robin Clarke	bbe6555b7a	Docs: your -> you're (#20883 )	2016-10-12 11:09:34 -04:00
Pascal Borreli	fcb01deb34	Fixed typos (#20843 )	2016-10-10 14:51:47 -06:00
Nik Everett	9271c0302f	CONSOLEify some aggs docs Cleans up the example result in `children-aggregation` so that it matches the example data. Relates to #18160	2016-10-03 09:22:56 -04:00
Nik Everett	5cff2a046d	Remove most of the need for `// NOTCONSOLE` and be much more stingy about what we consider a console candidate. * Add `// CONSOLE` to check-running * Fix version in some snippets * Mark groovy snippets as groovy * Fix versions in plugins * Fix language marker errors * Fix language parsing in snippets This adds support for snippets who's language is written like `[source, txt]` and `["source","js",subs="attributes,callouts"]`. This also makes language required for snippets which is nice because then we can be sure we can grep for snippets in a particular language.	2016-09-06 10:32:54 -04:00
Jim Ferenczi	4682fc34ae	Add the ability to disable the retrieval of the stored fields entirely This change adds a special field named _none_ that allows to disable the retrieval of the stored fields in a search request or in a TopHitsAggregation. To completely disable stored fields retrieval (including disabling metadata fields retrieval such as _id or _type) use _none_ like this: ```` POST _search { "stored_fields": "_none_" } ````	2016-08-24 16:40:08 +02:00
Jack Conradson	131e370a16	Make Painless the default scripting language. Closes #20017	2016-08-22 17:38:02 -07:00
Clinton Gormley	de208cf78c	Fied bad asciidoc	2016-08-18 14:08:58 +02:00
Clinton Gormley	31e5e0b17f	Document that pipeline aggs cannot be used for sorting Closes #20037	2016-08-18 13:52:45 +02:00
Nik Everett	c66db9a81e	Add `// CONSOLE` to much of pipeline agg docs Most of the examples in the pipeline aggregation docs use a small "sales" test data set and I converted all of the examples that use it to `// CONSOLE`. There are still a bunch of snippets in the pipeline aggregation docs that aren't `// CONSOLE` so they aren't tested. Most of them are "this is the most basic form of this aggregation" so they are more immune to errors and bit rot then the examples that I converted. I'd like to do something with them as well but I'm not sure what. Also, the moving average docs and serial diff docs didn't get a lot of love from this pass because they don't use the test data set or follow the same general layout. Relates to #18160	2016-08-17 09:26:41 -04:00
Thomas Decaux	bf2e5cb988	[docs] Remove extra "s" at buckets_path snippet Closes #19907	2016-08-10 08:56:00 -04:00
Deb Adair	c522568d1b	Docs: Fixed typos in example buckets_paths > buckets_path.	2016-08-09 14:37:37 -07:00
Ryan Biesemeyer	9f1525255a	Update link to mapper-murmur3 plugin in card docs (#19788 )	2016-08-04 15:56:59 +02:00
Adrien Grand	a0818d3b87	Split regular histograms from date histograms. #19551 Currently both aggregations really share the same implementation. This commit splits the implementations so that regular histograms can support decimal intervals/offsets and compute correct buckets for negative decimal values. However the response API is still the same. So for intance both regular histograms and date histograms will produce an `org.elasticsearch.search.aggregations.bucket.histogram.Histogram` aggregation. The optimization to compute an identifier of the rounded value and the rounded value itself has been removed since it was only used by regular histograms, which now do the rounding themselves instead of relying on the Rounding abstraction. Closes #8082 Closes #4847	2016-08-03 08:39:48 +02:00
Adrien Grand	dcc598c414	Make the heuristic to compute the default shard size less aggressive. The current heuristic to compute a default shard size is pretty aggressive, it returns `max(10, number_of_shards * size)` as a value for the shard size. I think making it less aggressive has the benefit that it would reduce the likelyness of running into OOME when there are many shards (yearly aggregations with time-based indices can make numbers of shards in the thousands) and make the use of breadth-first more likely/efficient. This commit replaces the heuristic with `size * 1.5 + 10`, which is enough to have good accuracy on zipfian distributions.	2016-07-29 09:59:29 +02:00
Jared McQueen	d97b3fd817	[docs] missing a comma in the terms aggregation example	2016-07-27 12:59:38 -04:00
Colin Goodheart-Smithe	3f344d3154	[DOCS] fix documentation for selecting algorithm for percentiles agg	2016-07-27 08:48:51 +01:00
Colin Goodheart-Smithe	7ed64af639	[DOCS] fix callout in buckets path docs	2016-07-26 11:33:54 +01:00
Colin Goodheart-Smithe	2c12c3e628	Add _bucket_count option to buckets_path This change adds a new special path to the buckets_path syntax `_bucket_count`. This new option will return the number of buckets for a multi-bucket aggregation, which can then be used in pipeline aggregations. Closes #19553	2016-07-26 09:28:21 +01:00
Adrien Grand	1ed6c5d110	Docs: Add more points to the chart that gives accuracy for the cardinality aggregation. This also adds instructions how to regenerate the chart.	2016-07-20 10:37:12 +02:00
Adrien Grand	bde99bad2e	Use a static default precision for the cardinality aggregation. #19215 Today the default precision for the cardinality aggregation depends on how many parent bucket aggregations it had. The reasoning was that the more parent bucket aggregations, the more buckets the cardinality had to be computed on. And this number could be huge depending on what the parent aggregations actually are. However now that we run terms aggregations in breadth-first mode by default when there are sub aggregations, it is less likely that we have to run the cardinality aggregation on kagilions of buckets. So we could use a static default, which will be less confusing to users.	2016-07-18 11:30:41 +02:00
Jim Ferenczi	afe99fcdcd	Restore reverted change now that alpha4 is out: Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-07-04 10:39:49 +02:00
Leon Weidauer	1297a707da	non-binary gender option in term aggr. example (#19188 ) * non-binary gender option in term aggr. example * replace gender with music genre for term aggregation docs	2016-07-01 14:59:03 +02:00
Jason Tedor	00356edd33	Clarify time units usage in docs This commit clarifies the distinction between supported time units for durations and supported time units for durations in the docs. Relates #19159	2016-06-29 17:02:15 -04:00
Robert Muir	6d52cec2a0	Merge pull request #19092 from rmuir/more_painless_docs cutover some docs to painless	2016-06-28 13:40:25 -04:00
Jim Ferenczi	eb1e231a63	Revert "Rename `fields` to `stored_fields` and add `docvalue_fields`" This reverts commit `2f46f53dc8`.	2016-06-27 17:20:32 +02:00
Robert Muir	6fc1a22977	cutover some docs to painless	2016-06-27 09:55:16 -04:00
Jerry Liu	1863ab95f8	fixed typo 'if' -> 'is' (#19051 )	2016-06-27 14:20:23 +02:00
Nik Everett	ee2a77143b	Docs: Convert aggs/misc to CONSOLE They should be more readable and tested during the build.	2016-06-22 14:52:06 -04:00
Jim Ferenczi	2f46f53dc8	Rename `fields` to `stored_fields` and add `docvalue_fields` `stored_fields` parameter will no longer try to retrieve fields from the _source but will only return stored fields. `fields` will throw an exception if the user uses it. Add `docvalue_fields` as an adjunct to `fielddata_fields` which is deprecated. `docvalue_fields` will try to load the value from the docvalue and fallback to fielddata cache if docvalues are not enabled on that field. Closes #18943	2016-06-22 17:38:30 +02:00
Jim Ferenczi	fb2a48d0f0	Revert "Remove support for sorting terms aggregation by ascending count" This is delayed after alpha4 since Kibana relies on it.	2016-06-17 17:14:01 +02:00
Jim Ferenczi	755721953b	Remove support for sorting terms aggregation by ascending count closes #17614	2016-06-17 15:06:49 +02:00
Glen Smith	5284c5094d	grammar	2016-06-17 10:09:21 +02:00
Jim Ferenczi	ad232aebbe	Set collection mode to breadth_first in the terms aggregation when the cardinality of the field is unknown or smaller than the requested size. closes #9825	2016-06-16 11:33:40 +02:00
Colin Goodheart-Smithe	cfd3356ee3	Remove size 0 options in aggregations This removes the ability to set `size: 0` in the `terms`, `significant_terms` and `geohash_grid` aggregations for the reasons described in https://github.com/elastic/elasticsearch/issues/18838 Closes #18838	2016-06-14 13:07:02 +01:00
Nicholas Knize	371c73e140	refactor matrix agg documentation from modules to main agg section	2016-06-06 07:39:00 -05:00
Adrien Grand	638da06c1d	Add back support for `ip` range aggregations. #17859 This commit adds support for range aggregations on `ip` fields. However it will only work on 5.x indices. Closes #17700	2016-05-13 17:22:01 +02:00
Robert Muir	c5532d3df0	add a rest test for this that seems to work, fix the documentation. thanks @s1monw	2016-05-11 16:07:08 -04:00
Jim Ferenczi	052191f2a2	Add the ability to use the breadth_first mode with nested aggregations (such as `top_hits`) which require access to score information. The score is recomputed lazily for each document belonging to a top bucket. Relates to #9825	2016-05-04 15:35:45 +02:00
Sergii Golubev	2f6405ee27	serial-diff-aggregation.asciidoc: fix a mistake (#17950 )	2016-04-25 07:45:54 -04:00
ericamick	069eb72604	Update bucket.asciidoc	2016-04-22 10:54:25 -06:00
Martijn van Groningen	8e63ce00f0	docs: removed confusing statement.	2016-04-19 11:49:51 +02:00
Sergii Golubev	5ce3eb96b0	tophits-aggregation.asciidoc: fix a typo	2016-04-18 09:23:39 +02:00
Sergii Golubev	434a563fe0	terms-aggregation.asciidoc tiny edit	2016-04-13 16:51:47 -06:00
Sergii Golubev	39b914bd77	histogram-aggregation.asciidoc: tiny edit (#17706 )	2016-04-13 14:19:05 +02:00
Adrien Grand	1d0239c125	Add a warning about the impact of sorting terms aggregations on the accuracy of doc counts.	2016-04-07 16:57:44 +02:00
Dmitrii Izgurskii	272f3eb140	Add missing comma Added missing comma	2016-04-06 15:03:37 -06:00
Adrien Grand	b42f66c8ac	Document 5.0 mapping changes.	2016-03-22 16:22:58 +01:00
Clinton Gormley	0ed0fea558	Updated link to Joda time zones	2016-03-14 12:24:58 +01:00
Christoph Büscher	ff46303f15	Simplify mock scripts	2016-03-07 15:39:35 +01:00
Christoph Büscher	6b0f63e1a6	Adding `time_zone` parameter to daterange-aggregation docs	2016-03-07 15:38:24 +01:00
Clinton Gormley	9674cbbe62	Documented [] syntax for buckets_path Closes #15707	2016-03-01 09:55:01 +01:00
Clinton Gormley	300554841e	Merge pull request #16738 from robertlyson/patch-1 Update to serial differencing aggregation doc	2016-02-28 20:09:14 +01:00
evanfreed	7ed30a9c00	Spelling Corrected spelling.	2016-02-26 13:39:25 -05:00
Robert	7844804874	Update to serial differencing aggregation doc Hi, `thirtieth_difference` should use `the_sum` metric as the `buckets_path`.	2016-02-20 12:13:02 +01:00
Colin Goodheart-Smithe	e546db0753	[DOCS] fix to sampler agg documentation	2016-02-15 13:17:19 +00:00
Colin Goodheart-Smithe	5f489b99bf	fixed docs link error	2016-02-15 12:12:16 +00:00
Colin Goodheart-Smithe	1f760bd1bd	Merge branch 'master' into feature/aggs-refactoring	2016-02-10 12:16:26 +00:00
Dongjoon Hyun	21ea552070	Fix typos in docs.	2016-02-09 02:07:32 -08:00
Colin Goodheart-Smithe	5d9d91b761	Merge branch 'master' into feature/aggs-refactoring	2016-02-03 14:45:16 +00:00
Clinton Gormley	53662b0be9	Merge pull request #16345 from lbrito1/patch-1 Changes "that is" to "for example".	2016-02-02 15:13:29 +01:00
Colin Goodheart-Smithe	3b35754f59	Merge branch 'master' into feature/aggs-refactoring # Conflicts: # core/src/test/java/org/elasticsearch/percolator/PercolateDocumentParserTests.java	2016-01-26 13:17:53 +00:00
Clinton Gormley	7cde0d47bc	Merge pull request #16215 from eemp/patch-1 Update filters-aggregation.asciidoc	2016-01-26 12:56:43 +01:00
Colin Goodheart-Smithe	cd8320b171	Merge branch 'master' into feature/aggs-refactoring # Conflicts: # core/src/main/java/org/elasticsearch/search/aggregations/bucket/filter/FilterAggregator.java # core/src/main/java/org/elasticsearch/search/aggregations/bucket/filters/FiltersAggregator.java # core/src/main/java/org/elasticsearch/search/SearchModule.java	2016-01-25 10:42:20 +00:00
Kevin Adams	768d171f77	Timezone: use forward slash Using a backslash causes errors when querying elasticsearch, but changing the back slash to forward slash on the timezone fixes it. Closes #16148	2016-01-22 14:26:49 +01:00
Colin Goodheart-Smithe	2c33f78192	Merge branch 'master' into feature/aggs-refactoring # Conflicts: # core/src/main/java/org/elasticsearch/search/aggregations/bucket/children/ChildrenParser.java # core/src/main/java/org/elasticsearch/search/aggregations/support/ValuesSourceParser.java # test/framework/src/main/java/org/elasticsearch/test/TestSearchContext.java	2016-01-06 09:35:53 +00:00
Eugene Pirogov	d48af9a155	Fix indent in example Previously it would look like if `warnings` key is nested under `errors`.	2016-01-05 14:41:09 +01:00
omiend	0c878f3bf6	add double quotation	2016-01-04 11:55:24 +09:00
Colin Goodheart-Smithe	1aea0faa86	Aggregations Refactor: Refactor Sampler Aggregation	2015-12-21 09:35:46 +00:00
KangYongKyun	b5d49641fb	colon is added "predict" 10 => "predict" : 10	2015-11-05 11:32:20 +09:00
Nicholas Knize	b31d3ddd3e	Adds geo_centroid metric aggregator This commit adds a new metric aggregator for computing the geo_centroid over a set of geo_point fields. This can be combined with other aggregators (e.g., geohash_grid, significant_terms) for computing the geospatial centroid based on the document sets from other aggregation results.	2015-10-14 16:19:09 -05:00
Clinton Gormley	3e7201ef63	Merge pull request #14096 from speedplane/patch-2 Fixed a typo ("when when")	2015-10-13 21:17:09 +02:00
Clinton Gormley	dc018cf622	Updated docs for 3.0.0-beta	2015-10-07 13:27:46 +02:00
Alex	4077a322c5	Docs: Fix typo - datehistogram date_histogram in place of datehistogram Closes #13886	2015-10-06 19:22:21 +02:00
Taehee Kim	45e0ccd274	Fix typo	2015-09-25 06:42:21 +09:00
Adrien Grand	86f1b07df0	Docs: Remove docs for the `filtered`, `and`, `or` and `(f)query` queries.	2015-09-11 11:00:54 +02:00
Clinton Gormley	8aba6ce93a	Docs: Improved the date histogram docs for time_zone and offset	2015-09-07 19:54:00 +02:00
Zachary Tong	397d5beae1	Aggregations: Add stats_bucket / extended_stats_bucket pipeline aggregations These are the complements to the stats/extended_stats metric aggregations, and can be used to calculate a variety of statistics over buckets	2015-09-04 15:23:48 -04:00
Zachary Tong	c5b39ce85e	[DOCS] Fix broken inter-page link	2015-09-03 23:17:01 -04:00
Zachary Tong	1016734b4c	Aggregations: Add percentiles_bucket pipeline aggregations This pipeline will calculate percentiles over a set of sibling buckets. This is an exact implementation, meaning it needs to cache a copy of the series in memory and sort it to determine the percentiles. This comes with a few limitations: to prevent serializing data around, only the requested percentiles are calculated (unlike the TDigest version, which allows the java API to ask for any percentile). It also needs to store the data in-memory, resulting in some overhead if the requested series is very large.	2015-09-03 22:24:14 -04:00
Lee Hinman	118eab5462	Merge pull request #13257 from elastic/docsfix Fixed non-valid JSON (though ES would accept it)	2015-09-02 07:51:13 -06:00
Colin Goodheart-Smithe	1d9905a798	[DOCS] Added note about valid return types for scripts in the scripted_metric aggregation	2015-09-02 12:13:15 +01:00
Shane Connelly	5e385d5bf2	Fixed non-valid JSON (though ES would accept it)	2015-09-01 13:17:07 -07:00
Clinton Gormley	aa52c4f712	Docs: Fixed variations of spelling of buckets_path Closes #13201	2015-08-31 13:47:40 +02:00
Colin Goodheart-Smithe	9112217869	Merge pull request #13024 from iantruslove/patch-1 [DOCS] Couple of typos - various misspellings of `buckets-path`	2015-08-24 15:37:05 +02:00
Murilo Pereira	a960b3cac4	Here too.	2015-08-20 18:07:51 -03:00
Murilo Pereira	13f961a3d3	s/bucket_paths/buckets_path/ Using "bucket_paths" makes the server return a 400 with "Unknown key for a VALUE_STRING in [aggregation-name]: [buckets_paths]."	2015-08-20 18:05:02 -03:00
Ian Truslove	ae0a74eb1c	Couple of typos - various misspellings of `buckets-path`	2015-08-20 14:57:09 -06:00
Adrien Grand	a91b3fcbb9	Move the `murmur3` field to a plugin and fix defaults. This move the `murmur3` field to the `mapper-murmur3` plugin and fixes its defaults so that values will not be indexed by default, as the only purpose of this field is to speed up `cardinality` aggregations on high-cardinality string fields, which only requires doc values. I also removed the `rehash` option from the `cardinality` aggregation as it doesn't bring much value (rehashing is cheap) and allowed to remove the coupling between the `cardinality` aggregation and the `murmur3` field. Close #12874	2015-08-18 11:41:52 +02:00
Clinton Gormley	c6c3a40cb6	Docs: Updated annotations for 2.0.0-beta1	2015-08-14 10:51:09 +02:00
Asimov4	60f3ea0131	Fixing typo	2015-08-08 14:14:59 -07:00
Sylvain Zimmer	c2f774ac57	Warning in the docs for negative histogram values As requested in https://github.com/elastic/elasticsearch/issues/8082#issuecomment-127962374	2015-08-07 13:10:03 +02:00
Clinton Gormley	ac2b8951c6	Docs: Mapping docs completely rewritten for 2.0	2015-08-06 17:24:51 +02:00
Sylvain Zimmer	12a2db5417	Fix typo in docs	2015-07-31 19:11:04 -04:00
Colin Goodheart-Smithe	3e0532a0c5	Aggregations: Add HDRHistogram as an option in percentiles and percentile_ranks aggregations HDRHistogram has been added as an option in the percentiles and percentile_ranks aggregation. It has one option `number_significant_digits` which controls the accuracy and memory size for the algorithm Closes #8324	2015-07-24 17:55:36 +01:00
Ryan Ernst	dba42a83e2	Docs: Update time_zone specification closes #12317	2015-07-21 00:22:53 -07:00
Zachary Tong	8790989a47	[DOCS] Fix link to serial_diff docs	2015-07-10 19:01:18 -04:00
Zachary Tong	bb9c160855	Merge pull request #11196 from polyfractal/feature/aggs_2_0_diff Aggregations: add serial differencing pipeline aggregation	2015-07-10 18:26:19 -04:00
Zachary Tong	e3f9d561e4	Aggregations: add serial differencing pipeline aggregation	2015-07-10 18:22:01 -04:00
Zachary Tong	0f76e656dd	Aggregations: add cost minimizer to moving_avg aggregation	2015-07-08 16:20:34 -04:00
Zachary Tong	c898dd252b	[DOCS] Update section about gap_policy	2015-07-07 15:40:15 -04:00
Colin Goodheart-Smithe	1d7fc6b4f2	Aggregations: Pipeline Aggregation to filter buckets based on a script This pipeline aggregation runs a script on each bucket in the parent aggregation to determine whether the bucket is kept in the final aggregation tree. If the script returns true the bucket is retained, if it returns false the bucket is dropped	2015-07-07 09:51:16 +01:00
Colin Goodheart-Smithe	e366d0380d	Aggregations: Adds other bucket to filters aggregation The filters aggregation now has an option to add an 'other' bucket which will, when turned on, contain all documents which do not match any of the defined filters. There is also an option to change the name of the 'other' bucket from the default of '_other_' Closes #11289	2015-07-01 10:44:04 +01:00
William Li	2be3fe31a4	Docs: Update filter-aggregation.asciidoc Closes #11782	2015-07-01 10:17:45 +02:00
Colin Goodheart-Smithe	62cbeecadf	[DOCS] marked pipeline aggregator documentation as Experimental	2015-06-30 10:30:50 +01:00
Adrien Grand	38f5cc236a	Rename caches. In order to be more consistent with what they do, the query cache has been renamed to request cache and the filter cache has been renamed to query cache. A known issue is that package/logger names do no longer match settings names, please speak up if you think this is an issue. Here are the settings for which I kept backward compatibility. Note that they are a bit different from what was discussed on #11569 but putting `cache` before the name of what is cached has the benefit of making these settings consistent with the fielddata cache whose size is configured by `indices.fielddata.cache.size`: * index.cache.query.enable -> index.requests.cache.enable * indices.cache.query.size -> indices.requests.cache.size * indices.cache.filter.size -> indices.queries.cache.size Close #11569	2015-06-29 10:15:27 +02:00
Christoph Büscher	f5f73259e4	Docs: Update Joda URLs in documentation.	2015-06-26 10:23:02 +02:00
Colin Goodheart-Smithe	f21924ae0d	Aggregations: Adds cumulative sum aggregation This adds a new pipeline aggregation, the cumulative sum aggregation. This is a parent aggregation which must be specified as a sub-aggregation to a histogram or date_histogram aggregation. It will add a new aggregation to each bucket containing the sum of a specified metrics over this and all previous buckets.	2015-06-25 14:27:57 +01:00
Clinton Gormley	37eae789a0	Merge pull request #11801 from golubev/patch-6 fix json syntax in filters-aggregation.asciidoc	2015-06-23 20:02:04 +02:00
Colin Goodheart-Smithe	f26311e88b	Aggregations: Rename `series_arithmetic` agg to `bucket_script`	2015-06-23 14:00:17 +01:00
Clinton Gormley	f123a53d72	Docs: Refactored modules and index modules sections	2015-06-22 23:49:45 +02:00
caldwecr	1ac728d22b	Docs: Update filter-aggregation.asciidoc Replace the previous example which leveraged a range filter, which causes unnecessary confusion about when to use a range filter to create a single bucket or a range aggregation with exactly one member in ranges. Closes #11704	2015-06-19 12:24:42 +02:00
Clinton Gormley	64ec18afa0	Merge pull request #11661 from pjcard/patch-1 Make explicit the requirement for intervals to be integers Conflicts: docs/reference/search/aggregations/bucket/histogram-aggregation.asciidoc	2015-06-15 11:42:12 +02:00
Colin Goodheart-Smithe	a216062d88	Aggregations: allow users to perform simple arithmetic operations on histogram aggregations Closes #11029	2015-06-12 09:25:52 +01:00
Colin Goodheart-Smithe	35a58d874e	Scripting: Unify script and template requests across codebase This change unifies the way scripts and templates are specified for all instances in the codebase. It builds on the Script class added previously and adds request building and parsing support as well as the ability to transfer script objects between nodes. It also adds a Template class which aims to provide the same functionality for template APIs Closes #11091	2015-05-29 16:52:04 +01:00
Zachary Tong	d32a80f37b	Docs: Fix misplaced images in moving_avg docs	2015-05-27 16:13:36 -04:00
Zachary Tong	491afbe01c	Aggregations: Add Holt-Winters model to `moving_avg` pipeline aggregation Closes #11043	2015-05-27 14:45:45 -04:00
Colin Goodheart-Smithe	35deb7efea	Aggregations: Renaming reducers to Pipeline Aggregators	2015-05-21 14:57:23 +01:00
Adrien Grand	32e23b9100	Aggs: Make it possible to configure missing values. Most aggregations (terms, histogram, stats, percentiles, geohash-grid) now support a new `missing` option which defines the value to consider when a field does not have a value. This can be handy if you eg. want a terms aggregation to handle the same way documents that have "N/A" or no value for a `tag` field. This works in a very similar way to the `missing` option on the `sort` element. One known issue is that this option sometimes cannot make the right decision in the unmapped case: it needs to replace all values with the `missing` value but might not know what kind of values source should be produced (numerics, strings, geo points?). For this reason, we might want to add an `unmapped_type` option in the future like we did for sorting. Related to #5324	2015-05-15 16:26:58 +02:00
Adrien Grand	a0af88e996	Query DSL: Remove filter parsers. This commit makes queries and filters parsed the same way using the QueryParser abstraction. This allowed to remove duplicate code that we had for similar queries/filters such as `range`, `prefix` or `term`.	2015-05-07 20:14:34 +02:00
Colin Goodheart-Smithe	cf1251796f	Aggregations: Adding Sum Bucket Aggregation Closes #11007	2015-05-06 14:44:56 +01:00
Zachary Tong	e70a8d4ee9	Merge pull request #10964 from polyfractal/feature/aggs_movavg_rename Rename Moving Average models to their "common" names	2015-05-06 09:07:23 -04:00
Zachary Tong	3eb9cb913d	Rename Moving Average models to their "common" names Previously, we were using the "statistical", technically accurate name. Instead, we should probably use the name that people are familiar with, e.g. "Holt Winters" instead of "triple exponential". To that end: - `single_exp` becomes `ewma` (exponentially weighted moving average) - `double_exp` becomes `holt` When the `triple_exp` is added, it will be called `holt_winters`.	2015-05-06 09:04:44 -04:00
Colin Goodheart-Smithe	72d99773dc	Aggregations: Adding Average Bucket Aggregation Also includes changes to the other bucket metric aggregations to share code Closes #11006	2015-05-06 13:53:57 +01:00

... 2 3 4 5 6 ...

354 Commits