OpenSearch

Commit Graph

Author	SHA1	Message	Date
David Kyle	b87cef6fe7	Include the ml inference aggregation doc (#59219 ) (#59226 ) Add to the list of pipeline aggregations	2020-07-08 14:35:08 +01:00
James Rodewig	e492c23944	[DOCS] Sort metric and pipeline agg docs (#56613 ) (#56846 ) Co-authored-by: Gil Raphaelli <gil@elastic.co>	2020-05-15 17:15:53 -04:00
Tal Levy	5e90ff32f7	Add Normalize Pipeline Aggregation (#56399 ) (#56792 ) This aggregation will perform normalizations of metrics for a given series of data in the form of bucket values. The aggregations supports the following normalizations - rescale 0-1 - rescale 0-100 - percentage of sum - mean normalization - z-score normalization - softmax normalization To specify which normalization is to be used, it can be specified in the normalize agg's `normalizer` field. For example: ``` { "normalize": { "buckets_path": <>, "normalizer": "percent" } } ```	2020-05-14 17:40:15 -07:00
Ignacio Vera	222ee721ec	Add moving percentiles pipeline aggregation (#55441 ) (#56575 ) Similar to what the moving function aggregation does, except merging windows of percentiles sketches together instead of cumulatively merging final metrics	2020-05-12 11:35:23 +02:00
James Rodewig	1f36c4e50c	[DOCS] Replace "// CONSOLE" comments with [source,console] (#46159 ) (#46332 )	2019-09-05 10:11:25 -04:00
Zachary Tong	85e2e41de7	Add CumulativeCard pipeline agg to pipeline index (#46279 ) The Cumulative Cardinality docs weren't linked from the pipeline index page	2019-09-03 12:11:04 -04:00
Zachary Tong	3df1c76f9b	Allow pipeline aggs to select specific buckets from multi-bucket aggs (#44179 ) This adjusts the `buckets_path` parser so that pipeline aggs can select specific buckets (via their bucket keys) instead of fetching the entire set of buckets. This is useful for bucket_script in particular, which might want specific buckets for calculations. It's possible to workaround this with `filter` aggs, but the workaround is hacky and probably less performant. - Adjusts documentation - Adds a barebones AggregatorTestCase for bucket_script - Tweaks AggTestCase to use getMockScriptService() for reductions and pipelines. Previously pipelines could just pass in a script service for testing, but this didnt work for regular aggs. The new getMockScriptService() method fixes that issue, but needs to be used for pipelines too. This had a knock-on effect of touching MovFn, AvgBucket and ScriptedMetric	2019-08-05 12:18:40 -04:00
Zachary Tong	6ae6f57d39	[7.x Backport] Force selection of calendar or fixed intervals (#41906 ) The date_histogram accepts an interval which can be either a calendar interval (DST-aware, leap seconds, arbitrary length of months, etc) or fixed interval (strict multiples of SI units). Unfortunately this is inferred by first trying to parse as a calendar interval, then falling back to fixed if that fails. This leads to confusing arrangement where `1d` == calendar, but `2d` == fixed. And if you want a day of fixed time, you have to specify `24h` (e.g. the next smallest unit). This arrangement is very error-prone for users. This PR adds `calendar_interval` and `fixed_interval` parameters to any code that uses intervals (date_histogram, rollup, composite, datafeed, etc). Calendar only accepts calendar intervals, fixed accepts any combination of units (meaning `1d` can be used to specify `24h` in fixed time), and both are mutually exclusive. The old interval behavior is deprecated and will throw a deprecation warning. It is also mutually exclusive with the two new parameters. In the future the old dual-purpose interval will be removed. The change applies to both REST and java clients.	2019-05-20 12:07:29 -04:00
Zachary Tong	df853c49c0	Add a MovingFunction pipeline aggregation, deprecate MovingAvg agg (#29594 ) This pipeline aggregation gives the user the ability to script functions that "move" across a window of data, instead of single data points. It is the scripted version of MovingAvg pipeline agg. Through custom script contexts, we expose a number of convenience methods: - MovingFunctions.max() - MovingFunctions.min() - MovingFunctions.sum() - MovingFunctions.unweightedAvg() - MovingFunctions.linearWeightedAvg() - MovingFunctions.ewma() - MovingFunctions.holt() - MovingFunctions.holtWinters() - MovingFunctions.stdDev() The user can also define any arbitrary logic via their own scripting, or combine with the above methods.	2018-05-16 10:57:00 -04:00
D Pinto	8d6a368402	[Docs] Correct typo in pipeline.asciidoc (#29431 )	2018-04-10 10:42:07 +02:00
Dimitris Athanasiou	66bef26495	Aggregations: bucket_sort pipeline aggregation (#27152 ) This commit adds a parent pipeline aggregation that allows sorting the buckets of a parent multi-bucket aggregation. The aggregation also offers [from] and [size] parameters in order to truncate the result as desired. Closes #14928	2017-11-09 17:59:57 +00:00
Clinton Gormley	ff4a2519f2	Update experimental labels in the docs (#25727 ) Relates https://github.com/elastic/elasticsearch/issues/19798 Removed experimental label from: * Painless * Diversified Sampler Agg * Sampler Agg * Significant Terms Agg * Terms Agg document count error and execution_hint * Cardinality Agg precision_threshold * Pipeline Aggregations * index.shard.check_on_startup * index.store.type (added warning) * Preloading data into the file system cache * foreach ingest processor * Field caps API * Profile API Added experimental label to: * Moving Average Agg Prediction Changed experimental to beta for: * Adjacency matrix agg * Normalizers * Tasks API * Index sorting Labelled experimental in Lucene: * ICU plugin custom rules file * Flatten graph token filter * Synonym graph token filter * Word delimiter graph token filter * Simple pattern tokenizer * Simple pattern split tokenizer Replaced experimental label with warning that details may change in the future: * Analysis explain output format * Segments verbose output format * Percentile Agg compression and HDR Histogram * Percentile Rank Agg HDR Histogram	2017-07-18 14:06:22 +02:00
Ryan Ernst	a03b6c2fa5	Scripting: Change keys for inline/stored scripts to source/id (#25127 ) This commit adds back "id" as the key within a script to specify a stored script (which with file scripts now gone is no longer ambiguous). It also adds "source" as a replacement for "code". This is in an attempt to normalize how scripts are specified across both put stored scripts and script usages, including search template requests. This also deprecates the old inline/stored keys.	2017-06-09 08:29:25 -07:00
Zachary Tong	130f1a56f1	Re-enable doc testing for Pipeline Aggregations (#24374 ) * Re-enable doc testing for Pipeline Aggregations Also adds a response + test for movavg pipeline	2017-05-01 13:30:51 -04:00
Nik Everett	5cff2a046d	Remove most of the need for `// NOTCONSOLE` and be much more stingy about what we consider a console candidate. * Add `// CONSOLE` to check-running * Fix version in some snippets * Mark groovy snippets as groovy * Fix versions in plugins * Fix language marker errors * Fix language parsing in snippets This adds support for snippets who's language is written like `[source, txt]` and `["source","js",subs="attributes,callouts"]`. This also makes language required for snippets which is nice because then we can be sure we can grep for snippets in a particular language.	2016-09-06 10:32:54 -04:00
Jack Conradson	131e370a16	Make Painless the default scripting language. Closes #20017	2016-08-22 17:38:02 -07:00
Nik Everett	c66db9a81e	Add `// CONSOLE` to much of pipeline agg docs Most of the examples in the pipeline aggregation docs use a small "sales" test data set and I converted all of the examples that use it to `// CONSOLE`. There are still a bunch of snippets in the pipeline aggregation docs that aren't `// CONSOLE` so they aren't tested. Most of them are "this is the most basic form of this aggregation" so they are more immune to errors and bit rot then the examples that I converted. I'd like to do something with them as well but I'm not sure what. Also, the moving average docs and serial diff docs didn't get a lot of love from this pass because they don't use the test data set or follow the same general layout. Relates to #18160	2016-08-17 09:26:41 -04:00
Colin Goodheart-Smithe	7ed64af639	[DOCS] fix callout in buckets path docs	2016-07-26 11:33:54 +01:00
Colin Goodheart-Smithe	2c12c3e628	Add _bucket_count option to buckets_path This change adds a new special path to the buckets_path syntax `_bucket_count`. This new option will return the number of buckets for a multi-bucket aggregation, which can then be used in pipeline aggregations. Closes #19553	2016-07-26 09:28:21 +01:00
Clinton Gormley	9674cbbe62	Documented [] syntax for buckets_path Closes #15707	2016-03-01 09:55:01 +01:00
Clinton Gormley	53662b0be9	Merge pull request #16345 from lbrito1/patch-1 Changes "that is" to "for example".	2016-02-02 15:13:29 +01:00
Clinton Gormley	dc018cf622	Updated docs for 3.0.0-beta	2015-10-07 13:27:46 +02:00
Zachary Tong	397d5beae1	Aggregations: Add stats_bucket / extended_stats_bucket pipeline aggregations These are the complements to the stats/extended_stats metric aggregations, and can be used to calculate a variety of statistics over buckets	2015-09-04 15:23:48 -04:00
Zachary Tong	1016734b4c	Aggregations: Add percentiles_bucket pipeline aggregations This pipeline will calculate percentiles over a set of sibling buckets. This is an exact implementation, meaning it needs to cache a copy of the series in memory and sort it to determine the percentiles. This comes with a few limitations: to prevent serializing data around, only the requested percentiles are calculated (unlike the TDigest version, which allows the java API to ask for any percentile). It also needs to store the data in-memory, resulting in some overhead if the requested series is very large.	2015-09-03 22:24:14 -04:00
Clinton Gormley	aa52c4f712	Docs: Fixed variations of spelling of buckets_path Closes #13201	2015-08-31 13:47:40 +02:00
Clinton Gormley	c6c3a40cb6	Docs: Updated annotations for 2.0.0-beta1	2015-08-14 10:51:09 +02:00
Asimov4	60f3ea0131	Fixing typo	2015-08-08 14:14:59 -07:00
Zachary Tong	8790989a47	[DOCS] Fix link to serial_diff docs	2015-07-10 19:01:18 -04:00
Zachary Tong	bb9c160855	Merge pull request #11196 from polyfractal/feature/aggs_2_0_diff Aggregations: add serial differencing pipeline aggregation	2015-07-10 18:26:19 -04:00
Zachary Tong	e3f9d561e4	Aggregations: add serial differencing pipeline aggregation	2015-07-10 18:22:01 -04:00
Zachary Tong	c898dd252b	[DOCS] Update section about gap_policy	2015-07-07 15:40:15 -04:00
Colin Goodheart-Smithe	1d7fc6b4f2	Aggregations: Pipeline Aggregation to filter buckets based on a script This pipeline aggregation runs a script on each bucket in the parent aggregation to determine whether the bucket is kept in the final aggregation tree. If the script returns true the bucket is retained, if it returns false the bucket is dropped	2015-07-07 09:51:16 +01:00
Colin Goodheart-Smithe	f21924ae0d	Aggregations: Adds cumulative sum aggregation This adds a new pipeline aggregation, the cumulative sum aggregation. This is a parent aggregation which must be specified as a sub-aggregation to a histogram or date_histogram aggregation. It will add a new aggregation to each bucket containing the sum of a specified metrics over this and all previous buckets.	2015-06-25 14:27:57 +01:00
Colin Goodheart-Smithe	f26311e88b	Aggregations: Rename `series_arithmetic` agg to `bucket_script`	2015-06-23 14:00:17 +01:00
Colin Goodheart-Smithe	a216062d88	Aggregations: allow users to perform simple arithmetic operations on histogram aggregations Closes #11029	2015-06-12 09:25:52 +01:00
Colin Goodheart-Smithe	35deb7efea	Aggregations: Renaming reducers to Pipeline Aggregators	2015-05-21 14:57:23 +01:00

36 Commits