2015-06-17 05:48:21 -04:00
|
|
|
[[search-aggregations-pipeline-bucket-script-aggregation]]
|
|
|
|
=== Bucket Script Aggregation
|
2015-05-07 10:08:36 -04:00
|
|
|
|
2016-08-12 18:42:19 -04:00
|
|
|
A parent pipeline aggregation which executes a script which can perform per bucket computations on specified metrics
|
2015-05-07 10:08:36 -04:00
|
|
|
in the parent multi-bucket aggregation. The specified metric must be numeric and the script must return a numeric value.
|
|
|
|
|
2019-04-30 10:19:09 -04:00
|
|
|
[[bucket-script-agg-syntax]]
|
2015-05-07 10:08:36 -04:00
|
|
|
==== Syntax
|
|
|
|
|
2015-06-17 05:48:21 -04:00
|
|
|
A `bucket_script` aggregation looks like this in isolation:
|
2015-05-07 10:08:36 -04:00
|
|
|
|
|
|
|
[source,js]
|
|
|
|
--------------------------------------------------
|
|
|
|
{
|
2015-06-17 05:48:21 -04:00
|
|
|
"bucket_script": {
|
2015-05-07 10:08:36 -04:00
|
|
|
"buckets_path": {
|
|
|
|
"my_var1": "the_sum", <1>
|
|
|
|
"my_var2": "the_value_count"
|
|
|
|
},
|
2016-12-16 06:38:51 -05:00
|
|
|
"script": "params.my_var1 / params.my_var2"
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------------------------------
|
2017-05-01 13:30:51 -04:00
|
|
|
// NOTCONSOLE
|
2016-08-12 18:42:19 -04:00
|
|
|
<1> Here, `my_var1` is the name of the variable for this buckets path to use in the script, `the_sum` is the path to
|
2015-05-07 10:08:36 -04:00
|
|
|
the metrics to use for that variable.
|
|
|
|
|
2019-04-30 10:19:09 -04:00
|
|
|
[[bucket-script-params]]
|
2015-06-17 05:48:21 -04:00
|
|
|
.`bucket_script` Parameters
|
2019-04-30 10:19:09 -04:00
|
|
|
[options="header"]
|
2015-05-07 10:08:36 -04:00
|
|
|
|===
|
|
|
|
|Parameter Name |Description |Required |Default Value
|
2016-08-12 18:42:19 -04:00
|
|
|
|`script` |The script to run for this aggregation. The script can be inline, file or indexed. (see <<modules-scripting>>
|
2015-05-07 10:08:36 -04:00
|
|
|
for more details) |Required |
|
2016-08-12 18:42:19 -04:00
|
|
|
|`buckets_path` |A map of script variables and their associated path to the buckets we wish to use for the variable
|
2015-08-31 07:47:40 -04:00
|
|
|
(see <<buckets-path-syntax>> for more details) |Required |
|
2017-03-01 08:46:49 -05:00
|
|
|
|`gap_policy` |The policy to apply when gaps are found in the data (see <<gap-policy>> for more
|
|
|
|
details)|Optional |`skip`
|
|
|
|
|`format` |format to apply to the output value of this aggregation |Optional |`null`
|
2015-05-07 10:08:36 -04:00
|
|
|
|===
|
|
|
|
|
|
|
|
The following snippet calculates the ratio percentage of t-shirt sales compared to total sales each month:
|
|
|
|
|
2019-09-05 10:11:25 -04:00
|
|
|
[source,console]
|
2015-05-07 10:08:36 -04:00
|
|
|
--------------------------------------------------
|
2016-08-12 18:42:19 -04:00
|
|
|
POST /sales/_search
|
2015-05-07 10:08:36 -04:00
|
|
|
{
|
2016-08-12 18:42:19 -04:00
|
|
|
"size": 0,
|
2015-05-07 10:08:36 -04:00
|
|
|
"aggs" : {
|
|
|
|
"sales_per_month" : {
|
|
|
|
"date_histogram" : {
|
|
|
|
"field" : "date",
|
[7.x Backport] Force selection of calendar or fixed intervals (#41906)
The date_histogram accepts an interval which can be either a calendar
interval (DST-aware, leap seconds, arbitrary length of months, etc) or
fixed interval (strict multiples of SI units). Unfortunately this is inferred
by first trying to parse as a calendar interval, then falling back to fixed
if that fails.
This leads to confusing arrangement where `1d` == calendar, but
`2d` == fixed. And if you want a day of fixed time, you have to
specify `24h` (e.g. the next smallest unit). This arrangement is very
error-prone for users.
This PR adds `calendar_interval` and `fixed_interval` parameters to any
code that uses intervals (date_histogram, rollup, composite, datafeed, etc).
Calendar only accepts calendar intervals, fixed accepts any combination of
units (meaning `1d` can be used to specify `24h` in fixed time), and both
are mutually exclusive.
The old interval behavior is deprecated and will throw a deprecation warning.
It is also mutually exclusive with the two new parameters. In the future the
old dual-purpose interval will be removed.
The change applies to both REST and java clients.
2019-05-20 12:07:29 -04:00
|
|
|
"calendar_interval" : "month"
|
2015-05-07 10:08:36 -04:00
|
|
|
},
|
|
|
|
"aggs": {
|
|
|
|
"total_sales": {
|
|
|
|
"sum": {
|
|
|
|
"field": "price"
|
|
|
|
}
|
|
|
|
},
|
|
|
|
"t-shirts": {
|
|
|
|
"filter": {
|
|
|
|
"term": {
|
|
|
|
"type": "t-shirt"
|
|
|
|
}
|
|
|
|
},
|
|
|
|
"aggs": {
|
|
|
|
"sales": {
|
|
|
|
"sum": {
|
|
|
|
"field": "price"
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
},
|
|
|
|
"t-shirt-percentage": {
|
2015-06-25 09:22:15 -04:00
|
|
|
"bucket_script": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"buckets_path": {
|
2015-05-07 10:08:36 -04:00
|
|
|
"tShirtSales": "t-shirts>sales",
|
|
|
|
"totalSales": "total_sales"
|
|
|
|
},
|
2016-08-22 20:38:02 -04:00
|
|
|
"script": "params.tShirtSales / params.totalSales * 100"
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------------------------------
|
2016-08-12 18:42:19 -04:00
|
|
|
// TEST[setup:sales]
|
2015-05-07 10:08:36 -04:00
|
|
|
|
|
|
|
And the following may be the response:
|
|
|
|
|
2019-09-06 16:09:09 -04:00
|
|
|
[source,console-result]
|
2015-05-07 10:08:36 -04:00
|
|
|
--------------------------------------------------
|
|
|
|
{
|
2016-08-12 18:42:19 -04:00
|
|
|
"took": 11,
|
|
|
|
"timed_out": false,
|
|
|
|
"_shards": ...,
|
|
|
|
"hits": ...,
|
2015-05-07 10:08:36 -04:00
|
|
|
"aggregations": {
|
|
|
|
"sales_per_month": {
|
|
|
|
"buckets": [
|
|
|
|
{
|
|
|
|
"key_as_string": "2015/01/01 00:00:00",
|
|
|
|
"key": 1420070400000,
|
|
|
|
"doc_count": 3,
|
|
|
|
"total_sales": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 550.0
|
2015-05-07 10:08:36 -04:00
|
|
|
},
|
|
|
|
"t-shirts": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"doc_count": 1,
|
2015-05-07 10:08:36 -04:00
|
|
|
"sales": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 200.0
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
},
|
|
|
|
"t-shirt-percentage": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 36.36363636363637
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
},
|
|
|
|
{
|
|
|
|
"key_as_string": "2015/02/01 00:00:00",
|
|
|
|
"key": 1422748800000,
|
2016-08-12 18:42:19 -04:00
|
|
|
"doc_count": 2,
|
2015-05-07 10:08:36 -04:00
|
|
|
"total_sales": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 60.0
|
2015-05-07 10:08:36 -04:00
|
|
|
},
|
|
|
|
"t-shirts": {
|
|
|
|
"doc_count": 1,
|
|
|
|
"sales": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 10.0
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
},
|
|
|
|
"t-shirt-percentage": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 16.666666666666664
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
},
|
|
|
|
{
|
|
|
|
"key_as_string": "2015/03/01 00:00:00",
|
|
|
|
"key": 1425168000000,
|
|
|
|
"doc_count": 2,
|
|
|
|
"total_sales": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 375.0
|
2015-05-07 10:08:36 -04:00
|
|
|
},
|
|
|
|
"t-shirts": {
|
|
|
|
"doc_count": 1,
|
|
|
|
"sales": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 175.0
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
},
|
|
|
|
"t-shirt-percentage": {
|
2016-08-12 18:42:19 -04:00
|
|
|
"value": 46.666666666666664
|
2015-05-07 10:08:36 -04:00
|
|
|
}
|
|
|
|
}
|
|
|
|
]
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------------------------------
|
2016-08-12 18:42:19 -04:00
|
|
|
// TESTRESPONSE[s/"took": 11/"took": $body.took/]
|
|
|
|
// TESTRESPONSE[s/"_shards": \.\.\./"_shards": $body._shards/]
|
|
|
|
// TESTRESPONSE[s/"hits": \.\.\./"hits": $body.hits/]
|