[DOCS] Percentile aggs are non-deterministic (#50468)

Percentile aggregations are non-deterministic. A percentile aggregation
can produce different results even when using the same data.

Based on [this discuss post][0], the non-deterministic property stems
from processes in Lucene that can affect the order in which docs are
provided to the aggregation.

This adds a warning stating that the aggregation is non-deterministic
and what that means.

[0]: https://discuss.elastic.co/t/different-results-for-same-query/111757
This commit is contained in:
James Rodewig 2019-12-23 13:11:31 -05:00
parent 01293ebad5
commit 694b119f0a
1 changed files with 7 additions and 0 deletions

View File

@ -251,6 +251,13 @@ for large number of values is that the law of large numbers makes the distributi
values more and more uniform and the t-digest tree can do a better job at summarizing
it. It would not be the case on more skewed distributions.
[WARNING]
====
Percentile aggregations are also
https://en.wikipedia.org/wiki/Nondeterministic_algorithm[non-deterministic].
This means you can get slightly different results using the same data.
====
[[search-aggregations-metrics-percentile-aggregation-compression]]
==== Compression