druid/docs/content/development/extensions-core/datasketches-extension.md
Gian Merlino 0851f2206c
Expanded documentation for DataSketches aggregators. (#5513)
Originally written by @AlexanderSaydakov in druid-io/druid-io.github.io#448.
I also added redirects and updated links to point to the new
datasketches-extension.html landing page for the extension, rather than to
the old page about theta sketches.
2018-03-21 18:19:27 -07:00

858 B

layout
doc_page

DataSketches extension

Druid aggregators based on datasketches library. Sketches are data structures implementing approximate streaming mergeable algorithms. Sketches can be ingested from the outside of Druid or built from raw data at ingestion time. Sketches can be stored in Druid segments as additive metrics.

To use the datasketch aggregators, make sure you include the extension in your config file:

druid.extensions.loadList=["druid-datasketches"]

The following aggregators are available:

  1. Theta sketch, useful for approximate set counting, and supporting union, intersection, and difference operations.
  2. Quantiles sketch.
  3. Tuple sketch.