druid/extensions-core/datasketches
Gian Merlino 274ccbfd85
Reset buffer aggregators when resetting Groupers. (#16296)
Buffer aggregators can contain some cached objects within them, such as
Memory references or HLL Unions. Prior to this patch, various Grouper
implementations were not releasing this state when resetting their own
internal state, which could lead to excessive memory use.

This patch renames AggregatorAdapater#close to "reset", and updates
Grouper implementations to call this reset method whenever they reset
their internal state.

The base method on BufferAggregator and VectorAggregator remains named
"close", for compatibility with existing extensions, but the contract
is adjusted to say that the aggregator may be reused after the method
is called. All existing implementations in core already adhere to this
new contract, except for the ArrayOfDoubles build flavors, which are
updated in this patch to adhere.

Additionally, this patch harmonizes buffer sketch helpers to call their
clear method "clear" rather than a mix of "clear" and "close". (Others
were already using "clear".)
2024-04-24 05:39:24 -04:00
..
src Reset buffer aggregators when resetting Groupers. (#16296) 2024-04-24 05:39:24 -04:00
README.md update links datasketches.github.io to datasketches.apache.org (#10107) 2020-07-01 14:56:17 -07:00
pom.xml Update Calcite*Test to use junit5 (#16106) 2024-03-19 04:05:12 -07:00

README.md

This module provides Druid aggregators based on https://datasketches.apache.org/.

Credits: This module is a result of feedback and work done by following people.

https://github.com/cheddar https://github.com/himanshug https://github.com/leerho https://github.com/will-lauer