druid/docs
Gian Merlino 875e0696e0
GroupBy: Cap dictionary-building selector memory usage. (#12309)
* GroupBy: Cap dictionary-building selector memory usage.

New context parameter "maxSelectorDictionarySize" controls when the
per-segment processing code should return early and trigger a trip
to the merge buffer.

Includes:

- Vectorized and nonvectorized implementations.
- Adjustments to GroupByQueryRunnerTest to exercise this code in
  the v2SmallDictionary suite. (Both the selector dictionary and
  the merging dictionary will be small in that suite.)
- Tests for the new config parameter.

* Fix issues from tests.

* Add "pre-existing" to dictionary.

* Simplify GroupByColumnSelectorStrategy interface by removing one of the writeToKeyBuffer methods.

* Adjustments from review comments.
2022-03-08 13:13:11 -08:00
..
_bin De-incubation cleanup in code, docs, packaging (#9108) 2020-01-03 12:33:19 -05:00
assets Design/architecture doc touchups (#11762) 2021-10-04 11:09:35 -07:00
comparisons Update druid-vs-kudu.md (#11470) 2021-07-21 22:58:14 +08:00
configuration GroupBy: Cap dictionary-building selector memory usage. (#12309) 2022-03-08 13:13:11 -08:00
dependencies Doc updates for metadata cleanup and storage (#12190) 2022-01-27 11:40:54 -08:00
design Allow coordinator run auto compaction duty period to be configured separately from other indexing duties (#12263) 2022-02-18 23:02:57 -08:00
development Adding Shared Access resource support for azure (#12266) 2022-02-22 18:27:43 +05:30
ingestion correct errors on compaction doc (#12308) 2022-03-04 15:33:35 -08:00
misc Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
operations add a new query laning metrics to visualize lane assignment (#12111) 2022-03-04 15:21:17 +08:00
querying GroupBy: Cap dictionary-building selector memory usage. (#12309) 2022-03-08 13:13:11 -08:00
tutorials Officially support Java 11. (#12232) 2022-03-04 14:15:45 -08:00