druid/docs/querying
Gian Merlino 97207cdcc7
Automatic sizing for GroupBy dictionaries. (#12763)
* Automatic sizing for GroupBy dictionary sizes.

Merging and selector dictionary sizes currently both default to 100MB.
This is not optimal, because it can lead to OOM on small servers and
insufficient resource utilization on larger servers. It also invites
end users to try to tune it when queries run out of dictionary space,
which can make things worse if the end user sets it to too high.

So, this patch:

- Adds automatic tuning for selector and merge dictionaries. Selectors
  use up to 15% of the heap and merge buffers use up to 30% of the heap
  (aggregate across all queries).

- Updates out-of-memory error messages to emphasize enabling disk
  spilling vs. increasing memory parameters. With the memory parameters
  automatically sized, it is more likely that an end user will get
  benefit from enabling disk spilling.

- Removes the query context parameters that allow lowering of configured
  dictionary sizes. These complicate the calculation, and I don't see a
  reasonable use case for them.

* Adjust tests.

* Review adjustments.

* Additional comment.

* Remove unused import.
2022-07-11 08:20:50 -07:00
..
aggregations.md stringFirst and stringLast supported in ingestion (#12466) 2022-04-22 10:28:49 +08:00
caching.md Docs - query caching (#11584) 2022-04-18 17:00:21 +08:00
datasource.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
datasourcemetadataquery.md Refresh query docs. (#9704) 2020-04-15 16:12:20 -07:00
dimensionspecs.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
filters.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
granularities.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
groupbyquery.md Automatic sizing for GroupBy dictionaries. (#12763) 2022-07-11 08:20:50 -07:00
having.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
hll-old.md De-incubation cleanup in code, docs, packaging (#9108) 2020-01-03 12:33:19 -05:00
joins.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
limitspec.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
lookups.md Clarify the use of the Lookup API (#12088) 2022-05-16 07:50:24 -07:00
multi-value-dimensions.md Adding new config for disabling group by on multiValue column (#12253) 2022-02-16 20:53:26 +05:30
multitenancy.md Addition to Multitenancy considerations doc (#12567) 2022-06-02 10:32:14 -07:00
post-aggregations.md Document expression post-aggregators (#11896) 2022-04-19 10:36:19 +08:00
query-context.md Enable vectorized virtual column processing by default. (#12520) 2022-05-16 15:43:53 +05:30
query-execution.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
querying.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
scan-query.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
searchquery.md fix desc of 'required' for granularity property (#10616) 2020-12-01 18:29:51 -08:00
segmentmetadataquery.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
select-query.md Add "offset" parameter to the Scan query. (#10233) 2020-08-13 14:56:24 -07:00
sorting-orders.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
sql-aggregations.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql-api.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql-data-types.md Segments doc update (#12344) 2022-06-16 13:25:17 -07:00
sql-jdbc.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql-metadata-tables.md SQL: Add is_active to sys.segments, update examples and docs. (#11550) 2022-05-19 14:23:28 -07:00
sql-multivalue-string-functions.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql-operators.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql-query-context.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql-scalar.md Add TIME_IN_INTERVAL SQL operator. (#12662) 2022-06-21 13:05:37 -07:00
sql-translation.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
sql.md IMPLY-12348: Update description of UNION ALL in SQL syntax doc (#12710) 2022-07-05 13:08:01 -07:00
timeboundaryquery.md Refresh query docs. (#9704) 2020-04-15 16:12:20 -07:00
timeseriesquery.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
topnmetricspec.md Sql docs items (#12530) 2022-05-17 16:56:31 -07:00
topnquery.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
troubleshooting.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00
using-caching.md Docs - query caching (#11584) 2022-04-18 17:00:21 +08:00
virtual-columns.md Refactor SQL docs (#12239) 2022-02-11 14:43:30 -08:00