druid/extensions-core
Gian Merlino 93aeaf4801
Improve on-heap aggregator footprint estimates. (#11950)
Add a "guessAggregatorHeapFootprint" method to AggregatorFactory that
mitigates #6743 by enabling heap footprint estimates based on a specific
number of rows. The idea is that at ingestion time, the number of rows
that go into an aggregator will be 1 (if rollup is off) or will likely
be a small number (if rollup is on).

It's a heuristic, because of course nothing guarantees that the rollup
ratio is a small number. But it's a common case, and I expect this logic
to go wrong much less often than the current logic. Also, when it does
go wrong, users can fix it by lowering maxRowsInMemory or
maxBytesInMemory. The current situation is unintuitive: when the
estimation goes wrong, users get an OOME, but actually they need to
*raise* these limits to fix it.
2021-11-28 13:21:24 +05:30
..
avro-extensions Support for hadoop 3 via maven profiles (#11794) 2021-10-30 22:46:24 +05:30
azure-extensions Migrate File.mkdirs to FileUtils.mkdirp. (#11879) 2021-11-09 11:10:49 -08:00
datasketches Improve on-heap aggregator footprint estimates. (#11950) 2021-11-28 13:21:24 +05:30
druid-aws-rds-extensions bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
druid-basic-security Migrate File.mkdirs to FileUtils.mkdirp. (#11879) 2021-11-09 11:10:49 -08:00
druid-bloom-filter restore and deprecate AggregatorFactory methods (#11917) 2021-11-19 15:59:35 -08:00
druid-kerberos Support for hadoop 3 via maven profiles (#11794) 2021-10-30 22:46:24 +05:30
druid-pac4j bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
druid-ranger-security Support for hadoop 3 via maven profiles (#11794) 2021-10-30 22:46:24 +05:30
ec2-extensions bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
google-extensions Migrate File.mkdirs to FileUtils.mkdirp. (#11879) 2021-11-09 11:10:49 -08:00
hdfs-storage Migrate File.mkdirs to FileUtils.mkdirp. (#11879) 2021-11-09 11:10:49 -08:00
histogram restore and deprecate AggregatorFactory methods (#11917) 2021-11-19 15:59:35 -08:00
kafka-extraction-namespace TestKafkaExtractionCluster: Shut down Kafka, ZK in @After. (#11963) 2021-11-20 15:17:05 -08:00
kafka-indexing-service Kafka Input Format for headers, key and payload parsing (#11630) 2021-10-07 08:56:27 -07:00
kinesis-indexing-service bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
kubernetes-extensions Remove CloseQuietly and migrate its usages to other methods. (#10247) 2021-10-23 17:03:21 -07:00
lookups-cached-global Warn if cache size of lookup is beyond max size (#11863) 2021-11-03 21:32:22 -04:00
lookups-cached-single bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
mysql-metadata-storage bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
orc-extensions Upgrade ORC to 1.7.1 (#11919) 2021-11-15 09:13:03 -08:00
parquet-extensions Support for hadoop 3 via maven profiles (#11794) 2021-10-30 22:46:24 +05:30
postgresql-metadata-storage bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
protobuf-extensions bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
s3-extensions AWS "Data read has a different length than the expected" error should reset stream and try again (#11941) 2021-11-26 12:45:34 -07:00
simple-client-sslcontext bump version to 0.23.0-SNAPSHOT (#11670) 2021-09-08 15:56:04 -07:00
stats restore and deprecate AggregatorFactory methods (#11917) 2021-11-19 15:59:35 -08:00
testing-tools complex typed expressions (#11853) 2021-11-08 00:33:06 -08:00