druid/extensions-core
imply-cheddar f684df4c22
Use an HllSketchHolder object to enable optimized merge (#13737)
* Use an HllSketchHolder object to enable optimized merge

HllSketchAggregatorFactory.combine had been implemented using a
pure pair-wise, "make a union -> add 2 things to union -> get sketch"
algorithm.  This algorithm does 2 things that was CPU

1) The Union object always builds an HLL_8 sketch regardless of the
  target type.  This means that when the target type is not HLL_8, we
  spent CPU cycles converting to HLL_8 and back over and over again
2) By throwing away the Union object and converting back to the
  HllSketch only to build another Union object, we do lots and lots
  of copy+conversions of the HllSketch

This change introduces an HllSketchHolder object which can hold onto
a Union object and delay conversion back into an HllSketch until
it is actually needed.  This follows the same pattern as the
SketchHolder object for theta sketches.
2023-02-07 13:57:48 -08:00
..
avro-extensions Fixing incorrect filtering of nulls in an array when ingesting for JSON and Avro (#13712) 2023-02-01 04:15:08 -08:00
azure-extensions Revert "Add filter in cloud object input source for backward compatibility (#13437)" (#13450) 2022-11-30 16:33:05 +05:30
datasketches Use an HllSketchHolder object to enable optimized merge (#13737) 2023-02-07 13:57:48 -08:00
druid-aws-rds-extensions Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
druid-basic-security Validate response headers and fix exception logging (#13609) 2023-01-05 14:15:15 -08:00
druid-bloom-filter various nested column (and other) fixes (#13732) 2023-02-06 19:48:02 -08:00
druid-catalog Much improved table functions (#13627) 2023-01-17 08:41:57 -08:00
druid-kerberos Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
druid-pac4j Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
druid-ranger-security Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
ec2-extensions Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
google-extensions Add InputStats to track bytes processed by a task (#13520) 2022-12-13 18:54:42 +05:30
hdfs-storage Add InputStats to track bytes processed by a task (#13520) 2022-12-13 18:54:42 +05:30
histogram SQL test framework extensions (#13426) 2022-12-02 09:11:59 -08:00
kafka-extraction-namespace Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
kafka-indexing-service discover nested columns when using nested column indexer for schemaless ingestion (#13672) 2023-01-18 12:57:28 -08:00
kinesis-indexing-service Kinesis: More robust default fetch settings. (#13539) 2023-01-13 11:03:54 +05:30
kubernetes-extensions update org.bouncycastle:bcprov-jdk15on 1.68 to 1.69 (#13440) 2022-11-30 21:57:38 +05:30
lookups-cached-global Quote and escape literals in JDBC lookup to allow reserved identifiers. (#13632) 2023-01-10 12:11:54 +05:30
lookups-cached-single Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
multi-stage-query Fix NPE in postCleanupStage if stage doesn't exist (#13742) 2023-02-06 19:13:39 +05:30
mysql-metadata-storage Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
orc-extensions discover nested columns when using nested column indexer for schemaless ingestion (#13672) 2023-01-18 12:57:28 -08:00
parquet-extensions discover nested columns when using nested column indexer for schemaless ingestion (#13672) 2023-01-18 12:57:28 -08:00
postgresql-metadata-storage Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
protobuf-extensions discover nested columns when using nested column indexer for schemaless ingestion (#13672) 2023-01-18 12:57:28 -08:00
s3-extensions Robust handling and management of S3 streams for MSQ shuffle storage (#13741) 2023-02-07 14:17:37 +05:30
simple-client-sslcontext Prepare master branch for next release, 26.0.0 (#13401) 2022-11-22 15:31:01 +05:30
stats SQL test framework extensions (#13426) 2022-12-02 09:11:59 -08:00
testing-tools SQL test framework extensions (#13426) 2022-12-02 09:11:59 -08:00