druid/extensions-core
Vishesh Garg e28424ea25
Enable rollup on multi-value dimensions for compaction with MSQ engine (#16937)
Currently compaction with MSQ engine doesn't work for rollup on multi-value dimensions (MVDs), the reason being the default behaviour of grouping on MVD dimensions to unnest the dimension values; for instance grouping on `[s1,s2]` with aggregate `a` will result in two rows: `<s1,a>` and `<s2,a>`. 

This change enables rollup on MVDs (without unnest) by converting MVDs to Arrays before rollup using virtual columns, and then converting them back to MVDs using post aggregators. If segment schema is available to the compaction task (when it ends up downloading segments to get existing dimensions/metrics/granularity), it selectively does the MVD-Array conversion only for known multi-valued columns; else it conservatively performs this conversion for all `string` columns.
2024-09-04 16:28:04 +05:30
..
avro-extensions Better error handling when retrieving Avro schemas from registry (#16684) 2024-07-02 16:48:34 -07:00
azure-extensions Allow druid.azure.account to be nullable (#16960) 2024-09-02 12:05:51 +05:30
datasketches generic block compressed complex columns (#16863) 2024-08-27 00:34:41 -07:00
druid-aws-rds-extensions Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
druid-basic-security Fix excessive logging from druid-basic-security (#16767) 2024-07-22 08:33:00 +05:30
druid-bloom-filter generic block compressed complex columns (#16863) 2024-08-27 00:34:41 -07:00
druid-catalog Quidem record (#16624) 2024-08-05 14:58:32 +02:00
druid-kerberos Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
druid-pac4j Improve exception handling in extension druid-pac4j (#16979) 2024-08-30 12:32:49 +05:30
druid-ranger-security Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
ec2-extensions Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
google-extensions Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
hdfs-storage Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
histogram generic block compressed complex columns (#16863) 2024-08-27 00:34:41 -07:00
kafka-extraction-namespace Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
kafka-indexing-service Track IngestionState more accurately in realtime tasks. (#16934) 2024-08-22 11:43:46 +05:30
kinesis-indexing-service Track IngestionState more accurately in realtime tasks. (#16934) 2024-08-22 11:43:46 +05:30
kubernetes-extensions #16717 defer provider instantiation in Kubernetes Module (#16726) 2024-07-16 13:05:28 -07:00
lookups-cached-global upgrade mysql:mysql-connector-java to 8.2.0 (#16024) 2024-05-06 21:58:37 +08:00
lookups-cached-single upgrade mysql:mysql-connector-java to 8.2.0 (#16024) 2024-05-06 21:58:37 +08:00
multi-stage-query Enable rollup on multi-value dimensions for compaction with MSQ engine (#16937) 2024-09-04 16:28:04 +05:30
mysql-metadata-storage remove Firehose and FirehoseFactory (#16758) 2024-07-19 14:37:21 -07:00
orc-extensions Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
parquet-extensions Refactor: Clean up `DecimalParquetInputTest` using Assume (#16436) 2024-05-14 21:13:07 +05:30
postgresql-metadata-storage remove Firehose and FirehoseFactory (#16758) 2024-07-19 14:37:21 -07:00
protobuf-extensions Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
s3-extensions Reduce logging in RetryableS3OutputStream (#16853) 2024-08-08 10:42:40 +05:30
simple-client-sslcontext Prepare master branch for 31.0.0 release (#16333) 2024-04-26 09:22:43 +05:30
stats generic block compressed complex columns (#16863) 2024-08-27 00:34:41 -07:00
testing-tools Quidem record (#16624) 2024-08-05 14:58:32 +02:00