druid/extensions-contrib
Didip Kerabat 6ddb828c7a
Able to filter Cloud objects with glob notation. (#12659)
In a heterogeneous environment, sometimes you don't have control over the input folder. Upstream can put any folder they want. In this situation the S3InputSource.java is unusable.

Most people like me solved it by using Airflow to fetch the full list of parquet files and pass it over to Druid. But doing this explodes the JSON spec. We had a situation where 1 of the JSON spec is 16MB and that's simply too much for Overlord.

This patch allows users to pass {"filter": "*.parquet"} and let Druid performs the filtering of the input files.

I am using the glob notation to be consistent with the LocalFirehose syntax.
2022-06-24 11:40:08 +05:30
..
aliyun-oss-extensions Able to filter Cloud objects with glob notation. (#12659) 2022-06-24 11:40:08 +05:30
ambari-metrics-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
cassandra-storage Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
cloudfiles-extensions Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
distinctcount Free ByteBuffers in tests and fix some bugs. (#12521) 2022-05-19 07:42:29 -07:00
dropwizard-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
gce-extensions Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
graphite-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
influx-extensions Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
influxdb-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
kafka-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
materialized-view-maintenance Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
materialized-view-selection Add remedial information in error message when type is unknown (#12612) 2022-06-07 20:22:45 +05:30
momentsketch Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
moving-average-query Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
opentelemetry-emitter Bump opentelemetry-instrumentation-bom-alpha (#12531) 2022-06-01 13:51:39 -07:00
opentsdb-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
prometheus-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
redis-cache Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
sqlserver-metadata-storage Optimize overlord GET /tasks memory usage (#12404) 2022-06-16 22:30:37 +05:30
statsd-emitter Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
tdigestsketch Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
thrift-extensions Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
time-min-max Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
virtual-columns Bump up the versions (#12480) 2022-04-27 14:28:20 +05:30
README.md fix broken links (#9537) 2020-03-22 17:41:18 -07:00

README.md

Community Extensions

Please contribute all community extensions in this directory and include a doc of how your extension can be used under docs/development/extensions-contrib/.

Please note that community extensions are maintained by their original contributors and are not packaged with the core Druid distribution. If you'd like to take on maintenance for a community extension, please post on dev@druid.apache.org to let us know!