druid/extensions-core
Xavier Léauté 118b50195e
Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730)
Today Kafka message support in streaming indexing tasks is limited to
message values, and does not provide a way to expose Kafka headers,
timestamps, or keys, which may be of interest to more specialized
Druid input formats. For instance, Kafka headers may be used to indicate
payload format/encoding or additional metadata, and timestamps are often
omitted from values in Kafka streams applications, since they are
included in the record.

This change proposes to introduce KafkaRecordEntity as InputEntity,
which would give input formats full access to the underlying Kafka record,
including headers, key, timestamps. It would also open access to low-level
information such as topic, partition, offset if needed.

KafkaEntity is a subclass of ByteEntity for backwards compatibility with
existing input formats, and to avoid introducing unnecessary complexity
for Kinesis indexing tasks.
2021-01-08 16:04:37 -08:00
..
avro-extensions Fix ingestion failure of pretty-formatted JSON message (#10383) 2020-11-13 13:59:23 -08:00
azure-extensions Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
datasketches Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
druid-aws-rds-extensions AWS RDS token based password provider (#9518) 2021-01-06 21:15:29 -08:00
druid-basic-security integration test for coordinator and overlord leadership client (#10680) 2020-12-17 22:50:12 -08:00
druid-bloom-filter support for vectorizing expressions with non-existent inputs, more consistent type handling for non-vectorized expressions (#10499) 2020-10-26 19:55:24 -07:00
druid-kerberos Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
druid-pac4j Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
druid-ranger-security Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
ec2-extensions Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
google-extensions Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
hdfs-storage Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
histogram Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
kafka-extraction-namespace Update Apache Kafka to 2.7.0 (#10701) 2020-12-22 13:56:00 -08:00
kafka-indexing-service Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730) 2021-01-08 16:04:37 -08:00
kinesis-indexing-service Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730) 2021-01-08 16:04:37 -08:00
kubernetes-extensions Update deps for CVE-2020-28168 and CVE-2020-28052 (#10733) 2021-01-07 20:31:44 -08:00
lookups-cached-global Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
lookups-cached-single Two fixes related to encoding of % symbols. (#10645) 2020-12-06 22:35:11 -08:00
mysql-metadata-storage modify access to protected SQLMetadataConnector methods to allow extensions to create SQL metadata tables using implementation specific constructs (payload type, serial type, etc) (#10573) 2020-11-12 23:20:01 +05:30
orc-extensions Fix ingestion failure of pretty-formatted JSON message (#10383) 2020-11-13 13:59:23 -08:00
parquet-extensions Fix ingestion failure of pretty-formatted JSON message (#10383) 2020-11-13 13:59:23 -08:00
postgresql-metadata-storage modify access to protected SQLMetadataConnector methods to allow extensions to create SQL metadata tables using implementation specific constructs (payload type, serial type, etc) (#10573) 2020-11-12 23:20:01 +05:30
protobuf-extensions Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
s3-extensions Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
simple-client-sslcontext Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00
stats Update version to 0.21.0-SNAPSHOT (#10450) 2020-10-03 16:08:34 -07:00