druid

History

Xavier Léauté 118b50195e Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730 ) Today Kafka message support in streaming indexing tasks is limited to message values, and does not provide a way to expose Kafka headers, timestamps, or keys, which may be of interest to more specialized Druid input formats. For instance, Kafka headers may be used to indicate payload format/encoding or additional metadata, and timestamps are often omitted from values in Kafka streams applications, since they are included in the record. This change proposes to introduce KafkaRecordEntity as InputEntity, which would give input formats full access to the underlying Kafka record, including headers, key, timestamps. It would also open access to low-level information such as topic, partition, offset if needed. KafkaEntity is a subclass of ByteEntity for backwards compatibility with existing input formats, and to avoid introducing unnecessary complexity for Kinesis indexing tasks.	2021-01-08 16:04:37 -08:00
..
src	Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730 )	2021-01-08 16:04:37 -08:00
pom.xml	Update Apache Kafka to 2.7.0 (#10701 )	2020-12-22 13:56:00 -08:00

Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730 )

Today Kafka message support in streaming indexing tasks is limited to
message values, and does not provide a way to expose Kafka headers,
timestamps, or keys, which may be of interest to more specialized
Druid input formats. For instance, Kafka headers may be used to indicate
payload format/encoding or additional metadata, and timestamps are often
omitted from values in Kafka streams applications, since they are
included in the record.

This change proposes to introduce KafkaRecordEntity as InputEntity,
which would give input formats full access to the underlying Kafka record,
including headers, key, timestamps. It would also open access to low-level
information such as topic, partition, offset if needed.

KafkaEntity is a subclass of ByteEntity for backwards compatibility with
existing input formats, and to avoid introducing unnecessary complexity
for Kinesis indexing tasks.

2021-01-08 16:04:37 -08:00

src

Introduce KafkaRecordEntity to support Kafka headers in InputFormats (#10730 )

2021-01-08 16:04:37 -08:00

pom.xml

Update Apache Kafka to 2.7.0 (#10701 )

2020-12-22 13:56:00 -08:00