druid/docs/ingestion
Atul Mohan 03d6d395a0
Extension to read and ingest iceberg data files (#14329)
This adds a new contrib extension: druid-iceberg-extensions which can be used to ingest data stored in Apache Iceberg format. It adds a new input source of type iceberg that connects to a catalog and retrieves the data files associated with an iceberg table and provides these data file paths to either an S3 or HDFS input source depending on the warehouse location.

Two important dependencies associated with Apache Iceberg tables are:

Catalog : This extension supports reading from either a Hive Metastore catalog or a Local file-based catalog. Support for AWS Glue is not available yet.
Warehouse : This extension supports reading data files from either HDFS or S3. Adapters for other cloud object locations should be easy to add by extending the AbstractInputSourceAdapter.
2023-07-18 08:59:57 +05:30
..
data-formats.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
faq.md API reference refactor (#14372) 2023-06-26 15:48:54 -07:00
hadoop.md API reference refactor (#14372) 2023-06-26 15:48:54 -07:00
index.md API reference refactor (#14372) 2023-06-26 15:48:54 -07:00
ingestion-spec.md doc: escape tags in markdown in prepration for docusaurus2 (#14379) 2023-06-08 11:26:18 -07:00
input-sources.md Extension to read and ingest iceberg data files (#14329) 2023-07-18 08:59:57 +05:30
native-batch-firehose.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
native-batch-simple-task.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
native-batch.md API reference refactor (#14372) 2023-06-26 15:48:54 -07:00
partitioning.md Add missing MSQ error code fields to docs (#13308) 2022-11-10 21:03:04 +05:30
rollup.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
schema-design.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
schema-model.md Update Ingestion section (#14023) 2023-05-19 09:42:27 -07:00
standalone-realtime.md Reduce visibility of Tranquility documentation (#11134) 2021-05-03 16:48:24 -07:00
tasks.md API reference refactor (#14372) 2023-06-26 15:48:54 -07:00
tranquility.md Reduce visibility of Tranquility documentation (#11134) 2021-05-03 16:48:24 -07:00