druid/docs/content/development/extensions-core/hdfs.md

1.1 KiB

layout
doc_page

HDFS

Make sure to include druid-hdfs-storage as an extension.

Deep Storage

Configuration

Property Possible Values Description Default
druid.storage.type hdfs Must be set.
druid.storage.storageDirectory Directory for storing segments. Must be set.

If you are using the Hadoop indexer, set your output directory to be a location on Hadoop and it will work

Google Cloud Storage

The HDFS extension can also be used for GCS as deep storage.

Configuration

Property Possible Values Description Default
druid.storage.type hdfs Must be set.
druid.storage.storageDirectory gs://bucket/example/directory Must be set.

All services that need to access GCS need to have the GCS connector jar in their class path. One option is to place this jar in /lib/ and /extensions/druid-hdfs-storage/

Tested with Druid 0.9.0, Hadoop 2.7.2 and gcs-connector jar 1.4.4-hadoop2.