druid/docs/content/development/extensions-contrib/cloudfiles.md

2.3 KiB

layout
doc_page

Rackspace Cloud Files

Deep Storage

Rackspace Cloud Files is another option for deep storage. This requires some additional druid configuration.

Property Possible Values Description Default
druid.storage.type cloudfiles Must be set.
druid.storage.region Rackspace Cloud Files region. Must be set.
druid.storage.container Rackspace Cloud Files container name. Must be set.
druid.storage.basePath Rackspace Cloud Files base path to use in the container. Must be set.
druid.storage.operationMaxRetries Number of tries before cancel a Rackspace operation. 10
druid.cloudfiles.userName Rackspace Cloud username Must be set.
druid.cloudfiles.apiKey Rackspace Cloud api key. Must be set.
druid.cloudfiles.provider rackspace-cloudfiles-us,rackspace-cloudfiles-uk Name of the provider depending on the region. Must be set.
druid.cloudfiles.useServiceNet true,false Whether to use the internal service net. true

Firehose

StaticCloudFilesFirehose

This firehose ingests events, similar to the StaticAzureBlobStoreFirehose, but from Rackspace's Cloud Files.

Data is newline delimited, with one JSON object per line and parsed as per the InputRowParser configuration.

The storage account is shared with the one used for Racksapce's Cloud Files deep storage functionality, but blobs can be in a different region and container.

As with the Azure blobstore, it is assumed to be gzipped if the extension ends in .gz

Sample spec:

"firehose" : {
    "type" : "static-cloudfiles",
    "blobs": [
        {
          "region": "DFW"
          "container": "container",
          "path": "/path/to/your/file.json"
        },
        {
          "region": "ORD"
          "container": "anothercontainer",
          "path": "/another/path.json"
        }
    ]
}
property description default required?
type This should be "static-cloudfiles". N/A yes
blobs JSON array of Cloud Files blobs. N/A yes

Cloud Files Blobs:

property description default required?
container Name of the Cloud Files container N/A yes
path The path where data is located. N/A yes