Apache Druid: a high performance real-time analytics database.
Go to file
Chi Cao Minh 4ae6466ae2 HDFS input source (#8899)
* HDFS input source

Add support for using HDFS as an input source. In this version, commas
or globs are not supported in HDFS paths.

* Fix forbidden api

* Address review comments
2019-11-19 22:19:39 -08:00
.github add checkbox for licenses.yaml in PR template, mention it in CONTRIBUTING.md (#8367) 2019-08-22 14:14:24 -07:00
.idea Implementing dropwizard emitter for druid (#7363) 2019-10-01 14:59:30 -07:00
benchmarks optimize numeric column null value checking for low filter selectivity (more rows) (#8822) 2019-11-13 10:53:46 -08:00
cloud Add credentials for ECS (#8651) 2019-10-12 09:12:14 -07:00
codestyle Fix dependency analyze warnings (#8230) 2019-09-09 14:37:21 -07:00
core HDFS input source (#8899) 2019-11-19 22:19:39 -08:00
dev Add an item to concurrency checklist about assertions in parall… (#8701) 2019-10-29 11:38:04 +03:00
distribution Address security vulnerabilities (#8878) 2019-11-19 09:14:33 -08:00
docs add google cloud storage InputSource for native batch (#8907) 2019-11-19 19:49:43 -08:00
examples Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
extendedset bump master version to 0.17.0-incubating-SNAPSHOT (#8421) 2019-08-28 01:58:36 -07:00
extensions-contrib Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
extensions-core HDFS input source (#8899) 2019-11-19 22:19:39 -08:00
hll Fix dependency analyze warnings (#8230) 2019-09-09 14:37:21 -07:00
indexing-hadoop Add InputSource and InputFormat interfaces (#8823) 2019-11-15 09:22:09 -08:00
indexing-service Retrying with a backward compatible task type on unknown task type error in parallel indexing (#8905) 2019-11-19 19:29:25 -08:00
integration-tests Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
licenses Address security vulnerabilities (#8878) 2019-11-19 09:14:33 -08:00
processing Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
publications [ImgBot] Optimize images (#7873) 2019-06-24 21:27:48 -07:00
server Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
services Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
sql Tidy up lifecycle, query, and ingestion logging. (#8889) 2019-11-19 13:57:58 -08:00
web-console bump typescript (#8890) 2019-11-17 16:23:47 -08:00
website add google cloud storage InputSource for native batch (#8907) 2019-11-19 19:49:43 -08:00
.codecov.yml Use Codecov (#8388) 2019-08-28 08:49:30 -07:00
.dockerignore Add docker container for druid (#6896) 2019-02-08 12:12:28 +00:00
.gitignore autogenerate NOTICE.BINARY from NOTICE and licenses.yaml (#8306) 2019-08-21 12:46:27 -07:00
.lgtm.yml Add license header for LGTM yaml config file (#8902) 2019-11-18 18:26:45 -08:00
.travis.yml Spellcheck docs (#8548) 2019-09-17 12:47:30 -07:00
CONTRIBUTING.md Fix incorrect build from source path in README.md and druid repo url. (#8531) 2019-09-12 19:48:01 -07:00
DISCLAIMER add missing license headers, in particular to MD files; clean up RAT … (#6563) 2018-11-13 09:38:37 -08:00
LABELS Add plain text README.txt, use relative link from README.md to build.md (#7611) 2019-05-09 21:29:26 -07:00
LICENSE Add missing license pointer for Porter Stemmer (#7941) 2019-06-24 12:21:40 -07:00
NOTICE Address security vulnerabilities (#8878) 2019-11-19 09:14:33 -08:00
README.md Update README.md (#8829) 2019-11-06 08:59:00 -08:00
README.template switch links from druid.io to druid.apache.org (#7914) 2019-06-18 09:06:27 -07:00
licenses.yaml Address security vulnerabilities (#8878) 2019-11-19 09:14:33 -08:00
pom.xml Address security vulnerabilities (#8878) 2019-11-19 09:14:33 -08:00
upload.sh Adding licenses and enable apache-rat-plugin. (#6215) 2018-09-18 08:39:26 -07:00

README.md

Slack Build Status Language grade: Java Coverage Status Docker

Apache Druid (incubating)

Apache Druid (incubating) is a high performance real-time analytics database.

Druid is a next-gen open source alternative to analytical databases such as Vertica, Greenplum, and Exadata, and data warehouses such as Snowflake, BigQuery, and Redshift.

Getting started

You can get started with Druid with our quickstart.

Druid provides a rich set of APIs (via HTTP and JDBC) for loading, managing, and querying your data. You can also interact with Druid via the built-in console (shown below).

Load data

data loader Kafka

Load streaming and batch data using a point-and-click wizard to guide you through ingestion setup. Monitor one off tasks and ingestion supervisors.

Manage the cluster

management

Manage your cluster with ease. Get a view of your datasources, segments, ingestion tasks, and servers from one convenient location. All powered by SQL systems tables allowing you to see the underlying query for each view.

Issue queries

query view combo

Use the built-in query workbench to prototype DruidSQL and native queries or connect one of the many tools that help you make the most out of Druid.

Documentation

You can find the documentation for the latest Druid release on the project website.

If you would like to contribute documentation, please do so under /docs in this repository and submit a pull request.

Community

Community support is available on the druid-user mailing list, which is hosted at Google Groups.

Development discussions occur on dev@druid.apache.org, which you can subscribe to by emailing dev-subscribe@druid.apache.org.

Chat with Druid committers and users in real-time on the #druid channel in the Apache Slack team. Please use this invitation link to join the ASF Slack, and once joined, go into the #druid channel.

Building from source

Please note that JDK 8 is required to build Druid.

For instructions on building Druid from source, see docs/development/build.md

Contributing

Please follow the community guidelines for contributing.

License

Apache License, Version 2.0

Disclaimer: Apache Druid is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by the Apache Incubator. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.