7cd45a6e1f
- For now uses a hardcoded ratio of aggregator to timeanddim buffer sizes - canAppendRow is a workaround for realtime index since the Firehose currently does not have a way of rolling back the last event in case of error - canAppendRow needs a fudge factor; there is a race between checking if we can add a row and actually adding a row, because of the way MapDB reports its size. |
||
---|---|---|
common | ||
docs | ||
examples | ||
extensions | ||
indexing-hadoop | ||
indexing-service | ||
processing | ||
publications | ||
server | ||
services | ||
.gitignore | ||
DruidCorporateCLA.pdf | ||
DruidIndividualCLA.pdf | ||
LICENSE | ||
README.md | ||
build.sh | ||
eclipse_formatting.xml | ||
intellij_formatting.jar | ||
pom.xml | ||
upload.sh |
README.md
Druid
Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments. Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations. Druid can load both streaming and batch data and integrates with Storm and Hadoop.
More Information
Much more information about Druid can be found on our website.
Documentation
We host documentation on our website. If you want to contribute documentation changes, please submit a pull request to this repository.
Tutorials
We have a series of tutorials to get started with Druid, starting with this one.
Support
Report any bugs using GitHub issues.
Contact us through our forum or on IRC in #druid-dev
on irc.freenode.net
.