501dcb43fa
This is done by killing and respawning the jvms rather than reconnecting to existing jvms, for a couple reasons. One is that it lets you restore tasks after server reboots too, and another is that it lets you upgrade all the software on a box at once by just restarting everything. The main changes are, 1) Add "canRestore" and "stopGracefully" methods to Tasks that say if a task can stop gracefully, and actually do a graceful stop. RealtimeIndexTask is the only one that currently implements this. 2) Add "stop" method to TaskRunners that attempts to do an orderly shutdown. ThreadPoolTaskRunner- call stopGracefully on restorable tasks, wait for exit ForkingTaskRunner- close output stream to restorable tasks, wait for exit RemoteTaskRunner- do nothing special, we actually don't want to shutdown 3) Add "restore" method to TaskRunners that attempts to bootstrap tasks from last run. Only ForkingTaskRunner does anything here. It maintains a "restore.json" file with a list of restorable tasks. 4) Have the CliPeon's ExecutorLifecycle lock the task base directory to avoid a restored task and a zombie old task from stomping on each other. |
||
---|---|---|
aws-common | ||
benchmarks | ||
common | ||
distribution | ||
docs | ||
examples | ||
extensions | ||
indexing-hadoop | ||
indexing-service | ||
integration-tests | ||
processing | ||
publications | ||
server | ||
services | ||
.gitignore | ||
.travis.yml | ||
CONTRIBUTING.md | ||
DruidCorporateCLA.pdf | ||
DruidIndividualCLA.pdf | ||
LICENSE | ||
NOTICE | ||
README.md | ||
eclipse_formatting.xml | ||
intellij_formatting.jar | ||
pom.xml | ||
upload.sh |
README.md
Druid
Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments.
Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.
Druid can load both streaming and batch data and integrates with Samza, Kafka, Storm, and Hadoop.
License
More Information
More information about Druid can be found on http://www.druid.io.
Documentation
You can find the latest Druid Documentation on the project website.
If you would like to contribute documentation, please do so under
/docs/content
in this repository and submit a pull request.
Tutorials
We have a series of tutorials to get started with Druid. If you are just getting started, we suggest going over the first Druid tutorial.
Reporting Issues
If you find any bugs, please file a GitHub issue.
Community
Community support is available on the druid-user mailing list(druid-user@googlegroups.com).
Development discussions occur on the druid-development list(druid-development@googlegroups.com).
We also have a couple people hanging out on IRC in #druid-dev
on
irc.freenode.net
.