Apache Druid: a high performance real-time analytics database.
Go to file
Jihoon Son ddd8c9ef97 Add filter selectivity estimation for auto search strategy (#3848)
* Add filter selectivity estimation for auto search strategy

* Addressed comments

* Lazy bitmap materialization for bitmap sampling and java docs

* Addressed comments.

- Fix wrong non-overlap ratio computation and added unit tests.
- Change Iterable<Integer> to IntIterable
- Remove unnecessary Iterable<Integer>

* Addressed comments

- Split a long ternary operation into if-else blocks
- Add IntListUtils.fromTo()

* Fix test failure and add a test for RangeIntList

* fix code style

* Diabled selectivity estimation for multi-valued dimensions

* Address comment
2017-02-06 11:15:03 -08:00
api flattenSpec: Document that "expr" is ignored for type "root". (#3884) 2017-01-31 10:27:20 -08:00
aws-common Bump versions to 0.9.3-SNAPSHOT (#3524) 2016-09-29 13:53:32 -07:00
benchmarks Add filter selectivity estimation for auto search strategy (#3848) 2017-02-06 11:15:03 -08:00
bytebuffer-collections Migrating extendedset from Metamarkets. (#3694) 2017-01-17 10:10:27 -08:00
codestyle Migrating extendedset from Metamarkets. (#3694) 2017-01-17 10:10:27 -08:00
common Add virtual column types, holder serde, and safety features. (#3823) 2017-01-26 18:15:51 -08:00
distribution Druid Extension to enable Authentication using Kerberos. (#3853) 2017-02-02 14:55:21 -06:00
docs Simple doc fix (#3907) 2017-02-06 15:52:17 +05:30
examples Improve startup script - create PID and LOG dir if they do not exist (#3808) 2017-01-02 09:20:22 -08:00
extendedset Migrating extendedset from Metamarkets. (#3694) 2017-01-17 10:10:27 -08:00
extensions-contrib Remove deprecated Aggregator/AggregatorFactory methods (#3894) 2017-02-01 14:43:18 -08:00
extensions-core auto reset option for Kafka Indexing service (#3842) 2017-02-02 14:57:45 -06:00
hll Extract HLL related code to separate module (#3900) 2017-02-03 09:45:11 -08:00
indexing-hadoop Extract HLL related code to separate module (#3900) 2017-02-03 09:45:11 -08:00
indexing-service Extract HLL related code to separate module (#3900) 2017-02-03 09:45:11 -08:00
integration-tests Druid Extension to enable Authentication using Kerberos. (#3853) 2017-02-02 14:55:21 -06:00
java-util flattenSpec: Document that "expr" is ignored for type "root". (#3884) 2017-01-31 10:27:20 -08:00
processing Add filter selectivity estimation for auto search strategy (#3848) 2017-02-06 11:15:03 -08:00
publications Changes to lambda architecture paper required for HICSS (#3382) 2016-09-06 21:32:21 -07:00
server Introduce SegmentizerFactory (#3901) 2017-02-06 10:05:12 -08:00
services auto reset option for Kafka Indexing service (#3842) 2017-02-02 14:57:45 -06:00
sql Extract HLL related code to separate module (#3900) 2017-02-03 09:45:11 -08:00
.gitignore move distribution artifacts to distribution/target 2015-10-30 12:40:05 -05:00
.travis.yml Enable parallel test (#3774) 2016-12-14 21:05:56 -08:00
CONTRIBUTING.md Add doc link to eclipse formatting settings as well (#3131) 2016-06-24 15:27:50 -07:00
DruidCorporateCLA.pdf fix CLA email / mailing address 2014-04-17 15:26:28 -07:00
DruidIndividualCLA.pdf fix CLA email / mailing address 2014-04-17 15:26:28 -07:00
LICENSE Clean up README and license 2015-02-18 23:09:28 -08:00
NOTICE Migrating extendedset from Metamarkets. (#3694) 2017-01-17 10:10:27 -08:00
README.md update readme (#2830) 2016-04-13 11:33:31 -07:00
druid_intellij_formatting.xml Make formatting IntelliJ 2016 friendly (#2978) 2016-05-18 12:42:21 -07:00
eclipse.importorder Merge pull request #2905 from javasoze/eclipse_formatting 2016-04-29 18:42:03 -07:00
eclipse_formatting.xml Merge pull request #2905 from javasoze/eclipse_formatting 2016-04-29 18:42:03 -07:00
pom.xml Extract HLL related code to separate module (#3900) 2017-02-03 09:45:11 -08:00
upload.sh upload.sh: Use awscli if s3cmd is not available. (#3114) 2016-06-08 17:01:46 -07:00

README.md

Build Status Coverage Status

Druid

Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power exploratory dashboards in multi-tenant environments.

Druid excels as a data warehousing solution for fast aggregate queries on petabyte sized data sets. Druid supports a variety of flexible filters, exact calculations, approximate algorithms, and other useful calculations.

Druid can load both streaming and batch data and integrates with Samza, Kafka, Storm, Spark, and Hadoop.

License

Apache License, Version 2.0

More Information

More information about Druid can be found on http://www.druid.io.

Documentation

You can find the documentation for the latest Druid release on the project website.

If you would like to contribute documentation, please do so under /docs/content in this repository and submit a pull request.

Getting Started

You can get started with Druid with our quickstart.

Reporting Issues

If you find any bugs, please file a GitHub issue.

Community

Community support is available on the druid-user mailing list(druid-user@googlegroups.com).

Development discussions occur on the druid-development list(druid-development@googlegroups.com).

We also have a couple people hanging out on IRC in #druid-dev on irc.freenode.net.

Contributing

Please follow the guidelines listed here.