druid

Apache Druid: a high performance real-time analytics database.

druid

Go to file

Gian Merlino d4cace385f SQL: Allow Scans to be used as outer queries. (#11831 ) * SQL: Allow Scans to be used as outer queries. This has been possible in the native query system for a while, but the capability hasn't yet propagated into the SQL layer. One example of where this is useful is a query like: SELECT * FROM (... LIMIT X) WHERE <filter> Because this expands the kinds of subquery structures the SQL layer will consider, it was also necessary to improve the cost calculations. These changes appear in PartialDruidQuery and DruidOuterQueryRel. The ideas are: - Attach per-column penalties to the output signature of each query, instead of to the initial projection that starts a query. This encourages moving projections into subqueries instead of leaving them on outer queries. - Only attach penalties to projections if there are actually expressions happening. So, now, projections that simply reorder or remove fields are free. - Attach a constant penalty to every outer query. This discourages creating them when they are not needed. The changes are generally beneficial to the test cases we have in CalciteQueryTest. Most plans are unchanged, or are changed in purely cosmetic ways. Two have changed for the better: - testUsingSubqueryWithLimit now returns a constant from the subquery, instead of returning every column. - testJoinOuterGroupByAndSubqueryHasLimit returns a minimal set of columns from the innermost subquery; two unnecessary columns are no longer there. * Fix various DS operator conversions. These were all implemented as direct conversions, which isn't appropriate because they do not actually map onto native functions. These are only usable as post-aggregations. * Test case adjustment.		2021-10-23 17:18:43 -07:00
.github	Lock hadoop dependencies to 2.8.5 (#11583 )	2021-08-12 15:16:47 +05:30
.idea	Use ExecutorService variables to assign ExecutorService Instances (#11373 )	2021-06-25 16:56:34 -07:00
benchmarks	latest datasketches-java and datasketches-memory (#11773 )	2021-10-19 23:42:30 -07:00
cloud	bump version to 0.23.0-SNAPSHOT (#11670 )	2021-09-08 15:56:04 -07:00
codestyle	handle timestamps of complex types when parsing protobuf messages (#11293 )	2021-06-07 15:19:39 +05:30
core	Remove CloseQuietly and migrate its usages to other methods. (#10247 )	2021-10-23 17:03:21 -07:00
dev	chore: fix case of GitHub (#10928 )	2021-05-07 01:15:43 -07:00
distribution	Revert "Missing Loader parameter in generate-binary-license and generate-binary-notice py scripts (#11815 )" (#11832 )	2021-10-23 08:34:26 -07:00
docs	Docs - add description on time origin (#11826 )	2021-10-22 14:57:13 -07:00
examples	Allow spaces in java home. (#11407 )	2021-07-05 18:50:36 +05:30
extendedset	bump version to 0.23.0-SNAPSHOT (#11670 )	2021-09-08 15:56:04 -07:00
extensions-contrib	add output type information to ExpressionPostAggregator (#11818 )	2021-10-22 13:52:51 -07:00
extensions-core	SQL: Allow Scans to be used as outer queries. (#11831 )	2021-10-23 17:18:43 -07:00
helm/druid	remove DEPRECATION part (#11326 )	2021-06-09 15:52:43 +08:00
hll	bump version to 0.23.0-SNAPSHOT (#11670 )	2021-09-08 15:56:04 -07:00
hooks	Add git pre-commit hook to source control (#9554 )	2020-06-05 11:19:42 -10:00
indexing-hadoop	better types (#11713 )	2021-10-19 01:47:25 -07:00
indexing-service	Remove CloseQuietly and migrate its usages to other methods. (#10247 )	2021-10-23 17:03:21 -07:00
integration-tests	Simplify ITHttpInputSourceTest to mitigate flakiness (#11751 )	2021-10-12 11:51:27 -05:00
licenses	Web console: Better hotkeys and library upgrades (#11365 )	2021-06-17 18:24:29 -07:00
processing	Remove CloseQuietly and migrate its usages to other methods. (#10247 )	2021-10-23 17:03:21 -07:00
publications	De-incubation cleanup in code, docs, packaging (#9108 )	2020-01-03 12:33:19 -05:00
server	Remove CloseQuietly and migrate its usages to other methods. (#10247 )	2021-10-23 17:03:21 -07:00
services	Implement configurable internally generated query context (#11429 )	2021-10-06 09:02:41 -07:00
sql	SQL: Allow Scans to be used as outer queries. (#11831 )	2021-10-23 17:18:43 -07:00
web-console	Fix CVE-2021-3749 reported in security vulnerabilities job (#11786 )	2021-10-08 23:02:58 -07:00
website	Docs - update dynamic config provider topic (#11795 )	2021-10-14 17:51:32 -07:00
.asf.yaml	Add .asf.yaml. (#9083 )	2019-12-20 16:45:38 -08:00
.backportrc.json	Add 0.18.0 to .backportrc.json to facilitate backport. (#9661 )	2020-04-11 13:49:04 -07:00
.codecov.yml	Use Codecov (#8388 )	2019-08-28 08:49:30 -07:00
.dockerignore	Add docker container for druid (#6896 )	2019-02-08 12:12:28 +00:00
.gitignore	Web console basic end-to-end-test (#9595 )	2020-04-09 12:38:09 -07:00
.lgtm.yml	Suppress LGTM warnings about stack trace exposure (#9631 )	2020-04-09 17:31:03 -07:00
.travis.yml	Fix the travis build (#11799 )	2021-10-14 16:31:51 +05:30
CONTRIBUTING.md	Fix numbered list formatting in markdown. (#9664 )	2020-04-21 20:18:12 -07:00
LABELS	Add plain text README.txt, use relative link from README.md to build.md (#7611 )	2019-05-09 21:29:26 -07:00
LICENSE	support Aliyun OSS service as deep storage (#9898 )	2020-07-01 22:20:53 -07:00
NOTICE	license.yaml fixes for code introduced related to AWS RDS token based password provider in PR #9518 (#10885 )	2021-03-10 12:59:25 -08:00
README.md	Update link to helm chart quickstart guide (#11801 )	2021-10-19 14:10:40 +05:30
README.template	De-incubation cleanup in code, docs, packaging (#9108 )	2020-01-03 12:33:19 -05:00
check_test_suite.py	suppress false positive cve (#11699 )	2021-09-13 20:45:38 -07:00
check_test_suite_test.py	suppress false positive cve (#11699 )	2021-09-13 20:45:38 -07:00
licenses.yaml	latest datasketches-java and datasketches-memory (#11773 )	2021-10-19 23:42:30 -07:00
owasp-dependency-check-suppressions.xml	suppress hive-storage-api thrift security vulnerability (#11753 )	2021-09-28 23:54:13 -07:00
pom.xml	latest datasketches-java and datasketches-memory (#11773 )	2021-10-19 23:42:30 -07:00
setup-hooks.sh	Add git pre-commit hook to source control (#9554 )	2020-06-05 11:19:42 -10:00
upload.sh	Adding licenses and enable apache-rat-plugin. (#6215 )	2018-09-18 08:39:26 -07:00

README.md

Apache Druid

Druid is a high performance real-time analytics database. Druid's main value add is to reduce time to insight and action.

Druid is designed for workflows where fast queries and ingest really matter. Druid excels at powering UIs, running operational (ad-hoc) queries, or handling high concurrency. Consider Druid as an open source alternative to data warehouses for a variety of use cases. The design documentation explains the key concepts.

Getting started

You can get started with Druid with our local or Docker quickstart.

Druid provides a rich set of APIs (via HTTP and JDBC) for loading, managing, and querying your data. You can also interact with Druid via the built-in console (shown below).

Load data

Load streaming and batch data using a point-and-click wizard to guide you through ingestion setup. Monitor one off tasks and ingestion supervisors.

Manage the cluster

Manage your cluster with ease. Get a view of your datasources, segments, ingestion tasks, and services from one convenient location. All powered by SQL systems tables, allowing you to see the underlying query for each view.

Issue queries

Use the built-in query workbench to prototype DruidSQL and native queries or connect one of the many tools that help you make the most out of Druid.

Documentation

You can find the documentation for the latest Druid release on the project website.

If you would like to contribute documentation, please do so under /docs in this repository and submit a pull request.

Community

Community support is available on the druid-user mailing list, which is hosted at Google Groups.

Development discussions occur on dev@druid.apache.org, which you can subscribe to by emailing dev-subscribe@druid.apache.org.

Chat with Druid committers and users in real-time on the #druid channel in the Apache Slack team. Please use this invitation link to join the ASF Slack, and once joined, go into the #druid channel.

Building from source

Please note that JDK 8 is required to build Druid.

For instructions on building Druid from source, see docs/development/build.md

Contributing

Please follow the community guidelines for contributing.

For instructions on setting up IntelliJ dev/intellij-setup.md

License

Apache License, Version 2.0