druid

mirror of https://github.com/apache/druid.git synced 2025-02-12 04:55:12 +00:00

Go to file

somu-imply 9177419628

Unnest functionality for Druid (#13268 )

* Moving all unnest cursor code atop refactored code for unnest

* Updating unnest cursor

* Removing dedup and fixing up some null checks

* AllowList changes

* Fixing some NPEs

* Using bitset for allowlist

* Updating the initialization only when cursor is in non-done state

* Updating code to skip rows not in allow list

* Adding a flag for cases when first element is not in allowed list

* Updating for a null in allowList

* Splitting unnest cursor into 2 subclasses

* Intercepting some apis with columnName for new unnested column

* Adding test cases and renaming some stuff

* checkstyle fixes

* Moving to an interface for Unnest

* handling null rows in a dimension

* Updating cursors after comments part-1

* Addressing comments and adding some more tests

* Reverting a change to ScanQueryRunner and improving a comment

* removing an unused function

* Updating cursors after comments part 2

* One last fix for review comments

* Making some functions private, deleting some comments, adding a test for unnest of unnest with allowList

* Adding an exception for a case

* Closure for unnest data source

* Adding some javadocs

* One minor change in makeDimSelector of columnarCursor

* Updating an error message

* Update processing/src/main/java/org/apache/druid/segment/DimensionUnnestCursor.java

Co-authored-by: Abhishek Agarwal <1477457+abhishekagarwal87@users.noreply.github.com>

* Unnesting on virtual columns was missing an object array, adding that to support virtual columns unnesting

* Updating exceptions to use UOE

* Renamed files, added column capability test on adapter, return statement and made unnest datasource not cacheable for the time being

* Handling for null values in dim selector

* Fixing a NPE for null row

* Updating capabilities

* Updating capabilities

Co-authored-by: Abhishek Agarwal <1477457+abhishekagarwal87@users.noreply.github.com>

2022-12-02 18:48:25 -08:00

.github

Update gha & travis checks (#13412 )

2022-12-02 15:06:31 +05:30

.idea

Revert Accidental Change to Druid.xml (#13190 )

2022-10-06 14:42:35 -07:00

benchmarks

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

cloud

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

codestyle

Always return sketches from DS_HLL, DS_THETA, DS_QUANTILES_SKETCH. (#13247 )

2022-11-03 09:43:00 -07:00

core

SQL test framework extensions (#13426 )

2022-12-02 09:11:59 -08:00

dev

Make http options the default configurations (#13092 )

2022-10-05 05:35:17 +05:30

distribution

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

docs

Remove limit from timeseries (#13457 )

2022-12-02 12:19:59 -08:00

examples

Quieter streaming supervisors. (#13392 )

2022-11-20 23:53:17 -08:00

extendedset

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

extensions-contrib

SQL test framework extensions (#13426 )

2022-12-02 09:11:59 -08:00

extensions-core

SQL test framework extensions (#13426 )

2022-12-02 09:11:59 -08:00

helm/druid

helm: add Kubernetes discovery support (#13262 )

2022-10-28 15:09:48 +05:30

hll

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

hooks

Git hooks should fail on errors; pass args to git hooks (#12322 )

2022-03-10 09:07:50 +09:00

indexing-hadoop

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

indexing-service

Fix needless task shutdown on leader switch (#13411 )

2022-12-01 18:31:08 +05:30

integration-tests

MSQ Reindex IT (#13433 )

2022-12-01 12:13:23 +05:30

integration-tests-ex

MSQ Reindex IT (#13433 )

2022-12-01 12:13:23 +05:30

licenses

Web console: making the cell filter menu more functional, removing the old query view, and updating d3 (#13169 )

2022-10-07 12:44:40 -07:00

processing

Unnest functionality for Druid (#13268 )

2022-12-02 18:48:25 -08:00

publications

De-incubation cleanup in code, docs, packaging (#9108 )

2020-01-03 12:33:19 -05:00

server

SQL test framework extensions (#13426 )

2022-12-02 09:11:59 -08:00

services

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

sql

SQL test framework extensions (#13426 )

2022-12-02 09:11:59 -08:00

web-console

don't render duration if aggregated (#13455 )

2022-11-30 19:21:07 -08:00

website

doc: add a basic JDBC tutorial (#13343 )

2022-11-30 16:25:35 -08:00

.asf.yaml

Add .asf.yaml. (#9083 )

2019-12-20 16:45:38 -08:00

.backportrc.json

Add 0.18.0 to .backportrc.json to facilitate backport. (#9661 )

2020-04-11 13:49:04 -07:00

.codecov.yml

Use Codecov (#8388 )

2019-08-28 08:49:30 -07:00

.dockerignore

Add docker container for druid (#6896 )

2019-02-08 12:12:28 +00:00

.gitignore

Add CTA and fix typo (#13009 )

2022-09-06 11:16:50 -07:00

.lgtm.yml

be consistent about referring to the web console by its name (#13118 )

2022-09-19 15:02:17 -07:00

.travis.yml

Update gha & travis checks (#13412 )

2022-12-02 15:06:31 +05:30

check_test_suite_test.py

ignore licenses changes in check test script (#12964 )

2022-08-25 12:31:28 -07:00

check_test_suite.py

Adds license and security vulnerabilities checks for Hadoop3 build (#13270 )

2022-11-09 14:50:31 +05:30

CONTRIBUTING.md

Add missing MSQ error code fields to docs (#13308 )

2022-11-10 21:03:04 +05:30

it.sh

Building druid-it-tools and running for travis in it.sh (#12957 )

2022-08-30 12:48:07 +05:30

LABELS

Add plain text README.txt, use relative link from README.md to build.md (#7611 )

2019-05-09 21:29:26 -07:00

LICENSE

support Aliyun OSS service as deep storage (#9898 )

2020-07-01 22:20:53 -07:00

licenses.yaml

treat user cancelation seriously (#13376 )

2022-11-18 14:04:16 -08:00

NOTICE

Update year in the notice file and the release process instructions (#12622 )

2022-08-23 18:17:18 +05:30

owasp-dependency-check-suppressions.xml

Port CVE suppressions from 24.0.1 (#13415 )

2022-11-23 11:35:33 +05:30

pom.xml

Prepare master branch for next release, 26.0.0 (#13401 )

2022-11-22 15:31:01 +05:30

README.md

be consistent about referring to the web console by its name (#13118 )

2022-09-19 15:02:17 -07:00

README.template

De-incubation cleanup in code, docs, packaging (#9108 )

2020-01-03 12:33:19 -05:00

upload.sh

Adding licenses and enable apache-rat-plugin. (#6215 )

2018-09-18 08:39:26 -07:00

README.md

Apache Druid

Druid is a high performance real-time analytics database. Druid's main value add is to reduce time to insight and action.

Druid is designed for workflows where fast queries and ingest really matter. Druid excels at powering UIs, running operational (ad-hoc) queries, or handling high concurrency. Consider Druid as an open source alternative to data warehouses for a variety of use cases. The design documentation explains the key concepts.

Getting started

You can get started with Druid with our local or Docker quickstart.

Druid provides a rich set of APIs (via HTTP and JDBC) for loading, managing, and querying your data. You can also interact with Druid via the built-in web console (shown below).

Load data

Load streaming and batch data using a point-and-click wizard to guide you through ingestion setup. Monitor one off tasks and ingestion supervisors.

Manage the cluster

Manage your cluster with ease. Get a view of your datasources, segments, ingestion tasks, and services from one convenient location. All powered by SQL systems tables, allowing you to see the underlying query for each view.

Issue queries

Use the built-in query workbench to prototype DruidSQL and native queries or connect one of the many tools that help you make the most out of Druid.

Documentation

See the latest documentation for the documentation for the current official release. If you need information on a previous release, you can browse previous releases documentation.

Make documentation and tutorials updates in /docs using MarkDown and contribute them using a pull request.

Community

Visit the official project community page to read about getting involved in contributing to Apache Druid, and how we help one another use and operate Druid.

Druid users can find help in the druid-user mailing list on Google Groups, and have more technical conversations in #troubleshooting on Slack.
Druid development discussions take place in the druid-dev mailing list (dev@druid.apache.org). Subscribe by emailing dev-subscribe@druid.apache.org. For live conversations, join the #dev channel on Slack.

Check out the official community page for details of how to join the community Slack channels.

Find articles written by community members and a calendar of upcoming events on the project site - contribute your own events and articles by submitting a PR in the apache/druid-website-src repository.

Building from source

Please note that JDK 8 or JDK 11 is required to build Druid.

See the latest build guide for instructions on building Apache Druid from source.

Contributing

Please follow the community guidelines for contributing.

For instructions on setting up IntelliJ dev/intellij-setup.md

License

Apache License, Version 2.0

Languages

Java 62.4%

ReScript 30.7%

TypeScript 3.1%

Euphoria 0.9%

Csound 0.8%

Other 1.9%