druid

Commit Graph

Author	SHA1	Message	Date
Elliott Freis	d93fdb2632	Bump CycloneDX module to address POM errors (#13878 ) * Bump CycloneDX module to address POM errors * Including web-console in the PR --------- Co-authored-by: Elliott Freis <elliottfreis@Elliott-Freis.earth.dynamic.blacklight.net>	2023-03-03 15:39:15 +05:30
Clint Wylie	08b5951cc5	merge druid-core, extendedset, and druid-hll into druid-processing to simplify everything (#13698 ) * merge druid-core, extendedset, and druid-hll into druid-processing to simplify everything * fix poms and license stuff * mockito is evil * allow reset of JvmUtils RuntimeInfo if tests used static injection to override	2023-02-17 14:27:41 -08:00
Adarsh Sanjeev	e8330e95f5	Update Apache Kafka dependencies to 3.4.0 (#13802 ) Release notes: - https://downloads.apache.org/kafka/3.4.0/RELEASE_NOTES.html	2023-02-15 15:15:13 +05:30
Xavier Léauté	698670c88e	update core Apache Kafka dependencies to 3.3.2 (#13717 ) Release notes: - https://downloads.apache.org/kafka/3.3.2/RELEASE_NOTES.html	2023-01-27 21:00:01 -08:00
Dongjoon Hyun	2503095296	Publish SBOM artifacts (#13648 )	2023-01-10 16:08:10 +05:30
abhagraw	74a76c74b1	Updating dependency check version (#13649 )	2023-01-10 14:43:19 +05:30
Kashif Faraz	78ae0b7533	Upgrade to netty 4.1.86.Final to address CVEs (#13604 ) This commit addresses the following CVEs: - CVE-2021-43797 - CVE-2022-41881	2022-12-23 01:44:01 +05:30
Peter Stöckli	df55768535	Add CodeQL workflow (#13477 ) * workflower: Add CodeQL workflow * add modified CodeQL build config	2022-12-21 09:24:39 +05:30
Jason Koch	6c44dd8175	perf: core/TextReader for faster json ingestion (#13545 ) * perf: provide a custom utf8 specific buffered line iterator (benchmark) Benchmark Mode Cnt Score Error Units JsonLineReaderBenchmark.baseline avgt 15 3459.871 ± 106.175 us/op * perf: provide a custom utf8 specific buffered line iterator Benchmark Mode Cnt Score Error Units JsonLineReaderBenchmark.baseline avgt 15 3022.053 ± 51.286 us/op * perf: provide a custom utf8 specific buffered line iterator (more tests) * perf: provide a custom utf8 specific buffered line iterator (pr feedback) Ensure field visibility is as limited as possible Null check for buffer in constructor * perf: provide a custom utf8 specific buffered line iterator (pr feedback) Remove additional 'finished' variable. * perf: provide a custom utf8 specific buffered line iterator (more tests and bugfix)	2022-12-19 23:12:37 -08:00
Kashif Faraz	7cf761cee4	Prepare master branch for next release, 26.0.0 (#13401 ) * Prepare master branch for next release, 26.0.0 * Use docker image for druid 24.0.1 * Fix version in druid-it-cases pom.xml	2022-11-22 15:31:01 +05:30
Paul Rogers	81d005f267	Druid Catalog basics (#13165 ) Druid catalog basics Catalog object model for tables, columns Druid metadata DB storage (as an extension) REST API to update the catalog (as an extension) Integration tests Model only: no planner integration yet	2022-11-12 15:30:22 -08:00
Didip Kerabat	c875f4bd04	Upgrade curator to 5.4.0 (#13302 )	2022-11-03 11:26:19 -07:00
Dr. Sizzles	e5ad24ff9f	Support for middle manager less druid, tasks launch as k8s jobs (#13156 ) * Support for middle manager less druid, tasks launch as k8s jobs * Fixing forking task runner test * Test cleanup, dependency cleanup, intellij inspections cleanup * Changes per PR review Add configuration option to disable http/https proxy for the k8s client Update the docs to provide more detail about sidecar support * Removing un-needed log lines * Small changes per PR review * Upon task completion we callback to the overlord to update the status / locaiton, for slower k8s clusters, this reduces locking time significantly * Merge conflict fix * Fixing tests and docs * update tiny-cluster.yaml changed `enableTaskLevelLogPush` to `encapsulatedTask` * Apply suggestions from code review Co-authored-by: Abhishek Agarwal <1477457+abhishekagarwal87@users.noreply.github.com> * Minor changes per PR request * Cleanup, adding test to AbstractTask * Add comment in peon.sh * Bumping code coverage * More tests to make code coverage happy * Doh a duplicate dependnecy * Integration test setup is weird for k8s, will do this in a different PR * Reverting back all integration test changes, will do in anotbher PR * use StringUtils.base64 instead of Base64 * Jdk is nasty, if i compress in jdk 11 in jdk 17 the decompressed result is different Co-authored-by: Rahul Gidwani <r_gidwani@apple.com> Co-authored-by: Abhishek Agarwal <1477457+abhishekagarwal87@users.noreply.github.com>	2022-11-02 19:44:47 -07:00
chi-chi weng	72c16097ac	Fix Apache Commons Text CVE-2022-42889 (#13226 ) * Fix Apache Commons Text CVE-2022-42889 Fix Apache Commons Text CVE-2022-42889 https://nvd.nist.gov/vuln/detail/CVE-2022-42889 * Update license Co-authored-by: Frank Chen <frank.chen021@outlook.com>	2022-10-26 10:04:32 +08:00
Frank Chen	d30cf8c308	Dependency cleanup (#13194 ) * Clean up dependency in extensions * Bump protobuf/aws.sdk * Bump aws-sdk to 1.12.317 * Fix CI * Fix CI * Update license * Update license	2022-10-10 20:34:38 +08:00
Xavier Léauté	eff7edb603	update core Apache Kafka dependencies to 3.3.1 (#13176 ) Announcement: - https://blogs.apache.org/kafka/entry/what-rsquo-s-new-in Release notes: - https://archive.apache.org/dist/kafka/3.3.0/RELEASE_NOTES.html - https://downloads.apache.org/kafka/3.3.1/RELEASE_NOTES.html	2022-10-04 12:52:16 -07:00
AmatyaAvadhanula	acafd0d1e0	Upgrade kafka version to 3.2.3 to fix CVE (#13142 ) Upgrade to 3.2.3 to fix CVE: https://nvd.nist.gov/vuln/detail/CVE-2022-34917	2022-09-28 10:47:09 +05:30
Gian Merlino	5733360dfd	Update Snappy to 1.1.8.4. (#13081 ) * Update Snappy to 1.1.8.4. Prior to this, because snappy-java wasn't included in dependencyManagement, we actually shipped multiple different versions for different extensions, ranging from 1.1.7.1 to 1.1.8.4. Now, we standardize on 1.1.8.4. Among other things, this enables the tests to pass on M1 Macs. * Update snappy-java versions in licenses.yaml.	2022-09-14 15:13:47 -07:00
Adam Peck	ee22663dd3	Add interpolation to JsonConfigurator (#13023 ) * Add interpolation to JsonConfigurator * Fix checkstyle * Fix tests by removing common-text override * Add back commons-text without version * Remove unused hadoopDir configs * Move some stuff to hopefully pass coverage	2022-09-07 12:48:01 +05:30
senthilkv	3d9aef225d	compressed big decimal - module (#10705 ) Compressed Big Decimal is an extension which provides support for Mutable big decimal value that can be used to accumulate values without losing precision or reallocating memory. This type helps in absolute precision arithmetic on large numbers in applications, where greater level of accuracy is required, such as financial applications, currency based transactions. This helps avoid rounding issues where in potentially large amount of money can be lost. Accumulation requires that the two numbers have the same scale, but does not require that they are of the same size. If the value being accumulated has a larger underlying array than this value (the result), then the higher order bits are dropped, similar to what happens when adding a long to an int and storing the result in an int. A compressed big decimal that holds its data with an embedded array. Compressed big decimal is an absolute number based complex type based on big decimal in Java. This supports all the functionalities supported by Java Big Decimal. Java Big Decimal is not mutable in order to avoid big garbage collection issues. Compressed big decimal is needed to mutate the value in the accumulator.	2022-09-06 00:06:57 -07:00
Gian Merlino	48ceab2153	Add Java 17 information to documentation. (#12990 ) The docs say Java 17 support is experimental, and give tips on running successfully with Java 17. This patch also removes java.base/jdk.internal.perf and jdk.management/com.sun.management.internal from the list of required exports and opens, because they were formerly needed for JvmMonitor, which was rewritten in #12481 to use MXBeans instead.	2022-08-30 12:32:49 -07:00
Gian Merlino	9eb20e5e7c	Remove dependency on jvm-attach. (#12989 ) This dependency was no longer needed after #12481, but remained because it was used for a (now useless) test. This patch removes the test and the dependency.	2022-08-29 14:18:33 -07:00
Abhishek Agarwal	618757352b	Bump up the version to 25.0.0 (#12975 ) * Bump up the version to 25.0.0 * Fix the version in console	2022-08-29 11:27:38 +05:30
Adam Peck	21b73bde20	Update Curator to 5.3.0 (#12939 ) * Update Curator to 5.3.0 * Update licenses.yaml * Fix inspections + add tests. * Fix checkstyle * Another intellij inspection fix * Update curator exclusions * Cleanup new exhibitor references * Remove unused dep and checkstyle fix	2022-08-26 18:23:40 -07:00
Paul Rogers	cfed036091	Add the new integration test framework (#12368 ) This commit is a first draft of the revised integration test framework which provides: - A new directory, integration-tests-ex that holds the new integration test structure. (For now, the existing integration-tests is left unchanged.) - Maven module druid-it-tools to hold code placed into the Docker image. - Maven module druid-it-image to build the Druid-only test image from the tarball produced in distribution. (Dependencies live in their "official" image.) - Maven module druid-it-cases that holds the revised tests and the framework itself. The framework includes file-based test configuration, test-specific clients, test initialization and updated versions of some of the common test support classes. The integration test setup is primarily a huge mass of details. This approach refactors many of those details: from how the image is built and configured to how the Docker Compose scripts are structured to test configuration. An extensive set of "readme" files explains those details. Rather than repeat that material here, please consult those files for explanations.	2022-08-24 17:03:23 +05:30
Gian Merlino	d7d15ba51f	Add druid-multi-stage-query extension. (#12918 ) * Add druid-multi-stage-query extension. * Adjustments from CI. * Task ID validation. * Various changes from code review. * Remove unnecessary code. * LGTM-related.	2022-08-23 18:44:01 -07:00
Xavier Léauté	752e42a312	fix running integration tests on macos aarch64 (#12913 ) * add osx-aarch_64 netty-transport-native-kqueue native dependency * align docker-java dependency versions using bom and update to 3.2.13	2022-08-17 18:03:24 +02:00
dependabot[bot]	f70f7b4b89	Bump postgresql from 42.3.3 to 42.4.1 (#12871 ) * Bump postgresql from 42.3.3 to 42.4.1 Bumps [postgresql](https://github.com/pgjdbc/pgjdbc) from 42.3.3 to 42.4.1. - [Release notes](https://github.com/pgjdbc/pgjdbc/releases) - [Changelog](https://github.com/pgjdbc/pgjdbc/blob/master/CHANGELOG.md) - [Commits](https://github.com/pgjdbc/pgjdbc/compare/REL42.3.3...REL42.4.1) --- updated-dependencies: - dependency-name: org.postgresql:postgresql dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> * update licenses.yaml Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Xavier Léauté <xvrl@apache.org>	2022-08-16 23:25:39 +02:00
Paul Rogers	4706a4c572	Docker build for the revised ITs (#12707 ) * Docker build for the revised ITs * Fix POM versions * Update comments from review suggestions	2022-08-10 14:17:33 +05:30
Gian Merlino	ef6811ef88	Improved Java 17 support and Java runtime docs. (#12839 ) * Improved Java 17 support and Java runtime docs. 1) Add a "Java runtime" doc page with information about supported Java versions, garbage collection, and strong encapsulation.. 2) Update asm and equalsverifier to versions that support Java 17. 3) Add additional "--add-opens" lines to surefire configuration, so tests can pass successfully under Java 17. 4) Switch openjdk15 tests to openjdk17. 5) Update FrameFile to specifically mention Java runtime incompatibility as the cause of not being able to use Memory.map. 6) Update SegmentLoadDropHandler to log an error for Errors too, not just Exceptions. This is important because an IllegalAccessError is encountered when the correct "--add-opens" line is not provided, which would otherwise be silently ignored. 7) Update example configs to use druid.indexer.runner.javaOptsArray instead of druid.indexer.runner.javaOpts. (The latter is deprecated.) * Adjustments. * Use run-java in more places. * Add run-java. * Update .gitignore. * Exclude hadoop-client-api. Brought in when building on Java 17. * Swap one more usage of java. * Fix the run-java script. * Fix flag. * Include link to Temurin. * Spelling. * Update examples/bin/run-java Co-authored-by: Xavier Léauté <xl+github@xvrl.net> Co-authored-by: Xavier Léauté <xl+github@xvrl.net>	2022-08-03 23:16:05 -07:00
Karan Kumar	3290b49754	Log4j bump to 2.18 due to [LOG4J2-3419] (#12847 ) * Log4j bump to 2.18 due to [LOG4J2-3419] * Fixing license issues	2022-08-02 23:25:40 -07:00
PJ Fanning	188b5b0027	Upgrade to jetty 9.4.48.v20220622 due to CVEs (#12801 ) * Upgrade to jetty 9.4.48.v20220622 due to CVEs * Update licenses.yaml	2022-07-26 10:11:48 +08:00
Kashif Faraz	9e5f0109fd	Fix CVE-2022-2048 (jetty) and CVE-2022-31159 (aws-java-sdk-s3) (#12807 ) Changes: - Upgrade aws sdk version from `1.12.37` to `1.12.264` - Upgrade jetty version from `9.4.41.v20210516` to `9.4.47.v20220610`	2022-07-21 13:08:18 +05:30
Paul Rogers	ee15c238cc	Clone Calcite planner to access validator (#12708 ) Done in preparation for the "single-pass" planner.	2022-07-14 18:10:33 -07:00
Gian Merlino	9c925b4f09	Frame format for data transfer and short-term storage. (#12745 ) * Frame format for data transfer and short-term storage. As we move towards query execution plans that involve more transfer of data between servers, it's important to have a data format that provides for doing this more efficiently than the options available to us today. This patch adds: - Columnar frames, which support fast querying. - Row-based frames, which support fast sorting via memory comparison and fast whole-row copies via memory copying. - Frame files, a container format that can be stored on disk or transferred between servers. The idea is we should use row-based frames when data is expected to be sorted, and columnar frames when data is expected to be queried. The code in this patch is not used in production yet. Therefore, the patch involves minimal changes outside of the org.apache.druid.frame package. The main ones are adjustments to SqlBenchmark to add benchmarks for queries on frames, and the addition of a "forEach" method to Sequence. * Fixes based on tests, static analysis. * Additional fixes. * Skip DS mapping tests on JDK 14+ * Better JDK checking in tests. * Fix imports. * Additional comment. * Adjustments from code review. * Update test case.	2022-07-08 20:42:06 -07:00
Jianhuan Liu	4574dea5e9	Use MXBeans to get GC metrics #12476 (#12481 ) * jvm gc to mxbeans * add zgc and shenandoah #12476 * remove tryCreateGcCounter * separate the space collector * blend GcGenerationCollector into GcCollector * add jdk surefire argLine	2022-07-08 14:32:06 +08:00
PJ Fanning	059aba781a	issue-12628: upgrade jetty to 9.4.41.v20210516 due to CVE (#12629 ) * upgrade jetty to 9.4.41.v20210516 due to cve * Update licenses.yaml	2022-07-07 00:20:01 +08:00
imply-cheddar	e3128e3fa3	Poison stupid pool (#12646 ) * Poison StupidPool and fix resource leaks There are various resource leaks from test setup as well as some corners in query processing. We poison the StupidPool to start failing tests when the leaks come and fix any issues uncovered from that so that we can start from a clean baseline. Unfortunately, because of how poisoning works, we can only fail future checkouts from the same pool, which means that there is a natural race between a leak happening -> GC occurs -> leak detected -> pool poisoned. This race means that, depending on interleaving of tests, if the very last time that an object is checked out from the pool leaks, then it won't get caught. At some point in the future, something will catch it, however and from that point on it will be deterministic. * Remove various things left over from iterations * Clean up FilterAnalysis and add javadoc on StupidPool * Revert changes to .idea/misc.xml that accidentally got pushed * Style and test branches * Stylistic woes	2022-07-03 14:36:22 -07:00
Rohan Garg	c09b5a2294	Fix skipTests build flag (#12716 ) * fix skipTests * Skip console UTs with skipTests * Use skipTests in skip-tests profile	2022-06-29 21:59:26 -07:00
Rui Chen	068bea6334	deps: upgrade mysql-connector-java to v5.1.49 (#12704 )	2022-06-29 23:15:46 +08:00
Paul Rogers	f83fab699e	Add IT-related changes pulled out of PR #12368 (#12673 ) This commit contains changes made to the existing ITs to support the new ITs. Changes: - Make the "custom node role" code usable by the new ITs. - Use flag `-DskipITs` to skips the integration tests but runs unit tests. - Use flag `-DskipUTs` skips unit tests but runs the "new" integration tests. - Expand the existing Druid profile, `-P skip-tests` to skip both ITs and UTs.	2022-06-26 02:13:59 +05:30
Dr. Sizzles	7291c92f4f	Adding zstandard compression library (#12408 ) * Adding zstandard compression library * 1. Took @clintropolis's advice to have ZStandard decompressor use the byte array when the buffers are not direct. 2. Cleaned up checkstyle issues. * Fixing zstandard version to latest stable version in pom's and updating license files * Removing zstd from benchmarks and adding to processing (poms) * fix the intellij inspection issue * Removing the prefix v for the version in the license check for ztsd * Fixing license checks Co-authored-by: Rahul Gidwani <r_gidwani@apple.com>	2022-05-28 17:01:44 -07:00
Abhishek Agarwal	32fe4d1324	Use a different repository to download sigar artifacts. (#12561 )	2022-05-24 14:42:51 +05:30
Clint Wylie	2d8dbb53e0	update to latest lz4 1.8.0 (#12557 )	2022-05-21 16:02:20 +08:00
Xavier Léauté	ec41dfb535	upgrade core Apache Kafka dependencies to 3.2.0 (#12538 ) Announcement: https://blogs.apache.org/kafka/entry/what-s-new-in-apache8 Release notes: https://downloads.apache.org/kafka/3.2.0/RELEASE_NOTES.html	2022-05-19 09:04:52 -07:00
Gian Merlino	4631cff2a9	Free ByteBuffers in tests and fix some bugs. (#12521 ) * Ensure ByteBuffers allocated in tests get freed. Many tests had problems where a direct ByteBuffer would be allocated and then not freed. This is bad because it causes flaky tests. To fix this: 1) Add ByteBufferUtils.allocateDirect(size), which returns a ResourceHolder. This makes it easy to free the direct buffer. Currently, it's only used in tests, because production code seems OK. 2) Update all usages of ByteBuffer.allocateDirect (off-heap) in tests either to ByteBuffer.allocate (on-heap, which are garbaged collected), or to ByteBufferUtils.allocateDirect (wherever it seemed like there was a good reason for the buffer to be off-heap). Make sure to close all direct holders when done. * Changes based on CI results. * A different approach. * Roll back BitmapOperationTest stuff. * Try additional surefire memory. * Revert "Roll back BitmapOperationTest stuff." This reverts commit `49f846d9e3`. * Add TestBufferPool. * Revert Xmx change in tests. * Better behaved NestedQueryPushDownTest. Exit tests on OOME. * Fix TestBufferPool. * Remove T1C from ARM tests. * Somewhat safer. * Fix tests. * Fix style stuff. * Additional debugging. * Reset null / expr configs better. * ExpressionLambdaAggregatorFactory thread-safety. * Alter forkNode to try to get better info when a JVM crashes. * Fix buffer retention in ExpressionLambdaAggregatorFactory. * Remove unused import.	2022-05-19 07:42:29 -07:00
Kashif Faraz	7ab2170802	Use datasketches version 3.2.0 (#12509 ) Changes: - Use apache datasketches version 3.2.0. - Remove unsafe reflection-based usage of datasketch internals added in #12022	2022-05-13 11:28:15 +05:30
Abhishek Radhakrishnan	9177515be2	Add IPAddress java library as dependency and migrate IPv4 functions to use the new library. (#11634 ) * Add ipaddress library as dependency. * IPv4 functions to use the inet.ipaddr package. * Remove unused imports. * Add new function. * Minor rename. * Add more unit tests. * IPv4 address expr utils unit tests and address options. * Adjust the IPv4Util functions. * Move the UTs a bit around. * Javadoc comments. * Add license info for IPAddress. * Fix groupId, artifact and version in license.yaml. * Remove redundant subnet in messages - fixes UT. * Remove unused commons-net dependency for /processing project. * Make class and methods public so it can be accessed. * Add initial version of benchmark * Add subnetutils package for benchmarks. * Auto generate ip addresses. * Add more v4 address representations in setup to avoid bias. * Use ThreadLocalRandom to avoid forbidden API usage. * Adjust IPv4AddressBenchmark to adhere to codestyle rules. * Update ipaddress library to latest 5.3.4 * Add ipaddress package dependency to benchmarks project.	2022-05-11 22:06:20 -07:00
aggarwalakshay	dd8781f5b0	Upgrade dependency-check-maven to 7.0.4 (#12441 )	2022-05-01 22:45:58 +08:00
Gian Merlino	72d15ab321	JvmMonitor: Handle more generation and collector scenarios. (#12469 ) * JvmMonitor: Handle more generation and collector scenarios. ZGC on Java 11 only has a generation 1 (there is no 0). This causes a NullPointerException when trying to extract the spacesCount for generation 0. In addition, ZGC on Java 15 has a collector number 2 but no spaces in generation 2, which breaks the assumption that collectors always have same-numbered spaces. This patch adjusts things to be more robust, enabling the JvmMonitor to work properly for ZGC on both Java 11 and 15. * Test adjustments. * Improve surefire arglines. * Need a placeholder	2022-04-27 11:18:40 -07:00

1 2 3 4 5 ...

1577 Commits