druid

Commit Graph

Author	SHA1	Message	Date
Abhishek Radhakrishnan	c077daaade	GHA steps to collect and upload heap dumps to debug UT OOM errors (#17029 ) * Add GHA steps to tar and upload any heap dumps on failure to debug UT OOM issues. * Add jvm options to heap dump OnOutOfMemoryError Co-authored-by: Elliott Freis <108356317+imply-elliott@users.noreply.github.com> --------- Co-authored-by: Elliott Freis <108356317+imply-elliott@users.noreply.github.com>	2024-09-12 09:06:35 -04:00
Rishabh Singh	99313e9996	Revised IT to detect backward incompatible change (#16779 ) Added a new revised IT group BackwardCompatibilityMain. The idea is to catch potential backward compatibility issues that may arise during rolling upgrade. This test group runs a docker-compose cluster with Overlord & Coordinator service on the previous druid version. Following env vars are required in the GHA file .github/workflows/unit-and-integration-tests-unified.yml to run this test DRUID_PREVIOUS_VERSION -> Previous druid version to test backward incompatibility. DRUID_PREVIOUS_VERSION_DOWNLOAD_URL -> URL to fetch the tar.	2024-08-07 11:13:35 +05:30
Clint Wylie	71725b41b5	ignore dependencies for github stale action (#16797 )	2024-07-25 10:32:43 -07:00
Clint Wylie	37a50e6803	Remove index_realtime and index_realtime_appenderator tasks (#16602 ) index_realtime tasks were removed from the documentation in #13107. Even at that time, they weren't really documented per se— just mentioned. They existed solely to support Tranquility, which is an obsolete ingestion method that predates migration of Druid to ASF and is no longer being maintained. Tranquility docs were also de-linked from the sidebars and the other doc pages in #11134. Only a stub remains, so people with links to the page can see that it's no longer recommended. index_realtime_appenderator tasks existed in the code base, but were never documented, nor as far as I am aware were they used for any purpose. This patch removes both task types completely, as well as removes all supporting code that was otherwise unused. It also updates the stub doc for Tranquility to be firmer that it is not compatible. (Previously, the stub doc said it wasn't recommended, and pointed out that it is built against an ancient 0.9.2 version of Druid.) ITUnionQueryTest has been migrated to the new integration tests framework and updated to use Kafka ingestion. Co-authored-by: Gian Merlino <gianmerlino@gmail.com>	2024-06-24 20:13:33 -07:00
Rishabh Singh	a63c12bf34	Upload tasklogs along with service logs on Standard IT failure (#16631 ) * Fix build * Push tasklogs alongwith service logs * temp changes to run standard its without unit test results * test * minor change * test * test * Update datasource name for ITSystemTableBatchIndexTaskTest * Publish task logs * Revert other changes * update standard-it yaml	2024-06-22 11:45:54 +05:30
Zoltan Haindrich	44ea4e1c51	Fix cds-coordinator-metadata-query-disabled (#16488 ) fixes the issue with the newly enabled `cds-coordiantor-metadata-query-disabled` [split](https://github.com/apache/druid/pull/16468) * configures to use `prepopulated-data` environment things to configure `S3` for access * this is needed because these tests use a [dataset which is loaded from s3](https://github.com/apache/druid/blob/master/integration-tests/docker/test-data/cds-coordinator-metadata-query-disabled-sample-data.sql) * also undoes the previous [fix](https://github.com/apache/druid/pull/16469) of setting the aws region explicitly as this is a more complete solution - and configuring `prepopulated-data` also sets the region; so that's not needed anymore	2024-05-22 20:42:11 +02:00
Rishabh Singh	28473e7c4d	Use correct IT group name for the group `cds-coordinator-metadata-query-disabled` in GHA (#16468 ) * Fix build * Use the correct IT test group name in gha * update	2024-05-21 11:30:23 +05:30
Rishabh Singh	e30790e013	Introduce Segment Schema Publishing and Polling for Efficient Datasource Schema Building (#15817 ) Issue: #14989 The initial step in optimizing segment metadata was to centralize the construction of datasource schema in the Coordinator (#14985). Thereafter, we addressed the problem of publishing schema for realtime segments (#15475). Subsequently, our goal is to eliminate the requirement for regularly executing queries to obtain segment schema information. This is the final change which involves publishing segment schema for finalized segments from task and periodically polling them in the Coordinator.	2024-04-24 22:22:53 +05:30
Laksh Singla	cce2d0f127	Upload openrewrite patch via GHA (#16270 ) This patch adds a step to the openrewrite action, such that it uploads the correcting patch, in case it fails.	2024-04-12 15:31:07 +05:30
Zoltan Haindrich	0a42342cef	Update CalciteTest to use junit5 (#16106 ) Update CalciteTest to use junit5 change the way temp dirs are handled * add openrewrite workflow to safeguard upgrade * replace junitparamrunner with standard junit5 parametered tests * update a few rules to junit5 api * lots of boring changes * cleanup QueryLogHook * cleanup * fix compile error: ARRAYS_DATASOURCE * fix test * remove enclosed * empty +TEST:TDigestSketchSqlAggregatorTest,HllSketchSqlAggregatorTest,DoublesSketchSqlAggregatorTest,ThetaSketchSqlAggregatorTest,ArrayOfDoublesSketchSqlAggregatorTest,BloomFilterSqlAggregatorTest,BloomDimFilterSqlTest,CatalogIngestionTest,CatalogQueryTest,FixedBucketsHistogramQuantileSqlAggregatorTest,QuantileSqlAggregatorTest,MSQArraysTest,MSQDataSketchesTest,MSQExportTest,MSQFaultsTest,MSQInsertTest,MSQLoadedSegmentTests,MSQParseExceptionsTest,MSQReplaceTest,MSQSelectTest,InsertLockPreemptedFaultTest,MSQWarningsTest,SqlMSQStatementResourcePostTest,SqlStatementResourceTest,CalciteSelectJoinQueryMSQTest,CalciteSelectQueryMSQTest,CalciteUnionQueryMSQTest,MSQTestBase,VarianceSqlAggregatorTest,SleepSqlTest,SqlRowTransformerTest,DruidAvaticaHandlerTest,DruidStatementTest,BaseCalciteQueryTest,CalciteArraysQueryTest,CalciteCorrelatedQueryTest,CalciteExplainQueryTest,CalciteExportTest,CalciteIngestionDmlTest,CalciteInsertDmlTest,CalciteJoinQueryTest,CalciteLookupFunctionQueryTest,CalciteMultiValueStringQueryTest,CalciteNestedDataQueryTest,CalciteParameterQueryTest,CalciteQueryTest,CalciteReplaceDmlTest,CalciteScanSignatureTest,CalciteSelectQueryTest,CalciteSimpleQueryTest,CalciteSubqueryTest,CalciteSysQueryTest,CalciteTableAppendTest,CalciteTimeBoundaryQueryTest,CalciteUnionQueryTest,CalciteWindowQueryTest,DecoupledPlanningCalciteJoinQueryTest,DecoupledPlanningCalciteQueryTest,DecoupledPlanningCalciteUnionQueryTest,DrillWindowQueryTest,DruidPlannerResourceAnalyzeTest,IngestTableFunctionTest,QueryTestRunner,SqlTestFrameworkConfig,SqlAggregationModuleTest,ExpressionsTest,GreatestExpressionTest,IPv4AddressMatchExpressionTest,IPv4AddressParseExpressionTest,IPv4AddressStringifyExpressionTest,LeastExpressionTest,TimeFormatOperatorConversionTest,CombineAndSimplifyBoundsTest,FiltrationTest,SqlQueryTest,CalcitePlannerModuleTest,CalcitesTest,DruidCalciteSchemaModuleTest,DruidSchemaNoDataInitTest,InformationSchemaTest,NamedDruidSchemaTest,NamedLookupSchemaTest,NamedSystemSchemaTest,RootSchemaProviderTest,SystemSchemaTest,CalciteTestBase,SqlResourceTest * use @Nested * add rule to remove enclosed; upgrade surefire * remove enclosed * cleanup * add comment about surefire exclude	2024-03-19 04:05:12 -07:00
Adarsh Sanjeev	86a24012a6	Add security ITs for sending tasks to overlord (#16131 ) * Add security ITs for sending tasks to overlord * Add security ITs for sending tasks to overlord * Resolve test flakiness	2024-03-18 09:33:40 +05:30
Zoltan Haindrich	60766495aa	Use dorny/paths-filter@v3.0.0 (#16082 )	2024-03-08 13:35:26 +05:30
Abhishek Radhakrishnan	daf03939a9	Upgrade GHA dependencies (#15954 ) * Upgrade actions/checkout from v3 to v4. * Upgrade actions/setup-java from v3 to v4. * Upgrade dorny/paths-filter, actions/cdache/restore, actions/stale to v3, v4 and v9 respectively. * Add a GHA label for .github/** and skip UT/IT on .github files. * remove skipping UT/IT on .github/** changes.	2024-03-08 07:54:02 +05:30
Sensor	3acfc95453	Remove helm paths from CodeQL config (#16006 )	2024-02-29 20:02:27 +05:30
Abhishek Agarwal	ddfc31d7ed	Reduce the size of distribution docker image (#15968 ) This PR creates symlinks when there are duplicate jars present in the extension. Docker image includes contrib extensions, too, and the size of the image has bloated up quite a lot of late. This change also fixes "ITNestedQueryPushDownTest integration test"	2024-02-26 21:18:55 +05:30
Zoltan Haindrich	170d37f188	add check to build docker image (#15894 )	2024-02-21 10:53:35 -05:00
Vishesh Garg	6e9eee4c5f	Add failure check (#15873 )	2024-02-09 08:27:10 -08:00
Vishesh Garg	2a250a4e6e	Fix GHA logs dir and make tar and upload conditional on web console test failures (#15810 ) The PR makes 2 change: Correct the current logs directory tarred in GHA static checks to log Make the steps of logs tar-ing and uploading conditional on web console test failures, which currently happens on any step failure in static checks workflow Sample logs before this change for failed static checks: https://github.com/apache/druid/actions/runs/7719743853/job/21043502498	2024-01-31 15:39:56 +05:30
Zoltan Haindrich	2eba20d724	Fix minor build issues and stabilize intellij-inspections runs (#15747 ) * Possibly stabilize intellij-inspections * remove `integration-tests-ex/cases` from excluded projects from initial build * enable ErrorProne's `CheckedExceptionNotThrown` to get earlier errors than intellij-inspections * fix ddsketch pom.xml * fix spellcheck	2024-01-24 15:17:33 +05:30
Victoria Lim	52313c51ac	docs: Anchor link checker (#15624 ) Co-authored-by: 317brian <53799971+317brian@users.noreply.github.com>	2024-01-08 15:19:05 -08:00
sensor	62964e99b1	optimize CI workflow for doc updates (#15617 ) * optimize CI workflow for doc updates * Update .github/workflows/codeql.yml Co-authored-by: Abhishek Radhakrishnan <abhishek.rb19@gmail.com> * Update .github/workflows/codeql.yml Co-authored-by: Abhishek Radhakrishnan <abhishek.rb19@gmail.com> --------- Co-authored-by: Benedict Jin <asdf2014@apache.org> Co-authored-by: Abhishek Radhakrishnan <abhishek.rb19@gmail.com>	2024-01-05 17:18:38 -08:00
Abhishek Radhakrishnan	050b515355	Upgrade CodeQL from v2 to latest v3. (#15619 )	2024-01-03 11:31:53 -08:00
Abhishek Radhakrishnan	7fa987dae9	Update labeler to v5 that includes fix where bot doesn't remove labels added by maintainers. (#15558 )	2023-12-14 12:10:26 -05:00
Zoltan Haindrich	8bc7a5f3ac	Move codeql-config.yml out of the workflows folder (#15553 ) Move codeql config file out of the workflows folder so github doesn't try to run it and fail the github workflow run every time a branch is updated.	2023-12-13 08:37:01 -08:00
Vishesh Garg	801967b75f	Add test logs zipping and archival steps for failures in Static Checks Github Actions (#15506 ) Add test logs zipping and archival steps for failures in Static Checks Github Actions	2023-12-07 15:34:23 +05:30
Rishabh Singh	d968bb3f43	Rename config for enabling CentralizedDatasourceSchema feature (#15476 ) * Rename property to druid.centralizedDatasourceSchema.enabled * Update config name in docker-compose	2023-12-05 16:57:25 +05:30
Rishabh Singh	8c802e4c9b	Relocating Table Schema Building: Shifting from Brokers to Coordinator for Improved Efficiency (#14985 ) In the current design, brokers query both data nodes and tasks to fetch the schema of the segments they serve. The table schema is then constructed by combining the schemas of all segments within a datasource. However, this approach leads to a high number of segment metadata queries during broker startup, resulting in slow startup times and various issues outlined in the design proposal. To address these challenges, we propose centralizing the table schema management process within the coordinator. This change is the first step in that direction. In the new arrangement, the coordinator will take on the responsibility of querying both data nodes and tasks to fetch segment schema and subsequently building the table schema. Brokers will now simply query the Coordinator to fetch table schema. Importantly, brokers will still retain the capability to build table schemas if the need arises, ensuring both flexibility and resilience.	2023-11-04 19:33:25 +05:30
Xavier Léauté	352702bb25	run some integration tests with Java 21 (#15104 ) * use setup-java everywhere for consistency * add Java 21 to integration test matrix * simplify docker build containers script + add Java 21 * fix for Java versions reporting 21-ea	2023-10-20 11:18:13 +08:00
Tejaswini Bandlamudi	1f39c054a7	Fix GHA workflow bugs (#15209 )	2023-10-19 17:11:36 +05:30
Tejaswini Bandlamudi	0a6f78c0bb	Fix GHA workflow bugs (#15138 )	2023-10-12 21:25:57 +05:30
Xavier Léauté	f9439970c9	run build and unit tests using Java 21 (#15088 ) * run build and unit test using Java 21 * run static checks with Java 21 * use setup-java for unit tests, since Java 21 is not built-in * skip maven cache from setup-java * add comments to explain cache behavior	2023-10-06 12:45:07 -07:00
Tejaswini Bandlamudi	c888ac5d61	fix path of druid service IT logs (#15082 )	2023-10-04 15:38:38 +05:30
Zoltan Haindrich	5f3b310115	Build reliablity fixes (#15048 ) * disable parallel builds; enable batch mode to get rid of transfer progress * restore .m2 from setup-java if not found * some change to sql * add ws * fix quote * fix quote * undo querytest change * nullhandling in mvtest * init more * skip commitid plugin * add-back 1.0C to build ; remove redundant skip-s from copy-resources; add comment	2023-09-28 12:27:52 -07:00
Tejaswini Bandlamudi	fa61e654e4	fix uploading IT docker logs to GHA artifacts (#15046 )	2023-09-28 15:25:52 +05:30
Zoltan Haindrich	08cf290da2	Configure caching for static-check actions (#15010 ) * some stuff * some stuff * dont change it.sh * some stuff * updates * add missing * add 1 more * setup-java	2023-09-20 14:11:39 -07:00
Tejaswini Bandlamudi	b7bb5ee1db	Upload docker and druid service logs as artifacts on GitHub Actions IT run failure (#14967 ) With this PR, docker and druid service logs are uploaded as artifacts onto GitHub when an IT job fails so that we can later download them for investigation.	2023-09-13 11:32:04 +05:30
317brian	6b4dda964d	Docusaurus2 upgrade for master (#14411 ) Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> Co-authored-by: Charles Smith <techdocsmith@gmail.com>	2023-08-16 19:01:21 -07:00
Tejaswini Bandlamudi	a45b25fa1d	Removes support for Hadoop 2 (#14763 ) Removing Hadoop 2 support as discussed in https://lists.apache.org/list?dev@druid.apache.org:lte=1M:hadoop	2023-08-09 17:47:52 +05:30
Abhishek Agarwal	955734ba8d	Fix exempt labels in stale.yml (#14733 )	2023-08-02 17:12:18 +05:30
Sam Rash	0dcb19f7e3	Add Continuous Profiling to Unit Tests (#14506 ) Uses a custom continusou jfr profiler. Modifies the github actions for tests to do profiling only in the case of jdk17, as the profiler requires jdk17+ to use the JFR streaming API plus a few other language features in the code. Continuous Profiling service is provided to the Apache Druid project free of charge by Imply and any committer can request free access to the UI.	2023-07-12 17:50:38 -07:00
Tejaswini Bandlamudi	c3f84f9ea0	Suppress CVEs (#14291 ) Address various CVEs by upgrading dependencies or adding suppression with a justification	2023-07-10 15:19:26 +05:30
Gian Merlino	63ee69b4e8	Claim full support for Java 17. (#14384 ) * Claim full support for Java 17. No production code has changed, except the startup scripts. Changes: 1) Allow Java 17 without DRUID_SKIP_JAVA_CHECK. 2) Include the full list of opens and exports on both Java 11 and 17. 3) Document that Java 17 is both supported and preferred. 4) Switch some tests from Java 11 to 17 to get better coverage on the preferred version. * Doc update. * Update errorprone. * Update docker_build_containers.sh. * Update errorprone in licenses.yaml. * Add some more run-javas. * Additional run-javas. * Update errorprone. * Suppress new errorprone error. * Add exports and opens in ForkingTaskRunner for Java 11+. Test, doc changes. * Additional errorprone updates. * Update for errorprone. * Restore old fomatting in LdapCredentialsValidator. * Copy bin/ too. * Fix Java 15, 17 build line in docker_build_containers.sh. * Update busybox image. * One more java command. * Fix interpolation. * IT commandline refinements. * Switch to busybox 1.34.1-glibc. * POM adjustments, build and test one IT on 17. * Additional debugging. * Fix silly thing. * Adjust command line. * Add exports and opens one more place. * Additional harmonization of strong encapsulation parameters.	2023-07-07 12:52:35 -07:00
Tejaswini Bandlamudi	c04a36d15b	Run IntelliJ-inspections in parallel to static-checks & web-checks in GHA (#14515 ) Currently, IntelliJ-inspections are run sequentially w.r.t static-checks, thereby increasing build time. Moving IntelliJ-inspections to a separate job to improve builds time and get a quick insight into such issues early on.	2023-07-03 17:10:19 +05:30
Abhishek Agarwal	f8f2fe8b7b	Skip tests based on files changed in the PR (#14445 ) Our CI system has a lot of tests. And much of this testing is really unnecessary for most of the PRs. This PR adds some checks so we can skip these expensive tests when we know they are not necessary.	2023-06-22 12:27:23 +05:30
Tejaswini Bandlamudi	8e4f003f02	Fix flaky Revised ITs failures on GHA runners (#14348 ) * Fix read timed out failures and remove containers before test * remove containers before loading images * add labels to IT docker containers, download stable minio docker image release instead of latest	2023-06-05 18:58:54 +05:30
Abhishek Agarwal	b482fda503	Ignore misc.xml (#14362 )	2023-06-02 12:00:52 +05:30
Abhishek Radhakrishnan	5fd3e01ef0	More specific exclusions in the `examples` folder. (#14347 ) This PR changes how we skip java UT and ITs with changes in the examples folder. After this change, any Markdown files within the examples folder and jupyter-notebooks directory will be excluded. The rationale behind these more specific exclusions is that some ITs use json files checked in examples, so we want to trigger the full workflow for all other changes.	2023-05-30 12:01:45 +05:30
Tejaswini Bandlamudi	0e51c2702a	update operations per run (#14325 )	2023-05-29 14:05:11 +05:30
Tejaswini Bandlamudi	36a084e021	Fix GHA workflows naming & Run ITs if UTs fail on coverage (#14158 ) Currently, there is no way to run ITs if unit-tests fail on coverage. This PR allows Revised, Standard ITs to run even when unit-tests fail on coverage errors, still failing the workflow. This PR also fixes existing GHA workflow naming.	2023-05-22 11:44:34 +05:30
Abhishek Radhakrishnan	c546df3866	Add `examples/` to CI UT/IT ignore (#14306 ) * Skip UT/IT on examples only changes.	2023-05-17 17:46:25 -07:00

1 2

97 Commits