druid

Commit Graph

Author	SHA1	Message	Date
Abhishek Radhakrishnan	c077daaade	GHA steps to collect and upload heap dumps to debug UT OOM errors (#17029 ) * Add GHA steps to tar and upload any heap dumps on failure to debug UT OOM issues. * Add jvm options to heap dump OnOutOfMemoryError Co-authored-by: Elliott Freis <108356317+imply-elliott@users.noreply.github.com> --------- Co-authored-by: Elliott Freis <108356317+imply-elliott@users.noreply.github.com>	2024-09-12 09:06:35 -04:00
Rishabh Singh	99313e9996	Revised IT to detect backward incompatible change (#16779 ) Added a new revised IT group BackwardCompatibilityMain. The idea is to catch potential backward compatibility issues that may arise during rolling upgrade. This test group runs a docker-compose cluster with Overlord & Coordinator service on the previous druid version. Following env vars are required in the GHA file .github/workflows/unit-and-integration-tests-unified.yml to run this test DRUID_PREVIOUS_VERSION -> Previous druid version to test backward incompatibility. DRUID_PREVIOUS_VERSION_DOWNLOAD_URL -> URL to fetch the tar.	2024-08-07 11:13:35 +05:30
Clint Wylie	71725b41b5	ignore dependencies for github stale action (#16797 )	2024-07-25 10:32:43 -07:00
Clint Wylie	37a50e6803	Remove index_realtime and index_realtime_appenderator tasks (#16602 ) index_realtime tasks were removed from the documentation in #13107. Even at that time, they weren't really documented per se— just mentioned. They existed solely to support Tranquility, which is an obsolete ingestion method that predates migration of Druid to ASF and is no longer being maintained. Tranquility docs were also de-linked from the sidebars and the other doc pages in #11134. Only a stub remains, so people with links to the page can see that it's no longer recommended. index_realtime_appenderator tasks existed in the code base, but were never documented, nor as far as I am aware were they used for any purpose. This patch removes both task types completely, as well as removes all supporting code that was otherwise unused. It also updates the stub doc for Tranquility to be firmer that it is not compatible. (Previously, the stub doc said it wasn't recommended, and pointed out that it is built against an ancient 0.9.2 version of Druid.) ITUnionQueryTest has been migrated to the new integration tests framework and updated to use Kafka ingestion. Co-authored-by: Gian Merlino <gianmerlino@gmail.com>	2024-06-24 20:13:33 -07:00
Rishabh Singh	a63c12bf34	Upload tasklogs along with service logs on Standard IT failure (#16631 ) * Fix build * Push tasklogs alongwith service logs * temp changes to run standard its without unit test results * test * minor change * test * test * Update datasource name for ITSystemTableBatchIndexTaskTest * Publish task logs * Revert other changes * update standard-it yaml	2024-06-22 11:45:54 +05:30
Zoltan Haindrich	44ea4e1c51	Fix cds-coordinator-metadata-query-disabled (#16488 ) fixes the issue with the newly enabled `cds-coordiantor-metadata-query-disabled` [split](https://github.com/apache/druid/pull/16468) * configures to use `prepopulated-data` environment things to configure `S3` for access * this is needed because these tests use a [dataset which is loaded from s3](https://github.com/apache/druid/blob/master/integration-tests/docker/test-data/cds-coordinator-metadata-query-disabled-sample-data.sql) * also undoes the previous [fix](https://github.com/apache/druid/pull/16469) of setting the aws region explicitly as this is a more complete solution - and configuring `prepopulated-data` also sets the region; so that's not needed anymore	2024-05-22 20:42:11 +02:00
Rishabh Singh	28473e7c4d	Use correct IT group name for the group `cds-coordinator-metadata-query-disabled` in GHA (#16468 ) * Fix build * Use the correct IT test group name in gha * update	2024-05-21 11:30:23 +05:30
Kashif Faraz	89ec0da5c5	Disable upload of coverage report to codecov.io (#16347 )	2024-04-29 21:04:55 +05:30
Rishabh Singh	e30790e013	Introduce Segment Schema Publishing and Polling for Efficient Datasource Schema Building (#15817 ) Issue: #14989 The initial step in optimizing segment metadata was to centralize the construction of datasource schema in the Coordinator (#14985). Thereafter, we addressed the problem of publishing schema for realtime segments (#15475). Subsequently, our goal is to eliminate the requirement for regularly executing queries to obtain segment schema information. This is the final change which involves publishing segment schema for finalized segments from task and periodically polling them in the Coordinator.	2024-04-24 22:22:53 +05:30
Laksh Singla	cce2d0f127	Upload openrewrite patch via GHA (#16270 ) This patch adds a step to the openrewrite action, such that it uploads the correcting patch, in case it fails.	2024-04-12 15:31:07 +05:30
Zoltan Haindrich	0a42342cef	Update CalciteTest to use junit5 (#16106 ) Update CalciteTest to use junit5 change the way temp dirs are handled * add openrewrite workflow to safeguard upgrade * replace junitparamrunner with standard junit5 parametered tests * update a few rules to junit5 api * lots of boring changes * cleanup QueryLogHook * cleanup * fix compile error: ARRAYS_DATASOURCE * fix test * remove enclosed * empty +TEST:TDigestSketchSqlAggregatorTest,HllSketchSqlAggregatorTest,DoublesSketchSqlAggregatorTest,ThetaSketchSqlAggregatorTest,ArrayOfDoublesSketchSqlAggregatorTest,BloomFilterSqlAggregatorTest,BloomDimFilterSqlTest,CatalogIngestionTest,CatalogQueryTest,FixedBucketsHistogramQuantileSqlAggregatorTest,QuantileSqlAggregatorTest,MSQArraysTest,MSQDataSketchesTest,MSQExportTest,MSQFaultsTest,MSQInsertTest,MSQLoadedSegmentTests,MSQParseExceptionsTest,MSQReplaceTest,MSQSelectTest,InsertLockPreemptedFaultTest,MSQWarningsTest,SqlMSQStatementResourcePostTest,SqlStatementResourceTest,CalciteSelectJoinQueryMSQTest,CalciteSelectQueryMSQTest,CalciteUnionQueryMSQTest,MSQTestBase,VarianceSqlAggregatorTest,SleepSqlTest,SqlRowTransformerTest,DruidAvaticaHandlerTest,DruidStatementTest,BaseCalciteQueryTest,CalciteArraysQueryTest,CalciteCorrelatedQueryTest,CalciteExplainQueryTest,CalciteExportTest,CalciteIngestionDmlTest,CalciteInsertDmlTest,CalciteJoinQueryTest,CalciteLookupFunctionQueryTest,CalciteMultiValueStringQueryTest,CalciteNestedDataQueryTest,CalciteParameterQueryTest,CalciteQueryTest,CalciteReplaceDmlTest,CalciteScanSignatureTest,CalciteSelectQueryTest,CalciteSimpleQueryTest,CalciteSubqueryTest,CalciteSysQueryTest,CalciteTableAppendTest,CalciteTimeBoundaryQueryTest,CalciteUnionQueryTest,CalciteWindowQueryTest,DecoupledPlanningCalciteJoinQueryTest,DecoupledPlanningCalciteQueryTest,DecoupledPlanningCalciteUnionQueryTest,DrillWindowQueryTest,DruidPlannerResourceAnalyzeTest,IngestTableFunctionTest,QueryTestRunner,SqlTestFrameworkConfig,SqlAggregationModuleTest,ExpressionsTest,GreatestExpressionTest,IPv4AddressMatchExpressionTest,IPv4AddressParseExpressionTest,IPv4AddressStringifyExpressionTest,LeastExpressionTest,TimeFormatOperatorConversionTest,CombineAndSimplifyBoundsTest,FiltrationTest,SqlQueryTest,CalcitePlannerModuleTest,CalcitesTest,DruidCalciteSchemaModuleTest,DruidSchemaNoDataInitTest,InformationSchemaTest,NamedDruidSchemaTest,NamedLookupSchemaTest,NamedSystemSchemaTest,RootSchemaProviderTest,SystemSchemaTest,CalciteTestBase,SqlResourceTest * use @Nested * add rule to remove enclosed; upgrade surefire * remove enclosed * cleanup * add comment about surefire exclude	2024-03-19 04:05:12 -07:00
Adarsh Sanjeev	86a24012a6	Add security ITs for sending tasks to overlord (#16131 ) * Add security ITs for sending tasks to overlord * Add security ITs for sending tasks to overlord * Resolve test flakiness	2024-03-18 09:33:40 +05:30
Zoltan Haindrich	60766495aa	Use dorny/paths-filter@v3.0.0 (#16082 )	2024-03-08 13:35:26 +05:30
Abhishek Radhakrishnan	daf03939a9	Upgrade GHA dependencies (#15954 ) * Upgrade actions/checkout from v3 to v4. * Upgrade actions/setup-java from v3 to v4. * Upgrade dorny/paths-filter, actions/cdache/restore, actions/stale to v3, v4 and v9 respectively. * Add a GHA label for .github/** and skip UT/IT on .github files. * remove skipping UT/IT on .github/** changes.	2024-03-08 07:54:02 +05:30
Sensor	3acfc95453	Remove helm paths from CodeQL config (#16006 )	2024-02-29 20:02:27 +05:30
AlbericByte	f07d402f48	pin Testng dependencies to 7.3.0 (#15924 )	2024-02-28 09:48:13 +08:00
Abhishek Agarwal	ddfc31d7ed	Reduce the size of distribution docker image (#15968 ) This PR creates symlinks when there are duplicate jars present in the extension. Docker image includes contrib extensions, too, and the size of the image has bloated up quite a lot of late. This change also fixes "ITNestedQueryPushDownTest integration test"	2024-02-26 21:18:55 +05:30
Zoltan Haindrich	170d37f188	add check to build docker image (#15894 )	2024-02-21 10:53:35 -05:00
Sam Rash	be0ee2ee33	update version check for profiling to >= 17 (#15686 )	2024-02-14 21:44:20 +05:30
Vishesh Garg	6e9eee4c5f	Add failure check (#15873 )	2024-02-09 08:27:10 -08:00
Vishesh Garg	2a250a4e6e	Fix GHA logs dir and make tar and upload conditional on web console test failures (#15810 ) The PR makes 2 change: Correct the current logs directory tarred in GHA static checks to log Make the steps of logs tar-ing and uploading conditional on web console test failures, which currently happens on any step failure in static checks workflow Sample logs before this change for failed static checks: https://github.com/apache/druid/actions/runs/7719743853/job/21043502498	2024-01-31 15:39:56 +05:30
Zoltan Haindrich	2eba20d724	Fix minor build issues and stabilize intellij-inspections runs (#15747 ) * Possibly stabilize intellij-inspections * remove `integration-tests-ex/cases` from excluded projects from initial build * enable ErrorProne's `CheckedExceptionNotThrown` to get earlier errors than intellij-inspections * fix ddsketch pom.xml * fix spellcheck	2024-01-24 15:17:33 +05:30
Victoria Lim	52313c51ac	docs: Anchor link checker (#15624 ) Co-authored-by: 317brian <53799971+317brian@users.noreply.github.com>	2024-01-08 15:19:05 -08:00
sensor	62964e99b1	optimize CI workflow for doc updates (#15617 ) * optimize CI workflow for doc updates * Update .github/workflows/codeql.yml Co-authored-by: Abhishek Radhakrishnan <abhishek.rb19@gmail.com> * Update .github/workflows/codeql.yml Co-authored-by: Abhishek Radhakrishnan <abhishek.rb19@gmail.com> --------- Co-authored-by: Benedict Jin <asdf2014@apache.org> Co-authored-by: Abhishek Radhakrishnan <abhishek.rb19@gmail.com>	2024-01-05 17:18:38 -08:00
Abhishek Radhakrishnan	050b515355	Upgrade CodeQL from v2 to latest v3. (#15619 )	2024-01-03 11:31:53 -08:00
Abhishek Radhakrishnan	9deeb288c5	Update labeler config per v5 spec. (#15564 )	2023-12-14 14:00:21 -05:00
Abhishek Radhakrishnan	7fa987dae9	Update labeler to v5 that includes fix where bot doesn't remove labels added by maintainers. (#15558 )	2023-12-14 12:10:26 -05:00
Zoltan Haindrich	8bc7a5f3ac	Move codeql-config.yml out of the workflows folder (#15553 ) Move codeql config file out of the workflows folder so github doesn't try to run it and fail the github workflow run every time a branch is updated.	2023-12-13 08:37:01 -08:00
Vishesh Garg	801967b75f	Add test logs zipping and archival steps for failures in Static Checks Github Actions (#15506 ) Add test logs zipping and archival steps for failures in Static Checks Github Actions	2023-12-07 15:34:23 +05:30
Xavier Léauté	ae6893edc3	unpin guava related dependabot dependencies (#15494 ) Several dependabot ignore directives are no longer relevant. Unpin them to ensure we get again get timely updates via dependabot. * support for Hadoop 2 was dropped as part of #14763 * Guava was upgraded to 31 as part of #14767 * Calcite was upgraded to 1.35 as part of #14510	2023-12-05 16:04:39 -08:00
Rishabh Singh	d968bb3f43	Rename config for enabling CentralizedDatasourceSchema feature (#15476 ) * Rename property to druid.centralizedDatasourceSchema.enabled * Update config name in docker-compose	2023-12-05 16:57:25 +05:30
Rishabh Singh	8c802e4c9b	Relocating Table Schema Building: Shifting from Brokers to Coordinator for Improved Efficiency (#14985 ) In the current design, brokers query both data nodes and tasks to fetch the schema of the segments they serve. The table schema is then constructed by combining the schemas of all segments within a datasource. However, this approach leads to a high number of segment metadata queries during broker startup, resulting in slow startup times and various issues outlined in the design proposal. To address these challenges, we propose centralizing the table schema management process within the coordinator. This change is the first step in that direction. In the new arrangement, the coordinator will take on the responsibility of querying both data nodes and tasks to fetch segment schema and subsequently building the table schema. Brokers will now simply query the Coordinator to fetch table schema. Importantly, brokers will still retain the capability to build table schemas if the need arises, ensuring both flexibility and resilience.	2023-11-04 19:33:25 +05:30
Xavier Léauté	352702bb25	run some integration tests with Java 21 (#15104 ) * use setup-java everywhere for consistency * add Java 21 to integration test matrix * simplify docker build containers script + add Java 21 * fix for Java versions reporting 21-ea	2023-10-20 11:18:13 +08:00
Tejaswini Bandlamudi	1f39c054a7	Fix GHA workflow bugs (#15209 )	2023-10-19 17:11:36 +05:30
Tejaswini Bandlamudi	0a6f78c0bb	Fix GHA workflow bugs (#15138 )	2023-10-12 21:25:57 +05:30
Xavier Léauté	f9439970c9	run build and unit tests using Java 21 (#15088 ) * run build and unit test using Java 21 * run static checks with Java 21 * use setup-java for unit tests, since Java 21 is not built-in * skip maven cache from setup-java * add comments to explain cache behavior	2023-10-06 12:45:07 -07:00
Tejaswini Bandlamudi	c888ac5d61	fix path of druid service IT logs (#15082 )	2023-10-04 15:38:38 +05:30
Zoltan Haindrich	5f3b310115	Build reliablity fixes (#15048 ) * disable parallel builds; enable batch mode to get rid of transfer progress * restore .m2 from setup-java if not found * some change to sql * add ws * fix quote * fix quote * undo querytest change * nullhandling in mvtest * init more * skip commitid plugin * add-back 1.0C to build ; remove redundant skip-s from copy-resources; add comment	2023-09-28 12:27:52 -07:00
Tejaswini Bandlamudi	fa61e654e4	fix uploading IT docker logs to GHA artifacts (#15046 )	2023-09-28 15:25:52 +05:30
Zoltan Haindrich	08cf290da2	Configure caching for static-check actions (#15010 ) * some stuff * some stuff * dont change it.sh * some stuff * updates * add missing * add 1 more * setup-java	2023-09-20 14:11:39 -07:00
Laksh Singla	0fc5d5405a	Tweak GHA runner label for MSQ (#14992 )	2023-09-15 05:44:21 +00:00
Tejaswini Bandlamudi	b7bb5ee1db	Upload docker and druid service logs as artifacts on GitHub Actions IT run failure (#14967 ) With this PR, docker and druid service logs are uploaded as artifacts onto GitHub when an IT job fails so that we can later download them for investigation.	2023-09-13 11:32:04 +05:30
Abhishek Radhakrishnan	0f38a37b9d	Tweak GHA runner label. (#14963 ) - processing/** can be ingestion, querying or neither. Removing it for now. - Also, add msq extension for the querying label.	2023-09-11 20:09:26 -07:00
Abhishek Radhakrishnan	f9cf500a69	Extend GHA autolabeler to other areas (#14903 ) * Automate adding labels. * Add metrics/event emitting label * ingestion and segment format	2023-09-07 20:25:37 -07:00
317brian	263ac36e8d	docs: fix autolabeler for jupyter notebooks (#14862 )	2023-08-18 12:42:36 -07:00
317brian	6b4dda964d	Docusaurus2 upgrade for master (#14411 ) Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> Co-authored-by: Charles Smith <techdocsmith@gmail.com>	2023-08-16 19:01:21 -07:00
Xavier Léauté	50b3d96df5	increase dependabot PR limit for Java dependencies (#14804 ) Many dependabot PRs are currently stuck due to API changes or incompatibilities. Temporarily Increasing the limit so we can get updates for other dependencies.	2023-08-14 19:51:59 -07:00
Xavier Léauté	37ed0f4a17	Bump jclouds.version from 1.9.1 to 2.0.3 (#14746 ) * Updates `org.apache.jclouds:` from 1.9.1 to 2.0.3 Pin jclouds to 2.0.x since 2.1.x requires Guava 18+ * replace easymock with mockito Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-08-10 06:24:01 -07:00
Tejaswini Bandlamudi	a45b25fa1d	Removes support for Hadoop 2 (#14763 ) Removing Hadoop 2 support as discussed in https://lists.apache.org/list?dev@druid.apache.org:lte=1M:hadoop	2023-08-09 17:47:52 +05:30
Abhishek Agarwal	955734ba8d	Fix exempt labels in stale.yml (#14733 )	2023-08-02 17:12:18 +05:30

1 2 3

138 Commits