druid

mirror of https://github.com/apache/druid.git synced 2025-03-02 15:29:10 +00:00

Author	SHA1	Message	Date
Jonathan Wei	49a9c3ffb7	Revert "Adjust HadoopIndexTask temp segment renaming to avoid potential race conditions (#11075 )" (#11151 ) This reverts commit a2892d9c40793027ba8a8977c85b3de4a949e11c.	2021-04-22 15:33:27 -07:00
zachjsh	a2892d9c40	Adjust HadoopIndexTask temp segment renaming to avoid potential race conditions (#11075 ) * Do stuff * Do more stuff * * Do more stuff * * Do more stuff * * working * * cleanup * * more cleanup * * more cleanup * * add license header * * Add unit tests * * add java docs * * add more unit tests * * Cleanup test * * Move removing of workingPath to index task rather than in hadoop job. * * Address review comments * * remove unused import * * Address review comments * Do not overwrite segment descriptor for segment if it already exists. * * add comments to FileSystemHelper class * * fix local hadoop integration test	2021-04-21 12:24:31 -07:00
Yi Yuan	d0a94a8c14	add avro stream input format (#11040 ) * add avro stream input format * bug fixed * add document * doc fix * change doc * add integretion test * bug fixed * bug fixed * add string as binary getter Co-authored-by: yuanyi <yuanyi@freewheel.tv>	2021-04-12 21:53:41 -07:00
Jihoon Son	a6a2758095	More unit tests for JsonParserIterator; Integration tests for query errors (#11091 ) * unit tests for timeout exception in init * integration tests * run integraion test on travis * fix inspection	2021-04-12 15:08:50 -07:00
Jonathan Wei	e7b2ecd0fd	Add retry around query loop in ITWikipediaQueryTest.testQueryLaningLaneIsLimited (#11077 )	2021-04-09 10:54:34 -07:00
Maytas Monsereenusorn	4576152e4a	Make dropExisting flag for Compaction configurable and add warning documentations (#11070 ) * Make dropExisting flag for Compaction configurable * fix checkstyle * fix checkstyle * fix test * add tests * fix spelling * fix docs * add IT * fix test * fix doc * fix doc	2021-04-09 00:12:28 -07:00
Lucas Capistrant	8264203cee	Allow client to configure batch ingestion task to wait to complete until segments are confirmed to be available by other (#10676 ) * Add ability to wait for segment availability for batch jobs * IT updates * fix queries in legacy hadoop IT * Fix broken indexing integration tests * address an lgtm flag * spell checker still flagging for hadoop doc. adding under that file header too * fix compaction IT * Updates to wait for availability method * improve unit testing for patch * fix bad indentation * refactor waitForSegmentAvailability * Fixes based off of review comments * cleanup to get compile after merging with master * fix failing test after previous logic update * add back code that must have gotten deleted during conflict resolution * update some logging code * fixes to get compilation working after merge with master * reset interrupt flag in catch block after code review pointed it out * small changes following self-review * fixup some issues brought on by merge with master * small changes after review * cleanup a little bit after merge with master * Fix potential resource leak in AbstractBatchIndexTask * syntax fix * Add a Compcation TuningConfig type * add docs stipulating the lack of support by Compaction tasks for the new config * Fixup compilation errors after merge with master * Remove erreneous newline	2021-04-08 21:03:00 -07:00
zhangyue19921010	de691808ce	[Bug]Kinesis-data-format IT can not work (#11071 ) * start schema-resgity and replace json template * add docs Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2021-04-08 15:50:04 -07:00
Xavier Léauté	15bdd6bc2f	Fix unit tests and GC settings for Java 15 (#11074 ) * JavaScript script engine support was removed in JDK 15: skip those tests for JDKs without it * Fix flaky HTTP client tests with Java 15 * Switch from CMS to G1GC in integration tests, since CMS is no longer available in JDK 15	2021-04-08 10:33:37 -07:00
Yi Yuan	053af6815d	bug fixed (#11066 ) Co-authored-by: yuanyi <yuanyi@freewheel.tv>	2021-04-06 10:39:06 +08:00
Maytas Monsereenusorn	d7f5293364	Add an option for ingestion task to drop (mark unused) all existing segments that are contained by interval in the ingestionSpec (#11025 ) * Auto-Compaction can run indefinitely when segmentGranularity is changed from coarser to finer. * Add option to drop segments after ingestion * fix checkstyle * add tests * add tests * add tests * fix test * add tests * fix checkstyle * fix checkstyle * add docs * fix docs * address comments * address comments * fix spelling	2021-04-01 12:29:36 -07:00
Lasse Krogh Mammen	782a1d4e6c	Add Calcite Avatica protobuf handler (#10543 )	2021-03-31 12:46:25 -07:00
Himadri Singh	74ae2eb71a	Fix Integration Tests (#11046 )	2021-03-30 01:03:49 +05:30
Clint Wylie	f160548231	maybe fix leadership integration test flakes (#11031 )	2021-03-26 03:43:06 -07:00
Gian Merlino	bf20f9e979	DruidInputSource: Fix issues in column projection, timestamp handling. (#10267 ) * DruidInputSource: Fix issues in column projection, timestamp handling. DruidInputSource, DruidSegmentReader changes: 1) Remove "dimensions" and "metrics". They are not necessary, because we can compute which columns we need to read based on what is going to be used by the timestamp, transform, dimensions, and metrics. 2) Start using ColumnsFilter (see below) to decide which columns we need to read. 3) Actually respect the "timestampSpec". Previously, it was ignored, and the timestamp of the returned InputRows was set to the `__time` column of the input datasource. (1) and (2) together fix a bug in which the DruidInputSource would not properly read columns that are used as inputs to a transformSpec. (3) fixes a bug where the timestampSpec would be ignored if you attempted to set the column to something other than `__time`. (1) and (3) are breaking changes. Web console changes: 1) Remove "Dimensions" and "Metrics" from the Druid input source. 2) Set timestampSpec to `{"column": "__time", "format": "millis"}` for compatibility with the new behavior. Other changes: 1) Add ColumnsFilter, a new class that allows input readers to determine which columns they need to read. Currently, it's only used by the DruidInputSource, but it could be used by other columnar input sources in the future. 2) Add a ColumnsFilter to InputRowSchema. 3) Remove the metric names from InputRowSchema (they were unused). 4) Add InputRowSchemas.fromDataSchema method that computes the proper ColumnsFilter for given timestamp, dimensions, transform, and metrics. 5) Add "getRequiredColumns" method to TransformSpec to support the above. * Various fixups. * Uncomment incorrectly commented lines. * Move TransformSpecTest to the proper module. * Add druid.indexer.task.ignoreTimestampSpecForDruidInputSource setting. * Fix. * Fix build. * Checkstyle. * Misc fixes. * Fix test. * Move config. * Fix imports. * Fixup. * Fix ShuffleResourceTest. * Add import. * Smarter exclusions. * Fixes based on tests. Also, add TIME_COLUMN constant in the web console. * Adjustments for tests. * Reorder test data. * Update docs. * Update docs to say Druid 0.22.0 instead of 0.21.0. * Fix test. * Fix ITAutoCompactionTest. * Changes from review & from merging.	2021-03-25 10:32:21 -07:00
Maytas Monsereenusorn	c87ac0823f	Auto-compaction with segment granularity retrieve incomplete segments from timeline when interval overlap (#11019 ) * Fix Auto-compaction with segment granularity retrieve incomplete segments from timeline when interval overlap * Fix Auto-compaction with segment granularity retrieve incomplete segments from timeline when interval overlap * Fix Auto-compaction with segment granularity retrieve incomplete segments from timeline when interval overlap * Fix Auto-compaction with segment granularity retrieve incomplete segments from timeline when interval overlap * address comments	2021-03-24 11:37:29 -07:00
Jihoon Son	6aec8f0c1b	allow multiple ldap bootstrap files for integration tests (#11023 )	2021-03-23 13:18:36 -07:00
Maytas Monsereenusorn	51d2c61f1c	Auto-compaction with segment granularity should skip segments that already have the configured segmentGranularity (#11009 ) * Auto-compaction with segment granularity should skip segments that already have the configured segmentGranularity * Auto-compaction with segment granularity should skip segments that already have the configured segmentGranularity * Auto-compaction with segment granularity should skip segments that already have the configured segmentGranularity * address comments * address comments * address comments * address comments * address comments	2021-03-19 17:38:28 -07:00
Maytas Monsereenusorn	f19c2e9ce4	If ingested data has sparse columns, the ingested data with forceGuaranteedRollup=true can result in imperfect rollup and final dimension ordering can be different from dimensionSpec ordering in the ingestionSpec (#10948 ) * add IT * add IT * add the fix * fix checkstyle * fix compile * fix compile * fix test * fix test * address comments	2021-03-18 17:04:28 -07:00
Maytas Monsereenusorn	f37713dc6d	Fix auto compaction with mixed versions in the same time chunk based on new segment granularity (#11000 )	2021-03-16 12:48:19 -07:00
Xavier Léauté	68781a0d20	update testing frameworks for Java 15 support (#10984 ) * update jacoco to 0.8.6 * update easymock to 4.2 * update equalsverifier to 3.5.5 * update mockito to 3.8.0 * update powermock to 2.0.9 * update assertj-core to 3.19.0 * update testng to 7.3.0 - fix DTD url security for testng 7.x - fix backwards incompatibility in testng 7.x	2021-03-12 20:18:13 -08:00
Maytas Monsereenusorn	ed91a2bb38	Fix Kinesis should not increment throwAway count on EOS record (#10976 ) * fix Kinesis increament throwAway on EOS record * fix checkstyle * fix IT * fix test * fix IT * fix IT * fix IT * fix IT	2021-03-11 22:04:58 -08:00
Vyatcheslav Mogilevsky	b0432be07a	Apache archive mirror (#10979 ) * Ability to use mirror of archive.apache.org * Ability to use mirror of archive.apache.org: documentation * Ability to use mirror of archive.apache.org: fix int test Dockerfile: missing COPY instruction	2021-03-11 09:07:51 -08:00
frank chen	b79b7e6dfb	Improve exception handling in IT to reduce excessive stack trace messages (#10955 ) * Suppress logging for some exceptions to reduce excessive stack trace messages Signed-off-by: frank chen <frank.chen021@outlook.com> * log message for channel disconnected exception Signed-off-by: frank chen <frank.chen021@outlook.com>	2021-03-10 21:27:55 -08:00
Clint Wylie	96889cdebc	add avro + kafka + schema registry integration test (#10929 ) * add avro + schema registry integration test * style * retry init * maybe this * oops heh * this will fix it * review stuffs * fix comment	2021-03-08 08:12:12 -08:00
zhangyue19921010	bddacbb1c3	Dynamic auto scale Kafka-Stream ingest tasks (#10524 ) * druid task auto scale based on kafka lag * fix kafkaSupervisorIOConfig and KinesisSupervisorIOConfig * druid task auto scale based on kafka lag * fix kafkaSupervisorIOConfig and KinesisSupervisorIOConfig * test dynamic auto scale done * auto scale tasks tested on prd cluster * auto scale tasks tested on prd cluster * modify code style to solve 29055.10 29055.9 29055.17 29055.18 29055.19 29055.20 * rename test fiel function * change codes and add docs based on capistrant reviewed * midify test docs * modify docs * modify docs * modify docs * merge from master * Extract the autoScale logic out of SeekableStreamSupervisor to minimize putting more stuff inside there && Make autoscaling algorithm configurable and scalable. * fix ci failed * revert msic.xml * add uts to test autoscaler create && scale out/in and kafka ingest with scale enable * add more uts * fix inner class check * add IT for kafka ingestion with autoscaler * add new IT in groups=kafka-index named testKafkaIndexDataWithWithAutoscaler * review change * code review * remove unused imports * fix NLP * fix docs and UTs * revert misc.xml * use jackson to build autoScaleConfig with default values * add uts * use jackson to init AutoScalerConfig in IOConfig instead of Map<> * autoscalerConfig interface and provide a defaultAutoScalerConfig * modify uts * modify docs * fix checkstyle * revert misc.xml * modify uts * reviewed code change * reviewed code change * code reviewed * code review * log changed * do StringUtils.encodeForFormat when create allocationExec * code review && limit taskCountMax to partitionNumbers * modify docs * code review Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2021-03-06 14:36:52 +05:30
Maytas Monsereenusorn	23333914c7	add javadoc and test (#10938 )	2021-03-03 11:34:00 +08:00
Maytas Monsereenusorn	b7b0ee8362	Add query granularity to compaction task (#10900 ) * add query granularity to compaction task * fix checkstyle * fix checkstyle * fix test * fix test * add tests * fix test * fix test * cleanup * rename class * fix test * fix test * add test * fix test	2021-03-02 11:23:52 -08:00
zachjsh	553f5c8570	Ldap integration tests (#10901 ) * Add integration tests for ldap extension * * refactor * * add ldap-security integration test to travis * * fix license error * * Fix failing other integration test * * break up large tests * refactor * address review comments * * fix intellij inspections failure * * remove dead code	2021-02-23 13:29:57 -08:00
Agustin Gonzalez	eabad0fb35	Keep query granularity of compacted segments after compaction (#10856 ) * Keep query granularity of compacted segments after compaction * Protect against null isRollup * Fix bugspot check RC_REF_COMPARISON_BAD_PRACTICE_BOOLEAN & edit an existing comment * Make sure that NONE is also included when comparing for the finer granularity * Update integration test check for segment size due to query granularity propagation affecting size * Minor code cleanup * Added functional test to verify queryGranlarity after compaction * Minor style fix * Update unit tests	2021-02-18 01:35:10 -08:00
Maytas Monsereenusorn	6541178c21	Support segmentGranularity for auto-compaction (#10843 ) * Support segmentGranularity for auto-compaction * Support segmentGranularity for auto-compaction * Support segmentGranularity for auto-compaction * Support segmentGranularity for auto-compaction * resolve conflict * Support segmentGranularity for auto-compaction * Support segmentGranularity for auto-compaction * fix tests * fix more tests * fix checkstyle * add unit tests * fix checkstyle * fix checkstyle * fix checkstyle * add unit tests * add integration tests * fix checkstyle * fix checkstyle * fix failing tests * address comments * address comments * fix tests * fix tests * fix test * fix test * fix test * fix test * fix test * fix test * fix test * fix test	2021-02-12 03:03:20 -08:00
zachjsh	64774037c1	Add config option to specify zk version in integration tests (#10870 ) * Update integration-tests README Updated the integration-tests README file to include instructions for setting the `ZK_VERSION` property which is now required to be set prior to executing any integration test. Also added a note about the importance of setting the test group parameter when running integration tests, even when running single tests. * * revert change made to DOCKER_IP doc * * Add default value for zk version * * update travis config to use new zk.version property when running integration tests * Remove doc about needing to set ZK_VERSION variable when running integration tests	2021-02-11 10:31:49 -08:00
Abhishek Agarwal	9526fd38db	Exclude redundant jars from integration-tests build (#10878 ) * Exclude redundant jars from integration-tests build * changes	2021-02-10 23:53:34 -08:00
Jihoon Son	397e7455ba	Increase heap to 64m for custom node (#10846 )	2021-02-03 16:23:19 -08:00
zhangyue19921010	77946f9264	K8s IT Test enhance (#10785 ) * do build and stop action in IT * change base dir from druidHome to druidHome/integration-tests * add env DRUID_HOME * bug fix * modify stop_sh * ready to test * bug fix * modify dir * tested on dev * modify dir * move DRUID_HOME env * done Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2021-02-01 15:48:42 -08:00
Xavier Léauté	c346ce64b1	move integration tests from ZooKeeper 3.4.x to 3.5.x (#10786 ) * move integration tests from ZooKeeper 3.4.x to 3.5.x * run a subset of our integration tests with ZK 3.4 for backwards compatibility testing. * remove need to build separate docker-base image - use multi-stage build for the base image - use openjdk base image instead of building our own JDK base - workaround Debian not including MySQL by using MariaDB - download mysql connector directly instead of using distro version * fix incorrect openssl command failing on Debian * keep mysql connector version in sync with pom version	2021-01-31 08:35:39 -08:00
Maytas Monsereenusorn	a46d561bd7	Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead (#10740 ) * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * fix checkstyle * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * fix test * fix test * add log * Fix byte calculation for maxBytesInMemory to take into account of Sink/Hydrant Object overhead * address comments * fix checkstyle * fix checkstyle * add config to skip overhead memory calculation * add test for the skipBytesInMemoryOverheadCheck config * add docs * fix checkstyle * fix checkstyle * fix spelling * address comments * fix travis * address comments	2021-01-27 00:34:56 -08:00
Jihoon Son	95065bdf1a	Bump dev version to 0.22.0-SNAPSHOT (#10759 )	2021-01-15 13:16:23 -08:00
Jihoon Son	149306c9db	Tidy up HTTP status codes for query errors (#10746 ) * Tidy up query error codes * fix tests * Restore query exception type in JsonParserIterator * address review comments; add a comment explaining the ugly switch * fix test	2021-01-13 17:20:00 -08:00
zhangyue19921010	d5192640cb	remove extra comma (#10670 ) Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2021-01-08 15:15:08 -08:00
Jonathan Wei	68bb038b31	Multiphase segment merge for IndexMergerV9 (#10689 ) * Multiphase merge for IndexMergerV9 * JSON fix * Cleanup temp files * Docs * Address logging and add IT * Fix spelling and test unloader datasource name	2021-01-05 22:19:09 -08:00
Himanshu	d2e6240cac	k8s-int-test-build: zk-less druid cluster and http based segment/task managment (#10686 ) * zk-less druid cluster in k8s build * attempt to fix build and use http based remote task management * mm/router logs for debugging * add default account k8s role and binding for pod, configMap access * fix issue * change router port to 8088 for common readinessProbe * break build_run_k8s_cluster.sh into separate scripts * revert changes to K8sDruidNodeAnnouncer.java * k8s extension doc update * add license to new file * address review comments * do not try to load lookups at startup to improve cluster startup time	2021-01-05 18:51:47 -08:00
Lucas Capistrant	26b911a384	Make some additions to IT suite to make Hadoop related testing more understandable (#10667 ) * Make some additions to IT suite to make Hadoop related testing more understandable * add start.hadoop.docker to mvn arg tips in doc * fix issues preventing ITIndexHadoopTest from running in local mode	2020-12-28 12:25:47 -06:00
Clint Wylie	74fbdd322d	refactor NodeRole so extensions can participate in disco and announcement (#10700 ) * refactor NodeRole so extensions can participate in disco and announcement * fixes, maybe * retries * javadoc * fix * spelling	2020-12-24 15:29:32 -08:00
Xavier Léauté	b7a16d08a6	Update Apache Kafka to 2.7.0 (#10701 ) - align scala versions to match Kafka	2020-12-22 13:56:00 -08:00
Maytas Monsereenusorn	5bd7924296	Fix kinesis integration test (#10696 ) * fix kinesis IT * fix checkstyle	2020-12-21 12:57:40 -08:00
Clint Wylie	92e5700e1e	fix integration test override config which requires environment variables before calling compose (#10694 )	2020-12-18 17:57:07 -08:00
Maytas Monsereenusorn	6f2ce8f0a5	fix Kinesis It (#10692 )	2020-12-18 13:47:00 -08:00
Clint Wylie	da0eabaa01	integration test for coordinator and overlord leadership client (#10680 ) * integration test for coordinator and overlord leadership, added sys.servers is_leader column * docs * remove not needed * fix comments * fix compile heh * oof * revert unintended * fix tests, split out docker-compose file selection from starting cluster, use docker-compose down to stop cluster * fixes * style * dang * heh * scripts are hard * fix spelling * fix thing that must not matter since was already wrong ip, log when test fails * needs more heap * fix merge * less aggro	2020-12-17 22:50:12 -08:00
zhangyue19921010	1884c35698	Do Integrate test for Druid base on K8s cluster (#10669 ) * add a travls job to do integrate test on K8s * revert build_run_cluster.sh * revert msic * run IT test * ready to test * modify before/after script * done * change mod for script * done * add env DRUID_OPERATOR_VERSION=0.0.3 * change version Co-authored-by: yuezhang <yuezhang@freewheel.tv>	2020-12-16 16:00:42 -08:00

1 2 3 4 5 ...

396 Commits