druid

Commit Graph

Author	SHA1	Message	Date
agricenko	e72f490be0	Integration Tests. Small fixes for CI. (#9988 ) Co-authored-by: agritsenko <agritsenko@provectus.com>	2020-06-04 17:10:56 -07:00
agricenko	56a9cad532	Integration Tests. (#9854 ) * Integration Tests. Added docker-compose with druid-cluster configuration. Refactored shell scripts. split code in a few files * Integration Tests. Added environment variable: DRUID_INTEGRATION_TEST_GROUP * Integration Tests. Removed nit * Integration Tests. Updated if block in docker_run_cluster.sh. * Integration Tests. Readme. Added Docker-compose section. * Integration Tests. removed yml files for s3, gcs, azure. Renamed variables for skip start/stop/build docker. Updated readme. Rollback maven profile: int-tests-config-file * Integration Tests. Removed docker-compose.test-env.yml file. Added DRUID_INTEGRATION_TEST_GROUP variable to docker-compose.yml * Integration Tests. Readme. Added details about docker-compose * Integration Tests. cleanup shell scripts Co-authored-by: agritsenko <agritsenko@provectus.com>	2020-06-02 09:38:53 -07:00
Xavier Léauté	65280a6953	update kafka client version to 2.5.0 (#9902 ) - remove dependency on deprecated internal Kafka classes - keep LZ4 version in line with the version shipped with Kafka	2020-05-27 13:20:32 -07:00
Maytas Monsereenusorn	9db29b93bf	Fix Hadoop IT Legacy test query json was not parameterized (#9901 )	2020-05-20 21:09:17 -07:00
Jihoon Son	c06d3f14b1	Add javadoc for stream ingestion integration tests (#9795 )	2020-05-12 08:56:43 -07:00
Jonathan Wei	61295bd002	More Hadoop integration tests (#9714 ) * More Hadoop integration tests * Add missing s3 instructions * Address PR comments * Address PR comments * PR comments * Fix typo	2020-04-30 14:33:01 -07:00
Jihoon Son	39722bd064	Integration tests for stream ingestion with various data formats (#9783 ) * Integration tests for stream ingestion with various data formats * fix npe * better logging; fix tsv * fix tsv * exclude kinesis from travis * some readme	2020-04-29 13:18:01 -07:00
Maytas Monsereenusorn	6bc64b731f	Improve "waiting for tasks complete" logic in integration tests (#9759 ) * improve waiting for tasks complete logic in integration tests * improve waiting for tasks complete logic in integration tests * fix forbidden check	2020-04-29 08:53:45 -07:00
Maytas Monsereenusorn	a107ee3ed2	Fix problem when running single integration test using -Dit.test= (#9778 ) * fix running single it * fix checksyle	2020-04-29 08:53:25 -07:00
Maytas Monsereenusorn	16f5ae4405	Add integration tests for kafka ingestion (#9724 ) * add kafka admin and kafka writer * refactor kinesis IT * fix typo refactor * parallel * parallel * parallel * parallel works now * add kafka it * add doc to readme * fix tests * fix failing test * test * test * test * test * address comments * addressed comments	2020-04-22 10:43:34 -07:00
Maytas Monsereenusorn	cff39892ba	Fixes intermittent failure in ITAutoCompactionTest (#9739 ) * fix intermittent failure in ITAutoCompactionTest * fix typo * update javadoc	2020-04-21 20:56:17 -07:00
Maytas Monsereenusorn	8328d91b30	Add missing integration tests for the compaction by the coordinator (#9644 ) * Add API to trigger a compaction by the coordinator for integration tests * Add missing integration tests for the compaction by the coordinator * address comments	2020-04-15 14:27:33 -07:00
Maytas Monsereenusorn	d930f04e6a	Test file format extensions for inputSource (orc, parquet) (#9632 ) * Test file format extensions for inputSource (orc, parquet) * Test file format extensions for inputSource (orc, parquet) * fix path * resolve merge conflict * fix typo	2020-04-13 13:03:56 -07:00
Suneet Saldanha	1ced3b33fb	IntelliJ inspections cleanup (#9339 ) * IntelliJ inspections cleanup * Standard Charset object can be used * Redundant Collection.addAll() call * String literal concatenation missing whitespace * Statement with empty body * Redundant Collection operation * StringBuilder can be replaced with String * Type parameter hides visible type * fix warnings in test code * more test fixes * remove string concatenation inspection error * fix extra curly brace * cleanup AzureTestUtils * fix charsets for RangerAdminClient * review comments	2020-04-10 10:04:40 -07:00
Maytas Monsereenusorn	73a6baaeb6	change hadoop inputSource IT to use parallel batch ingestion (#9616 )	2020-04-07 11:37:37 -07:00
Clint Wylie	d267b1c414	check paths used for shuffle intermediary data manager get and delete (#9630 ) * check paths used for shuffle intermediary data manager get and delete * add test * newline * meh	2020-04-07 09:47:18 -07:00
Aleksei Chumagin	79522f3e25	Integration-tests: typo (#9624 ) * QA-57: change $ to # as comment * QA-57: fix haddop to hadoop	2020-04-06 17:40:05 -07:00
Clint Wylie	4d277dbf99	Fix double count ssl connection metrics (#9594 ) * fix double counted jetty/numOpenConnections metric for ssl connections * tests * more better * style	2020-04-03 23:29:23 -07:00
Maytas Monsereenusorn	1852bf33ea	Add Integration Test for functionality of kinesis ingestion (#9576 ) * kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * fix kinesis timeout * Kinesis IT * Kinesis IT * fix checkstyle * Kinesis IT * address comments * fix checkstyle	2020-04-03 09:45:22 -07:00
Suneet Saldanha	af3337dac8	DruidInputSource can add new dimensions during re-ingestion (#9590 ) * WIP integration tests * Add integration test for ingestion with transformSpec * WIP almost working tests * Add ignored tests * checkstyle stuff * remove newPage from index task ingestion spec * more test cleanup * still not quite working * Actually disable the tests * working tests * fix codestyle * dont use junit in integration tests * actually fix the bug * fix checkstyle * bring index tests closer to reindex tests	2020-04-02 17:32:31 -07:00
Jihoon Son	0da8ffc3ff	Bump up development version to 0.19.0-SNAPSHOT (#9586 )	2020-03-30 16:24:04 -07:00
Suneet Saldanha	e6e2836b0e	Instructions to run integration tests against quickstart (#9560 ) * Instructions to run integration tests against quickstart * Address review comments * actually exclude the test group * Revert "actually exclude the test group" This reverts commit `66f366409e`. * update comment	2020-03-26 13:22:53 -07:00
Suneet Saldanha	55c08e0746	DruidSegmentReader should work if timestamp is specified as a dimension (#9530 ) * DruidSegmentReader should work if timestamp is specified as a dimension * Add integration tests Tests for compaction and re-indexing a datasource with the timestamp column * Instructions to run integration tests against quickstart * address pr	2020-03-25 13:47:34 -07:00
Maytas Monsereenusorn	3f521943fc	S3 ingestion spec should not uses the default credentials provider chain when environment value password provider is misconfigured. (#9552 ) * fix s3 optional cred * S3 ingestion spec uses the default credentials provider chain when environment value password provider is misconfigured. * fix failing test	2020-03-24 15:09:02 -07:00
Clint Wylie	2bc29543e5	modify QueryCapacityExceededException to provide better messaging (#9547 ) * modify QueryCapacityExceededException to provide better messaging * style	2020-03-23 20:05:11 -07:00
Maytas Monsereenusorn	5f127a1829	Add integration tests for HDFS (#9542 ) * HDFS IT * HDFS IT * HDFS IT * fix checkstyle	2020-03-20 15:46:08 -07:00
Gian Merlino	1ef25a438f	Broker: Add ability to inline subqueries. (#9533 ) * Broker: Add ability to inline subqueries. The main changes: - ClientQuerySegmentWalker: Add ability to inline queries. - Query: Add "getSubQueryId" and "withSubQueryId" methods. - QueryMetrics: Add "subQueryId" dimension. - ServerConfig: Add new "maxSubqueryRows" parameter, which is used by ClientQuerySegmentWalker to limit how many rows can be inlined per query. - IndexedTableJoinMatcher: Allow creating keys on top of unknown types, by assuming they are strings. This is useful because not all types are known for fields in query results. - InlineDataSource: Store RowSignature rather than component parts. Add more zealous "equals" and "hashCode" methods to ease testing. - Moved QuerySegmentWalker test code from CalciteTests and SpecificSegmentsQueryWalker in druid-sql to QueryStackTests in druid-server. Use this to spin up a new ClientQuerySegmentWalkerTest. * Adjustments from CI. * Fix integration test.	2020-03-18 15:06:45 -07:00
Maytas Monsereenusorn	4c620b8f1c	Adding s3, gcs, azure integration tests (#9501 ) * exclude pulling s3 segments for tests that doesnt need it * fix script * fix script * fix script * add s3 test * refactor sample data script * add tests * add tests * add license header * fix failing tests * change bucket and path to config * update integration test readme * fix typo	2020-03-17 03:08:44 -07:00
Maytas Monsereenusorn	09600db8f2	Add the option to start Hadoop docker container when running integration tests (#9513 ) * hadoop docker it * hadoop docker container it * fix hadoop container	2020-03-16 12:04:05 -07:00
Clint Wylie	69af760a19	add manual laning strategy, integration test (#9492 ) * add manual laning strategy, integration test, json config test * share percent conversion method * wrong assert * review stuffs * doc adjustments * more tests * test adjustment * adjust docs * Update index.md	2020-03-13 20:06:55 -07:00
Gian Merlino	2ef5c17441	Link up row-based datasources to serving layer. (#9503 ) * Link up row-based datasources to serving layer. - Add SegmentWrangler interface that allows linking of DataSources to Segments. - Add LocalQuerySegmentWalker that uses SegmentWranglers to compute queries on data that is available locally. - Modify ClientQuerySegmentWalker to use LocalQuerySegmentWalker when the base datasource is concrete and not a table. - Add SegmentWranglerModule to the Broker so it has them available and can properly instantiate . LocalQuerySegmentWalkers. - Set InlineDataSource and LookupDataSource to concrete, since they can be directly queried now. * Fix tests.	2020-03-11 11:32:27 -07:00
Jihoon Son	7401bb3f93	Improve OvershadowableManager performance (#9441 ) * Use the iterator instead of higherKey(); use the iterator API instead of stream * Fix tests; fix a concurrency bug in timeline * fix test * add tests for findNonOvershadowedObjectsInInterval * fix test * add missing tests; fix a bug in QueueEntry * equals tests * fix test	2020-03-10 13:22:19 -07:00
Maytas Monsereenusorn	2db20afbb7	Integration test cluster supports override config (#9473 ) * integration test refactor * integration test refactor * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * address comments	2020-03-09 21:17:49 -07:00
Francesco Nidito	14accb50ad	Improves on the fix for 8918 (#9387 ) * Improves on the fix for 8918 * factorize constants for ITRetryUtil.retryUntil call * increasing retries and sleep in HttpUtil to cope with 401s in testing * adding retries in EventReceiverFirehoseTestClient * adding missing space	2020-02-25 15:50:27 -08:00
Jihoon Son	3bc7ae782c	Create splits of multiple files for parallel indexing (#9360 ) * Create splits of multiple files for parallel indexing * fix wrong import and npe in test * use the single file split in tests * rename * import order * Remove specific local input source * Update docs/ingestion/native-batch.md Co-Authored-By: sthetland <steve.hetland@imply.io> * Update docs/ingestion/native-batch.md Co-Authored-By: sthetland <steve.hetland@imply.io> * doc and error msg * fix build * fix a test and address comments Co-authored-by: sthetland <steve.hetland@imply.io>	2020-02-24 17:34:39 -08:00
sthetland	6d52edddab	Remove references to Docker Machine (#9366 ) * Remove references to Docker Machine Removing a broken link to an obsolete repo. While at it, removing references to Docker Machine, which was obsolete as of Docker v1.12 (avail. 2016). This version introduced Docker as native MacOS and Windows apps. * Update README.md Wording nit.	2020-02-15 03:08:43 -08:00
Maytas Monsereenusorn	31528bcdaf	Integration tests for JDK 11 (#9249 ) * Integration tests for JDK 11 * fix vm option * fix superviosrd * fix pom * add integration tests for java 11 * add logs * update docs * Update dockerfile to ack AdoptOpenJdk for Java 11 install commands	2020-02-12 16:36:31 -08:00
Lucas Capistrant	2e1dbe598c	Create new dynamic config to pause coordinator helpers when needed (#9224 ) * Create new dynamic config to pause coordinator helpers when needed * Fix spelling mistakes flagged in Travis build * Add an integration test for coordinator pause dynamic config * Improve documentation for new dynamic coordinator config and remove un-needed info logs in favor of debug * address naming convention of 'deep store' vs 'deep storage' in new configs doc line * Fix newline at end of configuration index.md * Last try to resolve newline issue in configuration readme * fix spell checks from travis build * Fix another flagges spelling error from Travis	2020-02-05 15:33:42 -08:00
Gian Merlino	204ba9966f	Add LookupJoinableFactory. (#9281 ) * Add LookupJoinableFactory. Enables joins where the right-hand side is a lookup. Includes an integration test. Also, includes changes to LookupExtractorFactoryContainerProvider: 1) Add "getAllLookupNames", which will be needed to eventually connect lookups to Druid's SQL catalog. 2) Convert "get" from nullable to Optional return. 3) Swap out most usages of LookupReferencesManager in favor of the simpler LookupExtractorFactoryContainerProvider interface. * Fixes for tests. * Fix another test. * Java 11 message fix. * Fixups. * Fixup benchmark class.	2020-01-30 14:46:21 -08:00
Maytas Monsereenusorn	b856853f09	Add Datasketch aggregator integration test (#9277 ) * add datasketch integration test * added datasketch integration tests	2020-01-30 13:50:33 -08:00
Roman Leventov	b9186f8f9f	Reconcile terminology and method naming to 'used/unused segments'; Rename MetadataSegmentManager to MetadataSegmentsManager (#7306 ) * Reconcile terminology and method naming to 'used/unused segments'; Don't use terms 'enable/disable data source'; Rename MetadataSegmentManager to MetadataSegments; Make REST API methods which mark segments as used/unused to return server error instead of an empty response in case of error * Fix brace * Import order * Rename withKillDataSourceWhitelist to withSpecificDataSourcesToKill * Fix tests * Fix tests by adding proper methods without interval parameters to IndexerMetadataStorageCoordinator instead of hacking with Intervals.ETERNITY * More aligned names of DruidCoordinatorHelpers, rename several CoordinatorDynamicConfig parameters * Rename ClientCompactTaskQuery to ClientCompactionTaskQuery for consistency with CompactionTask; ClientCompactQueryTuningConfig to ClientCompactionTaskQueryTuningConfig * More variable and method renames * Rename MetadataSegments to SegmentsMetadata * Javadoc update * Simplify SegmentsMetadata.getUnusedSegmentIntervals(), more javadocs * Update Javadoc of VersionedIntervalTimeline.iterateAllObjects() * Reorder imports * Rename SegmentsMetadata.tryMark... methods to mark... and make them to return boolean and the numbers of segments changed and relay exceptions to callers * Complete merge * Add CollectionUtils.newTreeSet(); Refactor DruidCoordinatorRuntimeParams creation in tests * Remove MetadataSegmentManager * Rename millisLagSinceCoordinatorBecomesLeaderBeforeCanMarkAsUnusedOvershadowedSegments to leadingTimeMillisBeforeCanMarkAsUnusedOvershadowedSegments * Fix tests, refactor DruidCluster creation in tests into DruidClusterBuilder * Fix inspections * Fix SQLMetadataSegmentManagerEmptyTest and rename it to SqlSegmentsMetadataEmptyTest * Rename SegmentsAndMetadata to SegmentsAndCommitMetadata to reduce the similarity with SegmentsMetadata; Rename some methods * Rename DruidCoordinatorHelper to CoordinatorDuty, refactor DruidCoordinator * Unused import * Optimize imports * Rename IndexerSQLMetadataStorageCoordinator.getDataSourceMetadata() to retrieveDataSourceMetadata() * Unused import * Update terminology in datasource-view.tsx * Fix label in datasource-view.spec.tsx.snap * Fix lint errors in datasource-view.tsx * Doc improvements * Another attempt to please TSLint * Another attempt to please TSLint * Style fixes * Fix IndexerSQLMetadataStorageCoordinator.createUsedSegmentsSqlQueryForIntervals() (wrong merge) * Try to fix docs build issue * Javadoc and spelling fixes * Rename SegmentsMetadata to SegmentsMetadataManager, address other comments * Address more comments	2020-01-27 11:24:29 -08:00
Gian Merlino	19b427e8f3	Add JoinableFactory interface and use it in the query stack. (#9247 ) * Add JoinableFactory interface and use it in the query stack. Also includes InlineJoinableFactory, which enables joining against inline datasources. This is the first patch where a basic join query actually works. It includes integration tests. * Fix test issues. * Adjustments from code review.	2020-01-24 13:10:01 -08:00
Gian Merlino	f511af1306	Fix DOCKER_HOST_IP handling for multihomed machines. (#9225 ) By picking one. Otherwise, when a machine has multiple IP addresses, DOCKER_HOST_IP would have a newline in the middle, causing havoc in configuration files.	2020-01-21 09:01:19 -08:00
Jonathan Wei	aa539177ec	De-incubation cleanup in code, docs, packaging (#9108 ) * De-incubation cleanup in code, docs, packaging * remove unused docs script	2020-01-03 12:33:19 -05:00
Jonathan Wei	4e8368a5d9	Set version to 0.18.0-SNAPSHOT (#9109 )	2020-01-02 17:55:10 -05:00
Suneet Saldanha	3c13444167	Fix flaky ITBasicAuthConfigurationTest (#9072 ) This test was failing to authenticate using the admin credentials. These should be available by default in the metadata store. This indicates that the credentials are not successfully being syncd before the test is run. This change increases the number of retries to 20 so that the services are syncd before the test runs	2019-12-19 17:38:55 -08:00
Suneet Saldanha	176bc8fd97	Remove resolve-ip dependency for integration-tests (#9065 ) * Remove resolve-ip dependency for integration-tests * use host hostname and fallback to dscacheutil * better shell script comparisons	2019-12-19 14:53:36 -08:00
Jihoon Son	94a23fb17e	Fix flaky realtime index task tests (#8999 ) * Fix flaky realtime index task tests * fix ITAppenderatorDriverRealtimeIndexTaskTest * fix comment * address comments	2019-12-18 13:25:00 -08:00
Jonathan Wei	8af41d7cd0	Update version to 0.18.0-incubating-SNAPSHOT (#9009 )	2019-12-11 14:04:03 -08:00
Chi Cao Minh	bab78fc80e	Parallel indexing single dim partitions (#8925 ) * Parallel indexing single dim partitions Implements single dimension range partitioning for native parallel batch indexing as described in #8769. This initial version requires the druid-datasketches extension to be loaded. The algorithm has 5 phases that are orchestrated by the supervisor in `ParallelIndexSupervisorTask#runRangePartitionMultiPhaseParallel()`. These phases and the main classes involved are described below: 1) In parallel, determine the distribution of dimension values for each input source split. `PartialDimensionDistributionTask` uses `StringSketch` to generate the approximate distribution of dimension values for each input source split. If the rows are ungrouped, `PartialDimensionDistributionTask.UngroupedRowDimensionValueFilter` uses a Bloom filter to skip rows that would be grouped. The final distribution is sent back to the supervisor via `DimensionDistributionReport`. 2) The range partitions are determined. In `ParallelIndexSupervisorTask#determineAllRangePartitions()`, the supervisor uses `StringSketchMerger` to merge the individual `StringSketch`es created in the preceding phase. The merged sketch is then used to create the range partitions. 3) In parallel, generate partial range-partitioned segments. `PartialRangeSegmentGenerateTask` uses the range partitions determined in the preceding phase and `RangePartitionCachingLocalSegmentAllocator` to generate `SingleDimensionShardSpec`s. The partition information is sent back to the supervisor via `GeneratedGenericPartitionsReport`. 4) The partial range segments are grouped. In `ParallelIndexSupervisorTask#groupGenericPartitionLocationsPerPartition()`, the supervisor creates the `PartialGenericSegmentMergeIOConfig`s necessary for the next phase. 5) In parallel, merge partial range-partitioned segments. `PartialGenericSegmentMergeTask` uses `GenericPartitionLocation` to retrieve the partial range-partitioned segments generated earlier and then merges and publishes them. * Fix dependencies & forbidden apis * Fixes for integration test * Address review comments * Fix docs, strict compile, sketch check, rollup check * Fix first shard spec, partition serde, single subtask * Fix first partition check in test * Misc rewording/refactoring to address code review * Fix doc link * Split batch index integration test * Do not run parallel-batch-index twice * Adjust last partition * Split ITParallelIndexTest to reduce runtime * Rename test class * Allow null values in range partitions * Indicate which phase failed * Improve asserts in tests	2019-12-09 23:05:49 -08:00

1 2 3 4 5 ...

299 Commits