druid

Commit Graph

Author	SHA1	Message	Date
Maytas Monsereenusorn	8328d91b30	Add missing integration tests for the compaction by the coordinator (#9644 ) * Add API to trigger a compaction by the coordinator for integration tests * Add missing integration tests for the compaction by the coordinator * address comments	2020-04-15 14:27:33 -07:00
Jihoon Son	b8f7128b2d	Revert "remove ServerDiscoverySelector from DruidLeaderClient (#9481 )" (#9702 ) * Revert "remove ServerDiscoverySelector from DruidLeaderClient (#9481)" This reverts commit `072bbe210f`. * fix build	2020-04-14 20:42:56 -07:00
Will Salisbury	cda9f41e69	s/S3/GCS/g (#9700 ) fix typo [ at least I hope this was a typo… ]	2020-04-14 18:39:54 -07:00
Chi Cao Minh	2262e33316	Fix flaky web console E2E test (#9685 ) web-console/e2e-tests/tutorial-batch.spec.ts would occasionally timeout between the transition from the data loader "configure schema" and "partition" steps due to missing waits when toggling the rollup setting. Also, fix shellcheck warnings for script/druid.	2020-04-14 15:27:16 -07:00
Maytas Monsereenusorn	d930f04e6a	Test file format extensions for inputSource (orc, parquet) (#9632 ) * Test file format extensions for inputSource (orc, parquet) * Test file format extensions for inputSource (orc, parquet) * fix path * resolve merge conflict * fix typo	2020-04-13 13:03:56 -07:00
Jihoon Son	6a52bdc605	Skip license check for dependency reduced pom files (#9687 )	2020-04-11 18:11:53 -07:00
Chi Cao Minh	e6dd6a4119	Skip node dev dependency vulnerability scan (#9684 ) Since they are not production dependencies, security vulnerabilities in the dev dependencies can be ignored.	2020-04-11 14:24:25 -07:00
Abhishek Radhakrishnan	cbbfd63bed	Add 0.18.0 to .backportrc.json to facilitate backport. (#9661 )	2020-04-11 13:49:04 -07:00
Clint Wylie	0ff926b1a1	fix issue with group by limit pushdown for extractionFn, expressions, joins, etc (#9662 ) * fix issue with group by limit pushdown for extractionFn, expressions, joins, etc * remove unused * fix test * revert unintended change * more tests * consider capabilities for StringGroupByColumnSelectorStrategy * fix test * fix and more test * revert because im scared	2020-04-11 01:18:11 -07:00
Jihoon Son	1b60148ec6	Missing license changes for sources in licenses.yaml (#9678 )	2020-04-10 23:06:33 -07:00
Gian Merlino	5249155284	Fix off-by-one in IndexedTableJoinMatcher.getCardinality. (#9674 ) * Fix off-by-one in IndexedTableJoinMatcher.getCardinality. It would report a cardinality that is one lower than the actual cardinality. The missing value is the phantom null that can be generated by outer joins. * Fix tests.	2020-04-10 18:11:05 -07:00
Himanshu	ca369e5768	druid-pac4j: add ability to use custom ssl trust store while talking to auth server (#9637 ) * druid-pac4j: add ability for custom ssl trust store for talking to auth server * fix nimbusds DefaultResourceRetriever name in comment	2020-04-10 18:01:59 -07:00
Suneet Saldanha	332ca19621	Fix potential integer overflow issues (#9609 ) ApproximateHistogram - seems unlikely SegmentAnalyzer - unclear if this is an actual issue GenericIndexedWriter - unclear if this is an actual issue IncrementalIndexRow and OnheapIncrementalIndex are non-issues becaus it's very unlikely for the number of dims to be large enough to hit the overflow condition	2020-04-10 11:47:08 -07:00
Suneet Saldanha	22d3eed80c	Do not use external input in format strings (#9665 ) https://lgtm.com/rules/7900080/	2020-04-10 10:46:04 -07:00
Suneet Saldanha	bd1cff24a2	Remove no-op assert statement in ClientQuerySegmentWalker (#9607 ) * Remove no-op assert statement The assert statement in ClientQuerySegmentWalker will always be true because of the preceeding while loop which has the same condition. This change removes dead code to fix an error reported by LGTM * Suppress lgtm * cleanup whitespace	2020-04-10 10:41:29 -07:00
Suneet Saldanha	642fe83897	Indexing Service validates externally received taskId (#9666 ) Addresses issues flagged by https://lgtm.com/rules/5970070/	2020-04-10 10:36:26 -07:00
Suneet Saldanha	1ced3b33fb	IntelliJ inspections cleanup (#9339 ) * IntelliJ inspections cleanup * Standard Charset object can be used * Redundant Collection.addAll() call * String literal concatenation missing whitespace * Statement with empty body * Redundant Collection operation * StringBuilder can be replaced with String * Type parameter hides visible type * fix warnings in test code * more test fixes * remove string concatenation inspection error * fix extra curly brace * cleanup AzureTestUtils * fix charsets for RangerAdminClient * review comments	2020-04-10 10:04:40 -07:00
Jihoon Son	e157fb089a	Fix wrong cardinality computation in BufferArrayGrouper (#9655 ) * Fix wrong cardinality computation in BufferArrayGrouper * fix javadoc	2020-04-10 09:05:38 -07:00
yuanli	8ccc0b241a	Fix some flaws of KafkaEmitter (#9573 ) * fix flaws of KafkaEmitter * fix flaws of KafkaEmitter * fix flaws of KafkaEmitter * Update extensions-contrib/kafka-emitter/src/main/java/org/apache/druid/emitter/kafka/KafkaEmitter.java Co-Authored-By: Himanshu <g.himanshu@gmail.com> * Update extensions-contrib/kafka-emitter/src/main/java/org/apache/druid/emitter/kafka/KafkaEmitter.java Co-Authored-By: Himanshu <g.himanshu@gmail.com> Co-authored-by: Himanshu <g.himanshu@gmail.com>	2020-04-09 23:31:32 -07:00
Suneet Saldanha	65de636893	Fix potential integer overflow in BufferArrayGrouper (#9605 ) This change fixes a potential integer overflow in BufferArrayGrouper that was flagged by LGTM. It also adds a check that the vectorized arrays are initialized before aggregateVector is called. The changes in HashTableUtils should not have any effect since the numbers being multiplied are small, but the change will remove the warnings from being flagged in LGTM.	2020-04-09 17:46:15 -07:00
Suneet Saldanha	9888268000	Suppress LGTM warnings about stack trace exposure (#9631 ) Since Druid is an open source project, these warnings are not concerning as the information it may potentially leak is already available in the open.	2020-04-09 17:31:03 -07:00
Gian Merlino	75c543b50f	SQL: More straightforward handling of join planning. (#9648 ) * SQL: More straightforward handling of join planning. Two changes that simplify how joins are planned: 1) Stop using JoinProjectTransposeRule as a way of guiding subquery removal. Instead, add logic to DruidJoinRule that identifies removable subqueries and removes them at the point of creating a DruidJoinQueryRel. This approach reduces the size of the planning space and allows the planner to complete quickly. 2) Remove rules that reorder joins. Not because of an impact on the planning time (it seems minimal), but because the decisions that the planner was making in the new tests were sometimes worse than the user-provided order. I think we'll need to go with the user-provided order for now, and revisit reordering when we can add more smarts to the cost estimator. A third change updates numeric ExprEval classes to store their value as a boxed type that corresponds to what it is supposed to be. This is useful because it affects the behavior of "asString", and is included in this patch because it is needed for the new test "testInnerJoinTwoLookupsToTableUsingNumericColumnInReverse". This test relies on CAST('6', 'DOUBLE') stringifying to "6.0" like an actual double would. Fixes #9646. * Fix comments. * Fix tests.	2020-04-09 16:21:43 -07:00
Chi Cao Minh	eb45981b60	Upgrade netty 4 to fix CVE-2020-11612 (#9651 )	2020-04-09 13:26:14 -07:00
Chi Cao Minh	84c1c2505d	Web console basic end-to-end-test (#9595 ) Load data and query (i.e., automate https://druid.apache.org/docs/latest/tutorials/tutorial-batch.html) to have some basic checks ensuring the web console is wired up to druid correctly. The new end-to-end tests (tutorial-batch.spec.ts) are added to `web-console/e2e-tests`. Within that directory: - `components` represent the various tabs of the web console. Currently, abstractions for `load data`, `ingestion`, `datasources`, and `query` are implemented. - `components/load-data/data-connector` contains abstractions for the different data source options available to the data loader's `Connect` step. Currently, only the `Local file` data source connector is implemented. - `components/load-data/config` contains abstractions for the different configuration options available for each step of the data loader flow. Currently, the `Configure Schema`, `Partition`, and `Publish` steps have initial implementation of their configuration options. - `util` contains various helper methods for the tests and does not contain abstractions of the web console. Changes to add the new tests to CI: - `.travis.yml`: New "web console end-to-end tests" job - `web-console/jest.*.js`: Refactor jest configurations to have different flavors for unit tests and for end-to-end tests. In particular, the latter adds a jest setup configuration to wait for the web console to be ready (`web-console/e2e-tests/util/setup.ts`). - `web-console/package.json`: Refactor run scripts to add new script for running end-to-end tests. - `web-console/script/druid`: Utility scripts for building, starting, and stopping druid. Other changes: - `pom.xml`: Refactor various settings disable java static checks and to disable java tests into two new maven profiles. Since the same settings are used in several places (e.g., .travis.yml, Dockerfiles, etc.), having them in maven profiles makes it more maintainable. - `web-console/src/console-application.tsx`: Fix typo ("the the").	2020-04-09 12:38:09 -07:00
bolkedebruin	ab5ac7f890	Document possible vulnerabilities for the druid-ranger-security (#9649 ) * Document possible vulnerabilities for the druid-ranger-security In certain configurations the ranger plugin can expose vulnerabilities due to some of its dependencies having CVEs. * Spelling checker is a bit tight	2020-04-09 10:43:11 -07:00
Jihoon Son	a6790ff22a	More optimize CNF conversion of filters (#9634 ) * More optimize CNF conversion of filters * update license * fix build * checkstyle * remove unnecessary code * split helper * license * checkstyle * add comments on cnf conversion	2020-04-08 21:31:17 -07:00
Abhishek Radhakrishnan	08851c0198	Preserve the null values for numeric type dimensions post-compaction. (#9622 ) * Add selector null check to preserve null values as-is. * Fix typo. * add wrapping dimension selector test. * Address review comments. * nit: replace exception type. * uh, float is indeed NOT a special case.	2020-04-08 18:56:06 -07:00
mcbrewster	6f3d403491	Use auto-form for add an edit lookups (#9587 ) * use auto form * jest -u * fix unreachable statment * complete the owl * jest -u * remove changes to query-view * fix permissions * add test, fix info * add cool highlights * fix formatting * fix capitalization * add optional placeholder * add space	2020-04-08 16:34:59 -07:00
mcbrewster	2b2b9efcd7	add new text to lookup action dialog (#9643 )	2020-04-08 11:30:47 -07:00
Clint Wylie	7bf2dfa3b1	fix flaky jetty test (#9633 )	2020-04-08 10:07:06 -07:00
Maytas Monsereenusorn	b95a1b9878	Fix NPE in RemoteTaskRunner event handler causes JVM shutdown (#9610 ) * Fix NPE in RemoteTaskRunner event handler causes JVM shutdown * address comments * fix compile * fix checkstyle * fix lgtm * fix merge * fix test * fix tests * change scope * address comments * address comments	2020-04-07 14:53:51 -07:00
mcbrewster	6e50d29b4e	fix global filter input (#9567 ) * fix global filter input * remove clear * close global filters after clicking apply * add restFilter	2020-04-07 13:31:19 -07:00
Maytas Monsereenusorn	73a6baaeb6	change hadoop inputSource IT to use parallel batch ingestion (#9616 )	2020-04-07 11:37:37 -07:00
Clint Wylie	d267b1c414	check paths used for shuffle intermediary data manager get and delete (#9630 ) * check paths used for shuffle intermediary data manager get and delete * add test * newline * meh	2020-04-07 09:47:18 -07:00
Aleksei Chumagin	79522f3e25	Integration-tests: typo (#9624 ) * QA-57: change $ to # as comment * QA-57: fix haddop to hadoop	2020-04-06 17:40:05 -07:00
Jihoon Son	82ce60b5c1	Reuse transformer in stream indexing (#9625 ) * Reuse transformer in stream indexing * remove unused method * memoize complied pattern	2020-04-06 16:36:08 -07:00
Suneet Saldanha	7bf1ebb0b8	Add tests for valid and invalid datasource names (#9614 ) * Add tests for valid and invalid datasource names * code review * clean up dependencies	2020-04-06 16:02:50 -07:00
Himanshu	fc2897da1d	pac4j: be noop if a previous authenticator in chain has successfully authenticated (#9620 )	2020-04-06 11:55:55 -07:00
Jihoon Son	40e84a171b	Eliminate common subfilters when converting it to a CNF (#9608 )	2020-04-05 22:29:41 -07:00
bolkedebruin	2d99966933	Add Apache Ranger Authorization (#9579 )	2020-04-04 18:02:24 +02:00
Clint Wylie	4d277dbf99	Fix double count ssl connection metrics (#9594 ) * fix double counted jetty/numOpenConnections metric for ssl connections * tests * more better * style	2020-04-03 23:29:23 -07:00
Chi Cao Minh	b5419962f0	Suppress CVEs for jackson-mapper-asl:1.9.13 (#9604 ) The jackson-mapper-asl:1.9.13 CVEs via curator-x-discovery are all suppressed for now as fixing them requires updating the curator version.	2020-04-03 10:33:52 -07:00
Maytas Monsereenusorn	1852bf33ea	Add Integration Test for functionality of kinesis ingestion (#9576 ) * kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * Kinesis IT * fix kinesis timeout * Kinesis IT * Kinesis IT * fix checkstyle * Kinesis IT * address comments * fix checkstyle	2020-04-03 09:45:22 -07:00
Suneet Saldanha	af3337dac8	DruidInputSource can add new dimensions during re-ingestion (#9590 ) * WIP integration tests * Add integration test for ingestion with transformSpec * WIP almost working tests * Add ignored tests * checkstyle stuff * remove newPage from index task ingestion spec * more test cleanup * still not quite working * Actually disable the tests * working tests * fix codestyle * dont use junit in integration tests * actually fix the bug * fix checkstyle * bring index tests closer to reindex tests	2020-04-02 17:32:31 -07:00
Jonathan Wei	dbaabdd247	Fix for [CVE-2020-1958]: Apache Druid LDAP injection vulnerability (#9600 )	2020-04-01 14:52:01 -07:00
zachjsh	e855c7fe1b	Allow Cloud Deep Storage configs without segment bucket or path specified (#9588 ) * Allow Cloud SegmentKillers to be instantiated without segment bucket or path This change fixes a bug that was introduced that causes ingestion to fail if data is ingested from one of the supported cloud storages (Azure, Google, S3), and the user is using another type of storage for deep storage. In this case the all segment killer implementations are instantiated. A change recently made forced a dependency between the supported cloud storage type SegmentKiller classes and the deep storage configuration for that storage type being set, which forced the deep storage bucket and prefix to be non-null. This caused a NullPointerException to be thrown when instantiating the SegmentKiller classes during ingestion. To fix this issue, the respective deep storage segment configs for the cloud storage types supported in druid are now allowed to have nullable bucket and prefix configurations * * Allow google deep storage bucket to be null	2020-04-01 11:57:32 -07:00
Jihoon Son	0da8ffc3ff	Bump up development version to 0.19.0-SNAPSHOT (#9586 )	2020-03-30 16:24:04 -07:00
Himanshu	839379246a	remove commons-lang3 usage from DoubleMeanAggregatorFactoryTest (#9578 )	2020-03-30 14:31:50 -07:00
Clint Wylie	fa5da6693c	add lane enforcement for joinish queries (#9563 ) * add lane enforcement for joinish queries * oops * style * review stuffs	2020-03-30 11:58:16 -07:00
Chi Cao Minh	c0195a19e4	Fix HDFS input source split (#9574 ) Fixes an issue where splitting an HDFS input source for use in native parallel batch ingestion would cause the subtasks to get a split with an invalid HDFS path.	2020-03-28 15:45:57 -07:00

1 2 3 4 5 ...

10271 Commits All Branches Search

10271 Commits

All Branches