druid

mirror of https://github.com/apache/druid.git synced 2025-02-19 16:37:45 +00:00

Author	SHA1	Message	Date
Gian Merlino	1ef25a438f	Broker: Add ability to inline subqueries. (#9533 ) * Broker: Add ability to inline subqueries. The main changes: - ClientQuerySegmentWalker: Add ability to inline queries. - Query: Add "getSubQueryId" and "withSubQueryId" methods. - QueryMetrics: Add "subQueryId" dimension. - ServerConfig: Add new "maxSubqueryRows" parameter, which is used by ClientQuerySegmentWalker to limit how many rows can be inlined per query. - IndexedTableJoinMatcher: Allow creating keys on top of unknown types, by assuming they are strings. This is useful because not all types are known for fields in query results. - InlineDataSource: Store RowSignature rather than component parts. Add more zealous "equals" and "hashCode" methods to ease testing. - Moved QuerySegmentWalker test code from CalciteTests and SpecificSegmentsQueryWalker in druid-sql to QueryStackTests in druid-server. Use this to spin up a new ClientQuerySegmentWalkerTest. * Adjustments from CI. * Fix integration test.	2020-03-18 15:06:45 -07:00
Maytas Monsereenusorn	4c620b8f1c	Adding s3, gcs, azure integration tests (#9501 ) * exclude pulling s3 segments for tests that doesnt need it * fix script * fix script * fix script * add s3 test * refactor sample data script * add tests * add tests * add license header * fix failing tests * change bucket and path to config * update integration test readme * fix typo	2020-03-17 03:08:44 -07:00
Jonathan Wei	b1847364b0	More efficient join filter rewrites (#9516 ) * More efficient join filter rewrites * Rebase * Remove unused functions * PR comments, fix compile * Adjust comment * Allow filter rewrite when join condition has LHS expression * Fix inspections * Fix tests	2020-03-16 22:16:14 -07:00
Clint Wylie	142742f291	add kinesis lag metric (#9509 ) * add kinesis lag metric * fixes * heh * do it right this time * more test * split out supervisor report lags into lagMillis, remove latest offsets from kinesis supervisor report since always null, review stuffs	2020-03-16 21:39:53 -07:00
Vadim Ogievetsky	7626be26ca	Web console: add config control for the query context (#9499 ) * add default and mandetory query contexts * added config docs	2020-03-16 14:34:19 -07:00
Maytas Monsereenusorn	09600db8f2	Add the option to start Hadoop docker container when running integration tests (#9513 ) * hadoop docker it * hadoop docker container it * fix hadoop container	2020-03-16 12:04:05 -07:00
Chi Cao Minh	e7b3dd9cd1	Update to mysql connector 5.1.48 (#9514 )	2020-03-16 10:38:31 -07:00
Chi Cao Minh	100d587583	Suppress CWE-400 for node-sass:4.13.1 (#9517 ) The vulnerability is fixed in 4.13.1: https://github.com/sass/node-sass/issues/2816#issuecomment-575136455 But the dependency check plugin thinks its still broken as the affected/fixed versions has not been updated yet on Sonatype OSS Index: https://ossindex.sonatype.org/vuln/c97f4ae7-be1f-4f71-b238-7c095b126e74	2020-03-16 09:42:33 -07:00
Clint Wylie	69af760a19	add manual laning strategy, integration test (#9492 ) * add manual laning strategy, integration test, json config test * share percent conversion method * wrong assert * review stuffs * doc adjustments * more tests * test adjustment * adjust docs * Update index.md	2020-03-13 20:06:55 -07:00
mcbrewster	bcb9a632c7	Web console: update druid-query-toolkit to version 0.4.x (#9500 ) * add support for new version of DQT * update druid-query-toolkit * fix direction css * fix remove * update package * remove useless conditional * bump package * jest -u Co-authored-by: Maggie Brewster <maggiebrewster@implydata20sMBP.attlocal.net>	2020-03-13 18:09:47 -07:00
Clint Wylie	6afd55c8f4	threshold based automatic query prioritization (#9493 ) * threshold based automatic query prioritization * fixes * spelling and fixes * fix docs * spelling * checkstyle * adjustments * doc fix	2020-03-13 01:41:54 -07:00
Chi Cao Minh	6b02991464	Match GREATEST/LEAST function behavior to other DBs (#9488 ) * Match GREATEST/LEAST function behavior Change the behavior of the GREATEST / LEAST functions to be similar to how it is implemented in other databases (as functions instead of aggregators). The GREATEST/LEAST functions are not in the SQL standard, but users will expect behavior similar to what other databases provide. * Match postgres behavior & handle more SQL types * Fix imports	2020-03-12 15:10:11 -07:00
Vadim Ogievetsky	ddc6f87920	Web console: standardize the spec format (#9477 ) * standerdize the spec format * fix spec upgrade	2020-03-12 14:21:23 -07:00
Himanshu	1ba1a3c523	fix worker category on Indexer node (#9510 )	2020-03-12 14:11:02 -07:00
Gian Merlino	ff59d2e78b	Move RowSignature from druid-sql to druid-processing and make use of it. (#9508 ) * Move RowSignature from druid-sql to druid-processing and make use of it. 1) Moved (most of) RowSignature from sql to processing. Left behind the SQL-specific stuff in a RowSignatures utility class. It also picked up some new convenience methods along the way. 2) There were a lot of places in the code where Map<String, ValueType> was used to associate columns with type info. These are now all replaced with RowSignature. 3) QueryToolChest's resultArrayFields method is replaced with resultArraySignature, and it now provides type info. * Fix up extensions. * Various fixes	2020-03-12 11:06:44 -07:00
Jonathan Wei	3082b9289a	Fix NPE when using IndexedTable and all left rows are filtered out (#9490 ) * Fix NPE when using IndexedTable and all left rows are filtered out * Fix compile * Add constant for uninitialized current row * Fix checkstyle	2020-03-11 19:23:05 -07:00
Gian Merlino	2ef5c17441	Link up row-based datasources to serving layer. (#9503 ) * Link up row-based datasources to serving layer. - Add SegmentWrangler interface that allows linking of DataSources to Segments. - Add LocalQuerySegmentWalker that uses SegmentWranglers to compute queries on data that is available locally. - Modify ClientQuerySegmentWalker to use LocalQuerySegmentWalker when the base datasource is concrete and not a table. - Add SegmentWranglerModule to the Broker so it has them available and can properly instantiate . LocalQuerySegmentWalkers. - Set InlineDataSource and LookupDataSource to concrete, since they can be directly queried now. * Fix tests.	2020-03-11 11:32:27 -07:00
Maytas Monsereenusorn	e9888f41cb	Modify check java version script to indicate experimental support for Java 11 (#9455 ) * Modify check java version script to indicate experimental support for Java 11 * update docs	2020-03-11 09:22:39 -07:00
Maytas Monsereenusorn	9231f2acb3	Integration test compile with Java 8 and run with Java 8 and 11 (#9491 ) * test integration compile with 8 and run with 11 * Integration test compile with Java 8 and run with Java 8 and 11	2020-03-11 09:22:27 -07:00
Gian Merlino	4f085896c6	Ability to directly query row-based datasources. (#9502 ) * Ability to directly query row-based datasources. Includes: - Foundational classes RowBasedSegment, RowBasedStorageAdapter, RowBasedCursor provide a queryable interface on top of a RowBasedColumnSelectorFactory. - Add LookupSegment: A RowBasedSegment that is built on lookup data. - Improve capability reporting in RowBasedColumnSelectorFactory. * Fix import. * Remove unthrown IOException.	2020-03-10 20:39:01 -07:00
Samarth Jain	c74749f0f4	Don't exclude null dimension values from the map based query response (#9438 )	2020-03-10 15:06:03 -07:00
Jihoon Son	7401bb3f93	Improve OvershadowableManager performance (#9441 ) * Use the iterator instead of higherKey(); use the iterator API instead of stream * Fix tests; fix a concurrency bug in timeline * fix test * add tests for findNonOvershadowedObjectsInInterval * fix test * add missing tests; fix a bug in QueueEntry * equals tests * fix test	2020-03-10 13:22:19 -07:00
zachjsh	7e0e767cc2	Ability to Delete task logs and segments from S3 (#9459 ) * Ability to Delete task logs and segments from S3 * implement ability to delete all tasks logs or all task logs written before a particular date when written to S3 * implement ability to delete all segments from S3 deep storage * upgrade version of aws SDK in use * * update licenses for updated AWS SDK version * * fix bug in iterating through results from S3 * revert back to original version of AWS SDK * * Address review comments * * Fix failing dependency check	2020-03-10 13:13:46 -07:00
Himanshu	75a5591448	remove old unused zookeeper dependent lookups code (#9480 ) * remove old unused zookeeper dependent lookups code * make intellij inspector happy	2020-03-10 12:12:48 -07:00
Chi Cao Minh	559c7b64cc	Suppress CVEs for htrace-core4 and openstack-swift (#9489 ) CVE-2013-7109 can be ignored for openstack-swift as it is for the python SDK and druid uses the java SDK. The jackson-databind:2.4.0 CVEs via htrace-core4 are all suppressed for now as fixing them requires updating the hadoop version.	2020-03-10 10:55:41 -07:00
Gian Merlino	c6c2282b59	Harmonization and bug-fixing for selector and filter behavior on unknown types. (#9484 ) * Harmonization and bug-fixing for selector and filter behavior on unknown types. - Migrate ValueMatcherColumnSelectorStrategy to newer ColumnProcessorFactory system, and set defaultType COMPLEX so unknown types can be dynamically matched. - Remove ValueGetters in favor of ColumnComparisonFilter doing its own thing. - Switch various methods to use convertObjectToX when casting to numbers, rather than ad-hoc and inconsistent logic. - Fix bug in RowBasedExpressionColumnValueSelector: isBindingArray should return true even for 0- or 1- element arrays. - Adjust various javadocs. * Add throwParseExceptions option to Rows.objectToNumber, switch back to that. * Update tests. * Adjust moment sketch tests.	2020-03-10 07:15:57 -07:00
Clint Wylie	8b9fe6f584	query laning and load shedding (#9407 ) * prototype * merge QueryScheduler and QueryManager * everything in its right place * adjustments * docs * fixes * doc fixes * use resilience4j instead of semaphore * more tests * simplify * checkstyle * spelling * oops heh * remove unused * simplify * concurrency tests * add SqlResource tests, refactor error response * add json config tests * use LongAdder instead of AtomicLong * remove test only stuffs from scheduler * javadocs, etc * style * partial review stuffs * adjust * review stuffs * more javadoc * error response documentation * spelling * preserve user specified lane for NoSchedulingStrategy * more test, why not * doc adjustment * style * missed review for make a thing a constant * fixes and tests * fix test * Update docs/configuration/index.md Co-Authored-By: sthetland <steve.hetland@imply.io> * doc update Co-authored-by: sthetland <steve.hetland@imply.io>	2020-03-10 02:57:16 -07:00
Jihoon Son	75e2051195	Convert array_contains() and array_overlaps() into native filters if possible (#9487 ) * Convert array_contains() and array_overlaps() into native filters if possible * make spotbugs happy and fix null results when null compatible	2020-03-09 22:50:38 -07:00
Maytas Monsereenusorn	2db20afbb7	Integration test cluster supports override config (#9473 ) * integration test refactor * integration test refactor * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * refactor integration test * address comments	2020-03-09 21:17:49 -07:00
mcbrewster	95406ca20a	[IMPLY-2285] fix maxRowsPerSegment tool tip (#9468 )	2020-03-09 20:12:05 -07:00
mcbrewster	96ed7210d3	Fix history dialog overflow (#9471 ) * [IMPLY-1661] fix history dialog overflow * jest -u	2020-03-09 19:09:59 -07:00
Maytas Monsereenusorn	814f5a9717	add password provider reference to s3 optional cred docs (#9439 )	2020-03-09 17:56:42 -07:00
Clint Wylie	f8b1f2f7f3	fix issue when distinct grouping dimensions are optimized into the same virtual column expression (#9429 ) * fix issue when distinct grouping dimensions are optimized into the same virtual column expression * fix tests * more better * fixes	2020-03-09 17:48:29 -07:00
Jonathan Wei	0136dba95d	Add option to control join filter rewrites (#9472 ) * Add option to control join filter rewrites * Fix inspections	2020-03-09 17:36:07 -07:00
mcbrewster	a676d16226	[IMPLY-1767] fix popover direction (#9470 )	2020-03-09 17:35:02 -07:00
mcbrewster	da0ea627d0	Add disabled run button during loading state (#9474 ) * [IMPLY-1782] add disabled run button during loading state * jest -u	2020-03-09 17:10:35 -07:00
Himanshu	072bbe210f	remove ServerDiscoverySelector from DruidLeaderClient (#9481 )	2020-03-09 12:13:59 -07:00
Jihoon Son	f456d2fcf8	Resource leak in DruidSegmentReader (#9476 ) * Close the Yielder in DruidSegmentReader * forbidden api	2020-03-09 10:05:25 -07:00
Clint Wylie	a677664811	allow optimization of single multi-value column input expr with repeated identifier (#9425 ) * allow optimization of single multi-value column input expr with repeated identifier * add test	2020-03-06 12:53:32 -08:00
Julian Jaffe	eda03630d0	Add OnHeapMemorySegmentWriteOutMediumFactory (#9454 ) * Add OnHeapMemorySegmentWriteOutMediumFactory Add a factory for OnHeapMemorySegmentWriteOutMedium to support direct writing via Spark. * Register OnHeapMemorySegmentWriteOutMediumFactory. Register OnHeapMemorySegmentWriteOutMediumFactory with SegmentWriteOutMediumFactory. * Remove unnecessary throws The base `makeSegmentWriteOutMedium` throws an IOException, but the particular implementation of OnHeapMemorySegmentWriteOutMediumFactory does not throw a checked exception. * Update SegmentWriteOutMedium docs to include onHeapMemory Update the SegmentWriteOutMedium section of the indexing docs to include a description of the new OnHeapSegmentMediumWriteOut option.	2020-03-05 22:34:08 -08:00
Jihoon Son	64afc05080	Open the licenses.yaml with an explicit encoding (#9462 )	2020-03-05 17:13:44 -08:00
Clint Wylie	32cd47bc8e	Fix home view styling (#9444 )	2020-03-04 19:39:36 -08:00
Jihoon Son	3016057178	Make Transform an ExtensionPoint (#9319 ) * Make Transform an ExtensionPoint * Add transform to the list of documented extensions * Add example transform implementation	2020-03-04 12:13:14 -08:00
Chi Cao Minh	4ed83f6af6	Fix superbatch merge last partition boundaries (#9448 ) * Fix superbatch merge last partition boundaries A bug in the computation for the last parallel merge partition could cause an IndexOutOfBoundsException or precondition failure due to an empty partition. * Improve comments and tests	2020-03-04 10:35:21 -08:00
Jihoon Son	9466ac7c9b	Skip empty files for local, hdfs, and cloud input sources (#9450 ) * Skip empty files for local, hdfs, and cloud input sources * split hint spec doc * doc for skipping empty files * fix typo; adjust tests * unnecessary fluent iterable * address comments * fix test * use the right lists * fix test * fix test	2020-03-03 20:51:06 -08:00
mcbrewster	99095c4ac5	Add Azure ingestion flow to web console (#9437 ) * add support for azure * change bucket to container * add azure to input menu * remove static-azure	2020-03-03 11:06:00 -08:00
Gian Merlino	1fd865b7c1	BufferArrayGrouper: Fix potential overflow in requiredBufferCapacity. (#9435 ) * BufferArrayGrouper: Fix potential overflow in requiredBufferCapacity. If cardinality was high, the computation could overflow an int. There were tests for this, but the tests were wrong. * Nicer.	2020-02-28 14:27:52 -08:00
Gian Merlino	81d8be6e39	CacheStrategy: Improve Javadocs. (#9280 ) * CacheStrategy: Improve Javadocs. * Update processing/src/main/java/org/apache/druid/query/CacheStrategy.java Co-Authored-By: Suneet Saldanha <44787917+suneet-s@users.noreply.github.com> Co-authored-by: Suneet Saldanha <44787917+suneet-s@users.noreply.github.com>	2020-02-28 11:30:58 -08:00
Vadim Ogievetsky	c294e0b7c6	Web console: Column counter (#9334 ) * Column counter * more general test	2020-02-27 12:04:27 -08:00
Gian Merlino	ef3d24e886	Add javadocs for enableFilterPushDown. (#9423 )	2020-02-26 22:07:33 -08:00

1 2 3 4 5 ...

10248 Commits