druid

Commit Graph

Author	SHA1	Message	Date
Jonathan Wei	a013350018	Adjust required permissions for system schema (#7579 ) * Adjust required permissions for system schema * PR comments, fix current_size handling * Checkstyle * Set curr_size instead of current_size * Adjust information schema docs * Fix merge conflict * Update tests	2019-05-02 07:18:02 -07:00
Surekha	15d19f3059	Add is_overshadowed column to sys.segments table (#7425 ) * Add is_overshadowed column to sys.segments table * update docs * Rename class and variables * PR comments * PR comments * remove unused variables in MetadataResource * move constants together * add getFullyOvershadowedSegments method to ImmutableDruidDataSource * Fix compareTo of SegmentWithOvershadowedStatus * PR comment * PR comments * PR comments * PR comments * PR comments * fix issue with already consumed stream * minor refactoring * PR comments	2019-05-01 18:00:57 +02:00
Gian Merlino	c648775b5b	SQL: Remove "useFallback" feature. (#7567 ) This feature allows Calcite's Bindable interpreter to be bolted on top of Druid queries and table scans. I think it should be removed for a few reasons: 1. It is not recommended for production anyway, because it generates unscalable query plans (e.g. it will plan a join into two table scans and then try to do the entire join in memory on the broker). 2. It doesn't work with Druid-specific SQL functions, like TIME_FLOOR, REGEXP_EXTRACT, APPROX_COUNT_DISTINCT, etc. 3. It makes the SQL planning code needlessly complicated. With SQL coming out of experimental status soon, it's a good opportunity to remove this feature.	2019-04-28 18:26:44 -07:00
Xue Yu	2c8a71f883	Support LPAD and RPAD sql function (#7388 ) * lpad and rpad sql function * feedback address * feedback address * add doc and format * update docs	2019-04-22 14:51:32 -07:00
Clint Wylie	be65cca248	refactor druid-bloom-filter aggregators (#7496 ) * now with 100% more buffer * there can be only 1 * simplify * javadoc * clean up unused test method * fix exception message * style * why does style hate javadocs * review stuff * style :(	2019-04-18 11:54:06 -07:00
Kazuhito Takeuchi	7c19c92a81	Add ROUND function in druid-sql. (#7224 ) * Implement round function in druid-sql * Return value according to the type of argument * Fix codes for abnoraml inputs, updated math-expr.md * Fix assert text * Fix error messages and refactor codes * Fix compile error, update sql.md, refactor codes and format tests	2019-04-16 11:15:39 -07:00
Gian Merlino	721191635a	SQL: Include virtual columns used for filtering in ScanQuery. (#7472 ) PR #6902 introduced the ability to use virtual columns for filters, but they were being omitted from "scan" queries, so filters would refer to a null column instead of the intended virtual column.	2019-04-14 15:03:36 -07:00
Surekha	3e5dae9b96	Rename SegmentMetadataHolder to AvailableSegmentMetadata (#7372 )	2019-04-14 10:19:48 -07:00
Justin Borromeo	408e3e1b2a	Remove select execution code from SQL planner (#7416 ) * Removed select execution code from SQL planner * Update doc	2019-04-10 22:32:57 -07:00
Benedict Jin	2f64414ade	Add "REVERSE" / "REPEAT" / "RIGHT" / "LEFT" functions (#7334 ) * Add "REVERSE" / "REPEAT" / "RIGHT" / "LEFT" functions * Fix ImportOrder * Use RuntimeException instead of OutOfMemoryError according to "Effective Java" * Simplify * Patch suggestions	2019-04-10 11:46:29 +08:00
Justin Borromeo	799c66d9ac	Allow max rows and max segments for time-ordered scans to be overridden using the scan query JSON spec (#7413 ) * Initial changes * Fixed NPEs * Fixed failing spec test * Fixed failing Calcite test * Move configs to context * Validated and added docs * fixed weird indentation * Update default context vals in doc * Fixed allowable values	2019-04-07 20:12:52 -07:00
Clint Wylie	76b4a5c62e	refactor lookups to be more chill to router (#7222 ) * refactor lookups to be more chill to router * remove accidental change * fix and combine LookupIntrospectionResourceTest * fix inspection * rename RouterLookupModule to LookupSerdeModule and RouterLookupExtractorFactoryContainerProvider to NoopLookupExtractorFactoryContainerProvider * make comment generic * use ConfigResourceFilter instead of StateResourceFilter * fix indentation * unused import * another unused import * refactor some stuff into processing module, split up LookupModule.java classes into their own files	2019-04-05 14:49:41 -07:00
Gian Merlino	8c104a115c	SQL: Add STRING_FORMAT function. (#7327 )	2019-04-03 17:09:54 -04:00
Atul Mohan	c883c52cb1	Fix tests (#7401 )	2019-04-02 16:49:21 -07:00
Justin Borromeo	4584b5e139	SQL support for time-ordered scan (#7373 ) * Squashed commit of the following: commit `287a367f41` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 27 20:03:41 2019 -0700 Implemented Clint's recommendations commit `07503ea5c0` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 27 17:49:09 2019 -0700 doc fix commit `231a72e7d9` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 27 17:38:20 2019 -0700 Modified sequence limit to accept longs and added test for long limits commit `1df50de321` Merge: `480e932fd` `c7fea6ac8` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 15:23:01 2019 -0700 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit `480e932fdf` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 14:58:04 2019 -0700 Checkstyle and doc update commit `487f31fcf6` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 14:39:25 2019 -0700 Refixed regression commit `fb858efbb7` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 13:14:48 2019 -0700 Added test for n-way merge commit `376e8bf906` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 11:42:54 2019 -0700 Refactor n-way merge commit `8a6bb1127c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 17:17:41 2019 -0700 Fix docs and flipped boolean in ScanQueryLimitRowIterator commit `35692680fc` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 16:15:49 2019 -0700 Fix bug messing up count of rows commit `219af478c8` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 15:57:55 2019 -0700 Fix bug in numRowsScanned commit `da4fc66403` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 15:19:45 2019 -0700 Check type of segment spec before using for time ordering commit `b822fc73df` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 13:19:02 2019 -0700 Revert "Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge" This reverts commit `57033f36df`, reversing changes made to `8f01d8dd16`. commit `57033f36df` Merge: `8f01d8dd1` `86d9730fc` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 13:13:52 2019 -0700 Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge commit `8f01d8dd16` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 13:13:32 2019 -0700 Revert "Fixed failing tests -> allow usage of all types of segment spec" This reverts commit `ec470288c7`. commit `ec470288c7` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 11:01:35 2019 -0700 Fixed failing tests -> allow usage of all types of segment spec commit `86d9730fc9` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 11:01:35 2019 -0700 Fixed failing tests -> allow usage of all types of segment spec commit `8b3b6b51ed` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 16:01:56 2019 -0700 Nit comment commit `a87d02127c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 15:54:42 2019 -0700 Fix checkstyle and test commit `62dcedacde` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 15:30:41 2019 -0700 More comments commit `1b46b58aec` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 15:19:52 2019 -0700 Added a bit of docs commit `49472162b7` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 10:27:41 2019 -0700 Rename segment limit -> segment partitions limit commit `43d490cc3a` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Mar 21 13:16:58 2019 -0700 Optimized n-way merge strategy commit `42f5246b8d` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 20 17:40:19 2019 -0700 Smarter limiting for pQueue method commit `4823dab895` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 20 16:05:53 2019 -0700 Finish rename commit `2528a56142` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 18 14:00:50 2019 -0700 Renaming commit `7bfa77d3c1` Merge: `a032c46ee` `7e49d4739` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 16:57:45 2019 -0700 Merge branch 'Update-Query-Interrupted-Exception' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit `7e49d47391` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 16:51:25 2019 -0700 Added error message for UOE commit `a032c46ee0` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 16:47:17 2019 -0700 Updated error message commit `57b5682654` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 12:44:02 2019 -0700 Fixed tests commit `45e95bb1f4` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 11:09:08 2019 -0700 Optimization commit `cce917ab84` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 8 14:11:07 2019 -0800 Checkstyle fix commit `73f4038068` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Mar 7 18:40:00 2019 -0800 Applied Jon's recommended changes commit `fb966def83` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Mar 7 11:03:01 2019 -0800 Sorry, checkstyle commit `6dc53b311c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 6 10:34:13 2019 -0800 Improved test and appeased TeamCity commit `35c96d3557` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 4 16:00:44 2019 -0800 Checkstyle fix commit `2d1978d571` Merge: `83ec3fe1f` `3398d3982` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 4 15:24:49 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit `83ec3fe1f1` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 1 13:40:22 2019 -0800 Nit-change on javadoc commit `47c970b5f4` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 1 13:38:29 2019 -0800 Wrote tests and added Javadoc commit `5ff59f5ca6` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 28 15:58:20 2019 -0800 Reset config commit `806166f977` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 28 15:49:07 2019 -0800 Fixed failing tests commit `de83b11a1b` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 16:40:48 2019 -0800 Fixed mistakes in merge commit `5bd0e1a32c` Merge: `18cce9a64` `9fa649b3b` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 16:39:16 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit `18cce9a646` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 13:16:44 2019 -0800 Change so batching only occurs on broker for time-ordered scans Restricted batching to broker for time-ordered queries and adjusted tests Formatting Cleanup commit `451e2b4365` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 11:14:27 2019 -0800 WIP commit `69b24bd851` Merge: `763c43df7` `417b9f2fe` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 18:13:26 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit `763c43df7e` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 18:07:06 2019 -0800 Multi-historical setup works commit `06a5218917` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 16:59:57 2019 -0800 Wrote docs commit `3b923dac9c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 14:03:22 2019 -0800 Fixed bug introduced by replacing deque with list commit `023538d831` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 13:30:08 2019 -0800 Sequence stuff is so dirty :( commit `e1fc2955d3` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 10:39:59 2019 -0800 WIP commit `f57ff253fa` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 18:22:06 2019 -0800 Ordering is correct on n-way merge -> still need to batch events into ScanResultValues commit `1813a5472c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 17:06:18 2019 -0800 Cleanup commit `f83e99655d` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 16:56:36 2019 -0800 Refactor and pQueue works commit `b13ff624a9` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 15:13:33 2019 -0800 Set up time ordering strategy decision tree commit `fba6b022f0` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 15:08:27 2019 -0800 Added config and get # of segments commit `c9142e721c` Merge: `cd489a020` `554b0142c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 20 10:12:50 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-V2 commit `cd489a0208` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 20 00:16:48 2019 -0800 Fixed failing test due to null resultFormat commit `7baeade832` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 19 17:52:06 2019 -0800 Changes based on Gian's comments commit `35150fe1a6` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 15 15:57:53 2019 -0800 Small changes commit `4e69276d57` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 12:09:54 2019 -0800 Removed unused import to satisfy PMD check commit `ecb0f483a9` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:37:11 2019 -0800 improved doc commit `f0eddee665` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:18:45 2019 -0800 Added more javadoc commit `5f92dd7325` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:05:58 2019 -0800 Unused import commit `93e1636287` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:03:14 2019 -0800 Added javadoc on ScanResultValueTimestampComparator commit `134041c479` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 8 13:13:54 2019 -0800 Renamed sort function commit `2e3577cd3d` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 7 13:01:25 2019 -0800 Fixed benchmark queries commit `d3b335af42` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 7 11:08:07 2019 -0800 added all query types to scan benchmark commit `ab00eade9f` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 7 09:42:48 2019 -0800 Kicking travis with change to benchmark param commit `b432beaf84` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 17:45:59 2019 -0800 Fixed failing calcite tests commit `b2c8c77ad4` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 17:39:48 2019 -0800 Fixing tests WIP commit `85e72a614e` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:42:02 2019 -0800 Set to spaces over tabs commit `7e872a8ebc` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:36:24 2019 -0800 Created an error message for when someone tries to time order a result set > threshold limit commit `e8a4b49044` Merge: `305876a43` `8e3a58f72` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:05:11 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-V2 commit `305876a434` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:02:02 2019 -0800 nit commit `8212a21caf` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 14:40:35 2019 -0800 Improved conciseness commit `10b5e0ca93` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:42:12 2019 -0800 . commit `dfe4aa9681` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:41:18 2019 -0800 Fixed codestyle and forbidden API errors commit `148939e88b` Merge: `4f51024b3` `5edbe2ae1` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:26:17 2019 -0800 Merge branch '6088-Create-Scan-Benchmark' into 6088-Time-Ordering-On-Scans-V2 commit `5edbe2ae12` Merge: `60b7684db` `315ccb76b` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:18:55 2019 -0800 Merge github.com:apache/incubator-druid into 6088-Create-Scan-Benchmark commit `60b7684db7` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:02:13 2019 -0800 Committing a param change to kick teamcity commit `4f51024b31` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 12:08:12 2019 -0800 Wrote more tests for scan result value sort commit `8b7d5f5081` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 11:55:09 2019 -0800 Wrote tests for heapsort scan result values and fixed bug where iterator wasn't returning elements in correct order commit `b6d4df3864` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 16:45:20 2019 -0800 Decrease segment size for less memory usage commit `d1a1793f36` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 12:40:26 2019 -0800 nit commit `7deb06f6df` Merge: `b7d3a4900` `86c5eee13` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:53:38 2019 -0800 Merge branch '6088-Create-Scan-Benchmark' into 6088-Time-Ordering-On-Scans-V2 commit `86c5eee13b` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:31:27 2019 -0800 Broke some long lines into two lines commit `b7d3a4900a` Merge: `796083f2b` `8bc5eaa90` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:23:32 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-V2 commit `737a83321d` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:15:32 2019 -0800 Made Jon's changes and removed TODOs commit `796083f2bb` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 15:37:42 2019 -0800 Benchmark param change commit `20c36644db` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 15:36:35 2019 -0800 More param changes commit `9e6e71616b` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 15:31:21 2019 -0800 Changed benchmark params commit `01b25ed112` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 14:36:18 2019 -0800 Added time ordering to the scan benchmark commit `432acaf085` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 12:03:14 2019 -0800 Change number of benchmark iterations commit `12e51a2721` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 12:02:13 2019 -0800 Added TimestampComparator tests commit `e66339cd76` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 10:56:41 2019 -0800 Remove todos commit `ad731a362b` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 10:55:56 2019 -0800 Change benchmark commit `989bd2d50e` Merge: `7b5847139` `26930f8d2` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 10:46:38 2019 -0800 Merge branch '6088-Create-Scan-Benchmark' into 6088-Time-Ordering-On-Scans-V2 commit `7b58471394` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Sat Feb 2 03:48:18 2019 -0800 Licensing stuff commit `79e8319383` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 18:22:58 2019 -0800 Move ScanResultValue timestamp comparator to a separate class for testing commit `7a6080f636` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 18:00:58 2019 -0800 Stuff for time-ordered scan query commit `26930f8d20` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 16:38:49 2019 -0800 It runs. commit `dd4ec1ac9c` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 15:12:17 2019 -0800 Need to form queries commit `dba6e492a0` Merge: `10e57d5f9` `7d4cc2873` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 14:13:39 2019 -0800 Merge branch 'master' into 6088-Create-Scan-Benchmark commit `10e57d5f9e` Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 14:04:13 2019 -0800 Moved Scan Builder to Druids class and started on Scan Benchmark setup * Changed SQL planning to use scan over select * Fixed some bugs * Removed unused imports * Updated calcite query test and test segment walker * Fixed formatting recommendations	2019-04-02 15:46:01 -07:00
Xue Yu	78fd5aff21	support radians and degrees in sql (#7336 ) * support radians and degrees in sql * update test case	2019-04-02 12:47:49 -07:00
Justin Borromeo	ad7862c58a	Time Ordering On Scans (#7133 ) * Moved Scan Builder to Druids class and started on Scan Benchmark setup * Need to form queries * It runs. * Stuff for time-ordered scan query * Move ScanResultValue timestamp comparator to a separate class for testing * Licensing stuff * Change benchmark * Remove todos * Added TimestampComparator tests * Change number of benchmark iterations * Added time ordering to the scan benchmark * Changed benchmark params * More param changes * Benchmark param change * Made Jon's changes and removed TODOs * Broke some long lines into two lines * nit * Decrease segment size for less memory usage * Wrote tests for heapsort scan result values and fixed bug where iterator wasn't returning elements in correct order * Wrote more tests for scan result value sort * Committing a param change to kick teamcity * Fixed codestyle and forbidden API errors * . * Improved conciseness * nit * Created an error message for when someone tries to time order a result set > threshold limit * Set to spaces over tabs * Fixing tests WIP * Fixed failing calcite tests * Kicking travis with change to benchmark param * added all query types to scan benchmark * Fixed benchmark queries * Renamed sort function * Added javadoc on ScanResultValueTimestampComparator * Unused import * Added more javadoc * improved doc * Removed unused import to satisfy PMD check * Small changes * Changes based on Gian's comments * Fixed failing test due to null resultFormat * Added config and get # of segments * Set up time ordering strategy decision tree * Refactor and pQueue works * Cleanup * Ordering is correct on n-way merge -> still need to batch events into ScanResultValues * WIP * Sequence stuff is so dirty :( * Fixed bug introduced by replacing deque with list * Wrote docs * Multi-historical setup works * WIP * Change so batching only occurs on broker for time-ordered scans Restricted batching to broker for time-ordered queries and adjusted tests Formatting Cleanup * Fixed mistakes in merge * Fixed failing tests * Reset config * Wrote tests and added Javadoc * Nit-change on javadoc * Checkstyle fix * Improved test and appeased TeamCity * Sorry, checkstyle * Applied Jon's recommended changes * Checkstyle fix * Optimization * Fixed tests * Updated error message * Added error message for UOE * Renaming * Finish rename * Smarter limiting for pQueue method * Optimized n-way merge strategy * Rename segment limit -> segment partitions limit * Added a bit of docs * More comments * Fix checkstyle and test * Nit comment * Fixed failing tests -> allow usage of all types of segment spec * Fixed failing tests -> allow usage of all types of segment spec * Revert "Fixed failing tests -> allow usage of all types of segment spec" This reverts commit `ec470288c7`. * Revert "Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge" This reverts commit `57033f36df`, reversing changes made to `8f01d8dd16`. * Check type of segment spec before using for time ordering * Fix bug in numRowsScanned * Fix bug messing up count of rows * Fix docs and flipped boolean in ScanQueryLimitRowIterator * Refactor n-way merge * Added test for n-way merge * Refixed regression * Checkstyle and doc update * Modified sequence limit to accept longs and added test for long limits * doc fix * Implemented Clint's recommendations	2019-03-28 14:37:09 -07:00
Roman Leventov	bca40dcdaf	Fix some IntelliJ inspections (#7273 ) Prepare TeamCity for IntelliJ 2018.3.1 upgrade. Mostly removed redundant exceptions declarations in `throws` clauses.	2019-03-25 21:11:01 -03:00
Gian Merlino	4ca5fe0f60	SQL: Add PARSE_LONG function. (#7326 ) * SQL: Add PARSE_LONG function. * Fix test.	2019-03-22 15:40:10 -07:00
Roman Leventov	dfd27e00c0	Avoid many unnecessary materializations of collections of 'all segments in cluster' cardinality (#7185 ) * Avoid many unnecessary materializations of collections of 'all segments in cluster' cardinality * Fix DruidCoordinatorTest; Renamed DruidCoordinator.getReplicationStatus() to computeUnderReplicationCountsPerDataSourcePerTier() * More Javadocs, typos, refactor DruidCoordinatorRuntimeParams.createAvailableSegmentsSet() * Style * typo * Disable StaticPseudoFunctionalStyleMethod inspection because of too much false positives * Fixes	2019-03-19 18:22:56 -03:00
Furkan KAMACI	7ada1c49f9	Prohibit Throwables.propagate() (#7121 ) * Throw caught exception. * Throw caught exceptions. * Related checkstyle rule is added to prevent further bugs. * RuntimeException() is used instead of Throwables.propagate(). * Missing import is added. * Throwables are propogated if possible. * Throwables are propogated if possible. * Throwables are propogated if possible. * Throwables are propogated if possible. * * Checkstyle definition is improved. * Throwables.propagate() usages are removed. * Checkstyle pattern is changed for only scanning "Throwables.propagate(" instead of checking lookbehind. * Throwable is kept before firing a Runtime Exception. * Fix unused assignments.	2019-03-14 18:28:33 -03:00
Clint Wylie	d7ba19d477	sql, filters, and virtual columns (#6902 ) * refactor sql planning to re-use expression virtual columns when possible when constructing a DruidQuery, allowing virtual columns to be defined in filter expressions, and making resulting native druid queries more concise. also minor refactor of built-in sql aggregators to maximize code re-use * fix it * fix it in the right place * fixup for base64 stuff * fixup tests * fix merge conflict on import order * fixup * fix imports * fix tests * review comments * refactor * re-arrange * better javadoc * fixup merge * fixup tests * fix accidental changes	2019-03-11 11:37:58 -07:00
Xue Yu	65118277a3	support sin cos etc trigonometric function in sql (#7182 ) * support triangle function in sql * feedback address	2019-03-04 19:18:22 -08:00
Himanshu Pandey	8b803cbc22	Added checkstyle for "Methods starting with Capital Letters" (#7118 ) * Added checkstyle for "Methods starting with Capital Letters" and changed the method names violating this. * Un-abbreviate the method names in the calcite tests * Fixed checkstyle errors * Changed asserts position in the code	2019-02-23 20:10:31 -08:00
Surekha	02ef14f262	Fix num_rows in sys.segments (#6888 ) * Fix the bug with num_rows in sys.segments * Fix segmentMetadataInfo update in DruidSchema * Add numRows to SegmentMetadataHolder builder's constructor, so it's not overwritten * Rename SegSegmentSignature to setSegmentMetadataHolder and fix it so nested map is appended instead of recreated * Replace Map<String, Set<String>> segmentServerMap with Set<String> for num_replica * Remove unnecessary code and update test * Add unit test for num_rows * PR comments * change access modifier to default package level * minor changes to comments * PR comments	2019-02-11 16:21:19 -08:00
Jonathan Wei	fafbc4a80e	Set version to 0.15.0-incubating-SNAPSHOT (#7014 )	2019-02-07 14:02:52 -08:00
Justin Borromeo	6723243ed2	Create Scan Benchmark (#6986 ) * Moved Scan Builder to Druids class and started on Scan Benchmark setup * Need to form queries * It runs. * Remove todos * Change number of benchmark iterations * Changed benchmark params * More param changes * Made Jon's changes and removed TODOs * Broke some long lines into two lines * Decrease segment size for less memory usage * Committing a param change to kick teamcity	2019-02-06 14:45:01 -08:00
Surekha	ef451d3603	Add null checks in DruidSchema (#6830 ) * Add null checks in DruidSchema * Add unit tests * Add VisibleForTesting annotation * PR comments * unused import	2019-02-05 13:42:20 -08:00
Jonathan Wei	8bc5eaa908	Set version to 0.14.0-incubating-SNAPSHOT (#7003 )	2019-02-04 19:36:20 -08:00
Roman Leventov	0e926e8652	Prohibit assigning concurrent maps into Map-typed variables and fields and fix a race condition in CoordinatorRuleManager (#6898 ) * Prohibit assigning concurrent maps into Map-types variables and fields; Fix a race condition in CoordinatorRuleManager; improve logic in DirectDruidClient and ResourcePool * Enforce that if compute(), computeIfAbsent(), computeIfPresent() or merge() is called on a ConcurrentHashMap, it's stored in a ConcurrentHashMap-typed variable, not ConcurrentMap; add comments explaining get()-before-computeIfAbsent() optimization; refactor Counters; fix a race condition in Intialization.java * Remove unnecessary comment * Checkstyle * Fix getFromExtensions() * Add a reference to the comment about guarded computeIfAbsent() optimization; IdentityHashMap optimization * Fix UriCacheGeneratorTest * Workaround issue with MaterializedViewQueryQueryToolChest * Strengthen Appenderator's contract regarding concurrency	2019-02-04 09:18:12 -08:00
Surekha	7baa33049c	Introduce published segment cache in broker (#6901 ) * Add published segment cache in broker * Change the DataSegment interner so it's not based on DataSEgment's equals only and size is preserved if set * Added a trueEquals to DataSegment class * Use separate interner for realtime and historical segments * Remove trueEquals as it's not used anymore, change log message * PR comments * PR comments * Fix tests * PR comments * Few more modification to * change the coordinator api * removeall segments at once from MetadataSegmentView in order to serve a more consistent view of published segments * Change the poll behaviour to avoid multiple poll execution at same time * minor changes * PR comments * PR comments * Make the segment cache in broker off by default * Added a config to PlannerConfig * Moved MetadataSegmentView to sql module * Add doc for new planner config * Update documentation * PR comments * some more changes * PR comments * fix test * remove unintentional change, whether to synchronize on lifecycleLock is still in discussion in PR * minor changes * some changes to initialization * use pollPeriodInMS * Add boolean cachePopulated to check if first poll succeeds * Remove poll from start() * take the log message out of condition in stop()	2019-02-02 22:27:13 -08:00
Clint Wylie	7a5827e12e	bloom filter sql aggregator (#6950 ) * adds sql aggregator for bloom filter, adds complex value serde for sql results * fix tests * checkstyle * fix copy-paste	2019-02-01 13:54:46 -08:00
Clint Wylie	af3cbc3687	add bloom filter druid expression (#6904 ) * add "bloom_filter_test" druid expression to support bloom filters in ExpressionVirtualColumn and ExpressionDimFilter and sql expressions * more docs * use java.util.Base64, doc fixes	2019-01-28 08:41:45 -05:00
Benedict Jin	2b73644340	* Use `@SuppressWarnings("GuardedBy")` instead of `noinspection FieldAccessNotGuarded` comment (#6903 ) * Remove `@GuardedBy("connectionLock")` from `connectionLock` itself * Add FieldAccessNotGuarded into inspection profile and set the level to ERROR	2019-01-27 12:42:45 -08:00
Clint Wylie	66f64cd8bd	fix long/float/double dimension filtering for columns with nulls (#6906 ) * fix long,float, double dimension filtering when sql compatible null handling is enabled and the column has null values * revert unintended change * fix tests	2019-01-23 22:36:52 -08:00
Roman Leventov	8eae26fd4e	Introduce SegmentId class (#6370 ) * Introduce SegmentId class * tmp * Fix SelectQueryRunnerTest * Fix indentation * Fixes * Remove Comparators.inverse() tests * Refinements * Fix tests * Fix more tests * Remove duplicate DataSegmentTest, fixes #6064 * SegmentDescriptor doc * Fix SQLMetadataStorageUpdaterJobHandler * Fix DataSegment deserialization for ignoring id * Add comments * More comments * Address more comments * Fix compilation * Restore segment2 in SystemSchemaTest according to a comment * Fix style * fix testServerSegmentsTable * Fix compilation * Add comments about why SegmentId and SegmentIdWithShardSpec are separate classes * Fix SystemSchemaTest * Fix style * Compare SegmentDescriptor with SegmentId in Javadoc and comments rather than with DataSegment * Remove a link, see https://youtrack.jetbrains.com/issue/IDEA-205164 * Fix compilation	2019-01-21 11:11:10 -08:00
zhaojiandong	9f0fdcfef6	Fix deadlock in DruidStatement & DruidConnection (#6868 ) * Fix deadlock in DruidStatement & DruidConnection * change statements type to ConcurrentMap	2019-01-17 10:16:35 -08:00
Dayue Gao	5b8a221713	Add SQL id, request logs, and metrics (#6302 ) * use SqlLifecyle to manage sql execution, add sqlId * add sql request logger * fix UT * rename sqlId to sqlQueryId, sql/time to sqlQuery/time, etc * add docs and more sql request logger impls * add UT for http and jdbc * fix forbidden use of com.google.common.base.Charsets * fix UT in QuantileSqlAggregatorTest, supressed unused warning of getSqlQueryId * do not use default method in QueryMetrics interface * capitalize 'sql' everywhere in the non-property parts of the docs * use RequestLogger interface to log sql query * minor bugfixes and add switching request logger * add filePattern configs for FileRequestLogger * address review comments, adjust sql request log format * fix inspection error * try SuppressWarnings("RedundantThrows") to fix inspection error on ComposingRequestLoggerProvider	2019-01-15 23:12:59 -08:00
Surekha	f72f33f84a	Fix num_replicas count in sys.segments table (#6804 ) * Fix num_replicas count from sys.segments * Adjust unit test for num_replica > 1 * Pass named arguments instead of passing boolean constants * Address PR comments * PR comments	2019-01-15 08:31:29 -08:00
Charles Allen	5d2947cd52	Use Guava Compatible immediate executor service (#6815 ) * Use multi-guava version friendly direct executor implementation * Don't use a singleton * Fix strict compliation complaints * Copy Guava's DirectExecutor * Fix javadoc * Imports are the devil	2019-01-11 10:42:19 -08:00
Gian Merlino	bc671ac436	SQL: Fix ordering of sort, sortProject in DruidSemiJoin. (#6769 ) They were added in the wrong order, leading to this error message when evaluating rules: "Cannot move from stage[AGGREGATE] to stage[SORT_PROJECT]"	2019-01-03 10:36:28 -08:00
Surekha	5e5aad49e6	Set is_available to false by default for published segment (#6757 ) * Set is_available to false by default for published segment * Address comments Fix the is_published value for segments not in metadata store * Remove unused import * Use non-null sharSpec for a segment in test * Fix checkstyle * Modify comment	2018-12-20 13:29:00 -08:00
Gian Merlino	f0b7c272b9	Broker: Start up DruidSchema immediately if there are no segments. (#6765 ) Fixes a bug introduced in #6742, where the broker would delay startup indefinitely if there were no segments at all being served by any data servers.	2018-12-20 11:07:35 -07:00
Gian Merlino	7a09cde4de	Broker: Await initialization before finishing startup. (#6742 ) * Broker: Await initialization before finishing startup. In particular, hold off on announcing the service and starting the HTTP server until the server view and SQL metadata cache are finished initializing. This closes a window of time where a Broker could return partial results shortly after startup. As part of this, some simplification of server-lifecycle service announcements. This helps ensure that the two different kinds of announcements we do (legacy and new-style) stay in sync. * Remove unused imports. * Fix NPE in ServerRunnable.	2018-12-18 20:32:31 -08:00
Gian Merlino	f12a1aa993	SQL: Add support for queries with project-after-semijoin. (#6756 ) * SQL: Add support for queries with project-after-semijoin. These didn't work before, since the top Project rel wasn't getting merged into the DruidSemiJoin rel. This patch allows that to happen. * Null handling * Null handling * Null handling	2018-12-18 17:53:14 -08:00
Roman Leventov	ec38df7575	Simplify DruidNodeDiscoveryProvider; add DruidNodeDiscovery.Listener.nodeViewInitialized() (#6606 ) * Simplify DruidNodeDiscoveryProvider; add DruidNodeDiscovery.Listener.nodeViewInitialized() method; prohibit and eliminate some suboptimal Java 8 patterns * Fix style * Fix HttpEmitterTest.timeoutEmptyQueue() * Add DruidNodeDiscovery.Listener.nodeViewInitialized() calls in tests * Clarify code	2018-12-01 01:12:56 +01:00
Clint Wylie	efdec50847	bloom filter sql (#6502 ) * bloom filter sql support * docs * style fix * style fixes after rebase * use copied/patched bloomkfilter * remove context literal lookup function, changes from review * fix build * rename LookupOperatorConversion to QueryLookupOperatorConversion * remove doc * revert unintended change * add internal exception to bloom filter deserialization exception	2018-11-27 14:11:18 +08:00
Roman Leventov	87b96fb1fd	Add checkstyle rules about imports and empty lines between members (#6543 ) * Add checkstyle rules about imports and empty lines between members * Add suppressions * Update Eclipse import order * Add empty line * Fix StatsDEmitter	2018-11-20 12:42:15 +01:00
Gian Merlino	e9c3d3e651	SystemSchema: Fix data types for various fields. (#6642 ) * SystemSchema: Fix data types for various fields. - segments: start, end, partition_num - servers: plaintext_port, tls_port - tasks: plaintext_port, tls_port The declared and actual types did not match, but they must or else queries may generate ClassCastExceptions. Also adjusted some of the code for generating values to be more robust in the face of nulls or malformed strings. * Fix style.	2018-11-19 09:24:19 +08:00
Roman Leventov	8f3fe9cd02	Prohibit String.replace() and String.replaceAll(), fix and prohibit some toString()-related redundancies (#6607 ) * Prohibit String.replace() and String.replaceAll(), fix and prohibit some toString()-related redundancies * Fix bug * Replace checkstyle regexp with IntelliJ inspection	2018-11-15 13:21:34 -08:00

1 2 3 4 5

223 Commits