druid

mirror of https://github.com/apache/druid.git synced 2025-02-20 00:47:40 +00:00

Author	SHA1	Message	Date
Surekha	ea752ef562	Optimize overshadowed segments computation (#7595 ) * Move the overshadowed segment computation to SQLMetadataSegmentManager's poll * rename method in MetadataSegmentManager * Fix tests * PR comments * PR comments * PR comments * fix indentation * fix tests * fix test * add test for SegmentWithOvershadowedStatus serde format * PR comments * PR comments * fix test * remove snapshot updates outside poll * PR comments * PR comments * PR comments * removed unused import	2019-06-07 19:15:54 +02:00
Clint Wylie	12a1ecfc2b	allow sql lookup function to take advantage of injective lookups (#7655 )	2019-06-06 14:36:10 -07:00
Xue Yu	d482da6e9b	fix timestamp ceil lower bound bug (#7823 )	2019-06-04 01:16:31 -07:00
Eyal Yurman	69e9b8a464	Enables SQL by default. (#7808 )	2019-05-31 20:53:42 -07:00
Jihoon Son	7abfbb066a	Bump up snapshot version to 0.16.0 (#7802 )	2019-05-30 17:17:33 -07:00
Gian Merlino	58a571ccda	SQL: Use SegmentId instead of DataSegment as set/map keys. (#7796 ) Recently we've been talking about using SegmentIds as map keys rather than DataSegments, because its sense of equality is more well-defined. This is a refactor that does this in the druid-sql module, which mostly involves DruidSchema and some related classes. It should have no user-visible effects.	2019-05-30 12:58:36 -07:00
Gian Merlino	8649b8ab4c	SQL: Allow select-sort-project query shapes. (#7769 ) * SQL: Allow select-sort-project query shapes. Fixes #7768. Design changes: - In PartialDruidQuery, allow projection after select + sort by removing the SELECT_SORT query stage and instead allowing the SORT and SORT_PROJECT stages to apply either after aggregation or after a plain non-aggregating select. This is different from prior behavior, where SORT and SORT_PROJECT were only considered valid after aggregation stages. This logic change is in the "canAccept" method. - In DruidQuery, represent either kind of sorting with a single "Sorting" class (instead of DefaultLimitSpec). The Sorting class is still convertible into a DefaultLimitSpec, but is also convertible into the sorting parameters accepted by a Scan query. - In DruidQuery, represent post-select and post-sorting projections with a single "Projection" class. This obsoletes the SortProject and SelectProjection classes, and simplifies the DruidQuery by allowing us to move virtual-column and post-aggregator-creation logic into the new Projection class. - Split "DruidQuerySignature" into RowSignature and VirtualColumnRegistry. This effectively means that instead of having mutable and immutable versions of DruidQuerySignature, we instead of RowSignature (always immutable) and VirtualColumnRegistry (always mutable, but sometimes null). This change wasn't required, but IMO it this makes the logic involving them easier to follow, and makes it more clear when the virtual column registry is active and when it's not. Other changes: - ConvertBoundsToSelectors now just accepts a RowSignature, but we use the VirtualColumnRegistry.getFullRowSignature() method to get a signature that includes all columns, and therefore allows us to simplify the logic (no need to special-case virtual columns). - Add `__time` to the Scan column list if the query is ordering by time. * Remove unused import.	2019-05-30 12:56:29 -07:00
Roman Leventov	782863ed0f	Fix some problems reported by PVS-Studio (#7738 ) * Fix some problems reported by PVS-Studio * Address comments	2019-05-29 11:20:45 -07:00
Surekha	1fe0de1c96	Fix currSize attribute of historical server type (#7706 )	2019-05-21 11:55:58 -07:00
Gian Merlino	cbbce955de	SQL: Allow NULLs in place of optional arguments in many functions. (#7709 ) * SQL: Allow NULLs in place of optional arguments in many functions. Also adjust SQL docs to describe how to make time literals using TIME_PARSE (which is now possible in a nicer way). * Be less forbidden.	2019-05-21 11:54:34 -07:00
Gian Merlino	43c54385f6	SQL: Respect default timezone for TIME_PARSE and TIME_SHIFT. (#7704 ) * SQL: Respect default timezone for TIME_PARSE and TIME_SHIFT. They were inadvertently using UTC rather than the default timezone. Also, harmonize how time functions handle their parameters. * Fix tests * Add another TIME_SHIFT test.	2019-05-21 11:40:44 -07:00
Gian Merlino	69b2ea3ddc	SQL: TIME_EXTRACT should have 2 required operands. (#7710 ) * SQL: TIME_EXTRACT should have 2 required operands. Timestamp and time unit are both required. * Add regression test.	2019-05-21 11:32:36 -07:00
Gian Merlino	bcea05e4e8	SQL: Fix exception with OR of impossible filters. (#7707 ) Fixes #7671.	2019-05-21 11:32:09 -07:00
Xue Yu	dd7dace70a	Add TIMESTAMPDIFF sql support (#7695 ) * add timestampdiff sql support * feedback address	2019-05-21 08:05:38 -07:00
Gian Merlino	cb6ec2cab8	SqlOperatorConversion Javadoc fix. (#7713 ) Appears to be a copypasta error; the toDruidFilter method was referred to aggregations, but it's not handling aggregations.	2019-05-20 21:21:21 -07:00
Surekha	d3545f5086	Show all server types in sys.servers table (#7654 ) * update sys.servers table to show all servers * update docs * Fix integration test * modify test query for batch integration test * fix case in test queries * make the server_type lowercase * Apply suggestions from code review Co-Authored-By: Himanshu <g.himanshu@gmail.com> * Fix compilation from git suggestion * fix unit test	2019-05-15 16:54:02 -07:00
Xue Yu	35a1fbefea	upgrade avatica to 1.12.0 (#7644 )	2019-05-12 14:38:06 -07:00
Xue Yu	f7b8b57c3b	simpilfy DruidConvertletTable.java, remove STANDARD_CONVERTLET declare (#7632 )	2019-05-10 14:08:32 -07:00
Jonathan Wei	a013350018	Adjust required permissions for system schema (#7579 ) * Adjust required permissions for system schema * PR comments, fix current_size handling * Checkstyle * Set curr_size instead of current_size * Adjust information schema docs * Fix merge conflict * Update tests	2019-05-02 07:18:02 -07:00
Surekha	15d19f3059	Add is_overshadowed column to sys.segments table (#7425 ) * Add is_overshadowed column to sys.segments table * update docs * Rename class and variables * PR comments * PR comments * remove unused variables in MetadataResource * move constants together * add getFullyOvershadowedSegments method to ImmutableDruidDataSource * Fix compareTo of SegmentWithOvershadowedStatus * PR comment * PR comments * PR comments * PR comments * PR comments * fix issue with already consumed stream * minor refactoring * PR comments	2019-05-01 18:00:57 +02:00
Gian Merlino	c648775b5b	SQL: Remove "useFallback" feature. (#7567 ) This feature allows Calcite's Bindable interpreter to be bolted on top of Druid queries and table scans. I think it should be removed for a few reasons: 1. It is not recommended for production anyway, because it generates unscalable query plans (e.g. it will plan a join into two table scans and then try to do the entire join in memory on the broker). 2. It doesn't work with Druid-specific SQL functions, like TIME_FLOOR, REGEXP_EXTRACT, APPROX_COUNT_DISTINCT, etc. 3. It makes the SQL planning code needlessly complicated. With SQL coming out of experimental status soon, it's a good opportunity to remove this feature.	2019-04-28 18:26:44 -07:00
Xue Yu	2c8a71f883	Support LPAD and RPAD sql function (#7388 ) * lpad and rpad sql function * feedback address * feedback address * add doc and format * update docs	2019-04-22 14:51:32 -07:00
Clint Wylie	be65cca248	refactor druid-bloom-filter aggregators (#7496 ) * now with 100% more buffer * there can be only 1 * simplify * javadoc * clean up unused test method * fix exception message * style * why does style hate javadocs * review stuff * style :(	2019-04-18 11:54:06 -07:00
Kazuhito Takeuchi	7c19c92a81	Add ROUND function in druid-sql. (#7224 ) * Implement round function in druid-sql * Return value according to the type of argument * Fix codes for abnoraml inputs, updated math-expr.md * Fix assert text * Fix error messages and refactor codes * Fix compile error, update sql.md, refactor codes and format tests	2019-04-16 11:15:39 -07:00
Gian Merlino	721191635a	SQL: Include virtual columns used for filtering in ScanQuery. (#7472 ) PR #6902 introduced the ability to use virtual columns for filters, but they were being omitted from "scan" queries, so filters would refer to a null column instead of the intended virtual column.	2019-04-14 15:03:36 -07:00
Surekha	3e5dae9b96	Rename SegmentMetadataHolder to AvailableSegmentMetadata (#7372 )	2019-04-14 10:19:48 -07:00
Justin Borromeo	408e3e1b2a	Remove select execution code from SQL planner (#7416 ) * Removed select execution code from SQL planner * Update doc	2019-04-10 22:32:57 -07:00
Benedict Jin	2f64414ade	Add "REVERSE" / "REPEAT" / "RIGHT" / "LEFT" functions (#7334 ) * Add "REVERSE" / "REPEAT" / "RIGHT" / "LEFT" functions * Fix ImportOrder * Use RuntimeException instead of OutOfMemoryError according to "Effective Java" * Simplify * Patch suggestions	2019-04-10 11:46:29 +08:00
Justin Borromeo	799c66d9ac	Allow max rows and max segments for time-ordered scans to be overridden using the scan query JSON spec (#7413 ) * Initial changes * Fixed NPEs * Fixed failing spec test * Fixed failing Calcite test * Move configs to context * Validated and added docs * fixed weird indentation * Update default context vals in doc * Fixed allowable values	2019-04-07 20:12:52 -07:00
Clint Wylie	76b4a5c62e	refactor lookups to be more chill to router (#7222 ) * refactor lookups to be more chill to router * remove accidental change * fix and combine LookupIntrospectionResourceTest * fix inspection * rename RouterLookupModule to LookupSerdeModule and RouterLookupExtractorFactoryContainerProvider to NoopLookupExtractorFactoryContainerProvider * make comment generic * use ConfigResourceFilter instead of StateResourceFilter * fix indentation * unused import * another unused import * refactor some stuff into processing module, split up LookupModule.java classes into their own files	2019-04-05 14:49:41 -07:00
Gian Merlino	8c104a115c	SQL: Add STRING_FORMAT function. (#7327 )	2019-04-03 17:09:54 -04:00
Atul Mohan	c883c52cb1	Fix tests (#7401 )	2019-04-02 16:49:21 -07:00
Justin Borromeo	4584b5e139	SQL support for time-ordered scan (#7373 ) * Squashed commit of the following: commit 287a367f4170e7d0b3010d57788ea993688b9335 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 27 20:03:41 2019 -0700 Implemented Clint's recommendations commit 07503ea5c00892bf904c0e16e7062fadabcb7830 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 27 17:49:09 2019 -0700 doc fix commit 231a72e7d9c0f4bb2b3272134cf53fc8db8f0e73 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 27 17:38:20 2019 -0700 Modified sequence limit to accept longs and added test for long limits commit 1df50de32137961d949c1aaa4e4791f6edfb3d77 Merge: 480e932fd c7fea6ac8 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 15:23:01 2019 -0700 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit 480e932fdf02ef85ba81181deb865d9977dfed24 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 14:58:04 2019 -0700 Checkstyle and doc update commit 487f31fcf63a5e1fa9e802212b62206aec47fe25 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 14:39:25 2019 -0700 Refixed regression commit fb858efbb75218bb80b8c77effb2456554aa57b2 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 13:14:48 2019 -0700 Added test for n-way merge commit 376e8bf90610d43d2c7b278bf64525cab80267c5 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 26 11:42:54 2019 -0700 Refactor n-way merge commit 8a6bb1127c1814470424da2e9d6bfdd55e726199 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 17:17:41 2019 -0700 Fix docs and flipped boolean in ScanQueryLimitRowIterator commit 35692680fc7aba21c498a92307ef082a581cb23a Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 16:15:49 2019 -0700 Fix bug messing up count of rows commit 219af478c8ec243700973e616c5be556a83422e2 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 15:57:55 2019 -0700 Fix bug in numRowsScanned commit da4fc664031debae1dc3b4a0190125e979564aac Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 15:19:45 2019 -0700 Check type of segment spec before using for time ordering commit b822fc73dfba7f69c7e960bb95b31cab8d27ef25 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 13:19:02 2019 -0700 Revert "Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge" This reverts commit 57033f36df6e3ef887e5f0399ad74bb091306de8, reversing changes made to 8f01d8dd16f40d10c60519ca0ec0d2e6b2dde941. commit 57033f36df6e3ef887e5f0399ad74bb091306de8 Merge: 8f01d8dd1 86d9730fc Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 13:13:52 2019 -0700 Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge commit 8f01d8dd16f40d10c60519ca0ec0d2e6b2dde941 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 13:13:32 2019 -0700 Revert "Fixed failing tests -> allow usage of all types of segment spec" This reverts commit ec470288c7b725f5310bcf69d1db9f85ff509c8d. commit ec470288c7b725f5310bcf69d1db9f85ff509c8d Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 11:01:35 2019 -0700 Fixed failing tests -> allow usage of all types of segment spec commit 86d9730fc9f241b3010b123a45b1fc38a206a9af Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 25 11:01:35 2019 -0700 Fixed failing tests -> allow usage of all types of segment spec commit 8b3b6b51ed0d3bc3c937620d5b92096998e32080 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 16:01:56 2019 -0700 Nit comment commit a87d02127c72aa5e307af94b12b6be25150349be Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 15:54:42 2019 -0700 Fix checkstyle and test commit 62dcedacdeeed570134e8b5185633b207e91a547 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 15:30:41 2019 -0700 More comments commit 1b46b58aeccf13adc516a1d94054a98efc32184c Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 15:19:52 2019 -0700 Added a bit of docs commit 49472162b7fc0879159866c3736e192fc88837a4 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 22 10:27:41 2019 -0700 Rename segment limit -> segment partitions limit commit 43d490cc3ae697d0a61159ed6ae06906006cdf31 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Mar 21 13:16:58 2019 -0700 Optimized n-way merge strategy commit 42f5246b8d0c1879c2dc45334966bf52f543ea74 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 20 17:40:19 2019 -0700 Smarter limiting for pQueue method commit 4823dab895770a87356fe2ae4e9858bb4ba03fc3 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 20 16:05:53 2019 -0700 Finish rename commit 2528a5614267c48714abdb30fd7a2ccdb61b802d Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 18 14:00:50 2019 -0700 Renaming commit 7bfa77d3c177be42d0db0b0bc3c19f9ef536ffeb Merge: a032c46ee 7e49d4739 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 16:57:45 2019 -0700 Merge branch 'Update-Query-Interrupted-Exception' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit 7e49d47391d17b411b0620794e503592d8f37481 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 16:51:25 2019 -0700 Added error message for UOE commit a032c46ee09cd80b78f25d0da51f5179774aa75f Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 16:47:17 2019 -0700 Updated error message commit 57b568265488066c046f225c1982dba85e8a64ba Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 12:44:02 2019 -0700 Fixed tests commit 45e95bb1f40d50ab3a0a745d2b5fca34c3f53a82 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Mar 12 11:09:08 2019 -0700 Optimization commit cce917ab846198706ec8177a91869f9aa43e0525 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 8 14:11:07 2019 -0800 Checkstyle fix commit 73f4038068f2e30cb3487cc175730f3b97c5c8d2 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Mar 7 18:40:00 2019 -0800 Applied Jon's recommended changes commit fb966def8335e6808f0fe5d2d6a122dcd28f2355 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Mar 7 11:03:01 2019 -0800 Sorry, checkstyle commit 6dc53b311c568e29a6937fdd5f17a5623d14533f Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Mar 6 10:34:13 2019 -0800 Improved test and appeased TeamCity commit 35c96d355726cf5d238435655fecbfe19ea8ddb6 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 4 16:00:44 2019 -0800 Checkstyle fix commit 2d1978d5713187561a534c08eba51e383df66ce7 Merge: 83ec3fe1f 3398d3982 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Mar 4 15:24:49 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit 83ec3fe1f13c384aca52ceef0ba03b300b03d8d9 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 1 13:40:22 2019 -0800 Nit-change on javadoc commit 47c970b5f476e5bfe5e03aa798f314f59aeb67db Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Mar 1 13:38:29 2019 -0800 Wrote tests and added Javadoc commit 5ff59f5ca6c8058c04e500662b3691a4910aa842 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 28 15:58:20 2019 -0800 Reset config commit 806166f9777cccae5e10eabbb256c5e33b0e13f7 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 28 15:49:07 2019 -0800 Fixed failing tests commit de83b11a1bb24a0ae964240d9cb1ed17ea4a6c26 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 16:40:48 2019 -0800 Fixed mistakes in merge commit 5bd0e1a32cec1a0e4dadd74dd530d13341ab7349 Merge: 18cce9a64 9fa649b3b Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 16:39:16 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit 18cce9a646139a57004ef4eccef8077c9775e992 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 13:16:44 2019 -0800 Change so batching only occurs on broker for time-ordered scans Restricted batching to broker for time-ordered queries and adjusted tests Formatting Cleanup commit 451e2b43652020d6acb8b8db113fb34db0f50517 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 26 11:14:27 2019 -0800 WIP commit 69b24bd851d721592324bcbeec1c4229ad9ff462 Merge: 763c43df7 417b9f2fe Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 18:13:26 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-N-Way-Merge commit 763c43df7e99d4ab000f038a7c1b9ef98b479138 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 18:07:06 2019 -0800 Multi-historical setup works commit 06a5218917bca0716b98c32c07415a7271711431 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 16:59:57 2019 -0800 Wrote docs commit 3b923dac9cc82475795ee2f7691e6f96249560aa Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 14:03:22 2019 -0800 Fixed bug introduced by replacing deque with list commit 023538d83117086647c69d5030f2e8cb3e039558 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 13:30:08 2019 -0800 Sequence stuff is so dirty :( commit e1fc2955d361676eb6721ad31defd96d47fab999 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 22 10:39:59 2019 -0800 WIP commit f57ff253fa659cbb5aa09b7c9bf03d8e7670b865 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 18:22:06 2019 -0800 Ordering is correct on n-way merge -> still need to batch events into ScanResultValues commit 1813a5472c791509ba903f734b40b6102079876a Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 17:06:18 2019 -0800 Cleanup commit f83e99655d11247f44018e0e5d36bd6eac1fb2a6 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 16:56:36 2019 -0800 Refactor and pQueue works commit b13ff624a92a7e740eb1f74aa40c7a72165b9708 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 15:13:33 2019 -0800 Set up time ordering strategy decision tree commit fba6b022f0395cc297e3b3726f817c986f97010b Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 21 15:08:27 2019 -0800 Added config and get # of segments commit c9142e721c7ed824a54de7a160230cd959bb906d Merge: cd489a020 554b0142c Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 20 10:12:50 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-V2 commit cd489a0208b0cfc475a34440caa1c1e99d22a281 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 20 00:16:48 2019 -0800 Fixed failing test due to null resultFormat commit 7baeade8328776244e72a3cb5f2efb59111cf58b Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 19 17:52:06 2019 -0800 Changes based on Gian's comments commit 35150fe1a63c5143f564c4435461929e619a0de2 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 15 15:57:53 2019 -0800 Small changes commit 4e69276d57de4a9042b927efa5a864411aedacb4 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 12:09:54 2019 -0800 Removed unused import to satisfy PMD check commit ecb0f483a9525ffc2844cb01a0daafe6bc4d2161 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:37:11 2019 -0800 improved doc commit f0eddee66598095a767a1570516c5af59e58e2f6 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:18:45 2019 -0800 Added more javadoc commit 5f92dd7325aeff0b2e3f87003263e083ba2b427d Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:05:58 2019 -0800 Unused import commit 93e1636287f45d38c80f275e4644c0b3222c65e7 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 11 10:03:14 2019 -0800 Added javadoc on ScanResultValueTimestampComparator commit 134041c47965a8a199862ca33ef2119e29f67287 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 8 13:13:54 2019 -0800 Renamed sort function commit 2e3577cd3d7b43e140d36aad944536f49287fbfa Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 7 13:01:25 2019 -0800 Fixed benchmark queries commit d3b335af42602a771063bd8a63c89acf5c715938 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 7 11:08:07 2019 -0800 added all query types to scan benchmark commit ab00eade9f0b8e8642da40905214653c04cba4d4 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Thu Feb 7 09:42:48 2019 -0800 Kicking travis with change to benchmark param commit b432beaf84de5b363454fd8058ff653a097c713d Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 17:45:59 2019 -0800 Fixed failing calcite tests commit b2c8c77ad4ee5a9a273ee7a7870fb2d9b0ec9dd4 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 17:39:48 2019 -0800 Fixing tests WIP commit 85e72a614ef49736d1142ce82d26b533b609c911 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:42:02 2019 -0800 Set to spaces over tabs commit 7e872a8ebcea0d3a141addd122dd9f8b6629ead6 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:36:24 2019 -0800 Created an error message for when someone tries to time order a result set > threshold limit commit e8a4b490443b1efe6c70f964b1757bf17a64e9f6 Merge: 305876a43 8e3a58f72 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:05:11 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-V2 commit 305876a4346c292296db623c1fcea688a29c0bb8 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 15:02:02 2019 -0800 nit commit 8212a21cafc2ed4002607362f0661f4b5f6bef9d Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 14:40:35 2019 -0800 Improved conciseness commit 10b5e0ca93a529d1b0e018c11fafc9c63071b8cd Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:42:12 2019 -0800 . commit dfe4aa9681d04b8a31dcc1486e0447f29f6eb7bd Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:41:18 2019 -0800 Fixed codestyle and forbidden API errors commit 148939e88bfff021356bb532e4246e4c8e8ac333 Merge: 4f51024b3 5edbe2ae1 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:26:17 2019 -0800 Merge branch '6088-Create-Scan-Benchmark' into 6088-Time-Ordering-On-Scans-V2 commit 5edbe2ae12648b527e3e97b516127bf4b65196a3 Merge: 60b7684db 315ccb76b Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:18:55 2019 -0800 Merge github.com:apache/incubator-druid into 6088-Create-Scan-Benchmark commit 60b7684db725387b4d843385d9c61d50f2ed6744 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 13:02:13 2019 -0800 Committing a param change to kick teamcity commit 4f51024b318bf744eddb9d2f9638f7590872cf14 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 12:08:12 2019 -0800 Wrote more tests for scan result value sort commit 8b7d5f50818b00730965a55b1bf8ed27860bd6a4 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Wed Feb 6 11:55:09 2019 -0800 Wrote tests for heapsort scan result values and fixed bug where iterator wasn't returning elements in correct order commit b6d4df3864e3910fa406dcc83f6644f45f496c5f Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 16:45:20 2019 -0800 Decrease segment size for less memory usage commit d1a1793f36d4c9c910f84318f2bbbd355533c977 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 12:40:26 2019 -0800 nit commit 7deb06f6df47c55469a77a92622509ce88150ad5 Merge: b7d3a4900 86c5eee13 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:53:38 2019 -0800 Merge branch '6088-Create-Scan-Benchmark' into 6088-Time-Ordering-On-Scans-V2 commit 86c5eee13b6ce18b33c723cd0c4e464eaf41f010 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:31:27 2019 -0800 Broke some long lines into two lines commit b7d3a4900afb2b56b5e2667c2d37fa4872c67219 Merge: 796083f2b 8bc5eaa90 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:23:32 2019 -0800 Merge branch 'master' into 6088-Time-Ordering-On-Scans-V2 commit 737a83321d74cd0b1f7b4ca800509c36056d08ff Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Tue Feb 5 10:15:32 2019 -0800 Made Jon's changes and removed TODOs commit 796083f2bb188421f68858111bb39c988cb2f71c Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 15:37:42 2019 -0800 Benchmark param change commit 20c36644dbbf46df1a9209a635e661c01aeec627 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 15:36:35 2019 -0800 More param changes commit 9e6e71616bdcd9a7eea56e4bc1ef869c08bcf83c Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 15:31:21 2019 -0800 Changed benchmark params commit 01b25ed11293f472dac78d4f793f2941c3b22a18 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 14:36:18 2019 -0800 Added time ordering to the scan benchmark commit 432acaf08575c451ea02e8ec8d6318678dcf20cb Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 12:03:14 2019 -0800 Change number of benchmark iterations commit 12e51a272124c7a75628fe5b2f65ddc00e34ba27 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 12:02:13 2019 -0800 Added TimestampComparator tests commit e66339cd76cdb7f08a291e8488e3415518f3df63 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 10:56:41 2019 -0800 Remove todos commit ad731a362b465e9b4ca0c9ad7050fc6555606d52 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 10:55:56 2019 -0800 Change benchmark commit 989bd2d50e2419a715426b8fe903398d20429ff5 Merge: 7b5847139 26930f8d2 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Mon Feb 4 10:46:38 2019 -0800 Merge branch '6088-Create-Scan-Benchmark' into 6088-Time-Ordering-On-Scans-V2 commit 7b584713946b538d15da591e306ca4c0a7a378e3 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Sat Feb 2 03:48:18 2019 -0800 Licensing stuff commit 79e8319383eddbb49ecb4c1785dcd3eed14a0634 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 18:22:58 2019 -0800 Move ScanResultValue timestamp comparator to a separate class for testing commit 7a6080f636ab2ead5ff85a29ea6b9cc04d93b353 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 18:00:58 2019 -0800 Stuff for time-ordered scan query commit 26930f8d2021d1d62322c54e0ec35e260137ab1d Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 16:38:49 2019 -0800 It runs. commit dd4ec1ac9c1194144e3ec98b811adc59598c8d8c Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 15:12:17 2019 -0800 Need to form queries commit dba6e492a067b9bb4f77f3db4b19c340f85ef54f Merge: 10e57d5f9 7d4cc2873 Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 14:13:39 2019 -0800 Merge branch 'master' into 6088-Create-Scan-Benchmark commit 10e57d5f9ed003e032c82240045125002903a5bb Author: Justin Borromeo <jborrome@edu.uwaterloo.ca> Date: Fri Feb 1 14:04:13 2019 -0800 Moved Scan Builder to Druids class and started on Scan Benchmark setup * Changed SQL planning to use scan over select * Fixed some bugs * Removed unused imports * Updated calcite query test and test segment walker * Fixed formatting recommendations	2019-04-02 15:46:01 -07:00
Xue Yu	78fd5aff21	support radians and degrees in sql (#7336 ) * support radians and degrees in sql * update test case	2019-04-02 12:47:49 -07:00
Justin Borromeo	ad7862c58a	Time Ordering On Scans (#7133 ) * Moved Scan Builder to Druids class and started on Scan Benchmark setup * Need to form queries * It runs. * Stuff for time-ordered scan query * Move ScanResultValue timestamp comparator to a separate class for testing * Licensing stuff * Change benchmark * Remove todos * Added TimestampComparator tests * Change number of benchmark iterations * Added time ordering to the scan benchmark * Changed benchmark params * More param changes * Benchmark param change * Made Jon's changes and removed TODOs * Broke some long lines into two lines * nit * Decrease segment size for less memory usage * Wrote tests for heapsort scan result values and fixed bug where iterator wasn't returning elements in correct order * Wrote more tests for scan result value sort * Committing a param change to kick teamcity * Fixed codestyle and forbidden API errors * . * Improved conciseness * nit * Created an error message for when someone tries to time order a result set > threshold limit * Set to spaces over tabs * Fixing tests WIP * Fixed failing calcite tests * Kicking travis with change to benchmark param * added all query types to scan benchmark * Fixed benchmark queries * Renamed sort function * Added javadoc on ScanResultValueTimestampComparator * Unused import * Added more javadoc * improved doc * Removed unused import to satisfy PMD check * Small changes * Changes based on Gian's comments * Fixed failing test due to null resultFormat * Added config and get # of segments * Set up time ordering strategy decision tree * Refactor and pQueue works * Cleanup * Ordering is correct on n-way merge -> still need to batch events into ScanResultValues * WIP * Sequence stuff is so dirty :( * Fixed bug introduced by replacing deque with list * Wrote docs * Multi-historical setup works * WIP * Change so batching only occurs on broker for time-ordered scans Restricted batching to broker for time-ordered queries and adjusted tests Formatting Cleanup * Fixed mistakes in merge * Fixed failing tests * Reset config * Wrote tests and added Javadoc * Nit-change on javadoc * Checkstyle fix * Improved test and appeased TeamCity * Sorry, checkstyle * Applied Jon's recommended changes * Checkstyle fix * Optimization * Fixed tests * Updated error message * Added error message for UOE * Renaming * Finish rename * Smarter limiting for pQueue method * Optimized n-way merge strategy * Rename segment limit -> segment partitions limit * Added a bit of docs * More comments * Fix checkstyle and test * Nit comment * Fixed failing tests -> allow usage of all types of segment spec * Fixed failing tests -> allow usage of all types of segment spec * Revert "Fixed failing tests -> allow usage of all types of segment spec" This reverts commit ec470288c7b725f5310bcf69d1db9f85ff509c8d. * Revert "Merge branch '6088-Time-Ordering-On-Scans-N-Way-Merge' of github.com:justinborromeo/incubator-druid into 6088-Time-Ordering-On-Scans-N-Way-Merge" This reverts commit 57033f36df6e3ef887e5f0399ad74bb091306de8, reversing changes made to 8f01d8dd16f40d10c60519ca0ec0d2e6b2dde941. * Check type of segment spec before using for time ordering * Fix bug in numRowsScanned * Fix bug messing up count of rows * Fix docs and flipped boolean in ScanQueryLimitRowIterator * Refactor n-way merge * Added test for n-way merge * Refixed regression * Checkstyle and doc update * Modified sequence limit to accept longs and added test for long limits * doc fix * Implemented Clint's recommendations	2019-03-28 14:37:09 -07:00
Roman Leventov	bca40dcdaf	Fix some IntelliJ inspections (#7273 ) Prepare TeamCity for IntelliJ 2018.3.1 upgrade. Mostly removed redundant exceptions declarations in `throws` clauses.	2019-03-25 21:11:01 -03:00
Gian Merlino	4ca5fe0f60	SQL: Add PARSE_LONG function. (#7326 ) * SQL: Add PARSE_LONG function. * Fix test.	2019-03-22 15:40:10 -07:00
Roman Leventov	dfd27e00c0	Avoid many unnecessary materializations of collections of 'all segments in cluster' cardinality (#7185 ) * Avoid many unnecessary materializations of collections of 'all segments in cluster' cardinality * Fix DruidCoordinatorTest; Renamed DruidCoordinator.getReplicationStatus() to computeUnderReplicationCountsPerDataSourcePerTier() * More Javadocs, typos, refactor DruidCoordinatorRuntimeParams.createAvailableSegmentsSet() * Style * typo * Disable StaticPseudoFunctionalStyleMethod inspection because of too much false positives * Fixes	2019-03-19 18:22:56 -03:00
Furkan KAMACI	7ada1c49f9	Prohibit Throwables.propagate() (#7121 ) * Throw caught exception. * Throw caught exceptions. * Related checkstyle rule is added to prevent further bugs. * RuntimeException() is used instead of Throwables.propagate(). * Missing import is added. * Throwables are propogated if possible. * Throwables are propogated if possible. * Throwables are propogated if possible. * Throwables are propogated if possible. * * Checkstyle definition is improved. * Throwables.propagate() usages are removed. * Checkstyle pattern is changed for only scanning "Throwables.propagate(" instead of checking lookbehind. * Throwable is kept before firing a Runtime Exception. * Fix unused assignments.	2019-03-14 18:28:33 -03:00
Clint Wylie	d7ba19d477	sql, filters, and virtual columns (#6902 ) * refactor sql planning to re-use expression virtual columns when possible when constructing a DruidQuery, allowing virtual columns to be defined in filter expressions, and making resulting native druid queries more concise. also minor refactor of built-in sql aggregators to maximize code re-use * fix it * fix it in the right place * fixup for base64 stuff * fixup tests * fix merge conflict on import order * fixup * fix imports * fix tests * review comments * refactor * re-arrange * better javadoc * fixup merge * fixup tests * fix accidental changes	2019-03-11 11:37:58 -07:00
Xue Yu	65118277a3	support sin cos etc trigonometric function in sql (#7182 ) * support triangle function in sql * feedback address	2019-03-04 19:18:22 -08:00
Himanshu Pandey	8b803cbc22	Added checkstyle for "Methods starting with Capital Letters" (#7118 ) * Added checkstyle for "Methods starting with Capital Letters" and changed the method names violating this. * Un-abbreviate the method names in the calcite tests * Fixed checkstyle errors * Changed asserts position in the code	2019-02-23 20:10:31 -08:00
Surekha	02ef14f262	Fix num_rows in sys.segments (#6888 ) * Fix the bug with num_rows in sys.segments * Fix segmentMetadataInfo update in DruidSchema * Add numRows to SegmentMetadataHolder builder's constructor, so it's not overwritten * Rename SegSegmentSignature to setSegmentMetadataHolder and fix it so nested map is appended instead of recreated * Replace Map<String, Set<String>> segmentServerMap with Set<String> for num_replica * Remove unnecessary code and update test * Add unit test for num_rows * PR comments * change access modifier to default package level * minor changes to comments * PR comments	2019-02-11 16:21:19 -08:00
Jonathan Wei	fafbc4a80e	Set version to 0.15.0-incubating-SNAPSHOT (#7014 )	2019-02-07 14:02:52 -08:00
Justin Borromeo	6723243ed2	Create Scan Benchmark (#6986 ) * Moved Scan Builder to Druids class and started on Scan Benchmark setup * Need to form queries * It runs. * Remove todos * Change number of benchmark iterations * Changed benchmark params * More param changes * Made Jon's changes and removed TODOs * Broke some long lines into two lines * Decrease segment size for less memory usage * Committing a param change to kick teamcity	2019-02-06 14:45:01 -08:00
Surekha	ef451d3603	Add null checks in DruidSchema (#6830 ) * Add null checks in DruidSchema * Add unit tests * Add VisibleForTesting annotation * PR comments * unused import	2019-02-05 13:42:20 -08:00
Jonathan Wei	8bc5eaa908	Set version to 0.14.0-incubating-SNAPSHOT (#7003 )	2019-02-04 19:36:20 -08:00
Roman Leventov	0e926e8652	Prohibit assigning concurrent maps into Map-typed variables and fields and fix a race condition in CoordinatorRuleManager (#6898 ) * Prohibit assigning concurrent maps into Map-types variables and fields; Fix a race condition in CoordinatorRuleManager; improve logic in DirectDruidClient and ResourcePool * Enforce that if compute(), computeIfAbsent(), computeIfPresent() or merge() is called on a ConcurrentHashMap, it's stored in a ConcurrentHashMap-typed variable, not ConcurrentMap; add comments explaining get()-before-computeIfAbsent() optimization; refactor Counters; fix a race condition in Intialization.java * Remove unnecessary comment * Checkstyle * Fix getFromExtensions() * Add a reference to the comment about guarded computeIfAbsent() optimization; IdentityHashMap optimization * Fix UriCacheGeneratorTest * Workaround issue with MaterializedViewQueryQueryToolChest * Strengthen Appenderator's contract regarding concurrency	2019-02-04 09:18:12 -08:00
Surekha	7baa33049c	Introduce published segment cache in broker (#6901 ) * Add published segment cache in broker * Change the DataSegment interner so it's not based on DataSEgment's equals only and size is preserved if set * Added a trueEquals to DataSegment class * Use separate interner for realtime and historical segments * Remove trueEquals as it's not used anymore, change log message * PR comments * PR comments * Fix tests * PR comments * Few more modification to * change the coordinator api * removeall segments at once from MetadataSegmentView in order to serve a more consistent view of published segments * Change the poll behaviour to avoid multiple poll execution at same time * minor changes * PR comments * PR comments * Make the segment cache in broker off by default * Added a config to PlannerConfig * Moved MetadataSegmentView to sql module * Add doc for new planner config * Update documentation * PR comments * some more changes * PR comments * fix test * remove unintentional change, whether to synchronize on lifecycleLock is still in discussion in PR * minor changes * some changes to initialization * use pollPeriodInMS * Add boolean cachePopulated to check if first poll succeeds * Remove poll from start() * take the log message out of condition in stop()	2019-02-02 22:27:13 -08:00
Clint Wylie	7a5827e12e	bloom filter sql aggregator (#6950 ) * adds sql aggregator for bloom filter, adds complex value serde for sql results * fix tests * checkstyle * fix copy-paste	2019-02-01 13:54:46 -08:00

1 2 3 4 5

241 Commits