druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	bc671ac436	SQL: Fix ordering of sort, sortProject in DruidSemiJoin. (#6769 ) They were added in the wrong order, leading to this error message when evaluating rules: "Cannot move from stage[AGGREGATE] to stage[SORT_PROJECT]"	2019-01-03 10:36:28 -08:00
Surekha	5e5aad49e6	Set is_available to false by default for published segment (#6757 ) * Set is_available to false by default for published segment * Address comments Fix the is_published value for segments not in metadata store * Remove unused import * Use non-null sharSpec for a segment in test * Fix checkstyle * Modify comment	2018-12-20 13:29:00 -08:00
Gian Merlino	f0b7c272b9	Broker: Start up DruidSchema immediately if there are no segments. (#6765 ) Fixes a bug introduced in #6742, where the broker would delay startup indefinitely if there were no segments at all being served by any data servers.	2018-12-20 11:07:35 -07:00
Gian Merlino	7a09cde4de	Broker: Await initialization before finishing startup. (#6742 ) * Broker: Await initialization before finishing startup. In particular, hold off on announcing the service and starting the HTTP server until the server view and SQL metadata cache are finished initializing. This closes a window of time where a Broker could return partial results shortly after startup. As part of this, some simplification of server-lifecycle service announcements. This helps ensure that the two different kinds of announcements we do (legacy and new-style) stay in sync. * Remove unused imports. * Fix NPE in ServerRunnable.	2018-12-18 20:32:31 -08:00
Gian Merlino	f12a1aa993	SQL: Add support for queries with project-after-semijoin. (#6756 ) * SQL: Add support for queries with project-after-semijoin. These didn't work before, since the top Project rel wasn't getting merged into the DruidSemiJoin rel. This patch allows that to happen. * Null handling * Null handling * Null handling	2018-12-18 17:53:14 -08:00
Roman Leventov	ec38df7575	Simplify DruidNodeDiscoveryProvider; add DruidNodeDiscovery.Listener.nodeViewInitialized() (#6606 ) * Simplify DruidNodeDiscoveryProvider; add DruidNodeDiscovery.Listener.nodeViewInitialized() method; prohibit and eliminate some suboptimal Java 8 patterns * Fix style * Fix HttpEmitterTest.timeoutEmptyQueue() * Add DruidNodeDiscovery.Listener.nodeViewInitialized() calls in tests * Clarify code	2018-12-01 01:12:56 +01:00
Clint Wylie	efdec50847	bloom filter sql (#6502 ) * bloom filter sql support * docs * style fix * style fixes after rebase * use copied/patched bloomkfilter * remove context literal lookup function, changes from review * fix build * rename LookupOperatorConversion to QueryLookupOperatorConversion * remove doc * revert unintended change * add internal exception to bloom filter deserialization exception	2018-11-27 14:11:18 +08:00
Roman Leventov	87b96fb1fd	Add checkstyle rules about imports and empty lines between members (#6543 ) * Add checkstyle rules about imports and empty lines between members * Add suppressions * Update Eclipse import order * Add empty line * Fix StatsDEmitter	2018-11-20 12:42:15 +01:00
Gian Merlino	e9c3d3e651	SystemSchema: Fix data types for various fields. (#6642 ) * SystemSchema: Fix data types for various fields. - segments: start, end, partition_num - servers: plaintext_port, tls_port - tasks: plaintext_port, tls_port The declared and actual types did not match, but they must or else queries may generate ClassCastExceptions. Also adjusted some of the code for generating values to be more robust in the face of nulls or malformed strings. * Fix style.	2018-11-19 09:24:19 +08:00
Roman Leventov	8f3fe9cd02	Prohibit String.replace() and String.replaceAll(), fix and prohibit some toString()-related redundancies (#6607 ) * Prohibit String.replace() and String.replaceAll(), fix and prohibit some toString()-related redundancies * Fix bug * Replace checkstyle regexp with IntelliJ inspection	2018-11-15 13:21:34 -08:00
Gian Merlino	80173b5d29	SQL: Set INFORMATION_SCHEMA catalog name to "druid". (#6595 ) * SQL: Set INFORMATION_SCHEMA catalog name to "druid". Some third party tools ignore catalogs with empty names. So using the name "druid" for the catalog makes integration easier. * Update tests.	2018-11-14 06:32:40 +08:00
Gian Merlino	ab518781bb	SQL: Support AVG on system tables. (#6601 )	2018-11-14 06:31:33 +08:00
Gian Merlino	154b6fbcef	SQL: Add "POSITION" function. (#6596 ) Also add a "fromIndex" argument to the strpos expression function. There are some -1 and +1 adjustment terms due to the fact that the strpos expression behaves like Java indexOf (0-indexed), but the POSITION SQL function is 1-indexed.	2018-11-13 13:39:00 -08:00
Roman Leventov	54351a5c75	Fix various bugs; Enable more IntelliJ inspections and update error-prone (#6490 ) * Fix various bugs; Enable more IntelliJ inspections and update error-prone * Fix NPE * Fix inspections * Remove unused imports	2018-11-06 14:38:08 -08:00
Surekha	bcb754d066	Use current coordinator leader instead of cached one (#6551 ) (#6552 ) * Use current coordinator leader instead of cached one (#6551) Check the response status and throw exception if not OK * Modify tests * PR comment * Add the correct check for status of BytesAccumulatingResponseHandler * Move the status check into JsonParserIterator so sql query outputs meaningful message on failure * Fix tests	2018-11-06 13:09:51 -08:00
QiuMM	676f5e6d7f	Prohibit some guava collection APIs and use JDK collection APIs directly (#6511 ) * Prohibit some guava collection APIs and use JDK APIs directly * reset files that changed by accident * sort codestyle/druid-forbidden-apis.txt alphabetically	2018-10-29 13:02:43 +01:00
Roman Leventov	84ac18dc1b	Catch some incorrect method parameter or call argument formatting patterns with checkstyle (#6461 ) * Catch some incorrect method parameter or call argument formatting patterns with checkstyle * Fix DiscoveryModule * Inline parameters_and_arguments.txt * Fix a bug in PolyBind * Fix formatting	2018-10-23 07:17:38 -03:00
QiuMM	85a89e2703	make druid node bind address configurable (#6464 ) * make druid node bind address configurable * fix tests * fix travis-ci	2018-10-15 14:19:40 -07:00
Gian Merlino	f537c0069a	SQL: Support for selecting multi-value dimensions. (#6462 ) * SQL: Support for selecting multi-value dimensions. Fixes #4637. Doesn't completely address everything mentioned in #4638, but at least fixes one issue on the way there. * Fix null cases in tests.	2018-10-15 14:01:21 -07:00
QiuMM	6c71ee5ed5	fix type mismatch caused by #6377 (#6466 )	2018-10-15 17:34:18 +09:00
Clint Wylie	84598fba3b	combine druid-api, druid-common, java-util into druid-core (#6443 ) * combine druid-api, druid-common, java-util * spacing	2018-10-14 20:37:37 -07:00
Roman Leventov	e3397ba00f	Enforce Druid's exception class use (#6456 )	2018-10-13 16:35:14 -07:00
Surekha	e908fd6db7	Add check for nullable numRows (#6460 ) * Add check for nullable numRows * Make numRows long instead of Long type * Add check for numRows in unit test * small refactoring * Modify test PR comment from https://github.com/apache/incubator-druid/pull/6094#pullrequestreview-163937783 * Add a test for serverSegments table * update tests	2018-10-13 15:08:42 -07:00
Surekha	3be4a97150	Fix inconsistent segment size(#6448 ) (#6451 ) * Fix inconsistent segment size(#6448) * Fix the segment size for published segments * Changes to get numReplicas * Make coordinator segments API truly streaming * Changes to store partial segment data * Simplify SegmentMetadataHolder * Store partial the columns from available segments * Address comments	2018-10-12 12:55:20 -07:00
David Lim	20ab213ba6	change project versions to 0.13.0-incubating-SNAPSHOT (#6453 )	2018-10-11 19:28:01 -07:00
Surekha	3a0a667fe0	Introduce SystemSchema tables (#5989 ) (#6094 ) * Added SystemSchema with following tables (#5989) * SEGMENTS table provides details on served and published segments * SERVERS table provides details on data servers * SERVERSEGMETS table is the JOIN of SEGMENTS and SERVERS * TASKS table provides details on tasks * Add documentation for system schema * Fix static-analysis warnings * Address PR comments Add unit tests Fix a test * Try to fix a test * Fix a bug around replica count * rename io.druid to org.apache.druid * Major change is to make tasks and segment queries streaming * Made tasks/segments stream to calcite instead of storing it in memory * Add num_rows to segments table * Refactor JsonParserIterator * Replace with closeable iterator * Fix docs, make num_rows column nullable, some unit test changes * make num_rows column type long, allow it to be null fix a compile error after merge, add TrafficCop param to InputStreamResponseHandler * Filter null rows for segments table from Linq4j enumerable * change num_replicas datatype to long in segments table * Fix some tests and address comments * Doc updates, other PR comments * Update tests * Address comments * Add auth check * Update docs * Refactoring * Fix teamcity warning, change the getQueryableServer in TimelineServerView * Fix compilation after rebase * Use the stream API from AuthorizationUtils * Added LeaderClient interface and NoopDruidLeaderClient class * Revert "Added LeaderClient interface and NoopDruidLeaderClient class" This reverts commit `100fa46e39`. * Make the naming consistent to server_segments for the join table * Add ForbiddenException on auth check failure * Remove static block from SystemSchema * Try to fix a test in CalciteQueryTest due to rename of server_segments * Fix the json output format in the coordinator API * Add auth check in the segments API * Add null check to avoid NPE * Use annonymous class object instead of mock for DruidLeaderClient in SqlBenchmark * Fix test failures, type long/BIGINT can be nullable * Revert long nullability to fix tests * Fix style for tests * PR comments * Address PR comments * Add the missing BytesAccumulatingResponseHandler class * Use Sequences.withBaggage in DruidPlanner * Fix docs, add comments * Close the iterator if hasNext returns false	2018-10-10 17:17:29 -07:00
Roman Leventov	3ae563263a	Renamed 'Generic Column' -> 'Numeric Column'; Fixed a few resource leaks in processing; misc refinements (#5957 ) This PR accumulates many refactorings and small improvements that I did while preparing the next change set of https://github.com/druid-io/druid/projects/2. I finally decided to make them a separate PR to minimize the volume of the main PR. Some of the changes: - Renamed confusing "Generic Column" term to "Numeric Column" (what it actually implies) in many class names. - Generified `ComplexMetricExtractor`	2018-10-02 14:50:22 -03:00
Gian Merlino	244046fda5	SQL: Fix too-long headers in http responses. (#6411 ) Fixes #6409 by moving column name info from HTTP headers into the result body.	2018-10-01 18:13:08 -07:00
Gian Merlino	3548396a45	SQL: Update to Calcite 1.17.0. (#6404 ) * SQL: Update to Calcite 1.17.0. Other than keeping things fresh, another motivation is that this fixes CALCITE-1436 (AggregateNode NPE for aggregators other than SUM/COUNT), which affects aggregate functions on our system tables. Also sets shouldConvertRaggedUnionTypesToVarying = true, a new type system parameter that prefers VARCHAR over CHAR. This is better for Druid, because we don't really have support for a true CHAR type. * Remove unused import.	2018-09-29 18:33:29 -07:00
Gian Merlino	3922582d8c	SQL: Fix too-strict check in SortProject. (#6403 ) The "Duplicate field name" check on inputRowSignature is too strict: it is actually fine for a row signature to have the same field name twice. It happens when the same expression is selected twice, and both selections map to the same Druid object (dimension, aggregator, etc). I did not succeed in writing a test that triggers this, but I did see it occur in production for a complex query with hundreds of aggregators.	2018-09-29 13:54:34 -07:00
Gian Merlino	0da042cdd9	SQL: Unwrap IS_TRUE, IS_FALSE and friends when building a filter. (#6374 ) * SQL: Unwrap IS_TRUE, IS_FALSE and friends when building a filter. * Fix test.	2018-09-25 10:37:02 -07:00
Dayue Gao	edf0c13807	add a sql option to force user to specify time condition (#6246 ) * add a sql option to force user to specify time condition * rename forceTimeCondition to requireTimeCondition, refine error message	2018-09-17 13:52:24 -07:00
Roman Leventov	0c4bd2b57b	Prohibit some Random usage patterns (#6226 ) * Prohibit Random usage patterns * Fix FlattenJSONBenchmarkUtil	2018-09-14 13:35:51 -07:00
Gian Merlino	4669f0878f	SQL: UNION ALL operator. (#6314 ) * SQL: UNION ALL operator. * Remove unused import.	2018-09-09 22:32:56 -07:00
Dayue Gao	743547fc3b	Unauthorized sql request should return 403 (#6279 )	2018-09-01 09:17:18 -07:00
Gian Merlino	431d3d8497	Rename io.druid to org.apache.druid. (#6266 ) * Rename io.druid to org.apache.druid. * Fix META-INF files and remove some benchmark results. * MonitorsConfig update for metrics package migration. * Reorder some dimensions in inner queries for some reason. * Fix protobuf tests.	2018-08-30 09:56:26 -07:00
Himanshu	1fae6513e1	add "subtotalsSpec" attribute to groupBy query (#5280 ) * add subtotalsSpec attribute to groupBy query * dont sent subtotalsSpec to downstream nodes from broker and other updates * address review comment * fix checkstyle issues after merge to master * add docs for subtotalsSpec feature * address doc review comments	2018-08-28 17:46:38 -07:00
Gian Merlino	80224df36a	SQL: Fix post-aggregator naming logic for sort-project. (#6250 ) The old code assumes that post-aggregator prefixes are one character long followed by numbers. This isn't always true (we may pad with underscores to avoid conflicts). Instead, the new code uses a different base prefix for sort-project postaggregators ("s" instead of "p") and uses the usual Calcites.findUnusedPrefix function to avoid conflicts.	2018-08-28 10:59:32 -07:00
Dayue Gao	a879022bc8	fix AssertionError of semi join query (#6244 )	2018-08-27 17:49:51 -07:00
Dayue Gao	2325844a38	fix incorrect check of maxSemiJoinRowsInMemory (#6242 )	2018-08-27 16:28:29 -07:00
Gian Merlino	0172326c62	SQL: Support more result formats, add columns header. (#6191 ) * SQL: Support more result formats, add columns header. - Add result formats for line-based JSON and CSV. - Add X-Druid-Sql-Columns header with a list of all columns that the response will contain. - Add more comprehensive documentation on what callers should expect when making Druid SQL queries. * Fix some tests. * Adjust tests. * Adjust trailer, add types header. * Fix trailers.	2018-08-26 23:00:14 -06:00
Gian Merlino	28e6ae3664	SQL: Finalize aggregations for inner queries when necessary. (#6221 ) * SQL: Finalize aggregations for inner queries when necessary. Fixes #5779. * Fixed test method name.	2018-08-25 13:56:23 -07:00
Jihoon Son	ecee3e0a24	Further optimize memory for Travis jobs (#6150 ) * Further optimize memory for Travis jobs * fix build * sudo false	2018-08-10 22:03:36 -07:00
Nishant Bangarwa	75c8a87ce1	Part 2 of changes for SQL Compatible Null Handling (#5958 ) * Part 2 of changes for SQL Compatible Null Handling * Review comments - break lines longer than 120 characters * review comments * review comments * fix license * fix test failure * fix CalciteQueryTest failure * Null Handling - Review comments * review comments * review comments * fix checkstyle * fix checkstyle * remove unrelated change * fix test failure * fix failing test * fix travis failures * Make StringLast and StringFirst aggregators nullable and fix travis failures	2018-08-02 08:20:25 -07:00
Roman Leventov	0754d78a2e	Prohibit Lists.newArrayList() with a single argument (#6068 ) * Prohibit Lists.newArrayList() with a single argument * Test fixes * Add Javadoc to Node constructor	2018-07-31 20:09:10 -07:00
Benedict Jin	331a0afb98	Remove redundant type parameters and enforce some other style and inspection rules (#5980 ) * Various changes about druid-services module * Patch improvements from reviewer * Add ToArrayCallWithZeroLengthArrayArgument & ArraysAsListWithZeroOrOneArgument into inspection profile * Fix ArraysAsListWithZeroOrOneArgument * Fix conflict * Fix ToArrayCallWithZeroLengthArrayArgument * Fix AliEqualsAvoidNull * Remove blank line * Remove unused import clauses * Fix code style in TopNQueryRunnerTest * Fix conflict * Don't use Collections.singletonList when converting the type of array type * Add argLine into maven-surefire-plugin in druid-process module & increase the timeout value for testMoveSegment testcase * Roll back the latest commit * Add java.io.File#toURL() into druid-forbidden-apis * Using Boolean.parseBoolean instead of Boolean.valueOf for CliCoordinator#isOverlord * Add a new regexp element into stylecode xml file * Fix style error for new regexp * Set the level of ArraysAsListWithZeroOrOneArgument as WARNING * Fix style error for new regexp * Add option BY_LEVEL for ToArrayCallWithZeroLengthArrayArgument in inspection profile * Roll back the level as ToArrayCallWithZeroLengthArrayArgument as ERROR * Add toArray(new Object[0]) regexp into checkstyle config file & fix them * Set the level of ArraysAsListWithZeroOrOneArgument as ERROR & Roll back the level of ToArrayCallWithZeroLengthArrayArgument as WARNING until Youtrack fix it * Add a comment for string equals regexp in checkstyle config * Fix code format * Add RedundantTypeArguments as ERROR level inspection * Fix cannot resolve symbol datasource	2018-07-27 16:56:49 -05:00
Jonathan Wei	efab3b0160	Add concat and textcat SQL functions (#6005 )	2018-07-20 11:21:04 -07:00
Gian Merlino	cd8ea3da8d	SQL: Add server-wide default time zone config. (#5993 ) * SQL: Add server-wide default time zone config. * Switch API.	2018-07-18 13:12:40 -07:00
Gian Merlino	04ea3c9f8c	Update license headers. (#5976 ) * Update license headers. For compliance with http://www.apache.org/legal/src-headers.html. * More license adjustments. * Fix mistakenly edited package line.	2018-07-11 09:55:18 -07:00
Gian Merlino	948e73da77	Extend various test timeouts. (#5978 ) False failures on Travis due to spurious timeout (in turn due to noisy neighbors) is a bigger problem than legitimate failures taking too long to time out. So it makes sense to extend timeouts.	2018-07-10 13:02:14 -07:00

1 2 3 4

183 Commits