druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	8211379de6	MSQ: Change default clusterStatisticsMergeMode to SEQUENTIAL. (#14310 ) * MSQ: Change default clusterStatisticsMergeMode to SEQUENTIAL. This is an undocumented parameter that controls how cluster-by statistics are merged. In PARALLEL mode, statistics are gathered from workers all at once. In SEQUENTIAL mode, statistics are gathered time chunk by time chunk. This improves accuracy for jobs with many time chunks, and reduces memory usage. The main downside of SEQUENTIAL is that it can take longer, but in most situations I've seen, PARALLEL is only really usable in cases where the sketches are small enough that SEQUENTIAL would also run relatively quickly. So it seems like SEQUENTIAL is a better default. * Switch off-test from SEQUENTIAL to PARALLEL. * Fix sequential merge for situations where there are no time chunks at all. * Add a couple more tests.	2023-06-26 10:54:28 -07:00
YongGang	b7434be99e	Add ServiceStatusMonitor to monitor service health (#14443 ) * Add OverlordStatusMonitor and CoordinatorStatusMonitor to monitor service leader status * make the monitor more general * resolve conflict * use Supplier pattern to provide metrics * reformat code and doc * move service specific tag to dimension * minor refine * update doc * reformat code * address comments * remove declared exception * bind HeartbeatSupplier conditionally in Coordinator	2023-06-26 10:26:37 -07:00
Laksh Singla	114380749d	MSQ: Improve the parse exception errors and the handling of null UTF characters in Strings in Frames (#14398 )	2023-06-26 18:14:29 +05:30
Laksh Singla	1647d5f4a0	Limit the subquery results by memory usage (#13952 ) Users can now add a guardrail to prevent subquery’s results from exceeding the set number of bytes by setting druid.server.http.maxSubqueryRows in Broker's config or maxSubqueryRows in the query context. This feature is experimental for now and would default back to row-based limiting in case it fails to get the accurate size of the results consumed by the query.	2023-06-26 18:12:28 +05:30
Gian Merlino	d7c9c2f367	SqlResults: Coerce arrays to lists for VARCHAR. (#14260 ) * SqlResults: Coerce arrays to lists for VARCHAR. Useful for STRING_TO_MV, which returns VARCHAR at the SQL layer and an ExprEval with String[] at the native layer. * Fix style. * Improve test coverage. * Remove unnecessary throws.	2023-06-25 09:35:18 -07:00
Tejaswini Bandlamudi	72cf91fbc0	Upgrade Avro to latest version (#14440 ) Upgraded Avro to 1.11.1	2023-06-24 14:51:30 +05:30
Gian Merlino	970288067a	Fix flaky HttpEmitterConfigTest and ParametrizedUriEmitterConfigTest. (#14481 ) Recently, we have seen flakiness in these two tests, apparently due to computations based on Runtime.getRuntime().maxMemory() differing during static initialization and in the actual tests. I can't think of a reason why this would be happening, but anyway, this patch switches the tests to use the statics instead of recomputing Runtime.getRuntime().maxMemory().	2023-06-23 16:27:11 -07:00
Gian Merlino	3d19b748fb	SQL OperatorConversions: Introduce.aggregatorBuilder, allow CAST-as-literal. (#14249 ) * SQL OperatorConversions: Introduce.aggregatorBuilder, allow CAST-as-literal. Four main changes: 1) Provide aggregatorBuilder, a more consistent way of defining the SqlAggFunction we need for all of our SQL aggregators. The mechanism is analogous to the one we already use for SQL functions (OperatorConversions.operatorBuilder). 2) Allow CASTs of constants to be considered as "literalOperands". This fixes an issue where various of our operators are defined with OperandTypes.LITERAL as part of their checkers, which doesn't allow casts. However, in these cases we generally _do_ want to allow casts. The important piece is that the value must be reducible to a constant, not that the SQL text is literally a literal. 3) Update DataSketches SQL aggregators to use the new aggregatorBuilder functionality. The main user-visible effect here is [2]: the aggregators would now accept, for example, "CAST(0.99 AS DOUBLE)" as a literal argument. Other aggregators could be updated in a future patch. 4) Rename "requiredOperands" to "requiredOperandCount", because the old name was confusing. (It rhymes with "literalOperands" but the arguments mean different things.) * Adjust method calls.	2023-06-23 16:25:04 -07:00
Gian Merlino	1d6c9657ec	Clarify compaction docs. (#14225 ) * Clarify compaction docs. The prior wording made it sound like segmentGranularity, queryGranularity, and rollup are always required for granularitySpec. They are not required, but they are strongly recommended. The adjusted wording hopefully does a better job of making that clear. * Fix link. * Wording adjustments.	2023-06-23 15:24:15 -07:00
Gian Merlino	ddd0fc1b85	S3: Attach SSE key to doesObjectExist calls. (#14290 ) * S3: Attach SSE key to doesObjectExist calls. We did not previously attach the SSE key to the doesObjectExist request, leading to an inconsistency that may cause problems on "S3-compatible" implementations. This patch implements doesObjectExist using similar logic to the S3 client itself, but calls our implementation of getObjectMetadata rather than the S3 client's, ensuring the request is decorated with the SSE key. * Fix tests.	2023-06-23 15:23:59 -07:00
Rishabh Singh	155fde33ff	Add metrics to SegmentMetadataCache refresh (#14453 ) New metrics: - `segment/metadatacache/refresh/time`: time taken to refresh segments per datasource - `segment/metadatacache/refresh/count`: number of segments being refreshed per datasource	2023-06-23 16:51:08 +05:30
Peter Marshall	b6d6e3b827	Update start-druid-main.py (#14471 ) Quick typo correction.	2023-06-23 14:07:24 +05:30
imply-cheddar	7e2cf35d7b	Fix compatibility issue with SqlTaskResource (#14466 ) * Fix compatibility issue with SqlTaskResource The DruidException changes broke the response format for errors coming back from the SqlTaskResource, so fix those	2023-06-23 01:15:32 -07:00
Clint Wylie	9b1779734b	fix website mvn build (#14458 ) changes: * fix website mvn build * remove the i18n/en.json file add to gitignore * add spellcheck to mvn test phase	2023-06-22 12:14:23 -07:00
Clint Wylie	31b9d5695d	Extend InitializedNullHandlingTest instead of NullHandlingTest (#14467 ) NullHandlingTest is an actual test, it shouldn't be used as a base class	2023-06-22 15:01:50 +05:30
Adarsh Sanjeev	90b8f850a5	Allow empty tiered replicants map for load rules (#14432 ) Changes: - Add property `useDefaultTierForNull` for all load rules. This property determines the default value of `tieredReplicants` if it is not specified. When true, the default is `_default_tier => 2 replicas`. When false, the default is empty, i.e. no replicas on any tier. - Fix validation to allow empty replicants map, so that the segment is used but not loaded anywhere.	2023-06-22 14:44:06 +05:30
Abhishek Agarwal	f8f2fe8b7b	Skip tests based on files changed in the PR (#14445 ) Our CI system has a lot of tests. And much of this testing is really unnecessary for most of the PRs. This PR adds some checks so we can skip these expensive tests when we know they are not necessary.	2023-06-22 12:27:23 +05:30
Sergio Ferragut	1a9aefbb0f	Move from Jupyter notebook to Jupyter Lab and introduce a notebook folder structure (#14419 )	2023-06-21 09:11:00 -07:00
Rishabh Singh	92a7febacb	Revert "Add method to authorize native query using authentication result (#14376 )" (#14452 ) This reverts commit `8b212e73d7`.	2023-06-21 10:42:26 +05:30
Hardik Bajaj	1ea9158a50	Added new SysMonitorOshi v0 using Oshi library (#14359 ) Added a new monitor SysMonitorOshi to replace SysMonitor. The new monitor has a wider support for different machine architectures including ARM instances. Please switch to SysMonitorOshi as SysMonitor is now deprecated and will be removed in future releases.	2023-06-20 20:57:58 +05:30
Adarsh Sanjeev	f5cc823d0f	Handle nulls in DruidCoordinator.getReplicationFactor (#14447 )	2023-06-20 15:25:57 +05:30
Rohan Garg	09d6c5a45e	Decouple logical planning and native query generation in SQL planning (#14232 ) Add a new planning strategy that explicitly decouples the DAG from building the native query. With this mode, it is Calcite's job to generate a "logical DAG" which is all of the various DruidProject, DruidFilter, etc. nodes. We then take those nodes and use them to build a native query. The current commit doesn't pass all tests, but it does work for some things and is a decent starting baseline.	2023-06-19 16:00:40 -07:00
Kashif Faraz	50461c3bd5	Enable smartSegmentLoading on the Coordinator (#13197 ) This commit does a complete revamp of the coordinator to address problem areas: - Stability: Fix several bugs, add capabilities to prioritize and cancel load queue items - Visibility: Add new metrics, improve logs, revamp `CoordinatorRunStats` - Configuration: Add dynamic config `smartSegmentLoading` to automatically set optimal values for all segment loading configs such as `maxSegmentsToMove`, `replicationThrottleLimit` and `maxSegmentsInNodeLoadingQueue`. Changed classes: - Add `StrategicSegmentAssigner` to make assignment decisions for load, replicate and move - Add `SegmentAction` to distinguish between load, replicate, drop and move operations - Add `SegmentReplicationStatus` to capture current state of replication of all used segments - Add `SegmentLoadingConfig` to contain recomputed dynamic config values - Simplify classes `LoadRule`, `BroadcastRule` - Simplify the `BalancerStrategy` and `CostBalancerStrategy` - Add several new methods to `ServerHolder` to track loaded and queued segments - Refactor `DruidCoordinator` Impact: - Enable `smartSegmentLoading` by default. With this enabled, none of the following dynamic configs need to be set: `maxSegmentsToMove`, `replicationThrottleLimit`, `maxSegmentsInNodeLoadingQueue`, `useRoundRobinSegmentAssignment`, `emitBalancingStats` and `replicantLifetime`. - Coordinator reports richer metrics and produces cleaner and more informative logs - Coordinator uses an unlimited load queue for all serves, and makes better assignment decisions	2023-06-19 14:27:35 +05:30
imply-cheddar	cfd07a95b7	Errors take 3 (#14004 ) Introduce DruidException, an exception whose goal in life is to be delivered to a user. DruidException itself has javadoc on it to describe how it should be used. This commit both introduces the Exception and adjusts some of the places that are generating exceptions to generate DruidException objects instead, as a way to show how the Exception should be used. This work was a 3rd iteration on top of work that was started by Paul Rogers. I don't know if his name will survive the squash-and-merge, so I'm calling it out here and thanking him for starting on this.	2023-06-19 01:11:13 -07:00
Gian Merlino	2b676ac7f8	Quieter KafkaSupervisors in all bundled log4j2.xml. (#14444 ) Follow-up to #13392, which added this to a single log4j2.xml.	2023-06-19 12:04:11 +05:30
Adarsh Sanjeev	128133fadc	Add column replication_factor column to sys.segments table (#14403 ) Description: Druid allows a configuration of load rules that may cause a used segment to not be loaded on any historical. This status is not tracked in the sys.segments table on the broker, which makes it difficult to determine if the unavailability of a segment is expected and if we should not wait for it to be loaded on a server after ingestion has finished. Changes: - Track replication factor in `SegmentReplicantLookup` during evaluation of load rules - Update API `/druid/coordinator/v1metadata/segments` to return replication factor - Add column `replication_factor` to the sys.segments virtual table and populate it in `MetadataSegmentView` - If this column is 0, the segment is not assigned to any historical and will not be loaded.	2023-06-18 10:02:21 +05:30
George Shiqi Wu	bd07c3dd43	Don't need to double synchronize on simple map operations (#14435 ) * Don't need to double syncronize on simple map operations * remove lock	2023-06-17 17:30:37 -07:00
Abhishek Radhakrishnan	04fb75719e	Fail query planning if a `CLUSTERED BY` column contains descending order (#14436 ) * Throw ValidationException if CLUSTERED BY column descending order is specified. - Fails query planning * Some more tests. * fixup existing comment * Update comment * checkstyle fix: remove unused imports * Remove InsertCannotOrderByDescendingFault and deprecate the fault in readme. * move deprecated field to the bottom	2023-06-16 18:10:12 -04:00
George Shiqi Wu	64af9bfe5b	Add groupId to metrics (#14402 ) * Add group id as a dimension * Revert changes * Add to forking task runner * Add missing metrics * Fix indenting * revert metrics * Fix indentation	2023-06-16 09:28:16 -07:00
Clint Wylie	359bd63cc9	allow expression "best effort" type determination to better handle mixed type arrays (#14438 )	2023-06-16 00:02:43 -07:00
Gian Merlino	85656a467c	MSQ: Load broadcast tables on workers. (#14437 ) They were not previously loaded because supportsQueries was false. This patch sets supportsQueries to true, and clarifies in Task javadocs that supportsQueries can be true for tasks that aren't directly queryable over HTTP.	2023-06-16 12:02:20 +05:30
Maytas Monsereenusorn	5d76d0ea74	Fix segment/deleted/count metric not being emitted (#14433 ) * Fix segment/deleted/count metric * Fix segment/deleted/count metric * Fix segment/deleted/count metric	2023-06-15 14:08:19 -07:00
Laksh Singla	4935f2470a	Limit results generated by SELECT queries in MSQ (#14370 ) * Limit select results in MSQ * reduce number of files in test * add truncated flag * avoid materializing select results to list, use iterable instead * javadocs	2023-06-15 13:13:11 +05:30
Clint Wylie	ff5ae4db6c	fix kafka input format reader schema discovery and partial schema discovery (#14421 ) * fix kafka input format reader schema discovery and partial schema discovery to actually work right, by re-using dimension filtering logic of MapInputRowParser	2023-06-15 00:11:04 -07:00
Clint Wylie	ca116cf886	adjust broker parallel merge to help managed blocking be more well behaved (#14427 )	2023-06-15 00:10:31 -07:00
Pranav	5314db9f85	Adding the file mapper to handle v2 buffer deserialization (#14429 )	2023-06-14 19:41:44 -07:00
Pranav	e426d370ea	Start with solo accumulator and empty partition (#14426 ) * Starting parallel merge with solo accumulator and empty partitions * shutshown pool in test	2023-06-14 16:20:48 -07:00
Alexander Saydakov	f6169d437b	use the latest datasketches-java-4.1.0 (#14430 ) Co-authored-by: AlexanderSaydakov <AlexanderSaydakov@users.noreply.github.com>	2023-06-14 16:03:56 -07:00
George Shiqi Wu	76e70654ac	Fix issues when startup timeout is hit (#14425 )	2023-06-14 11:49:55 -07:00
Vadim Ogievetsky	6fd28fc185	Web console: split the Ingestion view into two views: Supervisors and Tasks (#14395 ) * init split * don't crash if unable to get running tasks * update snapshots * push down state into call * googies * simplify * update e2e tests * feedback fixes * update e2e tests * better icons * fix test * adjust colors	2023-06-14 10:42:30 -07:00
Clint Wylie	8454cc619a	auto columns fixes (#14422 ) changes: * auto columns no longer participate in generic 'null column' handling, this was a mistake to try to support and caused ingestion failures due to mismatched ColumnFormat, and will be replaced in the future with nested common format constant column functionality (not in this PR) * fix bugs with auto columns which contain empty objects, empty arrays, or primitive types mixed with either of these empty constructs * fix bug with bound filter when upper is null equivalent but is strict	2023-06-14 08:57:06 -07:00
Abhishek Radhakrishnan	be5a6593a9	Reset `RuntimeInfo` to fix flaky test `ParametrizedUriEmitterConfigTest`. (#14405 ) * Add injector so JVM settings are correctly set up and bound for the test. * Add VisibleForTesting IDE annotation. * spacing	2023-06-13 18:07:51 -07:00
Abhishek Radhakrishnan	b8495d45a1	Expose Druid functions in `INFORMATION_SCHEMA.ROUTINES` table. (#14378 ) * Add INFORMATION_SCHEMA.ROUTINES to expose Druid operators and functions. * checkstyle * remove IS_DETERMISITIC. * test * cleanup test * remove logs and simplify * fixup unit test * Add docs for INFORMATION_SCHEMA.ROUTINES table. * Update test and add another SQL query. * add stuff to .spelling and checkstyle fix. * Add more tests for custom operators. * checkstyle and comment. * Some naming cleanup. * Add FUNCTION_ID * The different Calcite function syntax enums get translated to FUNCTION * Update docs. * Cleanup markdown table. * fixup test. * fixup intellij inspection * Review comment: nullable column; add a function to determine function syntax. * More tests; add non-function syntax operators. * More unit tests. Also add a separate test for DruidOperatorTable. * actually just validate non-zero count. * switch up the order * checkstyle fixes.	2023-06-13 15:44:04 -04:00
Clint Wylie	61120dc49a	fix Kafka input format to throw ParseException if timestamp is missing (#14413 )	2023-06-13 09:00:11 -07:00
Rishabh Singh	66c3cc1391	Handle unparseable SupervisorSpec in metadata store (#14382 ) Changes: - Skip a supervisor spec entry which cannot be deserialised into a `SupervisorSpec` object. - Log an error for the unparseable spec	2023-06-13 08:02:01 +05:30
Abhishek Radhakrishnan	1c76ebad3b	Minor doc updates. (#14409 ) Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com>	2023-06-12 15:24:48 -07:00
Abhishek Radhakrishnan	326f2c5020	Add more statement attributes to explain plan result. (#14391 ) This PR adds the following to the ATTRIBUTES column in the explain plan output: - partitionedBy - clusteredBy - replaceTimeChunks This PR leverages the work done in #14074, which added a new column ATTRIBUTES to encapsulate all the statement-related attributes.	2023-06-12 19:18:02 +05:30
Rishabh Singh	8b212e73d7	Add method to authorize native query using authentication result (#14376 )	2023-06-12 11:06:00 +05:30
Clint Wylie	b5f45832b1	Add 'Flaky test' issue template (#14394 ) * Add 'Flaky test' issue template * Update flaky_test.md	2023-06-11 19:02:38 -07:00
Adarsh Sanjeev	267cbac6ff	Add logs for deleting files using storage connector (#14350 ) * Add logs for deleting files using storage connector * Address review comments * Update log message format	2023-06-11 21:24:30 +05:30

1 2 3 4 5 ...

12952 Commits All Branches Search

12952 Commits

All Branches