druid

Commit Graph

Author	SHA1	Message	Date
Akshat Jain	21e7e5cddd	Add benchmark suite for MSQ window functions (#17377 ) * Add benchmark suite for MSQ window functions * Fix inspection checks * Address review comment: Rename method	2024-10-30 11:32:28 +05:30
Akshat Jain	63c91ad813	Fix backward compatibility issues in WindowOperatorQueryFrameProcessorFactory and WindowOperatorQueryFrameProcessor (#17433 )	2024-10-30 11:32:02 +05:30
George Shiqi Wu	66eb365e4d	Revert "always set taskLocation (#17350 )" (#17417 ) This reverts commit `a664fc8be3`.	2024-10-29 13:31:01 -07:00
Clint Wylie	10208baab2	use big endian for compressed complex column values to fit object strategy expectations (#17422 )	2024-10-29 10:21:09 -07:00
Ashwin Tumma	1be2b852e9	[Kafka Ingestion Tutorial] Update docs for Schema Config (#17409 ) Co-authored-by: Ashwin Tumma <ashwin.tumma@salesforce.com>	2024-10-29 08:23:20 -07:00
Adarsh Sanjeev	b7c661b801	Make tempStorageDirectory configuration optional and rely on task dir instead (#17015 ) Currently, durable storage and export both require configuring a temporary directory to be used using druid.export.storage.<connectorType>.tempLocalDir and druid.msq.intermediate.storage.tempDir. Tasks on middle manager already have a configured temporary directory. This PR aims to reduce the configuration required by using the task directory as a default if it is not explicitly configured, thus reducing the number of configs that a user has to set. Please note that preference would be given to the user configured, druid..storage.tempDir, on the tasks. If that is not configured, we then use the configured temporary directory. Overlord and brokers also require storage connector configurations (for the durableStorageCleanerOverlordDuty and to fetch results of async queries respectively), but do not have a default temporary task directory. The configuration is still required for these services.	2024-10-29 13:36:59 +05:30
Gian Merlino	6a9c050095	DruidOverlord: Move becomeLeader/stopBeingLeader earlier. (#17415 ) * DruidOverlord: Move becomeLeader/stopBeingLeader earlier. On becoming leader, it is helpful for the TaskRunner and TaskQueue to be available when the SupervisorManager starts up, to aid the supervisors in discovering their tasks. On stopping leadership, it is helpful for the TaskRunner and TaskQueue to be available until the SupervisorManager has finished shutting down. They are only available when the TaskMaster is in "leader" mode, so to achieve the above, this patch moves it earlier in the sequence. * Adjust leadership into two phases. * Update test. * Adjustments for coverage. * Stop mirrors start better.	2024-10-28 20:43:13 -07:00
Gian Merlino	446a8f466f	Update errorprone, mockito, jacoco, checkerframework. (#17414 ) * Update errorprone, mockito, jacoco, checkerframework. This patch updates various build and test dependencies, to see if they cause unit tests on JDK 21 to behave more reliably. * Update licenses, tests. * Remove assertEquals. * Repair two tests. * Update some more tests.	2024-10-28 11:34:03 -07:00
Clint Wylie	73675d0671	clean up some thread pools in tests (#17421 )	2024-10-28 09:05:15 -07:00
Gian Merlino	65acc86756	Capture more dumps on failure. (#17412 ) Capture .hprof, hs_err_pid, replay_pid*, and core.NNNN on failure.	2024-10-26 21:25:46 -07:00
Suraj Goel	7306d280cc	Migrate jaxb bind dependency to jakarta (#17370 ) - Migrated from javax.xml.bind 2.3.1 to jakarta.xml.bind 2.3.3. - Minor version is modified to avoid any breaking changes.	2024-10-26 21:24:17 -07:00
Benjamin Hopp	b59317e42b	Fix typo in security.md (#17413 ) No longer using Azure Blog storage, moving to Blobs instead.	2024-10-25 13:43:58 -07:00
Akshat Jain	fe0f4150c9	MSQ ingestion: Improve error message on encountering non-long timestamp column (#17411 ) This PR improves the error message during MSQ ingestion if we encounter a non-long timestamp column.	2024-10-25 15:02:32 +05:30
Gian Merlino	c4b513e599	SeekableStreamSupervisor: Don't await task futures in workerExec. (#17403 ) Following #17394, workerExec can get deadlocked with itself, because it waits for task futures and is also used as the connectExec for the task client. To fix this, we need to never await task futures in the workerExec. There are two specific changes: in "verifyAndMergeCheckpoints" and "checkpointTaskGroup", two "coalesceAndAwait" calls that formerly occurred in workerExec are replaced with Futures.transform (using a callback in workerExec). Because this adjustment removes a source of blocking, it may also improve supervisor responsiveness for high task counts. This is not the primary goal, however. The primary goal is to fix the bug introduced by #17394.	2024-10-24 12:07:18 -07:00
Gian Merlino	7e8671caa9	GroupByQueryConfig: Skip unnecessary toString. (#17396 ) Calling toString on newConfig is unnecessary, because it will be done automatically by the logger. This saves some effort under log levels higher than DEBUG.	2024-10-23 19:57:22 +05:30
Akshat Jain	1e96c85b38	WindowOperatorQueryFrameProcessor: Avoid writing multiple frames to output channel in runIncrementally() (#17373 ) WindowOperatorQueryFrameProcessor: Avoid writing multiple frames to output channel in runIncrementally()	2024-10-23 10:34:37 +05:30
Abhishek Radhakrishnan	43b325b6aa	Add missing `@Nullable` annotations to SqlQuery (#17398 )	2024-10-22 20:34:46 -07:00
Gian Merlino	60daddedf8	SeekableStreamSupervisor: Use workerExec as the client connectExec. (#17394 ) * SeekableStreamSupervisor: Use workerExec as the client connectExec. This patch uses the already-existing per-supervisor workerExec as the connectExec for task clients, rather than using the process-wide default ServiceClientFactory pool. This helps prevent callbacks from backlogging on the process-wide pool. It's especially useful for retries, where callbacks may need to establish new TCP connections or perform TLS handshakes. * Fix compilation, tests. * Fix style.	2024-10-22 20:21:21 -07:00
Clint Wylie	1157ecdec3	abstract common base of SQL micro-benchmarks to reduce boilerplate and standardize parameters (#17383 ) changes: * adds `SqlBenchmarkDatasets` which contains commonly used benchmark data generator schemas * adds `SqlBaseBenchmark` which contains common benchmark segment generation methods for any benchmark using `SqlBenchmarkDatasets` * adds `SqlBaseQueryBenchmark` and `SqlBasePlanBenchmark` for benchmarks measuring queries and planning respectively * migrate all existing SQL jmh benchmarks to extend `SqlBaseQueryBenchmark`, quite dramatically reducing the boilerplate needed to create benchmarks, and allowing the use of multiple datasources within a benchmark file * adjustments to data generator stuff to allow passing in an ObjectMapper so that the same mapper can be used for both benchmark queries and segment generation, avoiding the need to register stuff with both mappers for benchmarks * adds `SqlProjectionsBenchmark` and `SqlComplexMetricsColumnsBenchmark` for measuring projections and measuring complex metric compression respectively	2024-10-22 19:37:17 -07:00
Kashif Faraz	9dfb378711	Remove unused coordinator dynamic configs mergeSegmentsLimit, mergeBytesLimit (#17384 ) * Remove unused coordinator dynamic configs * Update docs and web-console	2024-10-22 09:03:46 +05:30
Vadim Ogievetsky	6cf372a7f4	Web console: bump dependencies including d3 and typescript (#17381 ) * bump deps including d3 * better clean script	2024-10-21 11:38:19 -07:00
Abhishek Radhakrishnan	187e21afae	Add `BrokerClient` implementation (#17382 ) This patch is extracted from PR 17353. Changes: - Added BrokerClient and BrokerClientImpl to the sql package that leverages the ServiceClient functionality; similar to OverlordClient and CoordinatorClient implementations in the server module. - For now, only two broker API stubs are added: submitSqlTask() and fetchExplainPlan(). - Added a new POJO class ExplainPlan that encapsulates explain plan info. - Deprecated org.apache.druid.discovery.BrokerClient in favor of the new BrokerClient in this patch. - Clean up ExplainAttributesTest a bit and added serde verification.	2024-10-21 11:05:53 -07:00
Vishesh Garg	5da9949992	Fail MSQ compaction if multi-valued partition dimensions are found (#17344 ) MSQ currently supports only single-valued string dimensions as partition keys. This patch adds a check to ensure that partition keys are single-valued in case this info is available by virtue of segment download for schema inference. During compaction, if MSQ finds multi-valued dimensions (MVDs) declared as part of `range` partitionsSpec, it switches partitioning type to dynamic, ending up in repeated compactions of the same interval. To avoid this scenario, the segment download logic is also updated to always download segments if info on multi-valued dimensions is required.	2024-10-19 13:33:33 +05:30
Abhishek Radhakrishnan	9a16d4e219	Move SqlTaskStatus and SqlTaskStausTest from msq module to sql module. (#17380 ) - This is a non-functional change that moves SqlTaskStatus and its unit test SqlTaskStatusTest from the msq module to the sql module to help class reuse in other places. - This refactor is extracted from this PR to facilitate easier review. - Fix a minor spacing issue in the TaskStartTimeoutFault error message.	2024-10-18 14:39:01 -07:00
Abhishek Radhakrishnan	a44006c998	Fix decimal type support for the delta input format. (#17376 ) The Delta Decimal type wasn't handled correctly in the Druid Delta connector, resulting in the error: Unsupported fieldType[Decimal(4, 2)] for fieldName[price]. There were no tests or existing tables with the Decimal type, so I've updated the existing table, complex-types-table to include this data type. Note that the Decimal type can only be handled as a double at most in Druid. For a big decimal that cannot fit inside a double, it should be ingested as a string.	2024-10-18 08:09:35 -07:00
Laksh Singla	5b09329479	Fixes an issue with AppendableMemory that can cause MSQ jobs to fail (#17369 )	2024-10-18 09:05:53 +05:30
John Gozde	dceff89103	Web console: eslint@9, stylelint@16 (#17365 ) * Switch to react-jsx * WIP: eslint 9 * Fix lints * Stylelint * Fix compile * Bump postcss * Update licenses * Bump react-table	2024-10-17 15:28:01 -07:00
Adithya Chakilam	e834e49290	supervisor/autoscaler: Fix clearing of collected lags on skipped scale actions (#17356 ) * superviosr/autoscaler: Fix clearing of collected lags on skipped scale actions * comments * supervisor/autoscaler: Skip scaling when partitions are less than minTaskCount (#17335) * Fix pip installation after ubuntu upgrade (#17358) * fix tests --------- Co-authored-by: Pranav <pranavbhole@gmail.com>	2024-10-17 11:05:16 -07:00
317brian	d1b81f312a	docs: msq autocompaction (#16681 ) Co-authored-by: Kashif Faraz <kashif.faraz@gmail.com> Co-authored-by: Vishesh Garg <vishesh.garg@imply.io> Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com>	2024-10-17 10:40:53 -07:00
Abhishek Radhakrishnan	0e6c388b7f	Delta snapshots are zero-indexed, so remove zeroMeansUndefined: true. (#17367 ) This lets users filter by snapshotVersion: 0. Previously, zeroMeanUndefined was set to true, so it would silently default to the latest snapshot.	2024-10-17 08:33:10 -07:00
Akshat Jain	8c52be81d3	Fix postgres metadata storage warning logs because of tablename causing issues (#17351 )	2024-10-17 15:48:22 +05:30
Akshat Jain	450fb0147b	Add GlueingPartitioningOperator + Corresponding changes in window function layer to consume it for MSQ (#17038 ) * GlueingPartitioningOperator: It continuously receives data, and outputs batches of partitioned RACs. It maintains a last-partitioning-boundary of the last-pushed-RAC, and attempts to glue it with the next RAC it receives, ensuring that partitions are handled correctly, even across multiple RACs. You can check GlueingPartitioningOperatorTest for some good examples of the "glueing" work. * PartitionSortOperator: It sorts rows inside partitioned RACs, on the sort columns. The input RACs it receives are expected to be "complete / separate" partitions of data.	2024-10-17 10:54:52 +05:30
Ashwin Tumma	90175b8927	[Prometheus Emitter] Add to code coverage and remove code smell (#17362 ) * [Prometheus Emitter] Add to code coverage and remove code smell	2024-10-17 10:49:16 +05:30
Vadim Ogievetsky	26e2ca66d7	update to node 20 (#17363 )	2024-10-16 13:15:10 -07:00
Vadim Ogievetsky	877784e5fd	Web console: add expectedLoadTimeMillis (#17359 ) * add expectedLoadTimeMillis * make spec cleaning less agro * more cleanup	2024-10-16 13:14:27 -07:00
Vadim Ogievetsky	8ddb316e68	Web console: fix progress indication for table input (#17334 ) * fix porgress indication for table input * fix snapshot	2024-10-16 13:14:11 -07:00
Suraj Goel	c1fe1ac898	Remove EOL file-loader dependency (#17346 )	2024-10-16 11:11:06 -07:00
George Shiqi Wu	a664fc8be3	always set taskLocation (#17350 )	2024-10-16 14:02:39 -04:00
Kashif Faraz	df3a307e83	Do not use cachingCost balancer strategy in Docker environment (#17349 )	2024-10-16 20:59:46 +05:30
TessaIO	a9f582711e	Fix loading lookup extension (#17212 ) We introduce the option to iterate over fetched data from the dataFetcher for loadingLookups in the lookups-cached-single extension. Also, added the handling of a use case where the data exists in Druid but not in the actual data fetcher, which is in our use-case JDBC Data fetcher, where the value returned is null. Signed-off-by: TessaIO <ahmedgrati1999@gmail.com>	2024-10-16 07:28:32 -07:00
Pranav	f80e2c229e	Fix pip installation after ubuntu upgrade (#17358 )	2024-10-15 17:50:18 -07:00
Adithya Chakilam	c57bd3b438	supervisor/autoscaler: Skip scaling when partitions are less than minTaskCount (#17335 )	2024-10-15 14:12:53 -07:00
Hardik Bajaj	32ce341a6c	Fix RejectExecutionHandler of Blocking Single Threaded executor (#17146 ) Throw RejectedExecutionException when submitting tasks to executor that has been shut down.	2024-10-15 22:02:34 +05:30
Clint Wylie	c2149d59a7	remove stale comment in QueryableIndexCursorHolder (#17333 )	2024-10-11 16:23:59 -07:00
Gian Merlino	b287b219a8	MSQ: Include stageId, workerNumber in processing thread names. (#17324 ) * MSQ: Include stageId, workerNumber in processing thread names. Helps identify which query was running in a thread dump. * s/dart/msq/	2024-10-11 08:37:15 -07:00
Gian Merlino	a0c29f8bbb	MSQ WorkerResource: Fix timeout handler for httpGetChannelData. (#17328 ) The timeout handler should fire if the response has not been handled yet (i.e. if responseResolved was previously false). However, it erroneously fires only if the response was handled. This causes HTTP 500 errors if the timeout actually does fire. The timeout is 30 seconds, which can be hit during pipelined queries, if an earlier stage of the query hasn't produced its first frame within 30 seconds. This fixes a regression introduced in #17140.	2024-10-11 16:29:04 +05:30
Karan Kumar	034bb9dbea	Removing enable windowing from MSQ tests. (#17276 )	2024-10-11 05:33:27 +02:00
Shivam Garg	6898a5a359	Removed Microsecond from Extract function (#17247 )	2024-10-11 05:32:26 +02:00
Clint Wylie	a6236c3d15	add substituteCombiningFactory implementations for datasketches aggs (#17314 ) Follow up to #17214, adds implementations for substituteCombiningFactory so that more datasketches aggs can match projections, along with some projections tests for datasketches.	2024-10-10 16:14:06 +05:30
Suneet Saldanha	fb38e483cf	statsd-emitter: Add dutyGroup to coordinator global time metric (#17320 ) The duty group is a low cardinality dimension and can be helpful in providing insight into whether a particular duty group is not running fast enough on the coordinator.	2024-10-10 16:03:50 +05:30

1 2 3 4 5 ...

14652 Commits All Branches Search

14652 Commits

All Branches