druid

Commit Graph

Author	SHA1	Message	Date
Gian Merlino	46cbb33428	FrameChannelMerger: Fix incorrect behavior of finished(). (#17088 ) Previously, the processor used "remainingChannels" to track the number of non-null entries of currentFrame. Now, "remainingChannels" tracks the number of channels that are unfinished. The difference is subtle. In the previous code, when an input channel was blocked upon exiting nextFrame(), the "currentFrames" entry would be null, and therefore the "remainingChannels" variable would be decremented. After the next await and call to populateCurrentFramesAndTournamentTree(), "remainingChannels" would be incremented if the channel had become unblocked after awaiting. This means that finished(), which returned true if remainingChannels was zero, would not be reliable if called between nextFrame() and the next await + populateCurrentFramesAndTournamentTree(). This patch changes things such that finished() is always reliable. This fixes a regression introduced in PR #16911, which added a call to finished() that was, at that time, unsafe.	2024-09-17 08:35:54 -07:00
Gian Merlino	50503fe0ef	MSQ: Properly report errors that occur when starting up RunWorkOrder. (#17069 ) * MSQ: Properly report errors that occur when starting up RunWorkOrder. In #17046, an exception thrown by RunWorkOrder#startAsync would be ignored and replaced with a generic CanceledFault. This patch fixes it by retaining the original error.	2024-09-17 20:32:02 +05:30
Lasse Mammen	307b8e3357	feat: json_merge expression and sql function (#17081 )	2024-09-17 18:27:34 +05:30
Gian Merlino	8af9b4729f	TableInputSpecSlicer changes to support running on Brokers. (#17074 ) * TableInputSpecSlicer changes to support running on Brokers. Changes: 1) Rename TableInputSpecSlicer to IndexerTableInputSpecSlicer, in anticipation of a new implementation being added for controllers running on Brokers. 2) Allow the context to use the WorkerManager to build the TableInputSpecSlicer, in anticipation of Brokers wanting to use this to assign segments to servers that are already serving those segments. 3) Remove unused DataSegmentTimelineView interface. 4) Add additional javadoc to DataSegmentProvider. * Style.	2024-09-17 03:51:18 -07:00
Gian Merlino	c56e23ec37	Remove workerId parameter from postWorkerError. (#17072 ) * Remove workerId parameter from postWorkerError. It was redundant to MSQErrorReport#getTaskId. * Fix javadoc.	2024-09-17 01:37:46 -07:00
Gian Merlino	2e4d596d82	MSQ: Include worker context maps in WorkOrders. (#17076 ) * MSQ: Include worker context maps in WorkOrders. This provides a mechanism to send contexts to workers in long-lived, shared JVMs that are not part of the task system. * Style, coverage.	2024-09-17 01:37:21 -07:00
Sree Charan Manamala	bb1c3c1749	Add serde for ColumnBasedRowsAndColumns to fix window queries without group by (#16658 ) Register a Ser-De for RowsAndColumns so that the window operator query running on leaf operators would be transferred properly on the wire. Would fix the empty response given by window queries without group by on the native engine.	2024-09-17 06:44:40 +02:00
Laksh Singla	bb487a4193	Support maxSubqueryBytes for window functions (#16800 ) Window queries now acknowledge maxSubqueryBytes.	2024-09-17 10:06:24 +05:30
Victoria Lim	2e2f3cf66a	docs: Refresh docs for SQL input source (#17031 ) Co-authored-by: Charles Smith <techdocsmith@gmail.com>	2024-09-16 15:52:37 -07:00
Gian Merlino	9696f0b37c	Remove close method on MSQWarningReportPublisher. (#17071 ) It didn't do anything and also wasn't called.	2024-09-16 18:38:36 +05:30
Gian Merlino	4d8015578d	Remove unused WorkerManagerClient interface. (#17073 )	2024-09-16 18:00:47 +05:30
Gian Merlino	8630974157	MSQ: Wake up the main controller thread on workerError. (#17075 ) This isn't necessary when using MSQWorkerTaskLauncher as the WorkerManager implementation, because in that case, task failure also wakes up the main thread. However, when using workers that are not task-based, we don't want to rely on the WorkerManager for this.	2024-09-16 18:00:09 +05:30
Misha	6aad9b08dd	Fix low sonatype findings (#17017 ) Fixed vulnerabilities CVE-2021-26291 : Apache Maven is vulnerable to Man-in-the-Middle (MitM) attacks. Various functions across several files, mentioned below, allow for custom repositories to use the insecure HTTP protocol. An attacker can exploit this as part of a Man-in-the-Middle (MitM) attack, taking over or impersonating a repository using the insecure HTTP protocol. Unsuspecting users may then have the compromised repository defined as a dependency in their Project Object Model (pom) file and download potentially malicious files from it. Was fixed by removing outdated tesla-aether library containing vulnerable maven-settings (v3.1.1) package, pull-deps utility updated to use maven resolver instead. sonatype-2020-0244 : The joni package is vulnerable to Man-in-the-Middle (MitM) attacks. This project downloads dependencies over HTTP due to an insecure repository configuration within the .pom file. Consequently, a MitM could intercept requests to the specified repository and replace the requested dependencies with malicious versions, which can execute arbitrary code from the application that was built with them. Was fixed by upgrading joni package to recommended 2.1.34 version	2024-09-16 16:10:25 +05:30
Gian Merlino	a8d15182a3	Additional tests for ChannelStageOutputReader. (#17050 ) The existing tests are moved into a "WithMaximalBuffering" subclass, and a new "WithMinimalBuffering" subclass is added to test cases where only a single frame is buffered.	2024-09-16 12:22:32 +05:30
Gian Merlino	33cb563ff9	Move TerminalStageSpecFactory packages. (#17049 ) * Move TerminalStageSpecFactory packages. These packages are moved from the "guice" package to the "indexing.destination" package. They make more sense here, since "guice" is reserved for Guice modules, annotations, and providers. * Rearrange imports.	2024-09-16 11:33:26 +05:30
Gian Merlino	d789723331	Remove some unnecessary JoinableFactoryWrappers. (#17051 ) * Remove some unnecessary JoinableFactoryWrappers. * Remove unused import.	2024-09-16 11:32:12 +05:30
Gian Merlino	5b7fb5fbca	Speed up FrameFileTest, SuperSorterTest. (#17068 ) * Speed up FrameFileTest, SuperSorterTest. These are two heavily parameterized tests that, together, account for about 60% of runtime in the test suite. FrameFileTest changes: 1) Cache frame files in a static, rather than building the frame file for each parameterization of the test. 2) Adjust TestArrayCursorFactory to cache the signature, rather than re-creating it on each call to getColumnCapabilities. SuperSorterTest changes: 1) Dramatically reduce the number of tests that run with "maxRowsPerFrame" = 1. These are particularly slow due to writing so many small files. Some still run, since it's useful to test edge cases, but much fewer than before. 2) Reduce the "maxActiveProcessors" axis of the test from [1, 2, 4] to [1, 3]. The aim is to reduce the number of cases while still getting good coverage of the feature. 3) Reduce the "maxChannelsPerProcessor" axis of the test from [2, 3, 8] to [2, 7]. The aim is to reduce the number of cases while still getting good coverage of the feature. 4) Use in-memory input channels rather than file channels. 5) Defer formatting of assertion failure messages until they are needed. 6) Cache the cursor factory and its signature in a static. 7) Cache sorted test rows (used for verification) in a static. * It helps to include the file. * Style.	2024-09-15 17:03:18 -07:00
Clint Wylie	73a644258d	abstract `IncrementalIndex` cursor stuff to prepare for using different "views" of the data based on the cursor build spec (#17064 ) * abstract `IncrementalIndex` cursor stuff to prepare to allow for possibility of using different "views" of the data based on the cursor build spec changes: * introduce `IncrementalIndexRowSelector` interface to capture how `IncrementalIndexCursor` and `IncrementalIndexColumnSelectorFactory` read data * `IncrementalIndex` implements `IncrementalIndexRowSelector` * move `FactsHolder` interface to separate file * other minor refactorings	2024-09-15 16:45:51 -07:00
Clint Wylie	aa6336c5cf	add DataSchema.Builder to tidy stuff up a bit (#17065 ) * add DataSchema.Builder to tidy stuff up a bit * fixes * fixes * more style fixes * review stuff	2024-09-15 11:18:34 -07:00
Akshat Jain	6ed8632420	Handle memory leaks from Mockito inline mocks (#17070 )	2024-09-15 11:17:25 -07:00
Gian Merlino	4dc5942dab	BaseWorkerClientImpl: Don't attempt to recover from a closed channel. (#17052 ) * BaseWorkerClientImpl: Don't attempt to recover from a closed channel. This patch introduces an exception type "ChannelClosedForWritesException", which allows the BaseWorkerClientImpl to avoid retrying when the local channel has been closed. This can happen in cases of cancellation. * Add some test coverage. * wip * Add test coverage. * Style.	2024-09-15 02:10:58 -07:00
Gian Merlino	6fac267f17	MSQ: Improved worker cancellation. (#17046 ) * MSQ: Improved worker cancellation. Four changes: 1) FrameProcessorExecutor now requires that cancellationIds be registered with "registerCancellationId" prior to being used in "runFully" or "runAllFully". 2) FrameProcessorExecutor gains an "asExecutor" method, which allows that executor to be used as an executor for future callbacks in such a way that respects cancellationId. 3) RunWorkOrder gains a "stop" method, which cancels the current cancellationId and closes the current FrameContext. It blocks until both operations are complete. 4) Fixes a bug in RunAllFullyWidget where "processorManager.result()" was called outside "runAllFullyLock", which could cause it to be called out-of-order with "cleanup()" in case of cancellation or other error. Together, these changes help ensure cancellation does not have races. Once "cancel" is called for a given cancellationId, all existing processors and running callbacks are canceled and exit in an orderly manner. Future processors and callbacks with the same cancellationId are rejected before being executed. * Fix test. * Use execute, which doesn't return, to avoid errorprone complaints. * Fix some style stuff. * Further enhancements. * Fix style.	2024-09-15 01:22:28 -07:00
Gian Merlino	a276871dd0	Fix call to MemoryIntrospector in IndexerControllerContext. (#17066 ) This was a logical conflict between #17057 and #17048.	2024-09-14 18:10:56 -07:00
Gian Merlino	fd6706cd6a	MSQ: Rework memory management. (#17057 ) * MSQ: Rework memory management. This patch reworks memory management to better support multi-threaded workers running in shared JVMs. There are two main changes. First, processing buffers and threads are moved from a per-JVM model to a per-worker model. This enables queries to hold processing buffers without blocking other concurrently-running queries. Changes: - Introduce ProcessingBuffersSet and ProcessingBuffers to hold the per-worker and per-work-order processing buffers (respectively). On Peons, this is the JVM-wide processing pool. On Indexers, this is a per-worker pool of on-heap buffers. (This change fixes a bug on Indexers where excessive processing buffers could be used if MSQ tasks ran concurrently with realtime tasks.) - Add "bufferPool" argument to GroupingEngine#process so a per-worker pool can be passed in. - Add "druid.msq.task.memory.maxThreads" property, which controls the maximum number of processing threads to use per task. This allows usage of multiple processing buffers per task if admins desire. - IndexerWorkerContext acquires processingBuffers when creating the FrameContext for a work order, and releases them when closing the FrameContext. - Add "usesProcessingBuffers()" to FrameProcessorFactory so workers know how many sets of processing buffers are needed to run a given query. Second, adjustments to how WorkerMemoryParameters slices up bundles, to favor more memory for sorting and segment generation. Changes: - Instead of using same-sized bundles for processing and for sorting, workers now use minimally-sized processing bundles (just enough to read inputs plus a little overhead). The rest is devoted to broadcast data buffering, sorting, and segment-building. - Segment-building is now limited to 1 concurrent segment per work order. This allows each segment-building action to use more memory. Note that segment-building is internally multi-threaded to a degree. (Build and persist can run concurrently.) - Simplify frame size calculations by removing the distinction between "standard" and "large" frames. The new default frame size is the same as the old "standard" frames, 1 MB. The original goal of of the large frames was to reduce the number of temporary files during sorting, but I think we can achieve the same thing by simply merging a larger number of standard frames at once. - Remove the small worker adjustment that was added in #14117 to account for an extra frame involved in writing to durable storage. Instead, account for the extra frame whenever we are actually using durable storage. - Cap super-sorter parallelism using the number of output partitions, rather than using a hard coded cap at 4. Note that in practice, so far, this cap has not been relevant for tasks because they have only been using a single processing thread anyway. * Remove unused import. * Fix errorprone annotation. * Fixes for javadocs and inspections. * Additional test coverage. * Fix test.	2024-09-14 15:35:21 -07:00
Gian Merlino	d7be12067f	QueryResource: Don't close JSON content on error. (#17034 ) * QueryResource: Don't close JSON content on error. Following similar issues fixed in #11685 and #15880, this patch fixes a bug where QueryResource would write a closing array marker if it encountered an exception after starting to push results. This makes it difficult for callers to detect errors. The prior patches didn't catch this problem because QueryResource uses the ObjectMapper in a unique way, through writeValuesAsArray, which doesn't respect the global AUTO_CLOSE_JSON_CONTENT setting. * Fix usage of customized ObjectMappers.	2024-09-14 15:32:49 -07:00
Gian Merlino	27443a0600	Fix formatting of error message from validateNoIllegalRightyJoins. (#17061 ) The prior formatting was inconsistent in terms of punctuation and capitalization.	2024-09-14 15:20:48 -07:00
Gian Merlino	d3f86baff9	Add "targetPartitionsPerWorker" setting for MSQ. (#17048 ) As we move towards multi-threaded MSQ workers, it helps for parallelism to generate more than one partition per worker. That way, we can fully utilize all worker threads throughout all stages. The default value is the number of processing threads. Currently, this is hard-coded to 1 for peons, but that is expected to change in the future.	2024-09-13 16:01:18 -07:00
Gian Merlino	654e0b444b	MSQ: Fix two issues with phase transitions. (#17053 ) 1) ControllerQueryKernel: Update readyToReadResults to acknowledge that sorting stages can go directly from READING_INPUT to RESULTS_READY. 2) WorkerStageKernel: Ignore RESULTS_COMPLETE if work is already finished, which can happen if the transition to FINISHED comes early due to a downstream LIMIT.	2024-09-13 15:59:41 -07:00
Gian Merlino	99e8f664a9	Add "includeAllCounters()" to WorkerContext. (#17047 ) This removes the need to read it from the query context.	2024-09-13 15:47:51 -07:00
Clint Wylie	28ec962a06	add CursorHolder.isPreAggregated method to allow cursors on pre-aggregated data (#17058 ) changes: * CursorHolder.isPreAggregated method indicates that a cursor has pre-aggregated data for all AggregatorFactory specified in a CursorBuildSpec. If true, engines should rewrite the query to use AggregatorFactory.getCombiningAggreggator, and column selector factories will provide selectors with the aggregator interediate type for the aggregator factory name * Added groupby, timeseries, and topN support for CursorHolder.isPreAggregated * Added synthetic test since no CursorHolder implementations support isPreAggregated at this point in time	2024-09-13 12:52:35 -07:00
Abhishek Radhakrishnan	7a0d7d1897	Bump up -Xmx2500m from 2GB and keep MaxDirectMemorySize as 2500m as well. (#17056 )	2024-09-13 14:54:07 +05:30
Rishabh Singh	a8c06e93aa	Skip tombstone segment refresh in metadata cache (#17025 ) This PR #16890 introduced a change to skip adding tombstone segments to the cache. It turns out that as a side effect tombstone segments appear unavailable in the console. This happens because availability of a segment in Broker is determined from the metadata cache. The fix is to keep the segment in the metadata cache but skip them from refresh. This doesn't affect any functionality as metadata query for tombstone returns empty causing continuous refresh of those segments.	2024-09-13 11:47:11 +05:30
Akshat Jain	fff3e81dcc	Add window function drill tests for array_concat_agg for empty over scenarios (#17026 ) * Add window function drill tests for array_concat_agg for empty over scenarios * Cleanup sqlNativeIncompatible() as it's not needed now * Address review comment	2024-09-13 11:35:45 +05:30
Abhishek Radhakrishnan	668169d9a9	Provide `chmod` command for `-XX:OnOutOfMemoryError` from shell script (#17054 ) A command line arg -XX:OnOutOfMemoryError='chmod 644 ${project.parent.basedir}/target/.hprof' was added to collect heap dumps: #17029 This arg is causing problems when running tests from Intellij. Intellij doesn't seem to likechmod 644, but this command works as expected in mvn. So as a workaround, add the chmod 644 ${BASE_DIR/target/.hprof' command in a shell script that can then be executed when OnOutOfMemoryError happens to make Intellij happy.	2024-09-13 00:17:28 -04:00
Abhishek Radhakrishnan	5ef94c9dee	Add support for selective loading of broadcast datasources in the task layer (#17027 ) Tasks control the loading of broadcast datasources via BroadcastDatasourceLoadingSpec getBroadcastDatasourceLoadingSpec(). By default, tasks download all broadcast datasources, unless there's an override as with kill and MSQ controller task. The CLIPeon command line option --loadBroadcastSegments is deprecated in favor of --loadBroadcastDatasourceMode. Broadcast datasources can be specified in SQL queries through JOIN and FROM clauses, or obtained from other sources such as lookups.To this effect, we have introduced a BroadcastDatasourceLoadingSpec. Finding the set of broadcast datasources during SQL planning will be done in a follow-up, which will apply only to MSQ tasks, so they load only required broadcast datasources. This PR primarily focuses on the skeletal changes around BroadcastDatasourceLoadingSpec and integrating it from the Task interface via CliPeon to SegmentBootstrapper. Currently, only kill tasks and MSQ controller tasks skip loading broadcast datasources.	2024-09-12 13:30:28 -04:00
Adithya Chakilam	6ef8d5d8e1	OshiSysMonitor: Add ability to skip emitting metrics (#16972 ) * OshiSysMonitor: Add ability to skip emitting metrics * comments * static checks * remove oshi	2024-09-12 11:32:31 -04:00
Abhishek Radhakrishnan	c077daaade	GHA steps to collect and upload heap dumps to debug UT OOM errors (#17029 ) * Add GHA steps to tar and upload any heap dumps on failure to debug UT OOM issues. * Add jvm options to heap dump OnOutOfMemoryError Co-authored-by: Elliott Freis <108356317+imply-elliott@users.noreply.github.com> --------- Co-authored-by: Elliott Freis <108356317+imply-elliott@users.noreply.github.com>	2024-09-12 09:06:35 -04:00
Laksh Singla	d3392a23ce	Cancel the group by processing tasks if the merging runner gets scheduled post the query timeout (#17037 ) If the GroupByMergingQueryRunner gets scheduled after the query timeout, it fails to clean up the processing tasks that have been scheduled. This can lead to unnecessary processing being done for the tasks whos results won't get consumed.	2024-09-12 15:10:27 +05:30
Pranav	a95397e712	Allow request headers in HttpInputSource in native and MSQ Ingestion (#16974 ) Support for adding the request headers in http input source. we can now pass the additional headers as json in both native and MSQ.	2024-09-12 11:18:44 +05:30
Rishabh Singh	a18f582ef0	Skip refresh for unused segments in metadata cache (#16990 ) * Skip refresh for unused segments in metadata cache * Cover the condition where a used segment missing schema is marked for refresh * Fix test	2024-09-12 10:39:59 +05:30
George Shiqi Wu	428f58cf15	Support maxColumnsToMerge in supervisor tuningConfig (#17030 ) * support maxColumnsToMerge in supervisor specs * remove log line * fix style * add docs * fix unit tests	2024-09-11 18:00:13 -04:00
Vadim Ogievetsky	9e1544e9c4	Fix maxRowsInMemory default for streaming (#17028 ) * fix maxRowsInMemory * fix button css	2024-09-11 08:43:00 -07:00
Sébastien	5de84253d8	Web console query view improvements (#16991 ) * Made maxNumTaskOptions configurable in the Query view * Updated the copy for taskAssignment options * Reordered options in engine menu for msq engine * fixed snapshot * maxNumTaskOptions -> maxTasksOptions * added back select destination item * fixed duplicate menu item * snapshot * Added the ability to hide certain engine menu options * Added the ability to hide/show more menu items * -> fn * -> fn	2024-09-10 11:34:49 -07:00
aho135	2427972c10	Implement segment range threshold for automatic query prioritization (#17009 ) Implements threshold based automatic query prioritization using the time period of the actual segments scanned. This differs from the current implementation of durationThreshold which uses the duration in the user supplied query. There are some usability constraints with using durationThreshold from the user supplied query, especially when using SQL. For example, if a client does not explicitly specify both start and end timestamps then the duration is extremely large and will always exceed the configured durationThreshold. This is one example interval from a query that specifies no end timestamp: "interval":["2024-08-30T08:05:41.944Z/146140482-04-24T15:36:27.903Z"]. This interval is generated from a query like SELECT * FROM table WHERE __time > CURRENT_TIMESTAMP - INTERVAL '15' HOUR. Using the time period of the actual segments scanned allows proper prioritization without explicitly having to specify start and end timestamps. This PR adds onto #9493	2024-09-10 15:01:52 +05:30
Sree Charan Manamala	c7c3307e61	Fix String Frame Readers to read String Arrays correctly (#16885 ) While writing to a frame, String arrays are written by setting the multivalue byte. But while reading, it was hardcoded to false.	2024-09-10 14:20:54 +05:30
Laksh Singla	72fbaf2e56	Non querying tasks shouldn't use processing buffers / merge buffers (#16887 ) Tasks that do not support querying or query processing i.e. supportsQueries = false do not require processing threads, processing buffers, and merge buffers.	2024-09-10 11:36:36 +05:30
Abhishek Agarwal	78775ad398	Prepare master for 32.0.0 release (#17022 )	2024-09-10 11:01:20 +05:30
Clint Wylie	f57cd6f7af	transition away from StorageAdapter (#16985 ) * transition away from StorageAdapter changes: * CursorHolderFactory has been renamed to CursorFactory and moved off of StorageAdapter, instead fetched directly from the segment via 'asCursorFactory'. The previous deprecated CursorFactory interface has been merged into StorageAdapter * StorageAdapter is no longer used by any engines or tests and has been marked as deprecated with default implementations of all methods that throw exceptions indicating the new methods to call instead * StorageAdapter methods not covered by CursorFactory (CursorHolderFactory prior to this change) have been moved into interfaces which are retrieved by Segment.as, the primary classes are the previously existing Metadata, as well as new interfaces PhysicalSegmentInspector and TopNOptimizationInspector * added UnnestSegment and FilteredSegment that extend WrappedSegmentReference since their StorageAdapter implementations were previously provided by WrappedSegmentReference * added PhysicalSegmentInspector which covers some of the previous StorageAdapter functionality which was primarily used for segment metadata queries and other metadata uses, and is implemented for QueryableIndexSegment and IncrementalIndexSegment * added TopNOptimizationInspector to cover the oddly specific StorageAdapter.hasBuiltInFilters implementation, which is implemented for HashJoinSegment, UnnestSegment, and FilteredSegment * Updated all engines and tests to no longer use StorageAdapter	2024-09-09 14:55:29 -07:00
Abhishek Radhakrishnan	f4261c0e4d	Add Delta snapshot version to the web-console (#17023 ) * Web-console change to add Delta snapshot version. Web-console change for https://github.com/apache/druid/pull/17004. * Update web-console/src/druid-models/input-source/input-source.tsx * Update web-console/src/druid-models/ingestion-spec/ingestion-spec.tsx	2024-09-09 11:49:53 -07:00
Abhishek Radhakrishnan	aa833a711c	Support for reading Delta Lake table snapshots (#17004 ) Problem Currently, the delta input source only supports reading from the latest snapshot of the given Delta Lake table. This is a known documented limitation. Description Add support for reading Delta snapshot. By default, the Druid-Delta connector reads the latest snapshot of the Delta table in order to preserve compatibility. Users can specify a snapshotVersion to ingest change data events from Delta tables into Druid. In the future, we can also add support for time-based snapshot reads. The Delta API to read time-based snapshots is not clear currently.	2024-09-09 14:12:48 +05:30

1 2 3 4 5 ...

14434 Commits All Branches Search

14434 Commits

All Branches