druid

Commit Graph

Author	SHA1	Message	Date
Adarsh Sanjeev	0335aaa279	Add query results directory and prevent the auto cleaner from cleaning it (#14446 ) Adds support for automatic cleaning of a "query-results" directory in durable storage. This directory will be cleaned up only if the task id is not known to the overlord. This will allow the storage of query results after the task has finished running.	2023-06-28 10:14:04 +05:30
Abhishek Radhakrishnan	2cfb00b1de	Add missing `isNull()` implementation to `FilteredAggregator` (#14465 )	2023-06-27 16:35:15 -07:00
Gian Merlino	c78d885b80	Cache parsed expressions and binding analysis in more places. (#14124 ) * Cache parsed expressions and binding analysis in more places. Main changes: 1) Cache parsed and analyzed expressions within PlannerContext for a single SQL query. 2) Cache parsed expressions together with input binding analysis using a new class AnalyzeExpr. This speeds up SQL planning, because SQL planning involves parsing analyzing the same expression strings over and over again. * Fixes. * Fix style. * Fix test. * Simplify: get rid of AnalyzedExpr, focus on caching. * Rename parse -> parseExpression.	2023-06-27 13:40:35 -07:00
imply-cheddar	2f0a43790c	Make GuavaUtilsTest use less CPU (#14487 )	2023-06-26 21:45:29 -07:00
Clint Wylie	6ba10c8b6c	fix bug with json_value expression array extraction (#14461 )	2023-06-26 21:02:44 -07:00
Laksh Singla	f546cd64a9	MSQ: Ensure that the allocated segment aligns with the requested granularity (#14475 ) Changes: - Throw an `InsertCannotAllocateSegmentFault` if the allocated segment is not aligned with the requested granularity. - Tests to verify new behaviour	2023-06-27 09:25:32 +05:30
Laksh Singla	114380749d	MSQ: Improve the parse exception errors and the handling of null UTF characters in Strings in Frames (#14398 )	2023-06-26 18:14:29 +05:30
Laksh Singla	1647d5f4a0	Limit the subquery results by memory usage (#13952 ) Users can now add a guardrail to prevent subquery’s results from exceeding the set number of bytes by setting druid.server.http.maxSubqueryRows in Broker's config or maxSubqueryRows in the query context. This feature is experimental for now and would default back to row-based limiting in case it fails to get the accurate size of the results consumed by the query.	2023-06-26 18:12:28 +05:30
Gian Merlino	970288067a	Fix flaky HttpEmitterConfigTest and ParametrizedUriEmitterConfigTest. (#14481 ) Recently, we have seen flakiness in these two tests, apparently due to computations based on Runtime.getRuntime().maxMemory() differing during static initialization and in the actual tests. I can't think of a reason why this would be happening, but anyway, this patch switches the tests to use the statics instead of recomputing Runtime.getRuntime().maxMemory().	2023-06-23 16:27:11 -07:00
imply-cheddar	7e2cf35d7b	Fix compatibility issue with SqlTaskResource (#14466 ) * Fix compatibility issue with SqlTaskResource The DruidException changes broke the response format for errors coming back from the SqlTaskResource, so fix those	2023-06-23 01:15:32 -07:00
Clint Wylie	31b9d5695d	Extend InitializedNullHandlingTest instead of NullHandlingTest (#14467 ) NullHandlingTest is an actual test, it shouldn't be used as a base class	2023-06-22 15:01:50 +05:30
Hardik Bajaj	1ea9158a50	Added new SysMonitorOshi v0 using Oshi library (#14359 ) Added a new monitor SysMonitorOshi to replace SysMonitor. The new monitor has a wider support for different machine architectures including ARM instances. Please switch to SysMonitorOshi as SysMonitor is now deprecated and will be removed in future releases.	2023-06-20 20:57:58 +05:30
Kashif Faraz	50461c3bd5	Enable smartSegmentLoading on the Coordinator (#13197 ) This commit does a complete revamp of the coordinator to address problem areas: - Stability: Fix several bugs, add capabilities to prioritize and cancel load queue items - Visibility: Add new metrics, improve logs, revamp `CoordinatorRunStats` - Configuration: Add dynamic config `smartSegmentLoading` to automatically set optimal values for all segment loading configs such as `maxSegmentsToMove`, `replicationThrottleLimit` and `maxSegmentsInNodeLoadingQueue`. Changed classes: - Add `StrategicSegmentAssigner` to make assignment decisions for load, replicate and move - Add `SegmentAction` to distinguish between load, replicate, drop and move operations - Add `SegmentReplicationStatus` to capture current state of replication of all used segments - Add `SegmentLoadingConfig` to contain recomputed dynamic config values - Simplify classes `LoadRule`, `BroadcastRule` - Simplify the `BalancerStrategy` and `CostBalancerStrategy` - Add several new methods to `ServerHolder` to track loaded and queued segments - Refactor `DruidCoordinator` Impact: - Enable `smartSegmentLoading` by default. With this enabled, none of the following dynamic configs need to be set: `maxSegmentsToMove`, `replicationThrottleLimit`, `maxSegmentsInNodeLoadingQueue`, `useRoundRobinSegmentAssignment`, `emitBalancingStats` and `replicantLifetime`. - Coordinator reports richer metrics and produces cleaner and more informative logs - Coordinator uses an unlimited load queue for all serves, and makes better assignment decisions	2023-06-19 14:27:35 +05:30
imply-cheddar	cfd07a95b7	Errors take 3 (#14004 ) Introduce DruidException, an exception whose goal in life is to be delivered to a user. DruidException itself has javadoc on it to describe how it should be used. This commit both introduces the Exception and adjusts some of the places that are generating exceptions to generate DruidException objects instead, as a way to show how the Exception should be used. This work was a 3rd iteration on top of work that was started by Paul Rogers. I don't know if his name will survive the squash-and-merge, so I'm calling it out here and thanking him for starting on this.	2023-06-19 01:11:13 -07:00
Adarsh Sanjeev	128133fadc	Add column replication_factor column to sys.segments table (#14403 ) Description: Druid allows a configuration of load rules that may cause a used segment to not be loaded on any historical. This status is not tracked in the sys.segments table on the broker, which makes it difficult to determine if the unavailability of a segment is expected and if we should not wait for it to be loaded on a server after ingestion has finished. Changes: - Track replication factor in `SegmentReplicantLookup` during evaluation of load rules - Update API `/druid/coordinator/v1metadata/segments` to return replication factor - Add column `replication_factor` to the sys.segments virtual table and populate it in `MetadataSegmentView` - If this column is 0, the segment is not assigned to any historical and will not be loaded.	2023-06-18 10:02:21 +05:30
George Shiqi Wu	64af9bfe5b	Add groupId to metrics (#14402 ) * Add group id as a dimension * Revert changes * Add to forking task runner * Add missing metrics * Fix indenting * revert metrics * Fix indentation	2023-06-16 09:28:16 -07:00
Clint Wylie	359bd63cc9	allow expression "best effort" type determination to better handle mixed type arrays (#14438 )	2023-06-16 00:02:43 -07:00
Clint Wylie	ff5ae4db6c	fix kafka input format reader schema discovery and partial schema discovery (#14421 ) * fix kafka input format reader schema discovery and partial schema discovery to actually work right, by re-using dimension filtering logic of MapInputRowParser	2023-06-15 00:11:04 -07:00
Clint Wylie	ca116cf886	adjust broker parallel merge to help managed blocking be more well behaved (#14427 )	2023-06-15 00:10:31 -07:00
Pranav	e426d370ea	Start with solo accumulator and empty partition (#14426 ) * Starting parallel merge with solo accumulator and empty partitions * shutshown pool in test	2023-06-14 16:20:48 -07:00
Clint Wylie	8454cc619a	auto columns fixes (#14422 ) changes: * auto columns no longer participate in generic 'null column' handling, this was a mistake to try to support and caused ingestion failures due to mismatched ColumnFormat, and will be replaced in the future with nested common format constant column functionality (not in this PR) * fix bugs with auto columns which contain empty objects, empty arrays, or primitive types mixed with either of these empty constructs * fix bug with bound filter when upper is null equivalent but is strict	2023-06-14 08:57:06 -07:00
Abhishek Radhakrishnan	be5a6593a9	Reset `RuntimeInfo` to fix flaky test `ParametrizedUriEmitterConfigTest`. (#14405 ) * Add injector so JVM settings are correctly set up and bound for the test. * Add VisibleForTesting IDE annotation. * spacing	2023-06-13 18:07:51 -07:00
Clint Wylie	61120dc49a	fix Kafka input format to throw ParseException if timestamp is missing (#14413 )	2023-06-13 09:00:11 -07:00
Adarsh Sanjeev	267cbac6ff	Add logs for deleting files using storage connector (#14350 ) * Add logs for deleting files using storage connector * Address review comments * Update log message format	2023-06-11 21:24:30 +05:30
Kashif Faraz	6e158704cb	Do not retry INSERT task into metadata if max_allowed_packet limit is violated (#14271 ) Changes - Add a `DruidException` which contains a user-facing error message, HTTP response code - Make `EntryExistsException` extend `DruidException` - If metadata store max_allowed_packet limit is violated while inserting a new task, throw `DruidException` with response code 400 (bad request) to prevent retries - Add `SQLMetadataConnector.isRootCausePacketTooBigException` with impl for MySQL	2023-06-10 12:15:44 +05:30
imply-cheddar	87149d5975	Remove AbstractIndex (#14388 ) The class apparently only exists to add a toString() method to Indexes, which basically just crashes any debugger on any meaningfully sized index. It's a pointless abstract class that basically only causes pain.	2023-06-08 19:52:16 -07:00
Harini Rajendran	4ff6026d30	Adding SegmentMetadataEvent and publishing them via KafkaEmitter (#14281 ) In this PR, we are enhancing KafkaEmitter, to emit metadata about published segments (SegmentMetadataEvent) into a Kafka topic. This segment metadata information that gets published into Kafka, can be used by any other downstream services to query Druid intelligently based on the segments published. The segment metadata gets published into kafka topic in json string format similar to other events.	2023-06-02 21:28:26 +05:30
zachjsh	e75fb8e8e3	Account for data format and compression in MSQ auto taskAssignment (#14307 ) ### Description This change allows for consideration of the input format and compression when computing how to split the input files among available tasks, in MSQ ingestion, when considering the value of the `maxInputBytesPerWorker` query context parameter. This query parameter allows users to control the maximum number of bytes, with granularity of input file / object, that ingestion tasks will be assigned to ingest. With this change, this context parameter now denotes the estimated weighted size in bytes of the input to split on, with consideration for input format and compression format, rather than the actual file size, reported by the file system. We assume uncompressed newline delimited json as a baseline, with scaling factor of `1`. This means that when computing the byte weight that a file has towards the input splitting, we take the file size as is, if uncompressed json, 1:1. It was found during testing that gzip compressed json, and parquet, has scale factors of `4` and `8` respectively, meaning that each byte of data is weighted 4x and 8x respectively, when computing input splits. This weighted byte scaling is only considered for MSQ ingestion that uses either LocalInputSource or CloudObjectInputSource at the moment. The default value of the `maxInputBytesPerWorker` query context parameter has been updated from 10 GiB, to 512 MiB	2023-06-01 12:53:49 -07:00
Clint Wylie	4096f51f0b	add configurable ColumnTypeMergePolicy to SegmentMetadataCache (#14319 ) This PR adds a new interface to control how SegmentMetadataCache chooses ColumnType when faced with differences between segments for SQL schemas which are computed, exposed as druid.sql.planner.metadataColumnTypeMergePolicy and adds a new 'least restrictive type' mode to allow choosing the type that data across all segments can best be coerced into and sets this as the default behavior. This is a behavior change around when segment driven schema migrations take effect for the SQL schema. With latestInterval, the SQL schema will be updated as soon as the first job with the new schema has published segments, while using leastRestrictive, the schema will only be updated once all segments are reindexed to the new type. The benefit of leastRestrictive is that it eliminates a bunch of type coercion errors that can happen in SQL when types are varied across segments with latestInterval because the newest type is not able to correctly represent older data, such as if the segments have a mix of ARRAY and number types, or any other combinations that lead to odd query plans.	2023-05-24 20:32:51 +05:30
Soumyava	22ba457d29	Expr getCacheKey now delegates to children (#14287 ) * Expr getCacheKey now delegates to children * Removed the LOOKUP_EXPR_CACHE_KEY as we do not need it * Adding an unit test * Update processing/src/main/java/org/apache/druid/math/expr/Expr.java Co-authored-by: Clint Wylie <cjwylie@gmail.com> --------- Co-authored-by: Clint Wylie <cjwylie@gmail.com>	2023-05-23 14:49:38 -07:00
Abhishek Radhakrishnan	a5e04d95a4	Add `TYPE_NAME` to the complex serde classes and replace the hardcoded names. (#14317 ) * Add TYPE_NAME to the serde classes and reuse them instead of hardcoded strings. * Static check fixes.	2023-05-23 00:54:47 -05:00
Clint Wylie	d92b9fbfac	more resilient segment metadata, dont parallel merge internal segment metadata queries (#14296 )	2023-05-17 04:12:55 -07:00
Clint Wylie	b038a11280	fix issues with handling arrays with all null elements and arrays of booleans in strict mode (#14297 )	2023-05-17 01:33:44 -07:00
Soumyava	96a3c00754	Fixing an issue with filtering on a single dimension by converting In… (#14277 ) * Fixing an issue with filtering on a single dimension by converting In filter to a selector filter as needed with Filters.toFilter * Adding a test so that any future refactoring does not break this behavior * Made comment a bit more meaningful	2023-05-15 20:10:36 -07:00
imply-cheddar	f9861808bc	Be able to load segments on Peons (#14239 ) * Be able to load segments on Peons This change introduces a new config on WorkerConfig that indicates how many bytes of each storage location to use for storage of a task. Said config is divided up amongst the locations and slots and then used to set TaskConfig.tmpStorageBytesPerTask The Peons use their local task dir and tmpStorageBytesPerTask as their StorageLocations for the SegmentManager such that they can accept broadcast segments.	2023-05-12 16:51:00 -07:00
Kashif Faraz	ba11b3d462	Refactor: Add OverlordDuty to replace OverlordHelper and align with CoordinatorDuty (#14235 ) Changes: - Replace `OverlordHelper` with `OverlordDuty` to align with `CoordinatorDuty` - Each duty has a `run()` method and defines a `Schedule` with an initial delay and period. - Update existing duties `TaskLogAutoCleaner` and `DurableStorageCleaner` - Add utility class `Configs` - Update log, error messages and javadocs - Other minor style improvements	2023-05-12 22:39:56 +05:30
Clint Wylie	9875090bee	fix segment metadata queries for auto ingested columns that had all null values (#14262 )	2023-05-11 20:58:06 -07:00
Soumyava	f128b9b666	Updates to filter processing for inner query in Joins (#14237 )	2023-05-11 17:21:41 +05:30
Clint Wylie	a58cebe491	add array_to_mv function to convert arrays into mvds to assist with migration from mvds to arrays (#14236 )	2023-05-11 04:43:28 -07:00
Kashif Faraz	64e6283eca	Do not allow retention rules to be null (#14223 ) Changes: - Do not allow retention rules for any datasource or cluster to be null - Allow empty rules at the datasource level but not at the cluster level - Add validation to ensure that `druid.manager.rules.defaultRule` is always set correctly - Minor style refactors	2023-05-11 14:33:56 +05:30
Clint Wylie	aaaff74740	fix npe regression in json_value when filtering non-existent paths (#14250 ) * fix npe regression in json_value when filtering non-existent paths * more coverage	2023-05-10 22:39:22 -07:00
Clint Wylie	6db11bfc60	suppress some cves and fix javadoc build when using java 17 (#14241 )	2023-05-10 15:47:10 -07:00
Clint Wylie	8805d8d7db	fix issues with filtering nulls on values coerced to numeric types (#14139 ) * fix issues with filtering nulls on values coerced to numeric types * fix issues with 'auto' type numeric columns in default value mode * optimize variant typed columns without nested data * more tests for 'auto' type column ingestion	2023-05-08 13:19:02 -07:00
Clint Wylie	a7a4bfd331	modify QueryScheduler to lazily acquire lanes when executing queries to avoid leaks (#14184 ) This PR fixes an issue that could occur if druid.query.scheduler.numThreads is configured and any exception occurs after QueryScheduler.run has been called to create a Sequence. This would result in total and/or lane specific locks being acquired, but because the sequence was not actually being evaluated, the "baggage" which typically releases these locks was not being executed. An example of how this can happen is if a group-by having filter, which wraps and transforms this sequence happens to explode while wrapping the sequence. The end result is that the locks are acquired, but never released, eventually halting the ability to execute any queries.	2023-05-08 11:42:05 +05:30
Clint Wylie	90ea192d9c	fix bugs with auto encoded long vector deserializers (#14186 ) This PR fixes an issue when using 'auto' encoded LONG typed columns and the 'vectorized' query engine. These columns use a delta based bit-packing mechanism, and errors in the vectorized reader would cause it to incorrectly read column values for some bit sizes (1 through 32 bits). This is a regression caused by #11004, which added the optimized readers to improve performance, so impacts Druid versions 0.22.0+. While writing the test I finally got sad enough about IndexSpec not having a "builder", so I made one, and switched all the things to use it. Apologies for the noise in this bug fix PR, the only real changes are in VSizeLongSerde, and the tests that have been modified to cover the buggy behavior, VSizeLongSerdeTest and ExpressionVectorSelectorsTest. Everything else is just cleanup of IndexSpec usage.	2023-05-01 11:49:27 +05:30
Suneet Saldanha	84c11df980	Make LoggingEmitter more useful by using Markers (#14121 ) * Make LoggingEmitter more useful * Skip code coverage for facade classes * fix spellcheck * code review * fix dependency * logging.md * fix checkstyle * Add back jacoco version to main pom	2023-04-27 15:06:06 -07:00
Adarsh Sanjeev	5aa119dfda	Add retry to opening retrying stream (#14126 ) * Add retry to opening retrying stream * Add retry to S3Entity for network issues * Fix tests and clean up code	2023-04-27 16:52:22 +05:30
Gian Merlino	42c8c84eb6	TimeBoundary: Use cursor when datasource is not a regular table. (#14151 ) * TimeBoundary: Use cursor when datasource is not a regular table. Fixes a bug where TimeBoundary could return incorrect results with INNER Join or inline data. * Addl Javadocs.	2023-04-26 17:00:13 -07:00
Gian Merlino	752475b799	Fix two concurrency issues with segment fetching. (#14042 ) * Fix two concurrency issues with segment fetching. 1) SegmentLocalCacheManager: Fix a concurrency issue where certain directory cleanup happened outside of directoryWriteRemoveLock. This created the possibility that segments would be deleted by one thread, while being actively downloaded by another thread. 2) TaskDataSegmentProcessor (MSQ): Fix a concurrency issue when two stages in the same process both use the same segment. For example: a self-join using distributed sort-merge. Prior to this change, the two stages could delete each others' segments. 3) ReferenceCountingResourceHolder: increment() returns a new ResourceHolder, rather than a Releaser. This allows it to be passed to callers without them having to hold on to both the original ResourceHolder and a Releaser. 4) Simplify various interfaces and implementations by using ResourceHolder instead of Pair and instead of split-up fields. * Add test. * Fix style. * Remove Releaser. * Updates from master. * Add some GuardedBys. * Use the correct GuardedBy. * Adjustments.	2023-04-25 20:49:27 -07:00
Gian Merlino	2dfb693d4c	Improved handling for zero-length intervals. (#14136 ) * Improved handling for zero-length intervals. 1) Return an empty list from VersionedIntervalTimeline.lookup when provided with an empty interval. (The logic doesn't quite work when intervals are empty, which led to #14129.) 2) Don't return zero-length intervals from JodaUtils.condenseIntervals. 3) Detect "incorrect" comparator in JodaUtils.condenseIntervals, and recreate the SortedSet if needed. (Not strictly related to the theme of this patch. Just another thing in the same file.) 4) Remove unused method JodaUtils.containOverlappingIntervals. Fixes #14129. * Fix TimewarpOperatorTest.	2023-04-25 17:12:56 -07:00

1 2 3 4 5 ...

2816 Commits