druid

Commit Graph

Author	SHA1	Message	Date
Laksh Singla	8f102f9031	Introduce StorageConnector for Azure (#14660 ) The Azure connector is introduced and MSQ's fault tolerance and durable storage can now be used with Microsoft Azure's blob storage. Also, the results of newly introduced queries from deep storage can now store and fetch the results from Azure's blob storage.	2023-08-09 12:25:27 +00:00
Tejaswini Bandlamudi	a45b25fa1d	Removes support for Hadoop 2 (#14763 ) Removing Hadoop 2 support as discussed in https://lists.apache.org/list?dev@druid.apache.org:lte=1M:hadoop	2023-08-09 17:47:52 +05:30
Karan Kumar	cd817fc469	Fixing typo in `resultsTruncated` (#14779 )	2023-08-08 20:51:44 -07:00
Suneet Saldanha	b624a4ec4a	Rolling Supervisor restarts at taskDuration (#14396 ) * Rolling supervior task publishing * add an option for number of task groups to roll over * better * remove docs * oops * checkstyle * wip test * undo partial test change * remove incomplete test	2023-08-07 16:24:32 -07:00
George Shiqi Wu	14940dc3ed	Add pod name to TaskLocation for easier observability and debugging. (#14758 ) * Add pod name to location * Add log * fix style * Update extensions-contrib/kubernetes-overlord-extensions/src/main/java/org/apache/druid/k8s/overlord/KubernetesPeonLifecycle.java Co-authored-by: Suneet Saldanha <suneet@apache.org> * Fix unit tests --------- Co-authored-by: Suneet Saldanha <suneet@apache.org>	2023-08-07 12:33:35 -07:00
Adarsh Sanjeev	56ab81f381	Add support for different result formats to MSQ SqlStatementResource (#14571 ) * Add support for different result format * Add tests * Add tests * Fix checkstyle * Remove changes to destination * Removed some unwanted code * Address review comments * Rename parameter * Fix tests	2023-08-07 20:48:59 +05:30
imply-cheddar	748874405c	Minimize PostAggregator computations (#14708 ) * Minimize PostAggregator computations Since a change back in 2014, the topN query has been computing all PostAggregators on all intermediate responses from leaf nodes to brokers. This generates significant slow downs for queries with relatively expensive PostAggregators. This change rewrites the query that is pushed down to only have the minimal set of PostAggregators such that it is impossible for downstream processing to do too much work. The final PostAggregators are applied at the very end.	2023-08-04 00:04:31 +05:30
YongGang	20c48b6a3d	Retry S3 task log fetch in case of transient S3 exceptions (#14714 )	2023-08-03 19:46:10 +05:30
Adarsh Sanjeev	6837a7be19	Add logging for downsampling sketches in MSQ (#14580 ) * Add more logs for downsampling sketches * Fix builds * Lower log level * Add new log message	2023-08-02 20:07:54 +05:30
Gian Merlino	72c151a192	MSQ WorkerImpl: Ignore ServiceClosedException on postCounters. (#14707 ) * MSQ WorkerImpl: Ignore ServiceClosedException on postCounters. A race can happen where postCounters is in flight while the controller goes offline. When this happens, we should ignore the ServiceClosedException and continue without posting counters. * Fix style and logic.	2023-08-02 07:00:10 +05:30
Pranav	8a10b46dd8	Adding the PropertyNamingStrategies from jackson for fixing hadoop ingestion (#14671 )	2023-08-01 20:02:43 +05:30
Gian Merlino	5387f1bac0	Remove chatAsync parameter, so chat is always async. (#14692 ) * Remove chatAsync parameter, so chat is always async. chatAsync has been made default in Druid 26. I have seen good battle-testing of it in production, and am comfortable removing the older sync client. This was the last remaining usage of IndexTaskClient, so this patch deletes all that stuff too. * Remove unthrown exception. * Remove unthrown exception. * No more TimeoutException.	2023-07-31 19:42:51 -07:00
Adarsh Sanjeev	339b8d959f	Change the default format from OBJECT to OBJECTLINES (#14700 )	2023-07-31 18:39:58 +00:00
Adarsh Sanjeev	21d023b62b	Handle taskIds which are not found in the overlord correctly (#14706 ) This PR has fixes a bug in the SqlStatementAPI where if the task is not found on the overlord, the response status is 500. This changes the response to invalid input since the queryID passed is not valid.	2023-07-31 21:38:14 +05:30
Laksh Singla	8232c03667	[MSQ] Handle dimensionless group by queries with partitioning, and multiple workers (#14678 ) * fixup * add ut * review	2023-07-29 07:15:17 +05:30
Gian Merlino	6517fc2796	Save a metadata call when reading files from CloudObjectInputSource. (#14677 ) * Save a metadata call when reading files from CloudObjectInputSource. The call to createSplits(inputFormat, null) in formattableReader would use the default split hint spec, MaxSizeSplitHintSpec, which makes getObjectMetadata calls in order to compute its splits. This isn't necessary; we're just trying to unpack the files inside the input source. To fix this, use FilePerSplitHintSpec to extract files without any funny business. * Adjust call. * Fix constant. * Test coverage.	2023-07-28 13:31:03 -07:00
Gian Merlino	46ecc6b900	Frames support for string arrays that are null. (#14653 ) * Frames support for string arrays that are null. The row format represents null arrays as 0x0001, which older readers would interpret as an empty array. This provides compatibility with older readers, which is useful during updates. The column format represents null arrays by writing -(actual length) - 1 instead of the length, and using FrameColumnWriters.TYPE_STRING_ARRAY for the type code for string arrays generally. Older readers will report this as an unrecognized type code. Column format is only used by the operator query, which is currently experimental, so the impact isn't too severe. * Remove unused import. * Return Object[] instead of List from frame array selectors. Update MSQSelectTest and MSQInsertTest to reflect the fact that null arrays are possible. Add a bunch of javadocs to object selectors describing expected behavior, including the requirement that array selectors return Object[]. * update test case. * Update test cases.	2023-07-28 10:23:39 -07:00
TSFenwick	9a9038c7ae	Speed up kill tasks by deleting segments in batch (#14131 ) * allow for batched delete of segments instead of deleting segment data one by one create new batchdelete method in datasegment killer that has default functionality of iterating through all segments and calling delete on them. This will enable a slow rollout of other deepstorage implementations to move to a batched delete on their own time * cleanup batchdelete segments * batch delete with the omni data deleter cleaned up code just need to add tests and docs for this functionality * update java doc to explain how it will try to use batch if function is overwritten * rename killBatch to kill add unit tests * add omniDataSegmentKillerTest for deleting multiple segments at a time. fix checkstyle * explain test peculiarity better * clean up batch kill in s3. * remove unused return value. cleanup comments and fix checkstyle * default to batch delete. more specific java docs. list segments that couldn't be deleted if there was a client error or server error * simplify error handling * add tests where an exception is thrown when killing multiple s3 segments * add test for failing to delete two calls with the s3 client * fix javadoc for kill(List<DataSegment> segments) clean up tests remove feature flag * fix typo in javadocs * fix test failure * fix checkstyle and improve tests * fix intellij inspections issues * address comments, make delete multiple segments not assume same bucket * fix test errors * better grammar and punctuation. fix test. and better logging for exception * remove unused code * avoid extra arraylist instantiation * fix broken test * fix broken test * fix tests to use assert.throws	2023-07-27 15:34:44 -07:00
Gian Merlino	986a271a7d	Merge core CoordinatorClient with MSQ CoordinatorServiceClient. (#14652 ) * Merge core CoordinatorClient with MSQ CoordinatorServiceClient. Continuing the work from #12696, this patch merges the MSQ CoordinatorServiceClient into the core CoordinatorClient, yielding a single interface that serves both needs and is based on the ServiceClient RPC system rather than DruidLeaderClient. Also removes the backwards-compatibility code for the handoff API in CoordinatorBasedSegmentHandoffNotifier, because the new API was added in 0.14.0. That's long enough ago that we don't need backwards compatibility for rolling updates. * Fixups. * Trigger GHA. * Remove unnecessary retrying in DruidInputSource. Add "about an hour" retry policy and h * EasyMock	2023-07-27 13:23:37 -07:00
Gian Merlino	2f9619a96f	Use OverlordClient for all Overlord RPCs. (#14581 ) * Use OverlordClient for all Overlord RPCs. Continuing the work from #12696, this patch removes HttpIndexingServiceClient and the IndexingService flavor of DruidLeaderClient completely. All remaining usages are migrated to OverlordClient. Supporting changes include: 1) Add a variety of methods to OverlordClient. 2) Update MetadataTaskStorage to skip the complete-task lookup when the caller requests zero completed tasks. This helps performance of the "get active tasks" APIs, which don't want to see complete ones. * Use less forbidden APIs. * Fixes from CI. * Add test coverage. * Two more tests. * Fix test. * Updates from CR. * Remove unthrown exceptions. * Refactor to improve testability and test coverage. * Add isNil tests. * Remove unnecessary "deserialize" methods.	2023-07-24 21:14:27 -07:00
Abhishek Agarwal	efb32810c4	Clean up the core API required for Iceberg extension (#14614 ) Changes: - Replace `AbstractInputSourceBuilder` with `InputSourceFactory` - Move iceberg specific logic to `IcebergInputSource`	2023-07-21 13:01:33 +05:30
Karan Kumar	77e0c16bce	Sql statement api error messaging fixes. (#14629 ) * Error messaging fixes. * Static check fix * Review comments	2023-07-20 22:48:44 +05:30
Gian Merlino	bac5ef347c	Add ingest/input/bytes metric and Kafka consumer metrics. (#14582 ) * Add ingest/input/bytes metric and Kafka consumer metrics. New metrics: 1) ingest/input/bytes. Equivalent to processedBytes in the task reports. 2) kafka/consumer/bytesConsumed: Equivalent to the Kafka consumer metric "bytes-consumed-total". Only emitted for Kafka tasks. 3) kafka/consumer/recordsConsumed: Equivalent to the Kafka consumer metric "records-consumed-total". Only emitted for Kafka tasks. * Fix anchor. * Fix KafkaConsumerMonitor. * Interface updates. * Doc changes. * Update indexing-service/src/main/java/org/apache/druid/indexing/seekablestream/SeekableStreamIndexTask.java Co-authored-by: Benedict Jin <asdf2014@apache.org> --------- Co-authored-by: Benedict Jin <asdf2014@apache.org>	2023-07-20 10:56:22 +08:00
Karan Kumar	ae168c4559	Adding null fix for rows and col stats information. (#14617 ) * Adding null fix for rows and col stats information. * Null handling test case fix	2023-07-19 21:16:05 +05:30
Clint Wylie	913416c669	add equality, null, and range filter (#14542 ) changes: * new filters that preserve match value typing to better handle filtering different column types * sql planner uses new filters by default in sql compatible null handling mode * remove isFilterable from column capabilities * proper handling of array filtering, add array processor to column processors * javadoc for sql test filter functions * range filter support for arrays, tons more tests, fixes * add dimension selector tests for mixed type roots * support json equality * rename semantic index maker thingys to mostly have plural names since they typically make many indexes, e.g. StringValueSetIndex -> StringValueSetIndexes * add cooler equality index maker, ValueIndexes * fix missing string utf8 index supplier * expression array comparator stuff	2023-07-18 12:15:22 -07:00
AmatyaAvadhanula	0412f40d36	Prepare master branch for next release, 28.0.0 (#14595 ) * Prepare master branch for next release, 28.0.0	2023-07-18 09:22:30 +05:30
Atul Mohan	03d6d395a0	Extension to read and ingest iceberg data files (#14329 ) This adds a new contrib extension: druid-iceberg-extensions which can be used to ingest data stored in Apache Iceberg format. It adds a new input source of type iceberg that connects to a catalog and retrieves the data files associated with an iceberg table and provides these data file paths to either an S3 or HDFS input source depending on the warehouse location. Two important dependencies associated with Apache Iceberg tables are: Catalog : This extension supports reading from either a Hive Metastore catalog or a Local file-based catalog. Support for AWS Glue is not available yet. Warehouse : This extension supports reading data files from either HDFS or S3. Adapters for other cloud object locations should be easy to add by extending the AbstractInputSourceAdapter.	2023-07-18 08:59:57 +05:30
Gian Merlino	95ca43034f	Change default handoffConditionTimeout to 15 minutes. (#14539 ) * Change default handoffConditionTimeout to 15 minutes. Most of the time, when handoff is taking this long, it's because something is preventing Historicals from loading new data. In this case, we have two choices: 1) Stop making progress on ingestion, wait for Historicals to load stuff, and keep the waiting-for-handoff segments available on realtime tasks. (handoffConditionTimeout = 0, the current default) 2) Continue making progress on ingestion, by exiting the realtime tasks that were waiting for handoff. Once the Historicals get their act together, the segments will be loaded, as they are still there on deep storage. They will just not be continuously available. (handoffConditionTimeout > 0) I believe most users would prefer [2], because [1] risks ingestion falling behind the stream, which causes many other problems. It can cause data loss if the stream ages-out data before we have a chance to ingest it. Due to the way tuningConfigs are serialized -- defaults are baked into the serialized form that is written to the database -- this default change will not change anyone's existing supervisors. It will take effect for newly created supervisors. * Fix tests. * Update docs/development/extensions-core/kafka-supervisor-reference.md Co-authored-by: Katya Macedo <38017980+ektravel@users.noreply.github.com> * Update docs/development/extensions-core/kinesis-ingestion.md Co-authored-by: Katya Macedo <38017980+ektravel@users.noreply.github.com> --------- Co-authored-by: Katya Macedo <38017980+ektravel@users.noreply.github.com>	2023-07-13 13:17:14 -07:00
Laksh Singla	c1c7dff2ad	Using DruidExceptions in MSQ (changes related to the Broker) (#14534 ) MSQ engine returns correct error codes for invalid user inputs in the query context. Also, using DruidExceptions for MSQ related errors happening in the Broker with improved error messages.	2023-07-13 19:08:49 +00:00
zachjsh	589aac8b31	Make errorCode of InsertTimeOutOfBoundsFault consistent with others (#14495 ) The errorCode of this fault when serialized over the wire was being set to the name of the class `InsertTimeOutOfBoundsFault` instead of the CODE `InsertTimeOutOfBounds`. All other faults' errorCodes are serialized as the respective Fault's code, so making consistent here as well.	2023-07-13 14:34:21 -04:00
Karan Kumar	89aee6caaa	Fixing an issue in sequential merge (#14574 ) * Fixing an issue in sequential merge where workers without any partial key statistics would get stuck because controller did not change the worker state. * Removing empty check * Adding IT for MSQ sequential bug fix.	2023-07-12 22:05:30 +05:30
Gian Merlino	3ff51487b7	Add ZooKeeper connection state alerts and metrics. (#14333 ) * Add ZooKeeper connection state alerts and metrics. - New metric "zk/connected" is an indicator showing 1 when connected, 0 when disconnected. - New metric "zk/disconnected/time" measures time spent disconnected. - New alert when Curator connection state enters LOST or SUSPENDED. * Use right GuardedBy. * Test fixes, coverage. * Adjustment. * Fix tests. * Fix ITs. * Improved injection. * Adjust metric name, add tests.	2023-07-12 09:34:28 -07:00
Laksh Singla	5ce536355e	Fix planning bug while using sort merge frame processor (#14450 ) sqlJoinAlgorithm is now a hint to the planner to execute the join in the specified manner. The planner can decide to ignore the hint if it deduces that the specified algorithm can be detrimental to the performance of the join beforehand.	2023-07-11 09:58:44 +00:00
Pranav	8087aa2b80	Adding the null check in combine and fold in doublesSketch (#14568 )	2023-07-11 14:28:34 +05:30
Adarsh Sanjeev	30a91be15a	Add log statements for tmpStorageBytes in MSQ (#14449 ) * Add log statements for tmpStorageBytes in MSQ * Add log * Update log message	2023-07-11 11:02:12 +05:30
imply-cheddar	66cac08a52	Refactor HllSketchBuildAggregatorFactory (#14544 ) * Refactor HllSketchBuildAggregatorFactory The usage of ColumnProcessors and HllSketchBuildColumnProcessorFactory made it very difficult to figure out what was going on from just looking at the AggregatorFactory or Aggregator code. It also didn't properly double check that you could use UTF8 ahead of time, even though it's entirely possible to validate it before trying to use it. This refactor makes keeps the general indirection that had been implemented by the Consumer<Supplier<HllSketch>> but centralizes the decision logic and makes it easier to understand the code. * Test fixes * Add test that validates the types are maintained * Add back indirection to avoid buffer calls * Cover floats and doubles are the same thing * Static checks	2023-07-10 09:57:09 -07:00
Gian Merlino	63ee69b4e8	Claim full support for Java 17. (#14384 ) * Claim full support for Java 17. No production code has changed, except the startup scripts. Changes: 1) Allow Java 17 without DRUID_SKIP_JAVA_CHECK. 2) Include the full list of opens and exports on both Java 11 and 17. 3) Document that Java 17 is both supported and preferred. 4) Switch some tests from Java 11 to 17 to get better coverage on the preferred version. * Doc update. * Update errorprone. * Update docker_build_containers.sh. * Update errorprone in licenses.yaml. * Add some more run-javas. * Additional run-javas. * Update errorprone. * Suppress new errorprone error. * Add exports and opens in ForkingTaskRunner for Java 11+. Test, doc changes. * Additional errorprone updates. * Update for errorprone. * Restore old fomatting in LdapCredentialsValidator. * Copy bin/ too. * Fix Java 15, 17 build line in docker_build_containers.sh. * Update busybox image. * One more java command. * Fix interpolation. * IT commandline refinements. * Switch to busybox 1.34.1-glibc. * POM adjustments, build and test one IT on 17. * Additional debugging. * Fix silly thing. * Adjust command line. * Add exports and opens one more place. * Additional harmonization of strong encapsulation parameters.	2023-07-07 12:52:35 -07:00
Laksh Singla	9e617373a0	Handle dimensionless group by queries with partitioning	2023-07-07 21:51:47 +05:30
Karan Kumar	afa8c7b8ab	Adding Ability for MSQ to write select results to durable storage. (#14527 ) One of the most requested features in druid is to have an ability to download big result sets. As part of #14416 , we added an ability for MSQ to be queried via a query friendly endpoint. This PR builds upon that work and adds the ability for MSQ to write select results to durable storage. We write the results to the durable storage location <prefix>/results/<queryId> in the druid frame format. This is exposed to users by /v2/sql/statements/:queryId/results.	2023-07-07 20:49:48 +05:30
Jan Werner	95115d722a	CVE fixes - update of multiple dependencies. (#14519 ) Apache Druid brings multiple direct and transitive dependencies that are affected by plethora of CVEs. This PR attempts to update all the dependencies that did not require code refactoring. This PR modifies pom files, license file and OWASP Dependency Check suppression file.	2023-07-07 20:27:30 +05:30
imply-cheddar	5fc122a144	Add window-focused tests from Drill (#13773 ) This commit borrows some test definitions from Drill's test suite and tries to use them to flesh out the full validation of window function capbilities. In order to be able to run these tests, we also add the ability to run a Scan operation against segments, which also meant an implementation of RowsAndColumns for frames.	2023-07-06 09:20:32 -07:00
Adarsh Sanjeev	27a70d569d	Add page information to SqlStatementResource API (#14512 ) * Changes the get results API in SqlStatementResource to take a page number instead of row/offset. * Adds "pages" containing information on each page to the results status. * Update the "numRows" and "sizeInByes" to "numTotalRows" and "totalSizeInBytes" respectively, which are totalled across all pages.	2023-07-03 15:20:14 +05:30
Pranav	2d5b27358e	Logging the fieldName in the coerce exceptions (#14483 ) Logging the fieldName in the coerce exceptions	2023-07-03 14:13:27 +05:30
Clint Wylie	277aaa5c57	remove druid.processing.columnCache.sizeBytes and CachingIndexed, combine string column implementations (#14500 ) * combine string column implementations changes: * generic indexed, front-coded, and auto string columns now all share the same column and index supplier implementations * remove CachingIndexed implementation, which I think is largely no longer needed by the switch of many things to directly using ByteBuffer, avoiding the cost of creating Strings * remove ColumnConfig.columnCacheSizeBytes since CachingIndexed was the only user	2023-07-02 19:37:15 -07:00
Gian Merlino	58f3faf299	SortMergeJoinFrameProcessor: Fix two bugs with buffering. (#14196 ) 1) Fix a problem where the fault wasn't reported when the left-hand side had too many buffered frames. (Instead, frames continued to be buffered, eventually running the server out of memory.) 2) Always update the mark when rewinding isn't necessary. It fixes a problem where frames would be needlessly buffered when there isn't a key match across the two sides. 3) Memory reserved for building the trackers now change based on the heap sized	2023-07-02 19:52:52 +05:30
Gian Merlino	048dbcee88	MSQ: Improve InsertTimeOutOfBounds error message. (#14511 ) Nicer and actionable error message for `InsertTimeOutOfBounds` fault	2023-07-02 01:44:19 +05:30
Gian Merlino	67fbd8e7fc	Add "stringEncoding" parameter to DataSketches HLL. (#11201 ) * Add "stringEncoding" parameter to DataSketches HLL. Builds on the concept from #11172 and adds a way to feed HLL sketches with UTF-8 bytes. This must be an option rather than always-on, because prior to this patch, HLL sketches used UTF-16LE encoding when hashing strings. To remain compatible with sketch images created prior to this patch -- which matters during rolling updates and when reading sketches that have been written to segments -- we must keep UTF-16LE as the default. Not currently documented, because I'm not yet sure how best to expose this functionality to users. I think the first place would be in the SQL layer: we could have it automatically select UTF-8 or UTF-16LE when building sketches at query time. We need to be careful about this, though, because UTF-8 isn't always faster. Sometimes, like for the results of expressions, UTF-16LE is faster. I expect we will sort this out in future patches. * Fix benchmark. * Fix style issues, improve test coverage. * Put round back, to make IT updates easier. * Fix test. * Fix issue with filtered aggregators and add test. * Use DS native update(ByteBuffer) method. Improve test coverage. * Add another suppression. * Fix ITAutoCompactionTest. * Update benchmarks. * Updates. * Fix conflict. * Adjustments.	2023-06-30 12:45:55 -07:00
Gian Merlino	a6cabbe10f	SQL: Avoid "intervals" for non-table-based datasources. (#14336 ) In these other cases, stick to plain "filter". This simplifies lots of logic downstream, and doesn't hurt since we don't have intervals-specific optimizations outside of tables. Fixes an issue where we couldn't properly filter on a column from an external datasource if it was named __time.	2023-06-29 09:57:11 +05:30
Gian Merlino	c798d3fb2e	Fix flaky SqlStatementResourceTest. (#14498 ) Mocks generally have state and should not be static. In particular, the "Yielder" included in one of the mocks can only be iterated once, which made the test suite order-dependent.	2023-06-29 05:42:44 +05:30
Jonathan Wei	c36f12f1d8	Support complex variance object inputs for variance SQL agg function (#14463 ) * Support complex variance object inputs for variance SQL agg function * Add test * Include complexTypeChecker, address PR comments * Checkstyle, javadoc link	2023-06-28 13:14:19 -05:00

1 2 3 4 5 ...

1268 Commits