druid

Commit Graph

Author	SHA1	Message	Date
Laksh Singla	5ce536355e	Fix planning bug while using sort merge frame processor (#14450 ) sqlJoinAlgorithm is now a hint to the planner to execute the join in the specified manner. The planner can decide to ignore the hint if it deduces that the specified algorithm can be detrimental to the performance of the join beforehand.	2023-07-11 09:58:44 +00:00
Pranav	8087aa2b80	Adding the null check in combine and fold in doublesSketch (#14568 )	2023-07-11 14:28:34 +05:30
Adarsh Sanjeev	30a91be15a	Add log statements for tmpStorageBytes in MSQ (#14449 ) * Add log statements for tmpStorageBytes in MSQ * Add log * Update log message	2023-07-11 11:02:12 +05:30
imply-cheddar	66cac08a52	Refactor HllSketchBuildAggregatorFactory (#14544 ) * Refactor HllSketchBuildAggregatorFactory The usage of ColumnProcessors and HllSketchBuildColumnProcessorFactory made it very difficult to figure out what was going on from just looking at the AggregatorFactory or Aggregator code. It also didn't properly double check that you could use UTF8 ahead of time, even though it's entirely possible to validate it before trying to use it. This refactor makes keeps the general indirection that had been implemented by the Consumer<Supplier<HllSketch>> but centralizes the decision logic and makes it easier to understand the code. * Test fixes * Add test that validates the types are maintained * Add back indirection to avoid buffer calls * Cover floats and doubles are the same thing * Static checks	2023-07-10 09:57:09 -07:00
Gian Merlino	63ee69b4e8	Claim full support for Java 17. (#14384 ) * Claim full support for Java 17. No production code has changed, except the startup scripts. Changes: 1) Allow Java 17 without DRUID_SKIP_JAVA_CHECK. 2) Include the full list of opens and exports on both Java 11 and 17. 3) Document that Java 17 is both supported and preferred. 4) Switch some tests from Java 11 to 17 to get better coverage on the preferred version. * Doc update. * Update errorprone. * Update docker_build_containers.sh. * Update errorprone in licenses.yaml. * Add some more run-javas. * Additional run-javas. * Update errorprone. * Suppress new errorprone error. * Add exports and opens in ForkingTaskRunner for Java 11+. Test, doc changes. * Additional errorprone updates. * Update for errorprone. * Restore old fomatting in LdapCredentialsValidator. * Copy bin/ too. * Fix Java 15, 17 build line in docker_build_containers.sh. * Update busybox image. * One more java command. * Fix interpolation. * IT commandline refinements. * Switch to busybox 1.34.1-glibc. * POM adjustments, build and test one IT on 17. * Additional debugging. * Fix silly thing. * Adjust command line. * Add exports and opens one more place. * Additional harmonization of strong encapsulation parameters.	2023-07-07 12:52:35 -07:00
Laksh Singla	9e617373a0	Handle dimensionless group by queries with partitioning	2023-07-07 21:51:47 +05:30
Karan Kumar	afa8c7b8ab	Adding Ability for MSQ to write select results to durable storage. (#14527 ) One of the most requested features in druid is to have an ability to download big result sets. As part of #14416 , we added an ability for MSQ to be queried via a query friendly endpoint. This PR builds upon that work and adds the ability for MSQ to write select results to durable storage. We write the results to the durable storage location <prefix>/results/<queryId> in the druid frame format. This is exposed to users by /v2/sql/statements/:queryId/results.	2023-07-07 20:49:48 +05:30
Jan Werner	95115d722a	CVE fixes - update of multiple dependencies. (#14519 ) Apache Druid brings multiple direct and transitive dependencies that are affected by plethora of CVEs. This PR attempts to update all the dependencies that did not require code refactoring. This PR modifies pom files, license file and OWASP Dependency Check suppression file.	2023-07-07 20:27:30 +05:30
imply-cheddar	5fc122a144	Add window-focused tests from Drill (#13773 ) This commit borrows some test definitions from Drill's test suite and tries to use them to flesh out the full validation of window function capbilities. In order to be able to run these tests, we also add the ability to run a Scan operation against segments, which also meant an implementation of RowsAndColumns for frames.	2023-07-06 09:20:32 -07:00
Adarsh Sanjeev	27a70d569d	Add page information to SqlStatementResource API (#14512 ) * Changes the get results API in SqlStatementResource to take a page number instead of row/offset. * Adds "pages" containing information on each page to the results status. * Update the "numRows" and "sizeInByes" to "numTotalRows" and "totalSizeInBytes" respectively, which are totalled across all pages.	2023-07-03 15:20:14 +05:30
Pranav	2d5b27358e	Logging the fieldName in the coerce exceptions (#14483 ) Logging the fieldName in the coerce exceptions	2023-07-03 14:13:27 +05:30
Clint Wylie	277aaa5c57	remove druid.processing.columnCache.sizeBytes and CachingIndexed, combine string column implementations (#14500 ) * combine string column implementations changes: * generic indexed, front-coded, and auto string columns now all share the same column and index supplier implementations * remove CachingIndexed implementation, which I think is largely no longer needed by the switch of many things to directly using ByteBuffer, avoiding the cost of creating Strings * remove ColumnConfig.columnCacheSizeBytes since CachingIndexed was the only user	2023-07-02 19:37:15 -07:00
Gian Merlino	58f3faf299	SortMergeJoinFrameProcessor: Fix two bugs with buffering. (#14196 ) 1) Fix a problem where the fault wasn't reported when the left-hand side had too many buffered frames. (Instead, frames continued to be buffered, eventually running the server out of memory.) 2) Always update the mark when rewinding isn't necessary. It fixes a problem where frames would be needlessly buffered when there isn't a key match across the two sides. 3) Memory reserved for building the trackers now change based on the heap sized	2023-07-02 19:52:52 +05:30
Gian Merlino	048dbcee88	MSQ: Improve InsertTimeOutOfBounds error message. (#14511 ) Nicer and actionable error message for `InsertTimeOutOfBounds` fault	2023-07-02 01:44:19 +05:30
Gian Merlino	67fbd8e7fc	Add "stringEncoding" parameter to DataSketches HLL. (#11201 ) * Add "stringEncoding" parameter to DataSketches HLL. Builds on the concept from #11172 and adds a way to feed HLL sketches with UTF-8 bytes. This must be an option rather than always-on, because prior to this patch, HLL sketches used UTF-16LE encoding when hashing strings. To remain compatible with sketch images created prior to this patch -- which matters during rolling updates and when reading sketches that have been written to segments -- we must keep UTF-16LE as the default. Not currently documented, because I'm not yet sure how best to expose this functionality to users. I think the first place would be in the SQL layer: we could have it automatically select UTF-8 or UTF-16LE when building sketches at query time. We need to be careful about this, though, because UTF-8 isn't always faster. Sometimes, like for the results of expressions, UTF-16LE is faster. I expect we will sort this out in future patches. * Fix benchmark. * Fix style issues, improve test coverage. * Put round back, to make IT updates easier. * Fix test. * Fix issue with filtered aggregators and add test. * Use DS native update(ByteBuffer) method. Improve test coverage. * Add another suppression. * Fix ITAutoCompactionTest. * Update benchmarks. * Updates. * Fix conflict. * Adjustments.	2023-06-30 12:45:55 -07:00
Gian Merlino	a6cabbe10f	SQL: Avoid "intervals" for non-table-based datasources. (#14336 ) In these other cases, stick to plain "filter". This simplifies lots of logic downstream, and doesn't hurt since we don't have intervals-specific optimizations outside of tables. Fixes an issue where we couldn't properly filter on a column from an external datasource if it was named __time.	2023-06-29 09:57:11 +05:30
Gian Merlino	c798d3fb2e	Fix flaky SqlStatementResourceTest. (#14498 ) Mocks generally have state and should not be static. In particular, the "Yielder" included in one of the mocks can only be iterated once, which made the test suite order-dependent.	2023-06-29 05:42:44 +05:30
Jonathan Wei	c36f12f1d8	Support complex variance object inputs for variance SQL agg function (#14463 ) * Support complex variance object inputs for variance SQL agg function * Add test * Include complexTypeChecker, address PR comments * Checkstyle, javadoc link	2023-06-28 13:14:19 -05:00
Adarsh Sanjeev	233233c92d	Add query context parameter to control limiting select rows (#14476 ) * Add query context parameter to control limiting select rows * Add unit tests * Address review comments * Address review comments * Address review comments	2023-06-28 17:54:24 +05:30
Karan Kumar	cb3a9d2b57	Adding Interactive API's for MSQ engine (#14416 ) This PR aims to expose a new API called "@path("/druid/v2/sql/statements/")" which takes the same payload as the current "/druid/v2/sql" endpoint and allows users to fetch results in an async manner.	2023-06-28 17:51:58 +05:30
Adarsh Sanjeev	0335aaa279	Add query results directory and prevent the auto cleaner from cleaning it (#14446 ) Adds support for automatic cleaning of a "query-results" directory in durable storage. This directory will be cleaned up only if the task id is not known to the overlord. This will allow the storage of query results after the task has finished running.	2023-06-28 10:14:04 +05:30
Laksh Singla	f546cd64a9	MSQ: Ensure that the allocated segment aligns with the requested granularity (#14475 ) Changes: - Throw an `InsertCannotAllocateSegmentFault` if the allocated segment is not aligned with the requested granularity. - Tests to verify new behaviour	2023-06-27 09:25:32 +05:30
Gian Merlino	8211379de6	MSQ: Change default clusterStatisticsMergeMode to SEQUENTIAL. (#14310 ) * MSQ: Change default clusterStatisticsMergeMode to SEQUENTIAL. This is an undocumented parameter that controls how cluster-by statistics are merged. In PARALLEL mode, statistics are gathered from workers all at once. In SEQUENTIAL mode, statistics are gathered time chunk by time chunk. This improves accuracy for jobs with many time chunks, and reduces memory usage. The main downside of SEQUENTIAL is that it can take longer, but in most situations I've seen, PARALLEL is only really usable in cases where the sketches are small enough that SEQUENTIAL would also run relatively quickly. So it seems like SEQUENTIAL is a better default. * Switch off-test from SEQUENTIAL to PARALLEL. * Fix sequential merge for situations where there are no time chunks at all. * Add a couple more tests.	2023-06-26 10:54:28 -07:00
Laksh Singla	114380749d	MSQ: Improve the parse exception errors and the handling of null UTF characters in Strings in Frames (#14398 )	2023-06-26 18:14:29 +05:30
Laksh Singla	1647d5f4a0	Limit the subquery results by memory usage (#13952 ) Users can now add a guardrail to prevent subquery’s results from exceeding the set number of bytes by setting druid.server.http.maxSubqueryRows in Broker's config or maxSubqueryRows in the query context. This feature is experimental for now and would default back to row-based limiting in case it fails to get the accurate size of the results consumed by the query.	2023-06-26 18:12:28 +05:30
Tejaswini Bandlamudi	72cf91fbc0	Upgrade Avro to latest version (#14440 ) Upgraded Avro to 1.11.1	2023-06-24 14:51:30 +05:30
Gian Merlino	3d19b748fb	SQL OperatorConversions: Introduce.aggregatorBuilder, allow CAST-as-literal. (#14249 ) * SQL OperatorConversions: Introduce.aggregatorBuilder, allow CAST-as-literal. Four main changes: 1) Provide aggregatorBuilder, a more consistent way of defining the SqlAggFunction we need for all of our SQL aggregators. The mechanism is analogous to the one we already use for SQL functions (OperatorConversions.operatorBuilder). 2) Allow CASTs of constants to be considered as "literalOperands". This fixes an issue where various of our operators are defined with OperandTypes.LITERAL as part of their checkers, which doesn't allow casts. However, in these cases we generally _do_ want to allow casts. The important piece is that the value must be reducible to a constant, not that the SQL text is literally a literal. 3) Update DataSketches SQL aggregators to use the new aggregatorBuilder functionality. The main user-visible effect here is [2]: the aggregators would now accept, for example, "CAST(0.99 AS DOUBLE)" as a literal argument. Other aggregators could be updated in a future patch. 4) Rename "requiredOperands" to "requiredOperandCount", because the old name was confusing. (It rhymes with "literalOperands" but the arguments mean different things.) * Adjust method calls.	2023-06-23 16:25:04 -07:00
Gian Merlino	ddd0fc1b85	S3: Attach SSE key to doesObjectExist calls. (#14290 ) * S3: Attach SSE key to doesObjectExist calls. We did not previously attach the SSE key to the doesObjectExist request, leading to an inconsistency that may cause problems on "S3-compatible" implementations. This patch implements doesObjectExist using similar logic to the S3 client itself, but calls our implementation of getObjectMetadata rather than the S3 client's, ensuring the request is decorated with the SSE key. * Fix tests.	2023-06-23 15:23:59 -07:00
imply-cheddar	7e2cf35d7b	Fix compatibility issue with SqlTaskResource (#14466 ) * Fix compatibility issue with SqlTaskResource The DruidException changes broke the response format for errors coming back from the SqlTaskResource, so fix those	2023-06-23 01:15:32 -07:00
imply-cheddar	cfd07a95b7	Errors take 3 (#14004 ) Introduce DruidException, an exception whose goal in life is to be delivered to a user. DruidException itself has javadoc on it to describe how it should be used. This commit both introduces the Exception and adjusts some of the places that are generating exceptions to generate DruidException objects instead, as a way to show how the Exception should be used. This work was a 3rd iteration on top of work that was started by Paul Rogers. I don't know if his name will survive the squash-and-merge, so I'm calling it out here and thanking him for starting on this.	2023-06-19 01:11:13 -07:00
Abhishek Radhakrishnan	04fb75719e	Fail query planning if a `CLUSTERED BY` column contains descending order (#14436 ) * Throw ValidationException if CLUSTERED BY column descending order is specified. - Fails query planning * Some more tests. * fixup existing comment * Update comment * checkstyle fix: remove unused imports * Remove InsertCannotOrderByDescendingFault and deprecate the fault in readme. * move deprecated field to the bottom	2023-06-16 18:10:12 -04:00
Gian Merlino	85656a467c	MSQ: Load broadcast tables on workers. (#14437 ) They were not previously loaded because supportsQueries was false. This patch sets supportsQueries to true, and clarifies in Task javadocs that supportsQueries can be true for tasks that aren't directly queryable over HTTP.	2023-06-16 12:02:20 +05:30
Laksh Singla	4935f2470a	Limit results generated by SELECT queries in MSQ (#14370 ) * Limit select results in MSQ * reduce number of files in test * add truncated flag * avoid materializing select results to list, use iterable instead * javadocs	2023-06-15 13:13:11 +05:30
Clint Wylie	ff5ae4db6c	fix kafka input format reader schema discovery and partial schema discovery (#14421 ) * fix kafka input format reader schema discovery and partial schema discovery to actually work right, by re-using dimension filtering logic of MapInputRowParser	2023-06-15 00:11:04 -07:00
Pranav	5314db9f85	Adding the file mapper to handle v2 buffer deserialization (#14429 )	2023-06-14 19:41:44 -07:00
Clint Wylie	61120dc49a	fix Kafka input format to throw ParseException if timestamp is missing (#14413 )	2023-06-13 09:00:11 -07:00
Adarsh Sanjeev	267cbac6ff	Add logs for deleting files using storage connector (#14350 ) * Add logs for deleting files using storage connector * Address review comments * Update log message format	2023-06-11 21:24:30 +05:30
Kashif Faraz	6e158704cb	Do not retry INSERT task into metadata if max_allowed_packet limit is violated (#14271 ) Changes - Add a `DruidException` which contains a user-facing error message, HTTP response code - Make `EntryExistsException` extend `DruidException` - If metadata store max_allowed_packet limit is violated while inserting a new task, throw `DruidException` with response code 400 (bad request) to prevent retries - Add `SQLMetadataConnector.isRootCausePacketTooBigException` with impl for MySQL	2023-06-10 12:15:44 +05:30
Atul Mohan	6a4cbab4b8	Upgrade parquet-mr version (#14070 ) * Upgrade parquet version * Move parquet version to hadoop3 * Fix license * Exclude audience annotations	2023-06-07 08:54:54 -07:00
Soumyava	01b22ca022	Hll Sketch and Theta sketch estimate can now be used as an expression (#14312 ) * Hll Sketch estimate can now be used as an expression * Theta sketch estimate now can be used as an expression	2023-06-06 20:14:25 -07:00
Abhishek Radhakrishnan	2d258a95ad	Fix `EARLIEST_BY`/`LATEST_BY` signature and include function name in signature. (#14352 ) * Fix EarliestLatestBySqlAggregator signature; Include function name for all signatures. * Single quote function signatures, space between args and remove \n. * fixup UT assertion	2023-06-06 09:41:05 -07:00
Laksh Singla	5da601c47e	fix npe (#14369 )	2023-06-06 17:01:42 +05:30
Gian Merlino	a0d49baad6	MSQ: Fix issue with rollup ingestion and aggregators with multiple names. (#14367 ) The same aggregator can have two output names for a SQL like: INSERT INTO foo SELECT x, COUNT() AS y, COUNT() AS z FROM t GROUP BY 1 PARTITIONED BY ALL In this case, the SQL planner will create a query with a single "count" aggregator mapped to output names "y" and "z". The prior MSQ code did not properly handle this case, instead throwing an error like: Expected single output for query column[a0] but got [[1, 2]]	2023-06-06 10:28:41 +05:30
zachjsh	04a82da63d	Input source security fixes (#14266 ) It was found that several supported tasks / input sources did not have implementations for the methods used by the input source security feature, causing these tasks and input sources to fail when used with this feature. This pr adds the needed missing implementations. Also securing the sampling endpoint with input source security, when enabled.	2023-06-01 16:37:19 -07:00
zachjsh	e75fb8e8e3	Account for data format and compression in MSQ auto taskAssignment (#14307 ) ### Description This change allows for consideration of the input format and compression when computing how to split the input files among available tasks, in MSQ ingestion, when considering the value of the `maxInputBytesPerWorker` query context parameter. This query parameter allows users to control the maximum number of bytes, with granularity of input file / object, that ingestion tasks will be assigned to ingest. With this change, this context parameter now denotes the estimated weighted size in bytes of the input to split on, with consideration for input format and compression format, rather than the actual file size, reported by the file system. We assume uncompressed newline delimited json as a baseline, with scaling factor of `1`. This means that when computing the byte weight that a file has towards the input splitting, we take the file size as is, if uncompressed json, 1:1. It was found during testing that gzip compressed json, and parquet, has scale factors of `4` and `8` respectively, meaning that each byte of data is weighted 4x and 8x respectively, when computing input splits. This weighted byte scaling is only considered for MSQ ingestion that uses either LocalInputSource or CloudObjectInputSource at the moment. The default value of the `maxInputBytesPerWorker` query context parameter has been updated from 10 GiB, to 512 MiB	2023-06-01 12:53:49 -07:00
panhongan	c244c3de53	fix hdfs initialization issue (#14276 ) * fix hdfs initialization issue * add PR * remove conf settings * Improve comments * move hdfs storage validation to start handler * restore exception	2023-05-30 12:41:54 -07:00
Alexander Saydakov	4131c0df13	use the latest datasketches-java-4.0.0 (#14334 ) * use the latest datasketches-java-4.0.0 * updated versions of datasketches * adjusted expectation * fixed the expectations --------- Co-authored-by: AlexanderSaydakov <AlexanderSaydakov@users.noreply.github.com>	2023-05-27 22:19:18 -07:00
Karan Kumar	8d256e35b4	MSQ ignores tombstone segments for downloads. (#14342 )	2023-05-27 14:21:52 +05:30
Abhishek Radhakrishnan	a5e04d95a4	Add `TYPE_NAME` to the complex serde classes and replace the hardcoded names. (#14317 ) * Add TYPE_NAME to the serde classes and reuse them instead of hardcoded strings. * Static check fixes.	2023-05-23 00:54:47 -05:00
Adarsh Sanjeev	e8ef31fe92	Fix condition for timeout in worker task launcher (#14270 ) * Fix condition for timeout in worker task launcher	2023-05-16 08:30:00 +05:30
Adarsh Sanjeev	10bce22e68	Configure maxBytesPerWorker directly instead of using StageDefinition (#14257 ) * Configure maxBytesPerWorker directly instead of using StageDefinition	2023-05-15 16:51:57 +05:30
imply-cheddar	f9861808bc	Be able to load segments on Peons (#14239 ) * Be able to load segments on Peons This change introduces a new config on WorkerConfig that indicates how many bytes of each storage location to use for storage of a task. Said config is divided up amongst the locations and slots and then used to set TaskConfig.tmpStorageBytesPerTask The Peons use their local task dir and tmpStorageBytesPerTask as their StorageLocations for the SegmentManager such that they can accept broadcast segments.	2023-05-12 16:51:00 -07:00
Kashif Faraz	ba11b3d462	Refactor: Add OverlordDuty to replace OverlordHelper and align with CoordinatorDuty (#14235 ) Changes: - Replace `OverlordHelper` with `OverlordDuty` to align with `CoordinatorDuty` - Each duty has a `run()` method and defines a `Schedule` with an initial delay and period. - Update existing duties `TaskLogAutoCleaner` and `DurableStorageCleaner` - Add utility class `Configs` - Update log, error messages and javadocs - Other minor style improvements	2023-05-12 22:39:56 +05:30
Clint Wylie	625c4745b1	add context flag "useAutoColumnSchemas" to use new auto types for MSQ segment generation (#14175 )	2023-05-10 15:37:14 -07:00
Adarsh Sanjeev	fb38085ddb	Add wait for worker shutdown to MSQ task cancel (#14198 ) * Add wait for worker shutdown to MSQ task cancel * Fix checkstyle	2023-05-05 16:29:59 -07:00
Abhishek Radhakrishnan	46dabab36d	Fix NPE in test parse exception report. Add more tests with different thresholds. (#14209 )	2023-05-05 10:05:41 -07:00
zachjsh	48cde236c4	Add columnMappings to explain plan output (#14187 ) * Add columnMappings to explain plan output * * fix checkstyle * add tests * * improve test coverage * * temporarily remove unit-test need to run ITs * * depend on build * * temporarily lower unit test threshold * * add back dependency on unit-tests * * add license headers * * fix header order * * review comments * * fix intellij inspection errors * * revert code coverage change	2023-05-04 10:36:28 -07:00
Abhishek Radhakrishnan	68f908e511	Fix uncaught `ParseException` when reading Avro from Kafka (#14183 ) In StreamChunkParser#parseWithInputFormat, we call byteEntityReader.read() without handling a potential ParseException, which is thrown during this function call by the delegate AvroStreamReader#intermediateRowIterator. A ParseException can be thrown if an Avro stream has corrupt data or data that doesn't conform to the schema specified or for other decoding reasons. This exception if uncaught, can cause ingestion to fail.	2023-05-04 12:35:36 +05:30
Abhishek Radhakrishnan	954f3917ef	Add check for required avroBytesDecoder property that otherwise causes NPE. (#14177 )	2023-05-03 09:53:58 -07:00
Karan Kumar	6f0cdd0c3f	`TaskStartTimeoutFault` now depends on the last successful worker launch time. (#14172 ) * `TaskStartTimeoutFault` now depends on the last successful worker launch time.	2023-05-03 00:05:15 +05:30
Laksh Singla	387e682fbc	Fix memory calculations for WorkerMemoryParameters for machines with relatively less heap space (#14117 ) * update worker memory parameters	2023-05-02 09:24:56 +05:30
Karan Kumar	078d5ac590	Preference to first worker error in-case job fails with `TooManyAttemptsForWorker` (#14170 )	2023-05-01 14:47:11 +05:30
Clint Wylie	90ea192d9c	fix bugs with auto encoded long vector deserializers (#14186 ) This PR fixes an issue when using 'auto' encoded LONG typed columns and the 'vectorized' query engine. These columns use a delta based bit-packing mechanism, and errors in the vectorized reader would cause it to incorrectly read column values for some bit sizes (1 through 32 bits). This is a regression caused by #11004, which added the optimized readers to improve performance, so impacts Druid versions 0.22.0+. While writing the test I finally got sad enough about IndexSpec not having a "builder", so I made one, and switched all the things to use it. Apologies for the noise in this bug fix PR, the only real changes are in VSizeLongSerde, and the tests that have been modified to cover the buggy behavior, VSizeLongSerdeTest and ExpressionVectorSelectorsTest. Everything else is just cleanup of IndexSpec usage.	2023-05-01 11:49:27 +05:30
Suneet Saldanha	84c11df980	Make LoggingEmitter more useful by using Markers (#14121 ) * Make LoggingEmitter more useful * Skip code coverage for facade classes * fix spellcheck * code review * fix dependency * logging.md * fix checkstyle * Add back jacoco version to main pom	2023-04-27 15:06:06 -07:00
Adarsh Sanjeev	63268a5023	Relaunch track of failed workers without work orders (#14166 ) * If a worker dies after it has finished generating results, MSQ decides to not retry it as it has no active work orders. However, since we don't keep track of it further, if it is required for a future stage, the controller hangs waiting for the worker to be ready. This PR keeps tracks of any workers the controller decides to not restart immediately and while starting workers for the next stage, queues these workers for retry.	2023-04-27 19:38:05 +05:30
Adarsh Sanjeev	5aa119dfda	Add retry to opening retrying stream (#14126 ) * Add retry to opening retrying stream * Add retry to S3Entity for network issues * Fix tests and clean up code	2023-04-27 16:52:22 +05:30
TSFenwick	6c99fbea92	fix typo in s3 docs. add readme to s3 module. (#14135 ) * fix typo in s3 docs. add readme to s3 module. * Update extensions-core/s3-extensions/README.md Co-authored-by: 317brian <53799971+317brian@users.noreply.github.com> * cleanup readme for s3 extension and link to repo markdown doc instead of web docs --------- Co-authored-by: 317brian <53799971+317brian@users.noreply.github.com>	2023-04-26 14:03:11 -07:00
robo220	5db7396c78	fix(avro-json-path-expressions): allow more complex jsonpath expressions (#14149 )	2023-04-26 14:58:11 +05:30
Tejaswini Bandlamudi	774073b2e7	Update Hadoop3 as default build version (#14005 ) Hadoop 2 often causes red security scans on Druid distribution because of the dependencies it brings. We want to move away from Hadoop 2 and provide Hadoop 3 distribution available. Switch druid to building with Hadoop 3 by default. Druid will still be compatible with Hadoop 2 and users can build hadoop-2 compatible distribution using hadoop2 profile.	2023-04-26 12:52:51 +05:30
Gian Merlino	752475b799	Fix two concurrency issues with segment fetching. (#14042 ) * Fix two concurrency issues with segment fetching. 1) SegmentLocalCacheManager: Fix a concurrency issue where certain directory cleanup happened outside of directoryWriteRemoveLock. This created the possibility that segments would be deleted by one thread, while being actively downloaded by another thread. 2) TaskDataSegmentProcessor (MSQ): Fix a concurrency issue when two stages in the same process both use the same segment. For example: a self-join using distributed sort-merge. Prior to this change, the two stages could delete each others' segments. 3) ReferenceCountingResourceHolder: increment() returns a new ResourceHolder, rather than a Releaser. This allows it to be passed to callers without them having to hold on to both the original ResourceHolder and a Releaser. 4) Simplify various interfaces and implementations by using ResourceHolder instead of Pair and instead of split-up fields. * Add test. * Fix style. * Remove Releaser. * Updates from master. * Add some GuardedBys. * Use the correct GuardedBy. * Adjustments.	2023-04-25 20:49:27 -07:00
Gian Merlino	89e7948159	MSQ: Subclass CalciteJoinQueryTest, other supporting changes. (#14105 ) * MSQ: Subclass CalciteJoinQueryTest, other supporting changes. The main change is the new tests: we now subclass CalciteJoinQueryTest in CalciteSelectJoinQueryMSQTest twice, once for Broadcast and once for SortMerge. Two supporting production changes for default-value mode: 1) InputNumberDataSource is marked as concrete, to allow leftFilter to be pushed down to it. 2) In default-value mode, numeric frame field readers can now return nulls. This is necessary when stacking joins on top of joins: nulls must be preserved for semantics that match broadcast joins and native queries. 3) In default-value mode, StringFieldReader.isNull returns true on empty strings in addition to nulls. This is more consistent with the behavior of the selectors, which map empty strings to null as well in that mode. As an effect of change (2), the InsertTimeNull change from #14020 (to replace null timestamps with default timestamps) is reverted. IMO, this is fine, as either behavior is defensible, and the change from #14020 hasn't been released yet. * Adjust tests. * Style fix. * Additional tests.	2023-04-25 12:10:23 -07:00
Adarsh Sanjeev	cade80b582	Change time column name when reading from external sources in MSQ (#14148 ) When ingesting from an external source which already contains a column "__time", currently, the value is dropped. Changing the time column name in the external input slice reader resolves this.	2023-04-25 11:13:59 +05:30
Nicholas Lippis	9d4cc501f7	return task status reported by peon (#14040 ) * return task status reported by peon * Write TaskStatus to file in AbstractTask.cleanUp * Get TaskStatus from task log * Fix merge conflicts in AbstractTaskTest * Add unit tests for TaskLogPusher, TaskLogStreamer, NoopTaskLogs to satisfy code coverage * Add license headerss * Fix style * Remove unknown exception declarations	2023-04-24 12:05:39 -07:00
Parag Jain	e8674e2a60	fix npe with gs uri having underscores (#14107 ) * fix npe with gs uri having underscores * compile fix	2023-04-19 11:26:18 +05:30
zachjsh	04da0102cb	KillTask should return empty inputSource resources (#14106 ) ### Description This pr fixes a few bugs found with the inputSource security feature. 1. `KillUnusedSegmentsTask` previously had no definition for the `getInputSourceResources`, which caused an unsupportedOperationException to be thrown when this task type was submitted with the inputSource security feature enabled. This task type should not require any input source specific resources, so returning an empty set for this task type now. 2. Fixed a bug where when the input source type security feature is enabled, all of the input source type specific resources used where authenticated against: `{"resource": {"name": "EXTERNAL", "type": "{INPUT_SOURCE_TYPE}"}, "action": "READ"}` When they should be instead authenticated against: `{"resource": {"name": "{INPUT_SOURCE_TYPE}", "type": "EXTERNAL"}, "action": "READ"}` 3. fixed bug where supervisor tasks were not authenticated against the specific input source types used, if input source security feature was enabled.	2023-04-18 15:27:16 -04:00
Adarsh Sanjeev	a7d5c64aeb	Move MSQ temporary storage to a runtime parameter instead of being configured from query context (#14061 ) * Adds new run time parameter druid.indexer.task.tmpStorageBytesPerTask. This sets a limit for the amount of temporary storage disk space used by tasks. This limit is currently only respected by MSQ tasks. * Removes query context parameters intermediateSuperSorterStorageMaxLocalBytes and composedIntermediateSuperSorterStorageEnabled. Composed intermediate super sorter (which was enabled by composedIntermediateSuperSorterStorageEnabled) is now enabled automatically if durableShuffleStorage is set to true. intermediateSuperSorterStorageMaxLocalBytes is calculated from the limit set by the run time parameter druid.indexer.task.tmpStorageBytesPerTask.	2023-04-18 16:56:51 +05:30
Laksh Singla	8eb854c845	Remove maxResultsSize config property from S3OutputConfig (#14101 ) * "maxResultsSize" has been removed from the S3OutputConfig and a default "chunkSize" of 100MiB is now present. This change primarily affects users who wish to use durable storage for MSQ jobs.	2023-04-18 14:25:20 +05:30
Karan Kumar	be6745f75b	Adding more logs for sequential merge. (#14097 )	2023-04-17 18:01:24 +05:30
Gian Merlino	eb797512a0	Fix MSQSelectTest. (#14099 ) A logical conflict between #14046 and #14048 caused testJoinWithLookup to fail. This patch fixes it.	2023-04-17 01:15:38 +05:30
Gian Merlino	eeed5ed7e2	MSQ: Use the same result coercion routines as the regular SQL endpoint. (#14046 ) * MSQ: Use the same result coercion routines as the regular SQL endpoint. The main changes are to move NativeQueryMaker.coerce to SqlResults, and to formally make the list of sqlTypeNames from the MSQ results reports use SqlTypeNames. - Change the default to MSQ-compatible rather than MSQ-incompatible. The explicit marker function is now "notMsqCompatible()".	2023-04-15 06:56:23 +05:30
Gian Merlino	0884a22c41	MSQ: Support for querying lookup and inline data directly. (#14048 ) * MSQ: Support for querying lookup and inline data directly. Main changes: 1) Add of LookupInputSpec and DataSourcePlan.forLookup. 2) Add InlineInputSpec, and modify of DataSourcePlan.forInline to use this instead of an ExternalInputSpec with JSON. This allows the inline data to act as the right-hand side of a join, if needed. Supporting changes: 1) Modify JoinDataSource's leftFilter validation to be a little less strict: it's now OK with leftFilter being attached to any concrete leaf (no children) datasource, rather than requiring it be a table. This allows MSQ to create JoinDataSource with InputNumberDataSource as the base. 2) Add SegmentWranglerModule to CliIndexer, CliPeon. This allows them to query lookups and inline data directly. * Updates based on CI. * Additional tests. * Style fix. * Remove unused import.	2023-04-14 14:04:02 -07:00
Karan Kumar	bdc5477094	Adding missed s3 retry handling in storage connector. (#14086 )	2023-04-14 17:21:39 +05:30
imply-cheddar	d2f82f8dd6	Make GCP initialization truly lazy (#14077 ) The GCP initialization pulls credentials for talking to GCP. We want that to only happen when fully required and thus want the GCP-related objects lazily instantiated.	2023-04-12 23:10:50 -07:00
Gian Merlino	81074411a9	MSQ: Support multiple result columns with the same name. (#14025 ) * MSQ: Support multiple result columns with the same name. This is allowed in SQL, and is supported by the regular SQL endpoint. We retain a validation that INSERT ... SELECT does not allow multiple columns with the same name, because column names in segments must be unique.	2023-04-13 11:09:39 +05:30
zachjsh	89bdbdc3ed	Input source security feature should work for MSQ tasks (#14056 ) ### Description Previously msq controller and worker tasks did not have implementations for the `getInputSourceResources()` method. This causes the submission of these tasks to fail if the following auth config is enabled: `druid.auth.enableInputSourceSecurity=true` Added implementations of this method for these tasks that return an empty set of input sources. This means that for these task types, if `druid.auth.enableInputSourceSecurity=true` config is used, the input source types will be properly computed and authorized in the SQL layer, but not if the equivalent controller / worker tasks are submitted to the task endpoint.	2023-04-11 11:36:15 -04:00
zachjsh	2e87b5a901	Input source security sql layer can handle input source with multiple types (#14050 ) ### Description This change allows for input sources used during MSQ ingestion to be authorized for multiple input source types, instead of just 1. Such an input source that allows for multiple types is the CombiningInputSource. Also fixed bug that caused some input source specific functions to be authorized against the permissions ` [ new ResourceAction(new Resource(ResourceType.EXTERNAL, ResourceType.EXTERNAL), Action.READ), new ResourceAction(new Resource(ResourceType.EXTERNAL, {input_source_type}), Action.READ) ] ` when the inputSource based authorization feature is enabled, when it should instead be authorized against ` [ new ResourceAction(new Resource(ResourceType.EXTERNAL, {input_source_type}), Action.READ) ] `	2023-04-10 09:48:57 -04:00
Clint Wylie	1aef72aa7e	Bump up the version in pom to 27.0.0 in preparation of release (#14051 )	2023-04-10 14:56:59 +05:30
Gian Merlino	d52bc333aa	Frames: Ensure nulls are read as default values when appropriate. (#14020 ) * Frames: Ensure nulls are read as default values when appropriate. Fixes a bug where LongFieldWriter didn't write a properly transformed zero when writing out a null. This had no meaningful effect in SQL-compatible null handling mode, because the field would get treated as a null anyway. But it does have an effect in default-value mode: it would cause Long.MIN_VALUE to get read out instead of zero. Also adds NullHandling checks to the various frame-based column selectors, allowing reading of nullable frames by servers in default-value mode.	2023-04-10 05:28:46 +05:30
Clint Wylie	a769f14652	fix compile with java 8 (#14045 )	2023-04-07 07:01:38 -07:00
Abhishek Radhakrishnan	f47b05a98c	Hyphenate multi value string for consistency. Fixup extra space in javadoc. (#14043 )	2023-04-07 11:46:07 +05:30
zachjsh	5c0221375c	Allow for Input source security in native task layer (#14003 ) Fixes #13837. ### Description This change allows for input source type security in the native task layer. To enable this feature, the user must set the following property to true: `druid.auth.enableInputSourceSecurity=true` The default value for this property is false, which will continue the existing functionality of needing authorization to write to the respective datasource. When this config is enabled, the users will be required to be authorized for the following resource action, in addition to write permission on the respective datasource. `new ResourceAction(new Resource(ResourceType.EXTERNAL, {INPUT_SOURCE_TYPE}, Action.READ` where `{INPUT_SOURCE_TYPE}` is the type of the input source being used;, http, inline, s3, etc.. Only tasks that provide a non-default implementation of the `getInputSourceResources` method can be submitted when config `druid.auth.enableInputSourceSecurity=true` is set. Otherwise, a 400 error will be thrown.	2023-04-06 13:13:09 -04:00
Paul Rogers	030ed911d4	Temporarily revert extended table functions for Druid 26 (#14019 )	2023-04-05 21:09:33 -07:00
Abhishek Radhakrishnan	b98eed8fb8	Revert quoting lookup fix. (#14034 ) * Revert "Add ANSI_QUOTES propety to DBI init in lookups. (#13826)" This reverts commit `9e9976001c`. * Revert "Quote and escape literals in JDBC lookup to allow reserved identifiers. (#13632)" This reverts commit `41fdf6eafb`. * fix typo.	2023-04-05 20:52:36 -07:00
Gian Merlino	319f99db05	Always use file sizes when determining batch ingest splits (#13955 ) * Always use file sizes when determining batch ingest splits. Main changes: 1) Update CloudObjectInputSource and its subclasses (S3, GCS, Azure, Aliyun OSS) to use SplitHintSpecs in all cases. Previously, they were only used for prefixes, not uris or objects. 2) Update ExternalInputSpecSlicer (MSQ) to consider file size. Previously, file size was ignored; all files were treated as equal weight when determining splits. A side effect of these changes is that we'll make additional network calls to find the sizes of objects when users specify URIs or objects as opposed to prefixes. IMO, this is worth it because it's the only way to respect the user's split hint and task assignment settings. Secondary changes: 1) S3, Aliyun OSS: Use getObjectMetadata instead of listObjects to get metadata for a single object. This is a simpler call that is also expected to be less expensive. 2) Azure: Fix a bug where getBlobLength did not populate blob reference attributes, and therefore would not actually retrieve the blob length. 3) MSQ: Align dynamic slicing logic between ExternalInputSpecSlicer and TableInputSpecSlicer. 4) MSQ: Adjust WorkerInputs to ensure there is always at least one worker, even if it has a nil slice. * Add msqCompatible to testGroupByWithImpossibleTimeFilter. * Fix tests. * Add additional tests. * Remove unused stuff. * Remove more unused stuff. * Adjust thresholds. * Remove irrelevant test. * Fix comments. * Fix bug. * Updates.	2023-04-05 08:54:01 -07:00
Karan Kumar	e6a11707cb	Adding query stack fault to MSQ to capture native query errors. (#13926 ) * Add a new fault "QueryRuntimeError" to MSQ engine to capture native query errors. * Fixed bug in MSQ fault tolerance where worker were being retried if `UnexpectedMultiValueDimensionException` was thrown. * An exception from the query runtime with `org.apache.druid.query` as the package name is thrown as a QueryRuntimeError	2023-04-05 16:29:10 +05:30
Laksh Singla	012b49d5e5	Fix the order of aggregator finalization in GroupByPostShuffleFrameProcessor (MSQ) (#14022 ) * fix the order in which finalization is done * add comment explaining the change * null handling case	2023-04-05 11:04:06 +05:30
Clint Wylie	d21babc5b8	remix nested columns (#14014 ) changes: * introduce ColumnFormat to separate physical storage format from logical type. ColumnFormat is now used instead of ColumnCapabilities to get column handlers for segment creation * introduce new 'auto' type indexer and merger which produces a new common nested format of columns, which is the next logical iteration of the nested column stuff. Essentially this is an automatic type column indexer that produces the most appropriate column for the given inputs, making either STRING, ARRAY<STRING>, LONG, ARRAY<LONG>, DOUBLE, ARRAY<DOUBLE>, or COMPLEX<json>. * revert NestedDataColumnIndexer, NestedDataColumnMerger, NestedDataColumnSerializer to their version pre #13803 behavior (v4) for backwards compatibility * fix a bug in RoaringBitmapSerdeFactory if anything actually ever wrote out an empty bitmap using toBytes and then later tried to read it (the nerve!)	2023-04-04 17:51:59 -07:00
Karan Kumar	217b0f6832	Eagerly fetching remote s3 files leading to out of disk (OOD) (#13981 ) * Eagerly fetching remote s3 files leading to OOD.	2023-04-03 14:10:37 +05:30
Clint Wylie	e3211e3be0	actually backwards compatible frontCoded string encoding strategy (#13996 )	2023-03-31 02:24:12 -07:00
zachjsh	3bb67721f7	Allow for Input source security in SQL layer (#13989 ) This change introduces the concept of input source type security model, proposed in #13837.. With this change, this feature is only available at the SQL layer, but we will expand to native layer in a follow up PR. To enable this feature, the user must set the following property to true: druid.auth.enableInputSourceSecurity=true The default value for this property is false, which will continue the existing functionality of having the usage all external sources being authorized against the hardcoded resource action new ResourceAction(new Resource(ResourceType.EXTERNAL, ResourceType.EXTERNAL), Action.READ When this config is enabled, the users will be required to be authorized for the following resource action new ResourceAction(new Resource(ResourceType.EXTERNAL, {INPUT_SOURCE_TYPE}, Action.READ where {INPUT_SOURCE_TYPE} is the type of the input source being used;, http, inline, s3, etc.. Documentation has not been added for the feature as it is not complete at the moment, as we still need to enable this for the native layer in a follow up pr.	2023-03-29 22:15:33 -04:00

1 2 3 4 5 ...

1286 Commits