druid

Commit Graph

Author	SHA1	Message	Date
Zoltan Haindrich	b100e982a4	make/etc	2024-07-17 16:40:30 +00:00
Zoltan Haindrich	0811d801fb	make query run	2024-07-17 16:33:10 +00:00
Zoltan Haindrich	97c32ca3de	less crappy way to run it	2024-07-17 16:19:08 +00:00
Zoltan Haindrich	6790f9cf8b	move stuff	2024-07-17 16:08:32 +00:00
Zoltan Haindrich	51d465df6d	make engine load via injector for msqdrill	2024-07-17 16:04:14 +00:00
Zoltan Haindrich	0eaf4c61b9	removePrint	2024-07-17 15:52:19 +00:00
Zoltan Haindrich	f3cf778115	some stuff	2024-07-17 15:48:36 +00:00
Zoltan Haindrich	42b3086512	msq-test-0	2024-07-17 15:38:50 +00:00
Zoltan Haindrich	8ada2ff238	picked akshat's 3e0202811e05dcd07db5ab47791151fab5dd5772	2024-07-17 14:44:27 +00:00
Zoltan Haindrich	2a590eb3ae	Merge commit 'apache/master^^^' into quidem-record	2024-07-17 13:27:54 +00:00
Sree Charan Manamala	78a4a09d01	Window Function offset correction for RAC (#16718 ) * When an ArrayList RAC creates a child RAC, the start and end offsets need to have the offset of parent's start offset * Defaults the 2nd window bound to CURRENT ROW when only a single bound is specified * Removes the windowingStrictValidation warning and throws a hard exception when Order By alongside RANGE clause is not provided with UNBOUNDED or CURRENT ROW as both bounds	2024-07-15 12:43:27 +02:00
Rishabh Singh	64104533ac	Enable querying entirely cold datasources (#16676 ) Add ability to query entirely cold datasources.	2024-07-15 15:02:59 +05:30
Vishesh Garg	197c54f673	Auto-Compaction using Multi-Stage Query Engine (#16291 ) Description: Compaction operations issued by the Coordinator currently run using the native query engine. As majority of the advancements that we are making in batch ingestion are in MSQ, it is imperative that we support compaction on MSQ to make Compaction more robust and possibly faster. For instance, we have seen OOM errors in native compaction that MSQ could have handled by its auto-calculation of tuning parameters. This commit enables compaction on MSQ to remove the dependency on native engine. Main changes: * `DataSourceCompactionConfig` now has an additional field `engine` that can be one of `[native, msq]` with `native` being the default. * if engine is MSQ, `CompactSegments` duty assigns all available compaction task slots to the launched `CompactionTask` to ensure full capacity is available to MSQ. This is to avoid stalling which could happen in case a fraction of the tasks were allotted and they eventually fell short of the number of tasks required by the MSQ engine to run the compaction. * `ClientCompactionTaskQuery` has a new field `compactionRunner` with just one `engine` field. * `CompactionTask` now has `CompactionRunner` interface instance with its implementations `NativeCompactinRunner` and `MSQCompactionRunner` in the `druid-multi-stage-query` extension. The objectmapper deserializes `ClientCompactionRunnerInfo` in `ClientCompactionTaskQuery` to the `CompactionRunner` instance that is mapped to the specified type [`native`, `msq`]. * `CompactTask` uses the `CompactionRunner` instance it receives to create the indexing tasks. * `CompactionTask` to `MSQControllerTask` conversion logic checks whether metrics are present in the segment schema. If present, the task is created with a native group-by query; if not, the task is issued with a scan query. The `storeCompactionState` flag is set in the context. * Each created `MSQControllerTask` is launched in-place and its `TaskStatus` tracked to determine the final status of the `CompactionTask`. The id of each of these tasks is the same as that of `CompactionTask` since otherwise, the workers will be unable to determine the controller task's location for communication (as they haven't been launched via the overlord).	2024-07-12 16:40:20 +05:30
Sree Charan Manamala	760d70312f	Window Drill tests coverage improvement (#16722 ) Window Drill tests coverage improvement	2024-07-11 19:11:36 +05:30
Zoltan Haindrich	a9bd0eea2a	Fix queries filtering for the same condition with both an IN and EQUALS to not return empty results (#16597 ) temp fix until CALCITE-6435 gets fixed (released&upgraded to) added a custom rule (FixIncorrectInExpansionTypes) to fix-up types of the affected literals added a testcase which will alert on upgrade	2024-07-09 12:28:21 +05:30
Alberic Liu	c6c2652c89	unified the code format in NestedDataOperatorConversions (#16695 )	2024-07-08 10:06:24 +08:00
Akshat Jain	34c80ee3de	Add MSQ engine support for window function drill tests (#16665 ) * Add MSQ engine support for window function drill tests * Address review comments * Revert formatting changes in TestDataBuilder	2024-06-28 11:14:17 +05:30
Rishabh Singh	b9c7664ac3	Fix empty datasource schema on the Broker when metadata query is disabled (#16645 ) * Fix build * Fix empty datasource schema on the broker * review comment * Remove unused import	2024-06-28 11:06:56 +05:30
Clint Wylie	d4f2636325	fix greatest/least function non-vectorized processing to ignore null argument types (#16649 )	2024-06-26 12:59:42 -07:00
Zoltan Haindrich	e7141e2080	Revert "stuff" This reverts commit `1b7dd8fd3c`.	2024-06-26 10:07:48 +00:00
Zoltan Haindrich	1b7dd8fd3c	stuff	2024-06-26 10:07:47 +00:00
Tom	52c9929019	Column name in parse exceptions (#16529 ) * first pass * more changes * fix tests and formatting * fix kinesis failing tests * fix kafka tests * add dimension name to float parse errors * double and convertToType handling of dimensionName can report parse errors with dimension name * fix checkstyle issue * fix tests * more cases to have better parse exception messages * fix test * fix tests * partially address comments * annotate method parameter with nullable * address comments * fix tests * let float, double, long dimensionIndexer pass dimensionName down to dimensionHandlerUtils * fix compilation error and clean up formatting * clean up whitespace * address feedback. undo change, pass down report parse exception for convertToType * fix test	2024-06-25 13:42:52 -07:00
Zoltan Haindrich	0d76a73c4c	remove final	2024-06-25 06:33:45 +00:00
Clint Wylie	37a50e6803	Remove index_realtime and index_realtime_appenderator tasks (#16602 ) index_realtime tasks were removed from the documentation in #13107. Even at that time, they weren't really documented per se— just mentioned. They existed solely to support Tranquility, which is an obsolete ingestion method that predates migration of Druid to ASF and is no longer being maintained. Tranquility docs were also de-linked from the sidebars and the other doc pages in #11134. Only a stub remains, so people with links to the page can see that it's no longer recommended. index_realtime_appenderator tasks existed in the code base, but were never documented, nor as far as I am aware were they used for any purpose. This patch removes both task types completely, as well as removes all supporting code that was otherwise unused. It also updates the stub doc for Tranquility to be firmer that it is not compatible. (Previously, the stub doc said it wasn't recommended, and pointed out that it is built against an ancient 0.9.2 version of Druid.) ITUnionQueryTest has been migrated to the new integration tests framework and updated to use Kafka ingestion. Co-authored-by: Gian Merlino <gianmerlino@gmail.com>	2024-06-24 20:13:33 -07:00
Sree Charan Manamala	990fd5f5fb	Make use group iterator for all window frames & support for same bound kinds (#16603 ) Fixes apache/druid#15739	2024-06-24 15:52:41 +02:00
Laksh Singla	00c96432af	Materialize scan results correctly when columns are not present in the segments (#16619 ) Fixes a bug causing maxSubqueryBytes not to work when segments have missing columns.	2024-06-23 23:15:45 +05:30
Abhishek Radhakrishnan	b20c3dbadf	Fix malformed period throwing `ADMIN` persona error (#16626 ) * Turn invalid periods into user-facing exception providing more context. The current exception is targeting the ADMIN persona. Catch that and turn it into a USER persona instead. Also, provide more context in the error message. * Review comment: pass the wrapping expression and stringify. * Update processing/src/main/java/org/apache/druid/query/expression/ExprUtils.java Co-authored-by: Clint Wylie <cjwylie@gmail.com> --------- Co-authored-by: Clint Wylie <cjwylie@gmail.com>	2024-06-20 08:40:28 -07:00
Zoltan Haindrich	6e48cb86d5	move annotation	2024-06-20 14:41:45 +00:00
Zoltan Haindrich	ebb27cf462	add extension to disabel when not sql compat	2024-06-20 14:40:44 +00:00
Zoltan Haindrich	604910cead	cleanup	2024-06-20 14:27:09 +00:00
Sree Charan Manamala	7ac0862287	Grouping Engine fix when a limit spec with different order by columns is applied (#16534 )	2024-06-20 11:35:58 +02:00
Zoltan Haindrich	1a0ab2c3b1	Merge remote-tracking branch 'apache/master' into quidem-record	2024-06-19 12:59:26 +00:00
Zoltan Haindrich	dffa331018	updates/etc	2024-06-18 16:33:22 +00:00
Zoltan Haindrich	d14b7374ba	add	2024-06-18 15:59:29 +00:00
Zoltan Haindrich	e725df7110	fix loader	2024-06-18 15:43:35 +00:00
Zoltan Haindrich	f5720ce97d	u	2024-06-17 16:05:29 +00:00
Zoltan Haindrich	7eccf5b518	add some validation/etc	2024-06-17 15:42:48 +00:00
Zoltan Haindrich	e06e54631e	fix style; emitter	2024-06-17 14:26:48 +00:00
Zoltan Haindrich	47696a6108	updates	2024-06-17 13:05:11 +00:00
Laksh Singla	da1e293a57	Deserialize dimensions in group by queries to their respective types when reading from their serialized format (#16511 ) * init * tests, pair groupable * framework change * tests * update benchmarks * comments * add javadoc for the jsonMapper * remove extra deserialization * add special serde for map based result rows * revert unnecessary change --------- Co-authored-by: asdf2014 <asdf2014@apache.org>	2024-06-14 16:27:47 +08:00
Zoltan Haindrich	ac19b148c2	Upgrade calcite to 1.37.0 (#16504 ) * contains Make a full copy of the parser and apply our modifications to it #16503 * some minor api changes pair/entry * some unnecessary aggregation was removed from a set of queries in `CalciteSubqueryTest` * `AliasedOperatorConversion` was detecting `CHAR_LENGTH` as not a function ; I've removed the check * the field it was using doesn't look maintained that much * the `kind` is passed for the created `SqlFunction` so I don't think this check is actually needed * some decoupled test cases become broken - will be fixed later * some aggregate related changes: due to the fact that SUM() and COUNT() of no inputs are different * upgrade avatica to 1.25.0 * `CalciteQueryTest#testExactCountDistinctWithFilter` is now executable Close apache/druid#16503	2024-06-13 08:47:50 +02:00
Zoltan Haindrich	f8645de341	Remove incorrect utf8 conversion of ResultCache keys (#16569 )	2024-06-12 13:12:05 -07:00
Clint Wylie	fee509df2e	fix NestedDataColumnIndexerV4 to not report cardinality (#16507 ) * fix NestedDataColumnIndexerV4 to not report cardinality changes: * fix issue similar to #16489 but for NestedDataColumnIndexerV4, which can report STRING type if it only processes a single type of values. this should be less common than the auto indexer problem * fix some issues with sql benchmarks	2024-06-11 20:58:12 -07:00
zachjsh	3f5f5921e0	Fix sql syntax error user (#16583 ) This fixes an issue where in some cases, a SQL syntax error encountered when parsing / planning a query results in an error returned to the user with persona a `admin` when it should instead be `user`.	2024-06-11 18:08:35 -04:00
Zoltan Haindrich	7a65938fd6	use druidhook instead Hook	2024-06-11 15:55:55 +00:00
Clint Wylie	3fb6ba22e8	fix expression column capabilities to not report dictionary encoded unless input is string (#16577 )	2024-06-08 13:05:19 -07:00
Gian Merlino	277006446d	Fallback vectorization for FunctionExpr and BaseMacroFunctionExpr. (#16366 ) * Fallback vectorization for FunctionExpr and BaseMacroFunctionExpr. This patch adds FallbackVectorProcessor, a processor that adapts non-vectorizable operations into vectorizable ones. It is used in FunctionExpr and BaseMacroFunctionExpr. In addition: - Identifiers are updated to offer getObjectVector for ARRAY and COMPLEX in addition to STRING. ExprEvalObjectVector is updated to offer ARRAY and COMPLEX as well. - In SQL tests, cannotVectorize now fails tests if an exception is not thrown. This makes it easier to identify tests that can now vectorize. - Fix a null-matcher bug in StringObjectVectorValueMatcher. * Fix tests. * Fixes. * Fix tests. * Fix test. * Fix test.	2024-06-05 20:03:02 -07:00
Gian Merlino	b837ce565b	Simplify serialized form of JsonInputFormat. (#15691 ) * Simplify serialized form of JsonInputFormat. Use JsonInclude for keepNullColumns, assumeNewlineDelimited, and useJsonNodeReader. Because the default value of keepNullColumns is variable, we store the original configured value rather than the derived value, and include if the original value is nonnull. * Fix test.	2024-06-05 20:01:14 -07:00
Gian Merlino	1040a29bc5	Fix capabilities reported by UnnestStorageAdapter. (#16551 ) UnnestStorageAdapter and its cursors did not return capabilities correctly for the output column. This patch fixes two problems: 1) UnnestStorageAdapter returned the capabilities of the unnest virtual column prior to unnesting. It should return the post-unnest capabilities. 2) UnnestColumnValueSelectorCursor passed through isDictionaryEncoded from the unnest virtual column. This is incorrect, because the dimension selector created by this class never has a dictionary. This is the cause of #16543.	2024-06-05 15:19:42 -07:00
Akshat Jain	6d7d2ffa63	Add interface method for returning canonical lookup name (#16557 ) * Add interface method for returning canonical lookup name * Address review comment * Add test in LookupReferencesManagerTest for coverage check * Add test in LookupSerdeModuleTest for coverage check	2024-06-05 14:33:18 -07:00

1 2 3 4 5 ...

1075 Commits