2022-09-06 13:36:09 -04:00
|
|
|
---
|
|
|
|
id: known-issues
|
|
|
|
title: SQL-based ingestion known issues
|
|
|
|
sidebar_label: Known issues
|
|
|
|
---
|
|
|
|
|
|
|
|
<!--
|
|
|
|
~ Licensed to the Apache Software Foundation (ASF) under one
|
|
|
|
~ or more contributor license agreements. See the NOTICE file
|
|
|
|
~ distributed with this work for additional information
|
|
|
|
~ regarding copyright ownership. The ASF licenses this file
|
|
|
|
~ to you under the Apache License, Version 2.0 (the
|
|
|
|
~ "License"); you may not use this file except in compliance
|
|
|
|
~ with the License. You may obtain a copy of the License at
|
|
|
|
~
|
|
|
|
~ http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
~
|
|
|
|
~ Unless required by applicable law or agreed to in writing,
|
|
|
|
~ software distributed under the License is distributed on an
|
|
|
|
~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
|
|
|
~ KIND, either express or implied. See the License for the
|
|
|
|
~ specific language governing permissions and limitations
|
|
|
|
~ under the License.
|
|
|
|
-->
|
|
|
|
|
2022-09-17 00:58:11 -04:00
|
|
|
> This page describes SQL-based batch ingestion using the [`druid-multi-stage-query`](../multi-stage-query/index.md)
|
|
|
|
> extension, new in Druid 24.0. Refer to the [ingestion methods](../ingestion/index.md#batch) table to determine which
|
|
|
|
> ingestion method is right for you.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
## Multi-stage query task runtime
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
- Fault tolerance is not implemented. If any task fails, the entire query fails.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
- Worker task stage outputs are stored in the working directory given by `druid.indexer.task.baseDir`. Stages that
|
|
|
|
generate a large amount of output data may exhaust all available disk space. In this case, the query fails with
|
2022-09-17 00:58:11 -04:00
|
|
|
an [UnknownError](./reference.md#error-codes) with a message including "No space left on device".
|
|
|
|
|
|
|
|
## SELECT
|
|
|
|
|
|
|
|
- SELECT from a Druid datasource does not include unpublished real-time data.
|
|
|
|
|
|
|
|
- GROUPING SETS and UNION ALL are not implemented. Queries using these features return a
|
|
|
|
[QueryNotSupported](reference.md#error-codes) error.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
- The numeric varieties of the EARLIEST and LATEST aggregators do not work properly. Attempting to use the numeric
|
|
|
|
varieties of these aggregators lead to an error like
|
|
|
|
`java.lang.ClassCastException: class java.lang.Double cannot be cast to class org.apache.druid.collections.SerializablePair`.
|
|
|
|
The string varieties, however, do work properly.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-17 00:58:11 -04:00
|
|
|
## INSERT and REPLACE
|
|
|
|
|
|
|
|
- INSERT and REPLACE with column lists, like `INSERT INTO tbl (a, b, c) SELECT ...`, is not implemented.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-17 00:58:11 -04:00
|
|
|
- `INSERT ... SELECT` and `REPLACE ... SELECT` insert columns from the SELECT statement based on column name. This
|
|
|
|
differs from SQL standard behavior, where columns are inserted based on position.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-17 00:58:11 -04:00
|
|
|
- INSERT and REPLACE do not support all options available in [ingestion specs](../ingestion/ingestion-spec.md),
|
|
|
|
including the `createBitmapIndex` and `multiValueHandling` [dimension](../ingestion/ingestion-spec.md#dimension-objects)
|
|
|
|
properties, and the `indexSpec` [`tuningConfig`](../ingestion/ingestion-spec.md#tuningconfig) property.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
## EXTERN
|
2022-09-06 13:36:09 -04:00
|
|
|
|
|
|
|
- The [schemaless dimensions](../ingestion/ingestion-spec.md#inclusions-and-exclusions)
|
2022-09-16 05:15:26 -04:00
|
|
|
feature is not available. All columns and their types must be specified explicitly using the `signature` parameter
|
2022-09-17 00:58:11 -04:00
|
|
|
of the [EXTERN function](reference.md#extern).
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
- EXTERN with input sources that match large numbers of files may exhaust available memory on the controller task.
|
2022-09-06 13:36:09 -04:00
|
|
|
|
2022-09-16 05:15:26 -04:00
|
|
|
- EXTERN does not accept `druid` input sources. Use FROM instead.
|