druid/docs/querying/sql-data-types.md

---
id: sql-data-types
title: "SQL data types"
sidebar_label: "SQL data types"
---

<!--
  ~ Licensed to the Apache Software Foundation (ASF) under one
  ~ or more contributor license agreements.  See the NOTICE file
  ~ distributed with this work for additional information
  ~ regarding copyright ownership.  The ASF licenses this file
  ~ to you under the Apache License, Version 2.0 (the
  ~ "License"); you may not use this file except in compliance
  ~ with the License.  You may obtain a copy of the License at
  ~
  ~   http://www.apache.org/licenses/LICENSE-2.0
  ~
  ~ Unless required by applicable law or agreed to in writing,
  ~ software distributed under the License is distributed on an
  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
  ~ KIND, either express or implied.  See the License for the
  ~ specific language governing permissions and limitations
  ~ under the License.
  -->

> Apache Druid supports two query languages: Druid SQL and [native queries](querying.md).
> This document describes the SQL language.


Columns in Druid are associated with a specific data type. This topic describes supported data types in [Druid SQL](./sql.md). 

## Standard types

Druid natively supports five basic column types: "long" (64 bit signed int), "float" (32 bit float), "double" (64 bit
float) "string" (UTF-8 encoded strings and string arrays), and "complex" (catch-all for more exotic data types like
json, hyperUnique, and approxHistogram columns).

Timestamps (including the `__time` column) are treated by Druid as longs, with the value being the number of
milliseconds since 1970-01-01 00:00:00 UTC, not counting leap seconds. Therefore, timestamps in Druid do not carry any
timezone information, but only carry information about the exact moment in time they represent. See the
[Time functions](sql-scalar.md#date-and-time-functions) section for more information about timestamp handling.

Casts between two SQL types with the same Druid runtime type (see below table) have no effect, other than exceptions
noted in the table. Casts between two SQL types that have different Druid runtime types generate a runtime cast in
Druid. If a value cannot be cast to the target type, as in `CAST('foo' AS BIGINT)`, Druid either substitutes a default
value (when `druid.generic.useDefaultValueForNull = true`, the default mode), or substitutes [NULL](#null-values) (when
`druid.generic.useDefaultValueForNull = false`). NULL values cast to non-nullable types are also substituted with a
default value. For example, if `druid.generic.useDefaultValueForNull = true`, a null VARCHAR cast to BIGINT is converted
to a zero.

The following table describes how Druid maps SQL types onto native types when running queries.

|SQL type|Druid runtime type|Default value<sup>*</sup>|Notes|
|--------|------------------|-------------|-----|
|CHAR|STRING|`''`||
|VARCHAR|STRING|`''`|Druid STRING columns are reported as VARCHAR. Can include [multi-value strings](#multi-value-strings) as well.|
|DECIMAL|DOUBLE|`0.0`|DECIMAL uses floating point, not fixed point math|
|FLOAT|FLOAT|`0.0`|Druid FLOAT columns are reported as FLOAT|
|REAL|DOUBLE|`0.0`||
|DOUBLE|DOUBLE|`0.0`|Druid DOUBLE columns are reported as DOUBLE|
|BOOLEAN|LONG|`false`||
|TINYINT|LONG|`0`||
|SMALLINT|LONG|`0`||
|INTEGER|LONG|`0`||
|BIGINT|LONG|`0`|Druid LONG columns (except `__time`) are reported as BIGINT|
|TIMESTAMP|LONG|`0`, meaning 1970-01-01 00:00:00 UTC|Druid's `__time` column is reported as TIMESTAMP. Casts between string and timestamp types assume standard SQL formatting, e.g. `2000-01-02 03:04:05`, _not_ ISO8601 formatting. For handling other formats, use one of the [time functions](sql-scalar.md#date-and-time-functions).|
|DATE|LONG|`0`, meaning 1970-01-01|Casting TIMESTAMP to DATE rounds down the timestamp to the nearest day. Casts between string and date types assume standard SQL formatting, e.g. `2000-01-02`. For handling other formats, use one of the [time functions](sql-scalar.md#date-and-time-functions).|
|OTHER|COMPLEX|none|May represent various Druid column types such as hyperUnique, approxHistogram, etc.|

<sup>*</sup> Default value applies if `druid.generic.useDefaultValueForNull = true` (the default mode). Otherwise, the default value is `NULL` for all types.

## Multi-value strings

Druid's native type system allows strings to potentially have multiple values. These
[multi-value string dimensions](multi-value-dimensions.md) are reported in SQL as `VARCHAR` typed, and can be
syntactically used like any other VARCHAR. Regular string functions that refer to multi-value string dimensions are
applied to all values for each row individually. Multi-value string dimensions can also be treated as arrays via special
[multi-value string functions](sql-multivalue-string-functions.md), which can perform powerful array-aware operations.

Grouping by a multi-value expression observes the native Druid multi-value aggregation behavior, which is similar to
the `UNNEST` functionality available in some other SQL dialects. Refer to the documentation on
[multi-value string dimensions](multi-value-dimensions.md) for additional details.

> Because multi-value dimensions are treated by the SQL planner as `VARCHAR`, there are some inconsistencies between how
> they are handled in Druid SQL and in native queries. For example, expressions involving multi-value dimensions may be
> incorrectly optimized by the Druid SQL planner: `multi_val_dim = 'a' AND multi_val_dim = 'b'` is optimized to
> `false`, even though it is possible for a single row to have both "a" and "b" as values for `multi_val_dim`. The
> SQL behavior of multi-value dimensions will change in a future release to more closely align with their behavior
> in native queries.

## NULL values

The [`druid.generic.useDefaultValueForNull`](../configuration/index.md#sql-compatible-null-handling)
runtime property controls Druid's NULL handling mode. For the most SQL compliant behavior, set this to `false`.

When `druid.generic.useDefaultValueForNull = true` (the default mode), Druid treats NULLs and empty strings
interchangeably, rather than according to the SQL standard. In this mode Druid SQL only has partial support for NULLs.
For example, the expressions `col IS NULL` and `col = ''` are equivalent, and both evaluate to true if `col`
contains an empty string. Similarly, the expression `COALESCE(col1, col2)` returns `col2` if `col1` is an empty
string. While the `COUNT(*)` aggregator counts all rows, the `COUNT(expr)` aggregator counts the number of rows
where `expr` is neither null nor the empty string. Numeric columns in this mode are not nullable; any null or missing
values are treated as zeroes.

When `druid.generic.useDefaultValueForNull = false`, NULLs are treated more closely to the SQL standard. In this mode,
numeric NULL is permitted, and NULLs and empty strings are no longer treated as interchangeable. This property
affects both storage and querying, and must be set on all Druid service types to be available at both ingestion time
and query time. There is some overhead associated with the ability to handle NULLs; see
the [segment internals](../design/segments.md#handling-null-values) documentation for more details.

## Boolean logic

The [`druid.expressions.useStrictBooleans`](../configuration/index.md#expression-processing-configurations)
runtime property controls Druid's boolean logic mode. For the most SQL compliant behavior, set this to `true`.

When `druid.expressions.useStrictBooleans = false` (the default mode), Druid uses two-valued logic.

When `druid.expressions.useStrictBooleans = true`, Druid uses three-valued logic for
[expressions](../misc/math-expr.md) evaluation, such as `expression` virtual columns or `expression` filters.
However, even in this mode, Druid uses two-valued logic for filter types other than `expression`.

## Nested columns

Druid supports storing nested data structures in segments using the native `COMPLEX<json>` type. See [Nested columns](./nested-columns.md) for more information.

You can interact with nested data using [JSON functions](./sql-json-functions.md), which can extract nested values, parse from string, serialize to string, and create new `COMPLEX<json>` structures.

`COMPLEX` types have limited functionality outside the specialized functions that use them, so their behavior is undefined when:

* Grouping on complex values.
* Filtering directly on complex values, such as `WHERE json is NULL`.
* Used as inputs to aggregators without specialized handling for a specific complex type.

In many cases, functions are provided to translate `COMPLEX` value types to `STRING`, which serves as a workaround solution until `COMPLEX` type functionality can be improved.
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00			`---`
			`id: sql-data-types`
			`title: "SQL data types"`
			`sidebar_label: "SQL data types"`
			`---`

			`<!--`
			`~ Licensed to the Apache Software Foundation (ASF) under one`
			`~ or more contributor license agreements. See the NOTICE file`
			`~ distributed with this work for additional information`
			`~ regarding copyright ownership. The ASF licenses this file`
			`~ to you under the Apache License, Version 2.0 (the`
			`~ "License"); you may not use this file except in compliance`
			`~ with the License. You may obtain a copy of the License at`
			`~`
			`~ http://www.apache.org/licenses/LICENSE-2.0`
			`~`
			`~ Unless required by applicable law or agreed to in writing,`
			`~ software distributed under the License is distributed on an`
			`~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY`
			`~ KIND, either express or implied. See the License for the`
			`~ specific language governing permissions and limitations`
			`~ under the License.`
			`-->`

			`> Apache Druid supports two query languages: Druid SQL and [native queries](querying.md).`
			`> This document describes the SQL language.`


Sql docs items (#12530) * touch up sql refactor * brush up SQL refactor * incorporate feedback * reorder sql * Update docs/querying/sql.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> 2022-05-17 19:56:31 -04:00			`Columns in Druid are associated with a specific data type. This topic describes supported data types in [Druid SQL](./sql.md).`
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00
			`## Standard types`

			`Druid natively supports five basic column types: "long" (64 bit signed int), "float" (32 bit float), "double" (64 bit`
			`float) "string" (UTF-8 encoded strings and string arrays), and "complex" (catch-all for more exotic data types like`
basic docs for nested column query functions (#12922) * basic docs for nested column query functions 2022-08-19 20:12:19 -04:00			`json, hyperUnique, and approxHistogram columns).`
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00
			Timestamps (including the `__time` column) are treated by Druid as longs, with the value being the number of
			`milliseconds since 1970-01-01 00:00:00 UTC, not counting leap seconds. Therefore, timestamps in Druid do not carry any`
			`timezone information, but only carry information about the exact moment in time they represent. See the`
			`[Time functions](sql-scalar.md#date-and-time-functions) section for more information about timestamp handling.`

SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			`Casts between two SQL types with the same Druid runtime type (see below table) have no effect, other than exceptions`
			`noted in the table. Casts between two SQL types that have different Druid runtime types generate a runtime cast in`
			Druid. If a value cannot be cast to the target type, as in `CAST('foo' AS BIGINT)`, Druid either substitutes a default
			value (when `druid.generic.useDefaultValueForNull = true`, the default mode), or substitutes [NULL](#null-values) (when
			`druid.generic.useDefaultValueForNull = false`). NULL values cast to non-nullable types are also substituted with a
			default value. For example, if `druid.generic.useDefaultValueForNull = true`, a null VARCHAR cast to BIGINT is converted
			`to a zero.`

			`The following table describes how Druid maps SQL types onto native types when running queries.`

			`\|SQL type\|Druid runtime type\|Default value<sup>*</sup>\|Notes\|`
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00			`\|--------\|------------------\|-------------\|-----\|`
			\|CHAR\|STRING\|`''`\|\|
			\|VARCHAR\|STRING\|`''`\|Druid STRING columns are reported as VARCHAR. Can include [multi-value strings](#multi-value-strings) as well.\|
			\|DECIMAL\|DOUBLE\|`0.0`\|DECIMAL uses floating point, not fixed point math\|
			\|FLOAT\|FLOAT\|`0.0`\|Druid FLOAT columns are reported as FLOAT\|
			\|REAL\|DOUBLE\|`0.0`\|\|
			\|DOUBLE\|DOUBLE\|`0.0`\|Druid DOUBLE columns are reported as DOUBLE\|
			\|BOOLEAN\|LONG\|`false`\|\|
			\|TINYINT\|LONG\|`0`\|\|
			\|SMALLINT\|LONG\|`0`\|\|
			\|INTEGER\|LONG\|`0`\|\|
			\|BIGINT\|LONG\|`0`\|Druid LONG columns (except `__time`) are reported as BIGINT\|
			\|TIMESTAMP\|LONG\|`0`, meaning 1970-01-01 00:00:00 UTC\|Druid's `__time` column is reported as TIMESTAMP. Casts between string and timestamp types assume standard SQL formatting, e.g. `2000-01-02 03:04:05`, _not_ ISO8601 formatting. For handling other formats, use one of the [time functions](sql-scalar.md#date-and-time-functions).\|
			\|DATE\|LONG\|`0`, meaning 1970-01-01\|Casting TIMESTAMP to DATE rounds down the timestamp to the nearest day. Casts between string and date types assume standard SQL formatting, e.g. `2000-01-02`. For handling other formats, use one of the [time functions](sql-scalar.md#date-and-time-functions).\|
			`\|OTHER\|COMPLEX\|none\|May represent various Druid column types such as hyperUnique, approxHistogram, etc.\|`

SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			<sup>*</sup> Default value applies if `druid.generic.useDefaultValueForNull = true` (the default mode). Otherwise, the default value is `NULL` for all types.

Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00			`## Multi-value strings`

			`Druid's native type system allows strings to potentially have multiple values. These`
SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			[multi-value string dimensions](multi-value-dimensions.md) are reported in SQL as `VARCHAR` typed, and can be
			`syntactically used like any other VARCHAR. Regular string functions that refer to multi-value string dimensions are`
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00			`applied to all values for each row individually. Multi-value string dimensions can also be treated as arrays via special`
			`[multi-value string functions](sql-multivalue-string-functions.md), which can perform powerful array-aware operations.`

SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			`Grouping by a multi-value expression observes the native Druid multi-value aggregation behavior, which is similar to`
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00			the `UNNEST` functionality available in some other SQL dialects. Refer to the documentation on
			`[multi-value string dimensions](multi-value-dimensions.md) for additional details.`

			> Because multi-value dimensions are treated by the SQL planner as `VARCHAR`, there are some inconsistencies between how
			`> they are handled in Druid SQL and in native queries. For example, expressions involving multi-value dimensions may be`
SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			> incorrectly optimized by the Druid SQL planner: `multi_val_dim = 'a' AND multi_val_dim = 'b'` is optimized to
Refactor SQL docs (#12239) * refactor and link fixes * add sql docs to left nav * code format for needle * updated web console script * link fixes * update earliest/latest functions * edits for grammar and style * more link fixes * another link * update with #12226 * update .spelling file 2022-02-11 17:43:30 -05:00			> `false`, even though it is possible for a single row to have both "a" and "b" as values for `multi_val_dim`. The
			`> SQL behavior of multi-value dimensions will change in a future release to more closely align with their behavior`
			`> in native queries.`

			`## NULL values`

Adjust "in" filter null behavior to match "selector". (#12863) * Adjust "in" filter null behavior to match "selector". Now, both of them match numeric nulls if constructed with a "null" value. This is consistent as far as native execution goes, but doesn't match the behavior of SQL = and IN. So, to address that, this patch also updates the docs to clarify that the native filters do match nulls. This patch also updates the SQL docs to describe how Boolean logic is handled in addition to how NULL values are handled. Fixes #12856. * Fix test. 2022-08-08 12:08:36 -04:00			The [`druid.generic.useDefaultValueForNull`](../configuration/index.md#sql-compatible-null-handling)
			runtime property controls Druid's NULL handling mode. For the most SQL compliant behavior, set this to `false`.

			When `druid.generic.useDefaultValueForNull = true` (the default mode), Druid treats NULLs and empty strings
			`interchangeably, rather than according to the SQL standard. In this mode Druid SQL only has partial support for NULLs.`
SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			For example, the expressions `col IS NULL` and `col = ''` are equivalent, and both evaluate to true if `col`
			contains an empty string. Similarly, the expression `COALESCE(col1, col2)` returns `col2` if `col1` is an empty
			string. While the `COUNT(*)` aggregator counts all rows, the `COUNT(expr)` aggregator counts the number of rows
Adjust "in" filter null behavior to match "selector". (#12863) * Adjust "in" filter null behavior to match "selector". Now, both of them match numeric nulls if constructed with a "null" value. This is consistent as far as native execution goes, but doesn't match the behavior of SQL = and IN. So, to address that, this patch also updates the docs to clarify that the native filters do match nulls. This patch also updates the SQL docs to describe how Boolean logic is handled in addition to how NULL values are handled. Fixes #12856. * Fix test. 2022-08-08 12:08:36 -04:00			where `expr` is neither null nor the empty string. Numeric columns in this mode are not nullable; any null or missing
SQL: Improve docs around casts. (#13466) Main change: clarify that the "default value" for casts only applies if druid.generic.useDefaultValueForNull = true. Secondary change: adjust a bunch of wording from future to present tense. 2022-12-15 18:01:40 -05:00			`values are treated as zeroes.`
Adjust "in" filter null behavior to match "selector". (#12863) * Adjust "in" filter null behavior to match "selector". Now, both of them match numeric nulls if constructed with a "null" value. This is consistent as far as native execution goes, but doesn't match the behavior of SQL = and IN. So, to address that, this patch also updates the docs to clarify that the native filters do match nulls. This patch also updates the SQL docs to describe how Boolean logic is handled in addition to how NULL values are handled. Fixes #12856. * Fix test. 2022-08-08 12:08:36 -04:00
			When `druid.generic.useDefaultValueForNull = false`, NULLs are treated more closely to the SQL standard. In this mode,
			`numeric NULL is permitted, and NULLs and empty strings are no longer treated as interchangeable. This property`
			`affects both storage and querying, and must be set on all Druid service types to be available at both ingestion time`
			`and query time. There is some overhead associated with the ability to handle NULLs; see`
Segments doc update (#12344) * Corrected heading levels in segments doc * IMPLY-18394: Updated Segments doc * Update docs/design/segments.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/design/segments.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/design/segments.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/design/segments.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/design/segments.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update docs/design/segments.md Co-authored-by: Charles Smith <techdocsmith@gmail.com> * Update segments.md * Updated links to changed headings in Segments doc * Corrected spelling error * Update segments.md Incorporated suggestions from Paul Rogers. * Update index.md * Update segments.md * Update segments.md * Update segments.md * Update compaction.md * Update docs/design/segments.md fix typo * Update docs/ingestion/compaction.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> * Update docs/design/segments.md Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> Co-authored-by: Charles Smith <techdocsmith@gmail.com> Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> 2022-06-16 16:25:17 -04:00			`the [segment internals](../design/segments.md#handling-null-values) documentation for more details.`
Adjust "in" filter null behavior to match "selector". (#12863) * Adjust "in" filter null behavior to match "selector". Now, both of them match numeric nulls if constructed with a "null" value. This is consistent as far as native execution goes, but doesn't match the behavior of SQL = and IN. So, to address that, this patch also updates the docs to clarify that the native filters do match nulls. This patch also updates the SQL docs to describe how Boolean logic is handled in addition to how NULL values are handled. Fixes #12856. * Fix test. 2022-08-08 12:08:36 -04:00
			`## Boolean logic`

			The [`druid.expressions.useStrictBooleans`](../configuration/index.md#expression-processing-configurations)
			runtime property controls Druid's boolean logic mode. For the most SQL compliant behavior, set this to `true`.

			When `druid.expressions.useStrictBooleans = false` (the default mode), Druid uses two-valued logic.

			When `druid.expressions.useStrictBooleans = true`, Druid uses three-valued logic for
			[expressions](../misc/math-expr.md) evaluation, such as `expression` virtual columns or `expression` filters.
			However, even in this mode, Druid uses two-valued logic for filter types other than `expression`.
basic docs for nested column query functions (#12922) * basic docs for nested column query functions 2022-08-19 20:12:19 -04:00
			`## Nested columns`
Nested columns documentation (#12946) Co-authored-by: Clint Wylie <cjwylie@gmail.com> Co-authored-by: Victoria Lim <vtlim@users.noreply.github.com> Co-authored-by: Charles Smith <techdocsmith@gmail.com> Co-authored-by: brian.le <brian.le@imply.io> 2022-09-06 17:42:18 -04:00
			Druid supports storing nested data structures in segments using the native `COMPLEX<json>` type. See [Nested columns](./nested-columns.md) for more information.

			You can interact with nested data using [JSON functions](./sql-json-functions.md), which can extract nested values, parse from string, serialize to string, and create new `COMPLEX<json>` structures.

			`COMPLEX` types have limited functionality outside the specialized functions that use them, so their behavior is undefined when:

			`* Grouping on complex values.`
			* Filtering directly on complex values, such as `WHERE json is NULL`.
			`* Used as inputs to aggregators without specialized handling for a specific complex type.`

			In many cases, functions are provided to translate `COMPLEX` value types to `STRING`, which serves as a workaround solution until `COMPLEX` type functionality can be improved.