From 9be2a5cdc2a5fbae5d6e38885f75e9f674da1e8f Mon Sep 17 00:00:00 2001 From: Lasse Krogh Mammen Date: Fri, 7 May 2021 09:12:32 +0100 Subject: [PATCH] Add documentation re alphabetical sorted of MV dimensions (#10695) --- docs/ingestion/index.md | 1 + docs/querying/multi-value-dimensions.md | 2 +- website/.spelling | 1 + 3 files changed, 3 insertions(+), 1 deletion(-) diff --git a/docs/ingestion/index.md b/docs/ingestion/index.md index 2dc903ba43b..adcfe0978d7 100644 --- a/docs/ingestion/index.md +++ b/docs/ingestion/index.md @@ -479,6 +479,7 @@ Dimension objects can have the following components: | type | Either `string`, `long`, `float`, or `double`. | `string` | | name | The name of the dimension. This will be used as the field name to read from input records, as well as the column name stored in generated segments.

Note that you can use a [`transformSpec`](#transformspec) if you want to rename columns during ingestion time. | none (required) | | createBitmapIndex | For `string` typed dimensions, whether or not bitmap indexes should be created for the column in generated segments. Creating a bitmap index requires more storage, but speeds up certain kinds of filtering (especially equality and prefix filtering). Only supported for `string` typed dimensions. | `true` | +| multiValueHandling | Specify the type of handling for [multi-value fields](../querying/multi-value-dimensions.md). Possible values are `sorted_array`, `sorted_set`, and `array`. `sorted_array` and `sorted_set` order the array upon ingestion. `sorted_set` removes duplicates. `array` ingests data as-is | `sorted_array` | #### Inclusions and exclusions diff --git a/docs/querying/multi-value-dimensions.md b/docs/querying/multi-value-dimensions.md index 2926091dab3..09d319b31b5 100644 --- a/docs/querying/multi-value-dimensions.md +++ b/docs/querying/multi-value-dimensions.md @@ -25,7 +25,7 @@ title: "Multi-value dimensions" Apache Druid supports "multi-value" string dimensions. These are generated when an input field contains an array of values instead of a single value (e.g. JSON arrays, or a TSV field containing one or more `listDelimiter` -characters). +characters). By default Druid ingests the values in alphabetical order, see [Dimension Objects](../ingestion/index.md#dimension-objects) for configuration. This document describes the behavior of groupBy (topN has similar behavior) queries on multi-value dimensions when they are used as a dimension being grouped by. See the section on multi-value columns in diff --git a/website/.spelling b/website/.spelling index d987eee224f..bce45ee6ec0 100644 --- a/website/.spelling +++ b/website/.spelling @@ -1880,3 +1880,4 @@ TiB PiB protobuf Golang +multiValueHandling \ No newline at end of file