druid/docs/querying/multi-value-dimensions.md

---
id: multi-value-dimensions
title: "Multi-value dimensions"
---

<!--
  ~ Licensed to the Apache Software Foundation (ASF) under one
  ~ or more contributor license agreements.  See the NOTICE file
  ~ distributed with this work for additional information
  ~ regarding copyright ownership.  The ASF licenses this file
  ~ to you under the Apache License, Version 2.0 (the
  ~ "License"); you may not use this file except in compliance
  ~ with the License.  You may obtain a copy of the License at
  ~
  ~   http://www.apache.org/licenses/LICENSE-2.0
  ~
  ~ Unless required by applicable law or agreed to in writing,
  ~ software distributed under the License is distributed on an
  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
  ~ KIND, either express or implied.  See the License for the
  ~ specific language governing permissions and limitations
  ~ under the License.
  -->


Apache Druid supports "multi-value" string dimensions. These are generated when an input field contains an
array of values instead of a single value (e.g. JSON arrays, or a TSV field containing one or more `listDelimiter`
characters). By default Druid ingests the values in alphabetical order, see [Dimension Objects](../ingestion/index.md#dimension-objects) for configuration.

This document describes the behavior of groupBy (topN has similar behavior) queries on multi-value dimensions when they
are used as a dimension being grouped by. See the section on multi-value columns in
[segments](../design/segments.md#multi-value-columns) for internal representation details. Examples in this document
are in the form of [native Druid queries](querying.md). Refer to the [Druid SQL documentation](sql.md) for details
about using multi-value string dimensions in SQL.

## Querying multi-value dimensions

Suppose, you have a dataSource with a segment that contains the following rows, with a multi-value dimension
called `tags`.

```
{"timestamp": "2011-01-12T00:00:00.000Z", "tags": ["t1","t2","t3"]}  #row1
{"timestamp": "2011-01-13T00:00:00.000Z", "tags": ["t3","t4","t5"]}  #row2
{"timestamp": "2011-01-14T00:00:00.000Z", "tags": ["t5","t6","t7"]}  #row3
{"timestamp": "2011-01-14T00:00:00.000Z", "tags": []}                #row4
```

### Filtering

All query types, as well as [filtered aggregators](aggregations.md#filtered-aggregator), can filter on multi-value
dimensions. Filters follow these rules on multi-value dimensions:

- Value filters (like "selector", "bound", and "in") match a row if any of the values of a multi-value dimension match
  the filter.
- The Column Comparison filter will match a row if the dimensions have any overlap.
- Value filters that match `null` or `""` (empty string) will match empty cells in a multi-value dimension.
- Logical expression filters behave the same way they do on single-value dimensions: "and" matches a row if all
  underlying filters match that row; "or" matches a row if any underlying filters match that row; "not" matches a row
  if the underlying filter does not match the row.

For example, this "or" filter would match row1 and row2 of the dataset above, but not row3:

```
{
  "type": "or",
  "fields": [
    {
      "type": "selector",
      "dimension": "tags",
      "value": "t1"
    },
    {
      "type": "selector",
      "dimension": "tags",
      "value": "t3"
    }
  ]
}
```

This "and" filter would match only row1 of the dataset above:

```
{
  "type": "and",
  "fields": [
    {
      "type": "selector",
      "dimension": "tags",
      "value": "t1"
    },
    {
      "type": "selector",
      "dimension": "tags",
      "value": "t3"
    }
  ]
}
```

This "selector" filter would match row4 of the dataset above:

```
{
  "type": "selector",
  "dimension": "tags",
  "value": null
}
```

### Grouping

topN and groupBy queries can group on multi-value dimensions. When grouping on a multi-value dimension, _all_ values
from matching rows will be used to generate one group per value. This can be thought of as the equivalent to the
`UNNEST` operator used on an `ARRAY` type that many SQL dialects support. This means it's possible for a query to return
more groups than there are rows. For example, a topN on the dimension `tags` with filter `"t1" AND "t3"` would match
only row1, and generate a result with three groups: `t1`, `t2`, and `t3`. If you only need to include values that match
your filter, you can use a [filtered dimensionSpec](dimensionspecs.md#filtered-dimensionspecs). This can also
improve performance.

### Example: GroupBy query with no filtering

See [GroupBy querying](groupbyquery.md) for details.

```json
{
  "queryType": "groupBy",
  "dataSource": "test",
  "intervals": [
    "1970-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z"
  ],
  "granularity": {
    "type": "all"
  },
  "dimensions": [
    {
      "type": "default",
      "dimension": "tags",
      "outputName": "tags"
    }
  ],
  "aggregations": [
    {
      "type": "count",
      "name": "count"
    }
  ]
}
```

returns following result.

```json
[
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t1"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t2"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 2,
      "tags": "t3"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t4"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 2,
      "tags": "t5"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t6"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t7"
    }
  }
]
```

notice how original rows are "exploded" into multiple rows and merged.

### Example: GroupBy query with a selector query filter

See [query filters](filters.md) for details of selector query filter.

```json
{
  "queryType": "groupBy",
  "dataSource": "test",
  "intervals": [
    "1970-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z"
  ],
  "filter": {
    "type": "selector",
    "dimension": "tags",
    "value": "t3"
  },
  "granularity": {
    "type": "all"
  },
  "dimensions": [
    {
      "type": "default",
      "dimension": "tags",
      "outputName": "tags"
    }
  ],
  "aggregations": [
    {
      "type": "count",
      "name": "count"
    }
  ]
}
```

returns following result.

```json
[
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t1"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t2"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 2,
      "tags": "t3"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t4"
    }
  },
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 1,
      "tags": "t5"
    }
  }
]
```

You might be surprised to see inclusion of "t1", "t2", "t4" and "t5" in the results. It happens because query filter is
applied on the row before explosion. For multi-value dimensions, selector filter for "t3" would match row1 and row2,
after which exploding is done. For multi-value dimensions, query filter matches a row if any individual value inside
the multiple values matches the query filter.

### Example: GroupBy query with a selector query filter and additional filter in "dimensions" attributes

To solve the problem above and to get only rows for "t3" returned, you would have to use a "filtered dimension spec" as
in the query below.

See section on filtered dimensionSpecs in [dimensionSpecs](dimensionspecs.md#filtered-dimensionspecs) for details.

```json
{
  "queryType": "groupBy",
  "dataSource": "test",
  "intervals": [
    "1970-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z"
  ],
  "filter": {
    "type": "selector",
    "dimension": "tags",
    "value": "t3"
  },
  "granularity": {
    "type": "all"
  },
  "dimensions": [
    {
      "type": "listFiltered",
      "delegate": {
        "type": "default",
        "dimension": "tags",
        "outputName": "tags"
      },
      "values": ["t3"]
    }
  ],
  "aggregations": [
    {
      "type": "count",
      "name": "count"
    }
  ]
}
```

returns the following result.

```json
[
  {
    "timestamp": "1970-01-01T00:00:00.000Z",
    "event": {
      "count": 2,
      "tags": "t3"
    }
  }
]
```

Note that, for groupBy queries, you could get similar result with a [having spec](having.md) but using a filtered
dimensionSpec is much more efficient because that gets applied at the lowest level in the query processing pipeline.
Having specs are applied at the outermost level of groupBy query processing.
Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733) 2018-12-13 14:47:20 -05:00			`---`
Docusaurus build framework + ingestion doc refresh. (#8311) * Docusaurus build framework + ingestion doc refresh. * stick to npm instead of yarn * fix typos * restore some _bin * Adjustments. * detect and fix redirect anchors * update anchor lint * Web-console: remove specific column filters (#8343) * add clear filter * update tool kit * remove usless check * auto run * add % * Fix resource leak (#8337) * Fix resource leak * Patch comments * Enable Spotbugs NP_NONNULL_RETURN_VIOLATION (#8234) * Fixes from PR review. * Fix more anchors. * Preamble nix. * Fix more anchors, headers * clean up placeholder page * add to website lint to travis config * better broken link checking * travis fix * Fixed more broken links * better redirects * unfancy catch * fix LGTM error * link fixes * fix md issues * Addl fixes 2019-08-21 00:48:59 -04:00			`id: multi-value-dimensions`
Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733) 2018-12-13 14:47:20 -05:00			`title: "Multi-value dimensions"`
			`---`

add missing license headers, in particular to MD files; clean up RAT … (#6563) * add missing license headers, in particular to MD files; clean up RAT exclusions * revert inadvertent doc changes * docs * cr changes * fix modified druid-production.svg 2018-11-13 12:38:37 -05:00			`<!--`
			`~ Licensed to the Apache Software Foundation (ASF) under one`
			`~ or more contributor license agreements. See the NOTICE file`
			`~ distributed with this work for additional information`
			`~ regarding copyright ownership. The ASF licenses this file`
			`~ to you under the Apache License, Version 2.0 (the`
			`~ "License"); you may not use this file except in compliance`
			`~ with the License. You may obtain a copy of the License at`
			`~`
			`~ http://www.apache.org/licenses/LICENSE-2.0`
			`~`
			`~ Unless required by applicable law or agreed to in writing,`
			`~ software distributed under the License is distributed on an`
			`~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY`
			`~ KIND, either express or implied. See the License for the`
			`~ specific language governing permissions and limitations`
			`~ under the License.`
			`-->`

documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
De-incubation cleanup in code, docs, packaging (#9108) * De-incubation cleanup in code, docs, packaging * remove unused docs script 2020-01-03 12:33:19 -05:00			`Apache Druid supports "multi-value" string dimensions. These are generated when an input field contains an`
Add IPv4 SQL functions (#8223) * Add IPv4 SQL functions New SQL functions for filtering IPv4 addresses: - IPV4_MATCH: Check if IP address belongs to a subnet - IPV4_PARSE: Convert string IP address to integer - IPV4_STRINGIFY: Convert integer IP address to string These are the SQL analogs of the druid expressions with the same name. Filtering is more efficient when operating on IP addresses as integers instead of strings. * Refactor operator conversions into named constants 2019-08-02 00:29:58 -04:00			array of values instead of a single value (e.g. JSON arrays, or a TSV field containing one or more `listDelimiter`
Add documentation re alphabetical sorted of MV dimensions (#10695) 2021-05-07 04:12:32 -04:00			`characters). By default Druid ingests the values in alphabetical order, see [Dimension Objects](../ingestion/index.md#dimension-objects) for configuration.`
new quickstart 2016-01-06 00:27:52 -05:00
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`This document describes the behavior of groupBy (topN has similar behavior) queries on multi-value dimensions when they`
			`are used as a dimension being grouped by. See the section on multi-value columns in`
cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`[segments](../design/segments.md#multi-value-columns) for internal representation details. Examples in this document`
			`are in the form of [native Druid queries](querying.md). Refer to the [Druid SQL documentation](sql.md) for details`
add SQL docs for multi-value string dimensions (#8011) * add SQL docs for multi-value string dimensions * formatting consistency * fix typo * adjust 2019-07-03 11:22:33 -04:00			`about using multi-value string dimensions in SQL.`
new quickstart 2016-01-06 00:27:52 -05:00
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`## Querying multi-value dimensions`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`Suppose, you have a dataSource with a segment that contains the following rows, with a multi-value dimension`
			called `tags`.
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
			```
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`{"timestamp": "2011-01-12T00:00:00.000Z", "tags": ["t1","t2","t3"]} #row1`
			`{"timestamp": "2011-01-13T00:00:00.000Z", "tags": ["t3","t4","t5"]} #row2`
			`{"timestamp": "2011-01-14T00:00:00.000Z", "tags": ["t5","t6","t7"]} #row3`
More consistent empty-set filtering behavior on multi-value columns. The behavior is now that filters on "null" will match rows with no values. The behavior in the past was inconsistent; sometimes these filters would match and sometimes they wouldn't. Adds tests for this behavior to SelectorFilterTest and BoundFilterTest, for query-level filters and filtered aggregates. Fixes #2750. 2016-03-28 21:30:22 -04:00			`{"timestamp": "2011-01-14T00:00:00.000Z", "tags": []} #row4`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00			```

More consistent empty-set filtering behavior on multi-value columns. The behavior is now that filters on "null" will match rows with no values. The behavior in the past was inconsistent; sometimes these filters would match and sometimes they wouldn't. Adds tests for this behavior to SelectorFilterTest and BoundFilterTest, for query-level filters and filtered aggregates. Fixes #2750. 2016-03-28 21:30:22 -04:00			`### Filtering`

cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`All query types, as well as [filtered aggregators](aggregations.md#filtered-aggregator), can filter on multi-value`
More consistent empty-set filtering behavior on multi-value columns. The behavior is now that filters on "null" will match rows with no values. The behavior in the past was inconsistent; sometimes these filters would match and sometimes they wouldn't. Adds tests for this behavior to SelectorFilterTest and BoundFilterTest, for query-level filters and filtered aggregates. Fixes #2750. 2016-03-28 21:30:22 -04:00			`dimensions. Filters follow these rules on multi-value dimensions:`

			`- Value filters (like "selector", "bound", and "in") match a row if any of the values of a multi-value dimension match`
			`the filter.`
Comparing dimensions to each other in a filter (#3928) Comparing dimensions to each other using a select filter 2017-03-23 21:23:46 -04:00			`- The Column Comparison filter will match a row if the dimensions have any overlap.`
More consistent empty-set filtering behavior on multi-value columns. The behavior is now that filters on "null" will match rows with no values. The behavior in the past was inconsistent; sometimes these filters would match and sometimes they wouldn't. Adds tests for this behavior to SelectorFilterTest and BoundFilterTest, for query-level filters and filtered aggregates. Fixes #2750. 2016-03-28 21:30:22 -04:00			- Value filters that match `null` or `""` (empty string) will match empty cells in a multi-value dimension.
			`- Logical expression filters behave the same way they do on single-value dimensions: "and" matches a row if all`
			`underlying filters match that row; "or" matches a row if any underlying filters match that row; "not" matches a row`
			`if the underlying filter does not match the row.`

			`For example, this "or" filter would match row1 and row2 of the dataset above, but not row3:`

			```
			`{`
			`"type": "or",`
			`"fields": [`
			`{`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": "t1"`
			`},`
			`{`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": "t3"`
			`}`
			`]`
			`}`
			```

			`This "and" filter would match only row1 of the dataset above:`

			```
			`{`
			`"type": "and",`
			`"fields": [`
			`{`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": "t1"`
			`},`
			`{`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": "t3"`
			`}`
			`]`
			`}`
			```

			`This "selector" filter would match row4 of the dataset above:`

			```
			`{`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": null`
			`}`
			```

			`### Grouping`
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00
			`topN and groupBy queries can group on multi-value dimensions. When grouping on a multi-value dimension, _all_ values`
add SQL docs for multi-value string dimensions (#8011) * add SQL docs for multi-value string dimensions * formatting consistency * fix typo * adjust 2019-07-03 11:22:33 -04:00			`from matching rows will be used to generate one group per value. This can be thought of as the equivalent to the`
			`UNNEST` operator used on an `ARRAY` type that many SQL dialects support. This means it's possible for a query to return
			more groups than there are rows. For example, a topN on the dimension `tags` with filter `"t1" AND "t3"` would match
			only row1, and generate a result with three groups: `t1`, `t2`, and `t3`. If you only need to include values that match
cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`your filter, you can use a [filtered dimensionSpec](dimensionspecs.md#filtered-dimensionspecs). This can also`
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`improve performance.`

			`### Example: GroupBy query with no filtering`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`See [GroupBy querying](groupbyquery.md) for details.`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
			```json
			`{`
			`"queryType": "groupBy",`
			`"dataSource": "test",`
			`"intervals": [`
			`"1970-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z"`
			`],`
			`"granularity": {`
			`"type": "all"`
			`},`
			`"dimensions": [`
			`{`
			`"type": "default",`
			`"dimension": "tags",`
			`"outputName": "tags"`
			`}`
			`],`
			`"aggregations": [`
			`{`
			`"type": "count",`
			`"name": "count"`
			`}`
			`]`
			`}`
			```

			`returns following result.`

			```json
			`[`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t1"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t2"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 2,`
			`"tags": "t3"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t4"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 2,`
			`"tags": "t5"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t6"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t7"`
			`}`
			`}`
			`]`
			```

			`notice how original rows are "exploded" into multiple rows and merged.`

Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`### Example: GroupBy query with a selector query filter`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`See [query filters](filters.md) for details of selector query filter.`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
			```json
			`{`
			`"queryType": "groupBy",`
			`"dataSource": "test",`
			`"intervals": [`
			`"1970-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z"`
			`],`
			`"filter": {`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": "t3"`
			`},`
			`"granularity": {`
			`"type": "all"`
			`},`
			`"dimensions": [`
			`{`
			`"type": "default",`
			`"dimension": "tags",`
			`"outputName": "tags"`
			`}`
			`],`
			`"aggregations": [`
			`{`
			`"type": "count",`
			`"name": "count"`
			`}`
			`]`
			`}`
			```

			`returns following result.`

			```json
			`[`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t1"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t2"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 2,`
			`"tags": "t3"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t4"`
			`}`
			`},`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 1,`
			`"tags": "t5"`
			`}`
			`}`
			`]`
			```

add SQL docs for multi-value string dimensions (#8011) * add SQL docs for multi-value string dimensions * formatting consistency * fix typo * adjust 2019-07-03 11:22:33 -04:00			`You might be surprised to see inclusion of "t1", "t2", "t4" and "t5" in the results. It happens because query filter is`
			`applied on the row before explosion. For multi-value dimensions, selector filter for "t3" would match row1 and row2,`
			`after which exploding is done. For multi-value dimensions, query filter matches a row if any individual value inside`
			`the multiple values matches the query filter.`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`### Example: GroupBy query with a selector query filter and additional filter in "dimensions" attributes`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
add SQL docs for multi-value string dimensions (#8011) * add SQL docs for multi-value string dimensions * formatting consistency * fix typo * adjust 2019-07-03 11:22:33 -04:00			`To solve the problem above and to get only rows for "t3" returned, you would have to use a "filtered dimension spec" as`
			`in the query below.`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`See section on filtered dimensionSpecs in [dimensionSpecs](dimensionspecs.md#filtered-dimensionspecs) for details.`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
			```json
			`{`
			`"queryType": "groupBy",`
			`"dataSource": "test",`
			`"intervals": [`
			`"1970-01-01T00:00:00.000Z/3000-01-01T00:00:00.000Z"`
			`],`
			`"filter": {`
			`"type": "selector",`
			`"dimension": "tags",`
			`"value": "t3"`
			`},`
			`"granularity": {`
			`"type": "all"`
			`},`
			`"dimensions": [`
			`{`
			`"type": "listFiltered",`
			`"delegate": {`
			`"type": "default",`
			`"dimension": "tags",`
			`"outputName": "tags"`
			`},`
			`"values": ["t3"]`
			`}`
			`],`
			`"aggregations": [`
			`{`
			`"type": "count",`
			`"name": "count"`
			`}`
			`]`
			`}`
			```

Improved docs for multi-value dimensions. - Add central doc for multi-value dimensions, with some content from other docs. - Link to multi-value dimension doc from topN and groupBy docs. - Fixes a broken link from dimensionspecs.md, which was presciently already linking to this nonexistent doc. - Resolve inconsistent naming in docs & code (sometimes "multi-valued", sometimes "multi-value") in favor of "multi-value". 2016-03-22 17:16:34 -04:00			`returns the following result.`
documenting querying behavior on multi-valued dimensions 2015-12-30 19:01:22 -05:00
			```json
			`[`
			`{`
			`"timestamp": "1970-01-01T00:00:00.000Z",`
			`"event": {`
			`"count": 2,`
			`"tags": "t3"`
			`}`
			`}`
			`]`
			```

cleaning up and fixing links (#10528) * cleaning up and fixing links * reverting local link * Update indexer.md * link checking * Fixing one more stale link for PostgreSQL 2020-12-17 16:37:43 -05:00			`Note that, for groupBy queries, you could get similar result with a [having spec](having.md) but using a filtered`
add SQL docs for multi-value string dimensions (#8011) * add SQL docs for multi-value string dimensions * formatting consistency * fix typo * adjust 2019-07-03 11:22:33 -04:00			`dimensionSpec is much more efficient because that gets applied at the lowest level in the query processing pipeline.`
			`Having specs are applied at the outermost level of groupBy query processing.`