druid/docs/content/SegmentMetadataQuery.md

---
layout: doc_page
---
# Segment Metadata Queries
Segment metadata queries return per segment information about:

* Cardinality of all columns in the segment
* Estimated byte size for the segment columns in TSV format
* Interval the segment covers
* Column type of all the columns in the segment
* Estimated total segment byte size in TSV format
* Segment id

```json
{
  "queryType":"segmentMetadata",
  "dataSource":"sample_datasource",
  "intervals":["2013-01-01/2014-01-01"]
}
```

There are several main parts to a segment metadata query:

|property|description|required?|
|--------|-----------|---------|
|queryType|This String should always be "segmentMetadata"; this is the first thing Druid looks at to figure out how to interpret the query|yes|
|dataSource|A String defining the data source to query, very similar to a table in a relational database|yes|
|intervals|A JSON Object representing ISO-8601 Intervals. This defines the time ranges to run the query over.|yes|
|toInclude|A JSON Object representing what columns should be included in the result. Defaults to "all".|no|
|merge|Merge all individual segment metadata results into a single result|no|
|context|An additional JSON Object which can be used to specify certain flags.|no|

The format of the result is:

```json
[ {
  "id" : "some_id",
  "intervals" : [ "2013-05-13T00:00:00.000Z/2013-05-14T00:00:00.000Z" ],
  "columns" : {
    "__time" : { "type" : "LONG", "size" : 407240380, "cardinality" : null },
    "dim1" : { "type" : "STRING", "size" : 100000, "cardinality" : 1944 },
    "dim2" : { "type" : "STRING", "size" : 100000, "cardinality" : 1504 },
    "metric1" : { "type" : "FLOAT", "size" : 100000, "cardinality" : null }
  },
  "size" : 300000
} ]
```

Dimension columns will have type `STRING`.  
Metric columns will have type `FLOAT`.  
Timestamp column will have type `LONG`.

Only columns which are dimensions (ie, have type `STRING`) will have any cardinality. Rest of the columns (timestamp and metric columns) will show cardinality as `null`.

### toInclude

There are 3 types of toInclude objects.

#### All

The grammar is as follows:

``` json
"toInclude": { "type": "all"}
```

#### None

The grammar is as follows:

``` json
"toInclude": { "type": "none"}
```

#### List

The grammar is as follows:

``` json
"toInclude": { "type": "list", "columns": [<string list of column names>]}
```
Added prepend tag to make pages display. 2013-09-16 17:49:36 -04:00			`---`
Docs working 2013-09-26 19:22:28 -04:00			`layout: doc_page`
Added prepend tag to make pages display. 2013-09-16 17:49:36 -04:00			`---`
added titles; fixed a few typos 2014-01-16 18:37:07 -05:00			`# Segment Metadata Queries`
Add docs from github wiki 2013-09-13 18:20:39 -04:00			`Segment metadata queries return per segment information about:`

Finish converting docs over to something that displays properly 2013-09-27 20:08:34 -04:00			`* Cardinality of all columns in the segment`
			`* Estimated byte size for the segment columns in TSV format`
			`* Interval the segment covers`
			`* Column type of all the columns in the segment`
			`* Estimated total segment byte size in TSV format`
			`* Segment id`

			```json
			`{`
			`"queryType":"segmentMetadata",`
			`"dataSource":"sample_datasource",`
Removed trailing comma 2014-01-28 10:34:13 -05:00			`"intervals":["2013-01-01/2014-01-01"]`
Finish converting docs over to something that displays properly 2013-09-27 20:08:34 -04:00			`}`
			```
Add docs from github wiki 2013-09-13 18:20:39 -04:00
			`There are several main parts to a segment metadata query:`

			`\|property\|description\|required?\|`
			`\|--------\|-----------\|---------\|`
Finish converting docs over to something that displays properly 2013-09-27 20:08:34 -04:00			`\|queryType\|This String should always be "segmentMetadata"; this is the first thing Druid looks at to figure out how to interpret the query\|yes\|`
Add docs from github wiki 2013-09-13 18:20:39 -04:00			`\|dataSource\|A String defining the data source to query, very similar to a table in a relational database\|yes\|`
			`\|intervals\|A JSON Object representing ISO-8601 Intervals. This defines the time ranges to run the query over.\|yes\|`
add more segment metadata docs 2014-05-29 15:53:13 -04:00			`\|toInclude\|A JSON Object representing what columns should be included in the result. Defaults to "all".\|no\|`
Add docs from github wiki 2013-09-13 18:20:39 -04:00			`\|merge\|Merge all individual segment metadata results into a single result\|no\|`
			`\|context\|An additional JSON Object which can be used to specify certain flags.\|no\|`

			`The format of the result is:`

Finish converting docs over to something that displays properly 2013-09-27 20:08:34 -04:00			```json
			`[ {`
			`"id" : "some_id",`
			`"intervals" : [ "2013-05-13T00:00:00.000Z/2013-05-14T00:00:00.000Z" ],`
			`"columns" : {`
			`"__time" : { "type" : "LONG", "size" : 407240380, "cardinality" : null },`
			`"dim1" : { "type" : "STRING", "size" : 100000, "cardinality" : 1944 },`
			`"dim2" : { "type" : "STRING", "size" : 100000, "cardinality" : 1504 },`
			`"metric1" : { "type" : "FLOAT", "size" : 100000, "cardinality" : null }`
			`},`
			`"size" : 300000`
			`} ]`
			```
add more segment metadata docs 2014-05-29 15:53:13 -04:00
segment metadata result explained better 2014-09-16 18:56:15 -04:00			Dimension columns will have type `STRING`.
			Metric columns will have type `FLOAT`.
			Timestamp column will have type `LONG`.

			Only columns which are dimensions (ie, have type `STRING`) will have any cardinality. Rest of the columns (timestamp and metric columns) will show cardinality as `null`.

add more segment metadata docs 2014-05-29 15:53:13 -04:00			`### toInclude`

			`There are 3 types of toInclude objects.`

			`#### All`

			`The grammar is as follows:`

			``` json
			`"toInclude": { "type": "all"}`
			```

			`#### None`

			`The grammar is as follows:`

			``` json
			`"toInclude": { "type": "none"}`
			```

			`#### List`

			`The grammar is as follows:`

			``` json
			`"toInclude": { "type": "list", "columns": [<string list of column names>]}`
segment metadata result explained better 2014-09-16 18:56:15 -04:00			```