druid/docs/content/querying/timeseriesquery.md

---
layout: doc_page
title: "Timeseries queries"
---

<!--
  ~ Licensed to the Apache Software Foundation (ASF) under one
  ~ or more contributor license agreements.  See the NOTICE file
  ~ distributed with this work for additional information
  ~ regarding copyright ownership.  The ASF licenses this file
  ~ to you under the Apache License, Version 2.0 (the
  ~ "License"); you may not use this file except in compliance
  ~ with the License.  You may obtain a copy of the License at
  ~
  ~   http://www.apache.org/licenses/LICENSE-2.0
  ~
  ~ Unless required by applicable law or agreed to in writing,
  ~ software distributed under the License is distributed on an
  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
  ~ KIND, either express or implied.  See the License for the
  ~ specific language governing permissions and limitations
  ~ under the License.
  -->

# Timeseries queries

These types of queries take a timeseries query object and return an array of JSON objects where each object represents a value asked for by the timeseries query.

An example timeseries query object is shown below:

```json
{
  "queryType": "timeseries",
  "dataSource": "sample_datasource",
  "granularity": "day",
  "descending": "true",
  "filter": {
    "type": "and",
    "fields": [
      { "type": "selector", "dimension": "sample_dimension1", "value": "sample_value1" },
      { "type": "or",
        "fields": [
          { "type": "selector", "dimension": "sample_dimension2", "value": "sample_value2" },
          { "type": "selector", "dimension": "sample_dimension3", "value": "sample_value3" }
        ]
      }
    ]
  },
  "aggregations": [
    { "type": "longSum", "name": "sample_name1", "fieldName": "sample_fieldName1" },
    { "type": "doubleSum", "name": "sample_name2", "fieldName": "sample_fieldName2" }
  ],
  "postAggregations": [
    { "type": "arithmetic",
      "name": "sample_divide",
      "fn": "/",
      "fields": [
        { "type": "fieldAccess", "name": "postAgg__sample_name1", "fieldName": "sample_name1" },
        { "type": "fieldAccess", "name": "postAgg__sample_name2", "fieldName": "sample_name2" }
      ]
    }
  ],
  "intervals": [ "2012-01-01T00:00:00.000/2012-01-03T00:00:00.000" ]
}
```

There are 7 main parts to a timeseries query:

|property|description|required?|
|--------|-----------|---------|
|queryType|This String should always be "timeseries"; this is the first thing Apache Druid (incubating) looks at to figure out how to interpret the query|yes|
|dataSource|A String or Object defining the data source to query, very similar to a table in a relational database. See [DataSource](../querying/datasource.html) for more information.|yes|
|descending|Whether to make descending ordered result. Default is `false`(ascending).|no|
|intervals|A JSON Object representing ISO-8601 Intervals. This defines the time ranges to run the query over.|yes|
|granularity|Defines the granularity to bucket query results. See [Granularities](../querying/granularities.html)|yes|
|filter|See [Filters](../querying/filters.html)|no|
|aggregations|See [Aggregations](../querying/aggregations.html)|no|
|postAggregations|See [Post Aggregations](../querying/post-aggregations.html)|no|
|context|Can be used to modify query behavior, including [grand totals](#grand-totals) and [zero-filling](#zero-filling). See also [Context](../querying/query-context.html) for parameters that apply to all query types.|no|

To pull it all together, the above query would return 2 data points, one for each day between 2012-01-01 and 2012-01-03, from the "sample\_datasource" table. Each data point would be the (long) sum of sample\_fieldName1, the (double) sum of sample\_fieldName2 and the (double) result of sample\_fieldName1 divided by sample\_fieldName2 for the filter set. The output looks like this:

```json
[
  {
    "timestamp": "2012-01-01T00:00:00.000Z",
    "result": { "sample_name1": <some_value>, "sample_name2": <some_value>, "sample_divide": <some_value> } 
  },
  {
    "timestamp": "2012-01-02T00:00:00.000Z",
    "result": { "sample_name1": <some_value>, "sample_name2": <some_value>, "sample_divide": <some_value> }
  }
]
```

#### Grand totals

Druid can include an extra "grand totals" row as the last row of a timeseries result set. To enable this, add
`"grandTotal" : true` to your query context. For example:

```json
{
  "queryType": "timeseries",
  "dataSource": "sample_datasource",
  "intervals": [ "2012-01-01T00:00:00.000/2012-01-03T00:00:00.000" ],
  "granularity": "day",
  "aggregations": [
    { "type": "longSum", "name": "sample_name1", "fieldName": "sample_fieldName1" },
    { "type": "doubleSum", "name": "sample_name2", "fieldName": "sample_fieldName2" }
  ],
  "context": {
    "grandTotal": true
  }
}
```

The grand totals row will appear as the last row in the result array, and will have no timestamp. It will be the last
row even if the query is run in "descending" mode. Post-aggregations in the grand totals row will be computed based
upon the grand total aggregations.

#### Zero-filling

Timeseries queries normally fill empty interior time buckets with zeroes. For example, if you issue a "day" granularity
timeseries query for the interval 2012-01-01/2012-01-04, and no data exists for 2012-01-02, you will receive:

```json
[
  {
    "timestamp": "2012-01-01T00:00:00.000Z",
    "result": { "sample_name1": <some_value> }
  },
  {
   "timestamp": "2012-01-02T00:00:00.000Z",
   "result": { "sample_name1": 0 }
  },
  {
    "timestamp": "2012-01-03T00:00:00.000Z",
    "result": { "sample_name1": <some_value> }
  }
]
```

Time buckets that lie completely outside the data interval are not zero-filled.

You can disable all zero-filling with the context flag "skipEmptyBuckets". In this mode, the data point for 2012-01-02
would be omitted from the results.

A query with this context flag set would look like:

```json
{
  "queryType": "timeseries",
  "dataSource": "sample_datasource",
  "granularity": "day",
  "aggregations": [
    { "type": "longSum", "name": "sample_name1", "fieldName": "sample_fieldName1" }
  ],
  "intervals": [ "2012-01-01T00:00:00.000/2012-01-04T00:00:00.000" ],
  "context" : {
    "skipEmptyBuckets": "true"
  }
}
```
Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733) 2018-12-13 14:47:20 -05:00			`---`
			`layout: doc_page`
			`title: "Timeseries queries"`
			`---`

add missing license headers, in particular to MD files; clean up RAT … (#6563) * add missing license headers, in particular to MD files; clean up RAT exclusions * revert inadvertent doc changes * docs * cr changes * fix modified druid-production.svg 2018-11-13 12:38:37 -05:00			`<!--`
			`~ Licensed to the Apache Software Foundation (ASF) under one`
			`~ or more contributor license agreements. See the NOTICE file`
			`~ distributed with this work for additional information`
			`~ regarding copyright ownership. The ASF licenses this file`
			`~ to you under the Apache License, Version 2.0 (the`
			`~ "License"); you may not use this file except in compliance`
			`~ with the License. You may obtain a copy of the License at`
			`~`
			`~ http://www.apache.org/licenses/LICENSE-2.0`
			`~`
			`~ Unless required by applicable law or agreed to in writing,`
			`~ software distributed under the License is distributed on an`
			`~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY`
			`~ KIND, either express or implied. See the License for the`
			`~ specific language governing permissions and limitations`
			`~ under the License.`
			`-->`

Added titles and harmonized docs to improve usability and SEO (#6731) * added titles and harmonized docs * manually fixed some titles 2018-12-12 23:42:12 -05:00			`# Timeseries queries`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00
			`These types of queries take a timeseries query object and return an array of JSON objects where each object represents a value asked for by the timeseries query.`

			`An example timeseries query object is shown below:`

			```json
			`{`
			`"queryType": "timeseries",`
			`"dataSource": "sample_datasource",`
			`"granularity": "day",`
time-descending result of timeseries queries 2015-12-13 20:29:05 -05:00			`"descending": "true",`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00			`"filter": {`
			`"type": "and",`
			`"fields": [`
			`{ "type": "selector", "dimension": "sample_dimension1", "value": "sample_value1" },`
			`{ "type": "or",`
			`"fields": [`
			`{ "type": "selector", "dimension": "sample_dimension2", "value": "sample_value2" },`
			`{ "type": "selector", "dimension": "sample_dimension3", "value": "sample_value3" }`
			`]`
			`}`
			`]`
			`},`
			`"aggregations": [`
			`{ "type": "longSum", "name": "sample_name1", "fieldName": "sample_fieldName1" },`
			`{ "type": "doubleSum", "name": "sample_name2", "fieldName": "sample_fieldName2" }`
			`],`
			`"postAggregations": [`
			`{ "type": "arithmetic",`
			`"name": "sample_divide",`
			`"fn": "/",`
			`"fields": [`
postAgg filedName must match name of AGG (#2874) 2016-04-22 14:11:54 -04:00			`{ "type": "fieldAccess", "name": "postAgg__sample_name1", "fieldName": "sample_name1" },`
			`{ "type": "fieldAccess", "name": "postAgg__sample_name2", "fieldName": "sample_name2" }`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00			`]`
			`}`
			`],`
			`"intervals": [ "2012-01-01T00:00:00.000/2012-01-03T00:00:00.000" ]`
			`}`
			```

			`There are 7 main parts to a timeseries query:`

			`\|property\|description\|required?\|`
			`\|--------\|-----------\|---------\|`
Add more Apache branding to docs (#7515) 2019-04-19 18:52:26 -04:00			`\|queryType\|This String should always be "timeseries"; this is the first thing Apache Druid (incubating) looks at to figure out how to interpret the query\|yes\|`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00			`\|dataSource\|A String or Object defining the data source to query, very similar to a table in a relational database. See [DataSource](../querying/datasource.html) for more information.\|yes\|`
time-descending result of select queries 2016-01-15 02:23:07 -05:00			\|descending\|Whether to make descending ordered result. Default is `false`(ascending).\|no\|
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00			`\|intervals\|A JSON Object representing ISO-8601 Intervals. This defines the time ranges to run the query over.\|yes\|`
			`\|granularity\|Defines the granularity to bucket query results. See [Granularities](../querying/granularities.html)\|yes\|`
			`\|filter\|See [Filters](../querying/filters.html)\|no\|`
Allow queries with no aggregators. (#3216) This is actually reasonable for a groupBy or lexicographic topNs that is being used to do a "COUNT DISTINCT" kind of query. No aggregators are needed for that query, and including a dummy aggregator wastes 8 bytes per row. It's kind of silly for timeseries, but why not. 2016-07-06 11:08:54 -04:00			`\|aggregations\|See [Aggregations](../querying/aggregations.html)\|no\|`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00			`\|postAggregations\|See [Post Aggregations](../querying/post-aggregations.html)\|no\|`
Timeseries: Add "grandTotal" option. (#5640) * Timeseries: Add "grandTotal" option. * Modify whitespace. * Checkstyle workaround. 2018-04-16 21:22:19 -04:00			`\|context\|Can be used to modify query behavior, including [grand totals](#grand-totals) and [zero-filling](#zero-filling). See also [Context](../querying/query-context.html) for parameters that apply to all query types.\|no\|`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00
minor fixes to TimeseriesQuery and Aggregations 2015-06-24 09:24:07 -04:00			`To pull it all together, the above query would return 2 data points, one for each day between 2012-01-01 and 2012-01-03, from the "sample\_datasource" table. Each data point would be the (long) sum of sample\_fieldName1, the (double) sum of sample\_fieldName2 and the (double) result of sample\_fieldName1 divided by sample\_fieldName2 for the filter set. The output looks like this:`
renaming all *.md filenames to only have lowercase and dashes so that they are editable on case-insensitive os as well 2015-05-05 17:07:32 -04:00
			```json
			`[`
			`{`
			`"timestamp": "2012-01-01T00:00:00.000Z",`
			`"result": { "sample_name1": <some_value>, "sample_name2": <some_value>, "sample_divide": <some_value> }`
			`},`
			`{`
			`"timestamp": "2012-01-02T00:00:00.000Z",`
			`"result": { "sample_name1": <some_value>, "sample_name2": <some_value>, "sample_divide": <some_value> }`
			`}`
			`]`
			```
Timeseries skipEmptyBucket docs. 2015-09-02 17:52:52 -04:00
Timeseries: Add "grandTotal" option. (#5640) * Timeseries: Add "grandTotal" option. * Modify whitespace. * Checkstyle workaround. 2018-04-16 21:22:19 -04:00			`#### Grand totals`

			`Druid can include an extra "grand totals" row as the last row of a timeseries result set. To enable this, add`
			`"grandTotal" : true` to your query context. For example:

			```json
			`{`
			`"queryType": "timeseries",`
			`"dataSource": "sample_datasource",`
			`"intervals": [ "2012-01-01T00:00:00.000/2012-01-03T00:00:00.000" ],`
			`"granularity": "day",`
			`"aggregations": [`
			`{ "type": "longSum", "name": "sample_name1", "fieldName": "sample_fieldName1" },`
			`{ "type": "doubleSum", "name": "sample_name2", "fieldName": "sample_fieldName2" }`
			`],`
			`"context": {`
			`"grandTotal": true`
			`}`
			`}`
			```

			`The grand totals row will appear as the last row in the result array, and will have no timestamp. It will be the last`
			`row even if the query is run in "descending" mode. Post-aggregations in the grand totals row will be computed based`
			`upon the grand total aggregations.`

Timeseries skipEmptyBucket docs. 2015-09-02 17:52:52 -04:00			`#### Zero-filling`

			`Timeseries queries normally fill empty interior time buckets with zeroes. For example, if you issue a "day" granularity`
			`timeseries query for the interval 2012-01-01/2012-01-04, and no data exists for 2012-01-02, you will receive:`

			```json
			`[`
			`{`
			`"timestamp": "2012-01-01T00:00:00.000Z",`
			`"result": { "sample_name1": <some_value> }`
			`},`
			`{`
			`"timestamp": "2012-01-02T00:00:00.000Z",`
			`"result": { "sample_name1": 0 }`
			`},`
			`{`
			`"timestamp": "2012-01-03T00:00:00.000Z",`
			`"result": { "sample_name1": <some_value> }`
			`}`
			`]`
			```

			`Time buckets that lie completely outside the data interval are not zero-filled.`

			`You can disable all zero-filling with the context flag "skipEmptyBuckets". In this mode, the data point for 2012-01-02`
			`would be omitted from the results.`

			`A query with this context flag set would look like:`

			```json
			`{`
			`"queryType": "timeseries",`
			`"dataSource": "sample_datasource",`
			`"granularity": "day",`
			`"aggregations": [`
			`{ "type": "longSum", "name": "sample_name1", "fieldName": "sample_fieldName1" }`
			`],`
			`"intervals": [ "2012-01-01T00:00:00.000/2012-01-04T00:00:00.000" ],`
			`"context" : {`
			`"skipEmptyBuckets": "true"`
			`}`
			`}`
			```