druid/docs/querying/granularities.md

---
id: granularities
title: "Query granularities"
sidebar_label: "Granularities"
---

<!--
  ~ Licensed to the Apache Software Foundation (ASF) under one
  ~ or more contributor license agreements.  See the NOTICE file
  ~ distributed with this work for additional information
  ~ regarding copyright ownership.  The ASF licenses this file
  ~ to you under the Apache License, Version 2.0 (the
  ~ "License"); you may not use this file except in compliance
  ~ with the License.  You may obtain a copy of the License at
  ~
  ~   http://www.apache.org/licenses/LICENSE-2.0
  ~
  ~ Unless required by applicable law or agreed to in writing,
  ~ software distributed under the License is distributed on an
  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
  ~ KIND, either express or implied.  See the License for the
  ~ specific language governing permissions and limitations
  ~ under the License.
  -->

> Apache Druid supports two query languages: [Druid SQL](sql.md) and [native queries](querying.md).
> This document describes the native
> language. For information about time functions available in SQL, refer to the
> [SQL documentation](sql.md#time-functions).

Granularity determines how to bucket data across the time dimension, or how to aggregate data by hour, day, minute, etc.

For example, use time granularities in [native queries](querying.md) to bucket results by time, and in the `dataSchema` \\ [`granularitySpec`](../ingestion/ingestion-spec.md#granularityspec) section of ingestion specifications to segment incoming data.

You can specify a time period as a [simple](#simple-granularities) string, as a [duration](#duration-granularities) in milliseconds, or as an arbitrary ISO8601 [period](#period-granularities).

### Simple Granularities

Simple granularities are specified as a string and bucket timestamps by their UTC time (e.g., days start at 00:00 UTC).

Supported granularity strings are: `all`, `none`, `second`, `minute`, `fifteen_minute`, `thirty_minute`, `hour`, `day`, `week`, `month`, `quarter` and `year`.

* `all` buckets everything into a single bucket
* `none` does not bucket data (it actually uses the granularity of the index - minimum here is `none` which means millisecond granularity). Using `none` in a [TimeseriesQuery](../querying/timeseriesquery.md) is currently not recommended (the system will try to generate 0 values for all milliseconds that didn’t exist, which is often a lot).

#### Example:

Suppose you have data below stored in Apache Druid with millisecond ingestion granularity,

``` json
{"timestamp": "2013-08-31T01:02:33Z", "page": "AAA", "language" : "en"}
{"timestamp": "2013-09-01T01:02:33Z", "page": "BBB", "language" : "en"}
{"timestamp": "2013-09-02T23:32:45Z", "page": "CCC", "language" : "en"}
{"timestamp": "2013-09-03T03:32:45Z", "page": "DDD", "language" : "en"}
```

After submitting a groupBy query with `hour` granularity,

``` json
{
   "queryType":"groupBy",
   "dataSource":"my_dataSource",
   "granularity":"hour",
   "dimensions":[
      "language"
   ],
   "aggregations":[
      {
         "type":"count",
         "name":"count"
      }
   ],
   "intervals":[
      "2000-01-01T00:00Z/3000-01-01T00:00Z"
   ]
}
```

you will get

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-31T01:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-01T01:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T23:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-03T03:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
} ]
```

Note that all the empty buckets are discarded.


If you change the granularity to `day`, you will get

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-31T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-01T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-03T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
} ]
```


If you change the granularity to `none`, you will get the same results as setting it to the ingestion granularity.

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-31T01:02:33.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-01T01:02:33.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T23:32:45.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-03T03:32:45.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
} ]
```

Having a query time `granularity` that is smaller than the `queryGranularity` parameter set at
[ingestion time](../ingestion/ingestion-spec.md#granularityspec) is unreasonable because information about that
smaller granularity is not present in the indexed data. So, if the query time granularity is smaller than the ingestion
time query granularity, Druid produces results that are equivalent to having set `granularity` to `queryGranularity`.


If you change the granularity to `all`, you will get everything aggregated in 1 bucket,

``` json
[ {
  "version" : "v1",
  "timestamp" : "2000-01-01T00:00:00.000Z",
  "event" : {
    "count" : 4,
    "language" : "en"
  }
} ]
```


### Duration Granularities

Duration granularities are specified as an exact duration in milliseconds and timestamps are returned as UTC. Duration granularity values are in millis.

They also support specifying an optional origin, which defines where to start counting time buckets from (defaults to 1970-01-01T00:00:00Z).

```javascript
{"type": "duration", "duration": 7200000}
```

This chunks up every 2 hours.

```javascript
{"type": "duration", "duration": 3600000, "origin": "2012-01-01T00:30:00Z"}
```

This chunks up every hour on the half-hour.

#### Example:

Reusing the data in the previous example, after submitting a groupBy query with 24 hours duration,

``` json
{
   "queryType":"groupBy",
   "dataSource":"my_dataSource",
   "granularity":{"type": "duration", "duration": "86400000"},
   "dimensions":[
      "language"
   ],
   "aggregations":[
      {
         "type":"count",
         "name":"count"
      }
   ],
   "intervals":[
      "2000-01-01T00:00Z/3000-01-01T00:00Z"
   ]
}
```

you will get

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-31T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-01T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-03T00:00:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
} ]
```

if you set the origin for the granularity to `2012-01-01T00:30:00Z`,

``` javascript
   "granularity":{"type": "duration", "duration": "86400000", "origin":"2012-01-01T00:30:00Z"}
```

you will get

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-31T00:30:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-01T00:30:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T00:30:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-03T00:30:00.000Z",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
} ]
```

Note that the timestamp for each bucket starts at the 30th minute.

### Period Granularities

Period granularities are specified as arbitrary period combinations of years, months, weeks, hours, minutes and seconds (e.g. P2W, P3M, PT1H30M, PT0.750S) in [ISO8601](https://en.wikipedia.org/wiki/ISO_8601) format. They support specifying a time zone which determines where period boundaries start as well as the timezone of the returned timestamps. By default, years start on the first of January, months start on the first of the month and weeks start on Mondays unless an origin is specified.

Time zone is optional (defaults to UTC). Origin is optional (defaults to 1970-01-01T00:00:00 in the given time zone).

```javascript
{"type": "period", "period": "P2D", "timeZone": "America/Los_Angeles"}
```

This will bucket by two-day chunks in the Pacific timezone.

```javascript
{"type": "period", "period": "P3M", "timeZone": "America/Los_Angeles", "origin": "2012-02-01T00:00:00-08:00"}
```

This will bucket by 3-month chunks in the Pacific timezone where the three-month quarters are defined as starting from February.

#### Example

Reusing the data in the previous example, if you submit a groupBy query with 1 day period in Pacific timezone,

``` json
{
   "queryType":"groupBy",
   "dataSource":"my_dataSource",
   "granularity":{"type": "period", "period": "P1D", "timeZone": "America/Los_Angeles"},
   "dimensions":[
      "language"
   ],
   "aggregations":[
      {
         "type":"count",
         "name":"count"
      }
   ],
   "intervals":[
      "1999-12-31T16:00:00.000-08:00/2999-12-31T16:00:00.000-08:00"
   ]
}
```

you will get

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-30T00:00:00.000-07:00",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-08-31T00:00:00.000-07:00",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T00:00:00.000-07:00",
  "event" : {
    "count" : 2,
    "language" : "en"
  }
} ]
```

Note that the timestamp for each bucket has been converted to Pacific time. Row `{"timestamp": "2013-09-02T23:32:45Z", "page": "CCC", "language" : "en"}` and
`{"timestamp": "2013-09-03T03:32:45Z", "page": "DDD", "language" : "en"}` are put in the same bucket because they are in the same day in Pacific time.

Also note that the `intervals` in groupBy query will not be converted to the timezone specified, the timezone specified in granularity is only applied on the
query results.

If you set the origin for the granularity to `1970-01-01T20:30:00-08:00`,

``` javascript
   "granularity":{"type": "period", "period": "P1D", "timeZone": "America/Los_Angeles", "origin": "1970-01-01T20:30:00-08:00"}
```

you will get

``` json
[ {
  "version" : "v1",
  "timestamp" : "2013-08-29T20:30:00.000-07:00",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-08-30T20:30:00.000-07:00",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-01T20:30:00.000-07:00",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
}, {
  "version" : "v1",
  "timestamp" : "2013-09-02T20:30:00.000-07:00",
  "event" : {
    "count" : 1,
    "language" : "en"
  }
} ]
```

Note that the `origin` you specified has nothing to do with the timezone, it only serves as a starting point for locating the very first granularity bucket.
In this case, Row `{"timestamp": "2013-09-02T23:32:45Z", "page": "CCC", "language" : "en"}` and `{"timestamp": "2013-09-03T03:32:45Z", "page": "DDD", "language" : "en"}`
are not in the same bucket.

#### Supported Time Zones
Timezone support is provided by the [Joda Time library](http://www.joda.org), which uses the standard IANA time zones. See the [Joda Time supported timezones](http://joda-time.sourceforge.net/timezones.html).
-												Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733)


											
										
										
											2018-12-13 14:47:20 -05:00
+								---
-												Docusaurus build framework + ingestion doc refresh. (#8311)

* Docusaurus build framework + ingestion doc refresh.

* stick to npm instead of yarn

* fix typos

* restore some _bin

* Adjustments.

* detect and fix redirect anchors

* update anchor lint

* Web-console: remove specific column filters (#8343)

* add clear filter

* update tool kit

* remove usless check

* auto run

* add %

* Fix resource leak (#8337)

* Fix resource leak

* Patch comments

* Enable Spotbugs NP_NONNULL_RETURN_VIOLATION (#8234)

* Fixes from PR review.

* Fix more anchors.

* Preamble nix.

* Fix more anchors, headers

* clean up placeholder page

* add to website lint to travis config

* better broken link checking

* travis fix

* Fixed more broken links

* better redirects

* unfancy catch

* fix LGTM error

* link fixes

* fix md issues

* Addl fixes

											
										
										
											2019-08-21 00:48:59 -04:00
+								id: granularities
-												Refresh query docs. (#9704)

* Refresh query docs.

Larger changes:

- New doc: querying/datasource.md describes the various kinds of
datasources you can use, and has examples for both SQL and native.
- New doc: querying/query-execution.md describes how native queries
are executed at a high level. It doesn't go into the details of specific
query engines or how queries run at a per-segment level. But I think it
would be good to add or link that content here in the future.
- Refreshed doc: querying/sql.md updated to refer to joins, reformatted
a bit, added a new "Query translation" section that explains how
queries are translated from SQL to native, and removed configuration
details (moved to configuration/index.md).
- Refreshed doc: querying/joins.md updated to refer to join datasources.

Smaller changes:

- Add helpful banners to the top of query documentation pages telling
people whether a given page describes SQL, native, or both.
- Add SQL metrics to operations/metrics.md.
- Add some color and cross-links in various places.
- Add native query component docs to the sidebar, and renamed them so
they look nicer.
- Remove Select query from the sidebar.
- Fix Broker SQL configs in configuration/index.md. Remove them from
querying/sql.md.
- Combined querying/searchquery.md and querying/searchqueryspec.md.

* Updates.

* Fix numbering.

* Fix glitches.

* Add new words to spellcheck file.

* Assorted changes.

* Further adjustments.

* Add missing punctuation.
											
										
										
											2020-04-15 19:12:20 -04:00
+								title: "Query granularities"
 								sidebar_label: "Granularities"
-												Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733)


											
										
										
											2018-12-13 14:47:20 -05:00
+								---
-												add missing license headers, in particular to MD files; clean up RAT … (#6563)

* add missing license headers, in particular to MD files; clean up RAT exclusions

* revert inadvertent doc changes

* docs

* cr changes

* fix modified druid-production.svg

											
										
										
											2018-11-13 12:38:37 -05:00
+								<!--
 								  ~ Licensed to the Apache Software Foundation (ASF) under one
 								  ~ or more contributor license agreements.  See the NOTICE file
 								  ~ distributed with this work for additional information
 								  ~ regarding copyright ownership.  The ASF licenses this file
 								  ~ to you under the Apache License, Version 2.0 (the
 								  ~ "License"); you may not use this file except in compliance
 								  ~ with the License.  You may obtain a copy of the License at
 								  ~
 								  ~   http://www.apache.org/licenses/LICENSE-2.0
 								  ~
 								  ~ Unless required by applicable law or agreed to in writing,
 								  ~ software distributed under the License is distributed on an
 								  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
 								  ~ KIND, either express or implied.  See the License for the
 								  ~ specific language governing permissions and limitations
 								  ~ under the License.
 								  -->
-												Refresh query docs. (#9704)

* Refresh query docs.

Larger changes:

- New doc: querying/datasource.md describes the various kinds of
datasources you can use, and has examples for both SQL and native.
- New doc: querying/query-execution.md describes how native queries
are executed at a high level. It doesn't go into the details of specific
query engines or how queries run at a per-segment level. But I think it
would be good to add or link that content here in the future.
- Refreshed doc: querying/sql.md updated to refer to joins, reformatted
a bit, added a new "Query translation" section that explains how
queries are translated from SQL to native, and removed configuration
details (moved to configuration/index.md).
- Refreshed doc: querying/joins.md updated to refer to join datasources.

Smaller changes:

- Add helpful banners to the top of query documentation pages telling
people whether a given page describes SQL, native, or both.
- Add SQL metrics to operations/metrics.md.
- Add some color and cross-links in various places.
- Add native query component docs to the sidebar, and renamed them so
they look nicer.
- Remove Select query from the sidebar.
- Fix Broker SQL configs in configuration/index.md. Remove them from
querying/sql.md.
- Combined querying/searchquery.md and querying/searchqueryspec.md.

* Updates.

* Fix numbering.

* Fix glitches.

* Add new words to spellcheck file.

* Assorted changes.

* Further adjustments.

* Add missing punctuation.
											
										
										
											2020-04-15 19:12:20 -04:00
+								> Apache Druid supports two query languages: [Druid SQL](sql.md) and [native queries](querying.md).
 								> This document describes the native
 								> language. For information about time functions available in SQL, refer to the
 								> [SQL documentation](sql.md#time-functions).
-												Added titles and harmonized docs to improve usability and SEO (#6731)

* added titles and harmonized docs

* manually fixed some titles

											
										
										
											2018-12-12 23:42:12 -05:00
-												Docs - granularities link back to segmentGranularity (#11672)

* Update granularities.md

Link-back to the ingestion spec as well as Native queries plus examples.

* Update docs/querying/granularities.md

Co-authored-by: Charles Smith <techdocsmith@gmail.com>

* Update docs/querying/granularities.md

Co-authored-by: Charles Smith <techdocsmith@gmail.com>
											
										
										
											2021-09-10 13:40:11 -04:00
+								Granularity determines how to bucket data across the time dimension, or how to aggregate data by hour, day, minute, etc.
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
-												Docs - granularities link back to segmentGranularity (#11672)

* Update granularities.md

Link-back to the ingestion spec as well as Native queries plus examples.

* Update docs/querying/granularities.md

Co-authored-by: Charles Smith <techdocsmith@gmail.com>

* Update docs/querying/granularities.md

Co-authored-by: Charles Smith <techdocsmith@gmail.com>
											
										
										
											2021-09-10 13:40:11 -04:00
+								For example, use time granularities in [native queries](querying.md) to bucket results by time, and in the `dataSchema` \\ [`granularitySpec`](../ingestion/ingestion-spec.md#granularityspec) section of ingestion specifications to segment incoming data.
 								You can specify a time period as a [simple](#simple-granularities) string, as a [duration](#duration-granularities) in milliseconds, or as an arbitrary ISO8601 [period](#period-granularities).
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
 								### Simple Granularities
 								Simple granularities are specified as a string and bucket timestamps by their UTC time (e.g., days start at 00:00 UTC).
-												Fix formatting in granularities doc. (#3229)


											
										
										
											2016-07-08 12:29:58 -04:00
+								Supported granularity strings are: `all`, `none`, `second`, `minute`, `fifteen_minute`, `thirty_minute`, `hour`, `day`, `week`, `month`, `quarter` and `year`.
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
 								* `all` buckets everything into a single bucket
-												Docusaurus build framework + ingestion doc refresh. (#8311)

* Docusaurus build framework + ingestion doc refresh.

* stick to npm instead of yarn

* fix typos

* restore some _bin

* Adjustments.

* detect and fix redirect anchors

* update anchor lint

* Web-console: remove specific column filters (#8343)

* add clear filter

* update tool kit

* remove usless check

* auto run

* add %

* Fix resource leak (#8337)

* Fix resource leak

* Patch comments

* Enable Spotbugs NP_NONNULL_RETURN_VIOLATION (#8234)

* Fixes from PR review.

* Fix more anchors.

* Preamble nix.

* Fix more anchors, headers

* clean up placeholder page

* add to website lint to travis config

* better broken link checking

* travis fix

* Fixed more broken links

* better redirects

* unfancy catch

* fix LGTM error

* link fixes

* fix md issues

* Addl fixes

											
										
										
											2019-08-21 00:48:59 -04:00
+								* `none` does not bucket data (it actually uses the granularity of the index - minimum here is `none` which means millisecond granularity). Using `none` in a [TimeseriesQuery](../querying/timeseriesquery.md) is currently not recommended (the system will try to generate 0 values for all milliseconds that didn’t exist, which is often a lot).
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
-												add examples for duration and period granularities

											
										
										
											2015-10-15 17:23:23 -04:00
+								#### Example:
-												De-incubation cleanup in code, docs, packaging (#9108)

* De-incubation cleanup in code, docs, packaging

* remove unused docs script

											
										
										
											2020-01-03 12:33:19 -05:00
+								Suppose you have data below stored in Apache Druid with millisecond ingestion granularity,
-												add examples for duration and period granularities

											
										
										
											2015-10-15 17:23:23 -04:00
 								``` json
 								{"timestamp": "2013-08-31T01:02:33Z", "page": "AAA", "language" : "en"}
 								{"timestamp": "2013-09-01T01:02:33Z", "page": "BBB", "language" : "en"}
 								{"timestamp": "2013-09-02T23:32:45Z", "page": "CCC", "language" : "en"}
 								{"timestamp": "2013-09-03T03:32:45Z", "page": "DDD", "language" : "en"}
 								```
 								After submitting a groupBy query with `hour` granularity,
 								``` json
 								{
 								   "queryType":"groupBy",
 								   "dataSource":"my_dataSource",
 								   "granularity":"hour",
 								   "dimensions":[
 								      "language"
 								   ],
 								   "aggregations":[
 								      {
 								         "type":"count",
 								         "name":"count"
 								      }
 								   ],
 								   "intervals":[
 								      "2000-01-01T00:00Z/3000-01-01T00:00Z"
 								   ]
 								}
 								```
 								you will get
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-31T01:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-01T01:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T23:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-03T03:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								} ]
 								```
 								Note that all the empty buckets are discarded.
 								If you change the granularity to `day`, you will get
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-31T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-01T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-03T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								} ]
 								```
 								If you change the granularity to `none`, you will get the same results as setting it to the ingestion granularity.
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-31T01:02:33.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-01T01:02:33.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T23:32:45.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-03T03:32:45.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								} ]
 								```
-												clarify granularity docs (#7977)


											
										
										
											2019-06-27 11:51:22 -04:00
+								Having a query time `granularity` that is smaller than the `queryGranularity` parameter set at
-												Docs refactor of ingestion. Carries #11541 (#11576)

* Docs refactor of ingestion. Carries #11541

* Update docs/misc/math-expr.md

* add Apache license

* fix header, add topics to sidebar

* Update docs/ingestion/partitioning.md

* pick up changes to  and  md from c7fdf1d, #11479

Co-authored-by: Suneet Saldanha <suneet@apache.org>
Co-authored-by: Jihoon Son <jihoonson@apache.org>
											
										
										
											2021-08-13 11:42:03 -04:00
+								[ingestion time](../ingestion/ingestion-spec.md#granularityspec) is unreasonable because information about that
-												clarify granularity docs (#7977)


											
										
										
											2019-06-27 11:51:22 -04:00
+								smaller granularity is not present in the indexed data. So, if the query time granularity is smaller than the ingestion
 								time query granularity, Druid produces results that are equivalent to having set `granularity` to `queryGranularity`.
-												add examples for duration and period granularities

											
										
										
											2015-10-15 17:23:23 -04:00
 								If you change the granularity to `all`, you will get everything aggregated in 1 bucket,
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2000-01-01T00:00:00.000Z",
 								  "event" : {
 								    "count" : 4,
 								    "language" : "en"
 								  }
 								} ]
 								```
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
+								### Duration Granularities
 								Duration granularities are specified as an exact duration in milliseconds and timestamps are returned as UTC. Duration granularity values are in millis.
 								They also support specifying an optional origin, which defines where to start counting time buckets from (defaults to 1970-01-01T00:00:00Z).
 								```javascript
 								{"type": "duration", "duration": 7200000}
 								```
 								This chunks up every 2 hours.
 								```javascript
 								{"type": "duration", "duration": 3600000, "origin": "2012-01-01T00:30:00Z"}
 								```
 								This chunks up every hour on the half-hour.
-												add examples for duration and period granularities

											
										
										
											2015-10-15 17:23:23 -04:00
+								#### Example:
 								Reusing the data in the previous example, after submitting a groupBy query with 24 hours duration,
 								``` json
 								{
 								   "queryType":"groupBy",
 								   "dataSource":"my_dataSource",
 								   "granularity":{"type": "duration", "duration": "86400000"},
 								   "dimensions":[
 								      "language"
 								   ],
 								   "aggregations":[
 								      {
 								         "type":"count",
 								         "name":"count"
 								      }
 								   ],
 								   "intervals":[
 								      "2000-01-01T00:00Z/3000-01-01T00:00Z"
 								   ]
 								}
 								```
 								you will get
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-31T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-01T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-03T00:00:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								} ]
 								```
 								if you set the origin for the granularity to `2012-01-01T00:30:00Z`,
 								``` javascript
 								   "granularity":{"type": "duration", "duration": "86400000", "origin":"2012-01-01T00:30:00Z"}
 								```
 								you will get
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-31T00:30:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-01T00:30:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T00:30:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-03T00:30:00.000Z",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								} ]
 								```
 								Note that the timestamp for each bucket starts at the 30th minute.
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
+								### Period Granularities
 								Period granularities are specified as arbitrary period combinations of years, months, weeks, hours, minutes and seconds (e.g. P2W, P3M, PT1H30M, PT0.750S) in [ISO8601](https://en.wikipedia.org/wiki/ISO_8601) format. They support specifying a time zone which determines where period boundaries start as well as the timezone of the returned timestamps. By default, years start on the first of January, months start on the first of the month and weeks start on Mondays unless an origin is specified.
 								Time zone is optional (defaults to UTC). Origin is optional (defaults to 1970-01-01T00:00:00 in the given time zone).
 								```javascript
 								{"type": "period", "period": "P2D", "timeZone": "America/Los_Angeles"}
 								```
 								This will bucket by two-day chunks in the Pacific timezone.
 								```javascript
 								{"type": "period", "period": "P3M", "timeZone": "America/Los_Angeles", "origin": "2012-02-01T00:00:00-08:00"}
 								```
 								This will bucket by 3-month chunks in the Pacific timezone where the three-month quarters are defined as starting from February.
-												add examples for duration and period granularities

											
										
										
											2015-10-15 17:23:23 -04:00
+								#### Example
 								Reusing the data in the previous example, if you submit a groupBy query with 1 day period in Pacific timezone,
 								``` json
 								{
 								   "queryType":"groupBy",
 								   "dataSource":"my_dataSource",
 								   "granularity":{"type": "period", "period": "P1D", "timeZone": "America/Los_Angeles"},
 								   "dimensions":[
 								      "language"
 								   ],
 								   "aggregations":[
 								      {
 								         "type":"count",
 								         "name":"count"
 								      }
 								   ],
 								   "intervals":[
 								      "1999-12-31T16:00:00.000-08:00/2999-12-31T16:00:00.000-08:00"
 								   ]
 								}
 								```
 								you will get
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-30T00:00:00.000-07:00",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-31T00:00:00.000-07:00",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T00:00:00.000-07:00",
 								  "event" : {
 								    "count" : 2,
 								    "language" : "en"
 								  }
 								} ]
 								```
 								Note that the timestamp for each bucket has been converted to Pacific time. Row `{"timestamp": "2013-09-02T23:32:45Z", "page": "CCC", "language" : "en"}` and
 								`{"timestamp": "2013-09-03T03:32:45Z", "page": "DDD", "language" : "en"}` are put in the same bucket because they are in the same day in Pacific time.
 								Also note that the `intervals` in groupBy query will not be converted to the timezone specified, the timezone specified in granularity is only applied on the
 								query results.
 								If you set the origin for the granularity to `1970-01-01T20:30:00-08:00`,
 								``` javascript
 								   "granularity":{"type": "period", "period": "P1D", "timeZone": "America/Los_Angeles", "origin": "1970-01-01T20:30:00-08:00"}
 								```
 								you will get
 								``` json
 								[ {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-29T20:30:00.000-07:00",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-08-30T20:30:00.000-07:00",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-01T20:30:00.000-07:00",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								}, {
 								  "version" : "v1",
 								  "timestamp" : "2013-09-02T20:30:00.000-07:00",
 								  "event" : {
 								    "count" : 1,
 								    "language" : "en"
 								  }
 								} ]
 								```
 								Note that the `origin` you specified has nothing to do with the timezone, it only serves as a starting point for locating the very first granularity bucket.
 								In this case, Row `{"timestamp": "2013-09-02T23:32:45Z", "page": "CCC", "language" : "en"}` and `{"timestamp": "2013-09-03T03:32:45Z", "page": "DDD", "language" : "en"}`
 								are not in the same bucket.
-												renaming all *.md filenames to only have lowercase and dashes
so that they are editable on case-insensitive os as well

											
										
										
											2015-05-05 17:07:32 -04:00
+								#### Supported Time Zones
 								Timezone support is provided by the [Joda Time library](http://www.joda.org), which uses the standard IANA time zones. See the [Joda Time supported timezones](http://joda-time.sourceforge.net/timezones.html).