druid/docs/content/ingestion/stream-ingestion.md

---
layout: doc_page
title: "Loading Streams"
---

<!--
  ~ Licensed to the Apache Software Foundation (ASF) under one
  ~ or more contributor license agreements.  See the NOTICE file
  ~ distributed with this work for additional information
  ~ regarding copyright ownership.  The ASF licenses this file
  ~ to you under the Apache License, Version 2.0 (the
  ~ "License"); you may not use this file except in compliance
  ~ with the License.  You may obtain a copy of the License at
  ~
  ~   http://www.apache.org/licenses/LICENSE-2.0
  ~
  ~ Unless required by applicable law or agreed to in writing,
  ~ software distributed under the License is distributed on an
  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
  ~ KIND, either express or implied.  See the License for the
  ~ specific language governing permissions and limitations
  ~ under the License.
  -->

# Loading Streams

Streams can be ingested in Druid using either [Tranquility](https://github.com/druid-io/tranquility) (a Druid-aware
client) or the [Kafka Indexing Service](../development/extensions-core/kafka-ingestion.html).

## Tranquility (Stream Push)

If you have a program that generates a stream, then you can push that stream directly into Druid in
real-time. With this approach, Tranquility is embedded in your data-producing application.
Tranquility comes with bindings for the
Storm and Samza stream processors. It also has a direct API that can be used from any JVM-based
program, such as Spark Streaming or a Kafka consumer.

Tranquility handles partitioning, replication, service discovery, and schema rollover for you,
seamlessly and without downtime. You only have to define your Druid schema.

For examples and more information, please see the [Tranquility README](https://github.com/druid-io/tranquility).

A tutorial is also available at [Tutorial: Loading stream data using HTTP push](../tutorials/tutorial-tranquility.html).

## Kafka Indexing Service (Stream Pull)

Druid can pulll data from Kafka streams using the [Kafka Indexing Service](../development/extensions-core/kafka-ingestion.html).

The Kafka indexing service enables the configuration of *supervisors* on the Overlord, which facilitate ingestion from
Kafka by managing the creation and lifetime of Kafka indexing tasks. These indexing tasks read events using Kafka's own
partition and offset mechanism and are therefore able to provide guarantees of exactly-once ingestion. They are also
able to read non-recent events from Kafka and are not subject to the window period considerations imposed on other
ingestion mechanisms. The supervisor oversees the state of the indexing tasks to coordinate handoffs, manage failures,
and ensure that the scalability and replication requirements are maintained.

A tutorial is available at [Tutorial: Loading stream data from Kafka](../tutorials/tutorial-kafka.html).
-												Front Matter header needs to be on the first line for md to be rendered properly by jekyll (#6733)


											
										
										
											2018-12-13 14:47:20 -05:00
+								---
 								layout: doc_page
 								title: "Loading Streams"
 								---
-												add missing license headers, in particular to MD files; clean up RAT … (#6563)

* add missing license headers, in particular to MD files; clean up RAT exclusions

* revert inadvertent doc changes

* docs

* cr changes

* fix modified druid-production.svg

											
										
										
											2018-11-13 12:38:37 -05:00
+								<!--
 								  ~ Licensed to the Apache Software Foundation (ASF) under one
 								  ~ or more contributor license agreements.  See the NOTICE file
 								  ~ distributed with this work for additional information
 								  ~ regarding copyright ownership.  The ASF licenses this file
 								  ~ to you under the Apache License, Version 2.0 (the
 								  ~ "License"); you may not use this file except in compliance
 								  ~ with the License.  You may obtain a copy of the License at
 								  ~
 								  ~   http://www.apache.org/licenses/LICENSE-2.0
 								  ~
 								  ~ Unless required by applicable law or agreed to in writing,
 								  ~ software distributed under the License is distributed on an
 								  ~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
 								  ~ KIND, either express or implied.  See the License for the
 								  ~ specific language governing permissions and limitations
 								  ~ under the License.
 								  -->
-												Added titles and harmonized docs to improve usability and SEO (#6731)

* added titles and harmonized docs

* manually fixed some titles

											
										
										
											2018-12-12 23:42:12 -05:00
+								# Loading Streams
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												Added titles and harmonized docs to improve usability and SEO (#6731)

* added titles and harmonized docs

* manually fixed some titles

											
										
										
											2018-12-12 23:42:12 -05:00
+								Streams can be ingested in Druid using either [Tranquility](https://github.com/druid-io/tranquility) (a Druid-aware
-												Docs consistency cleanup (#6259)


											
										
										
											2018-09-04 15:54:41 -04:00
+								client) or the [Kafka Indexing Service](../development/extensions-core/kafka-ingestion.html).
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												Docs consistency cleanup (#6259)


											
										
										
											2018-09-04 15:54:41 -04:00
+								## Tranquility (Stream Push)
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												Added titles and harmonized docs to improve usability and SEO (#6731)

* added titles and harmonized docs

* manually fixed some titles

											
										
										
											2018-12-12 23:42:12 -05:00
+								If you have a program that generates a stream, then you can push that stream directly into Druid in
 								real-time. With this approach, Tranquility is embedded in your data-producing application.
 								Tranquility comes with bindings for the
 								Storm and Samza stream processors. It also has a direct API that can be used from any JVM-based
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
+								program, such as Spark Streaming or a Kafka consumer.
-												Added titles and harmonized docs to improve usability and SEO (#6731)

* added titles and harmonized docs

* manually fixed some titles

											
										
										
											2018-12-12 23:42:12 -05:00
+								Tranquility handles partitioning, replication, service discovery, and schema rollover for you,
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
+								seamlessly and without downtime. You only have to define your Druid schema.
 								For examples and more information, please see the [Tranquility README](https://github.com/druid-io/tranquility).
-												Docs consistency cleanup (#6259)


											
										
										
											2018-09-04 15:54:41 -04:00
+								A tutorial is also available at [Tutorial: Loading stream data using HTTP push](../tutorials/tutorial-tranquility.html).
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												Docs consistency cleanup (#6259)


											
										
										
											2018-09-04 15:54:41 -04:00
+								## Kafka Indexing Service (Stream Pull)
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												Docs consistency cleanup (#6259)


											
										
										
											2018-09-04 15:54:41 -04:00
+								Druid can pulll data from Kafka streams using the [Kafka Indexing Service](../development/extensions-core/kafka-ingestion.html).
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												Docs consistency cleanup (#6259)


											
										
										
											2018-09-04 15:54:41 -04:00
+								The Kafka indexing service enables the configuration of *supervisors* on the Overlord, which facilitate ingestion from
 								Kafka by managing the creation and lifetime of Kafka indexing tasks. These indexing tasks read events using Kafka's own
 								partition and offset mechanism and are therefore able to provide guarantees of exactly-once ingestion. They are also
 								able to read non-recent events from Kafka and are not subject to the window period considerations imposed on other
 								ingestion mechanisms. The supervisor oversees the state of the indexing tasks to coordinate handoffs, manage failures,
 								and ensure that the scalability and replication requirements are maintained.
-												new quickstart

											
										
										
											2016-01-06 00:27:52 -05:00
-												add missing license headers, in particular to MD files; clean up RAT … (#6563)

* add missing license headers, in particular to MD files; clean up RAT exclusions

* revert inadvertent doc changes

* docs

* cr changes

* fix modified druid-production.svg

											
										
										
											2018-11-13 12:38:37 -05:00
+								A tutorial is available at [Tutorial: Loading stream data from Kafka](../tutorials/tutorial-kafka.html).