2018-12-13 14:47:20 -05:00
|
|
|
---
|
|
|
|
layout: doc_page
|
|
|
|
title: "Druid vs Elasticsearch"
|
|
|
|
---
|
|
|
|
|
2018-11-13 12:38:37 -05:00
|
|
|
<!--
|
|
|
|
~ Licensed to the Apache Software Foundation (ASF) under one
|
|
|
|
~ or more contributor license agreements. See the NOTICE file
|
|
|
|
~ distributed with this work for additional information
|
|
|
|
~ regarding copyright ownership. The ASF licenses this file
|
|
|
|
~ to you under the Apache License, Version 2.0 (the
|
|
|
|
~ "License"); you may not use this file except in compliance
|
|
|
|
~ with the License. You may obtain a copy of the License at
|
|
|
|
~
|
|
|
|
~ http://www.apache.org/licenses/LICENSE-2.0
|
|
|
|
~
|
|
|
|
~ Unless required by applicable law or agreed to in writing,
|
|
|
|
~ software distributed under the License is distributed on an
|
|
|
|
~ "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
|
|
|
|
~ KIND, either express or implied. See the License for the
|
|
|
|
~ specific language governing permissions and limitations
|
|
|
|
~ under the License.
|
|
|
|
-->
|
|
|
|
|
2018-12-12 23:42:12 -05:00
|
|
|
# Druid vs Elasticsearch
|
2015-05-05 17:07:32 -04:00
|
|
|
|
2015-11-09 19:40:07 -05:00
|
|
|
We are not experts on search systems, if anything is incorrect about our portrayal, please let us know on the mailing list or via some other means.
|
2015-05-05 17:07:32 -04:00
|
|
|
|
2015-11-09 19:40:07 -05:00
|
|
|
Elasticsearch is a search systems based on Apache Lucene. It provides full text search for schema-free documents
|
|
|
|
and provides access to raw event level data. Elasticsearch is increasingly adding more support for analytics and aggregations.
|
|
|
|
[Some members of the community](https://groups.google.com/forum/#!msg/druid-development/nlpwTHNclj8/sOuWlKOzPpYJ) have pointed out
|
|
|
|
the resource requirements for data ingestion and aggregation in Elasticsearch is much higher than those of Druid.
|
2015-05-05 17:07:32 -04:00
|
|
|
|
2015-11-09 19:40:07 -05:00
|
|
|
Elasticsearch also does not support data summarization/roll-up at ingestion time, which can compact the data that needs to be
|
|
|
|
stored up to 100x with real-world data sets. This leads to Elasticsearch having greater storage requirements.
|
|
|
|
|
|
|
|
Druid focuses on OLAP work flows. Druid is optimized for high performance (fast aggregation and ingestion) at low cost,
|
|
|
|
and supports a wide range of analytic operations. Druid has some basic search support for structured event data, but does not support
|
|
|
|
full text search. Druid also does not support completely unstructured data. Measures must be defined in a Druid schema such that
|
|
|
|
summarization/roll-up can be done.
|