From 1dcdee97f85c96aa71359e0c1c28a9e69c456702 Mon Sep 17 00:00:00 2001 From: ashwinkumar12345 Date: Mon, 16 Aug 2021 22:44:22 -0700 Subject: [PATCH 1/2] incorporated feedback --- _opensearch/data-streams.md | 11 +++++------ 1 file changed, 5 insertions(+), 6 deletions(-) diff --git a/_opensearch/data-streams.md b/_opensearch/data-streams.md index 8efa9702..4d14714b 100644 --- a/_opensearch/data-streams.md +++ b/_opensearch/data-streams.md @@ -6,15 +6,14 @@ nav_order: 13 # Data streams -If you're ingesting continuously generated time-series data such as logs, events, and metrics into OpenSearch, you're likely in a scenario where: +If you're ingesting continuously generated time-series data such as logs, events, and metrics into OpenSearch, you're likely in a scenario where the number of documents grows rapidly and you don't need to update older documents. -- You’re ingesting documents that grow rapidly. -- You don’t need to update older documents. -- Your searches generally target the newer documents. +A typical workflow to manage time-series data involves multiple steps such as creating a rollover index alias, defining a write index, and defining common mappings and settings for the backing indices. -A typical workflow to manage time-series data consists of setting up an alias, configuring a rollover operation, defining a write index, and creating common mapping fields in an index template. Data streams simplifies this process. +Data streams simplify this bootstrapping process and enforce a setup that best suits time-series data, such as being designed primarily for append-only data, and ensuring that each document has a timestamp field. + +A data stream is internally composed of multiple backing indices. Search requests are routed to all the backing indices, while indexing requests are routed to the latest write index. You can use [ISM]({{site.url}}{{site.baseurl}}/im-plugin/ism/index.md/) policies to automatically handle rollovers or deletion of indices in a data stream, based on your use case. -With data streams, you can store append-only time-series data across multiple indices with a single endpoint for ingesting and searching data. We recommend using data streams in place of index aliases for time-series data. ## About data streams From fd1ea4c9acb3618520ffbd04e1d1dbdf09cb2d08 Mon Sep 17 00:00:00 2001 From: ashwinkumar12345 Date: Tue, 17 Aug 2021 00:15:40 -0700 Subject: [PATCH 2/2] minor changes --- _opensearch/data-streams.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/_opensearch/data-streams.md b/_opensearch/data-streams.md index 4d14714b..312e5335 100644 --- a/_opensearch/data-streams.md +++ b/_opensearch/data-streams.md @@ -12,7 +12,7 @@ A typical workflow to manage time-series data involves multiple steps such as cr Data streams simplify this bootstrapping process and enforce a setup that best suits time-series data, such as being designed primarily for append-only data, and ensuring that each document has a timestamp field. -A data stream is internally composed of multiple backing indices. Search requests are routed to all the backing indices, while indexing requests are routed to the latest write index. You can use [ISM]({{site.url}}{{site.baseurl}}/im-plugin/ism/index.md/) policies to automatically handle rollovers or deletion of indices in a data stream, based on your use case. +A data stream is internally composed of multiple backing indices. Search requests are routed to all the backing indices, while indexing requests are routed to the latest write index. You can use [ISM]({{site.url}}{{site.baseurl}}/im-plugin/ism/index/) policies to automatically handle rollovers or deletion of indices in a data stream, based on your use case. ## About data streams