OpenSearch/docs/en/rest-api/ml/datafeedresource.asciidoc

//lcawley Verified example output 2017-04-11
[[ml-datafeed-resource]]
==== Data Feed Resources

A data feed resource has the following properties:

`aggregations`::
  (object) If set, the data feed performs aggregation searches.
  For syntax information, see {ref}/search-aggregations.html[Aggregations].
  Support for aggregations is limited: TBD.
  For example:
  `{"@timestamp": {"histogram": {"field": "@timestamp",
  "interval": 30000,"offset": 0,"order": {"_key": "asc"},"keyed": false,
  "min_doc_count": 0}, "aggregations": {"events_per_min": {"sum": {
  "field": "events_per_min"}}}}}`.

`chunking_config`::
  (object) The chunking configuration, which specifies how data searches are
  chunked. See <<ml-datafeed-chunking-config>>.
  For example: {"mode": "manual", "time_span": "3h"}

`datafeed_id`::
 (string) A numerical character string that uniquely identifies the data feed.

`frequency`::
  (time units) The interval at which scheduled queries are made while the data
  feed runs in real time. The default value is either the bucket span for short
  bucket spans, or, for longer bucket spans, a sensible fraction of the bucket
  span. For example: "150s"

`indexes` (required)::
  (array) An array of index names. For example: ["it_ops_metrics"]

`job_id` (required)::
 (string) The unique identifier for the job to which the data feed sends data.

`query`::
  (object) The Elasticsearch query domain-specific language (DSL). This value
  corresponds to the query object in an Elasticsearch search POST body. All the
  options that are supported by Elasticsearch can be used, as this object is
  passed verbatim to Elasticsearch. By default, this property has the following
  value: `{"match_all": {"boost": 1}}`. If this property is not specified, the
  default value is `“match_all”: {}`.

`query_delay`::
  (time units) The number of seconds behind real-time that data is queried. For
  example, if data from 10:04 a.m. might not be searchable in Elasticsearch
  until 10:06 a.m., set this property to 120 seconds. The default value is 60
  seconds. For example: "60s".

`scroll_size`::
  (unsigned integer) The `size` parameter that is used in Elasticsearch searches.
  The default value is `1000`.

`types` (required)::
  (array) A list of types to search for within the specified indices.
  For example: ["network","sql","kpi"].

[[ml-datafeed-chunking-config]]
===== Chunking Configuration Objects

A chunking configuration object has the following properties:

`mode` (required)::
  There are three available modes: +
  `auto`::: The chunk size will be dynamically calculated.
  `manual`::: Chunking will be applied according to the specified `time_span`.
  `off`::: No chunking will be applied.

`time_span`::
  (time units) The time span that each search will be querying.
  This setting is only applicable when the mode is set to `manual`.
  For example: "3h".

[float]
[[ml-datafeed-counts]]
==== Data Feed Counts

The get data feed statistics API provides information about the operational
progress of a data feed. For example:

`assigment_explanation`::
  TBD. For example: " "

`datafeed_id`::
 (string) A numerical character string that uniquely identifies the data feed.

`node`::
  (object) TBD
  The node that is running the query?
  `id`::: TBD. For example, "0-o0tOoRTwKFZifatTWKNw".
  `name`::: TBD. For example, "0-o0tOo".
  `ephemeral_id`::: TBD. For example, "DOZltLxLS_SzYpW6hQ9hyg".
  `transport_address`::: TBD. For example, "127.0.0.1:9300".
  `attributes`::: TBD. For example, {"max_running_jobs": "10"}.

`state`::
  (string) The status of the data feed, which can be one of the following values: +
  `started`::: The data feed is actively receiving data.
  `stopped`::: The data feed is stopped and will not receive data until it is re-started.
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`//lcawley Verified example output 2017-04-11`
[DOCS] Add ML documentation to master (elastic/x-pack-elasticsearch#959) Original commit: elastic/x-pack-elasticsearch@666a10bd23eaf692dcb25d8aedcb5bc4da735370 2017-04-04 15:26:39 -07:00			`[[ml-datafeed-resource]]`
			`==== Data Feed Resources`

			`A data feed resource has the following properties:`

[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`aggregations`::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(object) If set, the data feed performs aggregation searches.`
			`For syntax information, see {ref}/search-aggregations.html[Aggregations].`
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`Support for aggregations is limited: TBD.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`For example:`
			`{"@timestamp": {"histogram": {"field": "@timestamp",
			`"interval": 30000,"offset": 0,"order": {"_key": "asc"},"keyed": false,`
			`"min_doc_count": 0}, "aggregations": {"events_per_min": {"sum": {`
			"field": "events_per_min"}}}}}`.

[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`chunking_config`::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(object) The chunking configuration, which specifies how data searches are`
			`chunked. See <<ml-datafeed-chunking-config>>.`
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`For example: {"mode": "manual", "time_span": "3h"}`
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`datafeed_id`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(string) A numerical character string that uniquely identifies the data feed.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`frequency`::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(time units) The interval at which scheduled queries are made while the data`
			`feed runs in real time. The default value is either the bucket span for short`
			`bucket spans, or, for longer bucket spans, a sensible fraction of the bucket`
			`span. For example: "150s"`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`indexes` (required)::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(array) An array of index names. For example: ["it_ops_metrics"]`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`job_id` (required)::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(string) The unique identifier for the job to which the data feed sends data.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`query`::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(object) The Elasticsearch query domain-specific language (DSL). This value`
			`corresponds to the query object in an Elasticsearch search POST body. All the`
			`options that are supported by Elasticsearch can be used, as this object is`
			`passed verbatim to Elasticsearch. By default, this property has the following`
			value: `{"match_all": {"boost": 1}}`. If this property is not specified, the
			default value is `“match_all”: {}`.
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`query_delay`::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(time units) The number of seconds behind real-time that data is queried. For`
			`example, if data from 10:04 a.m. might not be searchable in Elasticsearch`
			`until 10:06 a.m., set this property to 120 seconds. The default value is 60`
			`seconds. For example: "60s".`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`scroll_size`::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			(unsigned integer) The `size` parameter that is used in Elasticsearch searches.
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			The default value is `1000`.

			`types` (required)::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`(array) A list of types to search for within the specified indices.`
			`For example: ["network","sql","kpi"].`
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00
			`[[ml-datafeed-chunking-config]]`
			`===== Chunking Configuration Objects`

			`A chunking configuration object has the following properties:`

			`mode` (required)::
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`There are three available modes: +`
			`auto`::: The chunk size will be dynamically calculated.
			`manual`::: Chunking will be applied according to the specified `time_span`.
			`off`::: No chunking will be applied.
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00
			`time_span`::
			`(time units) The time span that each search will be querying.`
			This setting is only applicable when the mode is set to `manual`.
[DOCS] More edits in datafeed resource (elastic/x-pack-elasticsearch#1221) Original commit: elastic/x-pack-elasticsearch@ea6abc163ff303d437289df1f2815168c0065690 2017-04-26 11:18:51 -07:00			`For example: "3h".`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`[float]`
[DOCS] Add ML API results examples Original commit: elastic/x-pack-elasticsearch@60a21763eb4eff50a887bef04471670587102071 2017-04-10 16:14:26 -07:00			`[[ml-datafeed-counts]]`
			`==== Data Feed Counts`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
[DOCS] Add ML API results examples Original commit: elastic/x-pack-elasticsearch@60a21763eb4eff50a887bef04471670587102071 2017-04-10 16:14:26 -07:00			`The get data feed statistics API provides information about the operational`
			`progress of a data feed. For example:`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`assigment_explanation`::
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`TBD. For example: " "`

			`datafeed_id`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(string) A numerical character string that uniquely identifies the data feed.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`node`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(object) TBD`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`The node that is running the query?`
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`id`::: TBD. For example, "0-o0tOoRTwKFZifatTWKNw".
			`name`::: TBD. For example, "0-o0tOo".
[DOCS] ML API docs review (elastic/x-pack-elasticsearch#1169) * [DOCS] Fix for prelertcategory * [DOCS] _preview returns a page of data * [DOCS] Added adv options e.g. background_persist_interval" * [DOCS] Clarify meanings of model_snapshot params * [DOCS] Format fixes * [DOCS] Include _all keyword * [DOCS] Explain retain. * [DOCS] Further explanations for model size limits * [DOCS] Format fixes in quick ref * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim Original commit: elastic/x-pack-elasticsearch@cdd2fcefdd3ea7cd2b517142c1bed1d2a02775de 2017-04-24 17:31:31 +01:00			`ephemeral_id`::: TBD. For example, "DOZltLxLS_SzYpW6hQ9hyg".
			`transport_address`::: TBD. For example, "127.0.0.1:9300".
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`attributes`::: TBD. For example, {"max_running_jobs": "10"}.
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`state`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(string) The status of the data feed, which can be one of the following values: +`
[DOCS] ML API docs review (elastic/x-pack-elasticsearch#1169) * [DOCS] Fix for prelertcategory * [DOCS] _preview returns a page of data * [DOCS] Added adv options e.g. background_persist_interval" * [DOCS] Clarify meanings of model_snapshot params * [DOCS] Format fixes * [DOCS] Include _all keyword * [DOCS] Explain retain. * [DOCS] Further explanations for model size limits * [DOCS] Format fixes in quick ref * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim Original commit: elastic/x-pack-elasticsearch@cdd2fcefdd3ea7cd2b517142c1bed1d2a02775de 2017-04-24 17:31:31 +01:00			`started`::: The data feed is actively receiving data.
			`stopped`::: The data feed is stopped and will not receive data until it is re-started.