OpenSearch/docs/en/rest-api/ml/datafeedresource.asciidoc

//lcawley Verified example output 2017-04-11
[[ml-datafeed-resource]]
==== Data Feed Resources

A data feed resource has the following properties:

`aggregations`::
  (object) When set the datafeed performs aggregation searches.
  For syntax information, see {ref}search-aggregations.html[Aggregations].
  Support for aggregations is limited: TBD.
  For example:
  `{"@timestamp": {"histogram": {"field": "@timestamp",
  "interval": 30000,"offset": 0,"order": {"_key": "asc"},"keyed": false,
  "min_doc_count": 0}, "aggregations": {"events_per_min": {"sum": {
  "field": "events_per_min"}}}}}`.

`chunking_config`::
  (object) The chunking configuration, which specifies how data searches
  will be chunked. See <<ml-datafeed-chunking-config,chunking configuration objects>>.
  For example: {"mode": "manual", "time_span": "3h"}

`datafeed_id`::
 (string) A numerical character string that uniquely identifies the data feed.

`frequency`::
  (time units) Interval at which scheduled queries should be made while the datafeed
  runs in real-time. The default is either the bucket span for short bucket spans, or,
  for longer bucket spans, a sensible fraction of the bucket span.
  For example: "150s"

`indexes` (required)::
  (array) An array of index names. For example: ["it_ops_metrics"]

`job_id` (required)::
 (string) The id of the job to which the datafeed will send data.

`query`::
  (object) Elasticsearch query DSL. Corresponds to the query object in an Elasticsearch
  search POST body. All options supported by Elasticsearch may be used, as this object
  is passed verbatim to Elasticsearch. If not specified the default is “match_all”: {}
  By default, this property has the following value: `{"match_all": {"boost": 1}}`.

`query_delay`::
  (time units) How many seconds behind real-time should data be queried. For example,
  if data from 10:04am may not be searchable in Elasticsearch until 10:06am then set this to 120 seconds.
  The default is 60 seconds. For example: "60s"

`scroll_size`::
  (unsigned integer) The `size` parameter to be used in elasticsearch searches.
  The default value is `1000`.

`types` (required)::
  (array) List of types to search for within the specified indexes.
  For example: ["network","sql","kpi"]

[[ml-datafeed-chunking-config]]
===== Chunking Configuration Objects

A chunking configuration object has the following properties:

`mode` (required)::
  There are 3 available modes: +
  `auto`::: the chunk size will be dynamically calculated.
  `manual`::: chunking will be applied according to the specified `time_span`.
  `off`::: no chunking will be applied.

`time_span`::
  (time units) The time span that each search will be querying.
  This setting is only applicable when the mode is set to `manual`.
  For example: "3h"

[float]
[[ml-datafeed-counts]]
==== Data Feed Counts

The get data feed statistics API provides information about the operational
progress of a data feed. For example:

`assigment_explanation`::
  TBD. For example: " "

`datafeed_id`::
 (string) A numerical character string that uniquely identifies the data feed.

`node`::
  (object) TBD
  The node that is running the query?
  `id`::: TBD. For example, "0-o0tOoRTwKFZifatTWKNw".
  `name`::: TBD. For example, "0-o0tOo".
  `ephemeral_id`::: TBD. For example, "DOZltLxLS_SzYpW6hQ9hyg".
  `transport_address`::: TBD. For example, "127.0.0.1:9300".
  `attributes`::: TBD. For example, {"max_running_jobs": "10"}.

`state`::
  (string) The status of the data feed, which can be one of the following values: +
  `started`::: The data feed is actively receiving data.
  `stopped`::: The data feed is stopped and will not receive data until it is re-started.
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`//lcawley Verified example output 2017-04-11`
[DOCS] Add ML documentation to master (elastic/x-pack-elasticsearch#959) Original commit: elastic/x-pack-elasticsearch@666a10bd23eaf692dcb25d8aedcb5bc4da735370 2017-04-04 15:26:39 -07:00			`[[ml-datafeed-resource]]`
			`==== Data Feed Resources`

			`A data feed resource has the following properties:`

[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`aggregations`::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(object) When set the datafeed performs aggregation searches.`
			`For syntax information, see {ref}search-aggregations.html[Aggregations].`
			`Support for aggregations is limited: TBD.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`For example:`
			`{"@timestamp": {"histogram": {"field": "@timestamp",
			`"interval": 30000,"offset": 0,"order": {"_key": "asc"},"keyed": false,`
			`"min_doc_count": 0}, "aggregations": {"events_per_min": {"sum": {`
			"field": "events_per_min"}}}}}`.

[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`chunking_config`::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(object) The chunking configuration, which specifies how data searches`
			`will be chunked. See <<ml-datafeed-chunking-config,chunking configuration objects>>.`
			`For example: {"mode": "manual", "time_span": "3h"}`
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`datafeed_id`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(string) A numerical character string that uniquely identifies the data feed.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`frequency`::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(time units) Interval at which scheduled queries should be made while the datafeed`
			`runs in real-time. The default is either the bucket span for short bucket spans, or,`
			`for longer bucket spans, a sensible fraction of the bucket span.`
			`For example: "150s"`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`indexes` (required)::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(array) An array of index names. For example: ["it_ops_metrics"]`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`job_id` (required)::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(string) The id of the job to which the datafeed will send data.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`query`::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(object) Elasticsearch query DSL. Corresponds to the query object in an Elasticsearch`
			`search POST body. All options supported by Elasticsearch may be used, as this object`
			`is passed verbatim to Elasticsearch. If not specified the default is “match_all”: {}`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			By default, this property has the following value: `{"match_all": {"boost": 1}}`.

			`query_delay`::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(time units) How many seconds behind real-time should data be queried. For example,`
			`if data from 10:04am may not be searchable in Elasticsearch until 10:06am then set this to 120 seconds.`
			`The default is 60 seconds. For example: "60s"`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`scroll_size`::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			(unsigned integer) The `size` parameter to be used in elasticsearch searches.
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			The default value is `1000`.

			`types` (required)::
[DOCS] Add missing info in datafeed resource (elastic/x-pack-elasticsearch#1215) Original commit: elastic/x-pack-elasticsearch@c415bc92c2f9d750a8827c9b72abd144c6cb15a6 2017-04-26 18:05:27 +01:00			`(array) List of types to search for within the specified indexes.`
			`For example: ["network","sql","kpi"]`

			`[[ml-datafeed-chunking-config]]`
			`===== Chunking Configuration Objects`

			`A chunking configuration object has the following properties:`

			`mode` (required)::
			`There are 3 available modes: +`
			`auto`::: the chunk size will be dynamically calculated.
			`manual`::: chunking will be applied according to the specified `time_span`.
			`off`::: no chunking will be applied.

			`time_span`::
			`(time units) The time span that each search will be querying.`
			This setting is only applicable when the mode is set to `manual`.
			`For example: "3h"`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`[float]`
[DOCS] Add ML API results examples Original commit: elastic/x-pack-elasticsearch@60a21763eb4eff50a887bef04471670587102071 2017-04-10 16:14:26 -07:00			`[[ml-datafeed-counts]]`
			`==== Data Feed Counts`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
[DOCS] Add ML API results examples Original commit: elastic/x-pack-elasticsearch@60a21763eb4eff50a887bef04471670587102071 2017-04-10 16:14:26 -07:00			`The get data feed statistics API provides information about the operational`
			`progress of a data feed. For example:`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`assigment_explanation`::
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`TBD. For example: " "`

			`datafeed_id`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(string) A numerical character string that uniquely identifies the data feed.`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`node`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(object) TBD`
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00			`The node that is running the query?`
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`id`::: TBD. For example, "0-o0tOoRTwKFZifatTWKNw".
			`name`::: TBD. For example, "0-o0tOo".
[DOCS] ML API docs review (elastic/x-pack-elasticsearch#1169) * [DOCS] Fix for prelertcategory * [DOCS] _preview returns a page of data * [DOCS] Added adv options e.g. background_persist_interval" * [DOCS] Clarify meanings of model_snapshot params * [DOCS] Format fixes * [DOCS] Include _all keyword * [DOCS] Explain retain. * [DOCS] Further explanations for model size limits * [DOCS] Format fixes in quick ref * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim Original commit: elastic/x-pack-elasticsearch@cdd2fcefdd3ea7cd2b517142c1bed1d2a02775de 2017-04-24 17:31:31 +01:00			`ephemeral_id`::: TBD. For example, "DOZltLxLS_SzYpW6hQ9hyg".
			`transport_address`::: TBD. For example, "127.0.0.1:9300".
[DOCS] Update all ML API examples with latest build output Original commit: elastic/x-pack-elasticsearch@f9fa3b813afc415486183895bc7168684edff0ee 2017-04-11 18:52:47 -07:00			`attributes`::: TBD. For example, {"max_running_jobs": "10"}.
[DOCS] Add ML data feed API examples (elastic/x-pack-elasticsearch#1016) * [DOCS] Added examples for all ML job APIs * [DOCS] Add ML datafeed API examples Original commit: elastic/x-pack-elasticsearch@96343563710d3bd3ba436bb752006215cf52e9a3 2017-04-10 08:59:27 -07:00
			`state`::
[DOCS] Remove data type formatting from API pages Original commit: elastic/x-pack-elasticsearch@fb06ece3f0700c9a71d2cc06d203553f01a993cc 2017-04-11 19:26:18 -07:00			`(string) The status of the data feed, which can be one of the following values: +`
[DOCS] ML API docs review (elastic/x-pack-elasticsearch#1169) * [DOCS] Fix for prelertcategory * [DOCS] _preview returns a page of data * [DOCS] Added adv options e.g. background_persist_interval" * [DOCS] Clarify meanings of model_snapshot params * [DOCS] Format fixes * [DOCS] Include _all keyword * [DOCS] Explain retain. * [DOCS] Further explanations for model size limits * [DOCS] Format fixes in quick ref * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim * [DOCS] Update for exclude_interim Original commit: elastic/x-pack-elasticsearch@cdd2fcefdd3ea7cd2b517142c1bed1d2a02775de 2017-04-24 17:31:31 +01:00			`started`::: The data feed is actively receiving data.
			`stopped`::: The data feed is stopped and will not receive data until it is re-started.