OpenSearch/docs/reference/settings/ml-settings.asciidoc

[role="xpack"]
[[ml-settings]]
=== Machine learning settings in Elasticsearch
++++
<titleabbrev>Machine learning settings</titleabbrev>
++++

You do not need to configure any settings to use {ml}. It is enabled by default.

All of these settings can be added to the `elasticsearch.yml` configuration file. 
The dynamic settings can also be updated across a cluster with the 
<<cluster-update-settings,cluster update settings API>>.

TIP: Dynamic settings take precedence over settings in the `elasticsearch.yml` 
file.

[float]
[[general-ml-settings]]
==== General machine learning settings

`node.ml`::
Set to `true` (default) to identify the node as a _machine learning node_. +
+
If set to `false` in `elasticsearch.yml`, the node cannot run jobs. If set to
`true` but `xpack.ml.enabled` is set to `false`, the `node.ml` setting is
ignored and the node cannot run jobs. If you want to run jobs, there must be at
least one machine learning node in your cluster. +
+
IMPORTANT: On dedicated coordinating nodes or dedicated master nodes, disable
the `node.ml` role.

`xpack.ml.enabled`::
Set to `true` (default) to enable {ml} on the node. +
+
If set to `false` in `elasticsearch.yml`, the {ml} APIs are disabled on the node.
Therefore the node cannot open jobs, start {dfeeds}, or receive transport (internal)
communication requests related to {ml} APIs. It also affects all {kib} instances
that connect to this {es} instance; you do not need to disable {ml} in those
`kibana.yml` files. For more information about disabling {ml} in specific {kib}
instances, see
{kibana-ref}/ml-settings-kb.html[{kib} Machine Learning Settings].
+
IMPORTANT: If you want to use {ml} features in your cluster, you must have
`xpack.ml.enabled` set to `true` on all master-eligible nodes. This is the
default behavior.

`xpack.ml.max_machine_memory_percent`::
The maximum percentage of the machine's memory that {ml} may use for running
analytics processes. (These processes are separate to the {es} JVM.) Defaults to
`30` percent. The limit is based on the total memory of the machine, not current
free memory. Jobs will not be allocated to a node if doing so would cause the
estimated memory use of {ml} jobs to exceed the limit.

`xpack.ml.max_model_memory_limit`::
The maximum `model_memory_limit` property value that can be set for any job on
this node. If you try to create a job with a `model_memory_limit` property value
that is greater than this setting value, an error occurs. Existing jobs are not
affected when you update this setting. For more information about the
`model_memory_limit` property, see <<ml-apilimits>>.

`xpack.ml.max_open_jobs`::
The maximum number of jobs that can run on a node. Defaults to `20`.
The maximum number of jobs is also constrained by memory usage, so fewer
jobs than specified by this setting will run on a node if the estimated
memory use of the jobs would be higher than allowed.

`xpack.ml.node_concurrent_job_allocations`::
The maximum number of jobs that can concurrently be in the `opening` state on
each node. Typically, jobs spend a small amount of time in this state before
they move to `open` state. Jobs that must restore large models when they are
opening spend more time in the `opening` state. Defaults to `2`.

[float]
[[advanced-ml-settings]]
==== Advanced machine learning settings

These settings are for advanced use cases; the default values are generally 
sufficient:

`xpack.ml.max_anomaly_records`:: (<<cluster-update-settings,Dynamic>>) 
The maximum number of records that are output per bucket. The default value is 
`500`.

`xpack.ml.max_lazy_ml_nodes`:: (<<cluster-update-settings,Dynamic>>)
The number of lazily spun up Machine Learning nodes. Useful in situations
where ML nodes are not desired until the first Machine Learning Job
is opened. It defaults to `0` and has a maximum acceptable value of `3`.
If the current number of ML nodes is `>=` than this setting, then it is
assumed that there are no more lazy nodes available as the desired number
of nodes have already been provisioned. When a job is opened with this
setting set at `>0` and there are no nodes that can accept the job, then
the job will stay in the `OPENING` state until a new ML node is added to the
cluster and the job is assigned to run on that node.
+
IMPORTANT: This setting assumes some external process is capable of adding ML nodes
to the cluster. This setting is only useful when used in conjunction with
such an external process.
[DOCS] Create X-Pack installation and introduction for Elasticsearch Ref (elastic/x-pack-elasticsearch#1698) * [DOCS] Create X-Pack installation and introduction for Elasticsearch Reference * [DOCS] Address feedback in X-Pack install and intro info * [DOCS] Add X-Pack setup pages * [DOCS] Add ML settings to Elasticsearch * [DOCS] Add table for X-Pack settings * [DOCS]Add logstash settings Original commit: elastic/x-pack-elasticsearch@65786cff789e209ed9ed206974ba31252b096475 2017-06-19 21:01:52 -04:00			`[role="xpack"]`
[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00			`[[ml-settings]]`
[DOCS] Adds new dynamic machine learning settings (#34094) 2018-09-28 12:41:14 -04:00			`=== Machine learning settings in Elasticsearch`
[DOCS] Add abbreviated titles Original commit: elastic/x-pack-elasticsearch@a4cf8a363fe9827fbdfea1235f277d5636dd8690 2017-08-11 13:00:35 -04:00			`++++`
[DOCS] Adds new dynamic machine learning settings (#34094) 2018-09-28 12:41:14 -04:00			`<titleabbrev>Machine learning settings</titleabbrev>`
[DOCS] Add abbreviated titles Original commit: elastic/x-pack-elasticsearch@a4cf8a363fe9827fbdfea1235f277d5636dd8690 2017-08-11 13:00:35 -04:00			`++++`

[DOCS] Overall review (elastic/x-pack-elasticsearch#1237) * [DOCS] Overall review * [DOCS] General review * [DOCS] typo * [DOCS] Fix for processed_record_count with aggs * [DOCS] Added latency tbd Original commit: elastic/x-pack-elasticsearch@9e8cf664c14acd04a2ddc168d24647a96773bda4 2017-04-27 13:51:48 -04:00			`You do not need to configure any settings to use {ml}. It is enabled by default.`
[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00
[DOCS] Adds new dynamic machine learning settings (#34094) 2018-09-28 12:41:14 -04:00			All of these settings can be added to the `elasticsearch.yml` configuration file.
			`The dynamic settings can also be updated across a cluster with the`
			`<<cluster-update-settings,cluster update settings API>>.`

			TIP: Dynamic settings take precedence over settings in the `elasticsearch.yml`
			`file.`

[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00			`[float]`
			`[[general-ml-settings]]`
[DOCS] Adds new dynamic machine learning settings (#34094) 2018-09-28 12:41:14 -04:00			`==== General machine learning settings`
[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00
[DOCS] Add xpack.ml.max_model_memory_limit (elastic/x-pack-elasticsearch#2787) * [DOCS] Add xpack.ml.max.model_memory_limit * [DOCS] Addressed feedback on model limit setting Original commit: elastic/x-pack-elasticsearch@77a10bfe0ea3bac75831b1788f85dd92f7af1406 2017-10-25 12:00:53 -04:00			`node.ml`::
			Set to `true` (default) to identify the node as a _machine learning node_. +
			`+`
			If set to `false` in `elasticsearch.yml`, the node cannot run jobs. If set to
			`true` but `xpack.ml.enabled` is set to `false`, the `node.ml` setting is
			`ignored and the node cannot run jobs. If you want to run jobs, there must be at`
			`least one machine learning node in your cluster. +`
			`+`
			`IMPORTANT: On dedicated coordinating nodes or dedicated master nodes, disable`
			the `node.ml` role.

[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00			`xpack.ml.enabled`::
[DOCS] Make clearer that xpack.xyz.enabled settings are node settings (elastic/x-pack-elasticsearch#2731) The discussion in elastic/x-pack-elasticsearch#2697 shows that this was not clear before. relates elastic/x-pack-elasticsearch#2697 Original commit: elastic/x-pack-elasticsearch@87553faa2cea34436509bff4d5e1ab2c04ead269 2017-10-13 04:22:21 -04:00			Set to `true` (default) to enable {ml} on the node. +
[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00			`+`
[DOCS] Make clearer that xpack.xyz.enabled settings are node settings (elastic/x-pack-elasticsearch#2731) The discussion in elastic/x-pack-elasticsearch#2697 shows that this was not clear before. relates elastic/x-pack-elasticsearch#2697 Original commit: elastic/x-pack-elasticsearch@87553faa2cea34436509bff4d5e1ab2c04ead269 2017-10-13 04:22:21 -04:00			If set to `false` in `elasticsearch.yml`, the {ml} APIs are disabled on the node.
			`Therefore the node cannot open jobs, start {dfeeds}, or receive transport (internal)`
[DOCS] Clarify xpack.security.enabled info (elastic/x-pack-elasticsearch#2166) Original commit: elastic/x-pack-elasticsearch@140e3309733d3900aa29c225b8cee81451fbeb89 2017-08-04 11:59:50 -04:00			`communication requests related to {ml} APIs. It also affects all {kib} instances`
			`that connect to this {es} instance; you do not need to disable {ml} in those`
			`kibana.yml` files. For more information about disabling {ml} in specific {kib}
			`instances, see`
			`{kibana-ref}/ml-settings-kb.html[{kib} Machine Learning Settings].`
[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00			`+`
[DOCS] Clarify ML node settings re transport requests (elastic/x-pack-elasticsearch#1641) * [DOCS] Clarify ML node settings re transport requests * [DOCS] Clarify xpack.ml.enabled based on feedback Original commit: elastic/x-pack-elasticsearch@3102d1e3f3527799f5b78fd2e0415030ba509e0d 2017-06-05 16:02:12 -04:00			`IMPORTANT: If you want to use {ml} features in your cluster, you must have`
			`xpack.ml.enabled` set to `true` on all master-eligible nodes. This is the
			`default behavior.`
[DOCS] Add ML settings page (elastic/x-pack-elasticsearch#1069) * [DOCS] Add ML settings page * [DOCS] Add ML node to introductory concepts * [DOCS] ML node clarifications for elastic/x-pack-elasticsearch#1069 Original commit: elastic/x-pack-elasticsearch@ebbd3b31c79db5816e45ea9f984243075c5fffad 2017-04-17 14:53:31 -04:00
[ML] Improve the way ML jobs are allocated to nodes (elastic/x-pack-elasticsearch#2975) This change modifies the way ML jobs are assigned to nodes to primarily base the decision on the estimated memory footprint of the jobs. The memory footprint comes from the model size stats if the job has been running long enough, otherwise from the model memory limit. In addition, an allowance for the program code and stack is added. If insufficient information is available to base the allocation decision on memory requirements then the decision falls back to using simple job counts per node. relates elastic/x-pack-elasticsearch#546 Original commit: elastic/x-pack-elasticsearch@b276aedf2fca6aee7382ba8f9deef6034a5d0ec3 2017-11-21 04:51:52 -05:00			`xpack.ml.max_machine_memory_percent`::
			`The maximum percentage of the machine's memory that {ml} may use for running`
			`analytics processes. (These processes are separate to the {es} JVM.) Defaults to`
			`30` percent. The limit is based on the total memory of the machine, not current
			`free memory. Jobs will not be allocated to a node if doing so would cause the`
			`estimated memory use of {ml} jobs to exceed the limit.`

[DOCS] Add xpack.ml.max_model_memory_limit (elastic/x-pack-elasticsearch#2787) * [DOCS] Add xpack.ml.max.model_memory_limit * [DOCS] Addressed feedback on model limit setting Original commit: elastic/x-pack-elasticsearch@77a10bfe0ea3bac75831b1788f85dd92f7af1406 2017-10-25 12:00:53 -04:00			`xpack.ml.max_model_memory_limit`::
			The maximum `model_memory_limit` property value that can be set for any job on
			this node. If you try to create a job with a `model_memory_limit` property value
			`that is greater than this setting value, an error occurs. Existing jobs are not`
			`affected when you update this setting. For more information about the`
			`model_memory_limit` property, see <<ml-apilimits>>.
[DOCS] Added xpack.ml.node_concurrent_job_allocations setting (elastic/x-pack-elasticsearch#3327) * [DOCS] Added concurrent ML job setting * [DOCS] Re-ordered ML settings * [DOCS] Clarified concurrent job allocation setting Original commit: elastic/x-pack-elasticsearch@cb2d50133328a72cb5bedf1115f695e2fe8a603b 2017-12-15 14:19:11 -05:00
[DOCS] Adds new dynamic machine learning settings (#34094) 2018-09-28 12:41:14 -04:00			`xpack.ml.max_open_jobs`::
			The maximum number of jobs that can run on a node. Defaults to `20`.
			`The maximum number of jobs is also constrained by memory usage, so fewer`
			`jobs than specified by this setting will run on a node if the estimated`
			`memory use of the jobs would be higher than allowed.`

[DOCS] Added xpack.ml.node_concurrent_job_allocations setting (elastic/x-pack-elasticsearch#3327) * [DOCS] Added concurrent ML job setting * [DOCS] Re-ordered ML settings * [DOCS] Clarified concurrent job allocation setting Original commit: elastic/x-pack-elasticsearch@cb2d50133328a72cb5bedf1115f695e2fe8a603b 2017-12-15 14:19:11 -05:00			`xpack.ml.node_concurrent_job_allocations`::
			The maximum number of jobs that can concurrently be in the `opening` state on
			`each node. Typically, jobs spend a small amount of time in this state before`
			they move to `open` state. Jobs that must restore large models when they are
			opening spend more time in the `opening` state. Defaults to `2`.
[DOCS] Adds new dynamic machine learning settings (#34094) 2018-09-28 12:41:14 -04:00
			`[float]`
			`[[advanced-ml-settings]]`
			`==== Advanced machine learning settings`

			`These settings are for advanced use cases; the default values are generally`
			`sufficient:`

			`xpack.ml.max_anomaly_records`:: (<<cluster-update-settings,Dynamic>>)
			`The maximum number of records that are output per bucket. The default value is`
			`500`.

[DOCS] Adds new lazy ml node setting (#34600) * Adding new xpack.ml.max_lazy_ml_nodes setting to docs * Fixing docs, making it clearer what the setting does * Adding note about external process need 2018-10-18 17:11:36 -04:00			`xpack.ml.max_lazy_ml_nodes`:: (<<cluster-update-settings,Dynamic>>)
			`The number of lazily spun up Machine Learning nodes. Useful in situations`
			`where ML nodes are not desired until the first Machine Learning Job`
			is opened. It defaults to `0` and has a maximum acceptable value of `3`.
			If the current number of ML nodes is `>=` than this setting, then it is
			`assumed that there are no more lazy nodes available as the desired number`
			`of nodes have already been provisioned. When a job is opened with this`
			setting set at `>0` and there are no nodes that can accept the job, then
			the job will stay in the `OPENING` state until a new ML node is added to the
			`cluster and the job is assigned to run on that node.`
			`+`
			`IMPORTANT: This setting assumes some external process is capable of adding ML nodes`
			`to the cluster. This setting is only useful when used in conjunction with`
			`such an external process.`