mirror of
https://github.com/honeymoose/OpenSearch.git
synced 2025-02-08 05:58:44 +00:00
8cb6e63a0e
Following the changes of elastic/x-pack-elasticsearch#2975 the hard limit on the number of ML jobs per node is no longer the only limiting factor. Additionally there is now a limit based on the estimated memory usage of the jobs, and this is expected to provide a more sensible limit that accounts for differing resource requirements per job. As a result, it makes sense to raise the default for the hard limit on the number of jobs, on the assumption that the memory limit will prevent the node becoming overloaded if an attempt is made to run many large jobs. Increasing the hard limit will allow more small jobs to be run than was previously the case by default. Of course, this change to the default will have no effect for customers who have already overridden the default in their config files. Original commit: elastic/x-pack-elasticsearch@9fed1d1237
59 lines
2.6 KiB
Plaintext
59 lines
2.6 KiB
Plaintext
[role="xpack"]
|
|
[[ml-settings]]
|
|
=== Machine Learning Settings in Elasticsearch
|
|
++++
|
|
<titleabbrev>Machine Learning Settings</titleabbrev>
|
|
++++
|
|
|
|
You do not need to configure any settings to use {ml}. It is enabled by default.
|
|
|
|
[float]
|
|
[[general-ml-settings]]
|
|
==== General Machine Learning Settings
|
|
|
|
`node.ml`::
|
|
Set to `true` (default) to identify the node as a _machine learning node_. +
|
|
+
|
|
If set to `false` in `elasticsearch.yml`, the node cannot run jobs. If set to
|
|
`true` but `xpack.ml.enabled` is set to `false`, the `node.ml` setting is
|
|
ignored and the node cannot run jobs. If you want to run jobs, there must be at
|
|
least one machine learning node in your cluster. +
|
|
+
|
|
IMPORTANT: On dedicated coordinating nodes or dedicated master nodes, disable
|
|
the `node.ml` role.
|
|
|
|
`xpack.ml.enabled`::
|
|
Set to `true` (default) to enable {ml} on the node. +
|
|
+
|
|
If set to `false` in `elasticsearch.yml`, the {ml} APIs are disabled on the node.
|
|
Therefore the node cannot open jobs, start {dfeeds}, or receive transport (internal)
|
|
communication requests related to {ml} APIs. It also affects all {kib} instances
|
|
that connect to this {es} instance; you do not need to disable {ml} in those
|
|
`kibana.yml` files. For more information about disabling {ml} in specific {kib}
|
|
instances, see
|
|
{kibana-ref}/ml-settings-kb.html[{kib} Machine Learning Settings].
|
|
+
|
|
IMPORTANT: If you want to use {ml} features in your cluster, you must have
|
|
`xpack.ml.enabled` set to `true` on all master-eligible nodes. This is the
|
|
default behavior.
|
|
|
|
`xpack.ml.max_open_jobs`::
|
|
The maximum number of jobs that can run on a node. Defaults to `20`.
|
|
The maximum number of jobs is also constrained by memory usage, so fewer
|
|
jobs than specified by this setting will run on a node if the estimated
|
|
memory use of the jobs would be higher than allowed.
|
|
|
|
`xpack.ml.max_machine_memory_percent`::
|
|
The maximum percentage of the machine's memory that {ml} may use for running
|
|
analytics processes. (These processes are separate to the {es} JVM.) Defaults to
|
|
`30` percent. The limit is based on the total memory of the machine, not current
|
|
free memory. Jobs will not be allocated to a node if doing so would cause the
|
|
estimated memory use of {ml} jobs to exceed the limit.
|
|
|
|
`xpack.ml.max_model_memory_limit`::
|
|
The maximum `model_memory_limit` property value that can be set for any job on
|
|
this node. If you try to create a job with a `model_memory_limit` property value
|
|
that is greater than this setting value, an error occurs. Existing jobs are not
|
|
affected when you update this setting. For more information about the
|
|
`model_memory_limit` property, see <<ml-apilimits>>.
|