Add docs for filesystem health checks (#59134)
Documents the feature and settings introduced in #52680. Co-authored-by: James Rodewig <james.rodewig@elastic.co>
This commit is contained in:
parent
664b546771
commit
8f4f844e6e
|
@ -245,3 +245,22 @@ WARNING: This setting replaces the `discovery.zen.no_master_block` setting in
|
||||||
earlier versions. The `discovery.zen.no_master_block` setting is ignored.
|
earlier versions. The `discovery.zen.no_master_block` setting is ignored.
|
||||||
|
|
||||||
--
|
--
|
||||||
|
|
||||||
|
`monitor.fs.health.enabled`::
|
||||||
|
|
||||||
|
(<<cluster-update-settings,Dynamic>>, boolean) If `true`, the node runs
|
||||||
|
periodic <<cluster-fault-detection-filesystem-health,filesystem health
|
||||||
|
checks>>. Defaults to `true`.
|
||||||
|
|
||||||
|
`monitor.fs.health.refresh_interval`::
|
||||||
|
|
||||||
|
(<<time-units, Time value>>) Interval between successive
|
||||||
|
<<cluster-fault-detection-filesystem-health,filesystem health checks>>.
|
||||||
|
Defaults to `2m`.
|
||||||
|
|
||||||
|
`monitor.fs.health.slow_path_logging_threshold`::
|
||||||
|
|
||||||
|
(<<time-units, Time value>>) If a
|
||||||
|
<<cluster-fault-detection-filesystem-health,filesystem health checks>>
|
||||||
|
takes longer than this threshold then {es} logs a warning. Defaults to
|
||||||
|
`5s`.
|
||||||
|
|
|
@ -18,3 +18,10 @@ Similarly, if a node detects that the elected master has disconnected, this
|
||||||
situation is treated as an immediate failure. The node bypasses the timeout and
|
situation is treated as an immediate failure. The node bypasses the timeout and
|
||||||
retry settings and restarts its discovery phase to try and find or elect a new
|
retry settings and restarts its discovery phase to try and find or elect a new
|
||||||
master.
|
master.
|
||||||
|
|
||||||
|
[[cluster-fault-detection-filesystem-health]]
|
||||||
|
Additionally, each node periodically verifies that its data path is healthy by
|
||||||
|
writing a small file to disk and then deleting it again. If a node discovers
|
||||||
|
its data path is unhealthy then it is removed from the cluster until the data
|
||||||
|
path recovers. You can control this behavior with the
|
||||||
|
<<modules-discovery-settings,`monitor.fs.health` settings>>.
|
||||||
|
|
Loading…
Reference in New Issue