OpenSearch/docs/reference/cat/health.asciidoc

[[cat-health]]
=== cat health

Returns the health status of a cluster, similar to the <<cluster-health,cluster
health>> API.


[[cat-health-api-request]]
==== {api-request-title}

`GET /_cat/health`


[[cat-health-api-desc]]
==== {api-description-title}

You can use the cat health API to get the health status of a cluster.

[[timestamp]]
This API is often used to check malfunctioning clusters. To help you
track cluster health alongside log files and alerting systems, the API returns
timestamps in two formats:

* `HH:MM:SS`, which is human-readable but includes no date information.
* https://en.wikipedia.org/wiki/Unix_time[Unix `epoch` time], which is
machine-sortable and includes date information. This is useful for cluster
recoveries that take multiple days.

You can use the cat health API to verify cluster health across multiple nodes.
See <<cat-health-api-example-across-nodes>>.

You also can use the API to track the recovery of a large cluster
over a longer period of time. See <<cat-health-api-example-large-cluster>>.


[[cat-health-api-query-params]]
==== {api-query-parms-title}

include::{docdir}/rest-api/common-parms.asciidoc[tag=http-format]

include::{docdir}/rest-api/common-parms.asciidoc[tag=cat-h]

include::{docdir}/rest-api/common-parms.asciidoc[tag=help]

include::{docdir}/rest-api/common-parms.asciidoc[tag=local]

include::{docdir}/rest-api/common-parms.asciidoc[tag=master-timeout]

include::{docdir}/rest-api/common-parms.asciidoc[tag=cat-s]

`ts` (timestamps)::
(Optional, boolean) If `true`, returns `HH:MM:SS` and
https://en.wikipedia.org/wiki/Unix_time[Unix `epoch`] timestamps. Defaults to
`true`.

include::{docdir}/rest-api/common-parms.asciidoc[tag=cat-v]


[[cat-health-api-example]]
==== {api-examples-title}

[[cat-health-api-example-timestamp]]
===== Example with a timestamp
By default, the cat health API returns `HH:MM:SS` and
https://en.wikipedia.org/wiki/Unix_time[Unix `epoch`] timestamps. For example:

[source,js]
--------------------------------------------------
GET /_cat/health?v
--------------------------------------------------
// CONSOLE
// TEST[s/^/PUT twitter\n{"settings":{"number_of_replicas": 0}}\n/]

The API returns the following response:

[source,txt]
--------------------------------------------------
epoch      timestamp cluster       status node.total node.data shards pri relo init unassign pending_tasks max_task_wait_time active_shards_percent
1475871424 16:17:04  elasticsearch green           1         1      1   1    0    0        0             0                  -                100.0%
--------------------------------------------------
// TESTRESPONSE[s/1475871424 16:17:04/\\d+ \\d+:\\d+:\\d+/]
// TESTRESPONSE[s/elasticsearch/[^ ]+/ s/0                  -/\\d+ (-|\\d+(\\.\\d+)?[ms]+)/ non_json]

[[cat-health-api-example-no-timestamp]]
===== Example without a timestamp
You can use the `ts` (timestamps) parameter to disable timestamps. For example:

[source,js]
--------------------------------------------------
GET /_cat/health?v&ts=false
--------------------------------------------------
// CONSOLE
// TEST[s/^/PUT twitter\n{"settings":{"number_of_replicas": 0}}\n/]

The API returns the following response:

[source,txt]
--------------------------------------------------
cluster       status node.total node.data shards pri relo init unassign pending_tasks max_task_wait_time active_shards_percent
elasticsearch green           1         1      1   1    0    0        0             0                  -                100.0%
--------------------------------------------------
// TESTRESPONSE[s/elasticsearch/[^ ]+/ s/0                  -/\\d+ (-|\\d+(\\.\\d+)?[ms]+)/ non_json]

[[cat-health-api-example-across-nodes]]
===== Example across nodes
You can use the cat health API to verify the health of a cluster across nodes.
For example:

[source,sh]
--------------------------------------------------
% pssh -i -h list.of.cluster.hosts curl -s localhost:9200/_cat/health
[1] 20:20:52 [SUCCESS] es3.vm
1384309218 18:20:18 foo green 3 3 3 3 0 0 0 0
[2] 20:20:52 [SUCCESS] es1.vm
1384309218 18:20:18 foo green 3 3 3 3 0 0 0 0
[3] 20:20:52 [SUCCESS] es2.vm
1384309218 18:20:18 foo green 3 3 3 3 0 0 0 0
--------------------------------------------------
// NOTCONSOLE

[[cat-health-api-example-large-cluster]]
===== Example with a large cluster
You can use the cat health API to track the recovery of a large cluster over a
longer period of time. You can do this by including the cat health API request
in a delayed loop. For example:

[source,sh]
--------------------------------------------------
% while true; do curl localhost:9200/_cat/health; sleep 120; done
1384309446 18:24:06 foo red 3 3 20 20 0 0 1812 0
1384309566 18:26:06 foo yellow 3 3 950 916 0 12 870 0
1384309686 18:28:06 foo yellow 3 3 1328 916 0 12 492 0
1384309806 18:30:06 foo green 3 3 1832 916 4 0 0
^C
--------------------------------------------------
// NOTCONSOLE

In this example, the recovery took roughly six minutes, from `18:24:06` to
`18:30:06`. If this recovery took hours, you could continue to monitor the
number of `UNASSIGNED` shards, which should drop. If the number of `UNASSIGNED`
shards remains static, it would indicate an issue with the cluster recovery.
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`[[cat-health]]`
[DOCS] Remove heading offsets for REST APIs (#44568) Several files in the REST APIs nav section are included using :leveloffset: tags. This increments headings (h2 -> h3, h3 -> h4, etc.) in those files and removes the :leveloffset: tags. Other supporting changes: * Alphabetizes top-level REST API nav items. * Change 'indices APIs' heading to 'index APIs.' * Changes 'Snapshot lifecycle management' heading to sentence case. 2019-07-19 14:35:36 -04:00			`=== cat health`
First pass at cat docs. 2013-11-14 20:14:39 -05:00
[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			`Returns the health status of a cluster, similar to the <<cluster-health,cluster`
			`health>> API.`


			`[[cat-health-api-request]]`
			`==== {api-request-title}`

			`GET /_cat/health`


			`[[cat-health-api-desc]]`
			`==== {api-description-title}`

			`You can use the cat health API to get the health status of a cluster.`

			`[[timestamp]]`
			`This API is often used to check malfunctioning clusters. To help you`
			`track cluster health alongside log files and alerting systems, the API returns`
			`timestamps in two formats:`

			* `HH:MM:SS`, which is human-readable but includes no date information.
			* https://en.wikipedia.org/wiki/Unix_time[Unix `epoch` time], which is
			`machine-sortable and includes date information. This is useful for cluster`
			`recoveries that take multiple days.`

			`You can use the cat health API to verify cluster health across multiple nodes.`
			`See <<cat-health-api-example-across-nodes>>.`

			`You also can use the API to track the recovery of a large cluster`
			`over a longer period of time. See <<cat-health-api-example-large-cluster>>.`


			`[[cat-health-api-query-params]]`
			`==== {api-query-parms-title}`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=http-format]`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=cat-h]`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=help]`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=local]`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=master-timeout]`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=cat-s]`

			`ts` (timestamps)::
			(Optional, boolean) If `true`, returns `HH:MM:SS` and
			https://en.wikipedia.org/wiki/Unix_time[Unix `epoch`] timestamps. Defaults to
			`true`.

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=cat-v]`


			`[[cat-health-api-example]]`
			`==== {api-examples-title}`

			`[[cat-health-api-example-timestamp]]`
			`===== Example with a timestamp`
			By default, the cat health API returns `HH:MM:SS` and
			https://en.wikipedia.org/wiki/Unix_time[Unix `epoch`] timestamps. For example:
First pass at cat docs. 2013-11-14 20:14:39 -05:00
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00			`[source,js]`
			`--------------------------------------------------`
			`GET /_cat/health?v`
			`--------------------------------------------------`
			`// CONSOLE`
			`// TEST[s/^/PUT twitter\n{"settings":{"number_of_replicas": 0}}\n/]`

[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			`The API returns the following response:`

Convert more docs to // CONSOLE Converts docs for `_cat/segments`, `_cat/plugins` and `_cat/repositories` from `curl` to `// CONSOLE` so they are tested as part of the build and are cleaner to use in Console. They should work fine with `curl` with the `COPY AS CURL` link. Also swaps the `source` type of the response from `js` to `txt` because that is more correct. The syntax highlighter doesn't care. It looks at the text to figure out the language. So it looks a little funny for `_cat` responses regardless. Relates to #18160 2016-10-25 10:56:30 -04:00			`[source,txt]`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00			`--------------------------------------------------`
Don't mind pending cluster tasks in docs build This removes an assertion that the cluster doesn't have any pending cluster state tasks from the `_cat/health` docs. Relates to #18160 2016-10-11 17:42:43 -04:00			`epoch timestamp cluster status node.total node.data shards pri relo init unassign pending_tasks max_task_wait_time active_shards_percent`
Default to one shard (#30539) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one). 2018-05-14 12:22:35 -04:00			`1475871424 16:17:04 elasticsearch green 1 1 1 1 0 0 0 0 - 100.0%`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00			`--------------------------------------------------`
Docs tests: cat/health can have max_task_wait_time Make the doc test assertions ok with a non `-` value for `max_task_wait_time`. These are rare, but possible: https://elasticsearch-ci.elastic.co/job/elastic+elasticsearch+master+multijob-unix-compatibility/os=oraclelinux/900/consoleFull 2017-04-28 09:56:19 -04:00			`// TESTRESPONSE[s/1475871424 16:17:04/\\d+ \\d+:\\d+:\\d+/]`
[DOCS] Change `// TESTRESPONSE[_cat]` to `// TESTRESPONSE[non_json]` (#43006) 2019-06-10 09:33:32 -04:00			`// TESTRESPONSE[s/elasticsearch/[^ ]+/ s/0 -/\\d+ (-\|\\d+(\\.\\d+)?[ms]+)/ non_json]`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00
[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			`[[cat-health-api-example-no-timestamp]]`
			`===== Example without a timestamp`
			You can use the `ts` (timestamps) parameter to disable timestamps. For example:
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00
			`[source,js]`
			`--------------------------------------------------`
Make boolean conversion strict (#22200) This PR removes all leniency in the conversion of Strings to booleans: "true" is converted to the boolean value `true`, "false" is converted to the boolean value `false`. Everything else raises an error. 2017-01-19 01:59:18 -05:00			`GET /_cat/health?v&ts=false`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00			`--------------------------------------------------`
			`// CONSOLE`
			`// TEST[s/^/PUT twitter\n{"settings":{"number_of_replicas": 0}}\n/]`

[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			`The API returns the following response:`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00
Convert more docs to // CONSOLE Converts docs for `_cat/segments`, `_cat/plugins` and `_cat/repositories` from `curl` to `// CONSOLE` so they are tested as part of the build and are cleaner to use in Console. They should work fine with `curl` with the `COPY AS CURL` link. Also swaps the `source` type of the response from `js` to `txt` because that is more correct. The syntax highlighter doesn't care. It looks at the text to figure out the language. So it looks a little funny for `_cat` responses regardless. Relates to #18160 2016-10-25 10:56:30 -04:00			`[source,txt]`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`--------------------------------------------------`
Don't mind pending cluster tasks in docs build This removes an assertion that the cluster doesn't have any pending cluster state tasks from the `_cat/health` docs. Relates to #18160 2016-10-11 17:42:43 -04:00			`cluster status node.total node.data shards pri relo init unassign pending_tasks max_task_wait_time active_shards_percent`
Default to one shard (#30539) This commit changes the default out-of-the-box configuration for the number of shards from five to one. We think this will help address a common problem of oversharding. For users with time-based indices that need a different default, this can be managed with index templates. For users with non-time-based indices that find they need to re-shard with the split API in place they no longer need to resort only to reindexing. Since this has the impact of changing the default number of shards used in REST tests, we want to ensure that we still have coverage for issues that could arise from multiple shards. As such, we randomize (rarely) the default number of shards in REST tests to two. This is managed via a global index template. However, some tests check the templates that are in the cluster state during the test. Since this template is randomly there, we need a way for tests to skip adding the template used to set the number of shards to two. For this we add the default_shards feature skip. To avoid having to write our docs in a complicated way because sometimes they might be behind one shard, and sometimes they might be behind two shards we apply the default_shards feature skip to all docs tests. That is, these tests will always run with the default number of shards (one). 2018-05-14 12:22:35 -04:00			`elasticsearch green 1 1 1 1 0 0 0 0 - 100.0%`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`--------------------------------------------------`
[DOCS] Change `// TESTRESPONSE[_cat]` to `// TESTRESPONSE[non_json]` (#43006) 2019-06-10 09:33:32 -04:00			`// TESTRESPONSE[s/elasticsearch/[^ ]+/ s/0 -/\\d+ (-\|\\d+(\\.\\d+)?[ms]+)/ non_json]`
First pass at cat docs. 2013-11-14 20:14:39 -05:00
[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			`[[cat-health-api-example-across-nodes]]`
			`===== Example across nodes`
			`You can use the cat health API to verify the health of a cluster across nodes.`
			`For example:`
First pass at cat docs. 2013-11-14 20:14:39 -05:00
Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting 2015-07-14 12:14:09 -04:00			`[source,sh]`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`--------------------------------------------------`
			`% pssh -i -h list.of.cluster.hosts curl -s localhost:9200/_cat/health`
			`[1] 20:20:52 [SUCCESS] es3.vm`
API: add pending tasks count to cluster health The number of current pending tasks is useful to detect and overloaded master. This commit adds it to the cluster health API. The complete list can be retrieved from the dedicated pending tasks API. It also adds rest tests for the cluster health variants. Closes #9877 2015-02-25 07:25:52 -05:00			`1384309218 18:20:18 foo green 3 3 3 3 0 0 0 0`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`[2] 20:20:52 [SUCCESS] es1.vm`
API: add pending tasks count to cluster health The number of current pending tasks is useful to detect and overloaded master. This commit adds it to the cluster health API. The complete list can be retrieved from the dedicated pending tasks API. It also adds rest tests for the cluster health variants. Closes #9877 2015-02-25 07:25:52 -05:00			`1384309218 18:20:18 foo green 3 3 3 3 0 0 0 0`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`[3] 20:20:52 [SUCCESS] es2.vm`
API: add pending tasks count to cluster health The number of current pending tasks is useful to detect and overloaded master. This commit adds it to the cluster health API. The complete list can be retrieved from the dedicated pending tasks API. It also adds rest tests for the cluster health variants. Closes #9877 2015-02-25 07:25:52 -05:00			`1384309218 18:20:18 foo green 3 3 3 3 0 0 0 0`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`--------------------------------------------------`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00			`// NOTCONSOLE`
First pass at cat docs. 2013-11-14 20:14:39 -05:00
[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			`[[cat-health-api-example-large-cluster]]`
			`===== Example with a large cluster`
			`You can use the cat health API to track the recovery of a large cluster over a`
			`longer period of time. You can do this by including the cat health API request`
			`in a delayed loop. For example:`
First pass at cat docs. 2013-11-14 20:14:39 -05:00
Docs: Use "js" instead of "json" and "sh" instead of "shell" for source highlighting 2015-07-14 12:14:09 -04:00			`[source,sh]`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`--------------------------------------------------`
Update health.asciidoc Changing network address of curl commands to "localhost" instead of 192.x.x.x 2015-11-04 12:00:41 -05:00			`% while true; do curl localhost:9200/_cat/health; sleep 120; done`
API: add pending tasks count to cluster health The number of current pending tasks is useful to detect and overloaded master. This commit adds it to the cluster health API. The complete list can be retrieved from the dedicated pending tasks API. It also adds rest tests for the cluster health variants. Closes #9877 2015-02-25 07:25:52 -05:00			`1384309446 18:24:06 foo red 3 3 20 20 0 0 1812 0`
			`1384309566 18:26:06 foo yellow 3 3 950 916 0 12 870 0`
			`1384309686 18:28:06 foo yellow 3 3 1328 916 0 12 492 0`
First pass at cat docs. 2013-11-14 20:14:39 -05:00			`1384309806 18:30:06 foo green 3 3 1832 916 4 0 0`
			`^C`
			`--------------------------------------------------`
CONSOLEify some _cat docs `/_cat/count`, `/_cat/fielddata`, and `/_cat/health`. Three more files down, 141 to go. Relates to #18160 2016-10-07 16:28:49 -04:00			`// NOTCONSOLE`
First pass at cat docs. 2013-11-14 20:14:39 -05:00
[DOCS] Reformat cat health API (#45218) 2019-08-06 08:40:52 -04:00			In this example, the recovery took roughly six minutes, from `18:24:06` to
			`18:30:06`. If this recovery took hours, you could continue to monitor the
			number of `UNASSIGNED` shards, which should drop. If the number of `UNASSIGNED`
			`shards remains static, it would indicate an issue with the cluster recovery.`