OpenSearch/docs/reference/cluster/reroute.asciidoc

[[cluster-reroute]]
=== Cluster reroute API
++++
<titleabbrev>Cluster reroute</titleabbrev>
++++

Changes the allocation of shards in a cluster.


[[cluster-reroute-api-request]]
==== {api-request-title}

`POST /_cluster/reroute`


[[cluster-reroute-api-desc]]
==== {api-description-title}

The reroute command allows for manual changes to the allocation of individual
shards in the cluster. For example, a shard can be moved from one node to
another explicitly, an allocation can be cancelled, and an unassigned shard can
be explicitly allocated to a specific node.

It is important to note that after processing any reroute commands {es} will 
perform rebalancing as normal (respecting the values of settings such as 
`cluster.routing.rebalance.enable`) in order to remain in a balanced state. For 
example, if the requested allocation includes moving a shard from `node1` to 
`node2` then this may cause a shard to be moved from `node2` back to `node1` to 
even things out.

The cluster can be set to disable allocations using the
`cluster.routing.allocation.enable` setting. If allocations are disabled then 
the only allocations that will be performed are explicit ones given using the
`reroute` command, and consequent allocations due to rebalancing.

It is possible to run `reroute` commands in "dry run" mode by using the
`?dry_run` URI query parameter, or by passing `"dry_run": true` in the request
body. This will calculate the result of applying the commands to the current
cluster state, and return the resulting cluster state after the commands (and
re-balancing) has been applied, but will not actually perform the requested
changes.

If the `?explain` URI query parameter is included then a detailed explanation
of why the commands could or could not be executed is included in the response.

The cluster will attempt to allocate a shard a maximum of
`index.allocation.max_retries` times in a row (defaults to `5`), before giving
up and leaving the shard unallocated. This scenario can be caused by
structural problems such as having an analyzer which refers to a stopwords
file which doesn't exist on all nodes.

Once the problem has been corrected, allocation can be manually retried by
calling the `reroute` API with the `?retry_failed` URI
query parameter, which will attempt a single retry round for these shards.


[[cluster-reroute-api-query-params]]
==== {api-query-parms-title}

`dry_run`::
    (Optional, boolean) If `true`, then the request simulates the operation only 
    and returns the resulting state.
    
`explain`::
    (Optional, boolean) If `true`, then the response contains an explanation of 
    why the commands can or cannot be executed.

`metric`::
    (Optional, string) Limits the information returned to the specified metrics. 
    Defaults to all but metadata The following options are available:

+
--
`_all`::
    Shows all metrics.
        
`blocks`::
    Shows the `blocks` part of the response.

`master_node`::
    Shows the elected `master_node` part of the response.
        
`metadata`::
    Shows the `metadata` part of the response. If you supply a comma separated
    list of indices, the returned output will only contain metadata for these
    indices.

`nodes`::
    Shows the `nodes` part of the response.

`routing_table`::
    Shows the `routing_table` part of the response.
        
`version`::
    Shows the cluster state version.
--

`retry_failed`::
    (Optional, boolean) If `true`, then retries allocation of shards that are 
    blocked due to too many subsequent allocation failures.

include::{docdir}/rest-api/common-parms.asciidoc[tag=timeoutparms]


[[cluster-reroute-api-request-body]]
==== {api-request-body-title}

`commands`::
    (Required, object) Defines the commands to perform. Supported commands are:

+
--
`move`::
    Move a started shard from one node to another node. Accepts `index` and 
    `shard` for index name and shard number, `from_node` for the node to move 
    the shard from, and `to_node` for the node to move the shard to.

`cancel`::
    Cancel allocation of a shard (or recovery). Accepts `index` and `shard` for
    index name and shard number, and `node` for the node to cancel the shard
    allocation on. This can be used to force resynchronization of existing
    replicas from the primary shard by cancelling them and allowing them to be
    reinitialized through the standard recovery process. By default only
    replica shard allocations can be cancelled. If it is necessary to cancel
    the allocation of a primary shard then the `allow_primary` flag must also
    be included in the request.

`allocate_replica`::
    Allocate an unassigned replica shard to a node. Accepts `index` and `shard`
    for index name and shard number, and `node` to allocate the shard to. Takes
    <<modules-cluster,allocation deciders>> into account.
--

Two more commands are available that allow the allocation of a primary shard to
a node. These commands should however be used with extreme care, as primary
shard allocation is usually fully automatically handled by {es}. Reasons why a 
primary shard cannot be automatically allocated include the
following:

- A new index was created but there is no node which satisfies the allocation
  deciders.
- An up-to-date shard copy of the data cannot be found on the current data
  nodes in the cluster. To prevent data loss, the system does not automatically
promote a stale shard copy to primary.

The following two commands are dangerous and may result in data loss. They are
meant to be used in cases where the original data can not be recovered and the
cluster administrator accepts the loss. If you have suffered a temporary issue
that can be fixed, please see the `retry_failed` flag described above. To
emphasise: if these commands are performed and then a node joins the cluster
that holds a copy of the affected shard then the copy on the newly-joined node
will be deleted or overwritten.

`allocate_stale_primary`::
    Allocate a primary shard to a node that holds a stale copy. Accepts the
    `index` and `shard` for index name and shard number, and `node` to allocate
    the shard to. Using this command may lead to data loss for the provided
    shard id. If a node which has the good copy of the data rejoins the cluster
    later on, that data will be deleted or overwritten with the data of the
    stale copy that was forcefully allocated with this command. To ensure that
    these implications are well-understood, this command requires the flag
    `accept_data_loss` to be explicitly set to `true`.

`allocate_empty_primary`::
    Allocate an empty primary shard to a node. Accepts the `index` and `shard`
    for index name and shard number, and `node` to allocate the shard to. Using
    this command leads to a complete loss of all data that was indexed into
    this shard, if it was previously started. If a node which has a copy of the
    data rejoins the cluster later on, that data will be deleted. To ensure
    that these implications are well-understood, this command requires the flag
    `accept_data_loss` to be explicitly set to `true`.


[[cluster-reroute-api-example]]
==== {api-examples-title}

This is a short example of a simple reroute API call:

[source,console]
--------------------------------------------------
POST /_cluster/reroute
{
    "commands" : [
        {
            "move" : {
                "index" : "test", "shard" : 0,
                "from_node" : "node1", "to_node" : "node2"
            }
        },
        {
          "allocate_replica" : {
                "index" : "test", "shard" : 1,
                "node" : "node3"
          }
        }
    ]
}
--------------------------------------------------
// TEST[skip:doc tests run with only a single node]
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00			`[[cluster-reroute]]`
[DOCS] Sort cluster API docs alphabetically (#48198) 2019-10-22 13:27:31 -04:00			`=== Cluster reroute API`
			`++++`
			`<titleabbrev>Cluster reroute</titleabbrev>`
			`++++`
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`Changes the allocation of shards in a cluster.`


			`[[cluster-reroute-api-request]]`
			`==== {api-request-title}`

			`POST /_cluster/reroute`


			`[[cluster-reroute-api-desc]]`
			`==== {api-description-title}`

Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`The reroute command allows for manual changes to the allocation of individual`
			`shards in the cluster. For example, a shard can be moved from one node to`
			`another explicitly, an allocation can be cancelled, and an unassigned shard can`
			`be explicitly allocated to a specific node.`
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`It is important to note that after processing any reroute commands {es} will`
			`perform rebalancing as normal (respecting the values of settings such as`
			`cluster.routing.rebalance.enable`) in order to remain in a balanced state. For
			example, if the requested allocation includes moving a shard from `node1` to
			`node2` then this may cause a shard to be moved from `node2` back to `node1` to
			`even things out.`
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`The cluster can be set to disable allocations using the`
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`cluster.routing.allocation.enable` setting. If allocations are disabled then
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`the only allocations that will be performed are explicit ones given using the`
			`reroute` command, and consequent allocations due to rebalancing.
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			It is possible to run `reroute` commands in "dry run" mode by using the
			`?dry_run` URI query parameter, or by passing `"dry_run": true` in the request
			`body. This will calculate the result of applying the commands to the current`
			`cluster state, and return the resulting cluster state after the commands (and`
			`re-balancing) has been applied, but will not actually perform the requested`
			`changes.`
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			If the `?explain` URI query parameter is included then a detailed explanation
			`of why the commands could or could not be executed is included in the response.`
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`The cluster will attempt to allocate a shard a maximum of`
			`index.allocation.max_retries` times in a row (defaults to `5`), before giving
			`up and leaving the shard unallocated. This scenario can be caused by`
			`structural problems such as having an analyzer which refers to a stopwords`
			`file which doesn't exist on all nodes.`

			`Once the problem has been corrected, allocation can be manually retried by`
			calling the `reroute` API with the `?retry_failed` URI
			`query parameter, which will attempt a single retry round for these shards.`


			`[[cluster-reroute-api-query-params]]`
			`==== {api-query-parms-title}`

			`dry_run`::
			(Optional, boolean) If `true`, then the request simulates the operation only
			`and returns the resulting state.`

			`explain`::
			(Optional, boolean) If `true`, then the response contains an explanation of
			`why the commands can or cannot be executed.`

			`metric`::
			`(Optional, string) Limits the information returned to the specified metrics.`
			`Defaults to all but metadata The following options are available:`

			`+`
			`--`
			`_all`::
			`Shows all metrics.`

			`blocks`::
			Shows the `blocks` part of the response.

			`master_node`::
			Shows the elected `master_node` part of the response.

			`metadata`::
			Shows the `metadata` part of the response. If you supply a comma separated
			`list of indices, the returned output will only contain metadata for these`
			`indices.`
Add `explain` flag support to the reroute API By specifying the `explain` flag, an explanation for the reason a command can or cannot be executed is returned. No allocation commands are actually performed. Returns a response similar to: { "state": {...cluster state...}, "acknowledged": true, "explanations" : [ { "command" : "cancel", "parameters" : { "index" : "decide", "shard" : 0, "node" : "IvpoKRdtRiGrQ_WKtt4_4w", "allow_primary" : false }, "decisions" : [ { "decider" : "cancel_allocation_command", "decision" : "YES", "explanation" : "..." } ] }, { "command" : "move", "parameters" : { "index" : "decide", "shard" : 0, "from_node" : "IvpoKRdtRiGrQ_WKtt4_4w", "to_node" : "IvpoKRdtRiGrQ_WKtt4_4w" }, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "shard cannot be allocated on same node [IvpoKRdtRiGrQ_WKtt4_4w] it already exists on" }, etc ] }] } also removes AllocationExplanation from cluster state Closes #2483 Closes #5169 2014-01-31 18:50:32 -05:00
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`nodes`::
			Shows the `nodes` part of the response.

			`routing_table`::
			Shows the `routing_table` part of the response.

			`version`::
			`Shows the cluster state version.`
			`--`

			`retry_failed`::
			(Optional, boolean) If `true`, then retries allocation of shards that are
			`blocked due to too many subsequent allocation failures.`

			`include::{docdir}/rest-api/common-parms.asciidoc[tag=timeoutparms]`


			`[[cluster-reroute-api-request-body]]`
			`==== {api-request-body-title}`

			`commands`::
			`(Required, object) Defines the commands to perform. Supported commands are:`

			`+`
			`--`
Add `explain` flag support to the reroute API By specifying the `explain` flag, an explanation for the reason a command can or cannot be executed is returned. No allocation commands are actually performed. Returns a response similar to: { "state": {...cluster state...}, "acknowledged": true, "explanations" : [ { "command" : "cancel", "parameters" : { "index" : "decide", "shard" : 0, "node" : "IvpoKRdtRiGrQ_WKtt4_4w", "allow_primary" : false }, "decisions" : [ { "decider" : "cancel_allocation_command", "decision" : "YES", "explanation" : "..." } ] }, { "command" : "move", "parameters" : { "index" : "decide", "shard" : 0, "from_node" : "IvpoKRdtRiGrQ_WKtt4_4w", "to_node" : "IvpoKRdtRiGrQ_WKtt4_4w" }, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "shard cannot be allocated on same node [IvpoKRdtRiGrQ_WKtt4_4w] it already exists on" }, etc ] }] } also removes AllocationExplanation from cluster state Closes #2483 Closes #5169 2014-01-31 18:50:32 -05:00			`move`::
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			Move a started shard from one node to another node. Accepts `index` and
			`shard` for index name and shard number, `from_node` for the node to move
			the shard from, and `to_node` for the node to move the shard to.
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
Add `explain` flag support to the reroute API By specifying the `explain` flag, an explanation for the reason a command can or cannot be executed is returned. No allocation commands are actually performed. Returns a response similar to: { "state": {...cluster state...}, "acknowledged": true, "explanations" : [ { "command" : "cancel", "parameters" : { "index" : "decide", "shard" : 0, "node" : "IvpoKRdtRiGrQ_WKtt4_4w", "allow_primary" : false }, "decisions" : [ { "decider" : "cancel_allocation_command", "decision" : "YES", "explanation" : "..." } ] }, { "command" : "move", "parameters" : { "index" : "decide", "shard" : 0, "from_node" : "IvpoKRdtRiGrQ_WKtt4_4w", "to_node" : "IvpoKRdtRiGrQ_WKtt4_4w" }, "decisions" : [ { "decider" : "same_shard", "decision" : "NO", "explanation" : "shard cannot be allocated on same node [IvpoKRdtRiGrQ_WKtt4_4w] it already exists on" }, etc ] }] } also removes AllocationExplanation from cluster state Closes #2483 Closes #5169 2014-01-31 18:50:32 -05:00			`cancel`::
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			Cancel allocation of a shard (or recovery). Accepts `index` and `shard` for
			index name and shard number, and `node` for the node to cancel the shard
			`allocation on. This can be used to force resynchronization of existing`
			`replicas from the primary shard by cancelling them and allowing them to be`
			`reinitialized through the standard recovery process. By default only`
			`replica shard allocations can be cancelled. If it is necessary to cancel`
			the allocation of a primary shard then the `allow_primary` flag must also
			`be included in the request.`
Migrated documentation into the main repo 2013-08-28 19:24:34 -04:00
Extend reroute with an option to force assign stale primary shard copies Closes #15708 2016-01-13 10:59:39 -05:00			`allocate_replica`::
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			Allocate an unassigned replica shard to a node. Accepts `index` and `shard`
			for index name and shard number, and `node` to allocate the shard to. Takes
			`<<modules-cluster,allocation deciders>> into account.`
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`--`
Add a note about using the `retry_failed` flag before accepting data loss (#29160) 2018-03-20 12:53:48 -04:00
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`Two more commands are available that allow the allocation of a primary shard to`
			`a node. These commands should however be used with extreme care, as primary`
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`shard allocation is usually fully automatically handled by {es}. Reasons why a`
			`primary shard cannot be automatically allocated include the`
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`following:`

			`- A new index was created but there is no node which satisfies the allocation`
			`deciders.`
			`- An up-to-date shard copy of the data cannot be found on the current data`
			`nodes in the cluster. To prevent data loss, the system does not automatically`
			`promote a stale shard copy to primary.`

Add a note about using the `retry_failed` flag before accepting data loss (#29160) 2018-03-20 12:53:48 -04:00			`The following two commands are dangerous and may result in data loss. They are`
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`meant to be used in cases where the original data can not be recovered and the`
			`cluster administrator accepts the loss. If you have suffered a temporary issue`
			that can be fixed, please see the `retry_failed` flag described above. To
			`emphasise: if these commands are performed and then a node joins the cluster`
			`that holds a copy of the affected shard then the copy on the newly-joined node`
			`will be deleted or overwritten.`
Extend reroute with an option to force assign stale primary shard copies Closes #15708 2016-01-13 10:59:39 -05:00
			`allocate_stale_primary`::
			`Allocate a primary shard to a node that holds a stale copy. Accepts the`
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			`index` and `shard` for index name and shard number, and `node` to allocate
			`the shard to. Using this command may lead to data loss for the provided`
			`shard id. If a node which has the good copy of the data rejoins the cluster`
			`later on, that data will be deleted or overwritten with the data of the`
			`stale copy that was forcefully allocated with this command. To ensure that`
			`these implications are well-understood, this command requires the flag`
			`accept_data_loss` to be explicitly set to `true`.
Extend reroute with an option to force assign stale primary shard copies Closes #15708 2016-01-13 10:59:39 -05:00
			`allocate_empty_primary`::
Minor tweaks to reroute documentation (#30246) Add yet another warning about data loss to the introductory paragraph about the unsafe commands. Also move this paragraph next to the details of the unsafe commands, below the section on the `retry_failed` flag. Be more specific about how to use the URI parameters and in-body flags. Clarify statements about when rebalancing takes place (i.e. it respects settings) Resolves #16113. 2018-04-30 08:09:03 -04:00			Allocate an empty primary shard to a node. Accepts the `index` and `shard`
			for index name and shard number, and `node` to allocate the shard to. Using
			`this command leads to a complete loss of all data that was indexed into`
			`this shard, if it was previously started. If a node which has a copy of the`
			`data rejoins the cluster later on, that data will be deleted. To ensure`
			`that these implications are well-understood, this command requires the flag`
			`accept_data_loss` to be explicitly set to `true`.
Limit retries of failed allocations per index (#18467) Today if a shard fails during initialization phase due to misconfiguration, broken disks, missing analyzers, not installed plugins etc. elasticsaerch keeps on trying to initialize or rather allocate that shard. Yet, in the worst case scenario this ends in an endless allocation loop. To prevent this loop and all it's sideeffects like spamming log files over and over again this commit adds an allocation decider that stops allocating a shard that failed more than N times in a row to allocate. The number or retries can be configured via `index.allocation.max_retry` and it's default is set to `5`. Once the setting is updated shards with less failures than the number set per index will be allowed to allocate again. Internally we maintain a counter on the UnassignedInfo that is reset to `0` once the shards has been started. Relates to #18417 2016-05-20 14:37:45 -04:00
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00
			`[[cluster-reroute-api-example]]`
			`==== {api-examples-title}`

			`This is a short example of a simple reroute API call:`

[DOCS] [2 of 5] Change // CONSOLE comments to [source,console] (#46353) (#46502) 2019-09-09 13:38:14 -04:00			`[source,console]`
[DOCS] Reformats cluster reroute API. (#45328) 2019-08-08 09:26:08 -04:00			`--------------------------------------------------`
			`POST /_cluster/reroute`
			`{`
			`"commands" : [`
			`{`
			`"move" : {`
			`"index" : "test", "shard" : 0,`
			`"from_node" : "node1", "to_node" : "node2"`
			`}`
			`},`
			`{`
			`"allocate_replica" : {`
			`"index" : "test", "shard" : 1,`
			`"node" : "node3"`
			`}`
			`}`
			`]`
			`}`
			`--------------------------------------------------`
			`// TEST[skip:doc tests run with only a single node]`