OpenSearch/docs/reference/modules/cluster/allocation_awareness.asciidoc

[[allocation-awareness]]
=== Shard allocation awareness

You can use custom node attributes as _awareness attributes_ to enable {es}
to take your physical hardware configuration into account when allocating shards.
If {es} knows which nodes are on the same physical server, in the same rack, or
in the same zone, it can distribute the primary shard and its replica shards to
minimise the risk of losing all shard copies in the event of a failure.

When shard allocation awareness is enabled with the
`cluster.routing.allocation.awareness.attributes` setting, shards are only
allocated to nodes that have values set for the specified awareness
attributes. If you use multiple awareness attributes, {es} considers
each attribute separately when allocating shards.

The allocation awareness settings can be configured in
`elasticsearch.yml` and updated dynamically with the
<<cluster-update-settings,cluster-update-settings>> API.

{es} prefers using shards in the same location (with the same
awareness attribute values) to process search or GET requests. Using local
shards is usually faster than crossing rack or zone boundaries.

NOTE: The number of attribute values determines how many shard copies are
allocated in each location. If the number of nodes in each location is
unbalanced and there are a lot of replicas, replica shards might be left
unassigned.

[float]
[[enabling-awareness]]
==== Enabling shard allocation awareness

To enable shard allocation awareness:

. Specify the location of each node with a custom node attribute. For example,
if you want Elasticsearch to distribute shards across different racks, you might
set an awareness attribute called `rack_id` in each node's `elasticsearch.yml`
config file.
+
[source,yaml]
--------------------------------------------------------
node.attr.rack_id: rack_one
--------------------------------------------------------
+
You can also set custom attributes when you start a node:
+
[source,sh]
--------------------------------------------------------
`./bin/elasticsearch -Enode.attr.rack_id=rack_one`
--------------------------------------------------------

. Tell {es} to take one or more awareness attributes into account when
allocating shards by setting
`cluster.routing.allocation.awareness.attributes` in *every* master-eligible
node's `elasticsearch.yml` config file.
+
--
[source,yaml]
--------------------------------------------------------
cluster.routing.allocation.awareness.attributes: rack_id <1>
--------------------------------------------------------
<1> Specify multiple attributes as a comma-separated list.
--
+
You can also use the
<<cluster-update-settings,cluster-update-settings>> API to set or update
a cluster's awareness attributes.

With this example configuration, if you start two nodes with
`node.attr.rack_id` set to `rack_one` and create an index with 5 primary
shards and 1 replica of each primary, all primaries and replicas are
allocated across the two nodes.

If you add two nodes with `node.attr.rack_id` set to `rack_two`,
{es} moves shards to the new nodes, ensuring (if possible)
that no two copies of the same shard are in the same rack.

If `rack_two` fails and takes down both its nodes, by default {es}
allocates the lost shard copies to nodes in `rack_one`. To prevent multiple
copies of a particular shard from being allocated in the same location, you can
enable forced awareness.

[float]
[[forced-awareness]]
==== Forced awareness

By default, if one location fails, Elasticsearch assigns all of the missing
replica shards to the remaining locations. While you might have sufficient
resources across all locations to host your primary and replica shards, a single
location might be unable to host *ALL* of the shards.

To prevent a single location from being overloaded in the event of a failure,
you can set `cluster.routing.allocation.awareness.force` so no replicas are
allocated until nodes are available in another location.

For example, if you have an awareness attribute called `zone` and configure nodes
in `zone1` and `zone2`, you can use forced awareness to prevent Elasticsearch
from allocating replicas if only one zone is available:

[source,yaml]
-------------------------------------------------------------------
cluster.routing.allocation.awareness.attributes: zone
cluster.routing.allocation.awareness.force.zone.values: zone1,zone2 <1>
-------------------------------------------------------------------
<1> Specify all possible values for the awareness attribute.

With this example configuration, if you start two nodes with `node.attr.zone` set
to `zone1` and create an index with 5 shards and 1 replica, Elasticsearch creates
the index and allocates the 5 primary shards but no replicas. Replicas are
only allocated once nodes with `node.attr.zone` set to `zone2` are available.
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00			`[[allocation-awareness]]`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`=== Shard allocation awareness`

			`You can use custom node attributes as _awareness attributes_ to enable {es}`
			`to take your physical hardware configuration into account when allocating shards.`
			`If {es} knows which nodes are on the same physical server, in the same rack, or`
			`in the same zone, it can distribute the primary shard and its replica shards to`
			`minimise the risk of losing all shard copies in the event of a failure.`

			`When shard allocation awareness is enabled with the`
			`cluster.routing.allocation.awareness.attributes` setting, shards are only
			`allocated to nodes that have values set for the specified awareness`
			`attributes. If you use multiple awareness attributes, {es} considers`
			`each attribute separately when allocating shards.`

			`The allocation awareness settings can be configured in`
			`elasticsearch.yml` and updated dynamically with the
			`<<cluster-update-settings,cluster-update-settings>> API.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`{es} prefers using shards in the same location (with the same`
			`awareness attribute values) to process search or GET requests. Using local`
			`shards is usually faster than crossing rack or zone boundaries.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`NOTE: The number of attribute values determines how many shard copies are`
			`allocated in each location. If the number of nodes in each location is`
			`unbalanced and there are a lot of replicas, replica shards might be left`
			`unassigned.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`[float]`
			`[[enabling-awareness]]`
			`==== Enabling shard allocation awareness`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`To enable shard allocation awareness:`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`. Specify the location of each node with a custom node attribute. For example,`
			`if you want Elasticsearch to distribute shards across different racks, you might`
			set an awareness attribute called `rack_id` in each node's `elasticsearch.yml`
			`config file.`
			`+`
			`[source,yaml]`
			`--------------------------------------------------------`
			`node.attr.rack_id: rack_one`
			`--------------------------------------------------------`
			`+`
			`You can also set custom attributes when you start a node:`
			`+`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00			`[source,sh]`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`--------------------------------------------------------`
			`./bin/elasticsearch -Enode.attr.rack_id=rack_one`
			`--------------------------------------------------------`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`. Tell {es} to take one or more awareness attributes into account when`
			`allocating shards by setting`
			`cluster.routing.allocation.awareness.attributes` in every master-eligible
			node's `elasticsearch.yml` config file.
			`+`
			`--`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00			`[source,yaml]`
			`--------------------------------------------------------`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`cluster.routing.allocation.awareness.attributes: rack_id <1>`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00			`--------------------------------------------------------`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`<1> Specify multiple attributes as a comma-separated list.`
			`--`
			`+`
			`You can also use the`
			`<<cluster-update-settings,cluster-update-settings>> API to set or update`
			`a cluster's awareness attributes.`

			`With this example configuration, if you start two nodes with`
			`node.attr.rack_id` set to `rack_one` and create an index with 5 primary
			`shards and 1 replica of each primary, all primaries and replicas are`
Update allocation awareness docs (#29116) Update allocation awareness docs Today, the docs imply that if multiple attributes are specified the the whole combination of values is considered as a single entity when performing allocation. In fact, each attribute is considered separately. This change fixes this discrepancy. It also replaces the use of the term "awareness zone" with "zone or domain", and reformats some paragraphs to the right width. Fixes #29105 2018-03-19 03:04:47 -04:00			`allocated across the two nodes.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			If you add two nodes with `node.attr.rack_id` set to `rack_two`,
			`{es} moves shards to the new nodes, ensuring (if possible)`
			`that no two copies of the same shard are in the same rack.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			If `rack_two` fails and takes down both its nodes, by default {es}
			allocates the lost shard copies to nodes in `rack_one`. To prevent multiple
			`copies of a particular shard from being allocated in the same location, you can`
			`enable forced awareness.`
Take initializing shards into consideration during awareness allocation It makes decision consistent. Fixes #12522 2015-07-29 23:50:10 -04:00
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00			`[float]`
			`[[forced-awareness]]`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`==== Forced awareness`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`By default, if one location fails, Elasticsearch assigns all of the missing`
			`replica shards to the remaining locations. While you might have sufficient`
			`resources across all locations to host your primary and replica shards, a single`
			`location might be unable to host ALL of the shards.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`To prevent a single location from being overloaded in the event of a failure,`
			you can set `cluster.routing.allocation.awareness.force` so no replicas are
			`allocated until nodes are available in another location.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			For example, if you have an awareness attribute called `zone` and configure nodes
			in `zone1` and `zone2`, you can use forced awareness to prevent Elasticsearch
			`from allocating replicas if only one zone is available:`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
			`[source,yaml]`
			`-------------------------------------------------------------------`
			`cluster.routing.allocation.awareness.attributes: zone`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`cluster.routing.allocation.awareness.force.zone.values: zone1,zone2 <1>`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00			`-------------------------------------------------------------------`
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			`<1> Specify all possible values for the awareness attribute.`
Docs: Refactored modules and index modules sections 2015-06-22 17:49:45 -04:00
[DOCS] Reworked the shard allocation filtering info. (#36456) * [DOCS] Reworked the shard allocation filtering info. Closes #36079 * Added multiple index allocation settings example back. * Removed extraneous space 2018-12-11 10:44:57 -05:00			With this example configuration, if you start two nodes with `node.attr.zone` set
			to `zone1` and create an index with 5 shards and 1 replica, Elasticsearch creates
			`the index and allocates the 5 primary shards but no replicas. Replicas are`
			only allocated once nodes with `node.attr.zone` set to `zone2` are available.