OpenSearch/docs/reference/mapping/fields/routing-field.asciidoc

[[mapping-routing-field]]
=== `_routing` field

A document is routed to a particular shard in an index using the following
formula:

    shard_num = hash(_routing) % num_primary_shards

The default value used for `_routing` is the document's <<mapping-id-field,`_id`>>.

Custom routing patterns can be implemented by specifying a custom `routing`
value per document.  For instance:

[source,js]
------------------------------
PUT my_index/_doc/1?routing=user1&refresh=true <1>
{
  "title": "This is a document"
}

GET my_index/_doc/1?routing=user1 <2>
------------------------------
// CONSOLE
// TESTSETUP

<1> This document uses `user1` as its routing value, instead of its ID.
<2> The same `routing` value needs to be provided when
    <<docs-get,getting>>, <<docs-delete,deleting>>, or <<docs-update,updating>>
    the document.

The value of the `_routing` field is accessible in queries:

[source,js]
--------------------------
GET my_index/_search
{
  "query": {
    "terms": {
      "_routing": [ "user1" ] <1>
    }
  }
}
--------------------------
// CONSOLE

<1> Querying on the `_routing` field (also see the <<query-dsl-ids-query,`ids` query>>)

==== Searching with custom routing

Custom routing can reduce the impact of searches.  Instead of having to fan
out a search request to all the shards in an index, the request can be sent to
just the shard that matches the specific routing value (or values):

[source,js]
------------------------------
GET my_index/_search?routing=user1,user2 <1>
{
  "query": {
    "match": {
      "title": "document"
    }
  }
}
------------------------------
// CONSOLE

<1> This search request will only be executed on the shards associated with the `user1` and `user2` routing values.


==== Making a routing value required

When using custom routing, it is important to provide the routing value
whenever <<docs-index_,indexing>>, <<docs-get,getting>>,
<<docs-delete,deleting>>, or <<docs-update,updating>> a document.

Forgetting the routing value can lead to a document being indexed on more than
one shard.  As a safeguard, the `_routing` field can be configured to make a
custom `routing` value required for all CRUD operations:

[source,js]
------------------------------
PUT my_index2?include_type_name=true
{
  "mappings": {
    "_doc": {
      "_routing": {
        "required": true <1>
      }
    }
  }
}

PUT my_index2/_doc/1 <2>
{
  "text": "No routing value provided"
}
------------------------------
// CONSOLE
// TEST[catch:bad_request]
<1> Routing is required for `_doc` documents.
<2> This index request throws a `routing_missing_exception`.

==== Unique IDs with custom routing

When indexing documents specifying a custom `_routing`, the uniqueness of the
`_id` is not guaranteed across all of the shards in the index. In fact,
documents with the same `_id` might end up on different shards if indexed with
different `_routing` values.

It is up to the user to ensure that IDs are unique across the index.

[[routing-index-partition]]
==== Routing to an index partition

An index can be configured such that custom routing values will go to a subset of the shards rather
than a single shard. This helps mitigate the risk of ending up with an imbalanced cluster while still
reducing the impact of searches.

This is done by providing the index level setting <<routing-partition-size,`index.routing_partition_size`>> at index creation.
As the partition size increases, the more evenly distributed the data will become at the
expense of having to search more shards per request.

When this setting is present, the formula for calculating the shard becomes:

    shard_num = (hash(_routing) + hash(_id) % routing_partition_size) % num_primary_shards

That is, the `_routing` field is used to calculate a set of shards within the index and then the
`_id` is used to pick a shard within that set.

To enable this feature, the `index.routing_partition_size` should have a value greater than 1 and
less than `index.number_of_shards`.

Once enabled, the partitioned index will have the following limitations:

*   Mappings with <<parent-join,`join` field>> relationships cannot be created within it.
*   All mappings within the index must have the `_routing` field marked as required.
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00			`[[mapping-routing-field]]`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			=== `_routing` field
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`A document is routed to a particular shard in an index using the following`
			`formula:`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`shard_num = hash(_routing) % num_primary_shards`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
Remove usage of multi-types from the docs and added a page explaining type removal (#25543) Closes #25401 2017-07-05 12:30:19 +02:00			The default value used for `_routing` is the document's <<mapping-id-field,`_id`>>.
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			Custom routing patterns can be implemented by specifying a custom `routing`
			`value per document. For instance:`
Migrated documentation into the main repo 2013-08-29 01:24:34 +02:00
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`[source,js]`
			`------------------------------`
Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			`PUT my_index/_doc/1?routing=user1&refresh=true <1>`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`{`
			`"title": "This is a document"`
			`}`

Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			`GET my_index/_doc/1?routing=user1 <2>`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 15:42:23 +02:00			`// CONSOLE`
Generate and run tests from the docs Adds infrastructure so `gradle :docs:check` will extract tests from snippets in the documentation and execute the tests. This is included in `gradle check` so it should happen on CI and during a normal build. By default each `// AUTOSENSE` snippet creates a unique REST test. These tests are executed in a random order and the cluster is wiped between each one. If multiple snippets chain together into a test you can annotate all snippets after the first with `// TEST[continued]` to have the generated tests for both snippets joined. Snippets marked as `// TESTRESPONSE` are checked against the response of the last action. See docs/README.asciidoc for lots more. Closes #12583. That issue is about catching bugs in the docs during build. This catches some bugs in the docs during build which is a good start. 2016-04-29 10:42:03 -04:00			`// TESTSETUP`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00
			<1> This document uses `user1` as its routing value, instead of its ID.
Fix typos in docs. 2016-02-09 02:07:32 -08:00			<2> The same `routing` value needs to be provided when
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`<<docs-get,getting>>, <<docs-delete,deleting>>, or <<docs-update,updating>>`
			`the document.`

Use `refresh=true` in mapping/fields examples (#20120) Fix field examples to make documents actually visible This commit adds refresh calls to field examples an removes not working `_routing` and `_field_names` script access. Closes #20118 2016-08-23 13:32:14 +02:00			The value of the `_routing` field is accessible in queries:
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00
			`[source,js]`
			`--------------------------`
			`GET my_index/_search`
			`{`
			`"query": {`
			`"terms": {`
			`"_routing": [ "user1" ] <1>`
			`}`
			`}`
			`}`
			`--------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 15:42:23 +02:00			`// CONSOLE`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00
			<1> Querying on the `_routing` field (also see the <<query-dsl-ids-query,`ids` query>>)

			`==== Searching with custom routing`

			`Custom routing can reduce the impact of searches. Instead of having to fan`
			`out a search request to all the shards in an index, the request can be sent to`
			`just the shard that matches the specific routing value (or values):`

			`[source,js]`
			`------------------------------`
			`GET my_index/_search?routing=user1,user2 <1>`
			`{`
			`"query": {`
			`"match": {`
			`"title": "document"`
			`}`
			`}`
			`}`
			`------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 15:42:23 +02:00			`// CONSOLE`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00
			<1> This search request will only be executed on the shards associated with the `user1` and `user2` routing values.


			`==== Making a routing value required`

			`When using custom routing, it is important to provide the routing value`
			`whenever <<docs-index_,indexing>>, <<docs-get,getting>>,`
			`<<docs-delete,deleting>>, or <<docs-update,updating>> a document.`

			`Forgetting the routing value can lead to a document being indexed on more than`
			one shard. As a safeguard, the `_routing` field can be configured to make a
			custom `routing` value required for all CRUD operations:

			`[source,js]`
			`------------------------------`
Update the default for include_type_name to false. (#37285) * Default include_type_name to false for get and put mappings. * Default include_type_name to false for get field mappings. * Add a constant for the default include_type_name value. * Default include_type_name to false for get and put index templates. * Default include_type_name to false for create index. * Update create index calls in REST documentation to use include_type_name=true. * Some minor clean-ups around the get index API. * In REST tests, use include_type_name=true by default for index creation. * Make sure to use 'expression == false'. * Clarify the different IndexTemplateMetaData toXContent methods. * Fix FullClusterRestartIT#testSnapshotRestore. * Fix the ml_anomalies_default_mappings test. * Fix GetFieldMappingsResponseTests and GetIndexTemplateResponseTests. We make sure to specify include_type_name=true during xContent parsing, so we continue to test the legacy typed responses. XContent generation for the typeless responses is currently only covered by REST tests, but we will be adding unit test coverage for these as we implement each typeless API in the Java HLRC. This commit also refactors GetMappingsResponse to follow the same appraoch as the other mappings-related responses, where we read include_type_name out of the xContent params, instead of creating a second toXContent method. This gives better consistency in the response parsing code. * Fix more REST tests. * Improve some wording in the create index documentation. * Add a note about types removal in the create index docs. * Fix SmokeTestMonitoringWithSecurityIT#testHTTPExporterWithSSL. * Make sure to mention include_type_name in the REST docs for affected APIs. * Make sure to use 'expression == false' in FullClusterRestartIT. * Mention include_type_name in the REST templates docs. 2019-01-14 13:08:01 -08:00			`PUT my_index2?include_type_name=true`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`{`
			`"mappings": {`
Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			`"_doc": {`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`"_routing": {`
			`"required": true <1>`
			`}`
			`}`
			`}`
			`}`

Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			`PUT my_index2/_doc/1 <2>`
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			`{`
			`"text": "No routing value provided"`
			`}`
			`------------------------------`
Renamed all AUTOSENSE snippets to CONSOLE (#18210) 2016-05-09 15:42:23 +02:00			`// CONSOLE`
Add bad_request to the rest-api-spec catch params (#26539) This adds another request to the catch params. It also makes sure that the generic request param does not allow 400 either. 2017-09-14 14:24:03 -05:00			`// TEST[catch:bad_request]`
Allow `_doc` as a type. (#27816) Allowing `_doc` as a type will enable users to make the transition to 7.0 smoother since the index APIs will be `PUT index/_doc/id` and `POST index/_doc`. This also moves most of the documentation to `_doc` as a type name. Closes #27750 Closes #27751 2017-12-14 17:47:53 +01:00			<1> Routing is required for `_doc` documents.
Docs: Refactored the mapping meta-fields docs 2015-07-20 01:24:29 +02:00			<2> This index request throws a `routing_missing_exception`.

			`==== Unique IDs with custom routing`

			When indexing documents specifying a custom `_routing`, the uniqueness of the
			`_id` is not guaranteed across all of the shards in the index. In fact,
			documents with the same `_id` might end up on different shards if indexed with
			different `_routing` values.

			`It is up to the user to ensure that IDs are unique across the index.`
Allow an index to be partitioned with custom routing (#22274) This change makes it possible for custom routing values to go to a subset of shards rather than just a single shard. This enables the ability to utilize the spatial locality that custom routing can provide while mitigating the likelihood of ending up with an imbalanced cluster or suffering from a hot shard. This is ideal for large multi-tenant indices with custom routing that suffer from one or both of the following: - The big tenants cannot fit into a single shard or there is so many of them that they will likely end up on the same shard - Tenants often have a surge in write traffic and a single shard cannot process it fast enough Beyond that, this should also be useful for use cases where most queries are done under the context of a specific field (e.g. a category) since it gives a hint at how the data can be stored to minimize the number of shards to check per query. While a similar solution can be achieved with multiple concrete indices or aliases per value today, those approaches breakdown for high cardinality fields. A partitioned index enforces that mappings have routing required, that the partition size does not change when shrinking an index (the partitions will shrink proportionally), and rejects mappings that have parent/child relationships. Closes #21585 2017-01-18 02:51:23 -05:00
			`[[routing-index-partition]]`
			`==== Routing to an index partition`

			`An index can be configured such that custom routing values will go to a subset of the shards rather`
			`than a single shard. This helps mitigate the risk of ending up with an imbalanced cluster while still`
			`reducing the impact of searches.`

			This is done by providing the index level setting <<routing-partition-size,`index.routing_partition_size`>> at index creation.
			`As the partition size increases, the more evenly distributed the data will become at the`
			`expense of having to search more shards per request.`

			`When this setting is present, the formula for calculating the shard becomes:`

			`shard_num = (hash(_routing) + hash(_id) % routing_partition_size) % num_primary_shards`

			That is, the `_routing` field is used to calculate a set of shards within the index and then the
			`_id` is used to pick a shard within that set.

			To enable this feature, the `index.routing_partition_size` should have a value greater than 1 and
			less than `index.number_of_shards`.

			`Once enabled, the partitioned index will have the following limitations:`

Remove usage of multi-types from the docs and added a page explaining type removal (#25543) Closes #25401 2017-07-05 12:30:19 +02:00			* Mappings with <<parent-join,`join` field>> relationships cannot be created within it.
Allow an index to be partitioned with custom routing (#22274) This change makes it possible for custom routing values to go to a subset of shards rather than just a single shard. This enables the ability to utilize the spatial locality that custom routing can provide while mitigating the likelihood of ending up with an imbalanced cluster or suffering from a hot shard. This is ideal for large multi-tenant indices with custom routing that suffer from one or both of the following: - The big tenants cannot fit into a single shard or there is so many of them that they will likely end up on the same shard - Tenants often have a surge in write traffic and a single shard cannot process it fast enough Beyond that, this should also be useful for use cases where most queries are done under the context of a specific field (e.g. a category) since it gives a hint at how the data can be stored to minimize the number of shards to check per query. While a similar solution can be achieved with multiple concrete indices or aliases per value today, those approaches breakdown for high cardinality fields. A partitioned index enforces that mappings have routing required, that the partition size does not change when shrinking an index (the partitions will shrink proportionally), and rejects mappings that have parent/child relationships. Closes #21585 2017-01-18 02:51:23 -05:00			* All mappings within the index must have the `_routing` field marked as required.