2013-08-28 19:24:34 -04:00
|
|
|
[[mapping-routing-field]]
|
2015-07-19 19:24:29 -04:00
|
|
|
=== `_routing` field
|
2013-08-28 19:24:34 -04:00
|
|
|
|
2015-07-19 19:24:29 -04:00
|
|
|
A document is routed to a particular shard in an index using the following
|
|
|
|
formula:
|
2013-08-28 19:24:34 -04:00
|
|
|
|
2015-07-19 19:24:29 -04:00
|
|
|
shard_num = hash(_routing) % num_primary_shards
|
2013-08-28 19:24:34 -04:00
|
|
|
|
2017-07-05 06:30:19 -04:00
|
|
|
The default value used for `_routing` is the document's <<mapping-id-field,`_id`>>.
|
2013-08-28 19:24:34 -04:00
|
|
|
|
2015-07-19 19:24:29 -04:00
|
|
|
Custom routing patterns can be implemented by specifying a custom `routing`
|
|
|
|
value per document. For instance:
|
2013-08-28 19:24:34 -04:00
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2015-07-19 19:24:29 -04:00
|
|
|
------------------------------
|
2020-07-27 15:58:26 -04:00
|
|
|
PUT my-index-000001/_doc/1?routing=user1&refresh=true <1>
|
2015-07-19 19:24:29 -04:00
|
|
|
{
|
|
|
|
"title": "This is a document"
|
|
|
|
}
|
|
|
|
|
2020-07-27 15:58:26 -04:00
|
|
|
GET my-index-000001/_doc/1?routing=user1 <2>
|
2015-07-19 19:24:29 -04:00
|
|
|
------------------------------
|
2016-04-29 10:42:03 -04:00
|
|
|
// TESTSETUP
|
2015-07-19 19:24:29 -04:00
|
|
|
|
|
|
|
<1> This document uses `user1` as its routing value, instead of its ID.
|
2016-02-09 05:07:32 -05:00
|
|
|
<2> The same `routing` value needs to be provided when
|
2015-07-19 19:24:29 -04:00
|
|
|
<<docs-get,getting>>, <<docs-delete,deleting>>, or <<docs-update,updating>>
|
|
|
|
the document.
|
|
|
|
|
2016-08-23 07:32:14 -04:00
|
|
|
The value of the `_routing` field is accessible in queries:
|
2015-07-19 19:24:29 -04:00
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2015-07-19 19:24:29 -04:00
|
|
|
--------------------------
|
2020-07-27 15:58:26 -04:00
|
|
|
GET my-index-000001/_search
|
2015-07-19 19:24:29 -04:00
|
|
|
{
|
|
|
|
"query": {
|
|
|
|
"terms": {
|
|
|
|
"_routing": [ "user1" ] <1>
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------
|
|
|
|
|
|
|
|
<1> Querying on the `_routing` field (also see the <<query-dsl-ids-query,`ids` query>>)
|
|
|
|
|
2020-07-09 16:52:30 -04:00
|
|
|
NOTE: Data streams do not support custom routing. Instead, target the
|
|
|
|
appropriate backing index for the stream.
|
|
|
|
|
2015-07-19 19:24:29 -04:00
|
|
|
==== Searching with custom routing
|
|
|
|
|
|
|
|
Custom routing can reduce the impact of searches. Instead of having to fan
|
|
|
|
out a search request to all the shards in an index, the request can be sent to
|
|
|
|
just the shard that matches the specific routing value (or values):
|
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2015-07-19 19:24:29 -04:00
|
|
|
------------------------------
|
2020-07-27 15:58:26 -04:00
|
|
|
GET my-index-000001/_search?routing=user1,user2 <1>
|
2015-07-19 19:24:29 -04:00
|
|
|
{
|
|
|
|
"query": {
|
|
|
|
"match": {
|
|
|
|
"title": "document"
|
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
------------------------------
|
|
|
|
|
|
|
|
<1> This search request will only be executed on the shards associated with the `user1` and `user2` routing values.
|
|
|
|
|
|
|
|
|
|
|
|
==== Making a routing value required
|
|
|
|
|
|
|
|
When using custom routing, it is important to provide the routing value
|
|
|
|
whenever <<docs-index_,indexing>>, <<docs-get,getting>>,
|
|
|
|
<<docs-delete,deleting>>, or <<docs-update,updating>> a document.
|
|
|
|
|
|
|
|
Forgetting the routing value can lead to a document being indexed on more than
|
|
|
|
one shard. As a safeguard, the `_routing` field can be configured to make a
|
|
|
|
custom `routing` value required for all CRUD operations:
|
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2015-07-19 19:24:29 -04:00
|
|
|
------------------------------
|
2020-07-27 15:58:26 -04:00
|
|
|
PUT my-index-000002
|
2015-07-19 19:24:29 -04:00
|
|
|
{
|
|
|
|
"mappings": {
|
2019-01-22 09:13:52 -05:00
|
|
|
"_routing": {
|
|
|
|
"required": true <1>
|
2015-07-19 19:24:29 -04:00
|
|
|
}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
|
2020-07-27 15:58:26 -04:00
|
|
|
PUT my-index-000002/_doc/1 <2>
|
2015-07-19 19:24:29 -04:00
|
|
|
{
|
|
|
|
"text": "No routing value provided"
|
|
|
|
}
|
|
|
|
------------------------------
|
2017-09-14 15:24:03 -04:00
|
|
|
// TEST[catch:bad_request]
|
2019-09-06 11:31:13 -04:00
|
|
|
|
2020-03-23 12:34:49 -04:00
|
|
|
<1> Routing is required for all documents.
|
2015-07-19 19:24:29 -04:00
|
|
|
<2> This index request throws a `routing_missing_exception`.
|
|
|
|
|
|
|
|
==== Unique IDs with custom routing
|
|
|
|
|
|
|
|
When indexing documents specifying a custom `_routing`, the uniqueness of the
|
|
|
|
`_id` is not guaranteed across all of the shards in the index. In fact,
|
|
|
|
documents with the same `_id` might end up on different shards if indexed with
|
|
|
|
different `_routing` values.
|
|
|
|
|
|
|
|
It is up to the user to ensure that IDs are unique across the index.
|
2017-01-18 02:51:23 -05:00
|
|
|
|
|
|
|
[[routing-index-partition]]
|
|
|
|
==== Routing to an index partition
|
|
|
|
|
|
|
|
An index can be configured such that custom routing values will go to a subset of the shards rather
|
|
|
|
than a single shard. This helps mitigate the risk of ending up with an imbalanced cluster while still
|
|
|
|
reducing the impact of searches.
|
|
|
|
|
|
|
|
This is done by providing the index level setting <<routing-partition-size,`index.routing_partition_size`>> at index creation.
|
|
|
|
As the partition size increases, the more evenly distributed the data will become at the
|
|
|
|
expense of having to search more shards per request.
|
|
|
|
|
|
|
|
When this setting is present, the formula for calculating the shard becomes:
|
|
|
|
|
|
|
|
shard_num = (hash(_routing) + hash(_id) % routing_partition_size) % num_primary_shards
|
|
|
|
|
|
|
|
That is, the `_routing` field is used to calculate a set of shards within the index and then the
|
|
|
|
`_id` is used to pick a shard within that set.
|
|
|
|
|
|
|
|
To enable this feature, the `index.routing_partition_size` should have a value greater than 1 and
|
|
|
|
less than `index.number_of_shards`.
|
|
|
|
|
|
|
|
Once enabled, the partitioned index will have the following limitations:
|
|
|
|
|
2017-07-05 06:30:19 -04:00
|
|
|
* Mappings with <<parent-join,`join` field>> relationships cannot be created within it.
|
2020-03-23 12:34:49 -04:00
|
|
|
* All mappings within the index must have the `_routing` field marked as required.
|