2019-07-25 14:17:51 -04:00
|
|
|
[[indices-clone-index]]
|
2019-08-01 11:21:11 -04:00
|
|
|
=== Clone Index
|
2019-07-25 14:17:51 -04:00
|
|
|
|
|
|
|
The clone index API allows you to clone an existing index into a new index,
|
|
|
|
where each original primary shard is cloned into a new primary shard in
|
|
|
|
the new index.
|
|
|
|
|
|
|
|
[float]
|
2019-08-01 11:21:11 -04:00
|
|
|
==== How does cloning work?
|
2019-07-25 14:17:51 -04:00
|
|
|
|
|
|
|
Cloning works as follows:
|
|
|
|
|
|
|
|
* First, it creates a new target index with the same definition as the source
|
|
|
|
index.
|
|
|
|
|
|
|
|
* Then it hard-links segments from the source index into the target index. (If
|
|
|
|
the file system doesn't support hard-linking, then all segments are copied
|
|
|
|
into the new index, which is a much more time consuming process.)
|
|
|
|
|
|
|
|
* Finally, it recovers the target index as though it were a closed index which
|
|
|
|
had just been re-opened.
|
|
|
|
|
|
|
|
[float]
|
2019-08-01 11:21:11 -04:00
|
|
|
==== Preparing an index for cloning
|
2019-07-25 14:17:51 -04:00
|
|
|
|
|
|
|
Create a new index:
|
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2019-07-25 14:17:51 -04:00
|
|
|
--------------------------------------------------
|
|
|
|
PUT my_source_index
|
|
|
|
{
|
|
|
|
"settings": {
|
|
|
|
"index.number_of_shards" : 5
|
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------------------------------
|
|
|
|
|
|
|
|
In order to clone an index, the index must be marked as read-only,
|
|
|
|
and have <<cluster-health,health>> `green`.
|
|
|
|
|
|
|
|
This can be achieved with the following request:
|
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2019-07-25 14:17:51 -04:00
|
|
|
--------------------------------------------------
|
|
|
|
PUT /my_source_index/_settings
|
|
|
|
{
|
|
|
|
"settings": {
|
|
|
|
"index.blocks.write": true <1>
|
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------------------------------
|
|
|
|
// TEST[continued]
|
|
|
|
|
|
|
|
<1> Prevents write operations to this index while still allowing metadata
|
|
|
|
changes like deleting the index.
|
|
|
|
|
|
|
|
[float]
|
2019-08-01 11:21:11 -04:00
|
|
|
==== Cloning an index
|
2019-07-25 14:17:51 -04:00
|
|
|
|
|
|
|
To clone `my_source_index` into a new index called `my_target_index`, issue
|
|
|
|
the following request:
|
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2019-07-25 14:17:51 -04:00
|
|
|
--------------------------------------------------
|
|
|
|
POST my_source_index/_clone/my_target_index
|
|
|
|
--------------------------------------------------
|
|
|
|
// TEST[continued]
|
|
|
|
|
|
|
|
The above request returns immediately once the target index has been added to
|
|
|
|
the cluster state -- it doesn't wait for the clone operation to start.
|
|
|
|
|
|
|
|
[IMPORTANT]
|
|
|
|
=====================================
|
|
|
|
|
|
|
|
Indices can only be cloned if they satisfy the following requirements:
|
|
|
|
|
|
|
|
* the target index must not exist
|
|
|
|
|
|
|
|
* The source index must have the same number of primary shards as the target index.
|
|
|
|
|
|
|
|
* The node handling the clone process must have sufficient free disk space to
|
|
|
|
accommodate a second copy of the existing index.
|
|
|
|
|
|
|
|
=====================================
|
|
|
|
|
|
|
|
The `_clone` API is similar to the <<indices-create-index, `create index` API>>
|
|
|
|
and accepts `settings` and `aliases` parameters for the target index:
|
|
|
|
|
2019-09-06 11:31:13 -04:00
|
|
|
[source,console]
|
2019-07-25 14:17:51 -04:00
|
|
|
--------------------------------------------------
|
|
|
|
POST my_source_index/_clone/my_target_index
|
|
|
|
{
|
|
|
|
"settings": {
|
|
|
|
"index.number_of_shards": 5 <1>
|
|
|
|
},
|
|
|
|
"aliases": {
|
|
|
|
"my_search_indices": {}
|
|
|
|
}
|
|
|
|
}
|
|
|
|
--------------------------------------------------
|
|
|
|
// TEST[s/^/PUT my_source_index\n{"settings": {"index.blocks.write": true, "index.number_of_shards": "5"}}\n/]
|
|
|
|
|
|
|
|
<1> The number of shards in the target index. This must be equal to the
|
|
|
|
number of shards in the source index.
|
|
|
|
|
|
|
|
|
|
|
|
NOTE: Mappings may not be specified in the `_clone` request. The mappings of
|
|
|
|
the source index will be used for the target index.
|
|
|
|
|
|
|
|
[float]
|
2019-08-01 11:21:11 -04:00
|
|
|
==== Monitoring the clone process
|
2019-07-25 14:17:51 -04:00
|
|
|
|
|
|
|
The clone process can be monitored with the <<cat-recovery,`_cat recovery`
|
|
|
|
API>>, or the <<cluster-health, `cluster health` API>> can be used to wait
|
|
|
|
until all primary shards have been allocated by setting the `wait_for_status`
|
|
|
|
parameter to `yellow`.
|
|
|
|
|
|
|
|
The `_clone` API returns as soon as the target index has been added to the
|
|
|
|
cluster state, before any shards have been allocated. At this point, all
|
|
|
|
shards are in the state `unassigned`. If, for any reason, the target index
|
|
|
|
can't be allocated, its primary shard will remain `unassigned` until it
|
|
|
|
can be allocated on that node.
|
|
|
|
|
|
|
|
Once the primary shard is allocated, it moves to state `initializing`, and the
|
|
|
|
clone process begins. When the clone operation completes, the shard will
|
|
|
|
become `active`. At that point, Elasticsearch will try to allocate any
|
|
|
|
replicas and may decide to relocate the primary shard to another node.
|
|
|
|
|
|
|
|
[float]
|
2019-08-01 11:21:11 -04:00
|
|
|
==== Wait For Active Shards
|
2019-07-25 14:17:51 -04:00
|
|
|
|
|
|
|
Because the clone operation creates a new index to clone the shards to,
|
|
|
|
the <<create-index-wait-for-active-shards,wait for active shards>> setting
|
|
|
|
on index creation applies to the clone index action as well.
|