2013-11-14 20:14:39 -05:00
|
|
|
[[cat-recovery]]
|
2014-05-16 15:43:35 -04:00
|
|
|
== cat recovery
|
2013-11-14 20:14:39 -05:00
|
|
|
|
2014-05-13 16:04:48 -04:00
|
|
|
The `recovery` command is a view of index shard recoveries, both on-going and previously
|
|
|
|
completed. It is a more compact view of the JSON <<indices-recovery,recovery>> API.
|
2013-11-14 20:14:39 -05:00
|
|
|
|
2014-05-13 16:04:48 -04:00
|
|
|
A recovery event occurs anytime an index shard moves to a different node in the cluster.
|
|
|
|
This can happen during a snapshot recovery, a change in replication level, node failure, or
|
2015-06-23 16:10:55 -04:00
|
|
|
on node startup. This last type is called a local store recovery and is the normal
|
2014-05-13 16:04:48 -04:00
|
|
|
way for shards to be loaded from disk when a node starts up.
|
|
|
|
|
|
|
|
As an example, here is what the recovery state of a cluster may look like when there
|
|
|
|
are no shards in transit from one node to another:
|
2013-11-14 20:14:39 -05:00
|
|
|
|
2017-02-22 03:18:10 -05:00
|
|
|
[source,js]
|
2014-10-27 15:49:58 -04:00
|
|
|
----------------------------------------------------------------------------
|
2017-02-22 03:18:10 -05:00
|
|
|
GET _cat/recovery?v
|
2014-10-27 15:49:58 -04:00
|
|
|
---------------------------------------------------------------------------
|
2017-02-22 03:18:10 -05:00
|
|
|
// CONSOLE
|
|
|
|
// TEST[setup:twitter]
|
|
|
|
|
|
|
|
The response of this request will be something like:
|
|
|
|
|
|
|
|
[source,js]
|
|
|
|
---------------------------------------------------------------------------
|
|
|
|
index shard time type stage source_host source_node target_host target_node repository snapshot files files_recovered files_percent files_total bytes bytes_recovered bytes_percent bytes_total translog_ops translog_ops_recovered translog_ops_percent
|
|
|
|
twitter 0 13ms store done n/a n/a 127.0.0.1 node-0 n/a n/a 0 0 100% 13 0 0 100% 9928 0 0 100.0%
|
|
|
|
---------------------------------------------------------------------------
|
|
|
|
// TESTRESPONSE[s/store/empty_store/]
|
|
|
|
// TESTRESPONSE[s/100%/0.0%/]
|
|
|
|
// TESTRESPONSE[s/9928/0/]
|
|
|
|
// TESTRESPONSE[s/13/\\d+/ _cat]
|
2014-05-13 16:04:48 -04:00
|
|
|
|
|
|
|
In the above case, the source and target nodes are the same because the recovery
|
2015-06-23 16:10:55 -04:00
|
|
|
type was store, i.e. they were read from local storage on node start.
|
2014-05-13 16:04:48 -04:00
|
|
|
|
|
|
|
Now let's see what a live recovery looks like. By increasing the replica count
|
|
|
|
of our index and bringing another node online to host the replicas, we can see
|
|
|
|
what a live shard recovery looks like.
|
|
|
|
|
2017-02-22 03:18:10 -05:00
|
|
|
[source,js]
|
2014-10-27 15:49:58 -04:00
|
|
|
----------------------------------------------------------------------------
|
2017-02-22 03:18:10 -05:00
|
|
|
GET _cat/recovery?v&h=i,s,t,ty,st,shost,thost,f,fp,b,bp
|
|
|
|
---------------------------------------------------------------------------
|
|
|
|
// CONSOLE
|
|
|
|
// TEST[setup:twitter]
|
|
|
|
|
|
|
|
This will return a line like:
|
|
|
|
|
|
|
|
[source,js]
|
|
|
|
---------------------------------------------------------------------------
|
|
|
|
i s t ty st shost thost f fp b bp
|
|
|
|
twitter 0 1252ms peer done 192.168.1.1 192.168.1.2 0 100.0% 0 100.0%
|
2014-10-27 15:49:58 -04:00
|
|
|
----------------------------------------------------------------------------
|
2017-02-22 03:18:10 -05:00
|
|
|
// TESTRESPONSE[s/peer/empty_store/]
|
|
|
|
// TESTRESPONSE[s/192.168.1.2/127.0.0.1/]
|
|
|
|
// TESTRESPONSE[s/192.168.1.1/n\/a/]
|
|
|
|
// TESTRESPONSE[s/100.0%/0.0%/]
|
|
|
|
// TESTRESPONSE[s/1252/\\d+/ _cat]
|
2014-05-13 16:04:48 -04:00
|
|
|
|
2017-02-22 03:18:10 -05:00
|
|
|
We can see in the above listing that our thw twitter shard was recovered from another node.
|
|
|
|
Notice that the recovery type is shown as `peer`. The files and bytes copied are
|
|
|
|
real-time measurements.
|
2014-05-13 16:04:48 -04:00
|
|
|
|
|
|
|
Finally, let's see what a snapshot recovery looks like. Assuming I have previously
|
|
|
|
made a backup of my index, I can restore it using the <<modules-snapshots,snapshot and restore>>
|
|
|
|
API.
|
2013-11-14 20:14:39 -05:00
|
|
|
|
2017-02-22 03:18:10 -05:00
|
|
|
[source,js]
|
2014-10-27 15:49:58 -04:00
|
|
|
--------------------------------------------------------------------------------
|
2017-02-22 03:18:10 -05:00
|
|
|
GET _cat/recovery?v&h=i,s,t,ty,st,rep,snap,f,fp,b,bp
|
|
|
|
---------------------------------------------------------------------------
|
|
|
|
// CONSOLE
|
|
|
|
// TEST[skip:no need to execute snapshot/restore here]
|
2014-10-27 15:49:58 -04:00
|
|
|
|
2017-02-22 03:18:10 -05:00
|
|
|
This will show a recovery of type snapshot in the response
|
2014-10-27 15:49:58 -04:00
|
|
|
|
2017-02-22 03:18:10 -05:00
|
|
|
[source,js]
|
|
|
|
---------------------------------------------------------------------------
|
|
|
|
i s t ty st rep snap f fp b bp
|
|
|
|
twitter 0 1978ms snapshot done twitter snap_1 79 8.0% 12086 9.0%
|
|
|
|
--------------------------------------------------------------------------------
|
|
|
|
// TESTRESPONSE[_cat]
|