From 82bb37070379bb8e135ee566589e8e348e0bdd78 Mon Sep 17 00:00:00 2001 From: stack Date: Mon, 29 Sep 2014 10:49:03 -0700 Subject: [PATCH] HBASE-6290 Add a function a mark a server as dead and start the recovery the process (Talat Uyarer) DOC ADDENDUM --- src/main/docbkx/ops_mgt.xml | 13 +++++++++++++ 1 file changed, 13 insertions(+) diff --git a/src/main/docbkx/ops_mgt.xml b/src/main/docbkx/ops_mgt.xml index f882646ceda..1f83a155f8f 100644 --- a/src/main/docbkx/ops_mgt.xml +++ b/src/main/docbkx/ops_mgt.xml @@ -704,6 +704,19 @@ Options: RegionServer. Avoid any problems by disabling the balancer first. See below. + + Kill Node Tool + In hbase-2.0, in the bin directory, we added a script named + considerAsDead.sh that can be used to kill a regionserver. + Hardware issues could be detected by specialized monitoring tools before the + zookeeper timeout has expired. considerAsDead.sh is a + simple function to mark a RegionServer as dead. It deletes all the znodes + of the server, starting the recovery process. Plug in the script into + your monitoring/fault detection tools to initiate faster failover. Be + careful how you use this disruptive tool. Copy the script if you need to + make use of it in a version of hbase previous to hbase-2.0. + + A downside to the above stop of a RegionServer is that regions could be offline for a good period of time. Regions are closed in order. If many regions on the server, the first region to close may not be back online until all regions close and after the master notices