From 754d79e7e6e6c82cbd39510e00f813a365eeb14f Mon Sep 17 00:00:00 2001 From: Michael Stack Date: Fri, 15 Jul 2011 23:36:14 +0000 Subject: [PATCH] HBASE-4108 book.xml/troubleshooting.xml - moving troubleshooting info out of FAQ into troubleshooting chapter git-svn-id: https://svn.apache.org/repos/asf/hbase/trunk@1147346 13f79535-47bb-0310-9956-ffa450edef68 --- src/docbkx/book.xml | 25 ++++--------------------- src/docbkx/troubleshooting.xml | 23 ++++++++++++++++++++++- 2 files changed, 26 insertions(+), 22 deletions(-) diff --git a/src/docbkx/book.xml b/src/docbkx/book.xml index 77e6cc10c20..6d0f6c26ad6 100644 --- a/src/docbkx/book.xml +++ b/src/docbkx/book.xml @@ -1550,38 +1550,21 @@ When I build, why do I always get Unable to find resource 'VM_global_libra Runtime - Loading, why do I see pauses when loading HBase? + I'm having problems with my HBase cluster, how can I troubleshoot it? - If compression is enabled, see this thread up on the user list, - Long client pauses with compression. + See . - Why does my RegionServer hang inexplicitly? + How can I improve HBase cluster performance? - Are you running an old JVM (< 1.6.0_u21?)? When you look at a thread dump, - does it look like threads are BLOCKED but no one holds the lock all are - blocked on? See HBASE 3622 Deadlock in HBaseServer (JVM bug?). - Adding -XX:+UseMembar to the HBase HBASE_OPTS in conf/hbase-env.sh - may fix it. - - - - - - Why do I see double the actual count of regionservers, once by domain name and then by IP? - - - - Fix your DNS. In versions of HBase before 0.92.x, reverse DNS needs to give same answer - as forward lookup. See HBASE 3431 - Regionserver is not using the name given it by the master; double entry in master listing of servers for gorey details. + See . diff --git a/src/docbkx/troubleshooting.xml b/src/docbkx/troubleshooting.xml index 9f761d06c4d..5fc715acb0c 100644 --- a/src/docbkx/troubleshooting.xml +++ b/src/docbkx/troubleshooting.xml @@ -455,6 +455,9 @@ hadoop 17789 155 35.2 9067824 8604364 ? S<l Mar04 9855:48 /usr/java/j more data is packed into the same region, but performance is achieved by being able to write larger files - which is why HBase waits until the flushize before writing a new StoreFile. And smaller StoreFiles become targets for compaction. Without compression the files are much bigger and don't need as much compaction, however this is at the expense of I/O. + + For additional information, see this thread on Long client pauses with compression. + @@ -545,7 +548,17 @@ java.lang.UnsatisfiedLinkError: no gplcompression in java.library.path
Runtime Errors - + +
+ RegionServer Hanging + + Are you running an old JVM (< 1.6.0_u21?)? When you look at a thread dump, + does it look like threads are BLOCKED but no one holds the lock all are + blocked on? See HBASE 3622 Deadlock in HBaseServer (JVM bug?). + Adding -XX:+UseMembar to the HBase HBASE_OPTS in conf/hbase-env.sh + may fix it. + +
java.io.IOException...(Too many open files) @@ -649,6 +662,14 @@ ERROR org.apache.hadoop.hbase.regionserver.HRegionServer: ZooKeeper session expi and then the client goes back to .META. to find the new location of the moved region. However, if the NotServingRegionException is logged ERROR, then the client ran out of retries and something probably wrong.
+
+ Regions listed by domain name, then IP + + Fix your DNS. In versions of HBase before 0.92.x, reverse DNS needs to give same answer + as forward lookup. See HBASE 3431 + RegionServer is not using the name given it by the master; double entry in master listing of servers for gorey details. + +