4686 Commits

Author SHA1 Message Date
tedyu
364a57a950 HBASE-16771 VerifyReplication should increase GOODROWS counter if re-comparison passes 2016-10-09 20:51:23 -07:00
tedyu
4b75614a2e HBASE-16773 AccessController should access local region if possible 2016-10-08 21:17:15 -07:00
tedyu
e043d450ed HBASE-16768 Inconsistent results from the Append/Increment (ChiaPing Tsai) 2016-10-08 13:12:21 -07:00
Gary Helmling
bbaa0e851d HBASE-16657 addendum handle null RegionLoad 2016-10-07 14:05:30 -07:00
Dustin Pho
fa98c506d0 HBASE-16657 Expose per-region last major compaction time in RegionServer UI
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-07 14:04:22 -07:00
Ashu Pachauri
7092dc3ec4 HBASE-16681: Flaky TestReplicationSourceManagerZkImpl
Change-Id: I6bf31eb2f3815079d346963ad78045f67e0f44b7
2016-10-06 16:58:08 -07:00
tedyu
a8fe9ed64f HBASE-16772 Add verbose option to VerifyReplication for logging good rows 2016-10-06 11:19:58 -07:00
Mikhail Antonov
afd3bc856e HBASE-16644 Errors when reading legit HFile Trailer of old (v2.0) format file 2016-10-04 17:52:08 -07:00
tedyu
252d7fae0a HBASE-16739 Timed out exception message should include encoded region name 2016-10-03 06:44:58 -07:00
tedyu
a4426a7c2d HBASE-16739 Timed out exception message should include encoded region name - Revert due to misspelled word in subject 2016-10-03 06:44:11 -07:00
tedyu
69a3fd2ebf HBASE-16739) imed out exception message should include encoded region name 2016-10-02 21:10:20 -07:00
Enis Soztutar
06cc123849 HBASE-16721 Concurrency issue in WAL unflushed seqId tracking - ADDENDUM to change WAL to extend Closeable 2016-10-01 14:50:54 -07:00
Enis Soztutar
911f9b9eb7 HBASE-16678 MapReduce jobs do not update counters from ScanMetrics
Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/mapreduce/TestTableMapReduce.java
2016-09-29 17:14:55 -07:00
Enis Soztutar
bf5a7aba5c HBASE-16721 Concurrency issue in WAL unflushed seqId tracking 2016-09-29 14:51:14 -07:00
Sean Busbey
df25ebf84f HBASE-15984 Handle premature EOF treatment of WALs in replication.
In some particular deployments, the Replication code believes it has
reached EOF for a WAL prior to succesfully parsing all bytes known to
exist in a cleanly closed file.

Consistently this failure happens due to an InvalidProtobufException
after some number of seeks during our attempts to tail the in-progress
RegionServer WAL. As a work-around, this patch treats cleanly closed
files differently than other execution paths. If an EOF is detected due
to parsing or other errors while there are still unparsed bytes before
the end-of-file trailer, we now reset the WAL to the very beginning and
attempt a clean read-through.

In current testing, a single such reset is sufficient to work around
observed dataloss. However, the above change will retry a given WAL file
indefinitely. On each such attempt, a log message like the below will
be emitted at the WARN level:

  Processing end of WAL file '{}'. At position {}, which is too far away
  from reported file length {}. Restarting WAL reading (see HBASE-15983
  for details).

Additionally, this patch adds some additional log detail at the TRACE
level about file offsets seen while handling recoverable errors. It also
add metrics that measure the use of this recovery mechanism.
2016-09-29 10:47:57 -05:00
tedyu
df57859258 HBASE-16725 Don't let flushThread hang in TestHRegion 2016-09-29 06:57:02 -07:00
tedyu
d4b5645a4e HBASE-16723 RMI registry is not destroyed after stopping JMX Connector Server (Pankaj Kumar) 2016-09-29 01:58:22 -07:00
Abhishek Singh Chouhan
96a8e8dce4 HBASE-16660 ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-09-27 11:08:00 -07:00
Andrew Purtell
67a43c3059 HBASE-16694 Reduce garbage for onDiskChecksum in HFileBlock (binlijin) 2016-09-26 13:55:28 -07:00
Matteo Bertozzi
4566e4df58 HBASE-16649 Truncate table with splits preserved can cause both data loss and truncated data appeared again 2016-09-26 13:09:06 -07:00
chenheng
d6f9eab4be HBASE-16677 Add table size (total store file size) to table page (Guang Yang) 2016-09-25 10:50:03 +08:00
Apekshit Sharma
9a510e9ba2 HBASE-16679 Flush throughput controller: Minor perf change and fix flaky TestFlushWithThroughputController.
Change-Id: I79f4a65a67bfdc46868d68ff8a0c32f3a02b8cb1
2016-09-23 11:47:12 -07:00
tedyu
b503843bae HBASE-16423 Add re-compare option to VerifyReplication to avoid occasional inconsistent rows (Jianwei Cui) 2016-09-23 05:08:19 -07:00
Enis Soztutar
8a797e81b8 HBASE-16604 Scanner retries on IOException can cause the scans to miss data 2016-09-22 18:48:06 -07:00
Ben Lau
6973304026 HBASE-16662 Fix open POODLE vulnerabilities
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-09-22 19:59:08 +00:00
tedyu
ba650a8a10 HBASE-16675 Average region size may be incorrect when there is region whose RegionLoad cannot be retrieved 2016-09-22 07:34:29 -07:00
Yu Li
76da21fc59 HBASE-16670 Make RpcServer#processRequest logic more robust 2016-09-22 15:44:45 +08:00
Jonathan M Hsieh
13d6acbc7f HBASE-12088 Remove unused hadoop-1.0, hadoop-1.1 profiles from non-root poms 2016-09-21 20:52:19 -07:00
Apekshit Sharma
e7e660d5b2 HBASE-14734 Prevent BindException when setting up MiniKdc.
Port for kdc service gets selected in the constructor, but we bind to it later in MiniKdc.start()-->MiniKdc.initKDCServer() --> KdcServer.start(). In meantime, some other service can capture the port which results in BindException. The solution here is to catch the exception and retry.

Testing methodology:
- Used python and intellij.
- breakpoint on kdc.start(1), in catch block(2) and just after catch block(3).
- used python to bind to the selected port on breakpoint 1 --> run the program --> stops at breakpoint 2 (catch block)
- On breakpoint 1 and after 2 failures, close the port --> run the program --> skips catch block and goes to breakpoint 3.

Change-Id: I4e06e69819d1ec9a0a7fa471bf017f3a72c75cb3
2016-09-21 16:44:14 -07:00
Umesh Agashe
23c5ea39bd HBASE-16294 hbck reporting "No HDFS region dir found" for replicas
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-09-21 14:43:15 -07:00
Jerry He
c80d671a06 HBASE-12949 Scanner can be stuck in infinite loop if the HFile is corrupted 2016-09-21 13:31:18 -07:00
Jerry He
abfb20206d HBASE-16647 hbck should do offline reference repair before online repair 2016-09-20 11:53:01 -07:00
zhangduo
91d03697d9 HBASE-16165 Decrease RpcServer.callQueueSize before writeResponse causes OOM (Guanghao Zhang) 2016-09-18 10:05:48 +08:00
tedyu
591cc4cfb8 HBASE-16349 TestClusterId may hang during cluster shutdown 2016-09-16 06:57:42 -07:00
Phil Yang
069d1f73fa HBASE-16388 Prevent client threads being blocked by only one slow region server
Signed-off-by: stack <stack@apache.org>
2016-09-14 09:28:20 -07:00
tedyu
8ad14bac67 HBASE-16616 Rpc handlers stuck on ThreadLocalMap.expungeStaleEntry (Tomu Tsuruhara) 2016-09-12 16:29:24 -07:00
chenheng
94026d0d09 HBASE-16086 TableCfWALEntryFilter and ScopeWALEntryFilter should not redundantly iterate over cells (Vincent Poon)
Conflicts:
    hbase-server/src/main/java/org/apache/hadoop/hbase/replication/ScopeWALEntryFilter.java
    hbase-server/src/main/java/org/apache/hadoop/hbase/replication/TableCfWALEntryFilter.java
    hbase-server/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationWALEntryFilters.java
2016-09-11 10:48:00 +08:00
binlijin
ad67fd0816 HBASE-16530 Reduce DBE code duplication
Signed-off-by: Yu Li <liyu@apache.org>
2016-09-09 16:25:56 +08:00
zhangduo
bc4ab47147 HBASE-16309 TestDefaultCompactSelection.testCompactionRatio is flaky 2016-09-09 14:21:31 +08:00
chenheng
b4086795f2 HBASE-16570 Compute region locality in parallel at startup (binlijin) 2016-09-09 14:16:37 +08:00
Andrew Purtell
52963b3428 HBASE-16589 Adjust log level for FATAL messages from HBaseReplicationEndpoint that are not fatal 2016-09-08 13:03:49 -07:00
tedyu
354706a7d8 HBASE-16572 Sync method in RecoverableZooKeeper failed to pass callback function in (Allan Yang) 2016-09-07 11:36:18 -07:00
tedyu
b694b63ed7 HBASE-16460 Can't rebuild the BucketAllocator's data structures when BucketCache uses FileIOEngine (Guanghao Zhang) 2016-09-05 06:52:03 -07:00
tedyu
e1aab356b3 HBASE-16556 The read/write timeout are not used in HTable.delete(List), HTable.get(List), and HTable.existsAll(List) (ChiaPing Tsai) 2016-09-04 21:15:10 -07:00
Matteo Bertozzi
fa7731de6e HBASE-16552 MiniHBaseCluster#getServerWith() does not ignore stopped RSs 2016-09-02 16:52:57 -07:00
Andrew Purtell
8691520e33 HBASE-16399 Provide an API to get list of failed regions and servername in Canary (Vishal Khandelwal) 2016-09-02 10:43:20 -07:00
Jerry He
8be6f95f99 HBASE-16311 Audit log for delete snapshot operation is missing in case of snapshot owner deleting the same (Yi Liang) 2016-09-02 10:09:44 -07:00
Loknath Priyatham Teja Singamsetty
84b2431472 HBASE-16375 Mapreduce mini cluster using HBaseTestingUtility not setting correct resourcemanager and jobhistory webapp address of MapReduceTestingShim
Signed-off-by: Andrew Purtell <apurtell@apache.org>
Amending-Author: Andrew Purtell <apurtell@apache.org>

Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/TestHBaseTestingUtility.java
2016-09-01 15:57:18 -07:00
tedyu
a034a2bdcb HBASE-16527 IOExceptions from DFS client still can cause CatalogJanitor to delete referenced files (Vladimir Rodionov) 2016-09-01 12:39:08 -07:00
Stephen Yuan Jiang
d91edc2ac4 HBASE-16528 Procedure-V2: ServerCrashProcedure misses owner information (Stephen Yuan Jiang)
Conflicts:
	hbase-server/src/main/java/org/apache/hadoop/hbase/master/ServerManager.java
2016-08-31 09:51:52 -07:00