Commit Graph

4846 Commits

Author SHA1 Message Date
Phil Yang dd0206ff31 Revert "HBASE-17314 Limit total buffered size for all replication sources"
This reverts commit e9444ed077.
2016-12-22 19:08:17 +08:00
tedyu 819d32f951 HBASE-17330 SnapshotFileCache will always refresh the file cache (Jianwei Cui) 2016-12-22 02:30:41 -08:00
tedyu f94180a3e9 HBASE-17341 Add a timeout during replication endpoint termination (Vincent Poon) 2016-12-21 08:27:45 -08:00
Phil Yang e9444ed077 HBASE-17314 Limit total buffered size for all replication sources 2016-12-21 13:10:29 +08:00
Vincent e79afbf0cb HBASE-17328 Properly dispose of looped replication peers
Signed-off-by: Andrew Purtell <apurtell@apache.org>

Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/replication/TestMasterReplication.java
2016-12-20 18:01:33 -08:00
tedyu fa975fa382 HBASE-17347 ExportSnapshot may write snapshot info file to wrong directory when specifying target name (Jianwei Cui) 2016-12-20 15:58:28 -08:00
tedyu 0b69f59133 HBASE-17292 Add observer notification before bulk loaded hfile is moved to region directory 2016-12-20 13:23:27 -08:00
tedyu b9689808eb HBASE-17344 The regionserver web UIs miss the coprocessors of RegionServerCoprocessorHost (ChiaPing Tsai) 2016-12-20 08:55:07 -08:00
Guanghao Zhang 97d1ba4015 HBASE-17317 [branch-1] The updatePeerConfig method in ReplicationPeersZKImpl didn't update the table-cfs map 2016-12-20 18:51:26 +08:00
binlijin 4c7f0f2436 HBASE-17332 Replace HashMap to Array for DataBlockEncoding.idToEncoding 2016-12-20 12:28:04 +08:00
binlijin ed39396497 HBASE-16398 optimize HRegion computeHDFSBlocksDistribution 2016-12-16 13:57:43 +08:00
tedyu f3a3069796 HBASE-17319 Truncate table with preserve after split may cause truncation to fail (Allan Yang) 2016-12-15 13:45:11 -08:00
tedyu ffe70158cc HBASE-17318 Increment does not add new column if the increment amount is zero at first time writing (Guangxu Cheng) 2016-12-15 08:45:05 -08:00
Guanghao Zhang 4b3df0f926 HBASE-17296 Provide per peer throttling for replication
Signed-off-by: Phil Yang <yangzhe1991@apache.org>
2016-12-15 16:13:04 +08:00
tedyu e029c554bb HBASE-17302 The region flush request disappeared from flushQueue (Guangxu Cheng) 2016-12-13 12:46:42 -08:00
tedyu 1f9214bee7 HBASE-17297 Single Filter in parenthesis cannot be parsed correctly (Xuesen Liang) 2016-12-12 12:09:22 -08:00
tedyu 30576991bc HBASE-17289 Avoid adding a replication peer named "lock" 2016-12-12 09:38:55 -08:00
Josh Elser f13283bc05 HBASE-17276 Only log stacktraces for exceptions once for updates in a batch
For large batches of updates, repeatedly logging WrongRegionExceptions,
FailedSanityCheckExceptions, and/or NoSuchColumnFamilyExceptions can
easily dominate the contents of a RegionServer log. After the first
occurence of logging the full exception, switch to logging only the
message on the exception.
2016-12-11 15:40:22 -05:00
tedyu d87b05f040 HBASE-16336 Removing peers seems to be leaving spare queues (Guanghao Zhang) 2016-12-11 07:43:10 -08:00
Stephen Yuan Jiang e51584381a HBASE-17023 Region left unassigned due to AM and SSH each thinking others would do the assignment work (Stephen Yuan Jiang) 2016-12-10 15:05:00 -08:00
tedyu c2801a2ea8 HBASE-17256 Rpc handler monitoring will be removed when the task queue is full (Guangxu Cheng) 2016-12-05 12:14:20 -08:00
Ashu Pachauri cbdc9fcb8a Addendum HBASE-16209: Add an ExponentialBackOffPolicy so that we spread out the timing of open region retries in AssignmentManager.
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-12-02 10:30:28 +08:00
Guanghao Zhang 682dd57cd6 HBASE-17205 Add a metric for the duration of region in transition 2016-12-01 10:32:24 -08:00
Jonathan M Hsieh 9da0d5d00e HBASE-17224 Fix lots of spelling errors in HBase logging and exception messages (Grant Sohn) 2016-12-01 09:10:03 -08:00
zhangduo c8ea82299c HBASE-17206 FSHLog may roll a new writer successfully with unflushed entries 2016-12-01 14:20:38 +08:00
Yu Li b3627ef51e HBASE-17212 Should add null checker on table name in HTable and RegionServerCallable constructor (addendum) 2016-12-01 13:32:33 +08:00
Guanghao Zhang f26b3bf5ba HBASE-17178 Add region balance throttling
Signed-off-by: Phil Yang <yangzhe1991@apache.org>
2016-11-30 18:07:02 +08:00
zhangduo af69783128 HBASE-17167 Pass mvcc to client when scan 2016-11-30 10:02:17 +08:00
Ashu Pachauri b8da9f83cb HBASE-16302 age of last shipped op and age of last applied op should be histograms
Signed-off-by: Ashish Singhi <ashishsinghi@apache.org>
2016-11-29 13:54:28 +05:30
Guanghao Zhang 7b2673db12 HBASE-16561 Add metrics about read/write/scan queue length and active read/write/scan handler count
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-11-29 16:09:56 +08:00
zhangduo be042652aa Revert "HBASE-16561 Add metrics about read/write/scan queue length and active read/write/scan handler count"
Forget to add signoff

This reverts commit 5ec218dbc2.
2016-11-29 16:09:22 +08:00
Guanghao Zhang 5ec218dbc2 HBASE-16561 Add metrics about read/write/scan queue length and active read/write/scan handler count 2016-11-29 16:00:37 +08:00
Stephen Yuan Jiang cdf539a8e0 HBASE-17189 TestMasterObserver#wasModifyTableActionCalled uses wrong variables (Stephen Yuan Jiang) 2016-11-28 23:18:42 -08:00
Michael Stack 987205caf9 HBASE-17072 CPU usage starts to climb up to 90-100% when using G1GC
Removes ThreadLocal. Uses AtomicReference instead (based on patch
posted up in HBASE-10676 "Removing ThreadLocal of PrefetchedHeader in
HFileBlock.FSReaderV2 make higher perforamce of scan")

Signed-off-by: Michael Stack <stack@apache.org>
2016-11-28 13:41:17 -08:00
Jerry He 10c070825f HBASE-17116 [PerformanceEvaluation] Add option to configure block size (Yi Liang) 2016-11-27 19:13:08 -08:00
Jingcheng Du fb1ace490c HBASE-17095 The ClientSimpleScanner keeps retrying if the hfile is corrupt or cannot found 2016-11-22 11:26:57 +08:00
binlijin 2da5ed098e HBASE-17118 StoreScanner leaked in KeyValueHeap (addendum) 2016-11-21 20:55:52 +08:00
tedyu e8f056935b HBASE-17131 Avoid livelock caused by HRegion#processRowsWithLocks (ChiaPing Tsai) 2016-11-20 18:17:21 -08:00
Guanghao Zhang 3c45ff08d9 HBASE-17088 Refactor RWQueueRpcExecutor/BalancedQueueRpcExecutor/RpcExecutor
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-11-19 14:18:06 +08:00
Esteban Gutierrez 19441937ea HBASE-17058 Lower epsilon used for jitter verification from HBASE-15324 2016-11-17 12:29:08 -08:00
tedyu c6f1b6e624 HBASE-17112 Prevent setting timestamp of delta operations the same as previous value's (Phil Yang) 2016-11-17 09:04:39 -08:00
tedyu d722b2aab7 HBASE-17118 StoreScanner leaked in KeyValueHeap (binlijin) 2016-11-17 08:46:11 -08:00
Andrew Purtell e54826434e HBASE-17044 Fix merge failed before creating merged region leaves meta inconsistent 2016-11-14 17:14:48 -08:00
tedyu b77bfe9d3b HBASE-17077 Don't copy the replication queue belonging to the peer which has been deleted (Guanghao Zhang) 2016-11-13 07:00:25 -08:00
tedyu 469462c850 HBASE-17062 RegionSplitter throws ClassCastException (Jeongdae Kim) 2016-11-11 07:18:22 -08:00
thiruvel 44ab659b93 HBASE-16962: Add readPoint to preCompactScannerOpen() and preFlushScannerOpen() API
Signed-off-by: anoopsamjohn <anoopsamjohn@gmail.com>
2016-11-11 11:09:55 +05:30
Yu Sun 18b31fdd32 HBASE-17020 keylen in midkey() dont computed correctly
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-11 08:01:00 +08:00
Guanghao Zhang a6397e3b0c HBASE-16938 TableCFsUpdater maybe failed due to no write permission on peerNode
Signed-off-by: Enis Soztutar <enis@apache.org>
2016-11-10 14:07:20 -08:00
binlijin dac73eceb0 HBASE-16570 Compute region locality in parallel at startup (addendum)
Addendum mainly for:
1. Avoid interfering with block location cache in RegionLocationFinder
2. Avoid refreshing block lcoations during HMaster startup (or else the startup could be really slow)

Signed-off-by: Yu Li <liyu@apache.org>
2016-11-10 16:53:39 +08:00
Guanghao Zhang e929156f96 HBASE-16985 TestClusterId failed due to wrong hbase rootdir
Signed-off-by: Michael Stack <stack@apache.org>
2016-11-09 17:19:31 -08:00
tedyu 1e322e68a5 HBASE-17054 Compactor#preCreateCoprocScanner should be passed user 2016-11-09 07:00:19 -08:00
Charlie Qiangeng Xu d248d6b0b3 HBASE-17039 SimpleLoadBalancer schedules large amount of invalid region moves
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-09 17:07:28 +08:00
Enis Soztutar 123d26ed90 HBASE-17017 Remove the current per-region latency histogram metrics 2016-11-08 18:31:12 -08:00
Michael Stack a70f73c1e1 HBASE-16983 TestMultiTableSnapshotInputFormat failing with Unable to create region directory: /tmp/... (Addendum by Guanghao Zhang) 2016-11-08 11:59:49 -08:00
tedyu 97276da9a7 HBASE-17010 Serial replication should handle daughter regions being assigned to another RS (Phil Yang) 2016-11-08 06:35:12 -08:00
Stephen Yuan Jiang 874cf5128f HBASE-17042 Remove 'public' keyword from MasterObserver interface (Stephen Yuan Jiang) 2016-11-07 17:51:44 -08:00
Enis Soztutar b99690ac6b HBASE-17033 LogRoller makes a lot of allocations unnecessarily 2016-11-07 14:31:30 -08:00
anoopsamjohn 2fdba6521d HBASE-16992 The usage of mutation from CP is weird. (ChiaPing Tsai) 2016-11-07 23:26:27 +05:30
tedyu ecd4de938b HBASE-17026 VerifyReplication log should distinguish whether good row key is result of revalidation 2016-11-07 07:41:55 -08:00
binlijin c4132bbf30 HBASE-16840 Reuse cell's timestamp and type in ScanQueryMatcher
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-07 22:42:46 +08:00
Yu Li a67481209f Revert "HBASE-16840 Reuse cell's timestamp and type in ScanQueryMatcher" to update author
This reverts commit 4d8611fa0f.
2016-11-07 22:42:38 +08:00
Yu Li 4d8611fa0f HBASE-16840 Reuse cell's timestamp and type in ScanQueryMatcher
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-07 21:38:03 +08:00
Mikhail Antonov 3063943c78 HBASE-17032 CallQueueTooBigException and CallDroppedException should not be triggering PFFE 2016-11-06 21:34:11 -08:00
tedyu 8ade73c0cb HBASE-16946 Provide Raw scan as an option in VerifyReplication (Sreekar Pallapothu) 2016-11-04 03:36:17 -07:00
binlijin 5bfec397d1 HBASE-16960 RegionServer hang when aborting
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-04 14:08:42 +08:00
Apekshit Sharma 51ba7cfde3 HBASE-17006 Give name to existing threads.
Having thread names in logs and thread dumps greatly improve debugability. This patch is simply adding the names to the threads we spawn.

Change-Id: I6ff22cc3804bb81147dde3a8e9ab671633c6f6ce
2016-11-03 18:31:03 -07:00
Eddie Elizondo b4676d51ad HBASE-14329 Report region in transition should check all regions in request
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-11-03 11:32:19 -07:00
Gary Helmling 4fe51ebaa7 HBASE-16964 Always clear successfully archived files from compacted files list 2016-11-03 09:28:22 -07:00
Yu Li 514add0086 HBASE-16972 Log more details for Scan#next request when responseTooSlow 2016-11-02 14:33:44 +08:00
Andrew Purtell 961b952171 HBASE-16980 TestRowProcessorEndpoint failing consistently (Yu Li) 2016-11-01 19:09:17 -07:00
Lars Hofhansl 6113f9a34f HBASE-16765 New SteppingRegionSplitPolicy, avoid too aggressive spread of regions for small tables. 2016-11-01 13:07:12 -07:00
Michael Stack b3ddea9fc4 HBASE-16983 TestMultiTableSnapshotInputFormat failing with Unable to
create region directory: /tmp/...
2016-10-31 21:48:49 -07:00
Guanghao Zhang 722be3a3d8 HBASE-16947 Some improvements for DumpReplicationQueues tool
Signed-off-by: Michael Stack <stack@apache.org>
2016-10-31 21:00:45 -07:00
Michael Stack a969f8d790 HBASE-16969 RegionCoprocessorServiceExec should override the toString() for debugging (ChiaPing Tsai) 2016-10-31 14:37:51 -07:00
Mikhail Antonov ea5b0a05d6 HBASE-16743 TestSimpleRpcScheduler#testCoDelScheduling is broke 2016-10-28 16:32:58 -07:00
Esteban Gutierrez 0a41493ac4 HBASE-16450 Shell tool to dump replication queues
New tool to dump existing replication peers, configurations and
queues when using HBase Replication. The tool provides two flags:

 --distributed  This flag will poll each RS for information about
                the replication queues being processed on this RS.
                By default this is not enabled and the information
                about the replication queues and configuration will
                be obtained from ZooKeeper.
 --hdfs         When --distributed is used, this flag will attempt
                to calculate the total size of the WAL files used
                by the replication queues. Since its possible that
                multiple peers can be configured this value can be
                overestimated.

Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-27 21:38:42 -07:00
tedyu d4826e1665 HBASE-16886 hbase-client: scanner with reversed=true and small=true gets no result (huzheng) 2016-10-27 07:47:49 -07:00
Michael Stack 24a92ed63a HBASE-16948 Fix inconsistency between HRegion and Region javadoc on getRowLock 2016-10-26 16:11:26 -07:00
Guanghao Zhang 4edd8a63d2 HBASE-16939 ExportSnapshot: set owner and permission on right directory
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-25 14:11:59 -07:00
Guanghao Zhang ae502a9d5c HBASE-16910 Avoid NPE when starting StochasticLoadBalancer
Signed-off-by: Ashish Singhi <ashishsinghi@apache.org>
2016-10-25 11:58:41 +05:30
binlijin 16823ff55e HBASE-16931 Setting cell's seqId to zero in compaction flow might cause RS down.
Signed-off-by: Yu Li <liyu@apache.org>
2016-10-24 23:33:34 +08:00
Guanghao Zhang d76cc4c1f0 HBASE-16870 Add the metrics of replication sources which were transformed from other dead rs to ReplicationLoad
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-10-22 14:27:01 +08:00
Guanghao Zhang e395bf91db HBASE-16815 Low scan ratio in RPC queue tuning triggers divide by zero exception
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-21 17:00:24 -07:00
Ashu Pachauri 0117ed9d78 HBASE-16752 addendum. Do not retry large request for client versions less than 1.3
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-20 10:34:46 -07:00
Yu Li a7a4e17f1d HBASE-16698 Performance issue: handlers stuck waiting for CountDownLatch inside WALKey#getWriteEntry under high writing workload 2016-10-20 15:32:59 +08:00
Ashu Pachauri 33e89fa9cf HBASE-16752 Return error back to rpc client on exceeding rpc request size limit.
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-19 15:50:42 -07:00
Huaxiang Sun a97aef5163 HBASE-16345 RpcRetryingCallerWithReadReplicas#call() should catch some RegionServer Exceptions
Fix logic for
1). how to handle exception while waiting for reply from the primary replica.
2). handle exception from replicas while waiting for a correct response.

Signed-off-by: Esteban Gutierrez <esteban@apache.org>
2016-10-19 14:22:42 -07:00
Gary Helmling ae151334ba HBASE-16754 All WALSplitter OutputSinks should process compaction events 2016-10-19 10:37:35 -07:00
Enis Soztutar 019c7f9303 HBASE-16824 Writer.flush() can be called on already closed streams in WAL roll 2016-10-18 19:14:20 -07:00
Guanghao Zhang 66941910bd HBASE-16653 Backport HBASE-11393 to branches which support namespace
Signed-off-by: chenheng <chenheng@apache.org>
2016-10-18 09:12:47 +08:00
tedyu 08498c6848 HBASE-16856 Exception message in SyncRunner.run() should print currentSequence (Allan Yang) 2016-10-17 07:23:02 -07:00
tedyu a4cbdede31 HBASE-16853 Regions are assigned to Region Servers in /hbase/draining after HBase Master failover (David Pope) 2016-10-16 18:54:59 -07:00
Pankaj Kumar b7f283c6f6 HBASE-16724 Snapshot owner can't clone
Signed-off-by: Ashish Singhi <ashishsinghi@apache.org>
2016-10-15 11:57:00 +05:30
tedyu 57d3e9e756 HBASE-16816 HMaster.move() should throw exception if region to move is not online (Allan Yang) 2016-10-13 11:11:11 -07:00
tedyu ca581874b9 HBASE-16810 HBase Balancer throws ArrayIndexOutOfBoundsException when regionservers are in /hbase/draining znode and unloaded (David Pope) 2016-10-13 08:42:49 -07:00
Pankaj Kumar 27398ea525 HBASE-16807, RegionServer will fail to report new active Hmaster until HMaster/RegionServer failover.
Signed-off-by: chenheng <chenheng@apache.org>
2016-10-13 22:17:28 +08:00
Phil Yang 8f9fadf021 HBASE-16664 Timeout logic in AsyncProcess is broken
Signed-off-by: chenheng <chenheng@apache.org>
2016-10-13 17:14:52 +08:00
tedyu e2278f9544 HBASE-16716 OfflineMetaRepair leaves empty directory inside /hbase/WALs which remains forever (Pankaj Kumar) 2016-10-12 19:42:18 -07:00
Phil Yang a3c29da0b6 HBASE-16505 Pass deadline to HRegion operations
Signed-off-by: Michael Stack <stack@apache.org>
2016-10-12 12:17:38 -07:00
tedyu 3830890635 HBASE-16801 The Append/Increment may return the data from future (ChiaPing Tsai) 2016-10-12 10:07:37 -07:00
tedyu bf03827196 HBASE-15109 HM/RS failed to start when "fs.hdfs.impl.disable.cache" is set to true (Pankaj Kumar) 2016-10-12 07:35:50 -07:00
Pankaj Kumar 6db4ef8479 HBASE-16663 JMX ConnectorServer stopped when unauthorized user try to stop HM/RS/cluster
Signed-off-by: Andrew Purtell <apurtell@apache.org>

Conflicts:

	hbase-server/src/main/java/org/apache/hadoop/hbase/master/MasterCoprocessorHost.java
2016-10-11 16:17:07 -07:00
Gary Helmling 4f29c23038 HBASE-16146 Remove thread local usage in Counter 2016-10-11 14:47:02 -07:00
tedyu b47ded3b42 HBASE-16803 Make hbase:acl table unsplittable - revert pending review comments 2016-10-11 12:51:58 -07:00
tedyu 408a9eb8a3 HBASE-16803 Make hbase:acl table unsplittable 2016-10-11 12:50:05 -07:00
Gary Helmling f13a21696f HBASE-16788 addendum Account for HStore archiveLock in heap size 2016-10-11 11:46:12 -07:00
Gary Helmling 89bef67d0c HBASE-16788 Guard HFile archiving under a separate lock 2016-10-10 16:06:55 -07:00
Dustin Pho 59ca4dad70 HBASE-16661 Add last major compaction age to per-region metrics
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-10 15:21:53 -07:00
Huaxiang Sun 66038b8c1a HBASE-16699 Overflows in AverageIntervalRateLimiter's refill() and getWaitInterval()
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-10 14:12:22 -07:00
Sean Busbey acb1392b15 HBASE-16701 rely on test category timeout instead of defining one on a specific test.
Signed-off-by: Umesh Agashe <uagashe@cloudera.com>
Signed-off-by: Yu Li <liyu@apache.org>
2016-10-10 00:24:24 -05:00
tedyu 364a57a950 HBASE-16771 VerifyReplication should increase GOODROWS counter if re-comparison passes 2016-10-09 20:51:23 -07:00
tedyu 4b75614a2e HBASE-16773 AccessController should access local region if possible 2016-10-08 21:17:15 -07:00
tedyu e043d450ed HBASE-16768 Inconsistent results from the Append/Increment (ChiaPing Tsai) 2016-10-08 13:12:21 -07:00
Gary Helmling bbaa0e851d HBASE-16657 addendum handle null RegionLoad 2016-10-07 14:05:30 -07:00
Dustin Pho fa98c506d0 HBASE-16657 Expose per-region last major compaction time in RegionServer UI
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-07 14:04:22 -07:00
Ashu Pachauri 7092dc3ec4 HBASE-16681: Flaky TestReplicationSourceManagerZkImpl
Change-Id: I6bf31eb2f3815079d346963ad78045f67e0f44b7
2016-10-06 16:58:08 -07:00
tedyu a8fe9ed64f HBASE-16772 Add verbose option to VerifyReplication for logging good rows 2016-10-06 11:19:58 -07:00
Mikhail Antonov afd3bc856e HBASE-16644 Errors when reading legit HFile Trailer of old (v2.0) format file 2016-10-04 17:52:08 -07:00
tedyu 252d7fae0a HBASE-16739 Timed out exception message should include encoded region name 2016-10-03 06:44:58 -07:00
tedyu a4426a7c2d HBASE-16739 Timed out exception message should include encoded region name - Revert due to misspelled word in subject 2016-10-03 06:44:11 -07:00
tedyu 69a3fd2ebf HBASE-16739) imed out exception message should include encoded region name 2016-10-02 21:10:20 -07:00
Enis Soztutar 06cc123849 HBASE-16721 Concurrency issue in WAL unflushed seqId tracking - ADDENDUM to change WAL to extend Closeable 2016-10-01 14:50:54 -07:00
Enis Soztutar 911f9b9eb7 HBASE-16678 MapReduce jobs do not update counters from ScanMetrics
Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/mapreduce/TestTableMapReduce.java
2016-09-29 17:14:55 -07:00
Enis Soztutar bf5a7aba5c HBASE-16721 Concurrency issue in WAL unflushed seqId tracking 2016-09-29 14:51:14 -07:00
Sean Busbey df25ebf84f HBASE-15984 Handle premature EOF treatment of WALs in replication.
In some particular deployments, the Replication code believes it has
reached EOF for a WAL prior to succesfully parsing all bytes known to
exist in a cleanly closed file.

Consistently this failure happens due to an InvalidProtobufException
after some number of seeks during our attempts to tail the in-progress
RegionServer WAL. As a work-around, this patch treats cleanly closed
files differently than other execution paths. If an EOF is detected due
to parsing or other errors while there are still unparsed bytes before
the end-of-file trailer, we now reset the WAL to the very beginning and
attempt a clean read-through.

In current testing, a single such reset is sufficient to work around
observed dataloss. However, the above change will retry a given WAL file
indefinitely. On each such attempt, a log message like the below will
be emitted at the WARN level:

  Processing end of WAL file '{}'. At position {}, which is too far away
  from reported file length {}. Restarting WAL reading (see HBASE-15983
  for details).

Additionally, this patch adds some additional log detail at the TRACE
level about file offsets seen while handling recoverable errors. It also
add metrics that measure the use of this recovery mechanism.
2016-09-29 10:47:57 -05:00
tedyu df57859258 HBASE-16725 Don't let flushThread hang in TestHRegion 2016-09-29 06:57:02 -07:00
tedyu d4b5645a4e HBASE-16723 RMI registry is not destroyed after stopping JMX Connector Server (Pankaj Kumar) 2016-09-29 01:58:22 -07:00
Abhishek Singh Chouhan 96a8e8dce4 HBASE-16660 ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-09-27 11:08:00 -07:00
Andrew Purtell 67a43c3059 HBASE-16694 Reduce garbage for onDiskChecksum in HFileBlock (binlijin) 2016-09-26 13:55:28 -07:00
Matteo Bertozzi 4566e4df58 HBASE-16649 Truncate table with splits preserved can cause both data loss and truncated data appeared again 2016-09-26 13:09:06 -07:00
chenheng d6f9eab4be HBASE-16677 Add table size (total store file size) to table page (Guang Yang) 2016-09-25 10:50:03 +08:00
Apekshit Sharma 9a510e9ba2 HBASE-16679 Flush throughput controller: Minor perf change and fix flaky TestFlushWithThroughputController.
Change-Id: I79f4a65a67bfdc46868d68ff8a0c32f3a02b8cb1
2016-09-23 11:47:12 -07:00
tedyu b503843bae HBASE-16423 Add re-compare option to VerifyReplication to avoid occasional inconsistent rows (Jianwei Cui) 2016-09-23 05:08:19 -07:00
Enis Soztutar 8a797e81b8 HBASE-16604 Scanner retries on IOException can cause the scans to miss data 2016-09-22 18:48:06 -07:00
Ben Lau 6973304026 HBASE-16662 Fix open POODLE vulnerabilities
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-09-22 19:59:08 +00:00
tedyu ba650a8a10 HBASE-16675 Average region size may be incorrect when there is region whose RegionLoad cannot be retrieved 2016-09-22 07:34:29 -07:00
Yu Li 76da21fc59 HBASE-16670 Make RpcServer#processRequest logic more robust 2016-09-22 15:44:45 +08:00
Jonathan M Hsieh 13d6acbc7f HBASE-12088 Remove unused hadoop-1.0, hadoop-1.1 profiles from non-root poms 2016-09-21 20:52:19 -07:00
Apekshit Sharma e7e660d5b2 HBASE-14734 Prevent BindException when setting up MiniKdc.
Port for kdc service gets selected in the constructor, but we bind to it later in MiniKdc.start()-->MiniKdc.initKDCServer() --> KdcServer.start(). In meantime, some other service can capture the port which results in BindException. The solution here is to catch the exception and retry.

Testing methodology:
- Used python and intellij.
- breakpoint on kdc.start(1), in catch block(2) and just after catch block(3).
- used python to bind to the selected port on breakpoint 1 --> run the program --> stops at breakpoint 2 (catch block)
- On breakpoint 1 and after 2 failures, close the port --> run the program --> skips catch block and goes to breakpoint 3.

Change-Id: I4e06e69819d1ec9a0a7fa471bf017f3a72c75cb3
2016-09-21 16:44:14 -07:00
Umesh Agashe 23c5ea39bd HBASE-16294 hbck reporting "No HDFS region dir found" for replicas
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-09-21 14:43:15 -07:00
Jerry He c80d671a06 HBASE-12949 Scanner can be stuck in infinite loop if the HFile is corrupted 2016-09-21 13:31:18 -07:00
Jerry He abfb20206d HBASE-16647 hbck should do offline reference repair before online repair 2016-09-20 11:53:01 -07:00
zhangduo 91d03697d9 HBASE-16165 Decrease RpcServer.callQueueSize before writeResponse causes OOM (Guanghao Zhang) 2016-09-18 10:05:48 +08:00
tedyu 591cc4cfb8 HBASE-16349 TestClusterId may hang during cluster shutdown 2016-09-16 06:57:42 -07:00
Phil Yang 069d1f73fa HBASE-16388 Prevent client threads being blocked by only one slow region server
Signed-off-by: stack <stack@apache.org>
2016-09-14 09:28:20 -07:00
tedyu 8ad14bac67 HBASE-16616 Rpc handlers stuck on ThreadLocalMap.expungeStaleEntry (Tomu Tsuruhara) 2016-09-12 16:29:24 -07:00
chenheng 94026d0d09 HBASE-16086 TableCfWALEntryFilter and ScopeWALEntryFilter should not redundantly iterate over cells (Vincent Poon)
Conflicts:
    hbase-server/src/main/java/org/apache/hadoop/hbase/replication/ScopeWALEntryFilter.java
    hbase-server/src/main/java/org/apache/hadoop/hbase/replication/TableCfWALEntryFilter.java
    hbase-server/src/test/java/org/apache/hadoop/hbase/replication/TestReplicationWALEntryFilters.java
2016-09-11 10:48:00 +08:00
binlijin ad67fd0816 HBASE-16530 Reduce DBE code duplication
Signed-off-by: Yu Li <liyu@apache.org>
2016-09-09 16:25:56 +08:00
zhangduo bc4ab47147 HBASE-16309 TestDefaultCompactSelection.testCompactionRatio is flaky 2016-09-09 14:21:31 +08:00
chenheng b4086795f2 HBASE-16570 Compute region locality in parallel at startup (binlijin) 2016-09-09 14:16:37 +08:00
Andrew Purtell 52963b3428 HBASE-16589 Adjust log level for FATAL messages from HBaseReplicationEndpoint that are not fatal 2016-09-08 13:03:49 -07:00
tedyu 354706a7d8 HBASE-16572 Sync method in RecoverableZooKeeper failed to pass callback function in (Allan Yang) 2016-09-07 11:36:18 -07:00
tedyu b694b63ed7 HBASE-16460 Can't rebuild the BucketAllocator's data structures when BucketCache uses FileIOEngine (Guanghao Zhang) 2016-09-05 06:52:03 -07:00
tedyu e1aab356b3 HBASE-16556 The read/write timeout are not used in HTable.delete(List), HTable.get(List), and HTable.existsAll(List) (ChiaPing Tsai) 2016-09-04 21:15:10 -07:00
Matteo Bertozzi fa7731de6e HBASE-16552 MiniHBaseCluster#getServerWith() does not ignore stopped RSs 2016-09-02 16:52:57 -07:00
Andrew Purtell 8691520e33 HBASE-16399 Provide an API to get list of failed regions and servername in Canary (Vishal Khandelwal) 2016-09-02 10:43:20 -07:00
Jerry He 8be6f95f99 HBASE-16311 Audit log for delete snapshot operation is missing in case of snapshot owner deleting the same (Yi Liang) 2016-09-02 10:09:44 -07:00
Loknath Priyatham Teja Singamsetty 84b2431472 HBASE-16375 Mapreduce mini cluster using HBaseTestingUtility not setting correct resourcemanager and jobhistory webapp address of MapReduceTestingShim
Signed-off-by: Andrew Purtell <apurtell@apache.org>
Amending-Author: Andrew Purtell <apurtell@apache.org>

Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/TestHBaseTestingUtility.java
2016-09-01 15:57:18 -07:00
tedyu a034a2bdcb HBASE-16527 IOExceptions from DFS client still can cause CatalogJanitor to delete referenced files (Vladimir Rodionov) 2016-09-01 12:39:08 -07:00
Stephen Yuan Jiang d91edc2ac4 HBASE-16528 Procedure-V2: ServerCrashProcedure misses owner information (Stephen Yuan Jiang)
Conflicts:
	hbase-server/src/main/java/org/apache/hadoop/hbase/master/ServerManager.java
2016-08-31 09:51:52 -07:00
tedyu 9907a7e2a2 HBASE-16502 Reduce garbage in BufferedDataBlockEncoder - addendum adopts Hiroshi's suggestion (binlijin) 2016-08-30 18:06:24 -07:00
Matteo Bertozzi 57c6384b09 HBASE-16522 Procedure v2 - Cache system user and avoid IOException 2016-08-30 15:35:08 -07:00
chenheng ce9b6c0abf HBASE-16224 Reduce the number of RPCs for the large PUTs (ChiaPing Tsai) 2016-08-30 13:30:23 +08:00
anoopsamjohn c899897bc8 HBASE-16213 A new HFileBlock structure for fast random get. (binlijin) 2016-08-29 12:11:46 +05:30
Matteo Bertozzi 35fa341913 HBASE-16508 Move UnexpectedStateException to common 2016-08-26 18:37:59 -07:00
tedyu b59f18c2af HBASE-16502 Reduce garbage in BufferedDataBlockEncoder (binlijin) 2016-08-25 20:39:42 -07:00
tedyu c78a5fd650 HBASE-16409 Addendum fixes the case where delimiter is not specified by user 2016-08-25 11:54:42 -07:00
Stephen Yuan Jiang 86d570b959 HBASE-16486 Unify system table creation using the same createSystemTable API (Stephen Yuan Jiang) 2016-08-24 21:59:33 -07:00
tedyu f9490aaf43 HBASE-16304 HRegion#RegionScannerImpl#handleFileNotFoundException may lead to deadlock when trying to obtain write lock on updatesLock 2016-08-24 10:57:14 -07:00
Enis Soztutar f0385b4b83 HBASE-16270 Handle duplicate clearing of snapshot in region replicas (Robert Yokota) 2016-08-24 06:43:19 -07:00
Ashish Singhi 3606b890f8 HBASE-16471 Region Server metrics context will be wrong when machine hostname contain "master" word (Pankaj Kumar) 2016-08-24 19:01:58 +05:30
chenheng de2f653caf HBASE-16464 archive folder grows bigger and bigger due to corrupt snapshot under tmp dir 2016-08-24 13:43:18 +08:00
Geoffrey 6e9b49cac7 HBASE-16448 Custom metrics for custom replication endpoints
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-08-23 17:17:42 -07:00
Matteo Bertozzi a1e57d942e HBASE-16360 TableMapReduceUtil addHBaseDependencyJars has the wrong class name for PrefixTreeCodec (Jing Pu Chen) 2016-08-23 14:31:58 -07:00
Apekshit Sharma a8909f5cba HBASE-16467 Move AbstractHBaseTool to hbase-common.
Change-Id: Ifb81c15b89b308f53e6ee8a76e18189b5b526cd3
2016-08-22 16:32:46 -07:00
tedyu 2a35019a31 HBASE-16455 Provide API for obtaining all the WAL files 2016-08-22 06:56:14 -07:00
tedyu d260108e39 HBASE-16409 Row key for bad row should be properly delimited in VerifyReplication 2016-08-20 17:18:27 -07:00
Yu Li e05b002ed5 HBASE-16429 FSHLog: deadlock if rollWriter called when ring buffer filled with appends 2016-08-18 10:03:46 +08:00
chenheng 3d7bcd020e HBASE-15635 Mean age of Blocks in cache (seconds) on webUI should be greater than zero
Signed-off-by: stack <stack@apache.org>
2016-08-17 10:59:58 -07:00
tedyu a944a19879 HBASE-16419 check REPLICATION_SCOPE's value more stringently (Guangxu Cheng) 2016-08-17 09:01:54 -07:00
stack 6976a7b312 HBASE-16341 Missing bit on "Regression: Random Read/WorkloadC slower in 1.x than 0.98" 2016-08-15 13:29:36 -07:00
Phil Yang e51df6723c HBASE-16368 test*WhenRegionMove in TestPartialResultsFromClientSide is flaky
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-08-10 18:13:20 +08:00
Phil Yang 38044ada32 HBASE-16285 Drop RPC requests if it must be considered as timeout at client
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-08-10 16:16:15 +08:00
Phil Yang 441bc050b9 HBASE-9465 Push entries to peer clusters serially
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-08-09 15:26:15 +08:00
stack 25c4ff5404 HBASE-9899 for idempotent operation dups, return the result instead of throwing conflict exception (Guanghao Zhang) 2016-08-08 21:44:37 -07:00
tedyu 0a1a0a6883 HBASE-16367 Race between master and region server initialization may lead to premature server abort - addendum shortens the waiting duration 2016-08-08 10:41:14 -07:00
tedyu 225383d321 HBASE-16367 Race between master and region server initialization may lead to premature server abort 2016-08-08 08:42:05 -07:00
Phil Yang 0354dbf1ec HBASE-12770 Don't transfer all the queued hlogs of a dead server to the same alive server
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-08-08 16:32:43 +08:00
Vivek ec99838b9c HBASE-15866 Split hbase.rpc.timeout into *.read.timeout and *.write.timeout
Signed-off-by: Andrew Purtell <apurtell@apache.org>
Amending-Author: Andrew Purtell <apurtell@apache.org>

Conflicts:
	hbase-client/src/main/java/org/apache/hadoop/hbase/client/AsyncProcess.java
	hbase-client/src/main/java/org/apache/hadoop/hbase/client/ConnectionImplementation.java
	hbase-client/src/main/java/org/apache/hadoop/hbase/client/HTable.java
	hbase-client/src/main/java/org/apache/hadoop/hbase/client/Table.java
	hbase-server/src/main/java/org/apache/hadoop/hbase/client/HTableWrapper.java
	hbase-server/src/test/java/org/apache/hadoop/hbase/client/HConnectionTestingUtility.java
	hbase-server/src/test/java/org/apache/hadoop/hbase/client/TestHCM.java
2016-08-06 10:38:41 -07:00
Sean Busbey 6b233c4332 HBASE-8386 deprecate TableMapReduce.addDependencyJars(Configuration, class<?> ...)
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>

 Conflicts:
	hbase-server/src/main/java/org/apache/hadoop/hbase/mapreduce/TableMapReduceUtil.java
2016-08-05 14:08:50 -05:00
Sean Busbey a5f98c3f9e HBASE-16321 ensure no findbugs-jsr305
Signed-off-by: Esteban Gutierrez <esteban@apache.org>

 Conflicts:
	hbase-client/src/main/java/org/apache/hadoop/hbase/MetaTableAccessor.java
	hbase-client/src/main/java/org/apache/hadoop/hbase/client/ConnectionImplementation.java
	hbase-rsgroup/pom.xml
	hbase-server/pom.xml
	hbase-server/src/main/java/org/apache/hadoop/hbase/coprocessor/ObserverContext.java
	hbase-server/src/main/java/org/apache/hadoop/hbase/master/TableStateManager.java
	hbase-server/src/main/java/org/apache/hadoop/hbase/util/FSTableDescriptors.java
	hbase-server/src/test/java/org/apache/hadoop/hbase/HBaseTestingUtility.java
	hbase-spark/pom.xml
	src/main/asciidoc/_chapters/developer.adoc
2016-08-05 14:08:50 -05:00
tedyu b1c60940da HBASE-16359 Addendum fixes TestZKBasedOpenCloseRegion 2016-08-05 10:54:20 -07:00
tedyu 00f6af2e85 HBASE-16359 NullPointerException in RSRpcServices.openRegion() 2016-08-05 03:21:23 -07:00
Ramkrishna 6278ece8b6 HBASE-16303 FilterList with MUST_PASS_ONE optimization (Ram) 2016-08-05 10:59:19 +05:30
chenheng 9c8cbd44ed HBASE-16287 LruBlockCache size should not exceed acceptableSize too many(Yu Sun) 2016-08-04 21:23:27 +08:00
Lim Chee Hau 96cce2dcc5 HBASE-15574 Fix typo in HRegionServerCommandLine
Signed-off-by: Sean Busbey <busbey@apache.org>
2016-08-03 23:45:55 -05:00
Nick Dimiduk 92b2973b62 HBASE-16317 revert all ESAPI changes
Revert "HBASE-15270 Use appropriate encoding for "filter" field in TaskMonitorTmpl.jamon."

This reverts commit ab1b2c6603.

Revert "HBASE-15122 Servlets generate XSS_REQUEST_PARAMETER_TO_SERVLET_WRITER findbugs warnings (Samir Ahmic)"

This reverts commit 29ce46a67f.

 Conflicts:
	hbase-server/pom.xml
2016-08-03 20:53:46 -07:00
Enis Soztutar d14ec71323 HBASE-16350 Undo server abort from HBASE-14968 2016-08-03 19:38:04 -07:00
Enis Soztutar 12549de43c HBASE-16301 Trigger flush without waiting when compaction is disabled on a table (huaxiang sun)
Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestCompactSplitThread.java
2016-08-03 10:55:08 -07:00
Andrew Purtell b06119edef HBASE-16319 Fix TestCacheOnWrite after HBASE-16288 2016-08-02 16:53:27 -07:00