Commit Graph

4701 Commits

Author SHA1 Message Date
thiruvel 44ab659b93 HBASE-16962: Add readPoint to preCompactScannerOpen() and preFlushScannerOpen() API
Signed-off-by: anoopsamjohn <anoopsamjohn@gmail.com>
2016-11-11 11:09:55 +05:30
Yu Sun 18b31fdd32 HBASE-17020 keylen in midkey() dont computed correctly
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-11 08:01:00 +08:00
Guanghao Zhang a6397e3b0c HBASE-16938 TableCFsUpdater maybe failed due to no write permission on peerNode
Signed-off-by: Enis Soztutar <enis@apache.org>
2016-11-10 14:07:20 -08:00
binlijin dac73eceb0 HBASE-16570 Compute region locality in parallel at startup (addendum)
Addendum mainly for:
1. Avoid interfering with block location cache in RegionLocationFinder
2. Avoid refreshing block lcoations during HMaster startup (or else the startup could be really slow)

Signed-off-by: Yu Li <liyu@apache.org>
2016-11-10 16:53:39 +08:00
Guanghao Zhang e929156f96 HBASE-16985 TestClusterId failed due to wrong hbase rootdir
Signed-off-by: Michael Stack <stack@apache.org>
2016-11-09 17:19:31 -08:00
tedyu 1e322e68a5 HBASE-17054 Compactor#preCreateCoprocScanner should be passed user 2016-11-09 07:00:19 -08:00
Charlie Qiangeng Xu d248d6b0b3 HBASE-17039 SimpleLoadBalancer schedules large amount of invalid region moves
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-09 17:07:28 +08:00
Enis Soztutar 123d26ed90 HBASE-17017 Remove the current per-region latency histogram metrics 2016-11-08 18:31:12 -08:00
Michael Stack a70f73c1e1 HBASE-16983 TestMultiTableSnapshotInputFormat failing with Unable to create region directory: /tmp/... (Addendum by Guanghao Zhang) 2016-11-08 11:59:49 -08:00
tedyu 97276da9a7 HBASE-17010 Serial replication should handle daughter regions being assigned to another RS (Phil Yang) 2016-11-08 06:35:12 -08:00
Stephen Yuan Jiang 874cf5128f HBASE-17042 Remove 'public' keyword from MasterObserver interface (Stephen Yuan Jiang) 2016-11-07 17:51:44 -08:00
Enis Soztutar b99690ac6b HBASE-17033 LogRoller makes a lot of allocations unnecessarily 2016-11-07 14:31:30 -08:00
anoopsamjohn 2fdba6521d HBASE-16992 The usage of mutation from CP is weird. (ChiaPing Tsai) 2016-11-07 23:26:27 +05:30
tedyu ecd4de938b HBASE-17026 VerifyReplication log should distinguish whether good row key is result of revalidation 2016-11-07 07:41:55 -08:00
binlijin c4132bbf30 HBASE-16840 Reuse cell's timestamp and type in ScanQueryMatcher
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-07 22:42:46 +08:00
Yu Li a67481209f Revert "HBASE-16840 Reuse cell's timestamp and type in ScanQueryMatcher" to update author
This reverts commit 4d8611fa0f.
2016-11-07 22:42:38 +08:00
Yu Li 4d8611fa0f HBASE-16840 Reuse cell's timestamp and type in ScanQueryMatcher
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-07 21:38:03 +08:00
Mikhail Antonov 3063943c78 HBASE-17032 CallQueueTooBigException and CallDroppedException should not be triggering PFFE 2016-11-06 21:34:11 -08:00
tedyu 8ade73c0cb HBASE-16946 Provide Raw scan as an option in VerifyReplication (Sreekar Pallapothu) 2016-11-04 03:36:17 -07:00
binlijin 5bfec397d1 HBASE-16960 RegionServer hang when aborting
Signed-off-by: Yu Li <liyu@apache.org>
2016-11-04 14:08:42 +08:00
Apekshit Sharma 51ba7cfde3 HBASE-17006 Give name to existing threads.
Having thread names in logs and thread dumps greatly improve debugability. This patch is simply adding the names to the threads we spawn.

Change-Id: I6ff22cc3804bb81147dde3a8e9ab671633c6f6ce
2016-11-03 18:31:03 -07:00
Eddie Elizondo b4676d51ad HBASE-14329 Report region in transition should check all regions in request
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-11-03 11:32:19 -07:00
Gary Helmling 4fe51ebaa7 HBASE-16964 Always clear successfully archived files from compacted files list 2016-11-03 09:28:22 -07:00
Yu Li 514add0086 HBASE-16972 Log more details for Scan#next request when responseTooSlow 2016-11-02 14:33:44 +08:00
Andrew Purtell 961b952171 HBASE-16980 TestRowProcessorEndpoint failing consistently (Yu Li) 2016-11-01 19:09:17 -07:00
Lars Hofhansl 6113f9a34f HBASE-16765 New SteppingRegionSplitPolicy, avoid too aggressive spread of regions for small tables. 2016-11-01 13:07:12 -07:00
Michael Stack b3ddea9fc4 HBASE-16983 TestMultiTableSnapshotInputFormat failing with Unable to
create region directory: /tmp/...
2016-10-31 21:48:49 -07:00
Guanghao Zhang 722be3a3d8 HBASE-16947 Some improvements for DumpReplicationQueues tool
Signed-off-by: Michael Stack <stack@apache.org>
2016-10-31 21:00:45 -07:00
Michael Stack a969f8d790 HBASE-16969 RegionCoprocessorServiceExec should override the toString() for debugging (ChiaPing Tsai) 2016-10-31 14:37:51 -07:00
Mikhail Antonov ea5b0a05d6 HBASE-16743 TestSimpleRpcScheduler#testCoDelScheduling is broke 2016-10-28 16:32:58 -07:00
Esteban Gutierrez 0a41493ac4 HBASE-16450 Shell tool to dump replication queues
New tool to dump existing replication peers, configurations and
queues when using HBase Replication. The tool provides two flags:

 --distributed  This flag will poll each RS for information about
                the replication queues being processed on this RS.
                By default this is not enabled and the information
                about the replication queues and configuration will
                be obtained from ZooKeeper.
 --hdfs         When --distributed is used, this flag will attempt
                to calculate the total size of the WAL files used
                by the replication queues. Since its possible that
                multiple peers can be configured this value can be
                overestimated.

Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-27 21:38:42 -07:00
tedyu d4826e1665 HBASE-16886 hbase-client: scanner with reversed=true and small=true gets no result (huzheng) 2016-10-27 07:47:49 -07:00
Michael Stack 24a92ed63a HBASE-16948 Fix inconsistency between HRegion and Region javadoc on getRowLock 2016-10-26 16:11:26 -07:00
Guanghao Zhang 4edd8a63d2 HBASE-16939 ExportSnapshot: set owner and permission on right directory
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-25 14:11:59 -07:00
Guanghao Zhang ae502a9d5c HBASE-16910 Avoid NPE when starting StochasticLoadBalancer
Signed-off-by: Ashish Singhi <ashishsinghi@apache.org>
2016-10-25 11:58:41 +05:30
binlijin 16823ff55e HBASE-16931 Setting cell's seqId to zero in compaction flow might cause RS down.
Signed-off-by: Yu Li <liyu@apache.org>
2016-10-24 23:33:34 +08:00
Guanghao Zhang d76cc4c1f0 HBASE-16870 Add the metrics of replication sources which were transformed from other dead rs to ReplicationLoad
Signed-off-by: zhangduo <zhangduo@apache.org>
2016-10-22 14:27:01 +08:00
Guanghao Zhang e395bf91db HBASE-16815 Low scan ratio in RPC queue tuning triggers divide by zero exception
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-21 17:00:24 -07:00
Ashu Pachauri 0117ed9d78 HBASE-16752 addendum. Do not retry large request for client versions less than 1.3
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-20 10:34:46 -07:00
Yu Li a7a4e17f1d HBASE-16698 Performance issue: handlers stuck waiting for CountDownLatch inside WALKey#getWriteEntry under high writing workload 2016-10-20 15:32:59 +08:00
Ashu Pachauri 33e89fa9cf HBASE-16752 Return error back to rpc client on exceeding rpc request size limit.
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-19 15:50:42 -07:00
Huaxiang Sun a97aef5163 HBASE-16345 RpcRetryingCallerWithReadReplicas#call() should catch some RegionServer Exceptions
Fix logic for
1). how to handle exception while waiting for reply from the primary replica.
2). handle exception from replicas while waiting for a correct response.

Signed-off-by: Esteban Gutierrez <esteban@apache.org>
2016-10-19 14:22:42 -07:00
Gary Helmling ae151334ba HBASE-16754 All WALSplitter OutputSinks should process compaction events 2016-10-19 10:37:35 -07:00
Enis Soztutar 019c7f9303 HBASE-16824 Writer.flush() can be called on already closed streams in WAL roll 2016-10-18 19:14:20 -07:00
Guanghao Zhang 66941910bd HBASE-16653 Backport HBASE-11393 to branches which support namespace
Signed-off-by: chenheng <chenheng@apache.org>
2016-10-18 09:12:47 +08:00
tedyu 08498c6848 HBASE-16856 Exception message in SyncRunner.run() should print currentSequence (Allan Yang) 2016-10-17 07:23:02 -07:00
tedyu a4cbdede31 HBASE-16853 Regions are assigned to Region Servers in /hbase/draining after HBase Master failover (David Pope) 2016-10-16 18:54:59 -07:00
Pankaj Kumar b7f283c6f6 HBASE-16724 Snapshot owner can't clone
Signed-off-by: Ashish Singhi <ashishsinghi@apache.org>
2016-10-15 11:57:00 +05:30
tedyu 57d3e9e756 HBASE-16816 HMaster.move() should throw exception if region to move is not online (Allan Yang) 2016-10-13 11:11:11 -07:00
tedyu ca581874b9 HBASE-16810 HBase Balancer throws ArrayIndexOutOfBoundsException when regionservers are in /hbase/draining znode and unloaded (David Pope) 2016-10-13 08:42:49 -07:00
Pankaj Kumar 27398ea525 HBASE-16807, RegionServer will fail to report new active Hmaster until HMaster/RegionServer failover.
Signed-off-by: chenheng <chenheng@apache.org>
2016-10-13 22:17:28 +08:00
Phil Yang 8f9fadf021 HBASE-16664 Timeout logic in AsyncProcess is broken
Signed-off-by: chenheng <chenheng@apache.org>
2016-10-13 17:14:52 +08:00
tedyu e2278f9544 HBASE-16716 OfflineMetaRepair leaves empty directory inside /hbase/WALs which remains forever (Pankaj Kumar) 2016-10-12 19:42:18 -07:00
Phil Yang a3c29da0b6 HBASE-16505 Pass deadline to HRegion operations
Signed-off-by: Michael Stack <stack@apache.org>
2016-10-12 12:17:38 -07:00
tedyu 3830890635 HBASE-16801 The Append/Increment may return the data from future (ChiaPing Tsai) 2016-10-12 10:07:37 -07:00
tedyu bf03827196 HBASE-15109 HM/RS failed to start when "fs.hdfs.impl.disable.cache" is set to true (Pankaj Kumar) 2016-10-12 07:35:50 -07:00
Pankaj Kumar 6db4ef8479 HBASE-16663 JMX ConnectorServer stopped when unauthorized user try to stop HM/RS/cluster
Signed-off-by: Andrew Purtell <apurtell@apache.org>

Conflicts:

	hbase-server/src/main/java/org/apache/hadoop/hbase/master/MasterCoprocessorHost.java
2016-10-11 16:17:07 -07:00
Gary Helmling 4f29c23038 HBASE-16146 Remove thread local usage in Counter 2016-10-11 14:47:02 -07:00
tedyu b47ded3b42 HBASE-16803 Make hbase:acl table unsplittable - revert pending review comments 2016-10-11 12:51:58 -07:00
tedyu 408a9eb8a3 HBASE-16803 Make hbase:acl table unsplittable 2016-10-11 12:50:05 -07:00
Gary Helmling f13a21696f HBASE-16788 addendum Account for HStore archiveLock in heap size 2016-10-11 11:46:12 -07:00
Gary Helmling 89bef67d0c HBASE-16788 Guard HFile archiving under a separate lock 2016-10-10 16:06:55 -07:00
Dustin Pho 59ca4dad70 HBASE-16661 Add last major compaction age to per-region metrics
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-10 15:21:53 -07:00
Huaxiang Sun 66038b8c1a HBASE-16699 Overflows in AverageIntervalRateLimiter's refill() and getWaitInterval()
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-10-10 14:12:22 -07:00
Sean Busbey acb1392b15 HBASE-16701 rely on test category timeout instead of defining one on a specific test.
Signed-off-by: Umesh Agashe <uagashe@cloudera.com>
Signed-off-by: Yu Li <liyu@apache.org>
2016-10-10 00:24:24 -05:00
tedyu 364a57a950 HBASE-16771 VerifyReplication should increase GOODROWS counter if re-comparison passes 2016-10-09 20:51:23 -07:00
tedyu 4b75614a2e HBASE-16773 AccessController should access local region if possible 2016-10-08 21:17:15 -07:00
tedyu e043d450ed HBASE-16768 Inconsistent results from the Append/Increment (ChiaPing Tsai) 2016-10-08 13:12:21 -07:00
Gary Helmling bbaa0e851d HBASE-16657 addendum handle null RegionLoad 2016-10-07 14:05:30 -07:00
Dustin Pho fa98c506d0 HBASE-16657 Expose per-region last major compaction time in RegionServer UI
Signed-off-by: Gary Helmling <garyh@apache.org>
2016-10-07 14:04:22 -07:00
Ashu Pachauri 7092dc3ec4 HBASE-16681: Flaky TestReplicationSourceManagerZkImpl
Change-Id: I6bf31eb2f3815079d346963ad78045f67e0f44b7
2016-10-06 16:58:08 -07:00
tedyu a8fe9ed64f HBASE-16772 Add verbose option to VerifyReplication for logging good rows 2016-10-06 11:19:58 -07:00
Mikhail Antonov afd3bc856e HBASE-16644 Errors when reading legit HFile Trailer of old (v2.0) format file 2016-10-04 17:52:08 -07:00
tedyu 252d7fae0a HBASE-16739 Timed out exception message should include encoded region name 2016-10-03 06:44:58 -07:00
tedyu a4426a7c2d HBASE-16739 Timed out exception message should include encoded region name - Revert due to misspelled word in subject 2016-10-03 06:44:11 -07:00
tedyu 69a3fd2ebf HBASE-16739) imed out exception message should include encoded region name 2016-10-02 21:10:20 -07:00
Enis Soztutar 06cc123849 HBASE-16721 Concurrency issue in WAL unflushed seqId tracking - ADDENDUM to change WAL to extend Closeable 2016-10-01 14:50:54 -07:00
Enis Soztutar 911f9b9eb7 HBASE-16678 MapReduce jobs do not update counters from ScanMetrics
Conflicts:
	hbase-server/src/test/java/org/apache/hadoop/hbase/mapreduce/TestTableMapReduce.java
2016-09-29 17:14:55 -07:00
Enis Soztutar bf5a7aba5c HBASE-16721 Concurrency issue in WAL unflushed seqId tracking 2016-09-29 14:51:14 -07:00
Sean Busbey df25ebf84f HBASE-15984 Handle premature EOF treatment of WALs in replication.
In some particular deployments, the Replication code believes it has
reached EOF for a WAL prior to succesfully parsing all bytes known to
exist in a cleanly closed file.

Consistently this failure happens due to an InvalidProtobufException
after some number of seeks during our attempts to tail the in-progress
RegionServer WAL. As a work-around, this patch treats cleanly closed
files differently than other execution paths. If an EOF is detected due
to parsing or other errors while there are still unparsed bytes before
the end-of-file trailer, we now reset the WAL to the very beginning and
attempt a clean read-through.

In current testing, a single such reset is sufficient to work around
observed dataloss. However, the above change will retry a given WAL file
indefinitely. On each such attempt, a log message like the below will
be emitted at the WARN level:

  Processing end of WAL file '{}'. At position {}, which is too far away
  from reported file length {}. Restarting WAL reading (see HBASE-15983
  for details).

Additionally, this patch adds some additional log detail at the TRACE
level about file offsets seen while handling recoverable errors. It also
add metrics that measure the use of this recovery mechanism.
2016-09-29 10:47:57 -05:00
tedyu df57859258 HBASE-16725 Don't let flushThread hang in TestHRegion 2016-09-29 06:57:02 -07:00
tedyu d4b5645a4e HBASE-16723 RMI registry is not destroyed after stopping JMX Connector Server (Pankaj Kumar) 2016-09-29 01:58:22 -07:00
Abhishek Singh Chouhan 96a8e8dce4 HBASE-16660 ArrayIndexOutOfBounds during the majorCompactionCheck in DateTieredCompaction
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-09-27 11:08:00 -07:00
Andrew Purtell 67a43c3059 HBASE-16694 Reduce garbage for onDiskChecksum in HFileBlock (binlijin) 2016-09-26 13:55:28 -07:00
Matteo Bertozzi 4566e4df58 HBASE-16649 Truncate table with splits preserved can cause both data loss and truncated data appeared again 2016-09-26 13:09:06 -07:00
chenheng d6f9eab4be HBASE-16677 Add table size (total store file size) to table page (Guang Yang) 2016-09-25 10:50:03 +08:00
Apekshit Sharma 9a510e9ba2 HBASE-16679 Flush throughput controller: Minor perf change and fix flaky TestFlushWithThroughputController.
Change-Id: I79f4a65a67bfdc46868d68ff8a0c32f3a02b8cb1
2016-09-23 11:47:12 -07:00
tedyu b503843bae HBASE-16423 Add re-compare option to VerifyReplication to avoid occasional inconsistent rows (Jianwei Cui) 2016-09-23 05:08:19 -07:00
Enis Soztutar 8a797e81b8 HBASE-16604 Scanner retries on IOException can cause the scans to miss data 2016-09-22 18:48:06 -07:00
Ben Lau 6973304026 HBASE-16662 Fix open POODLE vulnerabilities
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2016-09-22 19:59:08 +00:00
tedyu ba650a8a10 HBASE-16675 Average region size may be incorrect when there is region whose RegionLoad cannot be retrieved 2016-09-22 07:34:29 -07:00
Yu Li 76da21fc59 HBASE-16670 Make RpcServer#processRequest logic more robust 2016-09-22 15:44:45 +08:00
Jonathan M Hsieh 13d6acbc7f HBASE-12088 Remove unused hadoop-1.0, hadoop-1.1 profiles from non-root poms 2016-09-21 20:52:19 -07:00
Apekshit Sharma e7e660d5b2 HBASE-14734 Prevent BindException when setting up MiniKdc.
Port for kdc service gets selected in the constructor, but we bind to it later in MiniKdc.start()-->MiniKdc.initKDCServer() --> KdcServer.start(). In meantime, some other service can capture the port which results in BindException. The solution here is to catch the exception and retry.

Testing methodology:
- Used python and intellij.
- breakpoint on kdc.start(1), in catch block(2) and just after catch block(3).
- used python to bind to the selected port on breakpoint 1 --> run the program --> stops at breakpoint 2 (catch block)
- On breakpoint 1 and after 2 failures, close the port --> run the program --> skips catch block and goes to breakpoint 3.

Change-Id: I4e06e69819d1ec9a0a7fa471bf017f3a72c75cb3
2016-09-21 16:44:14 -07:00
Umesh Agashe 23c5ea39bd HBASE-16294 hbck reporting "No HDFS region dir found" for replicas
Signed-off-by: Matteo Bertozzi <matteo.bertozzi@cloudera.com>
2016-09-21 14:43:15 -07:00
Jerry He c80d671a06 HBASE-12949 Scanner can be stuck in infinite loop if the HFile is corrupted 2016-09-21 13:31:18 -07:00
Jerry He abfb20206d HBASE-16647 hbck should do offline reference repair before online repair 2016-09-20 11:53:01 -07:00
zhangduo 91d03697d9 HBASE-16165 Decrease RpcServer.callQueueSize before writeResponse causes OOM (Guanghao Zhang) 2016-09-18 10:05:48 +08:00
tedyu 591cc4cfb8 HBASE-16349 TestClusterId may hang during cluster shutdown 2016-09-16 06:57:42 -07:00
Phil Yang 069d1f73fa HBASE-16388 Prevent client threads being blocked by only one slow region server
Signed-off-by: stack <stack@apache.org>
2016-09-14 09:28:20 -07:00