Commit Graph

6983 Commits

Author SHA1 Message Date
Mingliang Liu fea75742b4
HBASE-21164 reportForDuty should do backoff rather than retry
Remove unused methods from Sleeper (its ok, its @Private).
Remove notion of startTime from Sleeper handling (it is is unused).
Allow passing in how long to sleep so can maintain externally.
In HRS, use a RetryCounter to calculate backoff sleep time for when
reportForDuty is failing against a struggling Master.
2018-09-25 11:31:39 -07:00
Andrew Purtell 101205345b
Amend HBASE-20704 Sometimes some compacted storefiles are not archived on region close
Forward port small logging improvements from branch-1 version of this change.
2018-09-21 16:12:51 -07:00
Michael Stack a22aec1dad
HBASE-21214 [hbck2] setTableState just sets hbase:meta state, not in-memory state 2018-09-21 16:03:58 -07:00
openinx 5a73a1ab25 HBASE-21206 Scan with batch size may return incomplete cells 2018-09-20 22:20:02 +08:00
tianjingyun c5af7b654b HBASE-21204 NPE when scan raw DELETE_FAMILY_VERSION and codec is not set
Signed-off-by: tedyu <yuzhihong@gmail.com>
2018-09-20 06:59:43 -07:00
Umesh Agashe e6c7ed34e0
HBASE-21023 Added bypassProcedure() API to HbckService 2018-09-19 15:01:29 -07:00
Michael Stack 37cc07a772
HBASE-21156 [hbck2] Queue an assign of hbase:meta and bulk assign/unassign
Adds 'raw' assigns and unassigns methods to Hbck Service.

Fixes HbckService so it works when cluster is Kerberized.
2018-09-19 09:02:43 -07:00
Vasudevan 27b772ddc6 HBASE-21102 ServerCrashProcedure should select target server where no
other replicas exist for the current region (Ram)
2018-09-17 22:36:50 +05:30
Michael Stack 39e0b8515f HBASE-21191 Add a holding-pattern if no assign for meta or namespace (Can happen if masterprocwals have been cleared).
Add a check for hbase:meta being online before we go to read it.
If not online, move into a holding-pattern until rectified, probably
by external operator.

Incorporates bulk of patch made by Allan Yang over on HBASE-21035.

M hbase-common/src/main/java/org/apache/hadoop/hbase/util/RetryCounterFactory.java

 Add a Constructor for case where retries are for ever.

M hbase-server/src/main/java/org/apache/hadoop/hbase/master/HMaster.java
 Move stuff around so that the first hbase:meta read is the AM#loadMeta.
 Previously, checking table state and/or favored nodes could end up
 trying to read a meta that was not onlined holding up master startup.
 Do similar for the namespace table. Adds new methods isMeta and
 isNamespace which check that the regions/tables are online.. if not,
 we wait logging with a back-off that assigns need to be run.

Signed-off-by: Allan Yang <allan163@apache.org>
Signed-off-by: Duo Zhang <zhangduo@apache.org>
2018-09-16 21:12:59 -07:00
Francis Liu a925a4ce16 HBASE-20704 Sometimes some compacted storefiles are not archived on region close 2018-09-16 18:38:03 -07:00
Ted Yu 842e0c974d HBASE-21097 Flush pressure assertion may fail in testFlushThroughputTuning
Amending-Author: Duo Zhang <zhangduo@apache.org>
Signed-off-by: Duo Zhang <zhangduo@apache.org>
2018-09-15 18:39:42 +08:00
Umesh Agashe 589c1e4078
HBASE-20941 Created and implemented HbckService in master
Added API setTableStateInMeta() to update table state only in Meta. This will be used by hbck2 tool.
2018-09-12 21:31:13 -07:00
Mike Drob d81e806718 HBASE-21168 Insecure Randomness in BloomFilterUtil
Flagged by Fortify static analysis

Signed-off-by: Andrew Purtell <apurtell@apache.org>
Signed-off-by: Mingliang Liu <liuml07@apache.org>
2018-09-12 09:52:41 -05:00
Duo Zhang 2da6dbe563 HBASE-21172 Reimplement the retry backoff logic for ReopenTableRegionsProcedure 2018-09-12 16:01:55 +08:00
David Manning 75a7643b11 Backport "HBASE-21126 Add ability for HBase Canary to ignore a configurable number of ZooKeeper down nodes" to branch-2.1
Signed-off-by: Josh Elser <elserj@apache.org>
Signed-off-by: Duo Zhang <zhangduo@apache.org>
2018-09-12 10:01:28 +08:00
krish.dey 63ef89bff7 HBASE-21125 Backport 'HBASE-20942 Improve RpcServer TRACE logging' to branch-2.1
Signed-off-by: Duo Zhang <zhangduo@apache.org>
2018-09-12 09:59:28 +08:00
Duo Zhang b9d74f89ff Revert "HBASE-20942 Fix ArrayIndexOutOfBoundsException for RpcServer TRACE logging"
This reverts commit 69756da503.
2018-09-12 09:55:46 +08:00
krish.dey 69756da503 HBASE-20942 Fix ArrayIndexOutOfBoundsException for RpcServer TRACE logging
Also makes the trace log message length configurable.

Signed-off-by: Josh Elser <elserj@apache.org>
2018-09-12 09:44:22 +08:00
Guangxu Cheng 1c8c7e10f8 HBASE-21158 Empty qualifier cell is always returned when using QualifierFilter 2018-09-10 21:40:57 +08:00
Duo Zhang 6ab9997d1f HBASE-21144 AssignmentManager.waitForAssignment is not stable 2018-09-10 17:28:57 +08:00
Guangxu Cheng 12ffa086c7 HBASE-21001 ReplicationObserver fails to load in HBase 2.0.0 2018-09-07 23:43:10 +08:00
Michael Stack 5324911cd8
HBASE-21155 Save on a few log strings and some churn in wal splitter by skipping out early if no logs in dir; ADDENDUM
Address review comments.

Signed-off-by: Mike Drob <mdrob@apache.org>
2018-09-06 17:24:03 -07:00
Michael Stack 205783419c
HBASE-21155 Save on a few log strings and some churn in wal splitter by skipping out early if no logs in dir 2018-09-06 16:36:59 -07:00
Vasudevan 2051b0982d HBASE-20741 Split of a region with replicas creates all daughter regions
and its replica in same server (Ram)
2018-09-06 16:44:59 +05:30
Guangxu Cheng c64814ec96 HBASE-20892 [UI] Start / End keys are empty on table.jsp 2018-09-05 09:37:25 +08:00
Allan Yang e33591515c
HBASE-21083 Introduce a mechanism to bypass the execution of a stuck procedure 2018-08-28 20:18:47 -07:00
Michael Stack 4340930c71
HBASE-20649 Validate HFiles do not have PREFIX_TREE DataBlockEncoding; ADDEDNDUM ADD MISSING FILE 2018-08-28 07:45:27 -07:00
Balazs Meszaros 147694bb08
HBASE-20649 Validate HFiles do not have PREFIX_TREE DataBlockEncoding 2018-08-28 07:09:47 -07:00
Ted Yu c1cd6d5a89
HBASE-21088 HStoreFile should be closed in HStore#hasReferences 2018-08-27 20:31:50 -07:00
Michael Stack e826e3f2b8 HBASE-21120 MoveRegionProcedure makes no progress; goes to STUCK 2018-08-27 14:55:52 -07:00
zhangduo 625be5137e HBASE-21072 Addendum do not write lock file when running TestHBaseFsckReplication 2018-08-27 21:05:16 +08:00
Allan Yang 33fa32d711 HBASE-21113 Apply the branch-2 version of HBASE-21095, The timeout retry logic for several procedures are broken after master restarts(addendum) 2018-08-26 22:15:49 +08:00
Michael Stack d954031d50 HBASE-21078 [amv2] CODE-BUG NPE in RTP doing Unassign 2018-08-24 13:22:16 -07:00
Michael Stack e26ca63f88 Revert "Revert "HBASE-21095 The timeout retry logic for several procedures are broken after master restarts""
HBASE-21113 Apply the branch-2 version of HBASE-21095, The timeout retry
logic for several procedures are broken after master restarts

I applied the patch HBASE-21095 and then reverted it so could apply the
patch as HBASE-21113 (by reverting the HBASE-21095 revert but pushing
with this message!).

This reverts commit 4978db8102.
2018-08-24 12:35:29 -07:00
Michael Stack 4978db8102 Revert "HBASE-21095 The timeout retry logic for several procedures are broken after master restarts"
This reverts commit b82cd670c3.
2018-08-24 12:24:32 -07:00
Allan Yang b82cd670c3 HBASE-21095 The timeout retry logic for several procedures are broken after master restarts 2018-08-24 12:20:43 -07:00
Michael Stack 66add55234 HBASE-21072 Block out HBCK1 in hbase2
Write the hbase-1.x hbck1 lock file to block out hbck1 instances writing
state to an hbase-2.x cluster (could do damage).
Set hbase.write.hbck1.lock.file to false to disable this writing.
2018-08-24 09:22:53 -07:00
Duo Zhang 8a9acd4d2a HBASE-21101 Remove the waitUntilAllRegionsAssigned call after split in TestTruncateTableProcedure 2018-08-24 10:35:10 +08:00
zhangduo bf21a9dc33 HBASE-20193 Move TestCreateTableProcedure.testMRegions to a separated file 2018-08-24 10:09:31 +08:00
Duo Zhang 239d12dae8 HBASE-20194 Remove the explicit timeout config for TestTruncateTableProcedure 2018-08-23 06:27:41 +08:00
Allan Yang 16ab716134 HBASE-21041 Memstore's heap size will be decreased to minus zero after flush 2018-08-22 22:54:14 +08:00
Allan Yang c07afa8875 HBASE-21031 Memory leak if replay edits failed during region opening 2018-08-22 22:13:26 +08:00
Andrey Elenskiy 5f03be4675 HBASE-21032 ScanResponses contain only one cell each
Amending-Author: Duo Zhang <zhangduo@apache.org>
2018-08-21 13:31:18 -07:00
Andrew Purtell 798cb1d793 HBASE-20940 HStore.cansplit should not allow split to happen if it has references (Vishal Khandelwal) 2018-08-17 15:02:26 -07:00
Josh Elser 67ad0e6013 HBASE-21062 Correctly use the defaultProvider value on the Providers enum when constructing a WALProvider 2018-08-17 14:55:42 -04:00
Sakthi 48dee7e44d HBASE-20705 Having RPC quota on a table now no longer prevents Space Quota to be recreate/removed
Just added 2 test cases as the subtasks of this jira solves the issue

Signed-off-by: Josh Elser <elserj@apache.org>
2018-08-17 14:09:26 -04:00
Andrew Purtell b49941012a HBASE-21047 Object creation of StoreFileScanner thru constructor and close may leave refCount to -1 (Vishal Khandelwal) 2018-08-16 11:42:54 -07:00
Nihal Jain 145c92f3d6 HBASE-20469 Directory used for sidelining old recovered edits files should be made configurable
Signed-off-by: Andrew Purtell <apurtell@apache.org>
2018-08-15 18:08:15 -07:00
Michael Stack 2e5efa690a HBASE-20772 Controlled shutdown fills Master log with the disturbing message 'No matching procedure found for rit=OPEN, location=ZZZZ, table=YYYYY, region=XXXX transition to CLOSED'
Look for the particular case where RS does the close of region w/o
involving Master and log special message in this case. Dodgy. But
until we have Master run shutdown of all regions, better than
the message we currently show.
2018-08-13 15:59:39 -07:00
Allan Yang 161c018927 HBASE-21029 Miscount of memstore's heap/offheap size if same cell was put 2018-08-13 20:30:23 +08:00
jingyuntian 95e3dec510 HBASE-20985 add two attributes when we do normalization
Signed-off-by: Guanghao Zhang <zghao@apache.org>
2018-08-13 16:55:19 +08:00
Duo Zhang 846078f9b0 HBASE-21025 Addendum missed a 'succ = true'
Signed-off-by: Guanghao Zhang <zghao@apache.org>
2018-08-13 10:55:18 +08:00
brandboat 873d9f5082 HBASE-21012 Revert the change of serializing TimeRangeTracker
Signed-off-by: Michael Stack <stack@apache.org>
Signed-off-by: zhangduo <zhangduo@apache.org>
Signed-off-by: Chia-Ping Tsai <chia7712@gmail.com>
2018-08-11 22:28:49 +08:00
Wei-Chiu Chuang 5e12d6a98e HBASE-21018 RS crashed because AsyncFS was unable to update HDFS data encryption key 2018-08-10 19:53:22 -07:00
zhangduo ee164fcbc5 HBASE-21025 Add cache for TableStateManager 2018-08-10 21:11:53 +08:00
brandboat 8a9ba0c65b HBASE-18201 add UT and docs for DataBlockEncodingTool
Signed-off-by: Reid Chan <reidchan@apache.org>
2018-08-10 11:19:36 +08:00
meiyi e222686294 HBASE-20965 Separate region server report requests to new handlers
Signed-off-by: Guanghao Zhang <zghao@apache.org>
2018-08-09 18:27:38 +08:00
Sakthi b2fc0f48f6 HBASE-20813 Removed RPC quotas when the associated table/Namespace is dropped off
Signed-off-by: Josh Elser <elserj@apache.org>
2018-08-08 13:46:25 -04:00
jingyuntian 9d594ac86a HBASE-20986 Separate the config of block size when we do log splitting and write Hlog
Signed-off-by: Guanghao Zhang <zghao@apache.org>
2018-08-07 14:03:03 +08:00
Sakthi 7e9f8c60e2 HBASE-20885 Removed entry for RPC quota from hbase:quota when RPC quota is removed
Signed-off-by: Josh Elser <elserj@apache.org>
Signed-off-by: Mike Drob <mdrob@apache.org>
2018-08-03 11:07:01 -04:00
TAK LON WU 2e1c12ca1b HBASE-20856 PITA having to set WAL provider in two places
With this change if hbase.wal.meta_provider is not explicitly set,
it uses whatever set with hbase.wal.provider. this change avoids a use
case of unexpectedly using two different providers when only
hbase.wal.provider is set to non-default but not hbase.wal.meta_provider.

This change also include document (architecture.adoc) update

Also, this is a port from master to branch-2

Signed-off-by: Zach York <zyork@apache.org>
Signed-off-by: Michael Stack <stack@apache.org>
Signed-off-by: Sean Busbey <busbey@apache.org>
Signed-off-by: Duo Zhang <Apache9@apache.org>
2018-08-01 14:45:11 -07:00
Michael Stack 88f3148810 HBASE-20989 Minor, miscellaneous logging fixes
Signed-off-by: Zach York <zyork@amazon.com>
Signed-off-by: Mingliang Liu <liuml07@apache.org>
2018-08-01 11:20:01 -07:00
Xu Cang 9338eaee65 HBASE-20794 add INFO level log to createTable operation 2018-08-01 11:04:00 -07:00
Michael Stack 0f4e857c7a HBASE-20893 Data loss if splitting region while ServerCrashProcedure executing ADDENDUM: Rather than rollback, just do region reopens.
In split, reopen the parent if recovered.edits and in merge, reopen the
parent region or regions that happened to have recovered.edits on close.
2018-08-01 00:33:12 -07:00
Andrew Purtell daeec8657e HBASE-20935 HStore.removeCompactedFiles should log in case it is unable to delete a file (Vishal Khandelwal)
Conflicts:
	hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HStore.java
2018-07-31 16:06:15 -07:00
zhangduo 1b66839bc4 HBASE-20538 Upgrade our hadoop versions to 2.7.7 and 3.0.3 2018-07-29 20:20:28 +08:00
zhangduo a9346982bf Revert "HBASE-20538 Upgrade our hadoop-two.version to 2.7.7 and 3.0.3"
This reverts commit 3dd83adb51.
2018-07-29 20:20:20 +08:00
zhangduo 3dd83adb51 HBASE-20538 Upgrade our hadoop-two.version to 2.7.7 and 3.0.3 2018-07-29 20:04:48 +08:00
Alex Leblang 31cbd7ab8f
HBASE-19369 Switch to Builder Pattern In WAL
This patch switches to the builder pattern by adding a helper method.
It also checks to ensure that the pattern is available (i.e. that
HBase is running on a hadoop version that supports it).

Amending-Author: Mike Drob <mdrob@apache.org>
Signed-off-by: tedyu <yuzhihong@gmail.com>
Signed-off-by: zhangduo <zhangduo@apache.org>
2018-07-27 23:43:08 -05:00
zhangduo 8bfdb19e85 HBASE-20939 There will be race when we call suspendIfNotReady and then throw ProcedureSuspendedException 2018-07-27 21:30:23 +08:00
Allan Yang 35c598db93 HBASE-20921 Possible NPE in ReopenTableRegionsProcedure 2018-07-27 09:31:12 +08:00
zhangduo 1777ea3aae HBASE-20938 Set version to 2.1.1-SNAPSHOT for branch-2.1 2018-07-25 21:45:09 +08:00
Allan Yang 3251554737 HBASE-20867 RS may get killed while master restarts 2018-07-25 18:11:28 +08:00
zhangduo 833657c46d HBASE-20846 Restore procedure locks when master restarts 2018-07-25 14:37:36 +08:00
huzheng 1dbfe92dbf HBASE-20565 ColumnRangeFilter combined with ColumnPaginationFilter can produce incorrect result 2018-07-24 10:39:36 +08:00
Allan Yang 44bf7076b7 HBASE-20878 Data loss if merging regions while ServerCrashProcedure executing 2018-07-24 09:51:46 +08:00
Allan Yang af2742fcf2 HBASE-20893 Data loss if splitting region while ServerCrashProcedure executing 2018-07-23 14:35:27 +08:00
Reid Chan 9d481f1faa HBASE-20401 Make MAX_WAIT and waitIfNotFinished in CleanerContext configurable (Contributed by Stephen Wu) 2018-07-23 10:33:26 +08:00
Michael Stack 46e5baf670 HBASE-20914 Trim Master memory usage
Add (weak reference) interning of ServerNames.

Correct Balancer regions x racks matrix.

Make smaller defaults when creating ArrayDeques.
2018-07-20 10:08:13 -07:00
Allan Yang 679698a7f2 HBASE-20870 Wrong HBase root dir in ITBLL's Search Tool 2018-07-20 11:31:21 +08:00
Yu Li 9ac26b80b2 HBASE-20907 Fix Intermittent failure on TestProcedurePriority 2018-07-19 12:01:29 +08:00
Michael Stack cecce16fad HBASE-20875 MemStoreLABImp::copyIntoCell uses 7% CPU when writing
Make the #copyCellInto method smaller so it inlines; we do it by
checking for the common type early and then taking a code path
that presumes ByteBufferExtendedCell -- avoids checks.
2018-07-18 20:40:47 -07:00
Toshihiro Suzuki e14b49080b HBASE-20865 CreateTableProcedure is stuck in retry loop in CREATE_TABLE_WRITE_FS_LAYOUT state
Signed-off-by: tedyu <yuzhihong@gmail.com>
2018-07-13 08:31:59 -07:00
Toshihiro Suzuki 881409bd0b HBASE-19572 RegionMover should use the configured default port number and not the one from HConstants
Signed-off-by: Reid Chan <reidchan@apache.org>
2018-07-13 10:46:51 +08:00
Allan Yang 368b1b1060 HBASE-20860 Merged region's RIT state may not be cleaned after master restart 2018-07-12 12:16:49 +08:00
zhangduo 8eab6d7a45 HBASE-20847 Addendum use addFront instead of addBack to add sub procedure 2018-07-12 08:31:40 +08:00
zhangduo 113652eb88 HBASE-20847 The parent procedure of RegionTransitionProcedure may not have the table lock 2018-07-11 17:37:27 +08:00
zhaoyuan 8de69db143 HBASE-20697 Can't cache All region locations of the specify table by calling table.getRegionLocator().getAllRegionLocations()
Signed-off-by: Guanghao Zhang <zghao@apache.org>
2018-07-11 11:17:41 +08:00
zhangduo 5e25bc92cf HBASE-20784 Will lose the SNAPSHOT suffix if we get the version of RS from ServerManager 2018-07-10 10:00:15 +08:00
Abhishek Singh Chouhan dddf15ae6b HBASE-20806 Split style journal for flushes and compactions 2018-07-09 12:42:20 -07:00
Balazs Meszaros da7fef6bf5
HBASE-20833 Modify pre-upgrade coprocessor validator to support table level coprocessors
- -jar parameter now accepts multiple jar files and directories of jar files.
- observer classes can be verified by -class option.
- -table parameter was added to check table level coprocessors.
- -config parameter was added to obtain the coprocessor classes from
  HBase cofiguration.
- -scan option was removed.

Signed-off-by: Mike Drob <mdrob@apache.org>
2018-07-09 14:19:12 -05:00
zhangduo 5a40606422 HBASE-20822 TestAsyncNonMetaRegionLocator is flakey 2018-07-09 14:56:45 +08:00
Nihal Jain 927ac8228f HBASE-20808 (Addendum) Remove duplicate calls for cancelling of chores
Signed-off-by: Reid Chan <reidchan@apache.org>
2018-07-07 00:21:08 +08:00
Nihal Jain 3ed9350233 HBASE-20808 Wrong shutdown order between Chores and ChoreService
Signed-off-by: Reid Chan <reidchan@apache.org>
2018-07-07 00:20:08 +08:00
zhangduo a2db3d27ff HBASE-20849 Set version as 2.1.0 in branch-2.1 in prep for first RC 2018-07-06 15:32:23 +08:00
zhangduo 159f1b4686 Revert "HBASE-20808 Wrong shutdown order between Chores and ChoreService"
For cutting 2.1.0RC0

This reverts commit ae2c858c5e.
2018-07-06 15:29:58 +08:00
Nihal Jain ae2c858c5e HBASE-20808 Wrong shutdown order between Chores and ChoreService
Signed-off-by: Reid Chan <reidchan@apache.org>
2018-07-06 11:38:17 +08:00
Yu Li d61bb64e93 HBASE-20691 Change the default WAL storage policy back to "NONE""
This reverts commit 564c193d61 and added more doc
about why we choose "NONE" as the default.
2018-07-04 13:45:54 +08:00
Guangxu Cheng 60ebdd9fd8 HBASE-20474 Show non-RPC tasks on master/regionserver Web UI by default 2018-07-04 10:54:21 +08:00
zhangduo 5dacfe9427 HBASE-20839 Fallback to FSHLog if we can not instantiated AsyncFSWAL when user does not specify AsyncFSWAL explicitly 2018-07-04 10:29:36 +08:00