hbase

Commit Graph

Author	SHA1	Message	Date
Andrew Purtell	2d591ab3c4	HBASE-25869 WAL value compression (#3244 ) WAL storage can be expensive, especially if the cell values represented in the edits are large, consisting of blobs or significant lengths of text. Such WALs might need to be kept around for a fairly long time to satisfy replication constraints on a space limited (or space-contended) filesystem. We have a custom dictionary compression scheme for cell metadata that is engaged when WAL compression is enabled in site configuration. This is fine for that application, where we can expect the universe of values and their lengths in the custom dictionaries to be constrained. For arbitrary cell values it is better to use one of the available compression codecs, which are suitable for arbitrary albeit compressible data. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Nick Dimiduk <ndimiduk@apache.org>	2021-05-21 11:53:36 -07:00
Rushabh Shah	7b374b7c76	HBASE-25827 Per Cell TTL tags get duplicated with increments causing tags length overflow (#3290 )(#3210 ) Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-05-21 22:58:32 +05:30
Baiqiang Zhao	cd06870f07	HBASE-25682 Add a new command to update the configuration of all RSs in a RSGroup for branch-2 (#3106 ) * HBASE-25682 Add a new command to update the configuration of all RSs in a RSGroup Signed-off-by: Pankaj Kumar<pankajkumar@apache.org>	2021-05-21 22:49:48 +05:30
Duo Zhang	d78c164429	HBASE-25897 TestRetainAssignmentOnRestart is flaky after HBASE-25032 (#3281 ) Signed-off-by: Xiaolin Ha <haxiaolin@apache.org>	2021-05-20 21:18:09 +08:00
GeorryHuang	3040be1438	HBASE-25791 UI of master-status to show a recent history of that why balancer was rejected to run (#3278 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-20 15:00:00 +08:00
Nick Dimiduk	d19e0587fa	HBASE-25841 Add basic jshell support Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Pankaj Kumar <pankajkumar@apache.org> Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-19 12:02:30 -07:00
Nick Dimiduk	aab6e1d0ff	HBASE-25861 Correct the usage of Configuration#addDeprecation (#3249 ) Co-authored-by: Baiqiang Zhao <zbq.dean@gmail.com> Signed-off-by: Nick Dimiduk <ndimiduk@apache.org>	2021-05-18 06:53:31 -07:00
Duo Zhang	8110b18ab0	HBASE-25032 Do not assign regions to region server which has not called regionServerReport yet (#3268 ) Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Michael Stack <stack@apache.org>	2021-05-18 10:43:01 +08:00
Pankaj	1a8d3c3b6b	HBASE-25875 RegionServer failed to start with IllegalThreadStateException due to race condition in AuthenticationTokenSecretManager (#3250 ) * HBASE-25875 RegionServer failed to start with IllegalThreadStateException due to race condition in AuthenticationTokenSecretManager's start & retrievePassword method Signed-off-by: stack <stack@apache.com> (cherry picked from commit `2126ec94f0`)	2021-05-17 12:21:47 +05:30
Duo Zhang	439fd4aff2	HBASE-25872 Add documentation for LoadBalancer about synchronization (#3267 ) Signed-off-by: Xin Sun <ddupgs@gmail.com>	2021-05-17 10:20:48 +08:00
Duo Zhang	d5d972f92e	HBASE-25883 The regionFinder and rackManager fields in BaseLoadBalancer should be volatile (#3262 ) Signed-off-by: meiyi <myimeiyi@gmail.com>	2021-05-17 09:54:40 +08:00
Viraj Jasani	1bb16fea5b	HBASE-25884 Return empty records for disabled balancer in-memory queue (#3263 ) Signed-off-by: stack <stack@apache.org> Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Aman Poonia <apoonia@salesforce.com>	2021-05-14 12:57:24 +05:30
Duo Zhang	72d7641fb5	HBASE-25852 Move all the intialization work of LoadBalancer implementation to initialize method (#3248 ) (#3258 ) Signed-off-by: Michael Stack <stack@apache.org> Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-05-13 14:19:03 +08:00
Michael Stack	075545731a	HBASE-25876 Add retry if we fail to read all bytes of the protobuf magic marker (#3251 ) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by Anoop Sam John <anoopsamjohn@apache.org>	2021-05-12 11:20:50 -07:00
Michael Stack	61d9b46aab	HBASE-25867 Extra doc around ITBLL (#3242 ) * HBASE-25867 Extra doc around ITBLL Minor edits to a few log messages. Explain how the '-c' option works when passed to ChaosMonkeyRunner. Some added notes on ITBLL. Fix whacky 'R' and 'Not r' thing in Master (shows when you run ITBLL). In HRS, report hostname and port when it checks in (was debugging issue where Master and HRS had different notions of its hostname). Spare a dirty FNFException on startup if base dir not yet in place. * Address Review by Sean Signed-off-by: Sean Busbey <busbey@apache.org>	2021-05-11 19:24:33 +01:00
Catalin Luca	4e507ccda0	HBASE-25859 Reference class incorrectly parses the protobuf magic marker (#3236 ) Co-authored-by: Catalin Luca <luca@adobe.com> Signed-off-by: stack <stack@apache.org>	2021-05-10 14:47:18 -05:00
GeorryHuang	63d49cb7ed	HBASE-25790 NamedQueue 'BalancerRejection' for recent history of balancer skipping (#3182 ) (#3245 ) Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-05-09 00:09:23 +05:30
Rushabh Shah	90dc150b1b	HBASE-25860 Add metric for successful wal roll requests. (#3238 ) Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-05-08 13:02:52 +05:30
Andrew Purtell	d60aa0a865	HBASE-25774 ServerManager.getOnlineServer may miss some region servers when refreshing state in some procedure implementations Revert "HBASE-25032 Wait for region server to become online before adding it to online servers in Master (#2771)" This reverts commit `3bb978894d`.	2021-05-07 17:55:20 -07:00
niuyulin	5e65da64e2	HBASE-25837 TestRollingRestart is flaky (#3220 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-07 20:28:37 +08:00
xiaozhang0319	bec79d877f	HBASE-25850 Fix spotbugs warnings on branch-2 (#3239 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-07 20:21:53 +08:00
Andrew Purtell	2938ba421b	HBASE-25854 Remove redundant AM in-memory state changes in CatalogJanitor (#3234 ) In CatalogJanitor we schedule GCRegionProcedure to clean up both filesystem and in-memory state after a split, and GCMultipleMergedRegionsProcedure to do the same for merges. Both of these procedures clean up in-memory state, but CatalogJanitor also does this redundantly just after scheduling the procedures. The cleanup should be done in only one place. Presumably we are using the procedures to do it in a principled way. Remove the redundancy in CatalogJanitor and fix any follow on issues, like test failures. Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Michael Stack <stack@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-05-06 09:17:29 -07:00
Duo Zhang	2d4904ceea	HBASE-25851 Make LoadBalancer not extend Configurable interface (#3233 ) (#3235 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-05-06 23:39:43 +08:00
Andrew Purtell	b2c58331f3	HBASE-25847 More DEBUG and TRACE level logging in CatalogJanitor and HbckChore (#3230 ) Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Michael Stack <stack@apache.org>	2021-05-05 17:02:12 -07:00
Nick Dimiduk	6f2842d1af	HBASE-25843 move master http-related code into o.a.h.h.master.http Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-05 08:36:40 -07:00
Duo Zhang	afa9836b87	HBASE-25834 Remove balanceTable method from LoadBalancer interface (#3217 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-05-05 16:03:51 +08:00
Nick Dimiduk	3f8d3ba6d9	HBASE-25842 move regionserver http-related code into o.a.h.h.regionserver.http Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-04 15:46:53 -07:00
Andrew Purtell	36994a2c5e	HBASE-25835 Ignore duplicate split requests from regionserver reports (#3218 ) Processing of the RS report happens asynchronously from other activities which can mutate region state. For example, a split procedure may already be running. A split procedure cannot succeed if the parent region is no longer open, so we can ignore it in that case. Note that submitting more than one split procedure for a given region is harmless -- the split is fenced in the procedure handling -- but it would be noisy in the logs. Only one procedure can succeed. The other procedure(s) would abort during initialization and report failure with WARN level logging. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: Pankaj <pankajkumar@apache.org>	2021-05-04 10:22:57 -07:00
Andrew Purtell	c8f3dd56b7	HBASE-25836 RegionStates#getAssignmentsForBalancer should only care about OPEN or OPENING regions (#3219 ) RegionStates#getAssignmentsForBalancer is used by the HMaster to collect all regions of interest to the balancer for the next chore iteration. We check if a table is in disabled state to exclude regions that will not be of interest (because disabled regions are or will be offline) or are in a state where they shouldn't be mutated (like SPLITTING). The current checks are not actually comprehensive. Filter out regions not in OPEN or OPENING state when building the set of interesting regions for the balancer to consider. Only regions open (or opening) on the cluster are of interest to balancing calculations for the current iteration. Regions in all other states can be expected to not be of interest – either offline (OFFLINE, or FAILED_*), not subject to balancer decisions now (SPLITTING, SPLITTING_NEW, MERGING, MERGING_NEW), or will be offline shortly (CLOSING) – until at least the next chore iteration. Add TRACE level logging. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-05-03 18:24:13 -07:00
Andrew Purtell	60055f5554	HBASE-25840 CatalogJanitor warns about skipping gc of regions during RIT, but does not actually skip (#3223 ) We claim in a WARN level log line to be "Playing-it-safe skipping merge/ split gc'ing of regions from hbase:meta while regions-in-transition (RIT)" but do not actually skip because of a missing return. Remove the warning. Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-05-03 18:17:41 -07:00
Duo Zhang	e725ef540e	HBASE-25838 Use double instead of Double in StochasticLoadBalancer (#3221 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-05-04 09:11:28 +08:00
Andrew Purtell	bf43006b9d	HBASE-25824 IntegrationTestLoadCommonCrawl (#3208 ) This integration test loads successful resource retrieval records from the Common Crawl (https://commoncrawl.org/) public dataset into an HBase table and writes records that can be used to later verify the presence and integrity of those records. Run like: ./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Access to the Common Crawl dataset in S3 is made available to anyone by Amazon AWS, but Hadoop's S3N filesystem still requires valid access credentials to initialize. The input path can either specify a directory or a file. The file may optionally be compressed with gzip. If a directory, the loader expects the directory to contain one or more WARC files from the Common Crawl dataset. If a file, the loader expects a list of Hadoop S3N URIs which point to S3 locations for one or more WARC files from the Common Crawl dataset, one URI per line. Lines should be terminated with the UNIX line terminator. Included in hbase-it/src/test/resources/CC-MAIN-2021-10-warc.paths.gz is a list of all WARC files comprising the Q1 2021 crawl archive. There are 64,000 WARC files in this data set, each containing ~1GB of gzipped data. The WARC files contain several record types, such as metadata, request, and response, but we only load the response record types. If the HBase table schema does not specify compression (by default) there is roughly a 10x expansion. Loading the full crawl archive results in a table approximately 640 TB in size. The hadoop-aws jar will be needed at runtime to instantiate the S3N filesystem. Use the -files ToolRunner argument to add it. You can also split the Loader and Verify stages: Load with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Loader' \ -files /path/to/hadoop-aws.jar \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Verify with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Verify' \ /path/to/tmp/warc-loader-output Signed-off-by: Michael Stack <stack@apache.org> Conflicts: pom.xml	2021-05-03 18:01:43 -07:00
Duo Zhang	3ebcfda3c7	HBASE-25825 RSGroupBasedLoadBalancer.onConfigurationChange should chain the request to internal balancer (#3209 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-04-30 22:57:18 +08:00
Duo Zhang	c52c091609	HBASE-25819 Fix style issues for StochasticLoadBalancer (#3207 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-04-29 11:26:58 +08:00
Nick Dimiduk	3a01941963	HBASE-25779 HRegionServer#compactSplitThread should be private Minor refactor. Make the `compactSplitThread` member field of `HRegionServer` private, and gate all access through the getter method. Signed-off-by: Yulin Niu <niuyulin@apache.org> Signed-off-by: Pankaj Kumar <pankajkumar@apache.org>	2021-04-28 16:54:44 -07:00
stack	04c30bda88	HBASE-25792 Filter out o.a.hadoop.thirdparty building shaded jars (#3184 ) Need to add to allowed-licenses list too.... Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org> Reviewed-by: Duo Zhang <zhangduo@apache.org> Reviewed-by: Nick Dimiduk <ndimiduk@apache.org>	2021-04-27 08:41:15 -07:00
Andrew Purtell	01ce44abc4	HBASE-25756 Support alternate compression for major and minor compactions (#3142 ) Signed-off-by: Duo Zhang <zhangduo@apache.org> Conflicts: hbase-client/src/main/java/org/apache/hadoop/hbase/HColumnDescriptor.java hbase-server/src/main/java/org/apache/hadoop/hbase/mob/DefaultMobStoreCompactor.java hbase-shell/src/main/ruby/hbase/admin.rb	2021-04-23 16:02:41 -07:00
Duo Zhang	f374357cc3	HBASE-25802 Miscellaneous style improvements for load balancer related classes (#3192 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-04-23 16:32:10 +08:00
haxiaolin	fd7b2c6eec	HBASE-25754 StripeCompactionPolicy should support compacting cold regions (#3152 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-04-23 15:31:55 +08:00
Duo Zhang	a02ce95ff1	HBASE-25793 Move BaseLoadBalancer.Cluster to a separated file (#3185 ) (#3190 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-04-23 00:08:16 +08:00
Toshihiro Suzuki	f4f84302fa	HBASE-25766 Introduce RegionSplitRestriction that restricts the pattern of the split point Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Michael Stack <stack@apache.org>	2021-04-22 14:06:28 +09:00
Baiqiang Zhao	d575c11259	HBASE-25798 typo in MetricsAssertHelper (#3187 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-04-21 21:41:31 +08:00
haxiaolin	4b916ea0eb	HBASE-25763 TestRSGroupsWithACL.setupBeforeClass is flaky (#3158 ) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-04-21 14:46:12 +08:00
niuyulin	9cf8a48d20	HBASE-25777 Fix wrong initialization value in StressAssignmentManagerMonkeyFactory (#3164 ) Signed-off-by: meiyi <myimeiyi@gmail.com>	2021-04-19 17:52:53 +08:00
Duo Zhang	a177fb4c62	HBASE-25775 Use a special balancer to deal with maintenance mode (#3161 ) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org>	2021-04-16 13:39:48 +08:00
Nick Dimiduk	32207cd478	HBASE-25770 Http InfoServers should honor gzip encoding when requested (#3159 ) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Josh Elser <elserj@apache.org>	2021-04-15 13:46:53 -07:00
xiaoyu	819062bac3	HBASE-25776 Use Class.asSubclass to fix the warning in StochasticLoadBalancer.loadCustomCostFunctions (#3163 ) Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-04-15 23:35:33 +05:30
DivyeshChandra	35520a72fb	HBASE-25653 Add units and round off region size to 2 digits after decimal (#3046 ) Signed-off-by: stack <stack@duboce.net> Reviewed-by: Viraj Jasani <vjasani@apache.org>	2021-04-13 09:06:05 -07:00
ZhiChen	46a106014e	HBASE-25762 Improvement for some debug-logging guards (#3145 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2021-04-13 23:25:03 +08:00
Duo Zhang	15659f0e85	HBASE-25767 CandidateGenerator.getRandomIterationOrder is too slow on large cluster (#3149 ) Signed-off-by: XinSun <ddupgs@gmail.com> Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-04-13 23:14:48 +08:00

... 2 3 4 5 6 ...

17837 Commits All Branches Search

17837 Commits

All Branches