hbase

Commit Graph

Author	SHA1	Message	Date
Andrew Purtell	10471944bd	HBASE-26582 Prune use of Random and SecureRandom objects (#4118 ) Avoid the pattern where a Random object is allocated, used once or twice, and then left for GC. This pattern triggers warnings from some static analysis tools because this pattern leads to poor effective randomness. In a few cases we were legitimately suffering from this issue; in others a change is still good to reduce noise in analysis results. Use ThreadLocalRandom where there is no requirement to set the seed to gain good reuse. Where useful relax use of SecureRandom to simply Random or ThreadLocalRandom, which are unlikely to block if the system entropy pool is low, if we don't need crypographically strong randomness for the use case. The exception to this is normalization of use of Bytes#random to fill byte arrays with randomness. Because Bytes#random may be used to generate key material it must be backed by SecureRandom. Signed-off-by: Duo Zhang <zhangduo@apache.org>	2022-03-08 13:49:02 -08:00
BukrosSzabolcs	4829806220	HBASE-26707: Reduce number of renames during bulkload (#4066 ) Signed-off-by: Wellington Ramos Chevreuil <wchevreuil@apache.org>	2022-02-17 19:34:48 +00:00
Nick Dimiduk	625d610bcc	HBASE-26614 Refactor code related to "dump"ing ZK nodes (#3969 ) The code starting at `ZKUtil.dump(ZKWatcher)` is a small mess – it has cyclic dependencies woven through itself, `ZKWatcher` and `RecoverableZooKeeper`. It also initializes a static variable in `ZKUtil` through the factory for `RecoverableZooKeeper` instances. Let's decouple and clean it up. Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Josh Elser <elserj@apache.org>	2022-01-24 11:33:18 -08:00
Duo Zhang	c14a76c4fd	HBASE-26523 Upgrade hbase-thirdparty dependency to 4.0.1 (#3987 ) Signed-off-by: GeorryHuang <huangzhuoyue@apache.org>	2021-12-31 12:08:01 +08:00
Duo Zhang	3f59f21be0	HBASE-26621 Set version as 3.0.0-alpha-3-SNAPSHOT in master (#3978 ) Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2021-12-24 14:20:32 +08:00
Duo Zhang	e598f2c663	Revert "HBASE-26523 Upgrade hbase-thirdparty dependency to 4.0.0 (#3910 )" Need a new 4.0.1 release This reverts commit `139f08587a`.	2021-12-17 12:25:27 +08:00
Duo Zhang	139f08587a	HBASE-26523 Upgrade hbase-thirdparty dependency to 4.0.0 (#3910 ) Signed-off-by: Xiaolin Ha <haxiaolin@apache.org>	2021-12-17 10:22:48 +08:00
Wellington Ramos Chevreuil	a36d41af73	HBASE-26556 IT and Chaos Monkey improvements (#3932 ) Signed-off-by: Josh Elser <elserj@apache.org> Reviewed-by: Tak Lon (Stephen) Wu <taklwu@apache.org>	2021-12-14 21:22:28 +00:00
Duo Zhang	8bca21b47d	HBASE-26558 Set version as 3.0.0-alpha-2 in master in prep for first RC of 3.0.0-alpha-2 (#3935 ) Signed-off-by: Geoffrey Jacoby <gjacoby@apache.org>	2021-12-11 20:52:35 +08:00
Andrew Purtell	9e73ea878d	HBASE-26349 Improve recent change to IntegrationTestLoadCommonCrawl (#3744 ) Use a hybrid logical clock for timestamping entries. Using BufferedMutator without HLC was not good because we assign client timestamps, and the store loop is fast enough that on rare occasion two temporally adjacent URLs in the set of WARCs are equivalent and the timestamp does not advance, leading later to a rare false positive CORRUPT finding. While making changes, support direct S3N paths as input paths on the command line. Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-10-19 13:45:55 -07:00
Andrew Purtell	a384c239b9	HBASE-26335 Minor improvements to IntegrationTestLoadCommonCrawl (#3731 ) HBASE-26335 Minor improvements to IntegrationTestLoadCommonCrawl - Use BufferedMutator instead of Table. - Improve row key generator. - Improve retries and log levels. Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-10-08 10:00:51 -07:00
Duo Zhang	5f0950558f	HBASE-26096 Cleanup the deprecated methods in HBTU related classes and format code (#3503 ) Signed-off-by: Xiaolin Ha <haxiaolin@apache.org> Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-07-29 10:18:38 +08:00
Duo Zhang	16721239e7	HBASE-26100 Set version as 3.0.0-alpha-2-SNAPSHOT in master (#3508 ) Signed-off-by: Yulin Niu <niuyulin@apache.org>	2021-07-20 23:04:08 +08:00
Duo Zhang	d30cc27097	HBASE-26081 Copy HBTU to hbase-testing-util, rename the HBTU related classes in hbase-server and mark them as IA.LimitedPrivate (#3478 ) Signed-off-by: Michael Stack <stack@apache.org>	2021-07-19 09:29:08 +08:00
Duo Zhang	5118321ec9	HBASE-26059 Set version as 3.0.0-alpha-1 in master in prep for first RC of 3.0.0-alpha-1 (#3453 ) Signed-off-by: Pankaj Kumar <pankajkumar@apache.org>	2021-07-02 07:50:41 +08:00
Andrew Purtell	335305e0cf	HBASE-25911 Replace calls to System.currentTimeMillis with EnvironmentEdgeManager.currentTime (#3302 ) We introduced EnvironmentEdgeManager as a way to inject alternate clocks for unit tests. In order for this to be effective, all callers that would otherwise use System.currentTimeMillis() must call EnvironmentEdgeManager.currentTime() instead, except the implementers of EnvironmentEdge. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2021-06-01 09:57:48 -07:00
Michael Stack	630c73fda4	HBASE-25867 Extra doc around ITBLL (#3242 ) * HBASE-25867 Extra doc around ITBLL Minor edits to a few log messages. Explain how the '-c' option works when passed to ChaosMonkeyRunner. Some added notes on ITBLL. Fix whacky 'R' and 'Not r' thing in Master (shows when you run ITBLL). In HRS, report hostname and port when it checks in (was debugging issue where Master and HRS had different notions of its hostname). Spare a dirty FNFException on startup if base dir not yet in place. * Address Review by Sean Signed-off-by: Sean Busbey <busbey@apache.org>	2021-05-11 19:26:57 +01:00
Andrew Purtell	6ad5b9e569	HBASE-25824 IntegrationTestLoadCommonCrawl (#3208 ) * HBASE-25824 IntegrationTestLoadCommonCrawl This integration test loads successful resource retrieval records from the Common Crawl (https://commoncrawl.org/) public dataset into an HBase table and writes records that can be used to later verify the presence and integrity of those records. Run like: ./bin/hbase org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Access to the Common Crawl dataset in S3 is made available to anyone by Amazon AWS, but Hadoop's S3N filesystem still requires valid access credentials to initialize. The input path can either specify a directory or a file. The file may optionally be compressed with gzip. If a directory, the loader expects the directory to contain one or more WARC files from the Common Crawl dataset. If a file, the loader expects a list of Hadoop S3N URIs which point to S3 locations for one or more WARC files from the Common Crawl dataset, one URI per line. Lines should be terminated with the UNIX line terminator. Included in hbase-it/src/test/resources/CC-MAIN-2021-10-warc.paths.gz is a list of all WARC files comprising the Q1 2021 crawl archive. There are 64,000 WARC files in this data set, each containing ~1GB of gzipped data. The WARC files contain several record types, such as metadata, request, and response, but we only load the response record types. If the HBase table schema does not specify compression (by default) there is roughly a 10x expansion. Loading the full crawl archive results in a table approximately 640 TB in size. The hadoop-aws jar will be needed at runtime to instantiate the S3N filesystem. Use the -files ToolRunner argument to add it. You can also split the Loader and Verify stages: Load with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Loader' \ -files /path/to/hadoop-aws.jar \ -Dfs.s3n.awsAccessKeyId=<AWS access key> \ -Dfs.s3n.awsSecretAccessKey=<AWS secret key> \ /path/to/test-CC-MAIN-2021-10-warc.paths.gz \ /path/to/tmp/warc-loader-output Verify with: ./bin/hbase 'org.apache.hadoop.hbase.test.IntegrationTestLoadCommonCrawl$Verify' \ /path/to/tmp/warc-loader-output Signed-off-by: Michael Stack <stack@apache.org>	2021-05-03 17:59:00 -07:00
Duo Zhang	f6ff519dd0	HBASE-25591 Upgrade opentelemetry to 0.17.1 (#2971 ) Signed-off-by: Guanghao Zhang <zghao@apache.org>	2021-04-25 09:23:23 +08:00
Duo Zhang	302d9ea8b8	HBASE-25373 Remove HTrace completely in code base and try to make use of OpenTelemetry Signed-off-by: stack <stack@apache.org>	2021-04-25 09:23:23 +08:00
niuyulin	e8ac1fbe97	HBASE-25777 Fix wrong initialization value in StressAssignmentManagerMonkeyFactory (#3164 ) Signed-off-by: meiyi <myimeiyi@gmail.com>	2021-04-19 17:46:57 +08:00
Duo Zhang	ba3610d097	HBASE-19577 Use log4j2 instead of log4j for logging (#1708 ) Signed-off-by: stack <stack@apache.org>	2021-03-20 09:21:25 +08:00
Pankaj	48d9d196dc	HBASE-25502 IntegrationTestMTTR fails with TableNotFoundException (#2879 )	2021-01-13 11:01:26 -08:00
Mate Szalay-Beko	481662ab39	HBASE-25318 Config option for IntegrationTestImportTsv where to generate HFiles to bulkload (#2777 ) IntegrationTestImportTsv is generating HFiles under the working directory of the current hdfs user executing the tool, before bulkloading it into HBase. Assuming you encrypt the HBase root directory within HDFS (using HDFS Transparent Encryption), you can bulkload HFiles only if they sit in the same encryption zone in HDFS as the HBase root directory itself. When IntegrationTestImportTsv is executed against a real distributed cluster and the working directory of the current user (e.g. /user/hbase) is not in the same encryption zone as the HBase root directory (e.g. /hbase/data) then you will get an exception: ``` ERROR org.apache.hadoop.hbase.regionserver.HRegion: There was a partial failure due to IO when attempting to load d : hdfs://mycluster/user/hbase/test-data/22d8460d-04cc-e032-88ca-2cc20a7dd01c/ IntegrationTestImportTsv/hfiles/d/74655e3f8da142cb94bc31b64f0475cc org.apache.hadoop.ipc.RemoteException(java.io.IOException): /user/hbase/test-data/22d8460d-04cc-e032-88ca-2cc20a7dd01c/ IntegrationTestImportTsv/hfiles/d/74655e3f8da142cb94bc31b64f0475cc can't be moved into an encryption zone. ``` In this commit I make it configurable where the IntegrationTestImportTsv generates the HFiles. Co-authored-by: Mate Szalay-Beko <symat@apache.com> Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2021-01-05 09:24:24 +01:00
Lokesh Khurana	f8bd22827a	HBASE-24620 : Add a ClusterManager which submits command to ZooKeeper and its Agent which picks and execute those Commands (#2299 ) Signed-off-by: Aman Poonia <apoonia@salesforce.com> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-12-21 15:33:36 +05:30
Duo Zhang	92c3bcd9fb	HBASE-25164 Make ModifyTableProcedure support changing meta replica count (#2513 ) Signed-off-by: Michael Stack <stack@apache.org>	2020-10-13 09:43:56 +08:00
Duo Zhang	1bb19e0cdd	HBASE-25037 Lots of thread pool are changed to non daemon after HBASE-24750 which causes trouble when shutting down (#2407 ) Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-09-16 21:11:47 +08:00
Joseph295	a589e55a7b	HBASE-24992 log after Generator success when running ITBLL (#2358 ) Signed-off-by: Guanghao Zhang <zghao@apache.org>	2020-09-08 11:46:59 +08:00
Duo Zhang	57e49b3959	HBASE-23834 HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch (#2222 ) Signed-off-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: Josh Elser <elserj@apache.org> Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2020-08-25 12:05:52 +08:00
Viraj Jasani	ea130249ae	HBASE-24750 : Adding default UncaughtExceptionHandler for Thread factories (ADDENDUM) Closes #2231	2020-08-11 17:18:47 +05:30
Viraj Jasani	0b604d921a	HBASE-24750 : All ExecutorService should use guava ThreadFactoryBuilder Closes #2196 Signed-off-by: Nick Dimiduk <ndimiduk@apache.org> Signed-off-by: Ted Yu <tyu@apache.org>	2020-08-07 20:24:36 +05:30
Duo Zhang	d2f5a5f27b	HBAE-24507 Remove HTableDescriptor and HColumnDescriptor (#2186 ) Signed-off-by: stack <stack@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: tedyu <yuzhihong@gmail.com>	2020-08-04 10:31:42 +08:00
Nick Dimiduk	a6e3db5ba5	HBASE-24662 Update DumpClusterStatusAction to notice changes in region server count Sometimes running chaos monkey, I've found that we lose accounting of region servers. I've taken to a manual process of checking the reported list against a known reference. It occurs to me that ChaosMonkey has a known reference, and it can do this accounting for me. Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-07-21 15:56:22 -07:00
Nick Dimiduk	7ebc617026	HBASE-24658 Update PolicyBasedChaosMonkey to handle uncaught exceptions Running `ServerKillingChaosMonkey` via `RESTApiClusterManager` for any duration of time slowly leaks region servers. I see failures on the RESTApi side go unreported on the ChaosMonkey side. It seems like `RuntimeException`s are being thrown and lost. `PolicyBasedChaosMonkey` uses a primitive means of thread management anyway. Update to use a thread pool, thread groups, and an uncaughtExceptionHandler. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-07-20 16:57:11 -07:00
Duo Zhang	16a25b74db	HBASE-24635 Split TestMetaWithReplicas (#1980 ) Signed-off-by: Guanghao Zhang <zghao@apache.org>	2020-06-27 10:36:07 +08:00
Sandeep Pal	f22c7a6583	HBASE-23126: Removing the un-used integration test class - IntegrationTestRSGroup Closes #1936 Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-06-20 22:44:03 +05:30
Duo Zhang	c91829bb41	HBASE-24491 Remove HRegionInfo (#1830 ) Signed-off-by: Guanghao Zhang <zghao@apache.org> Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com>	2020-06-05 22:19:01 +08:00
Nick Dimiduk	5fb9e518ef	HBASE-24361 Make `RESTApiClusterManager` more resilient (#1701 ) * sometimes API calls return with null/empty response bodies. thus, wrap all API calls in a retry loop. * calls that submit work in the form of "commands" now retrieve the commandId from successful command submission, and track completion of that command before returning control to calling context. * model CM's process state and use that model to guide state transitions more intelligently. this guards against, for example, the start command failing with an error message like "Role must be stopped". * improvements to logging levels, avoid spamming logs with the side-effects of retries at this and higher contexts. * include references to API documentation, such as it is. Signed-off-by: stack <stack@apache.org>	2020-05-19 09:53:22 -07:00
Nick Dimiduk	0dae377f53	HBASE-24360 RollingBatchRestartRsAction loses track of dead servers `RollingBatchRestartRsAction` doesn't handle failure cases when tracking its list of dead servers. The original author believed that a failure to restart would result in a retry. However, by removing the dead server from the failed list, that state is lost, and retry never occurs. Because this action doesn't ever look back to the current state of the cluster, relying only on its local state for the current action invocation, it never realizes the abandoned server is still dead. Instead, be more careful to only remove the dead server from the list when the `startRs` invocation claims to have been successful. Signed-off-by: stack <stack@apache.org>	2020-05-18 13:01:11 -07:00
meiyi	a73132c62b	HBASE-24364 [Chaos Monkey] Invalid data block encoding in ChangeEncodingAction (#1707 ) Signed-off-by: Jan Hentschel <janh@apache.org>	2020-05-15 11:30:44 +08:00
Duo Zhang	8601416ee8	HBASE-24309 Avoid introducing log4j and slf4j-log4j dependencies for modules other than hbase-assembly (#1640 ) Signed-off-by: stack <stack@apache.org>	2020-05-12 12:03:30 +08:00
Nick Dimiduk	0e81ab08b9	HBASE-24295 [Chaos Monkey] abstract logging through the class hierarchy ; ADDENDUM Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com>	2020-05-07 13:20:15 -07:00
Michael Stack	5488124be0	HBASE-24284 [h3/jdk11] REST server won't start Exclude transitive includes of jax-rs 1.x and then explicitly include jax-rs 2.x glassfish impl for REST context when hadoop3. (#1625 )	2020-05-05 15:36:01 -07:00
Nick Dimiduk	204a1fad92	HBASE-24295 [Chaos Monkey] abstract logging through the class hierarchy Adds `protected abstract Logger getLogger()` to `Action` so that implementation's names are logged when actions are performed. Signed-off-by: stack <stack@apache.org> Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com>	2020-05-04 11:53:24 -07:00
Nick Dimiduk	e37aafcfc2	HBASE-24260 Add a ClusterManager that issues commands via coprocessor Implements `ClusterManager` that relies on the new `ShellExecEndpointCoprocessor` for remote shell command execution. Signed-off-by: Bharath Vissapragada <bharathv@apache.org>	2020-05-04 10:53:02 -07:00
Nick Dimiduk	a97395d4c0	HBASE-24274 `RESTApiClusterManager` attempts to deserialize response using serialization API Use the correct GSON API for deserializing service responses. Add simple unit test covering a very limited selection of the overall API surface area, just enough to ensure deserialization works. Signed-off-by: stack <stack@apache.org>	2020-04-29 13:13:03 -07:00
Duo Zhang	9f52e6b725	HBASE-24249 Move code in FSHDFSUtils to FSUtils and mark related clas… (#1586 ) Signed-off-by: stack <stack@apache.org>	2020-04-29 10:44:34 +08:00
Duo Zhang	6928674eb8	HBASE-24228 Merge the code in hbase-hadoop2-compat module to hbase-hadoop-compat (#1563 ) Signed-off-by: stack <stack@apache.org>	2020-04-29 10:34:53 +08:00
Jan Hentschel	75c717d4c2	HBASE-23848 Removed deprecated setStopRow from Scan (#1184 ) Signed-off-by: Duo Zhang <zhangduo@apache.org>	2020-04-22 15:15:17 +08:00
Duo Zhang	1f66806c96	HBASE-24170 Remove hadoop-2.0 profile (#1495 ) Signed-off-by: stack <stack@apache.org>	2020-04-16 18:57:40 +08:00

1 2 3 4 5 ...

576 Commits