hbase

Commit Graph

Author	SHA1	Message	Date
Duo Zhang	37c2ffdc2b	HBASE-25164 Make ModifyTableProcedure support changing meta replica count (#2513 ) Signed-off-by: Michael Stack <stack@apache.org>	2020-10-13 10:13:48 +08:00
Duo Zhang	7a3bb8aefe	HBASE-25037 Lots of thread pool are changed to non daemon after HBASE-24750 which causes trouble when shutting down (#2407 ) Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-09-16 22:03:42 +08:00
Joseph295	4acd6735fd	HBASE-24992 log after Generator success when running ITBLL (#2358 ) Signed-off-by: Guanghao Zhang <zghao@apache.org>	2020-09-09 11:08:26 +08:00
Duo Zhang	4455856e9c	HBASE-23834 HBase fails to run on Hadoop 3.3.0/3.2.2/3.1.4 due to jetty version mismatch (#2222 ) Signed-off-by: Viraj Jasani <vjasani@apache.org> Signed-off-by: Josh Elser <elserj@apache.org> Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2020-08-25 15:02:55 +08:00
Nick Dimiduk	c0d7bfb6f7	HBASE-24662 Update DumpClusterStatusAction to notice changes in region server count Sometimes running chaos monkey, I've found that we lose accounting of region servers. I've taken to a manual process of checking the reported list against a known reference. It occurs to me that ChaosMonkey has a known reference, and it can do this accounting for me. Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-07-21 15:56:40 -07:00
Nick Dimiduk	89cf76c2cd	HBASE-24658 Update PolicyBasedChaosMonkey to handle uncaught exceptions Running `ServerKillingChaosMonkey` via `RESTApiClusterManager` for any duration of time slowly leaks region servers. I see failures on the RESTApi side go unreported on the ChaosMonkey side. It seems like `RuntimeException`s are being thrown and lost. `PolicyBasedChaosMonkey` uses a primitive means of thread management anyway. Update to use a thread pool, thread groups, and an uncaughtExceptionHandler. Signed-off-by: Bharath Vissapragada <bharathv@apache.org> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-07-20 17:00:03 -07:00
Duo Zhang	7c78356218	HBASE-24635 Split TestMetaWithReplicas (#1980 ) Signed-off-by: Guanghao Zhang <zghao@apache.org>	2020-06-27 11:11:36 +08:00
Sandeep Pal	0527c2c70d	HBASE-23126: Removing the un-used integration test class - IntegrationTestRSGroup Closes #1936 Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com> Signed-off-by: Viraj Jasani <vjasani@apache.org>	2020-06-20 22:51:03 +05:30
meiyi	a41d2a3030	HBASE-24364 [Chaos Monkey] Invalid data block encoding in ChangeEncodingAction (#1707 ) Signed-off-by: Jan Hentschel <janh@apache.org>	2020-05-20 18:26:08 +08:00
Nick Dimiduk	fbe0da2672	HBASE-24361 Make `RESTApiClusterManager` more resilient (#1701 ) * sometimes API calls return with null/empty response bodies. thus, wrap all API calls in a retry loop. * calls that submit work in the form of "commands" now retrieve the commandId from successful command submission, and track completion of that command before returning control to calling context. * model CM's process state and use that model to guide state transitions more intelligently. this guards against, for example, the start command failing with an error message like "Role must be stopped". * improvements to logging levels, avoid spamming logs with the side-effects of retries at this and higher contexts. * include references to API documentation, such as it is. Signed-off-by: stack <stack@apache.org>	2020-05-19 09:46:37 -07:00
Nick Dimiduk	d0c7458e07	HBASE-24360 RollingBatchRestartRsAction loses track of dead servers `RollingBatchRestartRsAction` doesn't handle failure cases when tracking its list of dead servers. The original author believed that a failure to restart would result in a retry. However, by removing the dead server from the failed list, that state is lost, and retry never occurs. Because this action doesn't ever look back to the current state of the cluster, relying only on its local state for the current action invocation, it never realizes the abandoned server is still dead. Instead, be more careful to only remove the dead server from the list when the `startRs` invocation claims to have been successful. Signed-off-by: stack <stack@apache.org>	2020-05-18 12:55:19 -07:00
Nick Dimiduk	c28555c683	HBASE-24295 [Chaos Monkey] abstract logging through the class hierarchy ; ADDENDUM Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com>	2020-05-07 13:24:23 -07:00
Nick Dimiduk	9cf541bc8d	HBASE-24295 [Chaos Monkey] abstract logging through the class hierarchy Adds `protected abstract Logger getLogger()` to `Action` so that implementation's names are logged when actions are performed. Signed-off-by: stack <stack@apache.org> Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com>	2020-05-04 11:41:33 -07:00
Nick Dimiduk	47dca8eb45	HBASE-24260 Add a ClusterManager that issues commands via coprocessor Implements `ClusterManager` that relies on the new `ShellExecEndpointCoprocessor` for remote shell command execution. Signed-off-by: Bharath Vissapragada <bharathv@apache.org>	2020-05-04 10:52:28 -07:00
Nick Dimiduk	fdf2bd7312	HBASE-24274 `RESTApiClusterManager` attempts to deserialize response using serialization API Use the correct GSON API for deserializing service responses. Add simple unit test covering a very limited selection of the overall API surface area, just enough to ensure deserialization works. Signed-off-by: stack <stack@apache.org>	2020-04-29 13:13:14 -07:00
Duo Zhang	922921ee5f	HBASE-24249 Move code in FSHDFSUtils to FSUtils and mark related clas… (#1586 ) Signed-off-by: stack <stack@apache.org>	2020-04-29 11:31:32 +08:00
BukrosSzabolcs	f951913e24	HBASE-23891: Add an option to Actions to filter out meta RS Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2020-03-17 15:02:33 +01:00
Nick Dimiduk	4f76e24755	Revert "HBASE-23891: Add an option to Actions to filter out meta RS (#1217 )" This reverts commit `7d8fa5c818`.	2020-03-10 11:48:12 -07:00
BukrosSzabolcs	7d8fa5c818	HBASE-23891: Add an option to Actions to filter out meta RS (#1217 ) Signed-off-by: Wellington Chevreuil <wchevreuil@apache.org> (cherry picked from commit `4cb60327be`)	2020-03-06 11:10:00 +00:00
BukrosSzabolcs	f9abaee50c	HBASE-23566: Fix package/packet terminology problem in chaos monkeys (#933 ) s/package/packet/g Signed-off-by: Sean Busbey <busbey@apache.org> (cherry picked from commit `413d4b2d0f`)	2019-12-12 16:34:31 -06:00
Nick Dimiduk	391be59835	HBASE-23552 Format Javadocs on ITBLL We have this nice description in the java doc on ITBLL but it's unformatted and thus illegible. Add some formatting so that it can be read by humans. Signed-off-by: Jan Hentschel <janh@apache.org> Signed-off-by: Josh Elser <elserj@apache.org>	2019-12-10 13:12:54 -08:00
BukrosSzabolcs	014a40b678	HBASE-23352: Allow chaos monkeys to access cmd line params, and improve FillDiskCommandAction (#885 ) Instead of using the default properties when checking for monkey properties, now we use the ones already extended with command line params. Change FillDiskCommandAction to try to stop the remote process if the command failed with an exception. Signed-off-by: stack <stack@apache.org>	2019-12-02 10:33:56 +08:00
Peter Somogyi	ce34d895e5	HBASE-23085 Network and Data related Actions; ADDENDUM (#871 ) Fix percentage in String.format Signed-off-by: Sean Busbey <busbey@apache.org>	2019-11-22 22:15:27 -06:00
BukrosSzabolcs	54be3d1d86	HBASE-23085 Network and Data related Actions Add monkey actions: - manipulate network packages with tc (reorder, loose,...) - add CPU load - fill the disk - corrupt or delete regionserver data files Extend HBaseClusterManager to allow sudo calls. Signed-off-by: Josh Elser <elserj@apache.org> Signed-off-by: Balazs Meszaros <meszibalu@apache.org>	2019-11-19 10:15:35 +01:00
ravowlga123	5dfa58b017	HBASE-18439 Subclasses of o.a.h.h.chaos.actions.Action all use the same logger Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com> Signed-off-by: Guangxu Cheng <gxcheng@apache.org>	2019-11-08 20:26:43 +01:00
meiyi	d841245115	HBASE-23170 Admin#getRegionServers use ClusterMetrics.Option.SERVERS_NAME (#721 )	2019-10-18 10:09:42 +08:00
BukrosSzabolcs	cd9367512a	HBASE-22982: region server suspend/resume and graceful rolling restart actions (#592 ) * Add chaos monkey action for suspend/resume region servers * Add chaos monkey action for graceful rolling restart * Add these to relevant chaos monkeys Signed-off-by: Balazs Meszaros <meszibalu@apache.org> Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2019-09-26 11:56:17 +02:00
Guanghao	78f704796e	HBASE-22624 Should sanity check table configuration when clone snapshot to a new table	2019-07-03 18:28:48 +08:00
李小保	1bd5b5cf7b	HBASE-22250 The same constants used in many places should be placed in constant classes Signed-off-by: stack <stack@apache.org>	2019-04-23 21:24:46 -07:00
Jan Hentschel	c40e6e2339	HBASE-22231 Removed unused and '*' import	2019-04-23 12:53:52 +02:00
zhanggangxue	9a0daa8cbd	HBASE-21257 misspelled words.[occured -> occurred]	2019-04-14 21:36:24 +08:00
zhangduo	b04b1ecc74	HBASE-22108 Avoid passing null in Admin methods Signed-off-by: Guanghao Zhang <zghao@apache.org>	2019-04-07 21:08:55 +08:00
Vladimir Rodionov	ae4bfabeaa	HBASE-21688 Address WAL filesystem issues Amending-Author: Josh Elser <elserj@apache.org> Signed-off-by: Josh Elser <elserj@apache.org>	2019-04-03 13:55:48 -04:00
Guanghao Zhang	607ac735c4	HBASE-21922 BloomContext#sanityCheck may failed when use ROWPREFIX_DELIMITED bloom filter	2019-02-23 23:29:53 +08:00
Duo Zhang	761aef6d9d	HBASE-20587 Replace Jackson with shaded thirdparty gson Signed-off-by: Michael Stack <stack@apache.org>	2019-02-22 16:40:45 +08:00
Duo Zhang	4e792414f6	HBASE-21731 Do not need to use ClusterConnection in IntegrationTestBigLinkedListWithVisibility Signed-off-by: Peter Somogyi <psomogyi@apache.org>	2019-01-16 20:59:37 +08:00
Duo Zhang	9ec84c235f	HBASE-21704 The implementation of DistributedHBaseCluster.getServerHoldingRegion is incorrect	2019-01-11 21:20:50 +08:00
Zephyr Guo	2b1716fd8e	HBASE-21256 Improve IntegrationTestBigLinkedList for testing huge data Signed-off-by: Duo Zhang <zhangduo@apache.org> Signed-off-by: Andrew Purtell <apurtell@apache.org>	2018-10-12 11:00:03 +08:00
Guangxu Cheng	fd68e7593e	HBASE-20636 Introduce two bloom filter type : ROWPREFIX and ROWPREFIX_DELIMITED Signed-off-by: Andrew Purtell <apurtell@apache.org> Amending-Author: Andrew Purtell <apurtell@apache.org>	2018-09-21 16:06:34 -07:00
Monani Mihir	06a92a3d20	HBASE-19036 Add action in Chaos Monkey to restart Active Namenode Signed-off-by: tedyu <yuzhihong@gmail.com>	2018-08-02 05:00:16 -07:00
Allan Yang	1a6fae74b5	HBASE-20870 Wrong HBase root dir in ITBLL's Search Tool	2018-07-20 11:22:03 +08:00
Sahil Aggarwal	e61507b9a0	HBASE-19164: Remove UUID.randomUUID in tests. Signed-off-by: Mike Drob <mdrob@apache.org>	2018-06-27 10:36:48 -05:00
Mike Drob	b04c976fe6	HBASE-20478 Update checkstyle to v8.2 Cannot go to latest (8.9) yet due to https://github.com/checkstyle/checkstyle/issues/5279 * move hbaseanti import checks to checkstyle * implment a few missing equals checks, and ignore one * fix lots of javadoc errors Signed-off-by: Sean Busbey <busbey@apache.org>	2018-06-18 14:02:40 -07:00
maoling	4c95b82b61	HBASE-19761:Fix Checkstyle errors in hbase-zookeeper Signed-off-by: Jan Hentschel <jan.hentschel@ultratendency.com>	2018-06-02 10:17:27 +02:00
Josh Elser	c3d82a283d	HBASE-20223 Update to hbase-thirdparty 2.1.0 Remove commons-cli and commons-collections4 use. Account for the newer internal protobuf version of 3.5.1. Signed-off-by: Michael Stack <stack@apache.org> Signed-off-by: Mike Drob <mdrob@apache.org>	2018-03-26 16:07:39 -04:00
Chia-Ping Tsai	95596e8ba7	HBASE-20119 Introduce a pojo class to carry coprocessor information in order to make TableDescriptorBuilder accept multiple cp at once Signed-off-by: Ted Yu <yuzhihong@gmail.com> Signed-off-by: Michael Stack <stack@apache.org>	2018-03-16 01:26:08 +08:00
Michael Stack	260ee0da60	HBASE-20173 [AMv2] DisableTableProcedure concurrent to ServerCrashProcedure can deadlock Allow that DisableTableProcedue can grab a region lock before ServerCrashProcedure can. Cater to this cricumstance where SCP was not unable to make progress by running the search for RIT against the crashed server a second time, post creation of all crashed-server assignemnts. The second run will uncover such as the above DisableTableProcedure unassign and will interrupt its suspend allowing both procedures to make progress. M hbase-protocol-shaded/src/main/protobuf/MasterProcedure.proto Add new procedure step post-assigns that reruns the RIT finder method. M hbase-server/src/main/java/org/apache/hadoop/hbase/master/assignment/AssignmentManager.java Make this important log more specific as to what is going on. M hbase-server/src/main/java/org/apache/hadoop/hbase/master/assignment/UnassignProcedure.java Better explanation as to what is going on. M hbase-server/src/main/java/org/apache/hadoop/hbase/master/procedure/ServerCrashProcedure.java Add extra step and run handleRIT a second time after we've queued up all SCP assigns. Also fix a but. SCP was adding an assign of a RIT that was actually trying to unassign (made the deadlock more likely).	2018-03-13 05:44:43 -07:00
Michael Stack	8b3ae58e18	HBASE-20043 ITBLL fails against hadoop3 Fix MoveRandomRegionOfTableAction. It depended on old AM behavior. Make it do explicit move as is required in AMv3; w/o it, it was just closing region causing test to fail. Fix pom so hadoop3 profile specifies a different netty3 version. Bunch of logging format change that came of trying trying to read the spew from this test.	2018-02-24 17:29:24 -08:00
Michael Stack	8f1e01b6e5	HBASE-19951 Cleanup the explicit timeout value for test method	2018-02-07 16:39:54 -08:00
Mike Drob	7d449892af	HBASE-19947 ITU should overwrite HTU local FS	2018-02-07 16:56:11 -06:00

1 2 3 4 5 ...

438 Commits