Commit Graph

381 Commits

Author SHA1 Message Date
Márton Elek c533b79c32 HDDS-1710. Publish JVM metrics via Hadoop metrics
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-22 16:27:50 -07:00
Doroszlai, Attila c958eddcf4 HDDS-1811. Prometheus metrics are broken.
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-22 14:12:06 -07:00
Sammi Chen 69a46a95bb HDDS-1713. ReplicationManager fail to find proper node topology based… (#1112) 2019-07-19 15:45:26 -07:00
Shashikant Banerjee 20cf50c6d0 HDDS-1654. Ensure container state on datanode gets synced to disk whennever state change happens. Cotributed by Shashikant Banerjee. (#923) 2019-07-18 17:09:05 +05:30
Lokesh Jain 23e9bebe13
HDDS-1767: ContainerStateMachine should have its own executors for executing applyTransaction calls (#1087) 2019-07-18 14:48:13 +05:30
Nanda kumar 1502e6a22f HDDS-1756. DeleteContainerCommandHandler fails with NPE. Contributed by Nanda kumar(#1095). 2019-07-16 15:06:41 +05:30
Mukul Kumar Singh 0976f6fc30
HDDS-1766. ContainerStateMachine is unable to increment lastAppliedTermIndex. Contributed by Mukul Kumar Singh. (#1072) 2019-07-14 10:53:51 +05:30
Márton Elek 9119ed07ff
HDDS-1384. TestBlockOutputStreamWithFailures is failing
Closes #1029
2019-07-12 11:00:03 +02:00
Hrishikesh Gadre acef5e0cec HDDS-1201. Reporting corrupted containers info to SCM (#1032) 2019-07-11 14:49:58 +05:30
supratimdeka ac7a8accdf HDDS-1603. Handle Ratis Append Failure in Container State Machine. Contributed by Supratim Deka (#1019) 2019-07-10 07:49:36 +05:30
Mukul Kumar Singh 96d0555913 HDDS-1718. Increase Ratis Leader election timeout default. Contributed by Aravindan Vijayan & Siddharth Wagle. (#1062) 2019-07-09 23:17:50 +05:30
Mukul Kumar Singh e5ffb88257 HDDS-1550. MiniOzoneCluster is not shutting down all the threads during shutdown. Contributed by Mukul Kumar Singh. (#1050)
* HDDS-1550. MiniOzoneCluster is not shutting down all the threads during shutdown. Contributed by Mukul Kumar Singh.
2019-07-08 20:24:12 -07:00
Mukul Kumar Singh ce91d35b29
HDDS-1728. Add metrics for leader's latency in ContainerStateMachine. Contributed by Mukul Kumar Singh. (#1022) 2019-07-08 12:19:16 +05:30
Mukul Kumar Singh 9df6275954 HDDS-1555. Disable install snapshot for ContainerStateMachine. Contributed by Siddharth Wagle. (#846) 2019-07-01 20:52:32 +05:30
Elek, Márton 004207996c HDDS-1597. Remove hdds-server-scm dependency from ozone-common (#969)
* HDDS-1597. Remove hdds-server-scm dependency from ozone-common. Contributed by Elek, Marton.

* checkstyle fixes

* revert the import reorder of HddsUtil

* add javadoc

* switch back to the commons-lang2

* fix typo

* fix metrics core classpath problem (+rebase fix)
2019-06-24 11:56:04 -07:00
Hrishikesh Gadre 8194a1196e HDDS-1690. ContainerController should provide a way to retrieve containers per volume (#986) 2019-06-21 10:23:11 -07:00
Hanisha Koneru 2dfa932818
HDDS-1684. OM should create Ratis related dirs only if ratis is enabled (#965) 2019-06-18 16:08:48 -07:00
Elek, Márton e094b3b4cd HDDS-1680. Create missing parent directories during the creation of HddsVolume dirs (#961) 2019-06-13 16:18:15 -07:00
Shweta Yakkali fcfe7a3cc0 HDDS-1655. Redundant toString() call for metaDataPath in KeyValueContainerCheck (#932) 2019-06-08 22:12:29 -07:00
Xiaoyu Yao c7e6f076df HDDS-1650. Fix Ozone tests leaking volume checker thread. Contributed by Xiaoyu Yao. (#915) 2019-06-06 23:50:04 +05:30
Xiaoyu Yao 76c0183ae3 HDDS-1652. HddsDispatcher should not shutdown volumeSet. Contributed by Xiaoyu Yao. (#916) 2019-06-06 23:47:59 +05:30
Shweta Yakkali c8276f3e76 HDDS-1201. Reporting Corruptions in Containers to SCM (#912) 2019-06-06 23:36:48 +05:30
supratimdeka 9fded678ff HDDS-1621. writeData in ChunkUtils should not use AsynchronousFileChannel. Contributed by Supratim Deka (#917) 2019-06-06 18:53:37 +05:30
Mukul Kumar Singh 277e9a835b Opening of rocksDB in datanode fails with "No locks available"
Signed-off-by: Nanda kumar <nanda@apache.org>
2019-06-04 02:12:44 +05:30
Shashikant Banerjee f3271126fc HDDS-1558. IllegalArgumentException while processing container Reports.
Signed-off-by: Nanda kumar <nanda@apache.org>
2019-06-04 00:59:02 +05:30
Shashikant Banerjee 18c1eebc08 HDDS-1502. Add metrics for Ozone Ratis performance.Contributed by Shashikant Banerjee(#833). 2019-05-30 16:17:45 +05:30
Akira Ajisaka afd844059c HADOOP-16331. Fix ASF License check in pom.xml
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2019-05-29 17:25:13 +09:00
supratimdeka 346c2b7980 HDDS-1559. Include committedBytes to determine Out of Space in VolumeChoosingPolicy. Contributed by Supratim Deka (#841) 2019-05-28 20:48:14 -07:00
Xiaoyu Yao 7f2e87a419
HDDS-1604. ContainerReader#initializeUsedBytes leaks DB reference. Co… (#866) 2019-05-28 18:39:23 -07:00
Elek, Márton 79d14d0d42 HDDS-1341. TestContainerReplication#testContainerReplication fails intermittently. Contributed by Elek, Marton. (#862) 2019-05-28 14:40:45 -07:00
Bharat Viswanadham d8b18e82f0
HDDS-1602. Fix TestContainerPersistence#testDeleteBlockTwice. (#858) 2019-05-28 11:27:16 -07:00
Shweta Yakkali d1ec1c52e5 HDDS-1580. Obtain Handler reference in ContainerScrubber (#842) 2019-05-28 11:14:24 -07:00
Mukul Kumar Singh 9f0d3412f8
HDDS-1533. JVM exit on TestHddsDatanodeService
Closes #859
2019-05-28 14:18:25 +02:00
Akira Ajisaka 9f933e6446
HADOOP-16323. https everywhere in Maven settings. 2019-05-27 15:24:59 +09:00
sdeka ca93760504 fixed checkstyle issue post merge 2019-05-23 20:49:52 +05:30
sdeka 72bef0f6cb fixed merge error. adapted to new signature of BlockUtils getDB 2019-05-23 11:36:40 +05:30
supratimdeka 9da62f33be
Merge branch 'trunk' into HDDS-1535 2019-05-23 09:51:33 +05:30
Ajay Yadav 9c61494c02 HDDS-1065. OM and DN should persist SCM certificate as the trust root. Contributed by Ajay Kumar. (#834) 2019-05-22 11:47:32 -07:00
Mukul Kumar Singh 2fc6f8599a HDDS-1449. JVM Exit in datanode while committing a key. Contributed by Mukul Kumar Singh. (#825) 2019-05-22 17:18:40 +05:30
sdeka 456bb8a3e0 verifyContainerData also does fixup, renamed. Added a Javadoc comment, both as per review discussion 2019-05-22 09:34:21 +05:30
sdeka 64c39856d2 Fixed checkstyle issues. 2019-05-20 08:13:37 +05:30
sdeka 869a1ab41a HDDS-1535. Space tracking for Open Containers : Handle Node Startup. Contributed by Supratim Deka 2019-05-18 16:16:33 +05:30
Arpit Agarwal 732133cb2a HDDS-1422. Exception during DataNode shutdown. Contributed by Arpit A… (#725)
* HDDS-1422. Exception during DataNode shutdown. Contributed by Arpit Agarwal.

Change-Id: I6db6bdd19839a45e5341ed7e745cd38f68af8378

* Suppress spurious findbugs warning.

* Remove log file that got committed in error
2019-05-18 11:47:19 +02:00
Siddharth c183bd8e20 HDDS-1527. HDDS Datanode start fails due to datanode.id file read error. Contributed by Siddharth Wagle.
This closes #822.
2019-05-16 15:13:10 -07:00
Márton Elek 03ea8ea92e
HDDS-1297. Fix IllegalArgumentException thrown with MiniOzoneCluster Initialization. Contributed by Yiqun Lin. 2019-05-16 17:04:46 +02:00
supratimdeka 9569015802 HDDS-1511. Space tracking for Open Containers in HDDS Volumes. Contributed by Supratim Deka (#812) 2019-05-15 10:26:01 -07:00
Vivek Ratnavel Subramanian b832e174b0 HDDS-1474. "ozone.scm.datanode.id" config should take path for a dir and not a file (#809) 2019-05-09 17:11:09 -07:00
Hanisha Koneru 2d31ccc1b3
Revert "HDDS-1474. ozone.scm.datanode.id config should take path for a dir (#792)" (#808)
This reverts commit 18cca583ac.
2019-05-09 11:18:46 -07:00
Vivek Ratnavel Subramanian 18cca583ac HDDS-1474. ozone.scm.datanode.id config should take path for a dir (#792) 2019-05-09 10:23:44 -07:00
avijayanhwx eb9c8900bc HDDS-1475 : Fix OzoneContainer start method. (#788) 2019-05-07 14:09:41 -07:00
Siddharth 1df679985b HDDS-1473. DataNode ID file should be human readable. (#781) 2019-05-02 13:59:14 -07:00
Siddharth 5bca062d0e HDDS-1476. Fix logIfNeeded logic in EndPointStateMachine. (#779) 2019-04-29 12:05:38 -07:00
Bharat Viswanadham 43b2a4b77b
HDDS-1456. Stop the datanode, when any datanode statemachine state is… (#769) 2019-04-26 14:25:34 -07:00
Kitti Nanasi 3f787cd506
HDDS-1192. Support -conf command line argument in GenericCli
Closes #713
2019-04-24 15:34:49 +02:00
Nanda kumar 7e1f8d3a1b
HDDS-1368. Cleanup old ReplicationManager code from SCM. 2019-04-23 17:35:39 +05:30
Chen, Junjie 518f47bf9b HDDS-976: Parse network topology from yaml file. Contributed by Junjie Chen. (#661)
Signed-off-by: Xiaoyu Yao <xyao@apache.org>
2019-04-18 16:53:25 -07:00
Doroszlai, Attila 0a87fd1464 HDDS-1434. TestDatanodeStateMachine is flaky (#740) 2019-04-16 19:13:01 -07:00
Hanisha Koneru e543c3b31a
HDDS-1376. Datanode exits while executing client command when scmId is null (#724) 2019-04-16 13:51:39 -07:00
Arpit Agarwal 62f4808617
HDDS-1428. Remove benign warning in handleCreateContainer. Contributed by Siddharth Wagle. 2019-04-12 15:08:01 -07:00
Arpit Agarwal abace709cc
HDDS-1420. Tracing exception in DataNode HddsDispatcher
Closes #723
2019-04-12 12:21:10 +02:00
Doroszlai, Attila 3b08ac4659 HDDS-1363. ozone.metadata.dirs doesn't pick multiple dirs (#691) 2019-04-12 15:00:08 +05:30
Nanda kumar e9c4109004 HDDS-1417. After successfully importing a container, datanode should delete the container tar.gz file from working directory. (#721) 2019-04-10 16:55:37 -07:00
Bharat Viswanadham 0e770a6539
HDDS-1370. Command Execution in Datanode fails because of NPE (#715) 2019-04-10 10:25:28 -07:00
Márton Elek dfb518bbf5
HDDS-1397. Avoid the usage of signal handlers in datanodes of the MiniOzoneClusters.
Closes #701
2019-04-10 18:35:36 +02:00
Mukul Kumar Singh df01469141 HDDS-1401. Static ContainerCache in Datanodes can result in overwrite of container db. Contributed by Mukul Kumar Singh. (#708) 2019-04-10 18:00:10 +05:30
Shashikant Banerjee 2d4f6b6daa HDDS-372. There are three buffer copies in BlockOutputStream. Contributed by Shashikant Banerjee. 2019-04-09 10:27:58 +05:30
Doroszlai, Attila 274ab4e86c HDDS-1332. Attempt to fix flaky test testStartStopDatanodeStateMachine (#697) 2019-04-05 13:21:58 -07:00
Yiqun Lin f96fb05a2b HDDS-1365. Fix error handling in KeyValueContainerCheck. Contributed by Supratim Deka. 2019-04-03 14:01:30 +08:00
Yiqun Lin d31c86892e HDDS-1337. Handle GroupMismatchException in OzoneClient. Contributed by Shashikant Banerjee. 2019-04-02 16:32:30 +08:00
Xiaoyu Yao d7a2f9456b HDDS-1350. Fix checkstyle issue in TestDatanodeStateMachine. Contributed by Xiaoyu Yao. (#656) 2019-03-28 14:50:34 -07:00
Arpit Agarwal b4ed81c4e6 HDDS-1332. Add some logging for flaky test testStartStopDatanodeStateMachine. Contributed by Arpit Agarwal. (#649) 2019-03-27 20:53:30 +05:30
Sandeep Nemuri 5c0a81ad3c HDDS-1310. In datanode once a container becomes unhealthy, datanode restart fails.
Signed-off-by: Nanda kumar <nanda@apache.org>
2019-03-26 15:44:18 +05:30
Bharat Viswanadham 128dd91e10
HDDS-1331. In DatanodeStateMachine join check for not null. 2019-03-23 09:43:49 -07:00
Ajay Kumar 771ea6b5e7 Revert "HDDS-1310. In datanode once a container becomes unhealthy, datanode restart fails. Contributed by Sandeep Nemuri."
This reverts commit efad5717ec.
2019-03-22 16:20:44 -07:00
Ajay Kumar efad5717ec HDDS-1310. In datanode once a container becomes unhealthy, datanode restart fails. Contributed by Sandeep Nemuri. 2019-03-22 15:53:12 -07:00
Arpit Agarwal 90afc9ab03
HDDS-1320. Update ozone to latest ratis snapshot build (0.4.0-1fc5ace-SNAPSHOT). Contributed by Mukul Kumar Singh. 2019-03-21 14:24:02 -07:00
Ajay Yadav f10d493325
HDDS-1119. DN get OM certificate from SCM CA for block token validation. Contributed by Ajay Kumar. (#601) 2019-03-18 23:08:17 -07:00
Bharat Viswanadham 926d548caa
HDDS-1281. Fix the findbug issue caused by HDDS-1163. Contributed by Aravindan Vijayan. 2019-03-15 21:06:44 -07:00
Shashikant Banerjee 155ab6d5d8 HDDS-1098. Introduce Retry Policy in Ozone Client. Contributed by Shashikant Banerjee. 2019-03-15 15:40:59 +05:30
Márton Elek 3d140a537a
HDDS-1247. Bump trunk ozone version to 0.5.0. Contributed by Elek, Marton.
Closes #588
2019-03-14 12:01:25 +01:00
Mukul Kumar Singh ea9fb07fd2 HDDS-1241. Update ozone to latest ratis snapshot build (0.4.0-5680cf5-SNAPSHOT). Contributed by Mukul Kumar Singh. 2019-03-14 15:18:06 +05:30
Mukul Kumar Singh d602d462a6 HDDS-1256. Fix failure in TestKeyValueContainerCheck. Contributed by Supratim Deka. 2019-03-13 16:22:19 +05:30
Arpit Agarwal 24793d2d97
HDDS-1163. Basic framework for Ozone Data Scrubber. Contributed by Supratim Deka. 2019-03-13 04:32:39 +09:00
Márton Elek d17e31e062
HDDS-1214. Enable tracing for the datanode read/write path. Contributed by Elek, Marton.
Closes #550.
2019-03-12 11:34:56 +01:00
Mukul Kumar Singh 8458ced11e HDDS-1238. Fix Ratis Snapshot creation error if the snapshot file already exists in ContainerStateMachine. Contributed by Shashikant Banerjee. 2019-03-09 22:46:20 +05:30
Ajay Yadav 064f38b3a5 HDDS-594. SCM CA: DN sends CSR and uses certificate issued by SCM. Contributed by Ajay Kumar. (#547) 2019-03-07 14:41:52 -08:00
Lokesh Jain 129fd5dd18 HDDS-1208. ContainerStateMachine should set chunk data as state machine data for ratis. Contributed by Lokesh Jain. 2019-03-06 17:00:37 +05:30
Shashikant Banerjee 62e89dc275 HDDS-1184. Parallelization of write chunks in datanodes is broken. Contributed by Shashikant Banerjee. 2019-03-06 10:00:16 +05:30
Elek, Márton 7f636b47b5 HDDS-919. Enable prometheus endpoints for Ozone datanodes (#502)
Pull request from Elek, Márton. (#502)
2019-03-05 12:04:57 -08:00
Shashikant Banerjee 5fcea54a50 HDDS-935. Avoid creating an already created container on a datanode in case of disk removal followed by datanode restart. Contributed by Shashikant Banerjee. 2019-03-05 22:09:46 +05:30
Xiaoyu Yao 9e0f3d1c52 HDDS-1038. Support Service Level Authorization for Ozone. Contributed by Xiaoyu Yao and Ajay Kumar. 2019-02-26 06:02:10 -08:00
Anu Engineer 014e17af78 HDDS-1166. Fix checkstyle line length issues.
Contributed by Nandakumar.
2019-02-23 20:31:39 -08:00
Nanda kumar 2bc3cfe28f HDDS-1126. Datanode is trying to qausi-close a container which is already closed.
Signed-off-by: Nanda kumar <nanda@apache.org>
2019-02-21 21:37:08 +05:30
Anu Engineer 75e15cc0c4 HDDS-1103.Fix rat/findbug/checkstyle errors in ozone/hdds projects.
Contributed by Elek, Marton.
2019-02-14 23:33:25 -08:00
Márton Elek e7d1ae52d2 HDDS-1017. Use distributed tracing to indentify performance problems in Ozone. Contributed by Elek, Marton. 2019-02-11 12:18:26 +01:00
Mukul Kumar Singh 214112b2d7 HDDS-1010. ContainerSet#getContainerMap should be renamed. Contributed by Supratim Deka. 2019-02-07 18:06:23 +05:30
Bharat Viswanadham 912d9f790d HDDS-1044. Client doesn't propogate correct error code to client on out of disk space. Contributed by Yiqun Lin. 2019-02-06 20:02:17 -08:00
Yiqun Lin 3efa168e1f HDDS-1029. Allow option for force in DeleteContainerCommand. Contributed by Bharat Viswanadham. 2019-02-05 10:51:52 +08:00
Arpit Agarwal c35419579b HDDS-1016. Allow marking containers as unhealthy. Contributed by Arpit Agarwal. 2019-01-30 11:40:50 -08:00
Arpit Agarwal 3b49d7aeae HDDS-989. Check Hdds Volumes for errors. Contributed by Arpit Agarwal. 2019-01-27 11:18:30 -08:00