Commit Graph

366 Commits

Author SHA1 Message Date
bshashikant 7f9073132d HDDS-2117. ContainerStateMachine#writeStateMachineData times out. (#1430) 2019-09-17 16:49:25 +05:30
Doroszlai, Attila e54977f888 HDDS-2132. TestKeyValueContainer is failing (#1457). 2019-09-17 14:31:33 +05:30
Vivek Ratnavel Subramanian 66bd1681f8 HDDS-2107. Datanodes should retry forever to connect to SCM in an unsecure environment (#1424) 2019-09-16 12:58:16 -07:00
avijayanhwx 8024818d33 HDDS-1786 : Datanodes takeSnapshot should delete previously created snapshots (#1163) 2019-09-13 23:20:31 +05:30
Shashikant Banerjee fe8cdf0ab8 HDDS-2076. Read fails because the block cannot be located in the container (#1410)
Signed-off-by: Nanda kumar <nanda@apache.org>
2019-09-12 21:18:14 +05:30
Lokesh Jain c3beeb7761 HDDS-2048: State check during container state transition in datanode should be lock protected (#1375) 2019-09-10 14:14:52 +05:30
Mukul Kumar Singh d69b811ddd HDDS-2102. HddsVolumeChecker should use java optional in place of Guava optional. Contributed by Mukul Kumar Singh. (#1416) 2019-09-09 14:17:28 -07:00
Shashikant Banerjee 469165e6f2 HDDS-1843. Undetectable corruption after restart of a datanode. Contributed by Shashikant Banerjee(#1364). 2019-09-09 22:43:20 +05:30
Vivek Ratnavel Subramanian 3b9584d12b HDDS-2087. Remove the hard coded config key in ChunkManager (#1409) 2019-09-08 20:44:07 -07:00
Lokesh Jain 6e4cdf89ef HDDS-1561: Mark OPEN containers as QUASI_CLOSED as part of Ratis groupRemove (#1401) 2019-09-06 13:15:49 +05:30
Anu Engineer acbea8d976 HDDS-1708. Add container scrubber metrics.
Contributed by Hrishikesh Gadre.
2019-09-05 14:33:06 -07:00
Nanda kumar 2b16d5377c
HDDS-1898. GrpcReplicationService#download cannot replicate the container. (#1326) 2019-09-05 18:12:36 +05:30
Hrishikesh Gadre f347c348d8 Add support for checksum verification in data scrubber
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-09-04 20:31:49 -07:00
avijayanhwx b53d19a343 HDDS-1783 : Latency metric for applyTransaction in ContainerStateMachine (#1363). 2019-09-03 15:18:50 +05:30
Doroszlai, Attila a2d083f2c5 HDDS-1413. Attempt to fix TestCloseContainerCommandHandler by adjusting timeouts
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-30 18:17:08 -07:00
Xiaoyu Yao ec34cee5e3
HDDS-2047. Datanodes fail to come up after 10 retries in a secure env… (#1379) 2019-08-30 09:27:37 -07:00
Márton Elek 371c9eb6a6
Revert "HDDS-1596. Create service endpoint to download configuration from SCM."
This reverts commit c0499bd704.
2019-08-29 09:25:03 +02:00
Doroszlai, Attila 872cdf48a6 HDDS-1941. Unused executor in SimpleContainerDownloader (#1367) 2019-08-28 14:59:49 -07:00
supratimdeka 1407414a52 HDDS-1094. Performance test infrastructure : skip writing user data on Datanode. Contributed by Supratim Deka (#1323) 2019-08-28 10:05:20 -07:00
Márton Elek c0499bd704 HDDS-1596. Create service endpoint to download configuration from SCM.
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-28 09:13:46 -07:00
bshashikant 5d31a4eff7 HDDS-1753. Datanode unable to find chunk while replication data using ratis. (#1318) 2019-08-28 10:54:44 +05:30
Vivek Ratnavel Subramanian b1eee8b52e HDDS-1946. CertificateClient should not persist keys/certs to ozone.m… (#1311) 2019-08-27 17:29:27 -07:00
Doroszlai, Attila 0883ce1021 HDDS-2026. Overlapping chunk region cannot be read concurrently
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-27 15:34:58 -07:00
Shashikant Banerjee 66cfa482c4 HDDS-1610. applyTransaction failure should not be lost on restart. Contributed by Shashikant Banerjee(#1226). 2019-08-27 23:38:43 +05:30
Shashikant Banerjee ce8eb1283a Revert "HDDS-1610. applyTransaction failure should not be lost on restart. Contributed by Shashikant Banerjee."
This reverts commit 62445021d5 as it has unintended changes in DirectoryWithSnapshotFeature class..
2019-08-27 23:23:44 +05:30
Lokesh Jain 4379370fb1
HDDS-1981: Datanode should sync db when container is moved to CLOSED or QUASI_CLOSED state (#1319) 2019-08-27 10:22:03 +05:30
avijayanhwx 28fb4b527a HDDS-2008 : Wrong package for RatisHelper class in hadoop-hdds/common module. (#1333) 2019-08-22 10:57:22 -07:00
Shashikant Banerjee 62445021d5 HDDS-1610. applyTransaction failure should not be lost on restart. Contributed by Shashikant Banerjee. 2019-08-20 14:37:01 +05:30
Aravindan Vijayan fc229b6490 HDDS-1832 : Improve logging for PipelineActions handling in SCM and datanode. (Change to Error logging)
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-08-13 22:04:31 -07:00
supratimdeka f8ea6e1ce1 HDDS-1798. Propagate failure in writeStateMachineData to Ratis. Contributed by Supratim Deka (#1113) 2019-08-05 13:43:41 +05:30
avijayanhwx 6e8c8413ee HDDS-1832 : Improve logging for PipelineActions handling in SCM and datanode. (#1200) 2019-08-02 22:31:00 +05:30
Doroszlai, Attila 946e301731 HDDS-1878. checkstyle error in ContainerStateMachine (#1195) 2019-08-02 22:27:54 +05:30
Lokesh Jain d4ab9aea6f HDDS-1816: ContainerStateMachine should limit number of pending apply transactions. Adds a config, uses snapshot threshold default value. (#1150) 2019-07-31 13:26:24 +05:30
Eric Yang d023663e3e HDDS-1833. Moved RefCountedDB stacktrace to log level trace.
Contributed by Siddharth Wagle
2019-07-29 12:05:24 -04:00
Doroszlai, Attila 278390f5bf HDDS-1710. Fix checkstyle errors
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-23 07:56:30 -07:00
Márton Elek c533b79c32 HDDS-1710. Publish JVM metrics via Hadoop metrics
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-22 16:27:50 -07:00
Doroszlai, Attila c958eddcf4 HDDS-1811. Prometheus metrics are broken.
Signed-off-by: Anu Engineer <aengineer@apache.org>
2019-07-22 14:12:06 -07:00
Sammi Chen 69a46a95bb HDDS-1713. ReplicationManager fail to find proper node topology based… (#1112) 2019-07-19 15:45:26 -07:00
Shashikant Banerjee 20cf50c6d0 HDDS-1654. Ensure container state on datanode gets synced to disk whennever state change happens. Cotributed by Shashikant Banerjee. (#923) 2019-07-18 17:09:05 +05:30
Lokesh Jain 23e9bebe13
HDDS-1767: ContainerStateMachine should have its own executors for executing applyTransaction calls (#1087) 2019-07-18 14:48:13 +05:30
Nanda kumar 1502e6a22f HDDS-1756. DeleteContainerCommandHandler fails with NPE. Contributed by Nanda kumar(#1095). 2019-07-16 15:06:41 +05:30
Mukul Kumar Singh 0976f6fc30
HDDS-1766. ContainerStateMachine is unable to increment lastAppliedTermIndex. Contributed by Mukul Kumar Singh. (#1072) 2019-07-14 10:53:51 +05:30
Márton Elek 9119ed07ff
HDDS-1384. TestBlockOutputStreamWithFailures is failing
Closes #1029
2019-07-12 11:00:03 +02:00
Hrishikesh Gadre acef5e0cec HDDS-1201. Reporting corrupted containers info to SCM (#1032) 2019-07-11 14:49:58 +05:30
supratimdeka ac7a8accdf HDDS-1603. Handle Ratis Append Failure in Container State Machine. Contributed by Supratim Deka (#1019) 2019-07-10 07:49:36 +05:30
Mukul Kumar Singh 96d0555913 HDDS-1718. Increase Ratis Leader election timeout default. Contributed by Aravindan Vijayan & Siddharth Wagle. (#1062) 2019-07-09 23:17:50 +05:30
Mukul Kumar Singh e5ffb88257 HDDS-1550. MiniOzoneCluster is not shutting down all the threads during shutdown. Contributed by Mukul Kumar Singh. (#1050)
* HDDS-1550. MiniOzoneCluster is not shutting down all the threads during shutdown. Contributed by Mukul Kumar Singh.
2019-07-08 20:24:12 -07:00
Mukul Kumar Singh ce91d35b29
HDDS-1728. Add metrics for leader's latency in ContainerStateMachine. Contributed by Mukul Kumar Singh. (#1022) 2019-07-08 12:19:16 +05:30
Mukul Kumar Singh 9df6275954 HDDS-1555. Disable install snapshot for ContainerStateMachine. Contributed by Siddharth Wagle. (#846) 2019-07-01 20:52:32 +05:30
Elek, Márton 004207996c HDDS-1597. Remove hdds-server-scm dependency from ozone-common (#969)
* HDDS-1597. Remove hdds-server-scm dependency from ozone-common. Contributed by Elek, Marton.

* checkstyle fixes

* revert the import reorder of HddsUtil

* add javadoc

* switch back to the commons-lang2

* fix typo

* fix metrics core classpath problem (+rebase fix)
2019-06-24 11:56:04 -07:00