OpenSearch/server
Armin Braun 22679c7932
Fix Snapshot Corruption in Edge Case (#47552) (#47620)
This fixes missing to marking shard snapshots as failures when
multiple data-nodes are lost during the snapshot process or
shard snapshot failures have occured before a node left the cluster.

The problem was that we were simply not adding any shard entries for completed
shards on node-left events. This has no effect for a successful shard, but
for a failed shard would lead to that shard not being marked as failed during
snapshot finalization. Fixed by corectly keeping track of all previous completed
shard states as well in this case.
Also, added an assertion that without this fix would trip on almost every run of the
resiliency tests and adjusted the serialization of SnapshotsInProgress.Entry so
we have a proper assertion message.

Closes #47550
2019-10-05 15:01:06 +02:00
..
licenses Update joda to 2.10.3 (#45495) 2019-08-23 10:39:39 +02:00
src Fix Snapshot Corruption in Edge Case (#47552) (#47620) 2019-10-05 15:01:06 +02:00
build.gradle Remove eclipse conditionals (#44075) 2019-10-03 11:55:00 +03:00