From 04757e5864bd4904fd5a59d143fff480814700e4 Mon Sep 17 00:00:00 2001 From: Takanobu Asanuma Date: Mon, 28 May 2018 19:04:36 +0900 Subject: [PATCH] HDFS-13628. Update Archival Storage doc for Provided Storage Signed-off-by: Akira Ajisaka --- .../hadoop-hdfs/src/site/markdown/ArchivalStorage.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/ArchivalStorage.md b/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/ArchivalStorage.md index ab7975af539..3c49cb1666c 100644 --- a/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/ArchivalStorage.md +++ b/hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/ArchivalStorage.md @@ -35,7 +35,7 @@ A new storage type *ARCHIVE*, which has high storage density (petabyte of storag Another new storage type *RAM\_DISK* is added for supporting writing single replica files in memory. -### Storage Policies: Hot, Warm, Cold, All\_SSD, One\_SSD and Lazy\_Persist +### Storage Policies: Hot, Warm, Cold, All\_SSD, One\_SSD, Lazy\_Persist and Provided A new concept of storage policies is introduced in order to allow files to be stored in different storage types according to the storage policy. @@ -47,6 +47,7 @@ We have the following storage policies: * **All\_SSD** - for storing all replicas in SSD. * **One\_SSD** - for storing one of the replicas in SSD. The remaining replicas are stored in DISK. * **Lazy\_Persist** - for writing blocks with single replica in memory. The replica is first written in RAM\_DISK and then it is lazily persisted in DISK. +* **Provided** - for storing data outside HDFS. See also [HDFS Provided Storage](./HdfsProvidedStorage.html). More formally, a storage policy consists of the following fields: @@ -68,6 +69,7 @@ The following is a typical storage policy table. | 7 | Hot (default) | DISK: *n* | \ | ARCHIVE | | 5 | Warm | DISK: 1, ARCHIVE: *n*-1 | ARCHIVE, DISK | ARCHIVE, DISK | | 2 | Cold | ARCHIVE: *n* | \ | \ | +| 1 | Provided | PROVIDED: 1, DISK: *n*-1 | PROVIDED, DISK | PROVIDED, DISK | Note 1: The Lazy\_Persist policy is useful only for single replica blocks. For blocks with more than one replicas, all the replicas will be written to DISK since writing only one of the replicas to RAM\_DISK does not improve the overall performance.