HADOOP-12558. distcp documentation is woefully out of date. Contributed by Dinesh Chitlangia.

(cherry picked from commit 914b0cf15f)
This commit is contained in:
Arpit Agarwal 2018-11-15 13:58:13 -08:00
parent ba75aeec28
commit 3e57adee0a
1 changed files with 1 additions and 0 deletions

View File

@ -240,6 +240,7 @@ Flag | Description | Notes
`-skipcrccheck` | Whether to skip CRC checks between source and target paths. |
`-blocksperchunk <blocksperchunk>` | Number of blocks per chunk. When specified, split files into chunks to copy in parallel | If set to a positive value, files with more blocks than this value will be split into chunks of `<blocksperchunk>` blocks to be transferred in parallel, and reassembled on the destination. By default, `<blocksperchunk>` is 0 and the files will be transmitted in their entirety without splitting. This switch is only applicable when the source file system implements getBlockLocations method and the target file system implements concat method. |
`-copybuffersize <copybuffersize>` | Size of the copy buffer to use. By default, `<copybuffersize>` is set to 8192B |
`-xtrack <path>` | Save information about missing source files to the specified path. | This option is only valid with `-update` option. This is an experimental property and it cannot be used with `-atomic` option.
Architecture of DistCp
----------------------