Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org> (cherry picked from commit e2cecb681e2aab8b7c5465719cac53dce407a64c) (cherry picked from commit a7dd244a4995cbf20feb054530fc5913ccf384ac) modified: hadoop-tools/hadoop-distcp/src/main/java/org/apache/hadoop/tools/mapred/CopyCommitter.java (cherry picked from commit 84726679f0441bfa0788ad5a5ac2bfa4353afedb)
DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. It uses Map/Reduce to effect its distribution, error handling and recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list.