a400f396a6
Contributed by Ranith Sardar. (cherry picked from commit 546c5d70efebb828389f609a89b123c4ee51f867) (cherry picked from commit c5eca3f7ee095d6a261eb411ad97aba654d67d13) Conflicts: hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/util/TestDistCpUtils.java
DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. It uses Map/Reduce to effect its distribution, error handling and recovery, and reporting. It expands a list of files and directories into input to map tasks, each of which will copy a partition of the files specified in the source list.