hadoop/hadoop-tools/hadoop-distcp
Colin Patrick Mccabe 932730df7d HADOOP-11785. Reduce the number of listStatus operation in distcp buildListing (Zoran Dimitrijevic via Colin P. McCabe) 2015-04-03 14:08:25 -07:00
..
src HADOOP-11785. Reduce the number of listStatus operation in distcp buildListing (Zoran Dimitrijevic via Colin P. McCabe) 2015-04-03 14:08:25 -07:00
README HADOOP-11437. Remove the version and author information from distcp's README file (Brahma Reddy Battula via aw) 2015-02-11 15:47:36 -08:00
pom.xml MAPREDUCE-5639. Port DistCp2 document to trunk (Akira AJISAKA via jeagles) 2014-04-25 15:23:42 +00:00

README

DistCp (distributed copy) is a tool used for large inter/intra-cluster copying. 
It uses Map/Reduce to effect its distribution, error handling and recovery, 
and reporting. It expands a list of files and directories into input to map tasks, 
each of which will copy a partition of the files specified in the source list.