Steve Loughran
31755c91ff
Revert "HADOOP-8143. Change distcp to have -pb on by default."
...
This reverts commit dd65eea74b
.
Change-Id: I74180cf59d5bbad8c9f66cb331535addcbea863e
2020-05-15 13:49:25 +01:00
Steve Loughran
eebd96636e
Revert "HADOOP-14557. Document HADOOP-8143 (Change distcp to have -pb on by default)."
...
This reverts commit 44350fdf49
.
It is related to the rollback of HADOOP-8143.
Change-Id: If48e3dd670c920ada702dc36461ff398fe9d35cc
2020-05-15 13:49:13 +01:00
Gabor Bota
ec6d2a8402
Preparing for 3.1.5 development
...
Change-Id: Iabc64aba7392e3b6f9e4e18109fcaa2cfc01d1f9
2020-04-29 11:18:18 +02:00
Steve Loughran
caec6a1945
HADOOP-16775. DistCp reuses the same temp file within the task for different files.
...
Contributed by Amir Shenavandeh.
This avoids overwrite consistency issues with S3 and other stores
Change-Id: Ic4d05ef3397e963ba28fd9f775bb362b0da36ad9
2020-03-13 19:34:50 +00:00
Mukund Thakur
ce6614565e
HDFS-13660. DistCp job fails when new data is appended in the file while the DistCp copy job is running
...
This uses the length of the file known at the start of the copy to determine the amount of data to copy.
* If a file is appended to during the copy, the original bytes are copied.
* If a file is truncated during a copy, or the attempt to read the data fails with a truncated stream,
distcp will now fail. Until now these failures were not detected.
Contributed by Mukund Thakur.
Change-Id: I576a49d951fa48d37a45a7e4c82c47488aa8e884
(cherry picked from commit 51c64b357d
)
2020-02-27 16:37:36 -08:00
Akira Ajisaka
2c84ea96e3
HADOOP-16808. Use forkCount and reuseForks parameters instead of forkMode in the config of maven surefire plugin. Contributed by Xieming Li.
...
(cherry picked from commit f6d20daf40
)
2020-01-21 18:05:13 +09:00
Ranith Sardar
a400f396a6
HADOOP-16032. Distcp It should clear sub directory ACL before applying new ACL on.
...
Contributed by Ranith Sardar.
(cherry picked from commit 546c5d70ef
)
(cherry picked from commit c5eca3f7ee
)
Conflicts:
hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/util/TestDistCpUtils.java
2019-10-01 16:34:00 -07:00
Zhankun Tang
ef79d98788
Preparing for 3.1.4 development
2019-09-04 16:11:36 +08:00
KAI XIE
c1a2b29c0f
HADOOP-16158. DistCp to support checksum validation when copy blocks in parallel ( #919 )
...
* DistCp to support checksum validation when copy blocks in parallel
* address review comments
* add checksums comparison test for combine mode
(cherry picked from commit c765584eb2
)
(cherry picked from commit b3c14d4132
)
Conflicts:
hadoop-tools/hadoop-distcp/src/test/java/org/apache/hadoop/tools/util/TestDistCpUtils.java
2019-08-18 18:55:59 -07:00
Ayush Saxena
befafe9408
HADOOP-16440. Distcp can not preserve timestamp with -delete option. Contributed by ludun.
2019-07-20 13:43:38 +05:30
Takanobu Asanuma
2f6f42bc7b
HDFS-12564. Add the documents of swebhdfs configurations on the client side. Contributed by Takanobu Asanuma.
...
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 98d2065643
)
2019-06-20 20:18:00 -07:00
Takanobu Asanuma
8098ddaf40
HADOOP-16331. Fix ASF License check in pom.xml. Contributed by Akira Ajisaka.
...
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2019-05-29 17:39:49 +09:00
Akira Ajisaka
f8bd5deec1
HADOOP-16323. https everywhere in Maven settings.
2019-05-27 15:28:21 +09:00
Masatake Iwasaki
fd90af892c
HADOOP-14544. DistCp documentation for command line options is misaligned. Contributed by Masatake Iwasaki.
...
(cherry picked from commit bbdbc7a9a1
)
2019-04-12 12:03:11 +09:00
Eric E Payne
49d54633e0
HADOOP-15281. Distcp to add no-rename copy option.
...
Contributed by Andrew Olson.
2019-02-07 23:15:18 +00:00
Eric E Payne
352ebc6ed9
Revert "HADOOP-15281. Distcp to add no-rename copy option."
...
Revert "HADOOP-15281. Distcp to add no-rename copy option. Contributed by Andrew Olson."
This reverts commit d2765ffc2e
.
2019-02-07 20:18:32 +00:00
Andrew Olson
d2765ffc2e
HADOOP-15281. Distcp to add no-rename copy option.
...
Contributed by Andrew Olson.
(cherry picked from commit de804e53b9
)
2019-02-07 10:09:55 +00:00
Kai Xie
fee33057f0
HADOOP-16018. DistCp won't reassemble chunks when blocks per chunk > 0.
...
Contributed by Kai Xie.
(cherry picked from commit 188bebbe7e
)
2019-01-08 13:35:17 +00:00
Wangda Tan
31ea2f7806
Preparing for 3.1.3 development
...
Change-Id: I3c3d3ee47dc4fef239127b4452ff14676fa26e3d
2019-01-07 10:04:58 -08:00
Arpit Agarwal
3e57adee0a
HADOOP-12558. distcp documentation is woefully out of date. Contributed by Dinesh Chitlangia.
...
(cherry picked from commit 914b0cf15f
)
2018-11-15 13:58:38 -08:00
Ted Yu
84726679f0
HADOOP-15850. CopyCommitter#concatFileChunks should check that the blocks per chunk is not 0. Contributed by Ted Yu.
...
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit e2cecb681e
)
(cherry picked from commit a7dd244a49
)
2018-10-19 13:22:31 -07:00
Wangda Tan
5583711419
Preparing for 3.1.2 release
...
Change-Id: If2793e2ed2b5b349a9e1f98f78df43f309dcfcbd
2018-07-31 13:08:55 -07:00
Steve Loughran
d54241e9c9
HADOOP-15384. distcp numListstatusThreads option doesn't get to -delete scan.
...
Contributed by Steve Loughran.
(cherry picked from commit ca8b80bf59
)
2018-07-10 10:50:40 +01:00
Xiao Chen
805e33b62c
HDFS-13056. Expose file-level composite CRCs in HDFS which are comparable across different instances/layouts. Contributed by Dennis Huo.
2018-04-16 11:58:10 -07:00
Wangda Tan
25bf1d3ac6
Updated branch-3.1 to track 3.1.1
...
Change-Id: Idd55c79921b8acacc8a8a3258450e476f3abb706
2018-04-09 10:58:50 -07:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.)
0bd99621c5
Preparing for 3.1.0 development
2018-03-19 15:16:54 -07:00
Steve Loughran
1976e0066e
HADOOP-15209. DistCp to eliminate needless deletion of files under already-deleted directories.
...
Contributed by Steve Loughran.
2018-03-15 18:05:14 +00:00
Chris Douglas
45cccadd2e
HDFS-12780. Fix spelling mistake in DistCpUtils.java. Contributed by Jianfei Jiang
2018-03-13 11:08:11 -07:00
Steve Loughran
7ef4d942dd
HADOOP-15273.distcp can't handle remote stores with different checksum algorithms.
...
Contributed by Steve Loughran.
2018-03-08 11:24:06 +00:00
Steve Loughran
3bd6b1fd85
HADOOP-15292. Distcp's use of pread is slowing it down.
...
Contributed by Virajith Jalaparti.
2018-03-08 11:15:46 +00:00
fang zhenyi
4d4dde5112
HADOOP-15223. Replace Collections.EMPTY* with empty* when available
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2018-02-18 22:19:39 +09:00
Wangda Tan
60f9e60b3b
Preparing for 3.2.0 development
...
Change-Id: I6d0e01f3d665d26573ef2b957add1cf0cddf7938
2018-02-11 11:17:38 +08:00
Anu Engineer
4304fcd5bd
HDFS-12990. Change default NameNode RPC port back to 8020. Contributed by Xiao Chen.
2018-02-06 13:43:45 -08:00
Arpit Agarwal
d4e13a4647
HADOOP-15198. Correct the spelling in CopyFilter.java. Contributed by Mukul Kumar Singh.
2018-02-02 11:37:51 -08:00
Surendra Singh Lilhore
00129c5314
HDFS-12833. Distcp : Update the usage of delete option for dependency with update and overwrite option. Contributed by usharani.
2017-12-12 00:28:02 +05:30
Akira Ajisaka
cc3f3eca40
MAPREDUCE-6999. Fix typo onf in DynamicInputChunk.java. Contributed by fang zhenyi.
2017-11-02 18:32:24 +09:00
Steve Loughran
f36cbc8475
HADOOP-14942. DistCp#cleanup() should check whether jobFS is null.
...
Contributed by Andras Bokor.
2017-10-20 22:27:04 +01:00
ChenSammi
e0b3c644e1
HDFS-12414. Ensure to use CLI command to enable/disable erasure coding policy. Contributed by Sammi Chen
2017-09-14 09:15:29 +08:00
Xiaoyu Yao
63720ef574
HADOOP-14839. DistCp log output should contain copied and deleted files and directories. Contributed by Yiqun Lin.
2017-09-05 23:34:55 -07:00
Andrew Wang
0d419c984f
Preparing for 3.1.0 development
2017-09-01 11:53:48 -07:00
Andrew Wang
f29a0fc288
HDFS-12303. Change default EC cell size to 1MB for better performance. Contributed by Wei Zhou.
2017-08-25 14:14:23 -07:00
Andrew Wang
dd7916d3cd
HDFS-12250. Reduce usage of FsPermissionExtension in unit tests. Contributed by Chris Douglas.
2017-08-17 09:35:36 -07:00
Sean Mackrory
1a1bf6b7d0
HADOOP-13595. Rework hadoop_usage to be broken up by clients/daemons/etc. Contributed by Allen Wittenauer.
2017-08-02 12:25:05 -06:00
Wei-Chiu Chuang
44350fdf49
HADOOP-14557. Document HADOOP-8143 (Change distcp to have -pb on by default). Contributed by Bharat Viswanadham.
2017-07-20 18:23:13 -07:00
Andrew Wang
af2773f609
Updating version for 3.0.0-beta1 development
2017-06-29 17:57:40 -07:00
Jason Lowe
dd65eea74b
HADOOP-8143. Change distcp to have -pb on by default. Contributed by Mithun Radhakrishnan
2017-06-20 09:53:47 -05:00
Andrew Wang
16ad896d5c
Update maven version for 3.0.0-alpha4 development
2017-05-26 14:09:44 -07:00
Sunil G
b6f66b0da1
YARN-6584. Correct license headers in hadoop-common, hdfs, yarn and mapreduce. Contributed by Yeliang Cang.
2017-05-22 14:10:06 +05:30
Yongjun Zhang
b4adc8392c
HADOOP-14407. DistCp - Introduce a configurable copy buffer size. (Omkar Aradhya K S via Yongjun Zhang)
2017-05-18 15:35:22 -07:00
Mingliang Liu
26172a94d6
HADOOP-14267. Make DistCpOptions immutable. Contributed by Mingliang Liu
2017-03-31 20:04:26 -07:00