843 Commits

Author SHA1 Message Date
Weiwei Yang
afe2b98daf HADOOP-16306. AliyunOSS: Remove temporary files when upload small files to OSS. Contributed by wujinhu.
(cherry picked from commit 2d8282bb8248e6984878626c4cdc7148aa2e7202)
2019-05-14 14:08:36 -07:00
Akira Ajisaka
fbbb0e9919
HADOOP-16227. Upgrade checkstyle to 8.19
(cherry picked from commit 4b4fef2f0e0ed1e185ea1058db7a65d68d4970b9)
(cherry picked from commit b45813b25915bec19ff3e31add56c6fef7cf6e15)
2019-04-15 10:59:03 +09:00
Masatake Iwasaki
7985d9b1ce HADOOP-14544. DistCp documentation for command line options is misaligned. Contributed by Masatake Iwasaki.
(cherry picked from commit bbdbc7a9a158f36955c2253acb0edb14219ccb04)

Conflicts:
	hadoop-tools/hadoop-distcp/src/site/markdown/DistCp.md.vm
2019-04-12 12:18:19 +09:00
Steve Loughran
bdf8be6374
HADOOP-16233. S3AFileStatus to declare that isEncrypted() is always true (#685)
This is needed to fix up some confusion about caching of job.addCache() handling of S3A paths; all parent dirs -the files are downloaded by the NM without  using the DTs of the user submitting the job. This means that when you submit jobs to an EC2 cluster with lower IAM permissions than the user, cached resources don't get downloaded and the job doesn't start.

Production code changes:
* S3AFileStatus Adds "true" to the superclass's encrypted flag during construction.

Tests
* Base AbstractContractOpenTest can control whether zero byte files created in tests are encrypted. Not done via an XML attribute, just a subclass point. Thoughts?
* Verify that the filecache considers paths to not have the permissions which trigger reduce-privilege downloads
* And extend ITestDelegatedMRJob to test a completely different bucket (open street map), to verify that cached resources do get their tokens picked up

Docs:
* Advise FS developers to say all files are encrypted. It's otherwise harmless and it'll stop other people seeing impossible to debug error messages on app launch.

Contributed by Steve Loughran.

Change-Id: Ifaae4c9d735ccc5eafeebd2584b65daf2d4e5da3
2019-04-03 22:40:09 +01:00
Akira Ajisaka
6f0896e087
HADOOP-16232. Fix errors in the checkstyle configration xmls. Contributed by Wanqiang Ji.
(cherry picked from commit 8b6deebb1dda49e5e35180ed5c5fb5b5221c1516)
(cherry picked from commit 875435dc7d6a7cb7925a25ca3347a7ebaabec595)
2019-04-03 19:40:52 +09:00
Weiwei Yang
9b1f13fbd2 HADOOP-16191. AliyunOSS: improvements for copyFile/copyDirectory and logging. Contributed by wujinhu.
(cherry picked from commit 568d3ab8b65d1348dec9c971feffe200e6cba2ef)
2019-03-19 10:13:57 +08:00
Steve Loughran
7f70a78ce3
HADOOP-16109. Parquet reading S3AFileSystem causes EOF (#589)
Nobody gets seek right. No matter how many times they think they have.

Reproducible test from: Dave Christianson
Fixed seek() logic: Steve Loughran

Change-Id: I39b87f3d5daa98f65de2c0a44e348821a4930573
2019-03-12 11:36:32 +00:00
Kai Xie
a959aa61b4
HADOOP-16018. DistCp won't reassemble chunks when blocks per chunk > 0.
Contributed by Kai Xie.

(cherry picked from commit 188bebbe7eaec71433548dc47f5a9a995cbea15a)
2019-01-08 13:35:41 +00:00
Weiwei Yang
ad5dd32843 HADOOP-16030. AliyunOSS: bring fixes back from HADOOP-15671. Contributed by wujinhu.
(cherry picked from commit f87b3b11c46704dcdb63089dd971e2a5ba1deaac)
2019-01-07 17:03:22 +08:00
Sunil G
a450d58b49
Revert "HADOOP-15759. AliyunOSS: Update oss-sdk version to 3.0.0. Contributed by Jinhu Wu."
This reverts commit e4fca6aae46a3c04fc56897986a4ab4e5aa98503.

Revert "HADOOP-15671. AliyunOSS: Support Assume Roles in AliyunOSS. Contributed by Jinhu Wu."

This reverts commit 2b635125fb059fc204ed35bc0e264c42dd3a9fe9.

(cherry picked from commit 1f425271a73fff1fdbe3fbfdb71e906fd1ac0574)
2019-01-05 17:50:37 +09:00
Weiwei Yang
f17ef6f451 HADOOP-15323. AliyunOSS: Improve copy file performance for AliyunOSSFileSystemStore. Contributed wujinhu.
(cherry picked from commit 040a202b202a37f3b922cd321eb0a8ded457d88b)
2019-01-03 22:04:47 +08:00
Haibo Chen
0683468d51 YARN-7794. SLSRunner is not loading timeline service jars, causing failure. (Yufei Gu via Haibo Chen)
(cherry picked from commit fcea5a4d798a517443afd0b441883ccc4e9136ba)
(cherry picked from commit e00e08365e02e77a1d9ae3cc16e65ddb543b5b20)
2018-11-30 17:00:13 -08:00
Steve Loughran
7dac294110
HADOOP-15932. Oozie unable to create sharelib in s3a filesystem.
Contributed by Steve Loughran.

(cherry picked from commit 4c106fca0ca91536e288f11052568406a0b84300)
2018-11-27 20:57:16 +00:00
Weiwei Yang
8d930ae344 HADOOP-15943. AliyunOSS: add missing owner & group attributes for oss FileStatus. Contributed by wujinhu.
(cherry picked from commit 5ff0cf86a940fd83f1425794921cc075b19f1108)
2018-11-23 14:39:23 +08:00
Weiwei Yang
8a67f63664 HADOOP-15919. AliyunOSS: Enable Yarn to use OSS. Contributed by wujinhu.
(cherry picked from commit be0708c6eb3c759e43639e0f8df7c9910308fb97)
2018-11-19 14:50:06 +08:00
Sammi Chen
64cb97fb44 HADOOP-15917. AliyunOSS: fix incorrect ReadOps and WriteOps in statistics. Contributed by Jinhu Wu.
(cherry picked from commit 3fade865ce84dcf68bcd7de5a5ed1c7d904796e9)
2018-11-14 13:09:11 +08:00
Sammi Chen
c5a227062f HADOOP-15868. AliyunOSS: update document for properties of multiple part download, multiple part upload and directory copy. Contributed by Jinhu Wu.
(cherry picked from commit 7574d18538e838f40581519080d7c8621c65e53b)
(cherry picked from commit 366541d834f70fd6f8d4c5296a9e844236c6fd74)
2018-10-26 15:30:06 +08:00
Wei-Chiu Chuang
8360d15abc HADOOP-15850. CopyCommitter#concatFileChunks should check that the blocks per chunk is not 0. Contributed by Ted Yu.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit e2cecb681e2aab8b7c5465719cac53dce407a64c)
(cherry picked from commit a7dd244a4995cbf20feb054530fc5913ccf384ac)

	modified:   hadoop-tools/hadoop-distcp/src/main/java/org/apache/hadoop/tools/mapred/CopyCommitter.java
(cherry picked from commit 84726679f0441bfa0788ad5a5ac2bfa4353afedb)
2018-10-19 13:56:23 -07:00
Wei-Chiu Chuang
8754c0b65e Revert "HADOOP-15850. CopyCommitter#concatFileChunks should check that the blocks per chunk is not 0. Contributed by Ted Yu."
This reverts commit 437c5187cd82f45ab709f9b87dba75d2911f20e4.
2018-10-19 13:33:39 -07:00
Ted Yu
437c5187cd HADOOP-15850. CopyCommitter#concatFileChunks should check that the blocks per chunk is not 0. Contributed by Ted Yu.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit e2cecb681e2aab8b7c5465719cac53dce407a64c)
(cherry picked from commit a7dd244a4995cbf20feb054530fc5913ccf384ac)
(cherry picked from commit 84726679f0441bfa0788ad5a5ac2bfa4353afedb)
2018-10-19 13:23:02 -07:00
Steve Loughran
2abc4ab60a
HADOOP-15837. DynamoDB table Update can fail S3A FS init.
Contributed by Steve Loughran.

(cherry picked from commit f1fbc563719db88c50d57820b5052880cfd6b1aa)
2018-10-12 09:49:02 +01:00
Sammi Chen
85e00477b8 HADOOP-15671. AliyunOSS: Support Assume Roles in AliyunOSS. Contributed by Jinhu Wu.
(cherry picked from commit 2b635125fb059fc204ed35bc0e264c42dd3a9fe9)
(cherry picked from commit 5da3e8359757c0c1afaccc1d3a0f2bdc453e0311)
2018-09-25 19:53:03 +08:00
Sammi Chen
f9aedf32ec HADOOP-15607. AliyunOSS: fix duplicated partNumber issue in AliyunOSSBlockOutputStream. Contributed by Jinhu Wu.
(cherry picked from commit 0857f116b754d83d3c540cd6f989087af24fef27)
2018-07-30 10:56:31 +08:00
bibinchundatt
14fd114cd5 YARN-8577. Fix the broken anchor in SLS site-doc. Contributed by Weiwei Yang.
(cherry picked from commit 3d3158cea4580eb2e3b2af635c3a7d30f4dbb873)
2018-07-25 19:01:40 +05:30
Chris Douglas
d1dcc39222 HADOOP-15533. Make WASB listStatus messages consistent. Contributed by Esfandiar Manii
(cherry picked from commit f34744603ee93e082e7ba148df1400af5ac7c30c)
(cherry picked from commit 7e655b642d0ed5b85e0e8c212e004d067318ed4c)
2018-06-17 23:13:25 -07:00
Inigo Goiri
fa9d4a7493 YARN-8422. TestAMSimulator failing with NPE. Contributed by Giovanni Matteo Fumarola.
(cherry picked from commit c35481594ffc372e3f846b0c8ebc2ff9e36ffdb0)
2018-06-12 11:01:22 -07:00
Steve Loughran
df338f2e1a HADOOP-15506. Upgrade Azure Storage Sdk version to 7.0.0 and update corresponding code blocks.
Contributed by Esfandiar Manii.

(cherry picked from commit d901be679554eb6b323f3bc6e8de267d85dd2e06)
(cherry picked from commit baac7c2b285454d71d0371505fb7a3403a548176)
2018-06-11 15:33:36 -07:00
Yongjun Zhang
f0de11ba98 Preparing for 3.0.4 development 2018-05-29 23:40:26 -07:00
Inigo Goiri
595b44e2dc HADOOP-15498. TestHadoopArchiveLogs (#testGenerateScript, #testPrepareWorkingDir) fails on Windows. Contributed by Anbang Hu. 2018-05-28 16:47:43 -07:00
Sammi Chen
ce7ebbe2cc HADOOP-14999. AliyunOSS: provide one asynchronous multi-part based uploading mechanism. Contributed by Genmao Yu.
(cherry picked from commit 6542d17ea460ec222137c4b275b13daf15d3fca3)
(cherry picked from commit e96c7bf82de1e9fd97df5fb6b763e211ebad5913)
2018-04-13 10:17:46 +08:00
Lei Xu
3717df89ee Preparing for 3.0.3 development 2018-04-12 13:57:46 -07:00
Steve Loughran
a3945e4cf1 HADOOP-14651. Update okhttp version to 2.7.5. Contributed by Ray Chiang and John Zhuge.
(cherry picked from commit 8aca46e53f5962217851dc2103211f99a4ef7258)
2018-04-02 16:38:19 +01:00
Sean Mackrory
ca92d1810b HADOOP-15342. Updating ADLS connector to use the current SDK version (2.2.7). Contributed by Atul Sikaria.
(cherry picked from commit 9d7a9031a5978efc8d97566e35ebaace20db2353)
2018-03-29 14:05:42 -06:00
Sammi Chen
7985c5fdc9 HADOOP-15262. AliyunOSS: move files under a directory in parallel when rename a directory. Contributed by Jinhu Wu.
(cherry picked from commit d67a5e2dec5c60d96b0c216182891cdfd7832ac5)
(cherry picked from commit 2285afb32e71622b3dab5051247a1d099cfcbe85)
(cherry picked from commit 322520eb76cdcef25190495ccf98b3ca39907f58)
2018-03-19 15:44:15 +08:00
Chris Douglas
a03c8ea61f HADOOP-14696. parallel tests don't work for Windows. Contributed by Allen Wittenauer
(cherry picked from commit 45d1b0fdcc04a86be91a9b72073cdc30bec04d3b)
(cherry picked from commit 09940b1eb3b7ed764149f4a993c1857e9c6ad938)
2018-03-12 20:06:22 -07:00
Steve Loughran
1771af2320 HADOOP-15273.distcp can't handle remote stores with different checksum algorithms.
Contributed by Steve Loughran.

(cherry picked from commit 7ef4d942dd96232b0743a40ed25f77065254f94d)
2018-03-09 10:49:11 +00:00
Steve Loughran
cc0f14c13c HADOOP-15267. S3A multipart upload fails when SSE-C encryption is enabled.
Contributed by Anis Elleuch.

(cherry picked from commit 1dedc68f9d8d8544d715e67ee77cd3f017c21699)
2018-03-07 19:20:53 +00:00
Gergely Novák
8b9a7b2e48
MAPREDUCE-7023. TestHadoopArchiveLogs.testCheckFilesAndSeedApps fails on rerun
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit e99a1e77671fa0e74a652bc2221bbe01d1a579d3)
2018-03-02 16:22:06 -08:00
Steve Loughran
3c03672e87 HADOOP-15263. hadoop cloud-storage module to mark hadoop-common as provided; add azure-datalake.
Contributed by Steve Loughran

(cherry picked from commit edc9f1451b4af79069cd4b1d608b0fa68648ee72)
(cherry picked from commit abfe1eaf29a31d96f63022a59e29392b773e6bc1)
2018-02-28 15:33:21 +00:00
Steve Loughran
c2bbe22c5a HADOOP-10571. Use Log.*(Object, Throwable) overload to log exceptions.
Contributed by Andras Bokor.
2018-02-21 11:11:28 +00:00
Steve Loughran
b302f728f8 HADOOP-13972. ADLS to support per-store configuration.
Contributed by Sharad Sonker.

(cherry picked from commit 050f5287b79324b7f6231b879c0bfc608203b980)
2018-02-15 16:29:30 +00:00
Steve Loughran
1888b94806 HADOOP-15090. Add ADL troubleshooting doc.
Contributed by Steve Loughran.

(cherry picked from commit 58a2120e8a31307f19551f87be4e81d4fb626de1)
2018-02-15 14:26:18 +00:00
Anu Engineer
2d6ab20007 HDFS-12990. Change default NameNode RPC port back to 8020. Contributed by Xiao Chen.
(cherry picked from commit 4304fcd5bdf9fb7aa9181e866eea383f89bf171f)

 Conflicts:
	hadoop-hdfs-project/hadoop-hdfs/src/site/markdown/ViewFs.md
	hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/tools/TestGetConf.java

(cherry picked from commit 9264f10bb35dbe30c75c648bf759e8aeb715834a)
2018-02-07 11:16:53 -08:00
Lei Xu
7b377f43d0 Preparing for 3.0.2 development 2018-02-06 10:50:37 -08:00
Sammi Chen
91184299c5 HADOOP-15027. AliyunOSS: Support multi-thread pre-read to improve sequential read from Hadoop to Aliyun OSS performance. (Contributed by Jinhu Wu)
(cherry picked from commit 9195a6e302028ed3921d1016ac2fa5754f06ebf0)
2018-01-30 15:21:27 +08:00
Steve Loughran
95a96b13e2 HADOOP-15186. Allow Azure Data Lake SDK dependency version to be set on the command line.
Contributed by Vishwajeet Dusane.
2018-01-29 09:46:53 -08:00
Steve Loughran
ec4f5f0f0d HADOOP-15039/HADOOP-15189. Move SemaphoredDelegatingExecutor to hadoop-common
Contributed by Genmao Yu
2018-01-26 15:23:08 -08:00
Jason Lowe
217f408aaa YARN-7102. NM heartbeat stuck when responseId overflows MAX_INT. Contributed by Botong Huang
(cherry picked from commit ff8378eb1b960c72d18a984c7e5d145b407ca11a)
2018-01-25 17:51:51 -06:00
Steve Loughran
1de0712578 HADOOP-15185. Update adls connector to use the current version of ADLS SDK.
Contributed by Atul Sikaria.

(cherry picked from commit f63d13f10db708228482b53883863c873c89f69c)
2018-01-23 11:37:33 -08:00
Jason Lowe
b1cc14e9cb Revert "HADOOP-15027. AliyunOSS: Support multi-thread pre-read to improve sequential read from Hadoop to Aliyun OSS performance. (Contributed by Jinhu Wu)"
This reverts commit 55142849db02a9191db0dd6f4e1401ff19ec242a.
2018-01-17 09:53:24 -06:00