19030 Commits

Author SHA1 Message Date
Akira Ajisaka
f8bd5deec1
HADOOP-16323. https everywhere in Maven settings. 2019-05-27 15:28:21 +09:00
Erik Krogen
9295e95174 HDFS-14500. NameNode StartupProgress should not allow new steps in an already-completed phase. Contributed by Erik Krogen.
(cherry-picked from 55e0c134f002f74cb4a0360b6682a1b6796d1598)
(cherry-picked from c26cf22d6b3949a4798d8133669b451660a0f050)
2019-05-24 13:19:39 -07:00
Konstantin V Shvachko
799ca572f7 HDFS-14502. keepResults option in NNThroughputBenchmark should call saveNamespace(). Contributed by Konstantin V Shvachko. 2019-05-24 12:35:32 -07:00
Shweta
a6ff20d25d HDFS-14451. Incorrect header or version mismatch log message. Contributed by Shweta.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 93d7630062247793a4860704fad4a31437693de1)
(cherry picked from commit 7a5d9ab9eed194fabe609def69efe41bea4ed1f6)
2019-05-24 18:43:49 +02:00
Wanqiang Ji
6690955358
MAPREDUCE-7205. Treat container scheduler kill exit code as a task attempt killing event.
This closes #821

Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 67f9a7b165edecbec7c8063758202be4d8cff0f5)
2019-05-22 14:41:20 +09:00
Siyao Meng
9196c84c9f HDFS-14507. Document -blockingDecommission option for hdfs dfsadmin -listOpenFiles
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
(cherry picked from commit 9dff6eff819a832a5918d786675f42aa243a13e8)
2019-05-22 13:25:42 +09:00
Srinivasu Majeti
e599e3b546 HDFS-14323. Distcp fails in Hadoop 3.x when 2.x source webhdfs url has special characters in hdfs file path. Contributed by Srinivasu Majeti.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 3e5e5b028ad7e199d08e524fe7cddeee5db51a6d)
(cherry picked from commit 85f3664bb84f393f8006d3ab781d8ce5ebb1ab62)
2019-05-17 19:22:03 +02:00
Alexis Daboville
55cc35c0e1
HADOOP-16248. MutableQuantiles leak memory under heavy load.
Contributed by Alexis Daboville,

(cherry picked from commit 4cb3da6ac703b10de59a8ca371fe957a4a338601)
2019-05-17 15:17:55 +01:00
David Mollitor
cf0d5a0e6e
HADOOP-16307. Intern User Name and Group Name in FileStatus.
Author:    David Mollitor

(cherry picked from commit 2713dcf6e9ef308ffe6102532c90b27c52d27f7c)
2019-05-16 16:03:34 +02:00
Bharat Viswanadham
d4b4d90a52
HADOOP-16247. NPE in FsUrlConnection. Contributed by Karthik Palanisamy.
(cherry picked from commit d4c8858586eeed2820f3ab21da79603b52c64594)
2019-05-15 17:50:56 -07:00
Weiwei Yang
82430eec5a HADOOP-16306. AliyunOSS: Remove temporary files when upload small files to OSS. Contributed by wujinhu.
(cherry picked from commit 2d8282bb8248e6984878626c4cdc7148aa2e7202)
2019-05-14 14:07:40 -07:00
Sunil G
f09befd2ea YARN-9519. TFile log aggregation file format is not working for yarn.log-aggregation.TFile.remote-app-log-dir config. Contributed by Adam Antal.
(cherry picked from commit 7d831eca645f93d064975ebae35a7cbea3bbad31)
2019-05-14 10:50:02 -07:00
Sunil G
8b306e34e0 YARN-9504. [UI2] Fair scheduler queue view page does not show actual capacity. Contributed by Zoltan Siegl.
(cherry picked from commit 64c7f36ab12faba26a2ecf1e6a0cab7eb2bcffdd)
2019-05-10 14:28:08 +05:30
Rajat Khandelwal
8e0fdaeb38
HADOOP-16278. With S3A Filesystem, Long Running services End up Doing lot of GC and eventually die.
Contributed by Rajat Khandelwal

(cherry picked from commit 591ca698230f25217c10c7549aff8097baa11f1e)
2019-05-09 21:15:09 +01:00
Eric Yang
bf013aa06e YARN-8622. Fixed container-executor compilation on MacOSX.
Contributed by Siyao Meng

(cherry picked from commit ef97a20831677c055aa6bff6ad0649cbb3a56a86)
2019-05-09 14:55:38 -04:00
Siyao Meng
8414353daa HADOOP-16289. Allow extra jsvc startup option in hadoop_start_secure_daemon in hadoop-functions.sh. Contributed by Siyao Meng.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 93f2283a69ea4e07a998f2a4065f238f9574921b)
(cherry picked from commit 095815220df582a04f09f8b645f9f01128168bf9)
2019-05-06 15:48:14 -07:00
Haibo Chen
ea1f0f282b YARN-9529. Log correct cpu controller path on error while initializing CGroups. (Contributed by Jonathan Hung)
(cherry picked from commit 597fa47ad125c0871f5c4deb3a883e5b3341c67b)
(cherry picked from commit c6573562cbc43832e9332989996e5d07b8bce9b0)
2019-05-06 11:59:20 -07:00
Eric E Payne
41ffaea342 YARN-9285: RM UI progress column is of wrong type. Contributed by Ahmed Hussein.
(cherry picked from commit b094b94d43a46af9ddb910da24f792b95f614b08)
2019-05-02 19:57:44 +00:00
Siyao Meng
86fb5e1c8e HDFS-14463. Add Log Level link under NameNode and DataNode Web UI Utilities dropdown. Contributed by Siyao Meng.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 7cb46f035a92056783bad23a9abc6a264d71285d)
(cherry picked from commit 0550aa32f1a4b1809c234f8ed785db08b2e33d13)
2019-05-01 16:49:36 -07:00
Arpit Agarwal
7b51edbab9
HDFS-13677. Dynamic refresh Disk configuration results in overwriting VolumeMap. Contributed by xuzq.
(cherry picked from commit 4b4200f1f87ad40d9c19ba160f706ffd0470a8d4)
2019-04-29 14:54:11 -07:00
Weiwei Yang
94a895b94f YARN-9307. node_partitions constraint does not work. Contributed by kyungwan nam. 2019-04-26 13:16:43 +08:00
Weiwei Yang
d242b166ed YARN-9325. TestQueueManagementDynamicEditPolicy fails intermittent. Contributed by Prabhu Joseph.
(cherry picked from commit 1c8046d67ec10710e7749ed1929b09fac4b1ba94)
2019-04-23 14:25:33 +08:00
Erik Krogen
9da70a1b4b HADOOP-16265. Fix bug causing Configuration#getTimeDuration to use incorrect units when the default value is used. Contributed by starphin.
(cherry-picked from 1ddb48872f6a4985f4d0baadbb183899226cff68)
(cherry-picked from 28ff96f367ab2260666c2177980bce7c66e6dc57)
2019-04-22 08:19:25 -07:00
Eric Yang
8b228a42e9 YARN-8587. Added retries for fetching docker exit code.
Contributed by Charo Zhang

(cherry picked from commit c16c49b8c3b8e2e42c00e79a50e7ae029ebe98e2)
2019-04-19 15:40:56 -04:00
Eric Yang
68a98be8a2 YARN-6695. Fixed NPE in publishing appFinished events to ATSv2.
Contributed by Prabhu Joseph

(cherry picked from commit df76cdc8959c51b71704ab5c38335f745a6f35d8)
2019-04-18 12:31:34 -04:00
Puleya7
546ed4937c HDFS-14432. dfs.datanode.shared.file.descriptor.paths duplicated in hdfs-default.xml
Signed-off-by: Masatake Iwasaki <iwasakims@apache.org>
(cherry picked from commit 13907d847986aa14a51cc2a3cc83151f22c43781)
2019-04-17 21:57:06 +09:00
Akira Ajisaka
b45813b259
HADOOP-16227. Upgrade checkstyle to 8.19
(cherry picked from commit 4b4fef2f0e0ed1e185ea1058db7a65d68d4970b9)
2019-04-15 10:58:41 +09:00
Masatake Iwasaki
fd90af892c HADOOP-14544. DistCp documentation for command line options is misaligned. Contributed by Masatake Iwasaki.
(cherry picked from commit bbdbc7a9a158f36955c2253acb0edb14219ccb04)
2019-04-12 12:03:11 +09:00
Weiwei Yang
c37065eae9 YARN-9463. Add queueName info when failing with queue capacity sanity check. Contributed by Aihua Xu.
(cherry picked from commit 8c1bba375b144fd515b389174ddb349f2d9246fa)
2019-04-10 23:04:27 +08:00
Weiwei Yang
bd0c9bc160 YARN-9413. Queue resource leak after app fail for CapacityScheduler. Contributed by Tao Yang.
(cherry picked from commit ec143cbf678bd65f87fdd464c23022a2d2c54c07)
2019-04-06 20:38:06 +08:00
Akira Ajisaka
a70076b3a5
HDFS-14407. Fix misuse of SLF4j logging API in DatasetVolumeChecker#checkAllVolumes. Contributed by Wanqiang Ji.
(cherry picked from commit 67020f09502a4f07342dee457e47bb52b03441ae)
2019-04-05 12:31:14 +09:00
David Mollitor
41f1bc2056
HADOOP-16208. Do Not Log InterruptedException in Client.
Contributed by David Mollitor.

(cherry picked from commit c90736350ba158c7872a39426e7a29c5e5e0bb48)
2019-04-04 21:13:09 +01:00
Stephen O'Donnell
d9899015eb HDFS-14389. getAclStatus returns incorrect permissions and owner when an iNodeAttributeProvider is configured. Contributed by Stephen O'Donnell.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit c528e427aa6745434672b1c1850738795ad1d6d2)
(cherry picked from commit 388f445dde577999b2d81f809adcfca8f0958499)
2019-04-04 04:06:33 -07:00
Steve Loughran
3210ae372d
HADOOP-16233. S3AFileStatus to declare that isEncrypted() is always true (#685)
This is needed to fix up some confusion about caching of job.addCache() handling of S3A paths; all parent dirs -the files are downloaded by the NM without  using the DTs of the user submitting the job. This means that when you submit jobs to an EC2 cluster with lower IAM permissions than the user, cached resources don't get downloaded and the job doesn't start.

Production code changes:
* S3AFileStatus Adds "true" to the superclass's encrypted flag during construction.

Tests
* Base AbstractContractOpenTest can control whether zero byte files created in tests are encrypted. Not done via an XML attribute, just a subclass point. Thoughts?
* Verify that the filecache considers paths to not have the permissions which trigger reduce-privilege downloads
* And extend ITestDelegatedMRJob to test a completely different bucket (open street map), to verify that cached resources do get their tokens picked up

Docs:
* Advise FS developers to say all files are encrypted. It's otherwise harmless and it'll stop other people seeing impossible to debug error messages on app launch.

Contributed by Steve Loughran.

Change-Id: Ifaae4c9d735ccc5eafeebd2584b65daf2d4e5da3
2019-04-03 21:42:08 +01:00
Wei-Chiu Chuang
084fb9de22 HDFS-10477. Stop decommission a rack of DataNodes caused NameNode fail over to standby. Contributed by yunjiong zhao and Wei-Chiu Chuang.
(cherry picked from commit be488b6070a124234c77f16193ee925d32ca9a20)
(cherry picked from commit c8703dda0727e17d759d7ad27f0caee88103a530)
2019-04-03 12:41:07 -07:00
Akira Ajisaka
875435dc7d
HADOOP-16232. Fix errors in the checkstyle configration xmls. Contributed by Wanqiang Ji.
(cherry picked from commit 8b6deebb1dda49e5e35180ed5c5fb5b5221c1516)
2019-04-03 19:40:23 +09:00
Akira Ajisaka
10d6e68324
HADOOP-16225. Fix links to the developer mailing lists in DownstreamDev.md. Contributed by Wanqiang Ji.
(cherry picked from commit ebd0d2153884bf78a159534ea711c0b70acc1c05)
2019-04-02 10:55:07 +09:00
Adam Antal
52536ff887 MAPREDUCE-7190. Add SleepJob additional parameter to make parallel runs distinguishable. Contributed by Adam Antal.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 856cbf62d3fd1c885ae8374f08d7c18c83941791)
(cherry picked from commit ddf88596550be13889ff7002aadd0ebf797996a1)
2019-04-01 10:39:18 -07:00
Gabor Bota
f01d592f3b
HADOOP-16220. Add findbugs ignores for unjustified issues during update to guava to 27.0-jre in hadoop-project
This closes #665

Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 53a86e2b8ecb83b666d4ed223fc270e1a46642c1)
2019-04-01 13:52:44 +09:00
Takanobu Asanuma
1fc74178eb HDFS-14037. Fix SSLFactory truststore reloader thread leak in URLConnectionFactory.
(cherry picked from commit 55fb3c32fb48ca26a629d4d5f3f07e2858d09594)
2019-03-27 03:29:42 +09:00
Eric Yang
dbc02bcda7 YARN-9391. Fixed node manager environment leaks into Docker containers.
Contributed by Jim Brennan

(cherry picked from commit 3c45762a0bfb403e069a03e30d35dd11432ee8b0)
2019-03-25 15:55:46 -04:00
Weiwei Yang
f34d6b937f HADOOP-16191. AliyunOSS: improvements for copyFile/copyDirectory and logging. Contributed by wujinhu.
(cherry picked from commit 568d3ab8b65d1348dec9c971feffe200e6cba2ef)
2019-03-19 10:11:17 +08:00
Erik Krogen
d4fbbc83ad HADOOP-16192. Fix CallQueue backoff bugs: perform backoff when add() is used and update backoff when refreshed.
(cherry-picked from 8c95cb9d6bef369fef6a8364f0c0764eba90e44a)
(cherry-picked from 0de8b55a095ada2b98c0a41899651bd8e524f42e)
2019-03-18 08:53:13 -07:00
Inigo Goiri
80732146c6 HDFS-14366. Improve HDFS append performance. Contributed by Chao Sun.
(cherry picked from commit ff06ef0631cb8a0f67bbc39b5b5a1b0a81ca3b3c)
2019-03-15 13:58:39 -07:00
Erik Krogen
cb76fc5618 HDFS-14346. Add better time precision to Configuration#getTimeDuration, allowing return unit and default unit to be specified independently. Contributed by Chao Sun.
(cherry picked from commit 66357574ae1da09ced735da36bf7d80a40c3fa1b)
(cherry picked from commit fec7c5f3ebbaea7b290e0904570c3a485b541a22)
2019-03-13 13:20:51 -07:00
Shweta Yakkali
a324db2cce HDFS-14081. hdfs dfsadmin -metasave metasave_test results NPE. Contributed by Shweta Yakkali.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 1bea785020a538115b3e08f41ff88167033d2775)
(cherry picked from commit 1ceefa726e1b531ec92c2ee2212b25c327644ef6)
2019-03-12 16:06:16 -07:00
Stephen O'Donnell
327e57ae6e HDFS-14333. Datanode fails to start if any disk has errors during Namenode registration. Contributed by Stephen O'Donnell.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 34b14061b38dccab25058dff1b8743d8a3f82734)
(cherry picked from commit a21e2e4dbc1681175130242138ced0d36b6fafae)
2019-03-12 10:19:37 -07:00
Steve Loughran
1bace86501
HADOOP-16109. Parquet reading S3AFileSystem causes EOF
Nobody gets seek right. No matter how many times they think they have.

Reproducible test from: Dave Christianson
Fixed seek() logic: Steve Loughran
2019-03-11 16:53:06 +00:00
Weiwei Yang
fdd56b5b05 MAPREDUCE-7191. JobHistoryServer should log exception when loading/parsing history file failed. Contributed by Jiandan Yang.
(cherry picked from commit f0605146b34ddbd19fc9e52d4761f9a347d5173e)
2019-03-11 16:40:01 +08:00
Erik Krogen
b0b8000240 HDFS-14317. Ensure checkpoints are created when in-progress edit log tailing is enabled with a period shorter than the log roll period. Contributed by Ekanth Sethuramalingam.
(cherry-picked from commit 1bc282e0b3f74968c92751f0972746b012e72810)
(cherry-picked from commit 6d076dd5e8be8fee5062a04d4cd2aa0c5da0ea56)
2019-03-07 08:43:39 -08:00