892 Commits

Author SHA1 Message Date
Eric E Payne
c296e3c0aa YARN-9018. Add functionality to AuxiliaryLocalPathHandler to return all locations to read for a given path. Contributed by Kuhu Shukla (kshukla)
(cherry picked from commit 93233a7d6e4d6b8098622a1aa830355cc18d9589)
2020-01-09 17:52:26 +00:00
Eric Badger
ab29b54b19 YARN-8672. TestContainerManager#testLocalingResourceWhileContainerRunning occasionally times out. Contributed by Chandni Singh and Jim Brennan. 2020-01-08 20:13:13 +00:00
Jonathan Hung
d74f64aa99 Setting version to 2.10.1-SNAPSHOT after branch-2 to branch-2.10 rename 2019-12-09 11:49:59 -08:00
Eric Badger
128631c5ac YARN-9914. Use separate configs for free disk space checking for full and not-full disks. Contributed by Jim Brennan
(cherry picked from commit eef34f2d87a75e16b2cca870d99a5e1e28c31d9b)
2019-10-25 17:45:40 +00:00
Jonathan Hung
108127238b Preparing for 2.11.0 development 2019-10-15 10:01:45 -07:00
Miklos Szegedi
655154cb45 YARN-7585. NodeManager should go unhealthy when state store throws DBException. Contributed by Wilfred Spiegelenburg.
(cherry picked from commit 7f515f57ede74dae787994f37bfafd5d20c9aa4c)
2019-08-29 19:52:36 -07:00
Wangda Tan
7025333d31 YARN-7345. GPU Isolation: Incorrect minor device numbers written to devices.deny file. (Jonathan Hung via wangda) 2019-08-29 17:02:47 -07:00
Jonathan Hung
22ba6939eb YARN-9291. Backport YARN-7637 to branch-2 2019-08-29 17:02:47 -07:00
Jonathan Hung
e102c3d74e YARN-9397. Fix empty NMResourceInfo object test failures in branch-2 2019-08-29 17:02:47 -07:00
Sunil G
c539427696 YARN-7594. TestNMWebServices#testGetNMResourceInfo fails on trunk. Contributed by Gergely Novák. 2019-08-29 17:02:47 -07:00
Wangda Tan
ca7d0fda4f YARN-7573. Gpu Information page could be empty for nodes without GPU. (Sunil G via wangda)
Change-Id: I7f614e5a589a09ce4e4286c84b706e05c29abd14
2019-08-29 17:02:47 -07:00
Jonathan Hung
7939ae1695 YARN-9289. Backport YARN-7330 for GPU in UI to branch-2 2019-08-29 17:02:47 -07:00
Jian He
4d82318eb1 YARN-7396. NPE when accessing container logs due to null dirsHandler. Contributed by Jonathan Hung 2019-08-29 17:02:47 -07:00
Jonathan Hung
16faceb0da YARN-9174. Backport YARN-7224 for refactoring of GpuDevice class 2019-08-29 17:02:47 -07:00
Jonathan Hung
7ec4d7c6ce YARN-9280. Backport YARN-6620 to YARN-8200/branch-2 for NodeManager-side GPU isolation 2019-08-29 17:02:47 -07:00
Jonathan Hung
631dfc7277 YARN-9180. Port YARN-7033 NM recovery of assigned resources to branch-2 2019-08-29 17:02:47 -07:00
Jonathan Hung
56259bcecb YARN-9187. Backport YARN-6852 for GPU-specific native changes to branch-2 2019-08-29 17:02:47 -07:00
Wangda Tan
f340726c9a YARN-7039. Fix javac and javadoc errors in YARN-3926 branch. (Sunil G via wangda)
Change-Id: I442bf6d838b3aba83f1f6779cf9dcf8596a2102d
(cherry picked from commit e490602e9b306d5b8a543b93fb15a7395bb9a03d)
2019-08-29 17:02:46 -07:00
Varun Vasudev
7d5050a768 YARN-4830. Add support for resource types in the nodemanager. Contributed by Varun Vasudev.
(cherry picked from commit 759114b0063907d4c07ea6ee261e861bf5cc3a9a)
2019-08-29 17:02:46 -07:00
Eric Badger
afa9a4084d YARN-9442. container working directory has group read permissions. Contributed by Jim Brennan.
(cherry picked from commit 2ac029b949f041da2ee04da441c5f9f85e1f2c64)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/test/test-container-executor.c

(cherry picked from commit cec71691be76577718b22f936aea9e2b2cd100ea)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/impl/container-executor.c

(cherry picked from commit db88224e8f9d164ac811fcca9efe4a350cebecd1)
2019-08-13 17:27:43 +00:00
Haibo Chen
4a9fc45f6f YARN-9559. Create AbstractContainersLauncher for pluggable ContainersLauncher logic. (Contributed by Jonathan Hung)
(cherry picked from commit f51702d5398531835b24d812f6f95094a0e0493e)
(cherry picked from commit 8d357343c4bc9f18e25543583f8f217b8a2f621b)
(cherry picked from commit f943bff2546330bc9eb0914d1a0fa1ef4c709b72)
(cherry picked from commit 6109ecf16d6f0ebf1e314e163b5c41ad75758be1)
2019-08-06 15:14:45 -07:00
Eric Badger
9aae72af84 YARN-8045. Reduce log output from container status calls. Contributed by Craig Condit
(cherry picked from commit 144a55f0e3ba302327baf2e98d1e07b953dcbbfd)
2019-08-02 21:06:54 +00:00
Haibo Chen
68c24b322d YARN-9668. UGI conf doesn't read user overridden configurations on RM and NM startup. (Contributed by Jonanthan Hung) 2019-07-19 11:00:38 -07:00
Takanobu Asanuma
e3d4f6bfed HADOOP-16331. Fix ASF License check in pom.xml. Contributed by Akira Ajisaka.
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2019-05-29 17:41:21 +09:00
Akira Ajisaka
cea1b0000c
HADOOP-16323. https everywhere in Maven settings. 2019-05-27 15:28:54 +09:00
Haibo Chen
dc28c86270 YARN-9529. Log correct cpu controller path on error while initializing CGroups. (Contributed by Jonathan Hung)
(cherry picked from commit 597fa47ad125c0871f5c4deb3a883e5b3341c67b)
(cherry picked from commit c6573562cbc43832e9332989996e5d07b8bce9b0)
(cherry picked from commit ea1f0f282bee7ca64ce9fbe1a59bb5e231420d29)
(cherry picked from commit aa7ff8552e0f1b9439926ae3476a3227dcf339f3)
2019-05-06 12:01:26 -07:00
Eric E Payne
6e6ebc368c YARN-5714. ContainerExecutor does not order environment map. Contributed by Remi Catherinot and Jim Brennan. 2019-03-07 16:22:36 +00:00
Adam Antal
3c7c3366ff
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 9385ec45d75109a2e6565faa10527cc56637bf5f)
2019-02-15 19:00:48 +09:00
Masatake Iwasaki
83a9369f7d YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri.
(cherry picked from commit e0ab1bdecec3d7ba7ddc0849781d7f71714f8687)
2019-02-09 00:34:57 +09:00
Botong Huang
15286d9166 YARN 9108. Fix FederationIntercepter merge home and secondary allocate response typo. Contributed by Abhishek Modi. 2018-12-22 12:46:13 -08:00
Botong Huang
b7ca20e16e YARN-8933. [AMRMProxy] Fix potential empty fields in allocation response, move SubClusterTimeout to FederationInterceptor. Contributed by Botong Huang. 2018-11-11 19:41:08 -08:00
Giovanni Matteo Fumarola
81da8b262b YARN-8893. [AMRMProxy] Fix thread leak in AMRMClientRelayer and UAM client. Contributed by Botong Huang. 2018-11-02 16:02:56 -07:00
Inigo Goiri
b0900ad310 YARN-7652. Handle AM register requests asynchronously in FederationInterceptor. Contributed by Botong Huang.
(cherry picked from commit c3d22d3b4569b7f87af4ee4abfcc284deebe90de)
2018-10-09 10:30:42 -07:00
Giovanni Matteo Fumarola
57caab6f9b YARN-8760. [AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer. Contributed by Botong Huang. 2018-10-01 14:27:01 -07:00
Giovanni Matteo Fumarola
6937925838 YARN-8696. [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async. Contributed by Botong Huang. 2018-09-24 11:40:07 -07:00
Giovanni Matteo Fumarola
60565976e1 YARN-8658. [AMRMProxy] Metrics for AMRMClientRelayer inside FederationInterceptor. Contributed by Young Chen. 2018-09-21 10:36:36 -07:00
Giovanni Matteo Fumarola
d4a3be9591 HADOOP-15699. Fix some of testContainerManager failures in Windows. Contributed by Botong Huang. 2018-08-27 12:28:16 -07:00
Giovanni Matteo Fumarola
548a595027 YARN-8705. Refactor the UAM heartbeat thread in preparation for YARN-8696. Contributed by Botong Huang. 2018-08-27 11:26:31 -07:00
Jason Lowe
c4e3df2261 YARN-8649. NPE in localizer hearbeat processing if a container is killed while localizing. Contributed by lujie
(cherry picked from commit 585ebd873a55bedd2a364d256837f08ada8ba032)
2018-08-23 09:43:03 -05:00
Giovanni Matteo Fumarola
8e6807ef4a YARN-8673. [AMRMProxy] More robust responseId resync after an YarnRM master slave switch. Contributed by Botong Huang. 2018-08-21 13:09:33 -07:00
Jason Lowe
a44e53a314 YARN-8640. Restore previous state in container-executor after failure. Contributed by Jim Brennan
(cherry picked from commit d1d129aa9deecebf42261947fcb0b2ca46dacad5)
2018-08-14 10:33:27 -05:00
Jason Lowe
2024260af6 YARN-8331. Race condition in NM container launched after done. Contributed by Pradeep Ambati
(cherry picked from commit cd04e954d2db27f0a15b7d1c492b7cdb656a51db)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/container/TestContainer.java
2018-08-09 10:35:07 -05:00
Haibo Chen
1991a1d760 YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen) 2018-08-02 10:06:16 -07:00
Arun Suresh
e2b82b82e2 YARN-7542. Fix issue that causes some Running Opportunistic Containers to be recovered as PAUSED. (Sampada Dehankar via asuresh)
(cherry picked from commit a55884c68eb175f1c9f61771386c086bf1ee65a9)
(cherry picked from commit bd4dcc7772f9a6786e8ef4ef8fa97dfdd34d64d1)
2018-08-02 09:59:04 -07:00
Rohith Sharma K S
21e416ad27 YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi. 2018-08-01 22:25:53 +05:30
Robert Kanter
edb9d8b554 YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter)
(cherry picked from commit 1bc106a738a6ce4f7ed025d556bb44c1ede022e3)
(cherry picked from commit 6e0db6fe1a8ce50977175567f2ba1f957e7b9c91)
2018-07-22 05:28:21 +00:00
Robert Kanter
92f02f97fd Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 0838fe833738e04f5e6f6408e97866d77bebbf30)
(cherry picked from commit c1dc4ca2c6080377159157ce97bf5d72fa3285a1)
2018-07-22 05:28:20 +00:00
Robert Kanter
f5fd5aa025 Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 351cf87c92872d90f62c476f85ae4d02e485769c)
(cherry picked from commit d61d84279f7f22867c23dd95e8bfeb70ea7e0690)
2018-07-22 05:28:20 +00:00
Jason Lowe
0e6efe06ea YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan
(cherry picked from commit 17118f446c2387aa796849da8b69a845d9d307d3)
2018-07-13 10:11:57 -05:00
Sunil G
6cc5d49fa3 YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe.
(cherry picked from commit 705e2c1f7cba51496b0d019ecedffbe5fb55c28b)
2018-07-10 20:13:56 +05:30