Commit Graph

873 Commits

Author SHA1 Message Date
Eric Badger afa9a4084d YARN-9442. container working directory has group read permissions. Contributed by Jim Brennan.
(cherry picked from commit 2ac029b949)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/test/test-container-executor.c

(cherry picked from commit cec71691be)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/impl/container-executor.c

(cherry picked from commit db88224e8f)
2019-08-13 17:27:43 +00:00
Haibo Chen 4a9fc45f6f YARN-9559. Create AbstractContainersLauncher for pluggable ContainersLauncher logic. (Contributed by Jonathan Hung)
(cherry picked from commit f51702d539)
(cherry picked from commit 8d357343c4)
(cherry picked from commit f943bff254)
(cherry picked from commit 6109ecf16d)
2019-08-06 15:14:45 -07:00
Eric Badger 9aae72af84 YARN-8045. Reduce log output from container status calls. Contributed by Craig Condit
(cherry picked from commit 144a55f0e3)
2019-08-02 21:06:54 +00:00
Haibo Chen 68c24b322d YARN-9668. UGI conf doesn't read user overridden configurations on RM and NM startup. (Contributed by Jonanthan Hung) 2019-07-19 11:00:38 -07:00
Takanobu Asanuma e3d4f6bfed HADOOP-16331. Fix ASF License check in pom.xml. Contributed by Akira Ajisaka.
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2019-05-29 17:41:21 +09:00
Akira Ajisaka cea1b0000c
HADOOP-16323. https everywhere in Maven settings. 2019-05-27 15:28:54 +09:00
Haibo Chen dc28c86270 YARN-9529. Log correct cpu controller path on error while initializing CGroups. (Contributed by Jonathan Hung)
(cherry picked from commit 597fa47ad1)
(cherry picked from commit c6573562cb)
(cherry picked from commit ea1f0f282b)
(cherry picked from commit aa7ff8552e)
2019-05-06 12:01:26 -07:00
Eric E Payne 6e6ebc368c YARN-5714. ContainerExecutor does not order environment map. Contributed by Remi Catherinot and Jim Brennan. 2019-03-07 16:22:36 +00:00
Adam Antal 3c7c3366ff
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 9385ec45d7)
2019-02-15 19:00:48 +09:00
Masatake Iwasaki 83a9369f7d YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri.
(cherry picked from commit e0ab1bdece)
2019-02-09 00:34:57 +09:00
Botong Huang 15286d9166 YARN 9108. Fix FederationIntercepter merge home and secondary allocate response typo. Contributed by Abhishek Modi. 2018-12-22 12:46:13 -08:00
Botong Huang b7ca20e16e YARN-8933. [AMRMProxy] Fix potential empty fields in allocation response, move SubClusterTimeout to FederationInterceptor. Contributed by Botong Huang. 2018-11-11 19:41:08 -08:00
Giovanni Matteo Fumarola 81da8b262b YARN-8893. [AMRMProxy] Fix thread leak in AMRMClientRelayer and UAM client. Contributed by Botong Huang. 2018-11-02 16:02:56 -07:00
Inigo Goiri b0900ad310 YARN-7652. Handle AM register requests asynchronously in FederationInterceptor. Contributed by Botong Huang.
(cherry picked from commit c3d22d3b45)
2018-10-09 10:30:42 -07:00
Giovanni Matteo Fumarola 57caab6f9b YARN-8760. [AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer. Contributed by Botong Huang. 2018-10-01 14:27:01 -07:00
Giovanni Matteo Fumarola 6937925838 YARN-8696. [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async. Contributed by Botong Huang. 2018-09-24 11:40:07 -07:00
Giovanni Matteo Fumarola 60565976e1 YARN-8658. [AMRMProxy] Metrics for AMRMClientRelayer inside FederationInterceptor. Contributed by Young Chen. 2018-09-21 10:36:36 -07:00
Giovanni Matteo Fumarola d4a3be9591 HADOOP-15699. Fix some of testContainerManager failures in Windows. Contributed by Botong Huang. 2018-08-27 12:28:16 -07:00
Giovanni Matteo Fumarola 548a595027 YARN-8705. Refactor the UAM heartbeat thread in preparation for YARN-8696. Contributed by Botong Huang. 2018-08-27 11:26:31 -07:00
Jason Lowe c4e3df2261 YARN-8649. NPE in localizer hearbeat processing if a container is killed while localizing. Contributed by lujie
(cherry picked from commit 585ebd873a)
2018-08-23 09:43:03 -05:00
Giovanni Matteo Fumarola 8e6807ef4a YARN-8673. [AMRMProxy] More robust responseId resync after an YarnRM master slave switch. Contributed by Botong Huang. 2018-08-21 13:09:33 -07:00
Jason Lowe a44e53a314 YARN-8640. Restore previous state in container-executor after failure. Contributed by Jim Brennan
(cherry picked from commit d1d129aa9d)
2018-08-14 10:33:27 -05:00
Jason Lowe 2024260af6 YARN-8331. Race condition in NM container launched after done. Contributed by Pradeep Ambati
(cherry picked from commit cd04e954d2)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/container/TestContainer.java
2018-08-09 10:35:07 -05:00
Haibo Chen 1991a1d760 YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen) 2018-08-02 10:06:16 -07:00
Arun Suresh e2b82b82e2 YARN-7542. Fix issue that causes some Running Opportunistic Containers to be recovered as PAUSED. (Sampada Dehankar via asuresh)
(cherry picked from commit a55884c68e)
(cherry picked from commit bd4dcc7772)
2018-08-02 09:59:04 -07:00
Rohith Sharma K S 21e416ad27 YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi. 2018-08-01 22:25:53 +05:30
Robert Kanter edb9d8b554 YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter)
(cherry picked from commit 1bc106a738)
(cherry picked from commit 6e0db6fe1a)
2018-07-22 05:28:21 +00:00
Robert Kanter 92f02f97fd Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 0838fe8337)
(cherry picked from commit c1dc4ca2c6)
2018-07-22 05:28:20 +00:00
Robert Kanter f5fd5aa025 Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 351cf87c92)
(cherry picked from commit d61d84279f)
2018-07-22 05:28:20 +00:00
Jason Lowe 0e6efe06ea YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan
(cherry picked from commit 17118f446c)
2018-07-13 10:11:57 -05:00
Sunil G 6cc5d49fa3 YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe.
(cherry picked from commit 705e2c1f7c)
2018-07-10 20:13:56 +05:30
Giovanni Matteo Fumarola aab9bfc13c YARN-7899. [AMRMProxy] Stateful FederationInterceptor for pending requests. Contributed by Botong Huang. 2018-07-09 16:47:44 -07:00
Jason Lowe 14c7dc3c1e YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang
(cherry picked from commit 100470140d)
2018-06-29 13:17:14 -05:00
Inigo Goiri 85f3105e61 HADOOP-15529. ContainerLaunch#testInvalidEnvVariableSubstitutionType is not supported in Windows. Contributed by Giovanni Matteo Fumarola.
(cherry picked from commit 6e756e8a62)
2018-06-12 10:26:01 -07:00
Inigo Goiri 8be1640bf6 YARN-8370. Some Node Manager tests fail on Windows due to improper path/file separator. Contributed by Anbang Hu.
(cherry picked from commit 2b2f672022)
2018-06-11 19:27:34 -07:00
Inigo Goiri b991b38f51 YARN-8359. Exclude containermanager.linux test classes on Windows. Contributed by Jason Lowe.
(cherry picked from commit 3492a1db2c0654ce5375360caa74a34f928f23be)
2018-06-07 17:11:12 -07:00
Inigo Goiri 8f43ade46a YARN-8344. Missing nm.stop() in TestNodeManagerResync to fix testKillContainersOnResync. Contributed by Giovanni Matteo Fumarola.
(cherry picked from commit e99e5bf104)
2018-05-23 14:17:20 -07:00
Arun Suresh 113e2d6801 YARN-7900. [AMRMProxy] AMRMClientRelayer for stateful FederationInterceptor. (Botong Huang via asuresh) 2018-05-21 11:26:32 -07:00
Haibo Chen e28af8b0eb YARN-8130 Race condition when container events are published for KILLED applications. (Rohith Sharma K S via Haibo Chen)
(cherry picked from commit 2d00a0c71b)
2018-05-15 11:58:56 +05:30
Jason Lowe 5e3369ccd2 YARN-8244. TestContainerSchedulerQueuing.testStartMultipleContainers failed. Contributed by Jim Brennan
(cherry picked from commit dc912994a1)
2018-05-11 14:20:53 -05:00
Sunil G a91d5c7e2c YARN-8215. ATS v2 returns invalid YARN_CONTAINER_ALLOCATED_HOST_HTTP_ADDRESS from NM. Contributed by Rohith Sharma K S.
(cherry picked from commit 7fc09c4f72)
2018-04-27 11:19:12 +05:30
Jason Lowe 5ec195edbc YARN-7189. Container-executor doesn't remove Docker containers that error out early. Contributed by Eric Badger
(cherry picked from commit 391ac5cdd2)
2018-04-17 09:53:19 -05:00
Eric Yang 0731756293 YARN-7810. Update TestDockerContainerRuntime to test with current user credential.
(contributed by Shane Kumpf)
2018-04-16 13:39:21 -04:00
Wei-Chiu Chuang cc2a2a8e06 Revert "YARN-7810. Update TestDockerContainerRuntime to test with current user credential."
This reverts commit 724bffdb89.
2018-04-13 10:13:59 -07:00
Eric E Payne b0dfb18daa YARN-8120. JVM can crash with SIGSEGV when exiting due to custom leveldb logger. Contributed by Jason Lowe.
(cherry picked from commit 6bb128dfb8)
2018-04-12 16:38:50 +00:00
Eric Yang 724bffdb89 YARN-7810. Update TestDockerContainerRuntime to test with current user credential.
(contributed by Shane Kumpf)

            Cherry-picked from: 59828be197
2018-04-11 20:06:33 -04:00
Subru Krishnan 05246b7585 YARN-8110. AMRMProxy recover should catch for all throwable to avoid premature exit. (Botong Huang via Subru).
(cherry picked from commit 00905efab2)
2018-04-06 16:38:28 -07:00
Jason Lowe 13f74a3601 YARN-8054. Improve robustness of the LocalDirsHandlerService MonitoringTimerTask thread. Contributed by Jonathan Eagles
(cherry picked from commit 5aa7052e31)
2018-03-21 17:40:05 -05:00
Yufei Gu 204674f414 YARN-8039. Clean up log dir configuration in TestLinuxContainerExecutorWithMocks.testStartLocalizer. Contributed by Miklos Szegedi.
(cherry picked from commit 49c747ab18)
2018-03-16 16:34:14 -07:00
Jason Lowe 0a3a878428 YARN-7511. NPE in ContainerLocalizer when localization failed for running container. Contributed by Tao Yang
(cherry picked from commit 83798f15f8)
2018-03-02 10:08:32 -06:00