Commit Graph

1067 Commits

Author SHA1 Message Date
Billie Rinaldi 2a94823f32 YARN-8403. Change the log level for fail to download resource from INFO to ERROR. Contributed by Eric Yang
(cherry picked from commit 67c65da261)
2018-08-01 08:58:15 -07:00
Wangda Tan 5583711419 Preparing for 3.1.2 release
Change-Id: If2793e2ed2b5b349a9e1f98f78df43f309dcfcbd
2018-07-31 13:08:55 -07:00
Wangda Tan 7b552c9d72 YARN-8418. App local logs could leaked if log aggregation fails to initialize for the app. (Bibin A Chundatt via wangda)
Change-Id: I29a23ca4b219b48c92e7975cd44cddb8b0e04104
(cherry picked from commit 4b540bbfcf)
2018-07-31 12:13:36 -07:00
bibinchundatt 8cd2a73777 YARN-8584. Several typos in Log Aggregation related classes. Contributed by Szilard Nemeth.
(cherry picked from commit 2b39ad2698)
2018-07-31 00:07:08 +05:30
bibinchundatt 2e7876a725 YARN-8558. NM recovery level db not cleaned up properly on container finish. Contributed by Bibin A Chundatt.
(cherry picked from commit 3d586841ab)
2018-07-28 20:56:35 +05:30
Eric Yang c2c3eee69c YARN-8508. Release GPU resource for killed container.
Contributed by Chandni Singh

(cherry picked from commit ed9d60e888)
2018-07-27 19:36:21 -04:00
Haibo Chen 7e7792dd7b YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen)
(cherry picked from commit 9d3c39e9dd)
2018-07-24 12:50:43 -07:00
bibinchundatt 0710107f8d YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt.
(cherry picked from commit 8461278833)
2018-07-24 13:11:31 +05:30
Eric Yang 23b8546a80 YARN-8380. Support bind propagation options for mounts in docker runtime.
Contributed by Billie Rinaldi

(cherry picked from commit 8688a0c7f8)
2018-07-23 20:13:41 -04:00
Robert Kanter dfa71428ea YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter)
(cherry picked from commit 1bc106a738)
2018-07-18 16:07:48 -07:00
Robert Kanter 1c7d916347 Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 0838fe8337)
2018-07-18 16:07:48 -07:00
Robert Kanter 27e2b4b364 Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 351cf87c92)
2018-07-18 16:07:48 -07:00
Eric Yang d82edec3c0 YARN-8538. Fixed memory leaks in container-executor and test cases.
Contributed by Billie Rinaldi
2018-07-18 13:44:49 -04:00
Jason Lowe 7cbb9597c4 YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan
(cherry picked from commit 17118f446c)
2018-07-13 10:06:38 -05:00
Sunil G 9b4ead92c8 YARN-8512. ATSv2 entities are not published to HBase from second attempt onwards. Contributed by Rohith Sharma K S.
(cherry picked from commit 7f1d3d0e9d)
2018-07-11 12:27:12 +05:30
Sunil G 6f10491e64 YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe.
(cherry picked from commit 705e2c1f7c)
2018-07-10 20:12:47 +05:30
Shane Kumpf 2f264764a4 YARN-8485. Priviledged container app launch is failing intermittently. Contributed by Eric Yang
(cherry picked from commit 53e267fa72)
2018-07-02 16:23:23 -06:00
Eric Yang 0aa7272199 YARN-8465. Fixed docker container status for node manager restart.
Contributed by Shane Kumpf
2018-07-02 13:40:02 -04:00
Jason Lowe dbb0a5f51f YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang
(cherry picked from commit 100470140d)
2018-06-29 13:11:41 -05:00
Wangda Tan 8f226f4f1d YARN-8423. GPU does not get released even though the application gets killed. (Sunil G via wangda)
Change-Id: I570db7d60f8c6c21762dd618a9207d1107c486a0
(cherry picked from commit ada8f63d0b)
2018-06-26 19:34:10 -07:00
Eric Yang 3b3a85002d YARN-8326. Removed exit code file check for launched container.
Contributed by Shane Kumpf

(cherry picked from commit 8a32bc39eb)
2018-06-22 19:23:57 -04:00
Rohith Sharma K S 1aa6c9407c YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi.
(cherry picked from commit 9119b3cf8f)
2018-06-14 13:51:13 +05:30
Eric Yang 934a005f72 YARN-8259. Improve privileged docker container liveliness checks.
Contributed by Shane Kumpf

(cherry picked from commit 22994889dc)
2018-06-13 19:26:23 -04:00
Inigo Goiri d5cd12c082 HADOOP-15529. ContainerLaunch#testInvalidEnvVariableSubstitutionType is not supported in Windows. Contributed by Giovanni Matteo Fumarola.
(cherry picked from commit 6e756e8a62)
2018-06-12 10:25:05 -07:00
Inigo Goiri 65d2554ad8 YARN-8370. Some Node Manager tests fail on Windows due to improper path/file separator. Contributed by Anbang Hu.
(cherry picked from commit 2b2f672022)
2018-06-11 19:26:27 -07:00
Inigo Goiri 63c20f9021 YARN-8359. Exclude containermanager.linux test classes on Windows. Contributed by Jason Lowe.
(cherry picked from commit 3492a1db2c0654ce5375360caa74a34f928f23be)
2018-06-07 17:10:09 -07:00
Rohith Sharma K S a95f216945 YARN-8399. NodeManager is giving 403 GSS exception post upgrade to 3.1 in secure mode. Contributed by Sunil Govindan.
(cherry picked from commit 58bc34f1e3)
2018-06-07 10:27:08 +05:30
Miklos Szegedi 21948b048e YARN-8382. cgroup file leak in NM. Contributed by Hu Ziqian.
(cherry picked from commit 925fdf761a513130e23c10575c7328c8681cff1d)
2018-06-04 10:56:13 -07:00
Billie Rinaldi e59e9feefa YARN-8342. Enable untrusted docker image to run with launch command. Contributed by Eric Yang
(cherry picked from commit 31998643a5)
2018-06-02 14:49:21 -07:00
Wangda Tan cc460fea2d YARN-8384. stdout.txt, stderr.txt logs of a launched docker container is coming with primary group of submit user instead of hadoop. (Eric Yang via wangda)
Change-Id: Idbb2cd250f4f62dc32993e0d3ca3ec5684616baa
(cherry picked from commit 3a6bd77550)
2018-06-01 15:47:45 -07:00
Eric E Payne 3d06d75423 YARN-8377: Javadoc build failed in hadoop-yarn-server-nodemanager. Contributed by Takanobu Asanuma
(cherry picked from commit e44c0849d7)
2018-05-30 17:28:14 +00:00
Eric Yang 03209e8966 YARN-8362. Bugfix logic in container retries in node manager.
Contributed by Chandni Singh

(cherry picked from commit 135941e00d)
2018-05-29 17:04:01 -04:00
Jason Lowe a1fd04c4f4 YARN-8329. Docker client configuration can still be set incorrectly. Contributed by Shane Kumpf
(cherry picked from commit 4827e9a908)
2018-05-29 14:48:01 -05:00
Shane Kumpf de43b41d0f YARN-6919. Add default volume mount list. Contributed by Eric Badger 2018-05-24 09:35:35 -06:00
Rohith Sharma K S 923c833da4 YARN-8319. More YARN pages need to honor yarn.resourcemanager.display.per-user-apps. Contributed by Sunil G.
(cherry picked from commit c05b5d424b)
2018-05-24 14:22:43 +05:30
Inigo Goiri 61b5b2f4f7 YARN-8344. Missing nm.stop() in TestNodeManagerResync to fix testKillContainersOnResync. Contributed by Giovanni Matteo Fumarola.
(cherry picked from commit e99e5bf104)
2018-05-23 14:16:03 -07:00
Eric Yang 7c3f236783 YARN-7960. Added security flag no-new-privileges for YARN Docker integration.
Contributed by Eric Badger

(cherry picked from commit 6176d2b35c)
2018-05-22 13:47:08 -04:00
Jason Lowe 9788c8c01e YARN-8206. Sending a kill does not immediately kill docker containers. Contributed by Eric Badger
(cherry picked from commit 5f11288e41)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/linux/runtime/TestDockerContainerRuntime.java
2018-05-22 09:49:31 -05:00
Eric Yang aafaa5f99b YARN-8141. Removed YARN_CONTAINER_RUNTIME_DOCKER_LOCAL_RESOURCE_MOUNTS flag.
Contributed by Chandni Singh

(cherry-picked from commit d45a0b7d73)
2018-05-17 17:32:29 -04:00
Jason Lowe 79b2a508e0 YARN-8284. get_docker_command refactoring. Contributed by Eric Badger
(cherry picked from commit d47c09dcb1)
2018-05-16 09:34:24 -05:00
Haibo Chen 8d3598525d YARN-8130 Race condition when container events are published for KILLED applications. (Rohith Sharma K S via Haibo Chen)
(cherry picked from commit 2d00a0c71b)
2018-05-15 11:40:47 +05:30
Eric Yang 8edcbe9460 YARN-8265. Improve DNS handling on docker IP changes.
Contributed by Billie Rinaldi

(cherry picked from commit 0ff94563b9)
2018-05-11 22:41:30 -07:00
Jason Lowe dfe73334c2 YARN-7654. Support ENTRY_POINT for docker container. Contributed by Eric Yang
(cherry picked from commit 6c8e51ca7e)
2018-05-11 18:57:20 -05:00
Jason Lowe f6d6a2ab04 YARN-8244. TestContainerSchedulerQueuing.testStartMultipleContainers failed. Contributed by Jim Brennan
(cherry picked from commit dc912994a1)
2018-05-11 14:13:55 -05:00
Eric Yang 645429ca28 YARN-8274. Fixed a bug on docker start command.
Contributed by Jason Lowe
2018-05-11 14:26:20 -04:00
Eric Yang 6ce89ea2a8 YARN-8261. Fixed a bug in creation of localized container directory.
Contributed by Jason Lowe

(cherry picked from commit af4fc2e628)
2018-05-09 13:34:54 -04:00
Jason Lowe 39236da2ff YARN-8207. Docker container launch use popen have risk of shell expansion. Contributed by Eric Yang.
(cherry picked from commit a2ea756420)
2018-05-08 15:37:24 -05:00
Eric Yang a984abc472 YARN-8223. Improved yarn auxiliary service to load jar file from HDFS.
Contributed by Zian Chen

(cherry picked from commit 8cdb032aff)
2018-05-04 12:38:28 -04:00
Billie Rinaldi 04c3f11de4 YARN-7818. Remove privileged operation warnings during container launch for the ContainerRuntimes. Contributed by Shane Kumpf
(cherry picked from commit 502914ca32)
2018-05-04 08:55:30 -07:00
Eric Yang cf46533bca YARN-7973. Added ContainerRelaunch feature for Docker containers.
Addendum patch 001
            Contributed by Shane Kumpf
2018-05-03 12:46:04 -04:00
Eric Yang f729fb0baa YARN-8194. Fixed reinitialization error for LinuxContainerExecutor.
Contributed by Chandni Singh

(cherry picked from commit f4d280f02b)
2018-05-03 11:59:35 -04:00
Eric Yang e4b9981612 YARN-7973. Added ContainerRelaunch feature for Docker containers.
Contributed by Shane Kumpf
2018-05-03 11:57:01 -04:00
Eric Yang 61fac20c9e YARN-8209. Fixed NPE in Yarn Service deletion.
Contributed by Eric Badger
2018-05-02 15:05:30 -04:00
Eric Yang 0fc55a8f57 YARN-8228. Added hostname length check for docker container.
Contributed by Shane Kumpf

(cherry picked from commit a966ec6e23)
2018-04-30 19:14:37 -04:00
Sunil G a1a2f1ddc1 YARN-8215. ATS v2 returns invalid YARN_CONTAINER_ALLOCATED_HOST_HTTP_ADDRESS from NM. Contributed by Rohith Sharma K S.
(cherry picked from commit 7fc09c4f72)
2018-04-27 10:34:29 +05:30
Eric Yang 050bd4cd2a YARN-8043. Added the exception message for failed launches running under LCE.
Contributed by Shane Kumpf
2018-04-26 08:26:10 -07:00
Eric Yang 5556cf397c YARN-8064. Bugfix for clean up container-executor cmd helper file.
Contributed by Eric Badger
2018-04-20 19:12:06 -04:00
Weiwei Yang c3ca10c22a YARN-7734. Fix UT failure TestContainerLogsPage#testContainerLogPageAccess. Contributed by Tao Yang.
(cherry picked from commit 411993f6e5)
2018-04-18 19:40:34 -07:00
Shashikant Banerjee a13b3edfd8
HADOOP-15396. Some java source files are executable
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit e4c39f3247)
2018-04-19 08:36:32 +09:00
Wangda Tan c962371430 YARN-1151. Ability to configure auxiliary services from HDFS-based JAR files. (Xuan Gong via wangda)
Change-Id: Ied37ff11e507fc86847753ba79486652c8fadfe9
2018-04-13 10:17:47 -07:00
Weiwei Yang 84531ad9b6 YARN-8153. Guaranteed containers always stay in SCHEDULED on NM after restart. Contributed by Yang Wang. 2018-04-13 13:29:43 +08:00
Eric E Payne f955d52f39 YARN-8120. JVM can crash with SIGSEGV when exiting due to custom leveldb logger. Contributed by Jason Lowe.
(cherry picked from commit 6bb128dfb8)
2018-04-12 16:11:57 +00:00
Billie Rinaldi 11f1d49823 YARN-7221. Add security check for privileged docker container. Contributed by Eric Yang
(cherry picked from commit 933477e9e0)
2018-04-11 12:15:00 -07:00
Wangda Tan f5dc7d1edf YARN-8116. Nodemanager fails with NumberFormatException: For input string: . (Chandni Singh via wangda)
Change-Id: Idd30cfca59982d3fc6e47aa1b88f844a78fae94d
(cherry picked from commit 2bf9cc2c73)
2018-04-10 17:38:31 -07:00
Wangda Tan 25bf1d3ac6 Updated branch-3.1 to track 3.1.1
Change-Id: Idd55c79921b8acacc8a8a3258450e476f3abb706
2018-04-09 10:58:50 -07:00
bibinchundatt 091db4d0eb YARN-7905. Parent directory permission incorrect during public localization. Contributed by Bilwa S T.
(cherry picked from commit eb47c3de74)
2018-04-08 14:49:36 +05:30
Subru Krishnan b9bad94596 YARN-8110. AMRMProxy recover should catch for all throwable to avoid premature exit. (Botong Huang via Subru).
(cherry picked from commit 00905efab2)
2018-04-06 16:35:29 -07:00
Jason Lowe 5d4b2c31a1 YARN-8054. Improve robustness of the LocalDirsHandlerService MonitoringTimerTask thread. Contributed by Jonathan Eagles
(cherry picked from commit 5aa7052e31)
2018-03-21 17:20:36 -05:00
Jason Lowe a4f0dbcb44 YARN-8027. Setting hostname of docker container breaks for --net=host in docker 1.13. Contributed by Jim Brennan
(cherry picked from commit f480367af6)
2018-03-19 17:44:06 -05:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.) 0bd99621c5 Preparing for 3.1.0 development 2018-03-19 15:16:54 -07:00
Yufei Gu 49c747ab18 YARN-8039. Clean up log dir configuration in TestLinuxContainerExecutorWithMocks.testStartLocalizer. Contributed by Miklos Szegedi. 2018-03-16 16:25:33 -07:00
Wangda Tan 4bf622043f YARN-7952. RM should be able to recover log aggregation status after restart/fail-over. (Xuan Gong via wangda)
Change-Id: I725c9afe64831eda0aa6b0bebdbc79d2dd165707
2018-03-15 13:26:45 -07:00
fang zhenyi 5e013d50d1
HADOOP-15305. Replace FileUtils.writeStringToFile(File, String) with (File, String, Charset) to fix deprecation warnings.
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2018-03-15 16:12:07 +09:00
Eric Yang a82be7754d YARN-7999. Added file cache initializer for Linux container-executor.
Contributed by Jason Lowe
2018-03-14 12:21:57 -04:00
Wangda Tan a5b27b3c67 YARN-5015. Support sliding window retry capability for container restart. (Chandni Singh via wangda)
Change-Id: I07addd3e4ba8d98456ee2ff1d5c540a38fe61dea
2018-03-13 17:55:17 -07:00
Miklos Szegedi a82d4a2e3a YARN-5764. NUMA awareness support for launching containers. Contributed by Devaraj K. 2018-03-13 12:36:57 -07:00
Wangda Tan e718ac597f Revert "YARN-7891. LogAggregationIndexedFileController should support read from HAR file. (Xuan Gong via wangda)"
This reverts commit 4d53ef7eef.
2018-03-07 15:46:47 -08:00
Wangda Tan 037d783483 YARN-7626. Allow regular expression matching in container-executor.cfg for devices and named docker volumes mount. (Zian Chen via wangda)
Change-Id: If461277d4557922ab7e4dce9dd8dc5d0d5f22710
(cherry picked from commit 88f9138e12d2d5a1bd13f0915acef93037c1d086)
2018-03-07 11:30:15 -08:00
Wangda Tan 4d53ef7eef YARN-7891. LogAggregationIndexedFileController should support read from HAR file. (Xuan Gong via wangda)
Change-Id: Ie16e34039d57df50128c73b37516ad0bc7c9590e
2018-03-07 11:30:06 -08:00
Jason Lowe d69b31f7f7 YARN-7677. Docker image cannot set HADOOP_CONF_DIR. Contributed by Jim Brennan 2018-03-07 09:20:34 -06:00
Billie Rinaldi 628be58a4c YARN-7915. Trusted image log message repeated multiple times. Contributed by Shane Kumpf 2018-03-05 09:08:44 -08:00
Jason Lowe 83798f15f8 YARN-7511. NPE in ContainerLocalizer when localization failed for running container. Contributed by Tao Yang 2018-03-02 09:59:46 -06:00
Ray Chiang 923e1773e1 HADOOP-14652. Update metrics-core version to 3.2.4. (rchiang) 2018-03-01 15:36:10 -08:00
Sunil G 22928c0c60 YARN-7637. GPU volume creation command fails when work preserving is disabled at NM. Contributed by Zian Chen. 2018-03-01 22:39:32 +05:30
Billie Rinaldi 727c033997 YARN-7446. Remove --user flag when running privileged mode docker container. Contributed by Eric Yang 2018-02-27 14:33:57 -08:00
Jason Lowe 8e728f39c9 YARN-5714. ContainerExecutor does not order environment map. Contributed by Remi Catherinot and Jim Brennan 2018-02-23 15:46:35 -06:00
Jason Lowe 3132709b46 YARN-7916. Remove call to docker logs on failure in container-executor. Contributed by Shane Kumpf 2018-02-21 16:54:02 -06:00
Jason Lowe b9a429bb28 Revert "YARN-7677. Docker image cannot set HADOOP_CONF_DIR. Contributed by Jim Brennan"
This reverts commit 8013475d44.
2018-02-19 08:16:25 -06:00
fang zhenyi 4d4dde5112
HADOOP-15223. Replace Collections.EMPTY* with empty* when available
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2018-02-18 22:19:39 +09:00
Jason Lowe 8013475d44 YARN-7677. Docker image cannot set HADOOP_CONF_DIR. Contributed by Jim Brennan 2018-02-15 17:09:00 -06:00
Jason Lowe 5a1db60ab1 YARN-7914. Fix exit code handling for short lived Docker containers. Contributed by Shane Kumpf 2018-02-12 15:50:10 -06:00
Jason Lowe 3414fd1243 YARN-7917. Fix failing test TestDockerContainerRuntime#testLaunchContainerWithDockerTokens. Contributed by Shane Kumpf 2018-02-12 09:27:43 -06:00
Wangda Tan 60f9e60b3b Preparing for 3.2.0 development
Change-Id: I6d0e01f3d665d26573ef2b957add1cf0cddf7938
2018-02-11 11:17:38 +08:00
Jian He eb2449d539 YARN-5428. Allow for specifying the docker client configuration directory. Contributed by Shane Kumpf 2018-02-08 11:35:30 -08:00
Jason Lowe 456705a07c YARN-7815. Make the YARN mounts added to Docker containers more restrictive. Contributed by Shane Kumpf 2018-02-07 13:09:08 -06:00
Billie Rinaldi 266da25c04 Revert "YARN-6078. Containers stuck in Localizing state. Contributed by Billie Rinaldi."
This reverts commit e14f03dfbf.
2018-02-07 07:17:00 -08:00
Billie Rinaldi aa461f9091 YARN-7516. Add security check for trusted docker images. Contributed by Eric Yang 2018-02-07 07:09:45 -08:00
Anu Engineer 4304fcd5bd HDFS-12990. Change default NameNode RPC port back to 8020. Contributed by Xiao Chen. 2018-02-06 13:43:45 -08:00
Billie Rinaldi a196ee9362 YARN-7890. NPE during container relaunch. Contributed by Jason Lowe 2018-02-05 18:04:09 -08:00
Jason Lowe 682ea21f2b Revert "YARN-7677. Docker image cannot set HADOOP_CONF_DIR. Contributed by Jim Brennan"
This reverts commit 12eaae383a.
2018-02-01 12:14:09 -06:00