5167 Commits

Author SHA1 Message Date
Eric Yang
7640d62716 YARN-8579. Recover NMToken of previous attempted component data.
Contributed by Gour Saha
2018-07-31 18:35:31 -04:00
Wangda Tan
5583711419 Preparing for 3.1.2 release
Change-Id: If2793e2ed2b5b349a9e1f98f78df43f309dcfcbd
2018-07-31 13:08:55 -07:00
Wangda Tan
7b552c9d72 YARN-8418. App local logs could leaked if log aggregation fails to initialize for the app. (Bibin A Chundatt via wangda)
Change-Id: I29a23ca4b219b48c92e7975cd44cddb8b0e04104
(cherry picked from commit 4b540bbfcf02d828052999215c6135603d98f5db)
2018-07-31 12:13:36 -07:00
Jonathan Hung
b91cf90e1c YARN-7974. Allow updating application tracking url after registration. Contributed by Jonathan Hung 2018-07-30 17:57:25 -07:00
bibinchundatt
8cd2a73777 YARN-8584. Several typos in Log Aggregation related classes. Contributed by Szilard Nemeth.
(cherry picked from commit 2b39ad26984d641bad57db2cfcc0b7515ef95f46)
2018-07-31 00:07:08 +05:30
Sunil G
f1eb5777a0 YARN-8591. [ATSv2] NPE while checking for entity acl in non-secure cluster. Contributed by Rohith Sharma K S.
(cherry picked from commit 63e08ec071852640babea9e39780327a0907712a)
2018-07-30 14:49:03 +05:30
bibinchundatt
2e7876a725 YARN-8558. NM recovery level db not cleaned up properly on container finish. Contributed by Bibin A Chundatt.
(cherry picked from commit 3d586841aba99c7df98b2b4d3e48ec0144bad086)
2018-07-28 20:56:35 +05:30
Eric Yang
c2c3eee69c YARN-8508. Release GPU resource for killed container.
Contributed by Chandni Singh

(cherry picked from commit ed9d60e888d0acfd748fda7f66249f5b79a3ed6d)
2018-07-27 19:36:21 -04:00
Billie Rinaldi
06166f2255 YARN-8571. Validate service principal format prior to launching yarn service. Contributed by Eric Yang
(cherry picked from commit b429f19d32d8f60a3535e047ef10cfb3edeb54c8)
2018-07-27 11:35:41 -07:00
Eric Yang
d6627cda9d YARN-8429. Improve diagnostic message when artifact is not set properly.
Contributed by Gour Saha

(cherry picked from commit 8d3c068e59fdddd18e3f8260713fee83c458aa1d)
2018-07-27 11:29:43 -04:00
Eric Yang
177f6045ac YARN-8545. Return allocated resource to RM for failed container.
Contributed by Chandni Singh

(cherry picked from commit 40fad32824d2f8f960c779d78357e62103453da0)
2018-07-26 18:25:41 -04:00
Eric Yang
8e3807afe0 YARN-8330. Improved publishing ALLOCATED events to ATS.
Contributed by Suma Shivaprasad

(cherry picked from commit f93ecf5c1e0b3db27424963814fc01ec43eb76e0)
2018-07-25 18:51:42 -04:00
Eric E Payne
830ef12af8 YARN-4606. CapacityScheduler: applications could get starved because computation of #activeUsers considers pending apps. Contributed by Manikandan R
(cherry picked from commit 9485c9aee6e9bb935c3e6ae4da81d70b621781de)
2018-07-25 16:30:30 +00:00
bibinchundatt
8e65057eb1 YARN-8541. RM startup failure on recovery after user deletion. Contributed by Bibin A Chundatt. 2018-07-25 15:54:32 +05:30
Weiwei Yang
b89624a943 YARN-8546. Resource leak caused by a reserved container being released more than once under async scheduling. Contributed by Tao Yang.
(Cherry-picked from commit 5be9f4a5d05c9cb99348719fe35626b1de3055db)
2018-07-25 17:53:40 +08:00
Haibo Chen
7e7792dd7b YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen)
(cherry picked from commit 9d3c39e9dd88b8f32223c01328581bb68507d415)
2018-07-24 12:50:43 -07:00
Sunil G
4488fd8295 YARN-7748. TestContainerResizing.testIncreaseContainerUnreservedWhenApplicationCompleted fails due to multiple container fail events. Contributed by Weiwei Yang.
(cherry picked from commit 35ce6eb1f526ce3db7e015fb1761eee15604100c)
2018-07-24 22:21:15 +05:30
bibinchundatt
a684a2efb8 YARN-8548. AllocationRespose proto setNMToken initBuilder not done. Contributed by Bilwa S T.
(cherry picked from commit ff7c2eda34c2c40ad71b50df6462a661bd213fbd)
2018-07-24 16:30:31 +05:30
bibinchundatt
0710107f8d YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt.
(cherry picked from commit 84612788339392fcda1aef0e27c43f5c6b2a19e5)
2018-07-24 13:11:31 +05:30
Eric Yang
23b8546a80 YARN-8380. Support bind propagation options for mounts in docker runtime.
Contributed by Billie Rinaldi

(cherry picked from commit 8688a0c7f88f2adf1a7fce695e06f3dd1f745080)
2018-07-23 20:13:41 -04:00
Eric Yang
e665c0a9dd YARN-8360. Improve YARN service restart policy and node manager auto restart policy.
Contributed by Suma Shivaprasad

(cherry picked from commit 84d7bf1eeff6b9418361afa4aa713e5e6f771365)
2018-07-23 12:59:37 -04:00
Eric Yang
4f2a129f2e YARN-8301. Added YARN service upgrade instructions.
Contributed by Chandni Singh

(cherry picked from commit 10014a4d88f239d3c072e51bc0739cba1fca9406)
2018-07-20 19:48:19 -04:00
Weiwei Yang
004e1f248e YARN-8528. Final states in ContainerAllocation might be modified externally causing unexpected allocation results. Contributed by Xintong Song. 2018-07-20 22:43:47 +08:00
Eric Yang
76b8beb289 YARN-8501. Reduce complexity of RMWebServices getApps method.
Contributed by Szilard Nemeth

(cherry picked from commit 5836e0a46bf9793e0a61bb8ec46536f4a67d38d7)
2018-07-19 12:32:55 -04:00
Robert Kanter
dfa71428ea YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter)
(cherry picked from commit 1bc106a738a6ce4f7ed025d556bb44c1ede022e3)
2018-07-18 16:07:48 -07:00
Robert Kanter
1c7d916347 Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 0838fe833738e04f5e6f6408e97866d77bebbf30)
2018-07-18 16:07:48 -07:00
Robert Kanter
27e2b4b364 Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 351cf87c92872d90f62c476f85ae4d02e485769c)
2018-07-18 16:07:48 -07:00
Eric Yang
d82edec3c0 YARN-8538. Fixed memory leaks in container-executor and test cases.
Contributed by Billie Rinaldi
2018-07-18 13:44:49 -04:00
Wangda Tan
44beab0b63 YARN-8511. When AM releases a container, RM removes allocation tags before it is released by NM. (Weiwei Yang via wangda)
Change-Id: I6f9f409f2ef685b405cbff547dea9623bf3322d9
(cherry picked from commit 752dcce5f4cf0f6ebcb40a61f622f1a885c4bda7)
2018-07-16 11:04:08 -07:00
Eric E Payne
9a79e893f7 YARN-8421: when moving app, activeUsers is increased, even though app does not have outstanding request. Contributed by Kyungwan Nam
(cherry picked from commit 937ef39b3ff90f72392b7a319e4346344db34e03)
2018-07-16 16:32:05 +00:00
bibinchundatt
677bbdcdc4 YARN-8434. Update federation documentation of Nodemanager configurations. Contributed by Bibin A Chundatt.
(cherry picked from commit 4523cc5637bc3558aa5796150b358ca8471773bb)
2018-07-15 13:57:14 +05:30
Jason Lowe
7cbb9597c4 YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan
(cherry picked from commit 17118f446c2387aa796849da8b69a845d9d307d3)
2018-07-13 10:06:38 -05:00
bibinchundatt
2aaad4000a YARN-8491. TestServiceCLI#testEnableFastLaunch fail when umask is 077. Contributed by K G Bakthavachalam.
(cherry picked from commit 52e1bc8539ce769f47743d8b2d318a54c3887ba0)
2018-07-11 16:24:37 +05:30
Sunil G
9b4ead92c8 YARN-8512. ATSv2 entities are not published to HBase from second attempt onwards. Contributed by Rohith Sharma K S.
(cherry picked from commit 7f1d3d0e9dbe328fae0d43421665e0b6907b33fe)
2018-07-11 12:27:12 +05:30
Sunil G
6f10491e64 YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe.
(cherry picked from commit 705e2c1f7cba51496b0d019ecedffbe5fb55c28b)
2018-07-10 20:12:47 +05:30
Wangda Tan
8ad82ea5b5 YARN-8506. Make GetApplicationsRequestPBImpl thread safe. (wangda)
Change-Id: I2c006965375823c83036e7f45f7163d13c0bdf90
2018-07-09 11:35:15 -07:00
Rohith Sharma K S
785ed3f8ae YARN-8302. ATS v2 should handle HBase connection issue properly. Contributed by Billie Rinaldi.
(cherry picked from commit ba683204498c97654be4727ab9e128c433a45498)
2018-07-06 15:20:39 -07:00
Xiaoyu Yao
ffe2019887 HADOOP-15571. Multiple FileContexts created with the same configuration object should be allowed to have different umask. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 629f7008b531bad469903e6a1f6f8cb4be691f36)
2018-07-06 11:56:51 -07:00
Sunil G
844538d669 YARN-8459. Improve Capacity Scheduler logs to debug invalid states. Contributed by Wangda Tan.
(cherry picked from commit 51654a3962bcd0482c0e1dd32765e9533e3f9158)
2018-07-03 10:00:05 -07:00
Junping Du
3d6ba2dd4e yarn.resourcemanager.fail-fast is used inconsistently. Contributed by Yuanbo Liu.
(cherry picked from commit d9ba6f3656e8dc97d2813181e27d12e52dca4328)
2018-07-03 22:36:24 +08:00
Sunil G
8ffc191f61 YARN-8415. TimelineWebServices.getEntity should throw ForbiddenException instead of 404 when ACL checks fail. Contributed by Suma Shivaprasad.
(cherry picked from commit fa9ef15ecd6dc30fb260e1c342a2b51505d39b6b)
2018-07-02 15:35:45 -07:00
Shane Kumpf
2f264764a4 YARN-8485. Priviledged container app launch is failing intermittently. Contributed by Eric Yang
(cherry picked from commit 53e267fa7232add3c21174382d91b2607aa6becf)
2018-07-02 16:23:23 -06:00
Eric Yang
0aa7272199 YARN-8465. Fixed docker container status for node manager restart.
Contributed by Shane Kumpf
2018-07-02 13:40:02 -04:00
Jason Lowe
dbb0a5f51f YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang
(cherry picked from commit 100470140d86eede0fa240a9aa93226f274ee4f5)
2018-06-29 13:11:41 -05:00
Rohith Sharma K S
00406f583b YARN-8469. [UI2] URL needs to be trimmed to handle index.html redirection while accessing via knox. Contributed by Sunil Govindan.
(cherry picked from commit a820738e347854aed7cdaa7758d9c0830df624e3)
2018-06-29 10:39:30 -07:00
Sunil G
e7f2c9886a YARN-8455. Add basic ACL check for all ATS v2 REST APIs. Contributed by Rohith Sharma K S.
(cherry picked from commit 469b29c0817b7bf1902c9195c4f8d031a909e1c9)
2018-06-29 10:03:53 -07:00
Sunil G
122cb81b65 YARN-8379. Improve balancing resources in already satisfied queues by using Capacity Scheduler preemption. Contributed by Zian Chen.
(cherry picked from commit 291194302cc1a875d6d94ea93cf1184a3f1fc2cc)
2018-06-28 10:24:03 -07:00
Eric Yang
3ce3571a34 YARN-8409. Fixed NPE in ActiveStandbyElectorBasedElectorService.
Contributed by Chandni Singh
2018-06-28 13:11:56 -04:00
Sunil G
3c68d4ce45 YARN-8401. [UI2] new ui is not accessible with out internet connection. Contributed by Bibin A Chundatt.
(cherry picked from commit fbaff369e9b9022723a7b2c6f25e71122a8f8a15)
2018-06-27 10:35:50 -07:00
Wangda Tan
ec37e02bc9 YARN-8464. Async scheduling thread could be interrupted when there are no NodeManagers in cluster. (Sunil G via wangda)
Change-Id: I4f5f856373378685713e77752ba6cf0988a66065
(cherry picked from commit bedc4fe0799cf3b161100acc521fc62a97793427)
2018-06-26 19:34:15 -07:00