Commit Graph

4907 Commits

Author SHA1 Message Date
Rohith Sharma K S a3d4a25bbf YARN-8679. [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked. Contributed by Wangda Tan.
(cherry picked from commit 4aacbfff60)
2018-08-18 11:04:09 +05:30
Rohith Sharma K S 3532bd5c8b YARN-8612. Fix NM Collector Service Port issue in YarnConfiguration. Contributed by Prabha Manepalli.
(cherry picked from commit 1697a02306)
2018-08-17 11:13:25 +05:30
Eric Yang 5237bdfb5a YARN-8667. Cleanup symlinks when container restarted by NM.
Contributed by Chandni Singh

(cherry picked from commit d42806160e)
2018-08-16 18:44:47 -04:00
Eric Yang 97c1934247 YARN-8474. Fixed ApiServiceClient kerberos negotiation.
Contributed by Billie Rinaldi

(cherry picked from commit 8990eaf592)
2018-08-16 12:50:52 -04:00
Jason Lowe 819a2a6f10 YARN-8656. container-executor should not write cgroup tasks files for docker containers. Contributed by Jim Brennan
(cherry picked from commit cb21eaa026)
2018-08-16 10:09:56 -05:00
Weiwei Yang 05547b1e0b YARN-8614. Fix few annotation typos in YarnConfiguration. Contributed by Sen Zhao.
(cherry picked from commit 8dc07b4723)
2018-08-15 12:38:55 +08:00
Jason Lowe 95cd6de5c6 YARN-8640. Restore previous state in container-executor after failure. Contributed by Jim Brennan
(cherry picked from commit d1d129aa9d)
2018-08-14 10:26:21 -05:00
Shane Kumpf ba6b48e19f YARN-8520. Document best practice for user management. Contributed by Eric Yang
(cherry picked from commit e7951c69cb)
2018-08-10 14:47:14 -06:00
Weiwei Yang 734bc42289 YARN-8575. Avoid committing allocation proposal to unavailable nodes in async scheduling. Contributed by Tao Yang.
(cherry picked from commit 0a71bf1452)
2018-08-10 15:10:27 +08:00
Weiwei Yang 991514f7c3 YARN-8521. NPE in AllocationTagsManager when a container is removed more than once. Contributed by Weiwei Yang.
(cherry picked from commit 08d5060605)
2018-08-10 08:44:53 +08:00
Wangda Tan beea18736b YARN-8136. Add version attribute to site doc examples and quickstart. (Eric Yang via wangda)
Change-Id: I4541b239f490ca0a6edf9698e0d3deaf83669151
(cherry picked from commit 8244abb7ae)
2018-08-09 12:04:33 -07:00
Wangda Tan 68279fcd65 YARN-8588. Logging improvements for better debuggability. (Suma Shivaprasad via wangda)
Change-Id: I66aa4b0ec031ae5ce0fae558e2f8cbcbbfebc442
(cherry picked from commit 344c335a92)
2018-08-09 12:04:25 -07:00
Weiwei Yang 0ee7e80047 YARN-8559. Expose mutable-conf scheduler's configuration in RM /scheduler-conf endpoint. Contributed by Weiwei Yang.
(cherry picked from commit d352f167eb)
2018-08-10 00:43:53 +08:00
Jason Lowe 3dd299a770 YARN-8331. Race condition in NM container launched after done. Contributed by Pradeep Ambati
(cherry picked from commit cd04e954d2)
2018-08-09 10:23:02 -05:00
Sunil G c6013b060d YARN-8633. Update DataTables version in yarn-common in line with JQuery 3 upgrade. Contributed by Akhil PB.
(cherry picked from commit 00013d6ef7)
2018-08-09 12:35:21 +05:30
Bharat Viswanadham 5430aab133 YARN-8426:Upgrade jquery-ui to 1.12.1 in YARN. Contributed by Sunil Govindan
(cherry picked from commit 361ffb26be)
2018-08-09 12:27:50 +05:30
Robert Kanter 0ed91309d3 YARN-8568. Replace the deprecated zk-address property in the HA config example in ResourceManagerHA.md (bsteinbach via rkanter)
(cherry picked from commit 8478732bb2)
2018-08-08 15:10:52 -07:00
Wangda Tan f16c311486 Make 3.1.1 awared by other branches - adding missing files
(cherry picked from commit 3214cd75ac)
2018-08-08 13:06:14 -07:00
Wangda Tan 781620a44d Make 3.1.1 awared by other branches
(cherry picked from commit 49c687608b)
2018-08-08 13:04:05 -07:00
bibinchundatt b4f957e655 YARN-8601. Print ExecutionType in Container report CLI. Contributed by Bilwa S T.
(cherry picked from commit ff06bd1be8)
2018-08-08 22:46:16 +05:30
Wangda Tan f2768eaa38 YARN-8407. Container launch exception in AM log should be printed in ERROR level. (Yesha Vora via wangda)
Change-Id: I154e873df1df3503a09c41d6b3874ca195af91d9
(cherry picked from commit 861095f761)
2018-08-07 13:07:28 -07:00
Wangda Tan 450c791ecf YARN-8629. Container cleanup fails while trying to delete Cgroups. (Suma Shivaprasad via wangda)
Change-Id: I392ef4f8baa84d5d7b1f2e438c560b5426b6d4f2
(cherry picked from commit d4258fcad7)
2018-08-07 12:41:55 -07:00
Eric Yang 947dc34229 YARN-8624. Updated verbiage around entry point support.
Contributed by Craig Condit
2018-08-06 12:39:22 -04:00
Shane Kumpf 7773904974 YARN-8287. Update documentation and yarn-default related to the Docker runtime. Contributed by Craig Condit
(cherry picked from commit bcfc9851f4)
2018-08-03 18:23:37 -06:00
Sunil G d8fdcf8b42 YARN-8615. [UI2] Resource Usage tab shows only memory related info. No info available for vcores/gpu. Contributed by Akhil PB.
(cherry picked from commit 3426f4062a)
2018-08-03 19:36:19 +05:30
Sunil G 1bfce639e8 YARN-8620. [UI2] YARN Services UI new submission failures are not debuggable. Contributed by Akhil PB.
(cherry picked from commit 29417dbbf4)
2018-08-03 19:19:17 +05:30
Rohith Sharma K S 4816d10ddb YARN-8608. [UI2] No information available per application appAttempt about 'Total Outstanding Resource Requests'. Contributed by Akhil PB.
(cherry picked from commit 022592ae79)
2018-08-03 15:33:36 +05:30
Rohith Sharma K S 83ccb75b91 YARN-8603. [UI2] Latest run application should be listed first in the RM UI. Contributed by Akhil PB.
(cherry picked from commit 33482d35e8)
2018-08-03 15:33:24 +05:30
Jason Lowe 619019ccca YARN-8263. DockerClient still touches hadoop.tmp.dir. Contributed by Craig Condit
(cherry picked from commit 7526815e32)
2018-08-02 10:45:52 -05:00
Sunil G d76ab31961 YARN-8318. [UI2] IP address in component page shows N/A. Contributed by Yesha Vora.
(cherry picked from commit 5033d7da8f)
2018-08-02 20:10:08 +05:30
Sunil G b1fcbe7c2d YARN-8592. [UI2] rmip:port/ui2 endpoint shows a blank page in windows OS and Chrome browser. Contributed by Akhil PB.
(cherry picked from commit 97870ec1f6)
2018-08-02 16:12:12 +05:30
Sunil G 35b0686f24 YARN-8594. [UI2] Display current logged in user. Contributed by Akhil PB.
(cherry picked from commit 1ea81169ba)
2018-08-02 12:41:59 +05:30
Sunil G 1f77b20f08 YARN-8593. Add RM web service endpoint to get user information. Contributed by Akhil PB.
(cherry picked from commit 735b492556)
2018-08-02 08:35:54 +05:30
Eric Yang 680dac26d8 YARN-8610. Fixed initiate upgrade error message.
Contributed by Chandni Singh

(cherry picked from commit 23f394240e)
2018-08-01 20:43:22 -04:00
Shane Kumpf 62cc373dc5 YARN-8600. RegistryDNS hang when remote lookup does not reply. Contributed by Eric Yang
(cherry picked from commit 603a57476c)
2018-08-01 12:31:28 -06:00
Billie Rinaldi 2a94823f32 YARN-8403. Change the log level for fail to download resource from INFO to ERROR. Contributed by Eric Yang
(cherry picked from commit 67c65da261)
2018-08-01 08:58:15 -07:00
Sunil G a6525e0744 YARN-8595. [UI2] Container diagnostic information is missing from container page. Contributed by Akhil PB.
(cherry picked from commit d920b9db77)
2018-08-01 14:28:35 +05:30
Sunil G ff35f0c308 YARN-8606. Opportunistic scheduling does not work post RM failover. Contributed by Bibin A Chundatt.
(cherry picked from commit a48a0cc7fd)
2018-08-01 12:17:53 +05:30
Wangda Tan f4a3b26f9a YARN-8522. Application fails with InvalidResourceRequestException. (Zian Chen via wangda)
Change-Id: I34dd7fa49bd4d10580c4a78051033b1068d28f1e
(cherry picked from commit 5cc8e99147)
2018-07-31 22:35:16 -07:00
Sunil G cbfd7358d2 YARN-8397. Potential thread leak in ActivitiesManager. Contributed by Rohith Sharma K S.
(cherry picked from commit 6310c0d17d)
2018-08-01 08:34:09 +05:30
Eric Yang 7640d62716 YARN-8579. Recover NMToken of previous attempted component data.
Contributed by Gour Saha
2018-07-31 18:35:31 -04:00
Wangda Tan 5583711419 Preparing for 3.1.2 release
Change-Id: If2793e2ed2b5b349a9e1f98f78df43f309dcfcbd
2018-07-31 13:08:55 -07:00
Wangda Tan 7b552c9d72 YARN-8418. App local logs could leaked if log aggregation fails to initialize for the app. (Bibin A Chundatt via wangda)
Change-Id: I29a23ca4b219b48c92e7975cd44cddb8b0e04104
(cherry picked from commit 4b540bbfcf)
2018-07-31 12:13:36 -07:00
Jonathan Hung b91cf90e1c YARN-7974. Allow updating application tracking url after registration. Contributed by Jonathan Hung 2018-07-30 17:57:25 -07:00
bibinchundatt 8cd2a73777 YARN-8584. Several typos in Log Aggregation related classes. Contributed by Szilard Nemeth.
(cherry picked from commit 2b39ad2698)
2018-07-31 00:07:08 +05:30
Sunil G f1eb5777a0 YARN-8591. [ATSv2] NPE while checking for entity acl in non-secure cluster. Contributed by Rohith Sharma K S.
(cherry picked from commit 63e08ec071)
2018-07-30 14:49:03 +05:30
bibinchundatt 2e7876a725 YARN-8558. NM recovery level db not cleaned up properly on container finish. Contributed by Bibin A Chundatt.
(cherry picked from commit 3d586841ab)
2018-07-28 20:56:35 +05:30
Eric Yang c2c3eee69c YARN-8508. Release GPU resource for killed container.
Contributed by Chandni Singh

(cherry picked from commit ed9d60e888)
2018-07-27 19:36:21 -04:00
Billie Rinaldi 06166f2255 YARN-8571. Validate service principal format prior to launching yarn service. Contributed by Eric Yang
(cherry picked from commit b429f19d32)
2018-07-27 11:35:41 -07:00
Eric Yang d6627cda9d YARN-8429. Improve diagnostic message when artifact is not set properly.
Contributed by Gour Saha

(cherry picked from commit 8d3c068e59)
2018-07-27 11:29:43 -04:00
Eric Yang 177f6045ac YARN-8545. Return allocated resource to RM for failed container.
Contributed by Chandni Singh

(cherry picked from commit 40fad32824)
2018-07-26 18:25:41 -04:00
Eric Yang 8e3807afe0 YARN-8330. Improved publishing ALLOCATED events to ATS.
Contributed by Suma Shivaprasad

(cherry picked from commit f93ecf5c1e)
2018-07-25 18:51:42 -04:00
Eric E Payne 830ef12af8 YARN-4606. CapacityScheduler: applications could get starved because computation of #activeUsers considers pending apps. Contributed by Manikandan R
(cherry picked from commit 9485c9aee6)
2018-07-25 16:30:30 +00:00
bibinchundatt 8e65057eb1 YARN-8541. RM startup failure on recovery after user deletion. Contributed by Bibin A Chundatt. 2018-07-25 15:54:32 +05:30
Weiwei Yang b89624a943 YARN-8546. Resource leak caused by a reserved container being released more than once under async scheduling. Contributed by Tao Yang.
(Cherry-picked from commit 5be9f4a5d0)
2018-07-25 17:53:40 +08:00
Haibo Chen 7e7792dd7b YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen)
(cherry picked from commit 9d3c39e9dd)
2018-07-24 12:50:43 -07:00
Sunil G 4488fd8295 YARN-7748. TestContainerResizing.testIncreaseContainerUnreservedWhenApplicationCompleted fails due to multiple container fail events. Contributed by Weiwei Yang.
(cherry picked from commit 35ce6eb1f5)
2018-07-24 22:21:15 +05:30
bibinchundatt a684a2efb8 YARN-8548. AllocationRespose proto setNMToken initBuilder not done. Contributed by Bilwa S T.
(cherry picked from commit ff7c2eda34)
2018-07-24 16:30:31 +05:30
bibinchundatt 0710107f8d YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt.
(cherry picked from commit 8461278833)
2018-07-24 13:11:31 +05:30
Eric Yang 23b8546a80 YARN-8380. Support bind propagation options for mounts in docker runtime.
Contributed by Billie Rinaldi

(cherry picked from commit 8688a0c7f8)
2018-07-23 20:13:41 -04:00
Eric Yang e665c0a9dd YARN-8360. Improve YARN service restart policy and node manager auto restart policy.
Contributed by Suma Shivaprasad

(cherry picked from commit 84d7bf1eef)
2018-07-23 12:59:37 -04:00
Eric Yang 4f2a129f2e YARN-8301. Added YARN service upgrade instructions.
Contributed by Chandni Singh

(cherry picked from commit 10014a4d88f239d3c072e51bc0739cba1fca9406)
2018-07-20 19:48:19 -04:00
Weiwei Yang 004e1f248e YARN-8528. Final states in ContainerAllocation might be modified externally causing unexpected allocation results. Contributed by Xintong Song. 2018-07-20 22:43:47 +08:00
Eric Yang 76b8beb289 YARN-8501. Reduce complexity of RMWebServices getApps method.
Contributed by Szilard Nemeth

(cherry picked from commit 5836e0a46b)
2018-07-19 12:32:55 -04:00
Robert Kanter dfa71428ea YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter)
(cherry picked from commit 1bc106a738)
2018-07-18 16:07:48 -07:00
Robert Kanter 1c7d916347 Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 0838fe8337)
2018-07-18 16:07:48 -07:00
Robert Kanter 27e2b4b364 Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 351cf87c92)
2018-07-18 16:07:48 -07:00
Eric Yang d82edec3c0 YARN-8538. Fixed memory leaks in container-executor and test cases.
Contributed by Billie Rinaldi
2018-07-18 13:44:49 -04:00
Wangda Tan 44beab0b63 YARN-8511. When AM releases a container, RM removes allocation tags before it is released by NM. (Weiwei Yang via wangda)
Change-Id: I6f9f409f2ef685b405cbff547dea9623bf3322d9
(cherry picked from commit 752dcce5f4)
2018-07-16 11:04:08 -07:00
Eric E Payne 9a79e893f7 YARN-8421: when moving app, activeUsers is increased, even though app does not have outstanding request. Contributed by Kyungwan Nam
(cherry picked from commit 937ef39b3f)
2018-07-16 16:32:05 +00:00
bibinchundatt 677bbdcdc4 YARN-8434. Update federation documentation of Nodemanager configurations. Contributed by Bibin A Chundatt.
(cherry picked from commit 4523cc5637)
2018-07-15 13:57:14 +05:30
Jason Lowe 7cbb9597c4 YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan
(cherry picked from commit 17118f446c)
2018-07-13 10:06:38 -05:00
bibinchundatt 2aaad4000a YARN-8491. TestServiceCLI#testEnableFastLaunch fail when umask is 077. Contributed by K G Bakthavachalam.
(cherry picked from commit 52e1bc8539)
2018-07-11 16:24:37 +05:30
Sunil G 9b4ead92c8 YARN-8512. ATSv2 entities are not published to HBase from second attempt onwards. Contributed by Rohith Sharma K S.
(cherry picked from commit 7f1d3d0e9d)
2018-07-11 12:27:12 +05:30
Sunil G 6f10491e64 YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe.
(cherry picked from commit 705e2c1f7c)
2018-07-10 20:12:47 +05:30
Wangda Tan 8ad82ea5b5 YARN-8506. Make GetApplicationsRequestPBImpl thread safe. (wangda)
Change-Id: I2c006965375823c83036e7f45f7163d13c0bdf90
2018-07-09 11:35:15 -07:00
Rohith Sharma K S 785ed3f8ae YARN-8302. ATS v2 should handle HBase connection issue properly. Contributed by Billie Rinaldi.
(cherry picked from commit ba68320449)
2018-07-06 15:20:39 -07:00
Xiaoyu Yao ffe2019887 HADOOP-15571. Multiple FileContexts created with the same configuration object should be allowed to have different umask. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 629f7008b531bad469903e6a1f6f8cb4be691f36)
2018-07-06 11:56:51 -07:00
Sunil G 844538d669 YARN-8459. Improve Capacity Scheduler logs to debug invalid states. Contributed by Wangda Tan.
(cherry picked from commit 51654a3962)
2018-07-03 10:00:05 -07:00
Junping Du 3d6ba2dd4e yarn.resourcemanager.fail-fast is used inconsistently. Contributed by Yuanbo Liu.
(cherry picked from commit d9ba6f3656)
2018-07-03 22:36:24 +08:00
Sunil G 8ffc191f61 YARN-8415. TimelineWebServices.getEntity should throw ForbiddenException instead of 404 when ACL checks fail. Contributed by Suma Shivaprasad.
(cherry picked from commit fa9ef15ecd)
2018-07-02 15:35:45 -07:00
Shane Kumpf 2f264764a4 YARN-8485. Priviledged container app launch is failing intermittently. Contributed by Eric Yang
(cherry picked from commit 53e267fa72)
2018-07-02 16:23:23 -06:00
Eric Yang 0aa7272199 YARN-8465. Fixed docker container status for node manager restart.
Contributed by Shane Kumpf
2018-07-02 13:40:02 -04:00
Jason Lowe dbb0a5f51f YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang
(cherry picked from commit 100470140d)
2018-06-29 13:11:41 -05:00
Rohith Sharma K S 00406f583b YARN-8469. [UI2] URL needs to be trimmed to handle index.html redirection while accessing via knox. Contributed by Sunil Govindan.
(cherry picked from commit a820738e34)
2018-06-29 10:39:30 -07:00
Sunil G e7f2c9886a YARN-8455. Add basic ACL check for all ATS v2 REST APIs. Contributed by Rohith Sharma K S.
(cherry picked from commit 469b29c081)
2018-06-29 10:03:53 -07:00
Sunil G 122cb81b65 YARN-8379. Improve balancing resources in already satisfied queues by using Capacity Scheduler preemption. Contributed by Zian Chen.
(cherry picked from commit 291194302c)
2018-06-28 10:24:03 -07:00
Eric Yang 3ce3571a34 YARN-8409. Fixed NPE in ActiveStandbyElectorBasedElectorService.
Contributed by Chandni Singh
2018-06-28 13:11:56 -04:00
Sunil G 3c68d4ce45 YARN-8401. [UI2] new ui is not accessible with out internet connection. Contributed by Bibin A Chundatt.
(cherry picked from commit fbaff369e9)
2018-06-27 10:35:50 -07:00
Wangda Tan ec37e02bc9 YARN-8464. Async scheduling thread could be interrupted when there are no NodeManagers in cluster. (Sunil G via wangda)
Change-Id: I4f5f856373378685713e77752ba6cf0988a66065
(cherry picked from commit bedc4fe079)
2018-06-26 19:34:15 -07:00
Wangda Tan 8f226f4f1d YARN-8423. GPU does not get released even though the application gets killed. (Sunil G via wangda)
Change-Id: I570db7d60f8c6c21762dd618a9207d1107c486a0
(cherry picked from commit ada8f63d0b)
2018-06-26 19:34:10 -07:00
Sunil G aa8db00fc3 HADOOP-15483. Upgrade jquery to version 3.3.1. Contributed by Lokesh Jain, Mukul Kumar Singh and Sunil Govindan. 2018-06-26 15:02:49 -07:00
Eric Yang f6aa6a6981 YARN-8108. Added option to disable loading existing filters to prevent
security filter from initialize twice.
            Contributed by Sunil Govindan

(cherry picked from commit b69ba0f330)
2018-06-26 17:45:13 -04:00
Eric Yang 9e9530f505 YARN-8214. Change default RegistryDNS port.
Contributed by Billie Rinaldi

(cherry picked from commit 3e586330eb)
2018-06-26 14:26:26 -04:00
Rohith Sharma K S e925de6483 YARN-8457. Compilation is broken with -Pyarn-ui.
(cherry picked from commit 4ffe68a6f7)
2018-06-25 10:39:26 -07:00
Weiwei Yang bfdbc9dea2 YARN-8443. Total #VCores in cluster metrics is wrong when CapacityScheduler reserved some containers. Contributed by Tao Yang.
(Cherry picked from commit 440140cea6)
2018-06-25 09:28:57 +08:00
Eric Yang 3b3a85002d YARN-8326. Removed exit code file check for launched container.
Contributed by Shane Kumpf

(cherry picked from commit 8a32bc39eb)
2018-06-22 19:23:57 -04:00
Eric Yang 63ce762959 YARN-8445. Improved error message for duplicated service and component names.
Contributed by Chandni Singh

(cherry picked from commit 9f15483c5d)
2018-06-21 11:21:34 -04:00
Eric Yang d69c2e7867 YARN-8410. Fixed a bug in A record lookup by CNAME record.
Contributed by Shane Kumpf

(cherry picked from commit 9591765040)
2018-06-14 15:55:33 -04:00
Rohith Sharma K S 1aa6c9407c YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi.
(cherry picked from commit 9119b3cf8f)
2018-06-14 13:51:13 +05:30