Commit Graph

2570 Commits

Author SHA1 Message Date
Ashutosh Gupta 1c99810b89 YARN-8234. Improve RM system metrics publisher's performance by pushing events to timeline server in batch (#3793)
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 00e2405fbd)
2021-12-23 17:16:09 +09:00
Eric Payne ccaba2561a YARN-10178: Global Scheduler async thread crash caused by 'Comparison method violates its general contract. Contributed by Andras Gyori (gandras) and Qi Zhu (zhuqi).
(cherry picked from commit e2d6fd075d)
2021-12-21 19:20:21 +00:00
Viraj Jasani b0c1158829
HADOOP-18033. Upgrade fasterxml Jackson to 2.13.0 (#3764)
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2021-12-13 13:52:44 +09:00
Shubham Gupta c44f109860
YARN-10438. Handle null containerId in ClientRMService#getContainerReport() (#2313)
Co-authored-by: Shubham Gupta <gshubham@microsoft.com>
(cherry picked from commit e3cd627069)
2021-11-19 00:23:48 +09:00
Chao Sun e079fa6577 Preparing for 3.3.3 development 2021-11-16 16:02:34 -08:00
Ahmed Hussein 742d88b1c6 YARN-1115: Provide optional means for a scheduler to check real user ACLs. Contributed by Eric Payne (epayne) 2021-10-21 17:04:29 +00:00
Benjamin Teke 700045896c
YARN-10869. CS considers only the default maximum-allocation-mb/vcore property as a maximum when it creates dynamic queues (#3504)
Co-authored-by: Benjamin Teke <bteke@cloudera.com>
2021-10-12 18:05:50 +02:00
Neil 88deac0479
YARN-10970. Standby RM should expose prom endpoint (#3480)
Reviewed-by: Adam Antal <adamantal@apache.org>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 4bd0c36189)
2021-09-29 15:48:02 +09:00
Eric Badger 52ba50fd3c YARN-10935. AM Total Queue Limit goes below per-user AM Limit if parent is full. Contributed by Eric Payne.
(cherry picked from commit 43f0a34dd4)
2021-09-16 16:46:44 +00:00
Szilard Nemeth 6c68211062 YARN-10870. Missing user filtering check -> yarn.webapp.filter-entity-list-by-user for RM Scheduler page. Contributed by Gergely Pollak 2021-09-14 18:08:34 +02:00
Szilard Nemeth 0a726250ea
YARN-10428. Zombie applications in the YARN queue using FAIR + sizebasedweight. Contributed by Guang Yang, Andras Gyori
(cherry picked from commit 79a46599f7)
2021-09-01 10:44:15 +09:00
Szilard Nemeth a272adc5fa YARN-10789. RM HA startup can fail due to race conditions in ZKConfigurationStore. Contributed by Tarun Parimi 2021-07-29 19:21:58 +02:00
Szilard Nemeth 72801be13a YARN-10813. Set default capacity of root for node labels. Contributed by Andras Gyori 2021-07-28 14:55:19 +02:00
zhuqi-lucas c31618e6b9 YARN-10860. Make max container per heartbeat configs refreshable. Contributed by Eric Badger. 2021-07-22 10:12:32 +08:00
Jim Brennan b3481062e0 YARN-10456. RM PartitionQueueMetrics records are named QueueMetrics in Simon metrics registry. Contributed by Eric Payne.
(cherry picked from commit 632f64cadb)
2021-07-15 14:26:03 +00:00
Jim Brennan 47b3939009 YARN-10834. Intra-queue preemption: apps that don't use defined custom resource won't be preempted. Contributed by Eric Payne.
(cherry picked from commit dc6f456e95)
2021-06-28 14:55:26 +00:00
Wei-Chiu Chuang 86c28f0639
Revert "HADOOP-17669. Backport HADOOP-17079, HADOOP-17505 to branch-3.3 (#2959)"
This reverts commit 4ffe5eb1dd.
2021-05-24 17:37:18 +08:00
Jim Brennan 53a1c7653f YARN-10337. Amendment to fix import as in HADOOP-17100 2021-05-19 22:00:55 +00:00
Prabhu Joseph 1b3e4cf9ce YARN-10337. Fix failing testcase TestRMHATimelineCollectors.
Contributed by Bilwa S T.

(cherry picked from commit 2bbd00dff4)
2021-05-19 21:19:05 +00:00
Wei-Chiu Chuang fa4915fdbb
Preparing for 3.3.2 development 2021-05-19 21:52:37 +08:00
zhuqi-lucas 7d2eeaecc8 YARN-10701. The yarn.resource-types should support multi types without trimmed. Contributed by Qi Zhu. 2021-05-19 21:24:26 +08:00
Wei-Chiu Chuang 4ffe5eb1dd
HADOOP-17669. Backport HADOOP-17079, HADOOP-17505 to branch-3.3 (#2959)
* HADOOP-17079. Optimize UGI#getGroups by adding UGI#getGroupsSet.

Co-authored-by: Wei-Chiu Chuang <weichiu@apache.org>
Change-Id: I0f31409923ece24a82dfba4c4610d8a38c52d9fb

* HADOOP-17505. public interface GroupMappingServiceProvider needs default impl for getGroupsSet() (#2661). Contributed by Vinayakumar B.

(cherry picked from commit c4c0683dff)

Co-authored-by: Xiaoyu Yao <xyao@apache.org>
Co-authored-by: Vinayakumar B <vinayakumarb@apache.org>
2021-05-17 18:57:46 -07:00
lujiefsi 137e20cc9b
YARN-10555. Missing access check before getAppAttempts (#2608)
Co-authored-by: lujie <lujie@foxmail.com>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit d92a25b790)
2021-05-17 13:53:27 +09:00
Peter Bacsko 051a5068dd YARN-9615. Add dispatcher metrics to RM. Contributed by Jonathan Hung and Qi Zhu. 2021-05-11 19:23:45 +02:00
Szilard Nemeth 3d715c2e4c
YARN-9333. TestFairSchedulerPreemption.testRelaxLocalityPreemptionWithNoLessAMInRemainingNodes fails intermittently. Contributed by Peter Bacsko
(cherry picked from commit eacbe07b56)
2021-05-07 14:32:17 +09:00
Szilard Nemeth 3303aa5947
YARN-10515. Fix flaky test TestCapacitySchedulerAutoQueueCreation.testDynamicAutoQueueCreationWithTags. Contributed by Peter Bacsko
(cherry picked from commit 8620984b8d)
2021-05-07 14:26:58 +09:00
Wei-Chiu Chuang 670205c541
HADOOP-17653. Do not use guava's Files.createTempDir(). (#2945)
Reviewed-by: Steve Loughran <stevel@apache.org>
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit f1e1809029)
2021-05-02 11:12:37 +09:00
Eric Badger 003deeeecf YARN-10479. Can't remove all node labels after add node label without
nodemanager port, broken by YARN-10647. Contributed by D M Murali Krishna Reddy

(cherry picked from commit 6857a05d6a)
2021-04-23 22:14:57 +00:00
Eric Badger 1960924d07 YARN-10723. Change CS nodes page in UI to support custom resource. Contributed by Qi Zhu
(cherry picked from commit 6cb90005a7)
2021-04-20 17:46:05 +00:00
Eric Badger 1658a5140a YARN-10503. Support queue capacity in terms of absolute resources with custom
resourceType. Contributed by Qi Zhu.
2021-04-09 17:51:01 +00:00
Eric Badger fb5809984e YARN-10702. Add cluster metric for amount of CPU used by RM Event Processor.
Contributed by Jim Brennan.
2021-04-06 23:34:35 +00:00
Eric Badger 65bba8c3ed YARN-10713. ClusterMetrics should support custom resource capacity related metrics. Contributed by Qi Zhu.
(cherry picked from commit 19e418c10d)
2021-03-25 22:35:19 +00:00
Jim Brennan 78bddd0d9f YARN-10697. Resources are displayed in bytes in UI for schedulers other than capacity. Contributed by Bilwa S T.
(cherry picked from commit 174f3a96b1)
2021-03-23 18:23:50 +00:00
Eric Badger cd417f17ae YARN-10688. ClusterMetrics should support GPU capacity related metrics.. Contributed by Qi Zhu.
(cherry picked from commit 49f89f1d3d)
2021-03-17 18:16:59 +00:00
Eric Payne f5810ea83c YARN-10588. Percentage of queue and cluster is zero in WebUI . Contributed by Bilwa S T
(cherry picked from commit aa4c17b9d7)
2021-03-15 19:14:19 +00:00
Brahma Reddy Battula f12293fba2 YARN-10671.Fix Typo in TestSchedulingRequestContainerAllocation. Contributed by D M Murali Krishna Reddy.
(cherry picked from commit b2a565629d)
2021-03-09 20:27:07 +05:30
Peter Bacsko 066f89af01 YARN-10672. All testcases in TestReservations are flaky. Contributed by Szilard Nemeth. 2021-03-08 11:42:59 +01:00
Neil 0396a721e3 YARN-10649. Fix RMNodeImpl.updateExistContainers leak (#2719). Contributed by Max Xie
(cherry picked from commit d615e2d3bd)
2021-03-04 14:54:28 +05:30
Jonathan Hung be6e99963d YARN-10651. CapacityScheduler crashed with NPE in AbstractYarnScheduler.updateNodeResource(). Contributed by Haibo Chen
(cherry picked from commit f348ab3f2f468751af329a1ffce4917cb000fcbf)
2021-02-25 15:09:33 -08:00
Jim Brennan db457b056a [YARN-10613] Config to allow Intra- and Inter-queue preemption to enable/disable conservativeDRF. Contributed by Eric Payne
(cherry picked from commit c373da9f88)
2021-02-25 16:48:46 +00:00
Inigo Goiri 8c8ef2f444 YARN-9017. PlacementRule order is not maintained in CS. Contributed by Bilwa S T.
(cherry picked from commit 35010120fb)
2021-02-18 20:42:26 +05:30
Prabhu Joseph 72904c014d YARN-10361. Make custom DAO classes configurable into RMWebApp#JAXBContextResolver.
Contributed by Bilwa ST.

(cherry picked from commit c7e71a6c0b)
2021-02-18 14:25:16 +05:30
Prabhu Joseph 0c46ab51b5 YARN-8047. RMWebApp make external class pluggable.
Contributed by Bilwa S T.

(cherry picked from commit 3a4d05b850)
2021-02-18 13:59:50 +05:30
Masatake Iwasaki 4468378e4b YARN-10500. TestDelegationTokenRenewer fails intermittently. (#2619)
(cherry picked from commit f9a073c6c1)
2021-02-11 20:26:09 +00:00
bibinchundatt 1520b84b36 YARN-10519. Refactor QueueMetricsForCustomResources class to move to yarn-common package. Contributed by Minni Mittal
(cherry picked from commit 8bc2dfbf36)
2021-01-22 08:30:12 +05:30
Neil cd5ee0014f YARN-10541. capture the performance metrics of ZKRMStateStore (#2568)
(cherry picked from commit fa4cf91b57)
2021-01-08 10:38:08 -08:00
Szilard Nemeth f6b9f82b3f YARN-10528. maxAMShare should only be accepted for leaf queues, not parent queues. Contributed by Siddharth Ahuja 2021-01-08 12:41:17 +01:00
srinivasst 98565b6c60 YARN-10538: Add RECOMMISSIONING nodes to the list of updated nodes returned to the AM (#2564)
Contributed by Srinivas S T

(cherry picked from commit 1b1791075a)
2021-01-08 10:57:37 +05:30
Ayush Saxena 8378ab9f92 HADOOP-17288. Use shaded guava from thirdparty. Contributed by Ayush Saxena. #2505 2020-12-10 05:50:55 +05:30
Eric Payne 1fd6d81617 YARN-10278: CapacityScheduler test framework ProportionalCapacityPreemptionPolicyMockFramework. Contributed by Szilard Nemeth (snemeth)
(cherry picked from commit fa773a8326)
2020-12-01 22:51:20 +00:00