Jian He
9ebbf1bfce
YARN-3361. CapacityScheduler side changes to support non-exclusive node labels. Contributed by Wangda Tan
...
(cherry picked from commit 0fefda645b
)
2015-04-14 11:46:35 -07:00
Jian He
81bbee6852
YARN-3266. RMContext#inactiveNodes should have NodeId as map key. Contributed by Chengbing Liu
...
(cherry picked from commit b46ee1e7a3
)
2015-04-14 11:31:42 -07:00
Jian He
d58f5c8894
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
...
(cherry picked from commit a1afbc48b5
)
2015-04-13 14:07:51 -07:00
Junping Du
0891de0712
YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
...
(cherry picked from commit 92431c9617
)
2015-04-10 08:58:07 -07:00
Xuan
f5b49160d9
YARN-3293. Track and display capacity scheduler health metrics in web
...
UI. Contributed by Varun Vasudev
(cherry picked from commit afa5d4715a
)
2015-04-09 23:40:02 -07:00
Vinod Kumar Vavilapalli
1ff3fd33ed
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
...
(cherry picked from commit 9c5911294e
)
2015-04-09 13:10:06 -07:00
Robert Kanter
354d2c75cf
YARN-2429. TestAMRMTokens.testTokenExpiry fails Intermittently with error message:Invalid AMRMToken (zxu via rkanter)
...
(cherry picked from commit 99b08a748e
)
2015-04-06 14:11:50 -07:00
Tsuyoshi Ozawa
d2b2d76cce
TestFairScheduler.testContinuousScheduling fails Intermittently. Contributed by Zhihai Xu.
...
(cherry picked from commit 53959e69f7
)
2015-04-06 20:19:47 +09:00
Sandy Ryza
6286cfd711
YARN-3415. Non-AM containers can be counted towards amResourceUsage of a fairscheduler queue (Zhihai Xu via Sandy Ryza)
2015-04-02 13:57:53 -07:00
Xuan
e26b6e55e9
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
...
Contributed by Varun Vasudev
(cherry picked from commit 4728bdfa15
)
2015-04-01 04:20:37 -07:00
Karthik Kambatla
85c420a4d4
YARN-3412. RM tests should use MockRM where possible. (kasha)
...
(cherry picked from commit 79f7f2aabf
)
2015-03-31 09:22:16 -07:00
Wangda Tan
cba4ed1678
YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
...
(cherry picked from commit 2a945d24f7
)
2015-03-30 12:05:54 -07:00
Karthik Kambatla
75591e4131
YARN-3241. FairScheduler handles invalid queue names inconsistently. (Zhihai Xu via kasha)
...
(cherry picked from commit 2bc097cd14
)
2015-03-23 13:24:22 -07:00
cnauroth
342c525eaa
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
...
(cherry picked from commit 6ca1f12024
)
2015-03-23 10:46:06 -07:00
Jian He
9c494cedaf
YARN-3345. Add non-exclusive node label API. Contributed by Wangda Tan
...
(cherry picked from commit e1feb4ea1a
)
2015-03-20 19:05:14 -07:00
Jian He
a3a155a34e
YARN-3356. Capacity Scheduler FiCaSchedulerApp should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
...
(cherry picked from commit 586348e4cb
)
2015-03-20 13:54:46 -07:00
Devaraj K
a888dac344
YARN-3357. Move TestFifoScheduler to FIFO package. Contributed by Rohith
...
Sharmaks.
(cherry picked from commit 93d0f4acc8
)
2015-03-19 12:18:08 +05:30
Jian He
15ebacf03a
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and debugging. Contributed Rohith Sharmaks
...
(cherry picked from commit 658097d6da
)
2015-03-17 21:31:20 -07:00
Tsuyoshi Ozawa
b00b216a91
YARN-3205. FileSystemRMStateStore should disable FileSystem Cache to avoid get a Filesystem with an old configuration. Contributed by Zhihai Xu.
...
(cherry picked from commit 3bc72cc16d
)
2015-03-18 11:53:40 +09:00
Jian He
ab34e6975b
YARN-3305. Normalize AM resource request on app submission. Contributed by Rohith Sharmaks
...
(cherry picked from commit 968425e9f7
)
2015-03-17 13:50:44 -07:00
Jian He
1c601e492f
YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
...
(cherry picked from commit 487374b7fe
)
2015-03-17 10:25:07 -07:00
Tsuyoshi Ozawa
a20427d339
YARN-3349. Treat all exceptions as failure in TestFSRMStateStore#testFSRMStateStoreClientRetry. Contributed by Zhihai Xu.
...
(cherry picked from commit 7522a643fa
)
2015-03-17 08:10:12 +09:00
Vinod Kumar Vavilapalli
53aa3a4d1f
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
...
(cherry picked from commit 863079bb87
)
2015-03-12 13:33:42 -07:00
Zhijie Shen
426535007b
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM web page. Contributed by Xuan Gong.
...
(cherry picked from commit 85f6d67fa7
)
2015-03-11 19:39:46 -07:00
Jason Lowe
ef3d9bdf6b
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable queues. Contributed by Eric Payne
...
(cherry picked from commit 27e8ea820f
)
2015-03-06 22:45:09 +00:00
Jian He
a5f3fb4dc1
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
...
(cherry picked from commit 95bfd087dc
)
2015-03-05 21:27:03 -08:00
Karthik Kambatla
0d62e94887
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
...
(cherry picked from commit 8d88691d16
)
2015-03-04 19:49:21 -08:00
Karthik Kambatla
721d7b5741
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
...
(cherry picked from commit 22426a1c9f
)
2015-03-04 18:07:33 -08:00
Jian He
888a445638
YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
...
(cherry picked from commit b2f1ec312e
)
2015-03-03 16:28:55 -08:00
Wangda Tan
0a502c665b
YARN-3272. Surface container locality info in RM web UI (Jian He via wangda)
...
(cherry picked from commit e17e5ba9d7
)
2015-03-03 11:49:45 -08:00
Vinod Kumar Vavilapalli
253c78548b
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's available resource-limit from the parent queue. Contributed by Wangda Tan.
...
(cherry picked from commit 14dd647c55
)
2015-03-02 17:54:01 -08:00
Wangda Tan
0b0be0056b
YARN-3262. Surface application outstanding resource requests table in RM web UI. (Jian He via wangda)
...
(cherry picked from commit edcecedc1c
)
2015-02-27 16:14:35 -08:00
Tsuyoshi Ozawa
79f73f4613
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail due to IOException. Contributed by Zhihai Xu.
...
(cherry picked from commit 01a1621930
)
2015-02-28 00:57:01 +09:00
Devaraj K
f83d2e4410
YARN-3256. TestClientToAMTokens#testClientTokenRace is not running against
...
all Schedulers even when using ParameterizedSchedulerTestBase. Contributed
by Anubhav Dhoot.
(cherry picked from commit 0d4296f0e0
)
2015-02-26 15:47:32 +05:30
Tsuyoshi Ozawa
13e213c41a
YARN-3247. TestQueueMappings should use CapacityScheduler explicitly. Contributed by Zhihai Xu.
...
(cherry picked from commit 6cbd9f1113
)
2015-02-25 11:23:34 +09:00
Xuan
0ca974d0b0
YARN-2797. TestWorkPreservingRMRestart should use
...
ParametrizedSchedulerTestBase. Contributed by Karthik Kambatla
(cherry picked from commit fe7a302473
)
2015-02-21 19:19:08 -08:00
Jason Lowe
3e8dfd1299
YARN-3194. RM should handle NMContainerStatuses sent by NM while registering if NM is Reconnected node. Contributed by Rohith
...
(cherry picked from commit a64dd3d24b
)
2015-02-20 15:12:08 +00:00
Jian He
eaeaf80d39
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in RMApp. Contributed by Rohith Sharmaks
...
(cherry picked from commit c0d9b93953
)
2015-02-19 15:43:14 -08:00
Wangda Tan
cd5eb9c1de
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node mapping (Varun Saxena via wangda)
...
(cherry picked from commit d49ae725d5
)
2015-02-19 11:01:47 -08:00
Jian He
b8fe1a747e
YARN-1514. Utility to benchmark ZKRMStateStore#loadState for RM HA. Contributed by Tsuyoshi OZAWA
...
(cherry picked from commit 1c03376300
)
2015-02-18 16:34:42 -08:00
Jian He
8f06eaa251
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping when node becomes deactivated. Contributed by Wangda Tan
...
(cherry picked from commit f5da5566d9
)
2015-02-18 11:52:28 -08:00
Jian He
b5d6f76c6a
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat between rolling and activation. Contributed by Jason Lowe
...
(cherry picked from commit 18297e0972
)
2015-02-12 16:03:12 -08:00
Jian He
1be2d64ddd
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
...
(cherry picked from commit 18a594257e
)
Conflicts:
hadoop-yarn-project/CHANGES.txt
2015-02-12 15:01:14 -08:00
Xuan
cac6fe217a
YARN-3151. On Failover tracking url wrong in application cli for KILLED
...
application. Contributed by Rohith
(cherry picked from commit 65c69e296e
)
2015-02-11 21:21:06 -08:00
Zhijie Shen
fd75b8c9ca
YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
...
(cherry picked from commit d5855c0e46
)
2015-02-10 15:29:09 -08:00
Zhijie Shen
24d4d933f7
YARN-3100. Made YARN authorization pluggable. Contributed by Jian He.
...
(cherry picked from commit 23bf6c7207
)
2015-02-09 20:37:58 -08:00
Jian He
6146680955
YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
...
(cherry picked from commit 0af6a99a3f
)
2015-02-09 13:47:33 -08:00
Karthik Kambatla
1cb1c8b615
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and rack-local delays, even for off-rack-only requests. (kasha)
...
(cherry picked from commit 7e42088abf
)
2015-02-08 22:48:29 -08:00
Jason Lowe
61fdd862be
YARN-3143. RM Apps REST API can return NPE or entries missing id and other fields. Contributed by Jason Lowe
...
(cherry picked from commit da2fb2bc46
)
2015-02-06 21:51:49 +00:00
Jian He
3ddafaa7c8
YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
...
(cherry picked from commit c1957fef29
)
2015-02-06 11:34:56 -08:00
Jason Lowe
ebdd88ec81
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per queue. Contributed by Thomas Graves
...
(cherry picked from commit 69c8a7f45b
)
2015-02-05 19:30:06 +00:00
Sandy Ryza
84df660af4
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding max share (Anubhav Dhoot via Sandy Ryza)
2015-02-05 09:45:17 -08:00
Zhijie Shen
3ab3caa9ea
YARN-2543. Made resource usage be published to the timeline server too. Contributed by Naganarasimha G R.
...
(cherry picked from commit e5b56e2af6
)
2015-02-03 17:35:40 -08:00
Wangda Tan
205e15c4a3
YARN-3075. NodeLabelsManager implementation to retrieve label to node mapping (Varun Saxena via wangda)
...
(cherry picked from commit 5bd984691b
)
2015-02-03 12:55:25 -08:00
Jian He
c0b1311a93
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
...
(cherry picked from commit 21d80b3dd9
)
2015-02-03 11:43:40 -08:00
Jason Lowe
3828fb5fab
YARN-3085. Application summary should include the application type. Contributed by Rohith
...
(cherry picked from commit a761bf8726
)
2015-02-03 14:57:25 +00:00
Jian He
4941b8f30d
YARN-3077. Fixed RM to create zk root path recursively. Contributed by Chun Chen
...
(cherry picked from commit 054a947989
)
2015-01-30 17:35:26 -08:00
Jian He
cabf97ae4f
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
...
(cherry picked from commit 86358221fc
)
2015-01-30 15:16:22 -08:00
Wangda Tan
58d7d1efc6
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource. (Zhihai Xu via wangda)
...
(cherry picked from commit 7882bc0f14
)
2015-01-28 21:56:17 -08:00
Wangda Tan
4718af4352
YARN-2932. Add entry for preemptable status (enabled/disabled) to scheduler web UI and queue initialize/refresh logging. (Eric Payne via wangda)
...
(cherry picked from commit 18741adf97
)
2015-01-27 15:37:02 -08:00
Jian He
61b4116b4b
YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
...
(cherry picked from commit 6f9fe76918
)
2015-01-26 15:38:27 -08:00
Tsuyoshi Ozawa
25f2cf8a4e
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable node labels feature. Contributed by Wangda Tan.
...
(cherry picked from commit 24aa462673
)
2015-01-23 20:37:26 +09:00
Wangda Tan
ef6fc24dfb
YARN-2933. Capacity Scheduler preemption policy should only consider capacity without labels temporarily. Contributed by Mayank Bansal
...
(cherry picked from commit 0a2d3e717d
)
2015-01-19 16:57:17 -08:00
Junping Du
7cddec31d7
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync failure with allocation timeout. (Contributed by Jian He)
...
(cherry picked from commit 5d1cca34fa
)
2015-01-16 00:10:36 -08:00
Jian He
4931600030
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
...
(cherry picked from commit c53420f583
)
2015-01-13 17:32:36 -08:00
Robert Kanter
e5059b91bb
YARN-3027. Scheduler should use totalAvailable resource from node instead of availableResource for maxAllocation. (adhoot via rkanter)
...
(cherry picked from commit ae7bf31fe1
)
2015-01-12 10:48:45 -08:00
Zhijie Shen
8a81312062
YARN-2427. Added the API of moving apps between queues in RM web services. Contributed by Varun Vasudev.
...
(cherry picked from commit 60103fca04
)
2015-01-06 14:39:44 -08:00
Karthik Kambatla
7adffad2bb
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler. (Anubhav Dhoot via kasha)
...
(cherry picked from commit 0c4b112677
)
2015-01-06 04:42:55 +05:30
Zhijie Shen
c6cf748985
YARN-2958. Made RMStateStore not update the last sequence number when updating the delegation token. Contributed by Varun Saxena.
...
(cherry picked from commit 562a701945
)
2015-01-05 13:35:09 -08:00
Tsuyoshi Ozawa
c116743bdd
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
...
(cherry picked from commit ddc5be48fc
)
2015-01-05 00:11:01 +09:00
Karthik Kambatla
798ab51289
YARN-2998. Abstract out scheduler independent PlanFollower components. (Anubhav Dhoot via kasha)
...
(cherry picked from commit e7257acd8a
)
2014-12-30 19:55:50 -08:00
Jian He
cde5bfe3ec
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and app ACLs. Contributed by Varun Saxena
...
(cherry picked from commit e2351c7ae2
)
2014-12-30 17:16:05 -08:00
Jian He
143e48c25a
YARN-2493. Added node-labels page on RM web UI. Contributed by Wangda Tan
...
(cherry picked from commit b7442bf92e
)
2014-12-30 16:49:59 -08:00
Jian He
60530a6c4f
YARN-2946. Fixed potential deadlock in RMStateStore. Contributed by Rohith Sharmaks
...
(cherry picked from commit 4f18018b7a
)
2014-12-23 22:15:28 -08:00
Jian He
e3a54b1b8b
YARN-2837. Support TimeLine server to recover delegation token when restarting. Contributed by Zhijie Shen
...
(cherry picked from commit 149512a837
)
2014-12-23 18:26:37 -08:00
Jian He
8b398a66ca
YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
...
(cherry picked from commit 0d89859b51
)
2014-12-22 21:55:06 -08:00
Jian He
411836b74c
YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
...
(cherry picked from commit fdf042dfff
)
2014-12-22 16:52:15 -08:00
Karthik Kambatla
2abec14ec6
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
...
(cherry picked from commit 24ee9e3431
)
2014-12-20 12:20:29 -08:00
Jian He
9180d11b3b
YARN-2952. Fixed incorrect version check in StateStore. Contributed by Rohith Sharmaks
...
(cherry picked from commit 808cba3821
)
2014-12-19 16:56:55 -08:00
Karthik Kambatla
8ee40a1580
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler. (Anubhav Dhoot via kasha)
...
(cherry picked from commit a22ffc3188
)
2014-12-19 15:38:27 -08:00
Jason Lowe
173664d70f
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
...
(cherry picked from commit 0402bada19
)
2014-12-18 23:31:11 +00:00
Steve Loughran
a858d726c8
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
2014-12-12 17:10:54 +00:00
Jian He
0f47e4e745
YARN-2930. Fixed TestRMRestart#testRMRestartRecoveringNodeLabelManager intermittent failure. Contributed by Wangda Tan
...
(cherry picked from commit 2ed90a57fd
)
2014-12-09 16:48:46 -08:00
Karthik Kambatla
1986ea8dd2
YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
...
(cherry picked from commit a2e07a5456
)
2014-12-09 14:00:52 -08:00
Jian He
4cc0abe4fe
YARN-2869. CapacityScheduler should trim sub queue names when parse configuration. Contributed by Wangda Tan
...
(cherry picked from commit e69af836f3
)
2014-12-05 17:34:15 -08:00
Jason Lowe
b72fb6c774
YARN-2056. Disable preemption at Queue level. Contributed by Eric Payne
...
(cherry picked from commit 4b13082199
)
2014-12-05 21:08:14 +00:00
Jian He
1d1e7682c9
YARN-2301. Improved yarn container command. Contributed by Naganarasimha G R
...
(cherry picked from commit 258623ff8b
)
2014-12-04 12:53:52 -08:00
Jian He
58c971164c
YARN-2880. Added a test to make sure node labels will be recovered if RM restart is enabled. Contributed by Rohith Sharmaks
...
(cherry picked from commit 73fbb3c66b
)
2014-12-03 17:17:40 -08:00
Jian He
d6f3d4893d
YARN-2894. Fixed a bug regarding application view acl when RM fails over. Contributed by Rohith Sharmaks
...
(cherry picked from commit 392c3aaea8
)
2014-12-02 17:17:16 -08:00
Jian He
5067ac098b
YARN-2136. Changed RMStateStore to ignore store opearations when fenced. Contributed by Varun Saxena
...
(cherry picked from commit 52bcefca8b
)
2014-12-02 11:02:00 -08:00
Jian He
d208c90148
YARN-2765. Added leveldb-based implementation for RMStateStore. Contributed by Jason Lowe
...
(cherry picked from commit a7fba0bc28
)
2014-12-01 16:45:17 -08:00
Junping Du
c7bd22974a
YARN-2907. SchedulerNode#toString should print all resource detail instead of only memory. (Contributed by Rohith)
...
(cherry picked from commit c732ed760e
)
2014-12-01 05:41:56 -08:00
Jian He
2863056530
YARN-2404. Removed ApplicationAttemptState and ApplicationState class in RMStateStore. Contributed by Tsuyoshi OZAWA
...
(cherry picked from commit 5805a81efb
)
2014-11-25 12:51:03 -08:00
Sandy Ryza
3e2e8eac22
YARN-2669. FairScheduler: queue names shouldn't allow periods (Wei Yan via Sandy Ryza)
2014-11-21 16:08:22 -08:00
Karthik Kambatla
e9db0aa35c
YARN-2604. Scheduler should consider max-allocation-* in conjunction with the largest node. (Robert Kanter via kasha)
...
(cherry picked from commit 3114d4731d
)
2014-11-21 10:42:47 -08:00
Karthik Kambatla
7ba5bb0c5b
YARN-2315. FairScheduler: Set current capacity in addition to capacity. (Zhihai Xu via kasha)
...
(cherry picked from commit a9a0cc3679
)
2014-11-19 20:19:54 -08:00
Karthik Kambatla
564deb75a8
YARN-2802. ClusterMetrics to include AM launch and register delays. (Zhihai Xu via kasha)
...
(cherry picked from commit f9f8e750edbe6db453f0a845e2ed49ede66e0e8a)
2014-11-19 19:50:35 -08:00
Jian He
db31ef7e7f
YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
...
(cherry picked from commit 9cb8b75ba5
)
2014-11-19 19:49:44 -08:00
Karthik Kambatla
8df4c04c3a
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes independent of Scheduler type. (Anubhav Dhoot via kasha)
...
(cherry picked from commit 2fce6d6141
)
2014-11-17 16:46:18 -08:00
Jason Lowe
242fd0e39a
YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
...
(cherry picked from commit 81c9d17af8
)
2014-11-17 21:16:54 +00:00
Sandy Ryza
c4c77669f0
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max share (Siqi Li via Sandy Ryza)
2014-11-14 15:20:37 -08:00
Jian He
beb184ac58
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
...
(cherry picked from commit d005404ef7
)
2014-11-13 15:34:56 -08:00
Vinod Kumar Vavilapalli
ed3e5cb164
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
...
(cherry picked from commit 3651fe1b08
)
2014-11-13 08:13:57 -08:00
Jason Lowe
6984d899e3
YARN-2780. Log aggregated resource allocation in rm-appsummary.log. Contributed by Eric Payne
...
(cherry picked from commit f8aefa5e9c
)
2014-11-12 17:02:30 +00:00
Vinod Kumar Vavilapalli
d76fc94b21
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
...
(cherry picked from commit e76faebc95
)
2014-11-09 18:57:48 -08:00
Arun C. Murthy
175d222bfc
YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
...
(cherry picked from commit 43cd07b408
)
2014-11-09 15:03:59 -08:00
Zhijie Shen
1880a5a7c3
YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
...
(cherry picked from commit 9a4e0d343e
)
2014-11-07 20:39:53 -08:00
Vinod Kumar Vavilapalli
a50345f654
YARN-2753. Fixed a bunch of bugs in the NodeLabelsManager classes. Contributed by Zhihai xu.
...
(cherry picked from commit 4cfd5bc7c1
)
2014-11-07 14:16:54 -08:00
Vinod Kumar Vavilapalli
d5d2905262
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
...
(cherry picked from commit 2ac1be7dec
)
2014-11-07 10:44:25 -08:00
Xuan
bf79541868
YARN-2810. TestRMProxyUsersConf fails on Windows VMs. Contributed by Varun Vasudev
...
(cherry picked from commit 1e97f2f094
)
2014-11-07 09:46:04 -08:00
Vinod Kumar Vavilapalli
eb01c602ed
YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
...
(cherry picked from commit a5657182a7
)
2014-11-07 09:30:04 -08:00
Vinod Kumar Vavilapalli
52e57a95d9
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
...
(cherry picked from commit a3839a9fbf
)
2014-11-06 17:29:12 -08:00
Jian He
f92ff24f5e
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
...
(cherry picked from commit 395275af86
)
2014-11-05 17:03:01 -08:00
Zhijie Shen
f3ed97de3a
YARN-2767. Added a test case to verify that http static user cannot kill or submit apps in the secure mode. Contributed by Varun Vasudev.
...
(cherry picked from commit 7a4c92a9d55fcecef066053ac30dff0fcd4ec90c)
2014-11-05 10:59:05 -08:00
Karthik Kambatla
3e4b280de7
YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
...
(cherry picked from commit b2cd269802
)
2014-11-04 17:48:49 -08:00
Vinod Kumar Vavilapalli
277141b82d
YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
...
(cherry picked from commit ec6cbece8e
)
2014-11-03 13:45:31 -08:00
Zhijie Shen
caae0a0a08
YARN-2785. Fixed intermittent TestContainerResourceUsage failure. Contributed by Varun Vasudev.
...
(cherry picked from commit 27715ec63b
)
2014-11-02 15:22:54 -08:00
Vinod Kumar Vavilapalli
0ba8d8c19c
YARN-2698. Moved some node label APIs to be correctly placed in client protocol. Contributed by Wangda Tan.
...
(cherry picked from commit e0233c16eb
)
2014-10-30 23:02:06 -07:00
Karthik Kambatla
d2ba115f06
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with queue and headroom checks. (Tsuyoshi Ozawa via kasha)
...
(cherry picked from commit 179cab81e0
)
2014-10-30 00:39:34 -07:00
Karthik Kambatla
e88832dfb3
YARN-2742. FairSchedulerConfiguration should allow extra spaces between value and unit. (Wei Yan via kasha)
...
(cherry picked from commit 782971ae7a
)
2014-10-29 10:25:29 -07:00
Jian He
720de7eb4c
YARN-2503. Added node lablels in web UI. Contributed by Wangda Tan
...
(cherry picked from commit d5e0a09721
)
2014-10-28 18:00:59 -07:00
Vinod Kumar Vavilapalli
0ad33e1483
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the sake of localization and log-aggregation for long-running services. Contributed by Jian He.
...
(cherry picked from commit a16d022ca4
)
2014-10-27 15:50:51 -07:00
Vinod Kumar Vavilapalli
36d3f1fcca
YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken identifiers to be tampered and thus causing app submission failures in secure mode. Contributed by Jian He.
...
(cherry picked from commit 0186645505
)
2014-10-26 11:15:45 -07:00
Jian He
96a6e02d16
YARN-1915. Fixed a race condition that client could use the ClientToAMToken to contact with AM before AM actually receives the ClientToAMTokenMasterKey. Contributed by Jason Lowe
...
(cherry picked from commit 5864dd99a4
)
2014-10-24 22:48:48 -07:00
Zhijie Shen
3b03ea6b50
YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and made related MR changes. Contributed by Jian He.
...
(cherry picked from commit 0f3b6900be
)
2014-10-23 22:03:30 -07:00
Vinod Kumar Vavilapalli
77dc9afe86
YARN-2715. Fixed ResourceManager to respect common configurations for proxy users/groups beyond just the YARN level config. Contributed by Zhijie Shen.
...
(cherry picked from commit c0e034336c
)
2014-10-21 20:10:45 -07:00
Vinod Kumar Vavilapalli
ea841eb25d
YARN-2676. Enhanced Timeline auth-filter to support proxy users. Contributed by Zhijie Shen.
...
(cherry picked from commit 39063cd36f
)
2014-10-17 22:03:43 -07:00
Vinod Kumar Vavilapalli
1c52b6551b
YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing test-failures: added a dummy in-memory labels-manager. Contributed by Wangda Tan.
...
(cherry picked from commit e9c66e8fd2
)
2014-10-17 18:27:04 -07:00
Jian He
78a57087bf
YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol for RM fail over. Contributed by Tsuyoshi OZAWA
...
(cherry picked from commit c3de2412eb
)
2014-10-17 16:36:15 -07:00
Jian He
79428e021f
YARN-2588. Standby RM fails to transitionToActive if previous transitionToActive failed with ZK exception. Contributed by Rohith Sharmaks
...
(cherry picked from commit a6aa6e42ca
)
2014-10-17 10:55:04 -07:00
Vinod Kumar Vavilapalli
99ce4277a8
YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail when using ephemeral ports on NodeIDs. Contributed by Wangda Tan.
...
(cherry picked from commit abae63caf9
)
2014-10-17 08:59:22 -07:00
Vinod Kumar Vavilapalli
76cf2250de
YARN-2685. Fixed a bug in CommonNodeLabelsManager that caused wrong resource tracking per label when a host runs multiple node-managers. Contributed by Wangda Tan.
...
(cherry picked from commit b3056c266a
)
2014-10-15 18:48:36 -07:00
Vinod Kumar Vavilapalli
e8e3a36213
YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating resources based on node-labels. Contributed by Wangda Tan.
...
YARN-2500. Ehnaced ResourceManager to support schedulers allocating resources based on node-labels. Contributed by Wangda Tan.
(cherry picked from commit f2ea555ac6
)
2014-10-15 18:34:19 -07:00
Jian He
f93d2ea27e
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to use ContainerId#getContainerId instead. Contributed by Tsuyoshi OZAWA
2014-10-15 15:28:26 -07:00
Zhijie Shen
bec165e163
YARN-2656. Made RM web services authentication filter support proxy user. Contributed by Varun Vasudev and Zhijie Shen.
...
(cherry picked from commit 1220bb72d4
)
2014-10-14 21:56:44 -07:00
Zhijie Shen
8aeda20b25
HADOOP-11181. Generalized o.a.h.s.t.d.DelegationTokenManager to handle all sub-classes of AbstractDelegationTokenIdentifier. Contributed by Zhijie Shen.
...
(cherry picked from commit cdce88376a
)
2014-10-14 11:37:28 -07:00
Karthik Kambatla
a1116b56a4
YARN-2641. Decommission nodes on -refreshNodes instead of next NM-RM heartbeat. (Zhihai Xu via kasha)
...
(cherry picked from commit da709a2eac
)
2014-10-13 16:24:00 -07:00
Jian He
536254f5e7
YARN-2308. Changed CapacityScheduler to explicitly throw exception if the queue
...
to which the apps were submitted is changed across RM restart. Contributed by Craig Welch & Chang Li
(cherry picked from commit f9680d9a16
)
2014-10-13 14:16:12 -07:00
Zhijie Shen
e51ae64761
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. Contributed by Xuan Gong.
...
(cherry picked from commit 4aed2d8e91
)
2014-10-13 10:55:09 -07:00
Vinod Kumar Vavilapalli
8ef83a9480
YARN-2494. Added NodeLabels Manager internal API and implementation. Contributed by Wangda Tan.
...
(cherry picked from commit db7f165319
)
2014-10-10 11:45:15 -07:00
Jian He
5e6f86e3aa
YARN-2649. Fixed TestAMRMRPCNodeUpdates test failure. Contributed by Ming Ma
...
(cherry picked from commit e16e25ab1b
)
2014-10-08 11:03:00 -07:00
Jian He
9ff7c0c96c
YARN-1857. CapacityScheduler headroom doesn't account for other AM's running. Contributed by Chen He and Craig Welch
...
(cherry picked from commit 30d56fdbb4
)
2014-10-07 13:48:03 -07:00
Jian He
5c33e91229
YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when AM allocates. Contributed by Craig Welch
...
(cherry picked from commit 519e5a7dd2
)
2014-10-06 15:51:38 -07:00
Jian He
43358be60b
YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier to use protobuf as payload. Contributed by Junping Du
...
(cherry picked from commit ea26cc0b4a
)
2014-10-06 10:48:27 -07:00
subru
fb5e9df7fd
YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched for branch YARN-1051. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit c47464aba4
)
(cherry picked from commit a2986234be
)
2014-10-06 10:21:07 -07:00
subru
ae0f16ccc8
YARN-2576. Making test patch pass in branch. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 90ac0be86b
)
(cherry picked from commit 5e10a13bb4
)
2014-10-06 10:21:06 -07:00
subru
cbfbdf60d6
YARN-2080. Integrating reservation system with ResourceManager and client-RM protocol. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 8baeaead85
)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/RMAppImpl.java
(cherry picked from commit 6261f7cc69
)
2014-10-06 10:21:06 -07:00
subru
fec639cda1
YARN-1712. Plan follower that synchronizes the current state of reservation subsystem with the scheduler. Contributed by Subru Krishnan and Carlo Curino.
...
(cherry picked from commit 169085319b
)
(cherry picked from commit 3418c56bcf
)
2014-10-06 10:21:06 -07:00
carlo curino
6bfdaf06c4
YARN-1711. Policy to enforce instantaneous and over-time quotas on user reservation. Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit c4918cb4cb
)
(cherry picked from commit b6df0dddcd
)
2014-10-06 10:21:05 -07:00
carlo curino
6a3c167175
YARN-1710. Logic to find allocations within a Plan that satisfy user ReservationRequest(s). Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit aef7928899
)
(cherry picked from commit f66ffcf832
)
2014-10-06 10:21:05 -07:00
subru
63250ef9d6
YARN-1709. In-memory data structures used to track resources over time to enable reservations.
...
(cherry picked from commit 0d8b2cd88b
)
(cherry picked from commit cf4b34282a
)
2014-10-06 10:21:05 -07:00
carlo curino
b81f571e60
YARN-2475. Logic for responding to capacity drops for the ReservationSystem. Contributed by Carlo Curino and Subru Krishnan.
...
(cherry picked from commit f83a07f266
)
(cherry picked from commit 1c6950354f
)
2014-10-06 10:21:05 -07:00
carlo curino
cf5ef00b96
YARN-1707. Introduce APIs to add/remove/resize queues in the CapacityScheduler. Contributed by Carlo Curino and Subru Krishnan
...
(cherry picked from commit aac47fda7f
)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
(cherry picked from commit eb3e40b833
)
2014-10-06 10:21:04 -07:00
Karthik Kambatla
b6ce0a1f69
YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run with both CS and FS. (Wei Yan and kasha via kasha)
...
(cherry picked from commit 80d11eb68e
)
2014-10-03 11:52:05 -07:00
Jian He
6f4c77409b
YARN-2628. Capacity scheduler with DominantResourceCalculator carries out reservation even though slots are free. Contributed by Varun Vasudev
...
(cherry picked from commit 054f285526
)
2014-10-02 15:16:07 -07:00
Karthik Kambatla
b6f4d1ef42
YARN-2254. TestRMWebServicesAppsModification should run against both CS and FS. (Zhihai Xu via kasha)
...
(cherry picked from commit 5e0b49da9c
)
2014-10-02 10:15:41 -07:00
Zhijie Shen
4b50e23271
YARN-2630. Prevented previous AM container status from being acquired by the current restarted AM. Contributed by Jian He.
...
(cherry picked from commit 52bbe0f11b
)
2014-10-01 15:39:36 -07:00
Jian He
823f02725f
YARN-2602. Fixed possible NPE in ApplicationHistoryManagerOnTimelineStore. Contributed by Zhijie Shen
...
(cherry picked from commit bbff96be48
)
2014-09-30 16:44:44 -07:00
Jason Lowe
4f6a367166
YARN-1769. CapacityScheduler: Improve reservations. Contributed by Thomas Graves
...
(cherry picked from commit 9c22065109
)
2014-09-29 14:13:21 +00:00
Jian He
cb08ed1484
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier to use protobuf object as the payload. Contributed by Junping Du.
...
(cherry picked from commit 5391919b09
)
2014-09-26 17:53:35 -07:00
Jason Lowe
e4d46e5ff6
YARN-2523. ResourceManager UI showing negative value for "Decommissioned Nodes" field. Contributed by Rohith
...
(cherry picked from commit 8269bfa613
)
2014-09-25 22:39:57 +00:00
Zhijie Shen
83926b3c57
YARN-2546. Made REST API for application creation/submission use numeric and boolean types instead of the string of them. Contributed by Varun Vasudev.
...
(cherry picked from commit 72b0881ca6
)
2014-09-24 17:58:48 -07:00
Zhijie Shen
3a2e400377
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier. Contributed by Xuan Gong.
...
(cherry picked from commit c86674a3a4
)
2014-09-24 17:51:54 -07:00
Karthik Kambatla
024bcfdc0d
YARN-2252. Intermittent failure of TestFairScheduler.testContinuousScheduling. (Ratandeep Ratti and kasha via kasha)
...
(cherry picked from commit f5578207d2
)
2014-09-23 00:03:47 -07:00
Karthik Kambatla
8ba8521de5
YARN-1959. Fix headroom calculation in FairScheduler. (Anubhav Dhoot via kasha)
...
(cherry picked from commit 568d3dc2bb
)
2014-09-22 23:51:45 -07:00
Karthik Kambatla
3984435661
YARN-2539. FairScheduler: Set the default value for maxAMShare to 0.5. (Wei Yan via kasha)
...
(cherry picked from commit 43efdd30b5
)
2014-09-22 16:10:20 -07:00
Jian He
3ce97a9efd
YARN-1372. Ensure all completed containers are reported to the AMs across RM restart. Contributed by Anubhav Dhoot
...
(cherry picked from commit 0a641496c7
)
2014-09-22 10:32:44 -07:00
Karthik Kambatla
c1a3819a4d
YARN-2453. TestProportionalCapacityPreemptionPolicy fails with FairScheduler. (Zhihai Xu via kasha)
...
(cherry picked from commit 9721e2c1fe
)
2014-09-21 23:14:27 -07:00
Karthik Kambatla
1714af0618
YARN-2452. TestRMApplicationHistoryWriter fails with FairScheduler. (Zhihai Xu via kasha)
...
(cherry picked from commit c50fc92502
)
2014-09-21 13:15:45 -07:00
Jian He
07981c8f60
YARN-2565. Fixed RM to not use FileSystemApplicationHistoryStore unless explicitly set. Contributed by Zhijie Shen
...
(cherry picked from commit 444acf8ea7
)
2014-09-19 11:28:22 -07:00
Zhijie Shen
9333ee3068
YARN-2568. Fixed the potential test failures due to race conditions when RM work-preserving recovery is enabled. Contributed by Jian He.
...
(cherry picked from commit 6fe5c6b746
)
2014-09-18 22:02:53 -07:00
Jason Lowe
d9273a9547
YARN-2561. MR job client cannot reconnect to AM after NM restart. Contributed by Junping Du
...
(cherry picked from commit a337f0e354
)
2014-09-18 21:36:20 +00:00
Jason Lowe
44c22c3d8d
YARN-2363. Submitted applications occasionally lack a tracking URL. Contributed by Jason Lowe
...
(cherry picked from commit 9ea7b6c063
)
2014-09-18 20:15:46 +00:00
Vinod Kumar Vavilapalli
d24ef142b7
YARN-2001. Added a time threshold for RM to wait before starting container allocations after restart/failover. Contributed by Jian He.
...
(cherry picked from commit 485c96e3cb
)
2014-09-18 11:04:27 -07:00
Jian He
d61cdd66db
YARN-2559. Fixed NPE in SystemMetricsPublisher when retrieving FinalApplicationStatus. Contributed by Zhijie Shen
...
(cherry picked from commit ee21b13cbd
)
2014-09-17 21:44:49 -07:00
junping_du
1e1f4c2fdf
YARN-1250. Generic history service should support application-acls. (Contributed by Zhijie Shen)
...
(cherry picked from commit 90a0c03f0a
)
2014-09-16 18:30:06 -07:00
Vinod Kumar Vavilapalli
5cdb24d4b1
YARN-611. Added an API to let apps specify an interval beyond which AM failures should be ignored towards counting max-attempts. Contributed by Xuan Gong.
...
(cherry picked from commit 14e2639fd0
)
2014-09-13 18:05:02 -07:00
XuanGong
add5ac6a73
YARN-2456. Possible livelock in CapacityScheduler when RM is recovering
...
apps. Contributed by Jian He
2014-09-12 15:24:02 -07:00
Jian He
f21a5236e5
YARN-2229. Changed the integer field of ContainerId to be long type. Contributed by Tsuyoshi OZAWA
...
(cherry picked from commit 3122daa802
)
Conflicts:
hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/rm/preemption/CheckpointAMPreemptionPolicy.java
2014-09-12 10:45:42 -07:00
junping_du
efa9ab3638
YARN-2033. Merging generic-history into the Timeline Store (Contributed by Zhijie Shen)
...
(cherry picked from commit 6b8b1608e6
)
2014-09-12 10:08:06 +08:00
Karthik Kambatla
6f8b2a8e2e
YARN-2534. FairScheduler: Potential integer overflow calculating totalMaxShare. (Zhihai Xu via kasha)
...
(cherry picked from commit c11ada5ea6
)
2014-09-11 12:06:28 -07:00
Jian He
ceae7be6b2
YARN-415. Capture aggregate memory allocation at the app-level for chargeback. Contributed by Eric Payne & Andrey Klochkov
...
(cherry picked from commit 83be3ad444
)
2014-09-10 18:22:07 -07:00
Jian He
f58a076dbc
YARN-2158. Fixed TestRMWebServicesAppsModification#testSingleAppKill test failure. Contributed by Varun Vasudev
...
(cherry picked from commit cbfe26370b
)
2014-09-10 12:48:40 -07:00
XUAN
b91ef0c502
YARN-2459. RM crashes if App gets rejected for any reason and HA is enabled. Contributed by Jian He
2014-09-10 11:51:57 -07:00
Vinod Kumar Vavilapalli
b4b59ef749
YARN-2448. Changed ApplicationMasterProtocol to expose RM-recognized resource types to the AMs. Contributed by Varun Vasudev.
...
(cherry picked from commit b67d5ba784
)
2014-09-10 10:17:30 -07:00
Karthik Kambatla
a19694f19d
YARN-1458. FairScheduler: Zero weight can lead to livelock. (Zhihai Xu via kasha)
...
(cherry picked from commit 3072c83b38
)
2014-09-10 08:33:31 -07:00
Karthik Kambatla
96a13c6d0c
YARN-2394. FairScheduler: Configure fairSharePreemptionThreshold per queue. (Wei Yan via kasha)
...
(cherry picked from commit 1dcaba9a7a
)
2014-09-03 10:27:47 -07:00
Karthik Kambatla
7b9e763138
YARN-2395. FairScheduler: Preemption timeout should be configurable per queue. (Wei Yan via kasha)
...
(cherry picked from commit 0f34e6f387
)
2014-08-30 01:22:44 -07:00
Jian He
38853c97c2
YARN-1506. Changed RMNode/SchedulerNode to update resource with event notification. Contributed by Junping Du
...
(cherry picked from commit 5c14bc426b
)
Conflicts:
hadoop-yarn-project/CHANGES.txt
2014-08-29 23:39:58 -07:00
Jian He
73a0e46657
YARN-2447. RM web service app submission doesn't pass secrets correctly. Contributed by Varun Vasudev
...
(cherry picked from commit c686aa3533
)
2014-08-29 11:44:27 -07:00
Karthik Kambatla
f237dee156
YARN-2405. NPE in FairSchedulerAppsBlock. (Tsuyoshi Ozawa via kasha)
...
(cherry picked from commit fa80ca49bd
)
2014-08-28 23:22:04 -07:00
Karthik Kambatla
9c43760e59
YARN-1326. RM should log using RMStore at startup time. (Tsuyoshi Ozawa via kasha)
...
(cherry picked from commit d16bfd1d0f
)
2014-08-27 01:44:28 -07:00
Karthik Kambatla
21843592db
YARN-2393. FairScheduler: Add the notion of steady fair share. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1619852 13f79535-47bb-0310-9956-ffa450edef68
2014-08-22 16:00:57 +00:00
Jason Darrell Lowe
fab20109ae
svn merge -c 1619614 FIXES: YARN-2434. RM should not recover containers from previously failed attempt when AM restart is not enabled. Contributed by Jian He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1619616 13f79535-47bb-0310-9956-ffa450edef68
2014-08-21 22:45:35 +00:00
Zhijie Shen
400f6131d4
YARN-2249. Avoided AM release requests being lost on work preserving RM restart. Contributed by Jian He.
...
svn merge --ignore-ancestry -c 1618972 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1618973 13f79535-47bb-0310-9956-ffa450edef68
2014-08-19 20:40:37 +00:00
Jian He
aeab638ce2
Merge r1618915 from trunk. YARN-2409. RM ActiveToStandBy transition missing stoping previous rmDispatcher. Contributed by Rohith
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1618916 13f79535-47bb-0310-9956-ffa450edef68
2014-08-19 17:50:48 +00:00
Jian He
f9c07bd3f9
Merge r1618542 from trunk. YARN-2411. Support simple user and group mappings to queues. Contributed by Ram Venkatesh
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1618543 13f79535-47bb-0310-9956-ffa450edef68
2014-08-18 06:10:14 +00:00
Jian He
832cddebea
Merge r1618294 from trunk. YARN-2389. Added functionality for schedulers to kill all applications in a queue. Contributed by Subramaniam Venkatraman Krishnan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1618295 13f79535-47bb-0310-9956-ffa450edef68
2014-08-16 00:02:17 +00:00
Jian He
24102a4e6c
Merge r1618106 from trunk. YARN-2378. Added support for moving applications across queues in CapacityScheduler. Contributed by Subramaniam Venkatraman Krishnan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1618107 13f79535-47bb-0310-9956-ffa450edef68
2014-08-15 06:02:16 +00:00
Zhijie Shen
54429fd08a
YARN-2397. Avoided loading two authentication filters for RM and TS web interfaces. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1618054 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1618055 13f79535-47bb-0310-9956-ffa450edef68
2014-08-14 21:19:21 +00:00
Karthik Kambatla
e6799af4d3
YARN-1370. Fair scheduler to re-populate container allocation state. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1617647 13f79535-47bb-0310-9956-ffa450edef68
2014-08-13 01:45:07 +00:00
Karthik Kambatla
d0c3ca05de
YARN-2399. FairScheduler: Merge AppSchedulable and FSSchedulerApp into FSAppAttempt. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1617623 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 23:04:06 +00:00
Junping Du
e8d20ad77c
Merge r1617448 from trunk: YARN-1337. Recover containers upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1617450 13f79535-47bb-0310-9956-ffa450edef68
2014-08-12 11:02:38 +00:00
Jian He
285267b2dc
Merge r1617341 from trunk. YARN-2138. Cleaned up notifyDone* APIs in RMStateStore. Contributed by Varun Saxena
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1617342 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 18:25:51 +00:00
Xuan Gong
b7958c5df5
YARN-2400: Addendum fix for TestAMRestart failure. Contributed by Jian He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1617334 13f79535-47bb-0310-9956-ffa450edef68
2014-08-11 17:43:19 +00:00
Xuan Gong
775c42d3ea
YARN-2400. Fixed TestAMRestart fails intermittently. Contributed by Jian He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1617029 13f79535-47bb-0310-9956-ffa450edef68
2014-08-09 23:32:08 +00:00
Karthik Kambatla
a6611f2bd1
YARN-2026. Fair scheduler: Consider only active queues for computing fairshare. (Ashwin Shankar via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1616916 13f79535-47bb-0310-9956-ffa450edef68
2014-08-09 02:12:52 +00:00
Xuan Gong
4365c4530b
YARN-2212 ApplicationMaster needs to find a way to update the AMRMToken periodically. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1616891 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 21:37:35 +00:00
Karthik Kambatla
a0bf22947d
YARN-2352. FairScheduler: Collect metrics on duration of critical methods that affect performance. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1616785 13f79535-47bb-0310-9956-ffa450edef68
2014-08-08 14:40:46 +00:00
Jian He
06a0772d62
Merge r1616580 from trunk. YARN-2008. Fixed CapacityScheduler to calculate headroom based on max available capacity instead of configured max capacity. Contributed by Craig Welch
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1616581 13f79535-47bb-0310-9956-ffa450edef68
2014-08-07 20:01:14 +00:00
Karthik Kambatla
290d4e3ec8
YARN-2359. Application hangs when it fails to launch AM container. (Zhihai Xu via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1616376 13f79535-47bb-0310-9956-ffa450edef68
2014-08-07 00:08:27 +00:00
Junping Du
fc5bb235f2
Merge r1615550 from trunk: YARN-1354. Recover applications upon nodemanager restart. (Contributed by Jason Lowe)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1615554 13f79535-47bb-0310-9956-ffa450edef68
2014-08-04 13:35:49 +00:00
Xuan Gong
ecc257e384
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. Contributed by Craig Welch, Milan Potocnik and Arpit Agarwal
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1615010 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 21:29:01 +00:00
Zhijie Shen
f52092be46
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in yarn-server-common. Contributed by Junping Du.
...
svn merge --ignore-ancestry -c 1614838 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1614839 13f79535-47bb-0310-9956-ffa450edef68
2014-07-31 09:31:22 +00:00
Karthik Kambatla
d23e7d4acb
YARN-2328. FairScheduler: Verify update and continuous scheduling threads are stopped when the scheduler is stopped. (kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1614435 13f79535-47bb-0310-9956-ffa450edef68
2014-07-29 17:46:58 +00:00
Zhijie Shen
b41896f0a2
YARN-2247. Made RM web services authenticate users via kerberos and delegation token. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1613821 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1613822 13f79535-47bb-0310-9956-ffa450edef68
2014-07-27 17:57:44 +00:00
Jian He
eeb024c034
Merge r1613515 from trunk. YARN-2211. Persist AMRMToken master key in RMStateStore for RM recovery. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1613516 13f79535-47bb-0310-9956-ffa450edef68
2014-07-25 20:44:30 +00:00
Karthik Kambatla
6cca715334
YARN-2214. FairScheduler: preemptContainerPreCheck() in FSParentQueue delays convergence towards fairness. (Ashwin Shankar via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1613464 13f79535-47bb-0310-9956-ffa450edef68
2014-07-25 16:33:55 +00:00
Jason Darrell Lowe
0315458f2e
svn merge -c 1612950 FIXES: YARN-2147. client lacks delegation token exception details when application submit fails. Contributed by Chen He
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1612955 13f79535-47bb-0310-9956-ffa450edef68
2014-07-23 21:46:46 +00:00
Sanford Ryza
aeb12ce4bf
YARN-2313. Livelock can occur in FairScheduler when there are lots of running apps (Tsuyoshi Ozawa via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1612770 13f79535-47bb-0310-9956-ffa450edef68
2014-07-23 05:01:35 +00:00
Karthik Kambatla
a24001475c
YARN-2273. NPE in ContinuousScheduling thread when we lose a node. (Wei Yan via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1612721 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 22:45:05 +00:00
Zhijie Shen
7b4f8a74ec
YARN-2319. Made the MiniKdc instance start/close before/after the class of TestRMWebServicesDelegationTokens. Contributed by Wenwu Peng.
...
svn merge --ignore-ancestry -c 1612588 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1612590 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 15:17:13 +00:00
Junping Du
1ad0a08c20
Merge r1612565 from trunk: YARN-2242. Addendum patch. Improve exception information on AM launch crashes. (Contributed by Li Lu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1612566 13f79535-47bb-0310-9956-ffa450edef68
2014-07-22 13:11:51 +00:00
Karthik Kambatla
fcb0fe0548
YARN-2244. FairScheduler missing handling of containers for unknown application attempts. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1611841 13f79535-47bb-0310-9956-ffa450edef68
2014-07-19 00:20:16 +00:00
Xuan Gong
5b0492f7c5
YARN-2208. AMRMTokenManager need to have a way to roll over AMRMToken. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1611819 13f79535-47bb-0310-9956-ffa450edef68
2014-07-18 21:44:40 +00:00
Jian He
ad43e5bd1a
Merge r1611240 from trunk. Addendum patch for YARN-2219
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1611241 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 03:33:12 +00:00
Vinod Kumar Vavilapalli
63e374060e
YARN-2219. Changed ResourceManager to avoid AMs and NMs getting exceptions after RM recovery but before scheduler learns about apps and app-attempts. Contributed by Jian He.
...
svn merge --ignore-ancestry -c 1611222 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1611223 13f79535-47bb-0310-9956-ffa450edef68
2014-07-17 00:15:28 +00:00
Mayank Bansal
8260d48df9
YARN-1408 Preemption caused Invalid State Event: ACQUIRED at KILLED and caused a task timeout for 30mins. (Sunil G via mayank)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1610884 13f79535-47bb-0310-9956-ffa450edef68
2014-07-15 23:44:48 +00:00
Vinod Kumar Vavilapalli
9463cbd1ec
YARN-2233. Implemented ResourceManager web-services to create, renew and cancel delegation tokens. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1610876 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1610877 13f79535-47bb-0310-9956-ffa450edef68
2014-07-15 23:00:48 +00:00
Vinod Kumar Vavilapalli
66af4bf54f
YARN-2260. Fixed ResourceManager's RMNode to correctly remember containers when nodes resync during work-preserving RM restart. Contributed by Jian He.
...
svn merge --ignore-ancestry -c 1610557 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1610559 13f79535-47bb-0310-9956-ffa450edef68
2014-07-14 23:32:36 +00:00
Zhijie Shen
9cc4e93fcd
YARN-2022 Preempting an Application Master container can be kept as least priority when multiple applications are marked for preemption by ProportionalCapacityPreemptionPolicy (Sunil G via mayank)
...
svn merge --ignore-ancestry -c 1607227 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1609615 13f79535-47bb-0310-9956-ffa450edef68
2014-07-11 02:48:34 +00:00
Jian He
d90b47f8bc
Merge r1609561 from trunk. YARN-2181. Added preemption info to logs and RM web UI. Contributed by Wangda Tan
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1609562 13f79535-47bb-0310-9956-ffa450edef68
2014-07-10 20:05:20 +00:00
Karthik Kambatla
02f72d7a6b
YARN-2131. Add a way to format the RMStateStore. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1609280 13f79535-47bb-0310-9956-ffa450edef68
2014-07-09 20:03:18 +00:00
Zhijie Shen
077278fa9e
YARN-2158. Improved assertion messages of TestRMWebServicesAppsModification. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1608667 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1608668 13f79535-47bb-0310-9956-ffa450edef68
2014-07-08 06:09:09 +00:00
Sanford Ryza
92de5cae9b
YARN-2250. FairScheduler.findLowestCommonAncestorQueue returns null when queues not identical (Krisztian Horvath via Sandy Ryza)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1607873 13f79535-47bb-0310-9956-ffa450edef68
2014-07-04 15:17:14 +00:00
Junping Du
0d50c2f697
Merge r1607655 from trunk: YARN-2242. Improve exception information on AM launch crashes. (Contributed by Li Lu)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1607656 13f79535-47bb-0310-9956-ffa450edef68
2014-07-03 14:19:18 +00:00
Vinod Kumar Vavilapalli
0ac01d7cfc
YARN-2232. Fixed ResourceManager to allow DelegationToken owners to be able to cancel their own tokens in secure mode. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1607484 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1607485 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 21:37:20 +00:00
Vinod Kumar Vavilapalli
8f1f458729
YARN-1713. Added get-new-app and submit-app functionality to RM web services. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1607216 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1607217 13f79535-47bb-0310-9956-ffa450edef68
2014-07-02 00:23:49 +00:00
Xuan Gong
c2eaeadaab
TestRMApplicationHistoryWriter sometimes fails in trunk. Contributed by Zhijie Shen
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1606834 13f79535-47bb-0310-9956-ffa450edef68
2014-06-30 16:51:04 +00:00
Jian He
a385b77df7
Merge r1606557 from trunk. YARN-2052. Embedded an epoch number in container id to ensure the uniqueness of container id after RM restarts. Contributed by Tsuyoshi OZAWA
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1606558 13f79535-47bb-0310-9956-ffa450edef68
2014-06-29 18:26:08 +00:00
Jian He
c63e3f027b
Merge r1606407 from trunk. YARN-614. Changed ResourceManager to not count disk failure, node loss and RM restart towards app failures. Contributed by Xuan Gong
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1606408 13f79535-47bb-0310-9956-ffa450edef68
2014-06-28 23:41:51 +00:00
Zhijie Shen
705a5cede9
YARN-2201. Made TestRMWebServicesAppsModification be independent of the changes on yarn-default.xml. Contributed by Varun Vasudev.
...
svn merge --ignore-ancestry -c 1606285 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1606286 13f79535-47bb-0310-9956-ffa450edef68
2014-06-28 03:33:46 +00:00
Karthik Kambatla
94ac07e4a5
YARN-2204. Addendum patch. TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1606170 13f79535-47bb-0310-9956-ffa450edef68
2014-06-27 18:18:12 +00:00
Vinod Kumar Vavilapalli
8317fd5de6
YARN-2171. Improved CapacityScheduling to not lock on nodemanager-count when AMs heartbeat in. Contributed by Jason Lowe.
...
svn merge --ignore-ancestry -c 1605616 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605617 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 21:57:28 +00:00
Karthik Kambatla
7e252055bf
YARN-2204. TestAMRestart#testAMRestartWithExistingContainers assumes CapacityScheduler. (Robert Kanter via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605550 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 18:54:15 +00:00
Jian He
63940d6e13
Merge r1605263 from trunk. YARN-1365. Changed ApplicationMasterService to allow an app to re-register after RM restart. Contributed by Anubhav Dhoot
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605264 13f79535-47bb-0310-9956-ffa450edef68
2014-06-25 04:45:50 +00:00
Vinod Kumar Vavilapalli
a2e2c8ad97
YARN-2152. Added missing information into ContainerTokenIdentifier so that NodeManagers can report the same to RM when RM restarts. Contributed Jian He.
...
svn merge --ignore-ancestry -c 1605205 ../../trunk/
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605206 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 21:44:00 +00:00
Thomas Graves
1c2052e200
YARN-2072. RM/NM UIs and webservices are missing vcore information. (Nathan Roberts via tgraves)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605166 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 19:41:56 +00:00
Karthik Kambatla
e9cf83174a
YARN-2109. Fix TestRM to work with both schedulers. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605144 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 17:34:29 +00:00
Karthik Kambatla
1307f2aeae
YARN-2192. TestRMHA fails when run with a mix of Schedulers. (Anubhav Dhoot via kasha)
...
git-svn-id: https://svn.apache.org/repos/asf/hadoop/common/branches/branch-2@1605139 13f79535-47bb-0310-9956-ffa450edef68
2014-06-24 17:14:22 +00:00