1301 Commits

Author SHA1 Message Date
Jason Lowe
baf9e22284 YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
(cherry picked from commit 40b256949ad6f6e0dbdd248f2d257b05899f4332)
2015-06-26 15:48:34 +00:00
Jason Lowe
41d9677740 YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
(cherry picked from commit 8d58512d6e6d9fe93784a9de2af0056bcc316d96)
2015-06-24 16:38:40 +00:00
Jason Lowe
37b89deccf YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong
(cherry picked from commit 2a20dd9b61ba3833460cbda0e8c3e8b6366fc3ab)
2015-06-24 16:25:22 +00:00
Karthik Kambatla
6b1a156e27 YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
(cherry picked from commit 5ebf2817e58e1be8214dc1916a694a912075aa0a)
(cherry picked from commit 9656ee4ee763dc48435ba8bb3ac9b10f6b306fe3)
2015-06-22 17:49:55 -07:00
Xuan
c0a419b134 YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena 2015-06-17 16:27:01 -07:00
Jian He
a78ca0fadc YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
(cherry picked from commit 6ad4e59cfc111a92747fdb1fb99cc6378044832a)
2015-06-04 10:53:47 -07:00
Wangda Tan
e74e4d7bb9 YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
(cherry picked from commit ebd797c48fe236b404cf3a125ac9d1f7714e291e)
2015-06-04 10:26:06 -07:00
Jason Lowe
3d2c3f8648 YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
(cherry picked from commit e13b671aa510f553f4a6a232b4694b6a4cce88ae)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-06-03 19:46:51 +00:00
Xuan
b34825b0cb YARN-3753. RM failed to come up with "java.io.IOException: Wait for ZKClient creation timed out". Contributed by Jian He 2015-06-02 10:28:14 -07:00
Xuan
1ad0d43fb4 Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
This reverts commit 2032e8d1a0c6abe154a64df69b0124cdae9f78e0.
2015-06-01 11:29:36 -07:00
Wangda Tan
3f926f4b20 YARN-3725. App submission via REST API is broken in secure mode due to Timeline DT service address is empty. (Zhijie Shen via wangda)
(cherry picked from commit 5cc3fced957a8471733e0e9490878bd68429fe24)
2015-05-31 16:35:25 -07:00
Xuan
0943e0f5a8 YARN-2900. Application (Attempt and Container) Not Found in AHS results
in Internal Server Error (500). Contributed by Zhijie Shen and Mit Desai
2015-05-31 15:41:13 -07:00
Zhijie Shen
2032e8d1a0 YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong. 2015-05-30 09:39:49 -07:00
Vinod Kumar Vavilapalli
9449b72429 Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 9acd24fec408c182683968ae2c75ca2c38866a60)
2015-05-29 13:29:44 -07:00
Wangda Tan
bb8350388b YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue info once. (Varun Saxena via wangda) 2015-05-28 17:04:20 -07:00
cnauroth
02ab202a2b YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
(cherry picked from commit 7bc33b63c78c6ee9a04c0c1511ef16e63f7f3074)
(cherry picked from commit 6ed8989a6f897437ab53c85c42fa8b58fe556359)
2015-05-27 14:21:26 -07:00
Wangda Tan
fb57a1aac8 YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
(cherry picked from commit cdbd66be111c93c85a409d47284e588c453ecae9)
2015-05-26 16:29:27 -07:00
Xuan
eb4d1ed612 YARN-3701. Isolating the error of generating a single app report when
getting all apps from generic history service. Contributed by Zhijie
Shen

(cherry picked from commit 455b3acf0e82b214e06bd7b538968252945cd3c4)
(cherry picked from commit 33be070a5e944bdc04693e7be6f39fd3d145d8d1)
2015-05-22 14:36:30 -07:00
Karthik Kambatla
c60054743f YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
(cherry picked from commit a8b50e46737c11936ba72c427da69b2365a07aac)
(cherry picked from commit e8ac88d4fe8aaef9d2e5fb76e6bc50223ff0e495)
2015-05-21 13:43:22 -07:00
Jian He
ce45e4e82e YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan 2015-05-20 17:06:59 -07:00
Wangda Tan
114d41aeb2 Backport "YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan" to 2.7.1 2015-05-20 13:27:01 -07:00
Tsuyoshi Ozawa
6c7840f5b5 YARN-3677. Fix findbugs warnings in yarn-server-resourcemanager. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 7401e5b5e8060b6b027d714b5ceb641fcfe5b598)
2015-05-20 09:02:30 +09:00
Xuan
411c09b613 YARN-3526. ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster. Contributed by Weiwei Yang
(cherry picked from commit b0ad644083a0dfae3a39159ac88b6fc09d846371)
(cherry picked from commit 802676e1be350785d8c0ad35f6676eeb85b2467b)
2015-05-15 22:41:46 -07:00
Vinod Kumar Vavilapalli
3c51654d57 Fixing HDFS state-store. Contributed by Arun Suresh.
(cherry picked from commit 9a2a9553eee454ecd18120535d3e845f86fc3584)
(cherry picked from commit ad3196e01667bd6798a1988fddb0c0ae32f6687c)
2015-05-14 18:33:51 -07:00
Jason Lowe
5161751433 YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
(cherry picked from commit 711d77cc54a64b2c3db70bdacc6bf2245c896a4b)
2015-05-13 21:09:03 +00:00
Jason Lowe
7110499817 YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
(cherry picked from commit 5e093f0d400f82f67d9b2d24253c79e4a5abacf9)
2015-05-13 20:38:52 +00:00
Jason Lowe
bfd28d6f7e YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
(cherry picked from commit ac32fa187cf37e5a51fd579e052105662ab3c411)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-05-13 20:38:52 +00:00
Xuan
9527cdd12d YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
(cherry picked from commit 0f95921447ea547bdf9caf18f7fde46bc66031f8)
(cherry picked from commit 487d9b0f3f40a1f1afbcb3cc7b760982131b4902)
2015-05-13 13:12:38 -07:00
Zhijie Shen
071e21cacd YARN-3539. Updated timeline server documentation and marked REST APIs evolving. Contributed by Steve Loughran.
(cherry picked from commit fcd0702c10ce574b887280476aba63d6682d5271)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-05-12 21:19:33 -07:00
Wangda Tan
6b4adbf460 backport YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed (Jian He) to branch-2.7 2015-05-11 18:05:32 -07:00
Wangda Tan
84f7641f7e YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation. (Thomas Graves via wangda) 2015-05-11 15:14:06 -07:00
Jason Lowe
a75f4bed6e YARN-3476. Nodemanager can fail to delete local logs if log aggregation fails. Contributed by Rohith
(cherry picked from commit 25e2b02122c4ed760227ab33c49d3445c23b9276)
2015-05-08 22:47:18 +00:00
Devaraj K
ada2f74b17 YARN-3358. Audit log not present while refreshing Service ACLs.
Contributed by Varun Saxena.

(cherry picked from commit ef3d66d4624d360e75c016e36824a6782d6a9746)
2015-05-08 12:16:13 +05:30
Vinod Kumar Vavilapalli
3d1d551c33 YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
(cherry picked from commit 4c7b9b6abe2452c9752a11214762be2e7665fb32)
2015-05-06 17:57:49 -07:00
Karthik Kambatla
fbd4bbb07b YARN-3469. ZKRMStateStore: Avoid setting watches that are not required. (Jun Hong via kasha)
(cherry picked from commit e516706b896743e47e2852be81944eb5613e3e76)
2015-05-06 17:57:13 -07:00
Jian He
4e8e9d717c YARN-3301. Fixed the format issue of the new RM attempt web page. Contributed by Xuan Gong 2015-05-06 14:00:29 -07:00
Jian He
a38dbbc5b1 YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
(cherry picked from commit 487374b7fe0c92fc7eb1406c568952722b5d5b15)

(cherry picked from commit 1c601e492f4cd80e012aa78b796383ee9de161fd)
2015-04-30 13:51:13 -07:00
Zhijie Shen
04dd05989e YARN-3544. Got back AM logs link on the RM web UI for a completed app. Contributed by Xuan Gong. 2015-04-30 13:07:15 -07:00
Karthik Kambatla
396c41b1d4 YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
(cherry picked from commit 8f82970e0c247b37b2bf33aa21f6a39afa07efde)
(cherry picked from commit 4cc38df7de89a361493826c057f3363f1738a7e2)
2015-04-28 21:07:32 -07:00
Karthik Kambatla
4ddcc7e5b5 YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
(cherry picked from commit 47279c3228185548ed09c36579b420225e4894f5)
(cherry picked from commit 4045c41afe440b773d006e962bf8a5eae3fdc284)
2015-04-27 13:37:06 -07:00
Xuan
bec78f9b85 YARN-3516. killing ContainerLocalizer action doesn't take effect when
private localizer receives FETCH_FAILURE status. Contributed by zhihai
xu

(cherry picked from commit 0b3f8957a87ada1a275c9904b211fdbdcefafb02)
(cherry picked from commit 8f6053ae517d30dedece4f0577341ec3c2482252)
2015-04-27 13:35:38 -07:00
Karthik Kambatla
b5cc78e57e YARN-3465. Use LinkedHashMap to preserve order of resource requests. (Zhihai Xu via kasha)
(cherry picked from commit 6495940eae09418a939882a8955845f9241a6485)
(cherry picked from commit 53e0bf5c172c396780b96fda8dd31ad799a25fed)
2015-04-27 12:39:21 -07:00
Jian He
5cf35b98e1 YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
(cherry picked from commit a1afbc48b53f6bdbd30dc8eb56a7621d49c5d6db)
2015-04-23 17:43:41 -07:00
Jian He
6cc0e9baa1 YARN-3382. Some of UserMetricsInfo metrics are incorrectly set to root queue metrics. Contributed by Rohit Agarwal
(cherry picked from commit 944a16579fdb54c1c420162ef830ef2ac5451c69)
2015-04-23 17:40:56 -07:00
Karthik Kambatla
e1843c9a46 YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
(cherry picked from commit 20b49224eb90c796f042ac4251508f3979fd4787)
2015-04-23 17:39:50 -07:00
Steve Loughran
b3f2826bfb YARN-2605. [RM HA] Rest api endpoints doing redirect incorrectly. (Xuan Gong via stevel) 2015-04-23 09:22:09 +01:00
Wangda Tan
d6105d944c YARN-3487. CapacityScheduler scheduler lock obtained unnecessarily when calling getQueue (Jason Lowe via wangda)
(cherry picked from commit f47a5763acd55cb0b3f16152c7f8df06ec0e09a9)
2015-04-17 17:23:34 -07:00
Akira Ajisaka
48d6e994bf MAPREDUCE-6300. Task list sort by task id broken. Contributed by Siqi Li.
(cherry picked from commit 1fa8075794894b3e0053d251788277fb88e85533)
(cherry picked from commit 51c458a5c4562c54ff147824816f7b44c4b9b8f6)
2015-04-16 10:16:48 -05:00
Harsh J
d4a462c02e YARN-3462. Patches applied for YARN-2424 are inconsistent between trunk and branch-2. Contributed by Naganarasimha G R. 2015-04-15 09:42:20 +05:30
Jason Lowe
70ac8cda41 YARN-3466. Fix RM nodes web page to sort by node HTTP-address, #containers and node-label column. (Jason Lowe via wangda)
(cherry picked from commit 73d9e4c77b2aa4734845376d20fb76e6b91b6db0)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-04-10 13:52:34 +00:00