Commit Graph

1301 Commits

Author SHA1 Message Date
Jason Lowe baf9e22284 YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
(cherry picked from commit 40b256949a)
2015-06-26 15:48:34 +00:00
Jason Lowe 41d9677740 YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
(cherry picked from commit 8d58512d6e)
2015-06-24 16:38:40 +00:00
Jason Lowe 37b89deccf YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong
(cherry picked from commit 2a20dd9b61)
2015-06-24 16:25:22 +00:00
Karthik Kambatla 6b1a156e27 YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
(cherry picked from commit 5ebf2817e5)
(cherry picked from commit 9656ee4ee7)
2015-06-22 17:49:55 -07:00
Xuan c0a419b134 YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena 2015-06-17 16:27:01 -07:00
Jian He a78ca0fadc YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
(cherry picked from commit 6ad4e59cfc)
2015-06-04 10:53:47 -07:00
Wangda Tan e74e4d7bb9 YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
(cherry picked from commit ebd797c48f)
2015-06-04 10:26:06 -07:00
Jason Lowe 3d2c3f8648 YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
(cherry picked from commit e13b671aa5)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-06-03 19:46:51 +00:00
Xuan b34825b0cb YARN-3753. RM failed to come up with "java.io.IOException: Wait for ZKClient creation timed out". Contributed by Jian He 2015-06-02 10:28:14 -07:00
Xuan 1ad0d43fb4 Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
This reverts commit 2032e8d1a0.
2015-06-01 11:29:36 -07:00
Wangda Tan 3f926f4b20 YARN-3725. App submission via REST API is broken in secure mode due to Timeline DT service address is empty. (Zhijie Shen via wangda)
(cherry picked from commit 5cc3fced95)
2015-05-31 16:35:25 -07:00
Xuan 0943e0f5a8 YARN-2900. Application (Attempt and Container) Not Found in AHS results
in Internal Server Error (500). Contributed by Zhijie Shen and Mit Desai
2015-05-31 15:41:13 -07:00
Zhijie Shen 2032e8d1a0 YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong. 2015-05-30 09:39:49 -07:00
Vinod Kumar Vavilapalli 9449b72429 Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 9acd24fec4)
2015-05-29 13:29:44 -07:00
Wangda Tan bb8350388b YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue info once. (Varun Saxena via wangda) 2015-05-28 17:04:20 -07:00
cnauroth 02ab202a2b YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
(cherry picked from commit 7bc33b63c78c6ee9a04c0c1511ef16e63f7f3074)
(cherry picked from commit 6ed8989a6f)
2015-05-27 14:21:26 -07:00
Wangda Tan fb57a1aac8 YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
(cherry picked from commit cdbd66be11)
2015-05-26 16:29:27 -07:00
Xuan eb4d1ed612 YARN-3701. Isolating the error of generating a single app report when
getting all apps from generic history service. Contributed by Zhijie
Shen

(cherry picked from commit 455b3acf0e)
(cherry picked from commit 33be070a5e)
2015-05-22 14:36:30 -07:00
Karthik Kambatla c60054743f YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
(cherry picked from commit a8b50e46737c11936ba72c427da69b2365a07aac)
(cherry picked from commit e8ac88d4fe)
2015-05-21 13:43:22 -07:00
Jian He ce45e4e82e YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan 2015-05-20 17:06:59 -07:00
Wangda Tan 114d41aeb2 Backport "YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan" to 2.7.1 2015-05-20 13:27:01 -07:00
Tsuyoshi Ozawa 6c7840f5b5 YARN-3677. Fix findbugs warnings in yarn-server-resourcemanager. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 7401e5b5e8)
2015-05-20 09:02:30 +09:00
Xuan 411c09b613 YARN-3526. ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster. Contributed by Weiwei Yang
(cherry picked from commit b0ad644083)
(cherry picked from commit 802676e1be)
2015-05-15 22:41:46 -07:00
Vinod Kumar Vavilapalli 3c51654d57 Fixing HDFS state-store. Contributed by Arun Suresh.
(cherry picked from commit 9a2a9553ee)
(cherry picked from commit ad3196e016)
2015-05-14 18:33:51 -07:00
Jason Lowe 5161751433 YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
(cherry picked from commit 711d77cc54)
2015-05-13 21:09:03 +00:00
Jason Lowe 7110499817 YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
(cherry picked from commit 5e093f0d40)
2015-05-13 20:38:52 +00:00
Jason Lowe bfd28d6f7e YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
(cherry picked from commit ac32fa187c)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-05-13 20:38:52 +00:00
Xuan 9527cdd12d YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
(cherry picked from commit 0f95921447)
(cherry picked from commit 487d9b0f3f)
2015-05-13 13:12:38 -07:00
Zhijie Shen 071e21cacd YARN-3539. Updated timeline server documentation and marked REST APIs evolving. Contributed by Steve Loughran.
(cherry picked from commit fcd0702c10)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-05-12 21:19:33 -07:00
Wangda Tan 6b4adbf460 backport YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed (Jian He) to branch-2.7 2015-05-11 18:05:32 -07:00
Wangda Tan 84f7641f7e YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation. (Thomas Graves via wangda) 2015-05-11 15:14:06 -07:00
Jason Lowe a75f4bed6e YARN-3476. Nodemanager can fail to delete local logs if log aggregation fails. Contributed by Rohith
(cherry picked from commit 25e2b02122)
2015-05-08 22:47:18 +00:00
Devaraj K ada2f74b17 YARN-3358. Audit log not present while refreshing Service ACLs.
Contributed by Varun Saxena.

(cherry picked from commit ef3d66d462)
2015-05-08 12:16:13 +05:30
Vinod Kumar Vavilapalli 3d1d551c33 YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
(cherry picked from commit 4c7b9b6abe)
2015-05-06 17:57:49 -07:00
Karthik Kambatla fbd4bbb07b YARN-3469. ZKRMStateStore: Avoid setting watches that are not required. (Jun Hong via kasha)
(cherry picked from commit e516706b89)
2015-05-06 17:57:13 -07:00
Jian He 4e8e9d717c YARN-3301. Fixed the format issue of the new RM attempt web page. Contributed by Xuan Gong 2015-05-06 14:00:29 -07:00
Jian He a38dbbc5b1 YARN-3243. CapacityScheduler should pass headroom from parent to children to make sure ParentQueue obey its capacity limits. Contributed by Wangda Tan.
(cherry picked from commit 487374b7fe)

(cherry picked from commit 1c601e492f)
2015-04-30 13:51:13 -07:00
Zhijie Shen 04dd05989e YARN-3544. Got back AM logs link on the RM web UI for a completed app. Contributed by Xuan Gong. 2015-04-30 13:07:15 -07:00
Karthik Kambatla 396c41b1d4 YARN-3485. FairScheduler headroom calculation doesn't consider maxResources for Fifo and FairShare policies. (kasha)
(cherry picked from commit 8f82970e0c)
(cherry picked from commit 4cc38df7de)
2015-04-28 21:07:32 -07:00
Karthik Kambatla 4ddcc7e5b5 YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
(cherry picked from commit 47279c3228)
(cherry picked from commit 4045c41afe)
2015-04-27 13:37:06 -07:00
Xuan bec78f9b85 YARN-3516. killing ContainerLocalizer action doesn't take effect when
private localizer receives FETCH_FAILURE status. Contributed by zhihai
xu

(cherry picked from commit 0b3f8957a8)
(cherry picked from commit 8f6053ae51)
2015-04-27 13:35:38 -07:00
Karthik Kambatla b5cc78e57e YARN-3465. Use LinkedHashMap to preserve order of resource requests. (Zhihai Xu via kasha)
(cherry picked from commit 6495940eae)
(cherry picked from commit 53e0bf5c17)
2015-04-27 12:39:21 -07:00
Jian He 5cf35b98e1 YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens. Contributed by Rohith Sharmaks
(cherry picked from commit a1afbc48b5)
2015-04-23 17:43:41 -07:00
Jian He 6cc0e9baa1 YARN-3382. Some of UserMetricsInfo metrics are incorrectly set to root queue metrics. Contributed by Rohit Agarwal
(cherry picked from commit 944a16579f)
2015-04-23 17:40:56 -07:00
Karthik Kambatla e1843c9a46 YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
(cherry picked from commit 20b49224eb)
2015-04-23 17:39:50 -07:00
Steve Loughran b3f2826bfb YARN-2605. [RM HA] Rest api endpoints doing redirect incorrectly. (Xuan Gong via stevel) 2015-04-23 09:22:09 +01:00
Wangda Tan d6105d944c YARN-3487. CapacityScheduler scheduler lock obtained unnecessarily when calling getQueue (Jason Lowe via wangda)
(cherry picked from commit f47a5763ac)
2015-04-17 17:23:34 -07:00
Akira Ajisaka 48d6e994bf MAPREDUCE-6300. Task list sort by task id broken. Contributed by Siqi Li.
(cherry picked from commit 1fa8075794)
(cherry picked from commit 51c458a5c4)
2015-04-16 10:16:48 -05:00
Harsh J d4a462c02e YARN-3462. Patches applied for YARN-2424 are inconsistent between trunk and branch-2. Contributed by Naganarasimha G R. 2015-04-15 09:42:20 +05:30
Jason Lowe 70ac8cda41 YARN-3466. Fix RM nodes web page to sort by node HTTP-address, #containers and node-label column. (Jason Lowe via wangda)
(cherry picked from commit 73d9e4c77b)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-04-10 13:52:34 +00:00