Commit Graph

1677 Commits

Author SHA1 Message Date
Tsuyoshi Ozawa d9427a16b1 YARN-3170. YARN architecture document needs updating. Contirubted by Brahma Reddy Battula.
(cherry picked from commit edcaae44c1)
2015-07-15 15:43:49 +09:00
Jian He 9aa4411325 Revert "YARN-3878. AsyncDispatcher can hang while stopping if it is configured for draining events on stop. (Varun Saxena via kasha)"
This reverts commit aa067c6aa4.
(cherry picked from commit 2466460d4c)
2015-07-13 14:32:18 -07:00
Karthik Kambatla 8bb8006b71 YARN-3878. AsyncDispatcher can hang while stopping if it is configured for draining events on stop. (Varun Saxena via kasha)
(cherry picked from commit aa067c6aa4)
(cherry picked from commit ccf18705f7)
2015-07-09 09:50:14 -07:00
Akira Ajisaka 148b4db7e0 YARN-3690. [JDK8] 'mvn site' fails. Contributed by Brahma Reddy Battula.
(cherry picked from commit d6325745e2)
(cherry picked from commit d260478d3a)
2015-07-08 15:43:21 +09:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.) dcafd3ccd2 Release process for 2.7.1: Set the release date for 2.7.1.
(cherry picked from commit bf89ddb9b8)
2015-07-06 16:41:38 -07:00
Wangda Tan a6f6ba95ef YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda) 2015-07-02 11:08:21 -07:00
Jason Lowe 055d9292a7 YARN-3793. Several NPEs when deleting local files on NM recovery. Contributed by Varun Saxena
(cherry picked from commit b5cdf78e8e)
2015-07-01 21:15:30 +00:00
Vinod Kumar Vavilapalli 2e99675c72 Adding release 2.7.2 to CHANGES.txt.
(cherry picked from commit aad6a7d5db)
2015-06-28 16:33:49 -07:00
Jason Lowe baf9e22284 YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
(cherry picked from commit 40b256949a)
2015-06-26 15:48:34 +00:00
Jason Lowe 41d9677740 YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
(cherry picked from commit 8d58512d6e)
2015-06-24 16:38:40 +00:00
Jason Lowe 37b89deccf YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong
(cherry picked from commit 2a20dd9b61)
2015-06-24 16:25:22 +00:00
Karthik Kambatla 6b1a156e27 YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
(cherry picked from commit 5ebf2817e5)
(cherry picked from commit 9656ee4ee7)
2015-06-22 17:49:55 -07:00
Xuan c0a419b134 YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena 2015-06-17 16:27:01 -07:00
Tsuyoshi Ozawa 6c527f5514 YARN-3711. Documentation of ResourceManager HA should explain configurations about listen addresses. Contributed by Masatake Iwasaki.
(cherry picked from commit e8c514373f)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-06-16 10:16:15 +09:00
Jian He a78ca0fadc YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
(cherry picked from commit 6ad4e59cfc)
2015-06-04 10:53:47 -07:00
Wangda Tan e74e4d7bb9 YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
(cherry picked from commit ebd797c48f)
2015-06-04 10:26:06 -07:00
Jason Lowe 3d2c3f8648 YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
(cherry picked from commit e13b671aa5)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-06-03 19:46:51 +00:00
Xuan b34825b0cb YARN-3753. RM failed to come up with "java.io.IOException: Wait for ZKClient creation timed out". Contributed by Jian He 2015-06-02 10:28:14 -07:00
Xuan 1ad0d43fb4 Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
This reverts commit 2032e8d1a0.
2015-06-01 11:29:36 -07:00
Wangda Tan 3f926f4b20 YARN-3725. App submission via REST API is broken in secure mode due to Timeline DT service address is empty. (Zhijie Shen via wangda)
(cherry picked from commit 5cc3fced95)
2015-05-31 16:35:25 -07:00
Xuan 0943e0f5a8 YARN-2900. Application (Attempt and Container) Not Found in AHS results
in Internal Server Error (500). Contributed by Zhijie Shen and Mit Desai
2015-05-31 15:41:13 -07:00
Zhijie Shen 2032e8d1a0 YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong. 2015-05-30 09:39:49 -07:00
Wangda Tan bb8350388b YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue info once. (Varun Saxena via wangda) 2015-05-28 17:04:20 -07:00
Xuan 3f188272af YARN-3723. Need to clearly document primaryFilter and otherInfo value
type. Contributed by Zhijie Shen

(cherry picked from commit 3077c299da)
(cherry picked from commit 550b55146d)
2015-05-28 10:21:06 -07:00
Wangda Tan fb57a1aac8 YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
(cherry picked from commit cdbd66be11)
2015-05-26 16:29:27 -07:00
Xuan 23994e824a YARN-2238. Filtering on UI sticks even if I move away from the page.
Contributed by Jian He

(cherry picked from commit 39077dba2e)
(cherry picked from commit 84245ff3b2)
2015-05-25 22:41:41 -07:00
Xuan eb4d1ed612 YARN-3701. Isolating the error of generating a single app report when
getting all apps from generic history service. Contributed by Zhijie
Shen

(cherry picked from commit 455b3acf0e)
(cherry picked from commit 33be070a5e)
2015-05-22 14:36:30 -07:00
Karthik Kambatla c60054743f YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
(cherry picked from commit a8b50e46737c11936ba72c427da69b2365a07aac)
(cherry picked from commit e8ac88d4fe)
2015-05-21 13:43:22 -07:00
Devaraj K b68c338b17 YARN-3646. Applications are getting stuck some times in case of retry
policy forever. Contributed by Raju Bairishetti.

(cherry picked from commit 0305316d69)
2015-05-21 20:16:53 +05:30
Akira Ajisaka 7a8e076ffb YARN-3694. Fix dead link for TimelineServer REST API. Contributed by Jagadesh Kiran N.
(cherry picked from commit a5def58087)
(cherry picked from commit 6d7e7ef1c4)
2015-05-21 23:15:59 +09:00
Jian He ce45e4e82e YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan 2015-05-20 17:06:59 -07:00
Xuan b275818925 YARN-3681. yarn cmd says "could not find main class 'queue'" in windows. Contributed by Craig Welch and Varun Saxena
(cherry picked from commit a665b22cfa)
2015-05-20 14:44:31 -07:00
Wangda Tan 114d41aeb2 Backport "YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan" to 2.7.1 2015-05-20 13:27:01 -07:00
Tsuyoshi Ozawa 6c7840f5b5 YARN-3677. Fix findbugs warnings in yarn-server-resourcemanager. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 7401e5b5e8)
2015-05-20 09:02:30 +09:00
Xuan f0399f56e5 YARN-3601. Fix UT TestRMFailover.testRMWebAppRedirect. Contributed by Weiwei Yang
(cherry picked from commit 5009ad4a7f)
(cherry picked from commit d39039d54d)
2015-05-19 09:57:44 -07:00
Xuan 411c09b613 YARN-3526. ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster. Contributed by Weiwei Yang
(cherry picked from commit b0ad644083)
(cherry picked from commit 802676e1be)
2015-05-15 22:41:46 -07:00
Jason Lowe 5161751433 YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
(cherry picked from commit 711d77cc54)
2015-05-13 21:09:03 +00:00
Jason Lowe 7110499817 YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
(cherry picked from commit 5e093f0d40)
2015-05-13 20:38:52 +00:00
Jason Lowe bfd28d6f7e YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
(cherry picked from commit ac32fa187c)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-05-13 20:38:52 +00:00
Xuan 9527cdd12d YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
(cherry picked from commit 0f95921447)
(cherry picked from commit 487d9b0f3f)
2015-05-13 13:12:38 -07:00
Zhijie Shen 071e21cacd YARN-3539. Updated timeline server documentation and marked REST APIs evolving. Contributed by Steve Loughran.
(cherry picked from commit fcd0702c10)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-05-12 21:19:33 -07:00
Wangda Tan 6b4adbf460 backport YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed (Jian He) to branch-2.7 2015-05-11 18:05:32 -07:00
Wangda Tan 84f7641f7e YARN-3434. Interaction between reservations and userlimit can result in significant ULF violation. (Thomas Graves via wangda) 2015-05-11 15:14:06 -07:00
Jason Lowe a75f4bed6e YARN-3476. Nodemanager can fail to delete local logs if log aggregation fails. Contributed by Rohith
(cherry picked from commit 25e2b02122)
2015-05-08 22:47:18 +00:00
Jason Lowe 1b95bf9e1b YARN-3554. Default value for maximum nodemanager connect wait time is too high. Contributed by Naganarasimha G R
(cherry picked from commit 9757864fd6)
2015-05-08 13:48:21 +00:00
Devaraj K ada2f74b17 YARN-3358. Audit log not present while refreshing Service ACLs.
Contributed by Varun Saxena.

(cherry picked from commit ef3d66d462)
2015-05-08 12:16:13 +05:30
Vinod Kumar Vavilapalli 3d1d551c33 YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based state-store to avoid crashing on duplicate deletes. Contributed by Zhihai Xu.
(cherry picked from commit 4c7b9b6abe)
2015-05-06 17:57:49 -07:00
Karthik Kambatla 419b8f68f4 Fix up author name to Jun Gong in CHANGES.txt for YARN-3469
(cherry picked from commit 5cda6fffd3)
2015-05-06 17:57:36 -07:00
Karthik Kambatla fbd4bbb07b YARN-3469. ZKRMStateStore: Avoid setting watches that are not required. (Jun Hong via kasha)
(cherry picked from commit e516706b89)
2015-05-06 17:57:13 -07:00
Jian He 4e8e9d717c YARN-3301. Fixed the format issue of the new RM attempt web page. Contributed by Xuan Gong 2015-05-06 14:00:29 -07:00