924 Commits

Author SHA1 Message Date
Jian He
7029fc356f YARN-4127. RM fail with noAuth error if switched from failover mode to non-failover mode. Contributed by Varun Saxena 2015-11-03 15:33:13 -08:00
Vinod Kumar Vavilapalli
09348c4bce Preparing for 2.7.3 development 2015-11-03 15:18:10 -08:00
Wangda Tan
04abe7e9e0 YARN-3136. getTransferredContainers can be a bottleneck during AM registration. (Sunil G via jianhe) 2015-11-03 12:59:06 -08:00
Jian He
ece1b1645b YARN-4321. Incessant retries if NoAuthException is thrown by Zookeeper in non HA mode. Contributed by Varun Saxena 2015-11-02 11:22:30 -08:00
Tsuyoshi Ozawa
6636441911 YARN-4312. TestSubmitApplicationWithRMHA fails on branch-2.7 and branch-2.6 as some of the test cases time out. Contributed by Varun Saxena. 2015-10-30 16:25:52 +09:00
Junping Du
da564ef9a7 YARN-3580. [JDK8] TestClientRMService.testGetLabelsToNodes fails. Contributed by Robert Kanter.
(cherry picked from commit 31b627b2a8755dec0820d01f6f7ac96e4c6194e2)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-10-30 15:59:40 +09:00
Jonathan Eagles
3287a50cfe YARN-4183. Enabling generic application history forces every job to get a timeline service delegation token (jeagles)
(cherry picked from commit c293c58954cdab25c8c69418b0e839883b563fa4)
2015-10-29 16:45:31 -05:00
Jason Lowe
2b6ce20e1c YARN-4041. Slow delegation token renewal can severely prolong RM recovery. Contributed by Sunil G
(cherry picked from commit d3a34a4f388155f6a7ef040e244ce7be788cd28b)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-10-23 20:59:28 +00:00
Jonathan Eagles
49a7d70f53 YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
(cherry picked from commit f8adeb712dc834c27cec15c04a986f2f635aba83)
2015-10-23 11:18:32 -05:00
Jian He
7021e015d7 YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart. Contributed by Varun Saxena 2015-10-21 16:37:21 -07:00
Jason Lowe
29838ff5b1 YARN-4281. 2.7 RM app page is broken. Contributed by Chang Li 2015-10-21 13:43:25 +00:00
Tsuyoshi Ozawa
08c02199ff YARN-3798. ZKRMStateStore shouldn't create new session without occurrance of SESSIONEXPIED. (ozawa and Varun Saxena) 2015-10-19 10:30:28 +09:00
Jason Lowe
fe874e9f69 YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id has not been reset synchronously. (Jun Gong via rohithsharmaks)
(cherry picked from commit feaf0349949e831ce3f25814c1bbff52f17bfe8f)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-10-08 16:08:55 +00:00
Jason Lowe
784d00e0e2 YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
after NM is reconnected. Contributed by zhihai xu
(cherry picked from commit 5b5bb8dcdc888ba1ebc7e4eba0fa0e7e79edda9a)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-10-08 15:56:44 +00:00
Jason Lowe
aaff9dc39b YARN-3780. Should use equals when compare Resource in
RMNodeImpl#ReconnectNodeTransition. Contributed by zhihai xu.
(cherry picked from commit c7ee6c151c5771043a6de3b8a951cea13f59dd7b)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-10-08 15:30:25 +00:00
Rohith Sharma K S
a231075964 YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks) 2015-10-07 09:35:10 +05:30
Anubhav Dhoot
7c9a368b45 YARN-4180. AMLauncher does not retry on failures when talking to NM. (adhoot)
(cherry picked from commit 9735afe967a660f356e953348cb6c34417f41055)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestApplicationMasterLauncher.java

(cherry picked from commit 22f2501476d987afb7bc19080a7a0db94ea72be6)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/MockRM.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestApplicationMasterLauncher.java
2015-09-28 20:57:08 -07:00
Xuan
dce9223622 YARN-3248. Display count of nodes blacklisted by apps in the web UI.
Contributed by Varun Vasudev

(cherry picked from commit 4728bdfa15809db4b8b235faa286c65de4a48cf6)
2015-09-23 14:44:42 -07:00
Karthik Kambatla
f3fd5e30cd YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown. (Zhihai Xu via kasha)
(cherry picked from commit 332b520a480994b7bd56c135f7941aad30b05e9c)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/TestFairScheduler.java
2015-09-17 23:35:47 -07:00
Xuan
b55fb0ac44 YARN-4087. Followup fixes after YARN-2019 regarding RM behavior when
state-store error occurs. Contributed by Jian He

(cherry picked from commit 9b78e6e33d8c117c1e909df414f20d9db56efe4b)
(cherry picked from commit a0b7ef15d0663076b65ae3f53271b54e42308bfb)
2015-09-07 17:47:32 -07:00
Xuan
96b9455c6f YARN-2019. Retrospect on decision of making RM crashed if any exception throw in ZKRMStateStore. Contributed by Jian He. 2015-09-07 17:23:56 -07:00
Jason Lowe
a7eed5128b YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li 2015-09-04 16:24:03 +00:00
Jason Lowe
1275ba2cac Revert "YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li"
This reverts commit 71172a0bfcf549f13bf17c1f1cbb4996272dd470.
2015-09-04 16:06:19 +00:00
Jason Lowe
71172a0bfc YARN-4105. Capacity Scheduler headroom for DRF is wrong. Contributed by Chang Li
(cherry picked from commit 6eaca2e3634a88dc55689e8960352d6248c424d9)
2015-09-04 15:32:14 +00:00
Varun Vasudev
5002f885fc YARN-4103. RM WebServices missing scheme for appattempts logLinks. Contributed by Jonathan Eagles.
(cherry picked from commit 1b2e2ca9104dfc05e9aa7f93954ae2e00b3e62c4)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/RMAppBlock.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/RMWebServices.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/webapp/dao/AppAttemptInfo.java
2015-09-04 11:45:51 +05:30
Jian He
2d1ff2e1ca YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager gets connection issue. Contributed by Xuan Gong
(cherry picked from commit 09c64ba1ba8be7a2ac31f4e42efb8c99b682399f)
2015-09-02 17:46:24 -07:00
Rohith Sharma K S
c2ed7e4a09 YARN-3893. Both RM in active state when Admin#transitionToActive failure from refeshAll() (Bibin A Chundatt via rohithsharmaks)
(cherry picked from commit 7d6687fe76f6152a577ff2298c358dd30fce41fb)
2015-09-02 15:29:22 +05:30
Jian He
f44ed4f4b0 YARN-4092. Fixed UI redirection to print useful messages when both RMs are in standby mode. Contributed by Xuan Gong
(cherry picked from commit a3fd2ccc869dfc1f04d1cf0a8678d4d90a43a80f)
(cherry picked from commit 48f5161cd5d4c2f4e385b253a5bea066b2e23b9e)
2015-08-31 17:46:27 -07:00
Zhihai Xu
7409ed783e YARN-3857: Memory leak in ResourceManager with SIMPLE mode. Contributed by mujunchao. 2015-08-18 12:08:20 -07:00
Jian He
de1c4121a1 YARN-4047. ClientRMService getApplications has high scheduler lock contention. Contributed by Jason Lowe
(cherry picked from commit 7a445fcfabcf9c6aae219051f65d3f6cb8feb87c)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-08-13 16:15:44 -07:00
Xuan
617e294a63 YARN-3999. RM hangs on draing events. Contributed by Jian He 2015-08-11 18:27:23 -07:00
Jonathan Eagles
dcffe43419 YARN-3978. Configurably turn off the saving of container info in Generic AHS (Eric Payne via jeagles)
(cherry picked from commit 3cd02b95224e9d43fd63a4ef9ac5c44f113f710d)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/TestLeafQueue.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fifo/TestFifoScheduler.java
2015-08-03 10:53:43 -05:00
Jason Lowe
e1ae722a7d YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt
(cherry picked from commit 32e490b6c035487e99df30ce80366446fe09bd6c)
2015-07-31 17:39:32 +00:00
carlo curino
76339bfb9f YARN-3969. Allow jobs to be submitted to reservation that is active but does not have any allocations. (subru via curino)
(cherry picked from commit 0fcb4a8cf2add3f112907ff4e833e2f04947b53e)
(cherry picked from commit 6a71a0a230b337c8d62cd116a9742c54f00728e9)
2015-07-24 08:54:59 -07:00
Arun Suresh
63e4ada517 YARN-3535. Scheduler must re-request container resources when RMContainer transitions from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
(cherry picked from commit 9b272ccae78918e7d756d84920a9322187d61eed)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmcontainer/RMContainerImpl.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/TestAbstractYarnScheduler.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/TestFairScheduler.java
2015-07-18 13:59:19 -07:00
Wangda Tan
a6f6ba95ef YARN-3508. Prevent processing preemption events on the main RM dispatcher. (Varun Saxena via wangda) 2015-07-02 11:08:21 -07:00
Vinod Kumar Vavilapalli
a36f090c55 Preparing for 2.7.2 development: mvn versions:set -DnewVersion=2.7.2 2015-06-28 16:41:12 -07:00
Jason Lowe
37b89deccf YARN-3809. Failed to launch new attempts because ApplicationMasterLauncher's threads all hang. Contributed by Jun Gong
(cherry picked from commit 2a20dd9b61ba3833460cbda0e8c3e8b6366fc3ab)
2015-06-24 16:25:22 +00:00
Xuan
c0a419b134 YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl. Contributed by Varun Saxena 2015-06-17 16:27:01 -07:00
Jian He
a78ca0fadc YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent to another. Contributed by Wangda Tan
(cherry picked from commit 6ad4e59cfc111a92747fdb1fb99cc6378044832a)
2015-06-04 10:53:47 -07:00
Wangda Tan
e74e4d7bb9 YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
(cherry picked from commit ebd797c48fe236b404cf3a125ac9d1f7714e291e)
2015-06-04 10:26:06 -07:00
Xuan
b34825b0cb YARN-3753. RM failed to come up with "java.io.IOException: Wait for ZKClient creation timed out". Contributed by Jian He 2015-06-02 10:28:14 -07:00
Xuan
1ad0d43fb4 Revert "YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong."
This reverts commit 2032e8d1a0c6abe154a64df69b0124cdae9f78e0.
2015-06-01 11:29:36 -07:00
Zhijie Shen
2032e8d1a0 YARN-1462. Made RM write application tags to timeline server and exposed them to users via generic history web UI and REST API. Contributed by Xuan Gong. 2015-05-30 09:39:49 -07:00
Vinod Kumar Vavilapalli
9449b72429 Fixed more FilesSystemRMStateStore issues. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit 9acd24fec408c182683968ae2c75ca2c38866a60)
2015-05-29 13:29:44 -07:00
Wangda Tan
bb8350388b YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue info once. (Varun Saxena via wangda) 2015-05-28 17:04:20 -07:00
Wangda Tan
fb57a1aac8 YARN-3686. CapacityScheduler should trim default_node_label_expression. (Sunil G via wangda)
(cherry picked from commit cdbd66be111c93c85a409d47284e588c453ecae9)
2015-05-26 16:29:27 -07:00
Karthik Kambatla
c60054743f YARN-3675. FairScheduler: RM quits when node removal races with continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
(cherry picked from commit a8b50e46737c11936ba72c427da69b2365a07aac)
(cherry picked from commit e8ac88d4fe8aaef9d2e5fb76e6bc50223ff0e495)
2015-05-21 13:43:22 -07:00
Jian He
ce45e4e82e YARN-3609. Load node labels from storage inside RM serviceStart. Contributed by Wangda Tan 2015-05-20 17:06:59 -07:00
Wangda Tan
114d41aeb2 Backport "YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan" to 2.7.1 2015-05-20 13:27:01 -07:00