1157 Commits

Author SHA1 Message Date
Jason Lowe
570d52e53c YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie). Contributed by Jian He
(cherry picked from commit 0402bada1989258ecbfdc437cb339322a1f55a97)

(cherry picked from commit 173664d70f0ed3b1852b6703d32e796778fb1c78)
(cherry picked from commit 04e71db1ce9572ae0641234a02b7db5d174668fd)
2015-08-30 20:06:07 -07:00
Karthik Kambatla
6e954bc25c YARN-2910. FSLeafQueue can throw ConcurrentModificationException. (Wilfred Spiegelenburg via kasha)
(cherry picked from commit a2e07a54561a57a83b943628ebbc53ed5ba52718)
(cherry picked from commit 1986ea8dd223267ced3e3aef69980b46e2fef740)
(cherry picked from commit 2b827a18d7b4eb41dc0095ea7277239273e7e396)
2015-08-30 13:44:52 -07:00
Karthik Kambatla
7d686eccc3 YARN-2874. Dead lock in DelegationTokenRenewer which blocks RM to execute any further apps. (Naganarasimha G R via kasha)
(cherry picked from commit 799353e2c7db5af6e40e3521439b5c8a3c5c6a51)

(cherry picked from commit 25be97808b99148412c0efd4d87fc750db4d6607)
(cherry picked from commit d82bf536d44c6e7ba06a01105545b3979b731d80)
2015-08-30 13:25:37 -07:00
Jian He
3600f30c35 YARN-2894. Fixed a bug regarding application view acl when RM fails over. Contributed by Rohith Sharmaks
(cherry picked from commit 392c3aaea8e8f156b76e418157fa347256283c56)

(cherry picked from commit d6f3d4893d750f19dd8c539fe28eecfab2a54576)
(cherry picked from commit 61efbc1cba0c4a81b8aafb1d45c2f7b3cf7857d8)
2015-08-30 13:08:42 -07:00
Zhijie Shen
8a47d1aa55 YARN-2890. MiniYARNCluster should start the timeline server based on the configuration. Contributed by Mit Desai.
(cherry picked from commit 51af8d367de94689770f57c64bea3b244d7755f6)
(cherry picked from commit d21ef79707a0f32939d9a5af4fed2d9f5fe6f2ec)
(cherry picked from commit db325c053c6ef3ee8731d7273d1d92da7e5deee7)
2015-08-27 19:47:43 -07:00
Jian He
888ab4a6e7 YARN-2906. CapacitySchedulerPage shows HTML tags for a queue's Active Users. Contributed by Jason Lowe
(cherry picked from commit 8a7ca13b13c0c3f008a6490cc96d4d48a051d1f7)

(cherry picked from commit ae35b0e14d3438237f4b5d3b5d5268d45e549846)
(cherry picked from commit 65acee3e19a147e5c5a8688319ab75357bdf51b5)
2015-08-27 19:10:43 -07:00
Jian He
7f97189bcf YARN-2865. Fixed RM to always create a new RMContext when transtions from StandBy to Active. Contributed by Rohith Sharmaks
(cherry picked from commit 9cb8b75ba57f18639492bfa3b7e7c11c00bb3d3b)

(cherry picked from commit db31ef7e7f55436bbf88c6d93e2273c4463ca9f0)
(cherry picked from commit e669974ae94c03914c9181a4481b4879fd4acc0d)
2015-08-27 18:56:22 -07:00
Jason Lowe
f307b426f3 YARN-2414. RM web UI: app page will crash if app is failed before any attempt has been created. Contributed by Wangda Tan
(cherry picked from commit 81c9d17af84ed87b9ded7057cb726a3855ddd32d)

(cherry picked from commit 242fd0e39ad1c5d51719cd0f6c197166066e3288)
(cherry picked from commit a9d5acd898b34e1050a78f2d70ed62fdb82948a6)
2015-08-27 18:36:50 -07:00
Jason Lowe
f83d898944 YARN-2816. NM fail to start with NPE during container recovery. Contributed by Zhihai Xu
(cherry picked from commit 49c38898b0be64fc686d039ed2fb2dea1378df02)

(cherry picked from commit ad140d1fc831735fb9335e27b38d2fc040847af1)
(cherry picked from commit 85b23c323c80c5303bd0b7bdb066258792ca67d8)
2015-08-27 18:32:59 -07:00
Jian He
81ba30211e YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state on app recovery. Contributed by Rohith Sharmaks
(cherry picked from commit d005404ef7211fe96ce1801ed267a249568540fd)

(cherry picked from commit beb184ac580b0d89351a3f3a7201da34a26db1c1)
(cherry picked from commit 325bb33988743d60cb333002f9da60314241632e)
2015-08-27 18:29:39 -07:00
Wangda Tan
881084fe5c YARN-3251. Fixed a deadlock in CapacityScheduler when computing absoluteMaxAvailableCapacity in LeafQueue (Craig Welch via wangda) 2015-02-26 17:05:25 -08:00
Vinod Kumar Vavilapalli
56020955fd YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user kill request races with ApplicationMaster finish. Contributed by Jian He.
(cherry picked from commit 3651fe1b089851b38be351c00a9899817166bf3e)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2014-11-13 10:40:09 -08:00
Karthik Kambatla
b579c3d405 YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run with both CS and FS. (Wei Yan and kasha via kasha)
(cherry picked from commit 80d11eb68e60f88e16d7d41edecbddfc935a6b10)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/security/TestClientToAMTokens.java
2014-11-13 10:36:13 -08:00
Jason Lowe
685790e027 YARN-2846. Incorrect persist exit code for running containers in reacquireContainer() that interrupted by NodeManager restart. Contributed by Junping Du
(cherry picked from commit 33ea5ae92b9dd3abace104903d9a94d17dd75af5)
2014-11-13 16:12:30 +00:00
Zhijie Shen
1535e7211e YARN-2794. Fixed log messages about distributing system-credentials. Contributed by Jian He.
(cherry picked from commit be7bf956e96dd0fd9b521ca71df9124b9cc5ebd0)
2014-11-12 11:10:47 -08:00
Xuan
a690629ffc YARN-2841. RMProxy should retry EOFException. Contributed by Jian He 2014-11-12 10:28:55 -08:00
Ravi Prakash
9613a57e83 YARN-1964. Create Docker analog of the LinuxContainerExecutor in YARN 2014-11-12 09:33:42 -08:00
Arun C. Murthy
60584c732f Preparing to release hadoop-2.6.0: Set version in branch-2.6 to 2.6.1-SNAPSHOT. 2014-11-09 19:21:11 -08:00
Vinod Kumar Vavilapalli
6a9534e9cf YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery consistent with the (somewhat incorrect) behaviour in the non-recovery case. Contributed by Jian He.
Fixed a minor import issue in the test during cherry-pick from trunk.

(cherry picked from commit e76faebc9589654e83c8244ef9aff88391e56b80)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestWorkPreservingRMRestart.java
2014-11-09 19:09:48 -08:00
Arun C. Murthy
c2cc879f01 YARN-2830. Add backwords compatible ContainerId.newInstance constructor. Contributed by Jonathan Eagles.
(cherry picked from commit 43cd07b408c6613d2c9aa89203cfa3110d830538)
2014-11-09 15:04:55 -08:00
Zhijie Shen
33da0f8ecf YARN-2505. Supported get/add/remove/change labels in RM REST API. Contributed by Craig Welch.
(cherry picked from commit 9a4e0d343e9e891c10ef6682e7b2231a59e69ade)
2014-11-07 20:40:56 -08:00
Vinod Kumar Vavilapalli
6ff765bea2 YARN-2826. Fixed user-groups mappings' refresh bug caused by YARN-2826. Contributed by Wangda Tan.
(cherry picked from commit df36edf751202db00d8f43103d7120ec56d70a04)
2014-11-07 19:45:23 -08:00
Xuan
629274c09b YARN-2819. NPE in ATS Timeline Domains when upgrading from 2.4 to 2.6. Contributed by Zhijie Shen
(cherry picked from commit 4a114dd67aae83e5bb2d65470166de954acf36a2)
(cherry picked from commit 7d26cffb0cb21c16b335e8ae9b02523565ad3b5d)
2014-11-07 16:13:38 -08:00
Jason Lowe
e5c0fa2c95 YARN-2825. Container leak on NM. Contributed by Jian He
(cherry picked from commit c3d475070a1ec54c4b05923f4782cef204effd2c)
2014-11-07 23:17:49 +00:00
Vinod Kumar Vavilapalli
6593aaf117 YARN-2753. Fixed a bunch of bugs in the NodeLabelsManager classes. Contributed by Zhihai xu.
(cherry picked from commit 4cfd5bc7c18bb9a828f573b5c4d2b13fa28e732a)
2014-11-07 14:17:23 -08:00
cnauroth
9adb31b6a8 YARN-2803. MR distributed cache not working correctly on Windows after NodeManager privileged account changes. Contributed by Craig Welch.
(cherry picked from commit 06b797947c980d7d21864eb8b700cf565756aac1)
(cherry picked from commit c16f7182937dc4e8bbf46dfedd9db427a0a33357)
2014-11-07 12:45:25 -08:00
Vinod Kumar Vavilapalli
71a18a5303 YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are not mapped to queues by making default capacities per label to be zero. Contributed by Wangda Tan.
(cherry picked from commit 2ac1be7dec4aef001e3162e364249933b2c4a6c4)
2014-11-07 10:45:01 -08:00
Xuan
09955ea2c3 YARN-2810. TestRMProxyUsersConf fails on Windows VMs. Contributed by Varun Vasudev
(cherry picked from commit 1e97f2f09464e871773188f642f3a01b744c580f)
(cherry picked from commit bf795418686e1559db4c37c0b107bb5c08bbf525)
2014-11-07 09:46:53 -08:00
Vinod Kumar Vavilapalli
9f7396be55 YARN-2823. Fixed ResourceManager app-attempt state machine to inform schedulers about previous finished attempts of a running appliation to avoid expectation mismatch w.r.t transferred containers. Contributed by Jian He.
(cherry picked from commit a5657182a7accebe08cd86e46b4cdeb163d4d1f2)
2014-11-07 09:30:31 -08:00
Vinod Kumar Vavilapalli
21ef5afafa YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against queues. Contributed by Wangda Tan.
(cherry picked from commit a3839a9fbfb8eec396b9bf85472d25e0ffc3aab2)
2014-11-06 17:29:39 -08:00
Vinod Kumar Vavilapalli
b557f689b4 YARN-2818. Removed the now unnecessary user entity injection from Timeline service given we now have domains. Contributed by Zhijie Shen.
(cherry picked from commit f5b19bed7d71979dc8685b03152188902b6e45e9)
2014-11-06 11:50:12 -08:00
Xuan
43854764a9 YARN-2812. TestApplicationHistoryServer is likely to fail on less powerful machine. Contributed by Zhijie Shen
(cherry picked from commit b0b52c4e11336ca2ad6a02d64c0b5d5a8f1339ae)
(cherry picked from commit 4aa98d599194a444c9d2e1fe95262e32bf744d35)
2014-11-05 20:45:09 -08:00
Xuan
888fc4af4e YARN-2813. Fixed NPE from MemoryTimelineStore.getDomains. Contributed by Zhijie Shen 2014-11-05 18:27:41 -08:00
Jian He
e29e864c51 YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and FatalEventDispatcher try to transition RM to StandBy at the same time. Contributed by Rohith Sharmaks
(cherry picked from commit 395275af8622c780b9071c243422b0780e096202)
2014-11-05 17:03:26 -08:00
Vinod Kumar Vavilapalli
812ddc3991 YARN-2805. Fixed ResourceManager to load HA configs correctly before kerberos login. Contributed by Wangda Tan.
(cherry picked from commit 834e931d8efe4d806347b266e7e62929ce05389b)
2014-11-05 15:32:49 -08:00
Zhijie Shen
a1764e4d33 YARN-2767. Added a test case to verify that http static user cannot kill or submit apps in the secure mode. Contributed by Varun Vasudev.
(cherry picked from commit 7a4c92a9d55fcecef066053ac30dff0fcd4ec90c)
2014-11-05 11:04:38 -08:00
Vinod Kumar Vavilapalli
87e880b580 YARN-2804. Fixed Timeline service to not fill the logs with JAXB bindings exceptions. Contributed by Zhijie Shen.
(cherry picked from commit b76179895dd2ef4d56e8de31e9f673375faa2afa)

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2014-11-04 18:04:37 -08:00
Karthik Kambatla
36993e39d0 YARN-2010. Handle app-recovery failures gracefully. (Jian He and Karthik Kambatla via kasha)
(cherry picked from commit b2cd2698028118b6384904732dbf94942f644732)
2014-11-04 17:49:56 -08:00
Zhijie Shen
1550617e48 YARN-2752. Made ContainerExecutor append "nice -n" arg only when priority adjustment flag is set. Contributed by Xuan Gong.
(cherry picked from commit e06c23a6c92ef783cdb45447fa2abd1ab48d166f)
2014-11-04 15:54:28 -08:00
Vinod Kumar Vavilapalli
0b73606b1c YARN-1922. Fixed NodeManager to kill process-trees correctly in the presence of races between the launch and the stop-container call and when root processes crash. Contributed by Billie Rinaldi.
(cherry picked from commit c5a46d4c8ca236ff641a309f256bbbdf4dd56db5)
2014-11-03 16:41:21 -08:00
Vinod Kumar Vavilapalli
f2ef8c7b48 YARN-2795. Fixed ResourceManager to not crash loading node-label data from HDFS in secure mode. Contributed by Wangda Tan.
(cherry picked from commit ec6cbece8e7772868ce8ad996135d3136bd32245)
2014-11-03 13:46:05 -08:00
Vinod Kumar Vavilapalli
c205841d49 YARN-2788. Fixed backwards compatiblity issues with log-aggregation feature that were caused when adding log-upload-time via YARN-2703. Contributed by Xuan Gong.
(cherry picked from commit 58e9f24e0f06efede21085b7ffe36af042fa7b38)
2014-11-03 13:20:01 -08:00
Jason Lowe
2098c68acb YARN-2730. DefaultContainerExecutor runs only one localizer at a time. Contributed by Siqi Li
(cherry picked from commit 6157ace5475fff8d2513fd3cd99134b532b0b406)
2014-11-03 20:40:02 +00:00
Zhijie Shen
3e41828639 YARN-2785. Fixed intermittent TestContainerResourceUsage failure. Contributed by Varun Vasudev.
(cherry picked from commit 27715ec63bd77f1d31ee922b7daba85071da54ca)
2014-11-02 15:24:07 -08:00
Vinod Kumar Vavilapalli
5a0aac5506 YARN-2790. Fixed a NodeManager bug that was causing log-aggregation to fail beyond HFDS delegation-token expiry even when RM is a proxy-user (YARN-2704). Contributed by Jian He.
(cherry picked from commit 5c0381c96aa79196829edbca497c649eb6776944)
2014-11-01 16:34:11 -07:00
Zhijie Shen
5492370a4c YARN-2711. Fixed TestDefaultContainerExecutor#testContainerLaunchError failure on Windows. Contributed by Varun Vasudev.
(cherry picked from commit 1cd088fd9dac3015df0b6281974fc6b6c3ece20d)
2014-10-31 17:51:03 -07:00
Vinod Kumar Vavilapalli
f71d940e42 YARN-2789. Re-instated the NodeReport.newInstance private unstable API modified in YARN-2698 so that tests in YARN frameworks don't break. Contributed by Wangda Tan.
(cherry picked from commit 6ce32f593bff6788084ce9bc1e11ade74ed3dbaf)
2014-10-31 15:34:07 -07:00
Xuan
aa13977001 YARN-2701. Addendum patch. Potential race condition in startLocalizer when using LinuxContainerExecutor. Contributed by Xuan Gong 2014-10-31 14:39:49 -07:00
Jian He
a859adcc23 YARN-2770. Added functionality to renew/cancel TimeLineDelegationToken. Contributed by Zhijie Shen
(cherry picked from commit 1b4be918664b09272b120bc42de3e5fc02d79047)
2014-10-31 13:17:45 -07:00
Vinod Kumar Vavilapalli
a3f032031d YARN-2779. Fixed ResourceManager to not require delegation tokens for communicating with Timeline Service. Contributed by Zhijie Shen.
(cherry picked from commit d1828d94435eca21761b0ba8458f9de2f125d012)
2014-10-30 23:17:50 -07:00