Commit Graph

1559 Commits

Author SHA1 Message Date
Jian He 92742b4402 YARN-2301. Improved yarn container command. Contributed by Naganarasimha G R
(cherry picked from commit 258623ff8b)

(cherry picked from commit 1d1e7682c9)
2015-09-08 22:57:28 -07:00
Jian He 2336264900 YARN-2918. RM should not fail on startup if queue's configured labels do not exist in cluster-node-labels. Contributed by Wangda Tan
(cherry picked from commit f489a4ec96)

(cherry picked from commit d817fbb34d)
2015-09-06 14:15:33 -07:00
Jian He ee2b6bc248 YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track capacities-by-label. Contributed by Wangda Tan
(cherry picked from commit 18a594257e)

(cherry picked from commit 1be2d64ddd)
2015-09-06 11:54:40 -07:00
Jian He 637e7f9e39 YARN-2694. Ensure only single node label specified in ResourceRequest. Contributed by Wangda Tan
(cherry picked from commit c1957fef29)

(cherry picked from commit 3ddafaa7c8)
2015-09-05 21:07:51 -07:00
Jian He 4c94f07140 YARN-3098. Created common QueueCapacities class in Capacity Scheduler to track capacities-by-labels of queues. Contributed by Wangda Tan
(cherry picked from commit 21d80b3dd9)

(cherry picked from commit c0b1311a93)
2015-09-05 20:54:20 -07:00
Jian He d9281fbbab YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage to track used-resources-by-label. Contributed by Wangda Tan
(cherry picked from commit 86358221fc)

(cherry picked from commit cabf97ae4f)
2015-09-05 20:54:20 -07:00
Jian He b0ad553841 YARN-3092. Created a common ResourceUsage class to track labeled resource usages in Capacity Scheduler. Contributed by Wangda Tan
(cherry picked from commit 6f9fe76918)

(cherry picked from commit 61b4116b4b)
2015-09-05 20:54:19 -07:00
Jian He 419e18cb37 YARN-2978. Fixed potential NPE while getting queue info. Contributed by Varun Saxena
(cherry picked from commit dd57c2047b)

(cherry picked from commit c61e8a7bfa)
2015-09-05 20:54:19 -07:00
Jian He 88f022da24 YARN-2920. Changed CapacityScheduler to kill containers on nodes where node labels are changed. Contributed by Wangda Tan
(cherry picked from commit fdf042dfff)

(cherry picked from commit 411836b74c)
2015-09-05 20:54:18 -07:00
Wangda Tan 2073fc0f84 Add missing test file of YARN-3733
(cherry picked from commit 405bbcf68c)
(cherry picked from commit 344b750915)
2015-09-03 17:43:03 -07:00
Wangda Tan 85d92721a4 YARN-3733. Fix DominantRC#compare() does not work as expected if cluster resource is empty. (Rohith Sharmaks via wangda)
(cherry picked from commit ebd797c48f)
(cherry picked from commit 78d626fa89)
2015-09-03 17:43:01 -07:00
Jian He f1b35ffd4c YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler when activating applications. Contributed by Craig Welch
(cherry picked from commit c53420f583)

(cherry picked from commit 4931600030)
2015-09-03 17:40:24 -07:00
Jason Lowe ca7fe71000 YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when Node is connected/disconnected. Contributed by Bibin A Chundatt
(cherry picked from commit 32e490b6c0)

(cherry picked from commit c31e3ba921)
(cherry picked from commit 07d31d4c0808a169f4770187d655f38aa105255c)
2015-09-03 14:40:20 -07:00
Jason Lowe fe5877a49e YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
(cherry picked from commit 40b256949a)

(cherry picked from commit 0221d19f4e)
(cherry picked from commit 87d2204f28f192a964c04a5fa1e2e31644d74b59)
2015-09-03 14:35:01 -07:00
Jason Lowe f21fb808f1 YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
(cherry picked from commit 8d58512d6e)

(cherry picked from commit 15b1800b12)
(cherry picked from commit 38400507e3352d83c2a1f364de137366249b7983)
2015-09-03 14:26:43 -07:00
Jason Lowe 193d8d3667 YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
(cherry picked from commit e13b671aa5)

(cherry picked from commit 752caa95a4)
(cherry picked from commit 13c4db632b0e7f19dcfa883c2492431c2c7d0799)
2015-09-03 14:09:16 -07:00
Wangda Tan ae0fac3efa YARN-3725. App submission via REST API is broken in secure mode due to Timeline DT service address is empty. (Zhijie Shen via wangda)
(cherry picked from commit 5cc3fced95)
(cherry picked from commit a3734f67d3)
(cherry picked from commit 9ccc22e2ac89990f3e7997f1d89594523c66e76a)
2015-09-03 13:50:14 -07:00
Xuan 1c6a287bf5 YARN-2900. Application (Attempt and Container) Not Found in AHS results
in Internal Server Error (500). Contributed by Zhijie Shen and Mit Desai

(cherry picked from commit 06f8e9caba)
(cherry picked from commit 4fee8b3202)
(cherry picked from commit 6c7b625138ce3b262a8c8aa28077074b553638ed)
2015-09-03 13:45:06 -07:00
Zhijie Shen 42ce052585 YARN-3700. Made generic history service load a number of latest applications according to the parameter or the configuration. Contributed by Xuan Gong.
(cherry picked from commit 54504133f4)
(cherry picked from commit 839f81a632)
(cherry picked from commit 058380d9ef35f35e8c624fb8783eac0904c4d1f5)
2015-09-03 12:59:33 -07:00
Zhijie Shen 0f33fcd507 YARN-2766. Made ApplicationHistoryManager return a sorted list of apps, attempts and containers. Contributed by Robert Kanter.
(cherry picked from commit 3648cb57c9)
(cherry picked from commit 53d6c91df9)
2015-09-03 12:58:42 -07:00
Xuan 7b1a71a7ad YARN-3526. ApplicationMaster tracking URL is incorrectly redirected on a QJM cluster. Contributed by Weiwei Yang
(cherry picked from commit b0ad644083)
(cherry picked from commit 802676e1be)
(cherry picked from commit 2cadeb9e017c6a75db16e1f23b2accda04f12298)
2015-09-03 11:54:23 -07:00
Jason Lowe 778da79e6f YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
(cherry picked from commit 711d77cc54)

(cherry picked from commit a81ad81461)
(cherry picked from commit aa82b0684554be8d09f6fcd88826f167922280cc)
2015-09-03 11:50:30 -07:00
Karthik Kambatla 6ade6b5051 YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
(cherry picked from commit 47279c3228)
(cherry picked from commit 4045c41afe)
(cherry picked from commit 6f2cc0dfa8f21984ecdab59dc087ccf525934930)
2015-09-02 15:03:51 -07:00
Xuan 9af5b1dcd0 YARN-3024. LocalizerRunner should give DIE action when all resources are
localized. Contributed by Chengbing Liu

(cherry picked from commit 0d6bd62102)
(cherry picked from commit a7696b3fbf)
(cherry picked from commit 9e30232004ab7c3c3bfde3b8b27c37fa7065f6be)
2015-09-02 14:52:06 -07:00
Wangda Tan e081593042 YARN-3487. CapacityScheduler scheduler lock obtained unnecessarily when calling getQueue (Jason Lowe via wangda)
(cherry picked from commit f47a5763ac)
(cherry picked from commit 3316cd4357)
(cherry picked from commit 24d45ee9544abcfcf9e611ab835ec2f824333670)
2015-09-02 11:28:22 -07:00
Wangda Tan 61f2ddb125 YARN-3493. RM fails to come up with error "Failed to load/recover state" when mem settings are changed. (Jian He via wangda)
(cherry picked from commit f65eeb412d)
(cherry picked from commit e7cbecddc3)
(cherry picked from commit 9d47d5aa5bffe427c4a77260f7ccc039d446e1fd)
2015-09-02 11:14:35 -07:00
Vinod Kumar Vavilapalli 752e3da738 YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token renewal of applications part of a bigger workflow. Contributed by Daryn Sharp.
(cherry picked from commit 9c5911294e)
(cherry picked from commit 1ff3fd33ed)
(cherry picked from commit 82c722aae86669325672dd10840447434f15e7fd)
2015-09-01 21:31:00 -07:00
Xuan 914cc8f4a4 YARN-3393. Getting application(s) goes wrong when app finishes before
starting the attempt. Contributed by Zhijie Shen

(cherry picked from commit 9fae455e26)
(cherry picked from commit cbdcdfad6d)
(cherry picked from commit 61aafdcfa589cbae8363976c745ea528b03f152d)
2015-09-01 18:14:51 -07:00
Wangda Tan 005d865494 YARN-3369. Missing NullPointer check in AppSchedulingInfo causes RM to die. (Brahma Reddy Battula via wangda)
(cherry picked from commit 6bc7710ec7)
(cherry picked from commit 8e142d27cb)
(cherry picked from commit 4d43be3c01b1bc0deb31a9081fca5395d0eb4e0d)
2015-09-01 17:10:42 -07:00
Jonathan Eagles a9bb641d51 YARN-3267. Timelineserver applies the ACL rules after applying the limit on the number of records (Chang Li via jeagles)
(cherry picked from commit 8180e676ab)
(cherry picked from commit 44aedad5dd)
(cherry picked from commit f4bbf2c8f97d3601132504453f61e472950a433e)
2015-09-01 16:04:19 -07:00
Zhijie Shen 9005b141a5 YARN-3287. Made TimelineClient put methods do as the correct login context. Contributed by Daryn Sharp and Jonathan Eagles.
(cherry picked from commit d6e05c5ee2)
(cherry picked from commit a94d23762e)
(cherry picked from commit 68e07eb50b872ec8a78923df8f5f640f08a72aa2)
2015-09-01 15:24:36 -07:00
Xuan a57ada6c1f YARN-3227. Timeline renew delegation token fails when RM user's TGT is
expired. Contributed by Zhijie Shen

(cherry picked from commit d1abc5d4fc)
(cherry picked from commit 56c2050ab7)
(cherry picked from commit 780a9b1a98827a692e0ea9fbc92f9d1ab979e3e0)
2015-09-01 15:21:46 -07:00
Jian He 7ffdf7d105 YARN-1809. Synchronize RM and TimeLineServer Web-UIs. Contributed by Zhijie Shen and Xuan Gong
(cherry picked from commit 95bfd087dc)

(cherry picked from commit a5f3fb4dc1)
(cherry picked from commit 27a2f0acb84202cc082090eef7eea57f6e42f9bb)
2015-09-01 15:12:53 -07:00
Tsuyoshi Ozawa 81417f7572 YARN-3249. Add a 'kill application' button to Resource Manager's Web UI. Contributed by Ryu Kobayashi.
(cherry picked from commit 1b67209612)
(cherry picked from commit 6660c2f83b)
(cherry picked from commit 6ea859e435e7cd6bc342f67e1551ccb86fbd976f)
2015-09-01 14:37:21 -07:00
Wangda Tan 8b5bdac98e YARN-3230. Clarify application states on the web UI. (Jian He via wangda)
(cherry picked from commit ce5bf927c3)
(cherry picked from commit a1963968d2)
(cherry picked from commit 591e261ccf1fb5dd25e87665c8d5c0341ff6fb24)
2015-09-01 14:34:07 -07:00
Karthik Kambatla 5a6755cc0f YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving events for old client. (Zhihai Xu via kasha)
(cherry picked from commit 8d88691d16)
(cherry picked from commit 0d62e94887)
(cherry picked from commit 4a5b0e708d42fbff571229a43d1762d1767e2db5)
2015-09-01 14:06:34 -07:00
Karthik Kambatla dbc5bab9fd YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending jobs. (Siqi Li via kasha)
(cherry picked from commit 22426a1c9f)
(cherry picked from commit 721d7b5741)
(cherry picked from commit 5dfa25f22a989222e8b3d1013117b3350a48b2c5)
2015-09-01 13:54:04 -07:00
Vinod Kumar Vavilapalli 2692401dfb YARN-3222. Added the missing CHANGES.txt entry.
(cherry picked from commit 4620767156)
(cherry picked from commit 015696fb81)
2015-09-01 13:49:04 -07:00
Jian He db92b09e03 YARN-3222. Fixed NPE on RMNodeImpl#ReconnectNodeTransition when a node is reconnected with a different port. Contributed by Rohith Sharmaks
(cherry picked from commit b2f1ec312e)

(cherry picked from commit 888a445638)
(cherry picked from commit b78f87825bd593e30b2f2ea76f37c7a4fd673ab2)
2015-09-01 13:39:35 -07:00
Jason Lowe a4b8897b30 YARN-3239. WebAppProxy does not support a final tracking url which has query fragments and params. Contributed by Jian He
(cherry picked from commit 1a68fc4346)

(cherry picked from commit 257087417e)
(cherry picked from commit 49468108c203bf093acdc93c1798d90c480c3a17)
2015-09-01 13:32:21 -07:00
Xuan 4fcf71c1e7 YARN-3238. Connection timeouts to nodemanagers are retried at multiple
levels. Contributed by Jason Lowe

(cherry picked from commit 92d67ace32)
(cherry picked from commit fefeba4ac8)
(cherry picked from commit d8f02e1c5c3bcc230d942554b2f4cfbc3ed21526)
2015-09-01 11:19:37 -07:00
Xuan 95edb6e64f YARN-3207. Secondary filter matches entites which do not have the key
being filtered for. Contributed by Zhijie Shen

(cherry picked from commit 57db50cbe3)
(cherry picked from commit ba18adbb27)
(cherry picked from commit 9fd18e94849600ec66832df5ae424eeb0116330c)
2015-08-31 17:44:42 -07:00
Zhijie Shen 28160a0bd6 YARN-2246. Made the proxy tracking URL always be http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. Contributed by Devaraj K.
(cherry picked from commit d5855c0e46)
(cherry picked from commit fd75b8c9ca)
(cherry picked from commit a62891971380e5f8e4a645ed36bd88aa6fe0e47a)
2015-08-31 17:38:51 -07:00
Jian He a703952d39 YARN-3094. Reset timer for liveness monitors after RM recovery. Contributed by Jun Gong
(cherry picked from commit 0af6a99a3f)

(cherry picked from commit 6146680955)
(cherry picked from commit ab654746fbad2da12b24b13425dc9bf17c46b50c)
2015-08-31 17:17:47 -07:00
Jian He 8658945b3a YARN-3103. AMRMClientImpl does not update AMRM token properly. Contributed by Jason Lowe
(cherry picked from commit 6d2bdbd7da)

(cherry picked from commit 12522fd9cb)
(cherry picked from commit f50f5ad49d3b70448647384fc5f020214cb58f10)
2015-08-31 15:42:03 -07:00
Jian He 994c3d049a YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService might lead NM to crash. Contributed by Varun Saxena
(cherry picked from commit 4e15fc0841)

(cherry picked from commit 8100c8a68c)
(cherry picked from commit 10a6c4f349e6f32ed2a520bf669a0cbfff31c824)
2015-08-31 15:38:45 -07:00
Jian He 3f8da2a9eb YARN-2997. Fixed NodeStatusUpdater to not send alreay-sent completed container statuses on heartbeat. Contributed by Chengbing Liu
(cherry picked from commit cc2a745f7e)

(cherry picked from commit e7e6173049)
(cherry picked from commit 3c4ed2497b14140f09b3cae4959be6474c4cdc99)
2015-08-30 20:45:45 -07:00
Tsuyoshi Ozawa 03f9ac2de7 YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue. Contributed by Rohith Sharmaks.
(cherry picked from commit ddc5be48fc)
(cherry picked from commit c116743bdd)
(cherry picked from commit 3c9d26ae14625de3e9437c07eceda0d05f1985b2)
2015-08-30 20:43:14 -07:00
Jian He e7fc071906 YARN-2992. ZKRMStateStore crashes due to session expiry. Contributed by Karthik Kambatla
(cherry picked from commit 1454efe5d4)

(cherry picked from commit ca0349b87a)
(cherry picked from commit 2f6be218fa41fd0f39633ec5ed0df6e0fa0f54b6)
2015-08-30 20:42:06 -07:00
Jian He 8cdcb792a1 YARN-2340. Fixed NPE when queue is stopped during RM restart. Contributed by Rohith Sharmaks
(cherry picked from commit 0d89859b51)

(cherry picked from commit 8b398a66ca)
(cherry picked from commit 37f1789f419fa741c2fbae9855549d405e856bff)
2015-08-30 20:40:09 -07:00