1345 Commits

Author SHA1 Message Date
Varun Saxena
354afbb9df YARN-5521. Fix random failure of TestCapacityScheduler#testKillAllAppsInQueue (sandflee via Varun Saxena) 2016-08-16 00:04:40 +05:30
Varun Saxena
3c7cf5cd77 YARN-5491. Fix random failure of TestCapacityScheduler#testCSQueueBlocked (Bibin A Chundatt via Varun Saxena) 2016-08-15 03:56:31 +05:30
Varun Saxena
918a0f12fa YARN-5476. Non existent application reported as ACCEPTED by YarnClientImpl (Junping Du via Varun Saxena) 2016-08-13 00:19:00 +05:30
Naganarasimha
b63d3dcac9 YARN-4833. For Queue AccessControlException client retries multiple times on both RM. Contributed by Bibin A Chundatt 2016-08-12 20:16:15 +05:30
Rohith Sharma K S
0937715f7f YARN-5492. TestSubmitApplicationWithRMHA is failing sporadically during precommit builds. Contributed by Vrushali C.
(cherry picked from commit 5199db387d59f7233a0e52ac298df31e8ed8af20)
2016-08-11 11:51:56 +05:30
Jason Lowe
2b31f6fb2c YARN-5382. RM does not audit log kill request for active applications. Contributed by Vrushali C
(cherry picked from commit 5c95bb315ba605b3bed77966a99a63854234e2c5)
2016-08-10 18:27:25 +00:00
Jason Lowe
3d401206cf YARN-5483. Optimize RMAppAttempt#pullJustFinishedContainers. Contributed by sandflee 2016-08-10 18:06:56 +00:00
Karthik Kambatla
447ba47640 YARN-5343. TestContinuousScheduling#testSortedNodes fails intermittently. (Yufei Gu via kasha)
(cherry picked from commit 7992c0b42ceb10fd3ca6c4ced4f59b8e8998e046)
2016-08-09 16:51:27 -07:00
Arun Suresh
5f7edb79d1 YARN-5457. Refactor DistributedScheduling framework to pull out common functionality. (asuresh)
(cherry picked from commit 82c9e061017c32e633e0b0cbb7978749a6df4fb2)
2016-08-09 00:46:08 -07:00
Naganarasimha
991823482f YARN-4624. NPE in PartitionQueueCapacitiesInfo while accessing Schduler UI. Contributed by Brahma Reddy Battula
(cherry picked from commit d81b8163b4e5c0466a6af6e1068f512c5fd24a61)
2016-08-06 07:04:30 +05:30
Wangda Tan
a73e118437 YARN-4888. Changes in scheduler to identify resource-requests explicitly by allocation-id. (Subru Krishnan via wangda)
(cherry picked from commit 3f100d76ff5df020dbb8ecd4f5b4f9736a0a8270)
2016-08-05 10:44:07 -07:00
Rohith Sharma K S
69da77c357 YARN-5333. Some recovered apps are put into default queue when RM HA. Contributed by Jun Gong.
(cherry picked from commit d9a354c2f39274b2810144d1ae133201e44e3bfc)
2016-08-05 21:37:05 +05:30
Jason Lowe
9a0ac56a5c YARN-4280. CapacityScheduler reservations may not prevent indefinite postponement on a busy cluster. Contributed by Kuhu Shukla
(cherry picked from commit 4d92aefd35d4517d9435d81bafdec0d77905a7a0)
2016-08-03 19:00:06 +00:00
Arun Suresh
f197378f81 YARN-5113. Refactoring and other clean-up for distributed scheduling. (Konstantinos Karanasos via asuresh)
(cherry picked from commit e5766b1dbee02ae0ef89618e172f3fb227af19e8)
2016-07-31 11:49:09 -07:00
Subru Krishnan
2f4df2b851 YARN-5203.Return ResourceRequest JAXB object in ResourceManager Cluster Applications REST API. Contributed by Ellen Hui.
(cherry picked from commit 4e756d72719ec3c6d64a1e3daccbc0b8e8de998c)
2016-07-28 16:04:30 -07:00
Wangda Tan
77e0b6d1bf YARN-5195. RM intermittently crashed with NPE while handling APP_ATTEMPT_REMOVED event when async-scheduling enabled in CapacityScheduler. (sandflee via wangda)
(cherry picked from commit d62e121ffc0239e7feccc1e23ece92c5fac685f6)
2016-07-26 21:36:37 -07:00
Wangda Tan
55fa19d576 YARN-5342. Improve non-exclusive node partition resource allocation in Capacity Scheduler. (Sunil G via wangda)
(cherry picked from commit 49969b16cdba0f251b9f8bf3d8df9906e38b5c61)
2016-07-26 21:16:12 -07:00
Arun Suresh
df509a8066 YARN-5392. Replace use of Priority in the Scheduling infrastructure with an opaque ShedulerRequestKey. (asuresh and subru)
(cherry picked from commit 5aace38b748ba71aaadd2c4d64eba8dc1f816828)
2016-07-26 14:58:54 -07:00
Rohith Sharma K S
eff65463de YARN-5092. TestRMDelegationTokens fails intermittently. Contributed by Jason Lowe.
(cherry picked from commit 557a245d83c358211f51b3ab1950d707a70679d9)
2016-07-21 12:49:06 +05:30
Akira Ajisaka
2bf5c4eb1a YARN-4883. Make consistent operation name in AdminService. Contributed by Kai Sasaki.
(cherry picked from commit c63afdbe14df7359841aa4b213252a8a275e8c4b)
2016-07-20 16:51:44 -07:00
Varun Saxena
2153c9f04c YARN-4996. Make TestNMReconnect.testCompareRMNodeAfterReconnect() scheduler agnostic (Kai Sasaki via Varun Saxena) 2016-07-19 16:07:56 +05:30
Ray Chiang
66a501be5e YARN-5272. Handle queue names consistently in FairScheduler. (Wilfred Spiegelenburg via rchiang)
(cherry picked from commit 2df1b1e72b18422606e38a9fbe311988637cb9a5)
2016-07-15 14:39:28 -07:00
Arun Suresh
9d31c0b53f YARN-5181. ClusterNodeTracker: add method to get list of nodes matching a specific resourceName. (kasha via asuresh)
(cherry picked from commit e905a42a2c6c88210b5c2120d7daf0bb3739cd1d)
2016-07-15 14:38:26 -07:00
Wangda Tan
e34e1aa4fe YARN-4484. Available Resource calculation for a queue is not correct when used with labels. (Sunil G via wangda)
(cherry picked from commit 24db9167f16ba643a186624b33a6b9b80020f476)
2016-07-15 11:41:35 -07:00
Rohith Sharma K S
694b4abbc0 YARN-5362. TestRMRestart#testFinishedAppRemovalAfterRMRestart can fail. Contributed by sandflee.
(cherry picked from commit d6d41e820ac7b3ba73f5e4ea1ee72715dc1ffc9f)
2016-07-13 19:15:41 +05:30
Varun Saxena
dc3e077301 YARN-5353. ResourceManager can leak delegation tokens when they are shared across apps. (Jason Lowe via Varun Saxena). 2016-07-13 07:46:00 +05:30
Jason Lowe
1f96baf4ad YARN-5317. testAMRestartNotLostContainerCompleteMsg may fail. Contributed by sandflee
(cherry picked from commit 10b704c5946afe7bfd4a6be40192ce7ca745d817)
2016-07-12 20:38:26 +00:00
Jian He
e94e6be772 YARN-5270. Solve miscellaneous issues caused by YARN-4844. Contributed by Wangda Tan 2016-07-11 22:37:52 -07:00
Varun Saxena
5ef27b65b9 YARN-5037. Fix random failure of TestRMRestart#testQueueMetricsOnRMRestart (sandflee via Varun Saxena). 2016-07-10 22:13:53 +05:30
Varun Saxena
f515678b67 YARN-5318. Fix intermittent test failure of TestRMAdminService#testRefreshNodesResourceWithFileSystemBasedConfigurationProvider. (Jun Gong via Varun Saxena). 2016-07-09 01:43:09 +05:30
Varun Saxena
550abb53e5 YARN-5297. Avoid printing a stack trace when recovering an app after the RM restarts. (Junping Du via Varun Saxena). 2016-07-09 00:14:51 +05:30
Junping Du
5c0386d535 YARN-4939. The decommissioning Node should keep alive during NM restart. Contributed by sandflee.
(cherry picked from commit 30ee57ceb1e80c30ea3adfe7736d4d4c7d5c8386)
2016-07-08 04:15:47 -07:00
Wangda Tan
e246cf74c9 YARN-5294. Pass remote ip address down to YarnAuthorizationProvider. (Jian He via wangda) 2016-07-06 16:42:35 -07:00
Varun Saxena
516aa58c1b YARN-5286. Add RPC port info in RM web service's response when getting app status. (Jun Gong via Varun Saxena). 2016-07-05 23:10:08 +05:30
Jian He
d2559ca307 YARN-5023. TestAMRestart#testShouldNotCountFailureToMaxAttemptRetry fails. Contributed by sandflee
(cherry picked from commit c35a5a7a8d85b42498e6981a6b1f09f2bdd56459)
2016-07-01 14:29:28 -07:00
Varun Saxena
527cd9852a YARN-5182. MockNodes.newNodes creates one more node per rack than requested. (Karthik Kambatla via Varun Saxena). 2016-06-30 00:06:17 +05:30
Rohith Sharma K S
d3fc1bddea YARN-5262. Optimize sending RMNodeFinishedContainersPulledByAMEvent for every AM heartbeat.
(cherry picked from commit 26b5e6116f392b6be91dca57968259f87554ce33)
2016-06-29 10:13:17 +05:30
Arun Suresh
e64d21b297 YARN-5171. Extend DistributedSchedulerProtocol to notify RM of containers allocated by the Node. (Inigo Goiri via asuresh)
(cherry picked from commit 99e5dd68d0f44109c169d74824fa45a7396a5990)
2016-06-22 19:05:28 -07:00
Junping Du
ea4cab2330 YARN-5246. NMWebAppFilter web redirects drop query parameters. Contributed by Varun Vasudev.
(cherry picked from commit d0162f2040a4d98fbac51527dfc9447888cb32ab)
2016-06-19 17:45:57 -07:00
Karthik Kambatla
25be8105cd YARN-5077. Fix FSLeafQueue#getFairShare() for queues with zero fairshare. (Yufei Gu via kasha)
(cherry picked from commit 20f2799938c5cad59f59617cdb0f86e137c3c2c0)
2016-06-17 22:25:04 -07:00
Karthik Kambatla
e9c1155233 YARN-5082. Limit ContainerId increase in fair scheduler if the num of node app reserved reached the limit. Addendum to fix javac warning. (Arun Suresh via kasha)
(cherry picked from commit fbbe0bb627c1b5f333eb5d5390e709cc43517420)
2016-06-17 22:13:18 -07:00
Wangda Tan
863bfa4d6a YARN-1942. Deprecate toString/fromString methods from ConverterUtils and move them to records classes like ContainerId/ApplicationId, etc. (wangda) 2016-06-14 15:12:00 -07:00
Rohith Sharma K S
c540b9bd7b YARN-4989. TestWorkPreservingRMRestart#testCapacitySchedulerRecovery fails intermittently. Contributed by Ajith S.
(cherry picked from commit 28b66ae919e348123f4c05a4787c9ec56c087c25)
2016-06-13 11:23:10 +05:30
Arun Suresh
5985221b46 YARN-5082. Limit ContainerId increase in fair scheduler if the num of node app reserved reached the limit (sandflee via asuresh)
(cherry picked from commit 5279af7cd4afb090da742a96b5786d9dee6224bc)
2016-06-10 22:34:33 -07:00
Rohith Sharma K S
efdfa68911 YARN-5197. RM leaks containers if running container disappears from node update. Contributed by Jason Lowe.
(cherry picked from commit e0f4620cc7db3db4b781e6042ab7dd754af28f18)
2016-06-11 10:31:00 +05:30
Wangda Tan
a4094ddb5a YARN-5208. Run TestAMRMClient TestNMClient TestYarnClient TestClientRMTokens TestAMAuthorization tests with hadoop.security.token.service.use_ip enabled. (Rohith Sharma K S via wangda)
(cherry picked from commit 244506f9c8953029283aa8e0eb2006ae2f30025e)
2016-06-10 09:37:35 -07:00
Wangda Tan
aef6e455b1 YARN-4837. User facing aspects of 'AM blacklisting' feature need fixing. (vinodkv via wangda) 2016-06-09 10:08:31 -07:00
Arun Suresh
d3655f4123 YARN-4525. Fix bug in RLESparseResourceAllocation.getRangeOverlapping(). (Ishai Menache and Carlo Curino via asuresh)
(cherry picked from commit 3a154f75ed85d864b3ffd35818992418f2b6aa59)
2016-06-06 21:18:53 -07:00
Arun Suresh
f0a869b52a YARN-5185. StageAllocaterGreedyRLE: Fix NPE in corner case. (Carlo Curino via asuresh)
(cherry picked from commit 7a9b7372a1a917c7b5e1beca7e13c0419e3dbfef)
2016-06-06 21:07:24 -07:00
Ming Ma
74e2b5efa2 MAPREDUCE-5044. Have AM trigger jstack on task attempts that timeout before killing them. (Eric Payne and Gera Shegalov via mingma)
(cherry picked from commit 4a1cedc010d3fa1d8ef3f2773ca12acadfee5ba5)
2016-06-06 14:34:47 -07:00