Commit Graph

2344 Commits

Author SHA1 Message Date
Ming Ma db5b4c292b YARN-5797. Add metrics to the node manager for cleaning the PUBLIC and PRIVATE caches. (Chris Trezzo via mingma) 2017-04-06 17:08:59 -07:00
Jason Lowe 84b7f2e956 YARN-6288. Exceptions during aggregated log writes are mishandled. Contributed by Akira Ajisaka
(cherry picked from commit 1b081ca27e)
2017-04-06 16:25:41 -05:00
Daniel Templeton 59ad5cd68d YARN-6381. FSAppAttempt has several variables that should be final
(Contributed by Ameet Zaveri via Daniel Templeton)

(cherry picked from commit 984df31a31c25ffc8541467dc41913f62f69e1df)
2017-04-05 16:44:35 -07:00
Karthik Kambatla 7c0f9bd5e7 YARN-6448. Continuous scheduling thread crashes while sorting nodes. (Yufei Gu via kasha)
(cherry picked from commit b4c4f36594)
2017-04-05 16:00:30 -07:00
Varun Saxena 0a7ec1fcf7 YARN-6437. TestSignalContainer#testSignalRequestDeliveryToNM fails intermittently (Jason Lowe via Varun Saxena)
(cherry picked from commit 7d963c477a)
2017-04-06 01:58:32 +05:30
Jason Lowe 903278559a YARN-6403. Invalid local resource request can raise NPE and make NM exit. Contributed by Tao Yang
(cherry picked from commit e8071aa249)
2017-04-05 12:31:48 -05:00
Varun Saxena 05c008256a YARN-6420. RM startup failure due to wrong order in nodelabel editlog (Bibin A Chundatt via Varun Saxena)
(cherry picked from commit 19b89c4c7b)
2017-04-05 09:40:09 +05:30
Ming Ma 7507ccd38a YARN-6004. Refactor TestResourceLocalizationService#testDownloadingResourcesOnContainer so that it is less than 150 lines. (Chris Trezzo via mingma) 2017-04-04 18:05:09 -07:00
Karthik Kambatla 1938f97c0b YARN-6436. TestSchedulingPolicy#testParseSchedulingPolicy timeout is too low. (Eric Badger via kasha)
(cherry picked from commit 9cc04b4708)
2017-04-04 17:08:59 -07:00
Wangda Tan 70a810adc0 YARN-6109. Add an ability to convert ChildQueue to ParentQueue. (Xuan Gong via wangda)
(cherry picked from commit 3fdae0a2b6)
2017-04-04 14:39:38 -07:00
Eric Payne 4396c8e938 YARN-6354. LeveldbRMStateStore can parse invalid keys when recovering reservations. Contributed by Jason Lowe
(cherry picked from commit 318bfb01bc)
2017-03-31 12:36:01 -05:00
Jason Lowe 3ae529ba54 YARN-6411. Clean up the overwrite of createDispatcher() in subclass of MockRM. Contributed by Yufei Gu
(cherry picked from commit 4d1fac5df2)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-client/src/test/java/org/apache/hadoop/yarn/client/api/impl/TestAMRMClientOnRMRestart.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/security/TestClientToAMTokens.java
2017-03-31 10:21:44 -05:00
Varun Saxena 3fe7d36e72 YARN-6352. Header injections are possible in application proxy servlet (Naganarasimha G R via Varun Saxena) 2017-03-30 03:49:32 +05:30
Daniel Templeton c2636468d7 YARN-5685. RM configuration allows all failover methods to disabled when automatic failover is enabled
(cherry picked from commit 640ba1d23f)
2017-03-29 12:38:50 -07:00
Akira Ajisaka 554bc8342b
YARN-6329. Remove unnecessary TODO comment from AppLogAggregatorImpl.java. Contributed by victor bertschinger.
(cherry picked from commit 82fb9ce8df)
2017-03-29 10:37:52 +09:00
Robert Kanter a65011a128 YARN-6050. AMs can't be scheduled on racks or nodes (rkanter) 2017-03-28 14:33:08 -07:00
Varun Saxena 97c83f2498 YARN-5368. Memory leak in timeline server (Jonathan Eagles via Varun Saxena)
(cherry picked from commit 01aca54a22)
2017-03-29 01:56:59 +05:30
Jason Lowe bea8f8190f YARN-6359. TestRM#testApplicationKillAtAcceptedState fails rarely due to race condition. Contributed by Robert Kanter 2017-03-28 09:45:18 -05:00
Wangda Tan c97da481d7 YARN-6339. Improve performance for createAndGetApplicationReport. (Yunjiong Zhao via wangda)
(cherry picked from commit cd014d57aa)
2017-03-27 13:59:36 -07:00
Akira Ajisaka e4320305d3 YARN-6379. Remove unused argument in ClientRMService. Contributed by Kai Sasaki.
(cherry picked from commit b63ae854f9)
2017-03-27 11:26:23 +09:00
Sunil G f51ee482a7 YARN-5956. Refactor ClientRMService for unify error handling across apis. Contributed by Kai Sasaki. 2017-03-26 17:00:06 +05:30
Daniel Templeton d4bba011cc YARN-6284. hasAlreadyRun should be final in ResourceManager.StandByTransitionRunnable
(Contributed by Laura Adams via Daniel Templeton)

(cherry picked from commit 0a05c5c598)
2017-03-21 14:27:27 -07:00
Varun Saxena e2cabb4d21 YARN-6353. Clean up OrderingPolicy javadoc (Daniel Templeton via Varun Saxena)
(cherry picked from commit 35034653d0)
2017-03-20 23:51:15 +05:30
Jason Lowe 75e6b7428e YARN-6217. TestLocalCacheDirectoryManager test timeout is too aggressive. Contributed by Miklos Szegedi
(cherry picked from commit 4a8e304502)
2017-03-17 14:07:48 -05:00
Jason Lowe a16ba4296e YARN-4051. ContainerKillEvent lost when container is still recovering and application finishes. Contributed by sandflee 2017-03-16 09:31:20 -05:00
Sunil G caaa6fa96b YARN-6328. Fix a spelling mistake in CapacityScheduler. Contributed by Jin Yibo. This closes #202
(cherry picked from commit 2d0e24eb01)
2017-03-15 14:58:20 +05:30
Ray Chiang fa59f4e490 YARN-6042. Dump scheduler and queue state information into FairScheduler DEBUG log. (Yufei Gu via rchiang) 2017-03-14 14:45:13 -07:00
Junping Du f254002f1d YARN-6314. Potential infinite redirection on YARN log redirection web service. Contributed by Xuan Gong.
(cherry picked from commit 5a9dda796f0e73060ada794ad5752cc6a237ab2e)
2017-03-14 02:58:44 -07:00
Jason Lowe ef95e74eb1 YARN-6321. TestResources test timeouts are too aggressive. Contributed by Eric Badger
(cherry picked from commit 9649c27864)
2017-03-10 13:07:20 -06:00
Karthik Kambatla 4218671f3c YARN-6264. AM not launched when a single vcore is available on the cluster. (Yufei Gu via kasha)
(cherry picked from commit a96afae125)
2017-03-09 23:16:19 -08:00
Daniel Templeton c60cd88ad1 YARN-1047. Expose # of pre-emptions as a queue counter (Contributed by Karthik Kambatla via Daniel Templeton) 2017-03-09 18:18:03 -08:00
Daniel Templeton 6774f9c3c2 YARN-6300. NULL_UPDATE_REQUESTS is redundant in TestFairScheduler (Contributed by Yuanbo Liu via Daniel Templeton)
(cherry picked from commit 8bc8804be1b96f8a4fc33b5d24dd7f7af93a437e)
2017-03-09 17:53:24 -08:00
Sidharta S 3b6e5ef917 YARN-5669. Add support for docker pull command (Contribtued by luhuichun) 2017-03-09 16:23:21 -08:00
Jason Lowe 4a1187238a YARN-6165. Intra-queue preemption occurs even when preemption is turned off for a specific queue. Contributed by Eric Payne
(cherry picked from commit d7762a5511)
2017-03-08 16:47:31 -06:00
Robert Kanter 11d4cdc3a6 YARN-6297. TestAppLogAggregatorImp.verifyFilesUploaded() should check # of filed uploaded with that of files expected (haibochen via rkanter)
(cherry picked from commit 287ba4ffa6)
2017-03-08 10:45:59 -08:00
Sunil G ec4c2d42fe YARN-6207. Move application across queues should handle delayed event processing. Contributed by Bibin A Chundatt.
(cherry picked from commit 1eb8186703)
2017-03-08 12:13:14 +05:30
Daniel Templeton 83fbb051bb YARN-6287. RMCriticalThreadUncaughtExceptionHandler.rmContext should be final (Contributed by Corey Barker via Daniel Templeton)
(cherry picked from commit e0c239cdbd)
2017-03-07 12:01:38 -08:00
Junping Du a58dfcb421 YARN-6271. yarn rmadin -getGroups returns information from standby RM. Contributed by Jian He.
(cherry picked from commit 8db7a8c3ae)
2017-03-03 14:40:32 -08:00
Ray Chiang 303ee13e3c YARN-6218. Fix TestAMRMClient when using FairScheduler. (Miklos Szegedi via rchiang)
(cherry picked from commit 30b91ff9540e35195af834d1bd5983114a556c6d)
2017-03-03 12:57:17 -08:00
Jason Lowe 5eca427da2 YARN-6263. NMTokenSecretManagerInRM.createAndGetNMToken is not thread safe. Contributed by Haibo Chen
(cherry picked from commit e58fc76030)
2017-03-03 08:24:05 -06:00
Jian He c7ddf95d4a YARN-6153. KeepContainer does not work when AM retry window is set. Contributed by kyungwan nam
(cherry picked from commit 235203dffd)
2017-02-28 13:26:55 -08:00
Xuan 95bd3c3d55 YARN-6189: Improve application status log message when RM restarted when
app is in NEW state. Contributed by Junping Du

(cherry picked from commit e0bb867c3f)
2017-02-28 11:06:07 -08:00
Karthik Kambatla 58eefc71c7 YARN-6231. FairSchedulerTestBase helper methods should call scheduler.update to avoid flakiness. (kasha)
(cherry picked from commit f187d63816)
2017-02-27 22:16:32 -08:00
Naganarasimha 1eec911cd9 YARN-5703. ReservationAgents are not correctly configured. Contributed by Manikandan R.
(cherry picked from commit 5f5b031d1f)
2017-02-27 20:42:10 +05:30
Sunil G 68b08e96a0 YARN-6123. [YARN-5864] Add a test to make sure queues of orderingPolicy will be updated when childQueues is added or removed. Contributed by Wangda Tan. 2017-02-27 17:54:47 +05:30
Karthik Kambatla c24325bbcb YARN-6172. FSLeafQueue demand update needs to be atomic. (Miklos Szegedi via kasha)
(cherry picked from commit fbfe86deea)
2017-02-26 20:37:18 -08:00
Karthik Kambatla a95d3e1fe7 YARN-6215. FairScheduler preemption and update should not run concurrently. (Tao Jie via kasha)
(cherry picked from commit 815d53506f)
2017-02-26 20:31:03 -08:00
Xuan 19c5b3d622 YARN-6228: EntityGroupFSTimelineStore should allow configurable cache
stores. Contributed by Li Lu

(cherry picked from commit 53d372a255)
2017-02-24 11:00:16 -08:00
Wangda Tan 650ff95e00 YARN-5959. RM changes to support change of container ExecutionType. (Arun Suresh via wangda)
(cherry picked from commit 0a55bd841e)
2017-02-23 17:22:25 -08:00
Karthik Kambatla 40bc9e7ddb YARN-6222. TestFairScheduler.testReservationMetrics is flaky. (Yufei Gu via kasha)
(cherry picked from commit 694e680d20)
2017-02-23 15:30:29 -08:00
Wangda Tan a82480c7d4 YARN-6143. Fix incompatible issue caused by YARN-3583. (Sunil G via wangda) 2017-02-23 13:59:20 -08:00
Sunil G 801357726d YARN-6211. Synchronization improvement for moveApplicationAcrossQueues and updateApplicationPriority. Contributed by Bibin A Chundatt.
(cherry picked from commit a207aa9930)
2017-02-23 14:25:14 +05:30
Karthik Kambatla 37edbd35f1 YARN-6194. Cluster capacity in SchedulingPolicy is updated only on allocation file reload. (Yufei Gu via kasha)
(cherry picked from commit b10e962224)
2017-02-22 15:59:11 -08:00
Karthik Kambatla ca7a6a7365 YARN-6210. FairScheduler: Node reservations can interfere with preemption. (kasha)
(cherry picked from commit 718ad9f6ee)
2017-02-22 15:52:44 -08:00
Daniel Templeton 93219b6454 YARN-6125. The application attempt's diagnostic message should have a maximum size
(Contributed by Andras Piros via Daniel Templeton)

(cherry picked from commit c7a36e6130)
2017-02-17 14:38:06 -08:00
Karthik Kambatla 5575328bb6 YARN-6193. FairScheduler might not trigger preemption when using DRF. (kasha)
(cherry picked from commit dbbfcf74ab)
2017-02-17 14:07:47 -08:00
Daniel Templeton 10fd5627bb YARN-6188. Fix OOM issue with decommissioningNodesWatcher in the case of clusters with
large number of nodes (Contributed by Ajay Jadhav via Daniel Templeton)

(cherry picked from commit 585168cc1bdb87398d4f388f3eeccd1c71114982)
2017-02-17 13:14:37 -08:00
Karthik Kambatla afc8124ff6 YARN-6171. ConcurrentModificationException on FSAppAttempt.containersToPreempt. (Miklos Szegedi via kasha)
(cherry picked from commit a77f432449)
2017-02-16 15:03:35 -08:00
Rohith Sharma K S f07b52dc5a YARN-6200. Reverting since the same functionality achieved by YARN-1623.
Revert "YARN-5068. Expose scheduler queue to application master. (Harish Jaiprakash via rohithsharmaks)"

This reverts commit a37b3694ea.
2017-02-16 17:58:38 +05:30
Karthik Kambatla 9fecd5063f YARN-5798. Set UncaughtExceptionHandler for all FairScheduler threads. (Yufei Gu via kasha)
(cherry picked from commit 74dd142250)
2017-02-16 00:03:26 -08:00
Karthik Kambatla fefac1276a YARN-4212. FairScheduler: Can't create a DRF queue under a FAIR policy queue. (Yufei Gu via kasha)
(cherry picked from commit 11be3f70e0)
2017-02-15 23:51:51 -08:00
Karthik Kambatla 46b6c95e0a YARN-6163. FS Preemption is a trickle for severely starved applications. (kasha)
(cherry picked from commit 6c25dbcdc0)
2017-02-15 23:23:18 -08:00
Junping Du 2aa6f317f6 YARN-6174. Log files pattern should be same for both running and finished container. Contributed by Xuan Gong.
(cherry picked from commit ce2d5bfa5f)
2017-02-15 09:08:54 -08:00
Varun Saxena 9cab25a63d YARN-6156. AM blacklisting to consider node label partition (Bibin A Chundatt via Varun Saxena)
(cherry picked from commit b7613e0f40)
2017-02-15 14:58:39 +05:30
Karthik Kambatla 58acb676a2 YARN-6061. Addendum. Remove extraneous change.
(cherry picked from commit 353a9b2d91)
2017-02-14 15:20:11 -08:00
Karthik Kambatla 37921b3fef YARN-6061. Add an UncaughtExceptionHandler for critical threads in RM. (Yufei Gu via kasha)
(cherry picked from commit 652679aa8a)
2017-02-14 13:40:06 -08:00
Karthik Kambatla 30ff5bff1a YARN-3933. FairScheduler: Multiple calls to completedContainer are not safe. (Shiwei Guo and Miklos Szegedi via kasha)
(cherry picked from commit 646c6d6509)
2017-02-13 11:27:02 -08:00
Junping Du 205fc1f10c YARN-6113. Re-direct NM Web Service to get container logs for finished applications. Contributed by Xuan Gong.
(cherry picked from commit 464ff479ce)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/webapp/TestNMWebServices.java
2017-02-13 06:16:04 -08:00
Karthik Kambatla e259a818ad YARN-6144. FairScheduler: preempted resources can become negative. (Miklos Szegedi via kasha)
(cherry picked from commit 08f93978f3)
2017-02-09 14:45:21 -08:00
Karthik Kambatla 41361ec39e YARN-6112. UpdateCallDuration is calculated only when debug logging is enabled. (Yufei Gu via kasha) 2017-02-09 13:12:21 -08:00
Junping Du ab41da7e64 YARN-6108. Improve AHS webservice to accept NM address as a parameter to get container logs. Contributed by Xuan Gong.
(cherry picked from commit 9dbfab1284)
2017-02-07 02:14:09 -08:00
Junping Du c699ce70ea YARN-6100. Improve YARN webservice to output aggregated container logs. Contributed by Xuan Gong. 2017-02-02 00:45:27 -08:00
Daniel Templeton be14bda5c0 YARN-4658. Typo in o.a.h.yarn.server.resourcemanager.scheduler.fair.TestFairScheduler comment (Contributed by Udai Potluri via Daniel Templeton)
(cherry picked from commit 3e06475307)
2017-01-31 16:26:41 -08:00
Daniel Templeton ccf33bc36c YARN-6103. Log updates for ZKRMStateStore (Contributed by Daniel Sturman via Daniel Templeton)
(cherry picked from commit 87852b6ef4)
2017-01-30 21:46:38 -08:00
Karthik Kambatla c9824c5ff6 YARN-6131. FairScheduler: Lower update interval for faster tests. (kasha)
(cherry picked from commit 11e44bdda7)
2017-01-30 13:39:35 -08:00
Jason Lowe 4703f5d20b YARN-5641. Localizer leaves behind tarballs after container is complete. Contributed by Eric Badger 2017-01-27 15:25:57 +00:00
Karthik Kambatla 1b1a25cca6 YARN-6000. Make AllocationFileLoaderService.Listener public. (Tao Jie via kasha)
(cherry picked from commit 4d3f73acc0)
2017-01-26 14:11:50 -08:00
Daniel Templeton c4eeac02de YARN-4975. Fair Scheduler: exception thrown when a parent queue marked 'parent' has configured child queues
(Contributed by Yufei Gu via Daniel Templeton)

(cherry picked from commit f85b74ccf9)
2017-01-26 14:08:46 -08:00
Daniel Templeton 443f2803ac YARN-4997. Update fair scheduler to use pluggable auth provider (Contributed by Tao Jie via Daniel Templeton)
(cherry picked from commit b3befc021b)
2017-01-26 14:07:46 -08:00
Jason Lowe b806201f0a Revert "YARN-5641. Localizer leaves behind tarballs after container is complete. Contributed by Eric Badger"
This reverts commit 45407acaed.
2017-01-26 14:42:22 +00:00
Jason Lowe 45407acaed YARN-5641. Localizer leaves behind tarballs after container is complete. Contributed by Eric Badger
(cherry picked from commit 9e19f758c1)
2017-01-25 21:54:17 +00:00
Karthik Kambatla 73497f08fc YARN-5830. FairScheduler: Avoid preempting AM containers. (Yufei Gu via kasha)
(cherry picked from commit abedb8a9d8)
2017-01-25 12:19:27 -08:00
Karthik Kambatla 71026cc3f6 YARN-5831. FairScheduler: Propagate allowPreemptionFrom flag all the way down to the app. (Yufei Gu via kasha)
(cherry picked from commit e224c96234)
2017-01-25 12:19:20 -08:00
Karthik Kambatla 0ceb7149d9 YARN-4752. Improved preemption in FairScheduler. (kasha) 2017-01-25 10:45:02 -08:00
Junping Du 8027c3e8b9 YARN-6099. Improve webservice to list aggregated log files. Contributed by Xuan Gong. 2017-01-24 15:28:52 -08:00
Wangda Tan 1309accd68 YARN-5864. Capacity Scheduler - Queue Priorities. (wangda) 2017-01-24 14:44:42 -08:00
Jason Lowe 1672a06135 YARN-5547. NMLeveldbStateStore should be more tolerant of unknown keys. Contributed by Ajith S 2017-01-24 16:19:02 +00:00
Rohith Sharma K S 26c4cfb653 YARN-6082. Invalid REST api response for getApps since queueUsagePercentage is coming as INF. Contributed by Sunil G.
(cherry picked from commit 0101267d9d)
2017-01-24 11:01:52 +05:30
Jason Lowe 94b326f2c8 YARN-5910. Support for multi-cluster delegation tokens. Contributed by Jian He
(cherry picked from commit 69fa81679f)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/RMAppManager.java
2017-01-23 17:23:32 +00:00
Sunil G f8e49bb343 YARN-6031. Application recovery has failed when node label feature is turned off during RM recovery. Contributed by Ying Zhang.
(cherry picked from commit 3fa0d540df)
2017-01-23 12:24:39 +05:30
Chris Nauroth 5251de00fa YARN-5456. container-executor support for FreeBSD, NetBSD, and others if conf path is absolute. Contributed by Allen Wittenauer.
(cherry picked from commit b913677365)
2017-01-19 20:48:27 +05:30
Chris Nauroth 198bd84b33 YARN-5121. fix some container-executor portability issues. Contributed by Allen Wittenauer.
(cherry picked from commit ef501b1a0b)
2017-01-19 20:48:27 +05:30
Wangda Tan 8d6fa4569b YARN-5556. CapacityScheduler: Support deleting queues without requiring a RM restart. (Naganarasimha G R via wangda)
(cherry picked from commit 72054a817d)
2017-01-18 13:35:30 -08:00
Subru Krishnan fb639aed90 YARN-6016. Fix minor bugs in handling of local AMRMToken in AMRMProxy. (Botong Huang via Subru).
(cherry picked from commit 4d1f3d9020)
2017-01-17 14:51:27 -08:00
Junping Du 861e275646 YARN-6011. Add a new web service to list the files on a container in AHSWebService. Contributed by Xuan Gong.
(cherry picked from commit cf695577aa)
2017-01-16 16:41:16 -08:00
Sunil G f166bb8f09 YARN-6081. LeafQueue#getTotalPendingResourcesConsideringUserLimit should deduct reserved from pending to avoid unnecessary preemption of reserved container. Contributed by Wangda Tan.
(cherry picked from commit d3170f9eba)
2017-01-13 18:30:31 +05:30
Naganarasimha 4bd05840fc YARN-6072. RM unable to start in secure mode. Contributed by Ajith S.
(cherry picked from commit a6b06f7179)
2017-01-12 06:42:18 +05:30
Daniel Templeton 718ae2d477 YARN-5849. Automatically create YARN control group for pre-mounted cgroups (Contributed by Miklos Szegedi via Daniel Templeton)
(cherry picked from commit 0fffebe51a91023c4949ab560f5c828f1b568072)
2017-01-11 15:36:18 -08:00
Jason Lowe cc09b1a49a YARN-5416. TestRMRestart#testRMRestartWaitForPreviousAMToFinish failed intermittently due to not wait SchedulerApplicationAttempt to be stopped. Contributed by Junping Du
(cherry picked from commit 357eab9566)
2017-01-11 17:15:14 +00:00
Junping Du ba6a013341 YARN-6079. Fix simple spelling errors in yarn test code. Contributed by vijay.
(cherry picked from commit 4db119b7b5)
2017-01-10 18:22:45 -08:00