Commit Graph

3270 Commits

Author SHA1 Message Date
Haibo Chen 894521673b YARN-6510. Fix profs stat file warning caused by process names that includes parenthesis. (Wilfred Spiegelenburg via Haibo Chen)
(cherry picked from commit 4f3ca0396a)
2017-04-26 11:47:23 -07:00
Wangda Tan 503dd650bf YARN-6392. Add submit time to Application Summary log. (Zhihai Xu via wangda)
(cherry picked from commit 2ba21d6376)
2017-04-24 13:18:47 -07:00
Haibo Chen 49a2f286f7 YARN-6500. Do not mount inaccessible cgroups directories in CgroupsLCEResourcesHandler. (Miklos Szegedi via Haibo Chen)
(cherry picked from commit 8ac50e1322)
2017-04-24 11:39:55 -07:00
Eric Payne 421e51cb9c YARN-5617. AMs only intended to run one attempt can be run more than once? Contributed by Jason Lowe.
(cherry picked from commit 52adf71914)
2017-04-24 12:08:35 -05:00
Sunil G a9495b206b YARN-6164. Expose Queue Configurations per Node Label through YARN client api. Contributed by Benson Qiu. 2017-04-24 12:16:18 +05:30
Daniel Templeton d77137a03f YARN-6453. fairscheduler-statedump.log gets generated regardless of service
(Contributed by Yufei Gu via Daniel Templeton)

(cherry picked from commit e74ca7216aecc998d277e5ca1f4d0abae43c7e31)
2017-04-20 14:14:38 -07:00
Jason Lowe f5a9287bb1 YARN-6501. FSSchedulerNode.java fails to compile with JDK7. Contributed by John Zhuge 2017-04-20 09:52:44 -05:00
Junping Du 9289f4527d YARN-6304. Skip rm.transitionToActive call to RM if RM is already active. Contributed by Rohith Sharma K S.
(cherry picked from commit 6ed9d36224)
2017-04-17 15:52:02 -07:00
Karthik Kambatla 6f5912c106 YARN-6432. FairScheduler: Reserve preempted resources for corresponding applications. (Miklos Szegedi via kasha) 2017-04-17 14:34:28 -07:00
Wangda Tan 6ade9e6ce6 YARN-6406. Remove SchedulerRequestKeys when no more pending ResourceRequest. (Arun Suresh via wangda)
(cherry picked from commit 87e2ef8c98)
2017-04-15 21:34:29 -07:00
Jason Lowe 1699b20526 YARN-6480. Timeout is too aggressive for TestAMRestart.testPreemptedAMRestartOnRMRestart. Contributed by Eric Badger
(cherry picked from commit 4168805502)
2017-04-14 17:17:46 -05:00
Karthik Kambatla 53da65ef8b YARN-6433. Only accessible cgroup mount directories should be selected for a controller. (Miklos Szegedi via kasha)
(cherry picked from commit 8a1d7480f7)
2017-04-14 15:07:54 -07:00
Eric Payne 157a9f3104 YARN-5994. TestCapacityScheduler.testAMLimitUsage fails intermittently. Contributed by Eric Badger
(cherry-picked from commit a41f8dd58e)
2017-04-14 11:04:18 -05:00
Wangda Tan 5756256280 YARN-6216. Unify Container Resizing code paths with Container Updates making it scheduler agnostic. (Arun Suresh via wangda)
(cherry picked from commit eac6b4c35c)
2017-04-13 13:16:31 -07:00
Wangda Tan 408d23477f YARN-6040. Introduce api independent PendingAsk to replace usage of ResourceRequest within Scheduler classes. (wangda) 2017-04-13 11:11:57 -07:00
Junping Du d77cada182 YARN-3760. FSDataOutputStream leak in AggregatedLogFormat.LogWriter.close(). Contributed by Haibo Chen.
(cherry picked from commit c26ccf1adb3a72df3f68e1150b86b813c691203a)
(cherry picked from commit 9d9087a67a)
2017-04-12 13:54:58 -07:00
Eric Payne 607295d21f YARN-6450. TestContainerManagerWithLCE requires override for each new test added to ContainerManagerTest? Contributed by Jason Lowe.
(cherry picked from commit a16ab2be91)
2017-04-12 10:26:31 -05:00
Wangda Tan 2f08c86b7a YARN-6439. Fix ReservationSystem creation of default ReservationQueue. (Carlo Curino via wangda)
(cherry picked from commit 4d4ad0ebb7)
2017-04-11 14:59:34 -07:00
Naganarasimha b04c09163b YARN-6463. Correct spelling mistake in FileSystemRMStateStore. Contributed by Yeliang Cang.
(cherry picked from commit 7d873c465b)
2017-04-11 22:46:09 +05:30
Jason Lowe b29fb55798 YARN-6461. TestRMAdminCLI has very low test timeouts. Contributed by Eric Badger
(cherry picked from commit 90d97372ed)
2017-04-11 09:17:42 -05:00
Jason Lowe 0cb22a0f15 YARN-6195. Export UsedCapacity and AbsoluteUsedCapacity to JMX. Contributed by Benson Qiu
(cherry picked from commit 0e065f2ede)
2017-04-11 08:45:16 -05:00
Konstantinos Karanasos 9b3ba25980 YARN-6344. Add parameter for rack locality delay in CapacityScheduler. (kkaranasos)
(cherry picked from commit 3587b46164514e6751d7cebb09c8ab5e988853b4)
2017-04-10 17:22:29 -07:00
Daniel Templeton c1ae753de1 YARN-6372. Add default value for NM disk validator (Contributed by Yufei Gu via Daniel Templeton)
(cherry picked from commit e486569197e10bf48731d9e7ef5c031987c26acc)
2017-04-10 14:57:19 -07:00
Daniel Templeton 2dae9299ef YARN-6343. Docker docs MR example is broken (Contributed by Prashant Jha via Daniel Templeton)
(cherry picked from commit 14a39907c6)
2017-04-10 10:03:06 -07:00
Karthik Kambatla e0b143e51a Revert "YARN-6298. Metric preemptCall is not used in new preemption. (Yufei Gu via kasha)"
This reverts commit dcfce5fcdd.
2017-04-07 17:37:31 -07:00
Karthik Kambatla dcfce5fcdd YARN-6298. Metric preemptCall is not used in new preemption. (Yufei Gu via kasha)
(cherry picked from commit 2aa8967809)
2017-04-07 17:36:39 -07:00
Robert Kanter cbb1c76195 YARN-6368. Decommissioning an NM results in a -1 exit code (miklos.szegedi@cloudera.com via rkanter)
(cherry picked from commit 63f7322522)
2017-04-07 14:28:39 -07:00
Akira Ajisaka a467e5412e
YARN-6288. Addendum patch to fix compile error. 2017-04-07 12:19:46 +09:00
Ming Ma db5b4c292b YARN-5797. Add metrics to the node manager for cleaning the PUBLIC and PRIVATE caches. (Chris Trezzo via mingma) 2017-04-06 17:08:59 -07:00
Jason Lowe 84b7f2e956 YARN-6288. Exceptions during aggregated log writes are mishandled. Contributed by Akira Ajisaka
(cherry picked from commit 1b081ca27e)
2017-04-06 16:25:41 -05:00
Daniel Templeton 59ad5cd68d YARN-6381. FSAppAttempt has several variables that should be final
(Contributed by Ameet Zaveri via Daniel Templeton)

(cherry picked from commit 984df31a31c25ffc8541467dc41913f62f69e1df)
2017-04-05 16:44:35 -07:00
Karthik Kambatla 7c0f9bd5e7 YARN-6448. Continuous scheduling thread crashes while sorting nodes. (Yufei Gu via kasha)
(cherry picked from commit b4c4f36594)
2017-04-05 16:00:30 -07:00
Varun Saxena 0a7ec1fcf7 YARN-6437. TestSignalContainer#testSignalRequestDeliveryToNM fails intermittently (Jason Lowe via Varun Saxena)
(cherry picked from commit 7d963c477a)
2017-04-06 01:58:32 +05:30
Jason Lowe 903278559a YARN-6403. Invalid local resource request can raise NPE and make NM exit. Contributed by Tao Yang
(cherry picked from commit e8071aa249)
2017-04-05 12:31:48 -05:00
Varun Saxena 05c008256a YARN-6420. RM startup failure due to wrong order in nodelabel editlog (Bibin A Chundatt via Varun Saxena)
(cherry picked from commit 19b89c4c7b)
2017-04-05 09:40:09 +05:30
Ming Ma 7507ccd38a YARN-6004. Refactor TestResourceLocalizationService#testDownloadingResourcesOnContainer so that it is less than 150 lines. (Chris Trezzo via mingma) 2017-04-04 18:05:09 -07:00
Karthik Kambatla 1938f97c0b YARN-6436. TestSchedulingPolicy#testParseSchedulingPolicy timeout is too low. (Eric Badger via kasha)
(cherry picked from commit 9cc04b4708)
2017-04-04 17:08:59 -07:00
Wangda Tan 70a810adc0 YARN-6109. Add an ability to convert ChildQueue to ParentQueue. (Xuan Gong via wangda)
(cherry picked from commit 3fdae0a2b6)
2017-04-04 14:39:38 -07:00
Eric Payne 4396c8e938 YARN-6354. LeveldbRMStateStore can parse invalid keys when recovering reservations. Contributed by Jason Lowe
(cherry picked from commit 318bfb01bc)
2017-03-31 12:36:01 -05:00
Jason Lowe 3ae529ba54 YARN-6411. Clean up the overwrite of createDispatcher() in subclass of MockRM. Contributed by Yufei Gu
(cherry picked from commit 4d1fac5df2)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-client/src/test/java/org/apache/hadoop/yarn/client/api/impl/TestAMRMClientOnRMRestart.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/security/TestClientToAMTokens.java
2017-03-31 10:21:44 -05:00
Varun Saxena 3fe7d36e72 YARN-6352. Header injections are possible in application proxy servlet (Naganarasimha G R via Varun Saxena) 2017-03-30 03:49:32 +05:30
Daniel Templeton c2636468d7 YARN-5685. RM configuration allows all failover methods to disabled when automatic failover is enabled
(cherry picked from commit 640ba1d23f)
2017-03-29 12:38:50 -07:00
Akira Ajisaka 554bc8342b
YARN-6329. Remove unnecessary TODO comment from AppLogAggregatorImpl.java. Contributed by victor bertschinger.
(cherry picked from commit 82fb9ce8df)
2017-03-29 10:37:52 +09:00
Robert Kanter a65011a128 YARN-6050. AMs can't be scheduled on racks or nodes (rkanter) 2017-03-28 14:33:08 -07:00
Varun Saxena 97c83f2498 YARN-5368. Memory leak in timeline server (Jonathan Eagles via Varun Saxena)
(cherry picked from commit 01aca54a22)
2017-03-29 01:56:59 +05:30
Jason Lowe bea8f8190f YARN-6359. TestRM#testApplicationKillAtAcceptedState fails rarely due to race condition. Contributed by Robert Kanter 2017-03-28 09:45:18 -05:00
Wangda Tan c97da481d7 YARN-6339. Improve performance for createAndGetApplicationReport. (Yunjiong Zhao via wangda)
(cherry picked from commit cd014d57aa)
2017-03-27 13:59:36 -07:00
Akira Ajisaka e4320305d3 YARN-6379. Remove unused argument in ClientRMService. Contributed by Kai Sasaki.
(cherry picked from commit b63ae854f9)
2017-03-27 11:26:23 +09:00
Sunil G f51ee482a7 YARN-5956. Refactor ClientRMService for unify error handling across apis. Contributed by Kai Sasaki. 2017-03-26 17:00:06 +05:30
Junping Du 3ad4d37363 Add COMMON/MAPREDUCE/YARN jdiff of 2.8.0 release.
(cherry picked from commit 6df029db36)
2017-03-25 14:33:20 -07:00
Daniel Templeton 6209e4c913 YARN-6334. TestRMFailover#testAutomaticFailover always passes even when it should fail
(Contributed by Yufei Gu via Daniel Templeton)
2017-03-24 14:02:39 -07:00
Daniel Templeton d4bba011cc YARN-6284. hasAlreadyRun should be final in ResourceManager.StandByTransitionRunnable
(Contributed by Laura Adams via Daniel Templeton)

(cherry picked from commit 0a05c5c598)
2017-03-21 14:27:27 -07:00
Junping Du dabe96a72e YARN-6367. YARN logs CLI needs alway check containerLogsInfo/containerLogInfo before parse the JSON object from NMWebService. Contributed by Xuan Gong.
(cherry picked from commit c01d15ab27)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-client/src/main/java/org/apache/hadoop/yarn/client/cli/LogsCLI.java
2017-03-21 10:55:19 -07:00
Daniel Templeton f7177d91d6 YARN-6309. Fair scheduler docs should have the queue and queuePlacementPolicy
elements listed in bold so that they're easier to see
(Contributed by Esmaeil Mirzaee via Daniel Templeton)

(cherry picked from commit 948afe2c92977d6924a62c1a5a1ffca315cd3e68)
2017-03-20 14:32:01 -07:00
Varun Saxena e2cabb4d21 YARN-6353. Clean up OrderingPolicy javadoc (Daniel Templeton via Varun Saxena)
(cherry picked from commit 35034653d0)
2017-03-20 23:51:15 +05:30
Jason Lowe 75e6b7428e YARN-6217. TestLocalCacheDirectoryManager test timeout is too aggressive. Contributed by Miklos Szegedi
(cherry picked from commit 4a8e304502)
2017-03-17 14:07:48 -05:00
Jason Lowe a16ba4296e YARN-4051. ContainerKillEvent lost when container is still recovering and application finishes. Contributed by sandflee 2017-03-16 09:31:20 -05:00
Jian He 9f9ccb2784 YARN-6332. Make RegistrySecurity use short user names for ZK ACLs. Contributed by Billie Rinaldi 2017-03-16 13:00:44 +08:00
Junping Du 8b31ff6dba YARN-6294. ATS client should better handle Socket closed case. Contributed by Li Lu. 2017-03-15 11:49:51 -07:00
Sunil G caaa6fa96b YARN-6328. Fix a spelling mistake in CapacityScheduler. Contributed by Jin Yibo. This closes #202
(cherry picked from commit 2d0e24eb01)
2017-03-15 14:58:20 +05:30
Ray Chiang fa59f4e490 YARN-6042. Dump scheduler and queue state information into FairScheduler DEBUG log. (Yufei Gu via rchiang) 2017-03-14 14:45:13 -07:00
Junping Du 0e7879052a YARN-6313. YARN logs cli should provide logs for a completed container even when application is still running. Contributed by Xuan Gong.
(cherry picked from commit b88f5e0f7858d1d89b79dfd325b767c34416052d)
2017-03-14 12:58:41 -07:00
Junping Du f254002f1d YARN-6314. Potential infinite redirection on YARN log redirection web service. Contributed by Xuan Gong.
(cherry picked from commit 5a9dda796f0e73060ada794ad5752cc6a237ab2e)
2017-03-14 02:58:44 -07:00
Jason Lowe ef95e74eb1 YARN-6321. TestResources test timeouts are too aggressive. Contributed by Eric Badger
(cherry picked from commit 9649c27864)
2017-03-10 13:07:20 -06:00
Jian He 8c44a9db21 HADOOP-14062. ApplicationMasterProtocolPBClientImpl.allocate fails with EOFException when RPC privacy is enabled. Contributed by Steven Rand 2017-03-10 09:26:19 -08:00
Jason Lowe 1a0358b59a YARN-6310. OutputStreams in AggregatedLogFormat.LogWriter can be left open upon exceptions. Contributed by Haibo Chen
(cherry picked from commit deb9f56946)
2017-03-10 11:09:04 -06:00
Karthik Kambatla 4218671f3c YARN-6264. AM not launched when a single vcore is available on the cluster. (Yufei Gu via kasha)
(cherry picked from commit a96afae125)
2017-03-09 23:16:19 -08:00
Daniel Templeton c60cd88ad1 YARN-1047. Expose # of pre-emptions as a queue counter (Contributed by Karthik Kambatla via Daniel Templeton) 2017-03-09 18:18:03 -08:00
Daniel Templeton 6774f9c3c2 YARN-6300. NULL_UPDATE_REQUESTS is redundant in TestFairScheduler (Contributed by Yuanbo Liu via Daniel Templeton)
(cherry picked from commit 8bc8804be1b96f8a4fc33b5d24dd7f7af93a437e)
2017-03-09 17:53:24 -08:00
Sidharta S 3b6e5ef917 YARN-5669. Add support for docker pull command (Contribtued by luhuichun) 2017-03-09 16:23:21 -08:00
Jason Lowe 4a1187238a YARN-6165. Intra-queue preemption occurs even when preemption is turned off for a specific queue. Contributed by Eric Payne
(cherry picked from commit d7762a5511)
2017-03-08 16:47:31 -06:00
Jian He ea8d53de6a Revert "HADOOP-14062. ApplicationMasterProtocolPBClientImpl.allocate fails with EOFException when RPC privacy is enabled. Contributed by Steven Rand"
This reverts commit 36eda5af75.
2017-03-08 13:20:45 -08:00
Jian He 36eda5af75 HADOOP-14062. ApplicationMasterProtocolPBClientImpl.allocate fails with EOFException when RPC privacy is enabled. Contributed by Steven Rand 2017-03-08 10:49:07 -08:00
Robert Kanter 11d4cdc3a6 YARN-6297. TestAppLogAggregatorImp.verifyFilesUploaded() should check # of filed uploaded with that of files expected (haibochen via rkanter)
(cherry picked from commit 287ba4ffa6)
2017-03-08 10:45:59 -08:00
Sunil G ec4c2d42fe YARN-6207. Move application across queues should handle delayed event processing. Contributed by Bibin A Chundatt.
(cherry picked from commit 1eb8186703)
2017-03-08 12:13:14 +05:30
Daniel Templeton 83fbb051bb YARN-6287. RMCriticalThreadUncaughtExceptionHandler.rmContext should be final (Contributed by Corey Barker via Daniel Templeton)
(cherry picked from commit e0c239cdbd)
2017-03-07 12:01:38 -08:00
Jian He 712434e744 YARN-6270. WebUtils.getRMWebAppURLWithScheme() needs to honor RM HA setting. Contributed by Xuan Gong 2017-03-03 15:24:23 -08:00
Junping Du a58dfcb421 YARN-6271. yarn rmadin -getGroups returns information from standby RM. Contributed by Jian He.
(cherry picked from commit 8db7a8c3ae)
2017-03-03 14:40:32 -08:00
Ray Chiang 303ee13e3c YARN-6218. Fix TestAMRMClient when using FairScheduler. (Miklos Szegedi via rchiang)
(cherry picked from commit 30b91ff9540e35195af834d1bd5983114a556c6d)
2017-03-03 12:57:17 -08:00
Jason Lowe 5eca427da2 YARN-6263. NMTokenSecretManagerInRM.createAndGetNMToken is not thread safe. Contributed by Haibo Chen
(cherry picked from commit e58fc76030)
2017-03-03 08:24:05 -06:00
Jian He c7ddf95d4a YARN-6153. KeepContainer does not work when AM retry window is set. Contributed by kyungwan nam
(cherry picked from commit 235203dffd)
2017-02-28 13:26:55 -08:00
Xuan 95bd3c3d55 YARN-6189: Improve application status log message when RM restarted when
app is in NEW state. Contributed by Junping Du

(cherry picked from commit e0bb867c3f)
2017-02-28 11:06:07 -08:00
Gera Shegalov df35ba81fe YARN-1728. Workaround guice3x-undecoded pathInfo in YARN WebApp. (Yuanbo Liu via gera) 2017-02-28 09:46:57 -08:00
Karthik Kambatla 58eefc71c7 YARN-6231. FairSchedulerTestBase helper methods should call scheduler.update to avoid flakiness. (kasha)
(cherry picked from commit f187d63816)
2017-02-27 22:16:32 -08:00
Naganarasimha 1eec911cd9 YARN-5703. ReservationAgents are not correctly configured. Contributed by Manikandan R.
(cherry picked from commit 5f5b031d1f)
2017-02-27 20:42:10 +05:30
Sunil G 68b08e96a0 YARN-6123. [YARN-5864] Add a test to make sure queues of orderingPolicy will be updated when childQueues is added or removed. Contributed by Wangda Tan. 2017-02-27 17:54:47 +05:30
Karthik Kambatla c24325bbcb YARN-6172. FSLeafQueue demand update needs to be atomic. (Miklos Szegedi via kasha)
(cherry picked from commit fbfe86deea)
2017-02-26 20:37:18 -08:00
Karthik Kambatla a95d3e1fe7 YARN-6215. FairScheduler preemption and update should not run concurrently. (Tao Jie via kasha)
(cherry picked from commit 815d53506f)
2017-02-26 20:31:03 -08:00
Xuan 19c5b3d622 YARN-6228: EntityGroupFSTimelineStore should allow configurable cache
stores. Contributed by Li Lu

(cherry picked from commit 53d372a255)
2017-02-24 11:00:16 -08:00
Wangda Tan 650ff95e00 YARN-5959. RM changes to support change of container ExecutionType. (Arun Suresh via wangda)
(cherry picked from commit 0a55bd841e)
2017-02-23 17:22:25 -08:00
Karthik Kambatla 40bc9e7ddb YARN-6222. TestFairScheduler.testReservationMetrics is flaky. (Yufei Gu via kasha)
(cherry picked from commit 694e680d20)
2017-02-23 15:30:29 -08:00
Wangda Tan a82480c7d4 YARN-6143. Fix incompatible issue caused by YARN-3583. (Sunil G via wangda) 2017-02-23 13:59:20 -08:00
Sunil G 801357726d YARN-6211. Synchronization improvement for moveApplicationAcrossQueues and updateApplicationPriority. Contributed by Bibin A Chundatt.
(cherry picked from commit a207aa9930)
2017-02-23 14:25:14 +05:30
Karthik Kambatla 37edbd35f1 YARN-6194. Cluster capacity in SchedulingPolicy is updated only on allocation file reload. (Yufei Gu via kasha)
(cherry picked from commit b10e962224)
2017-02-22 15:59:11 -08:00
Karthik Kambatla ca7a6a7365 YARN-6210. FairScheduler: Node reservations can interfere with preemption. (kasha)
(cherry picked from commit 718ad9f6ee)
2017-02-22 15:52:44 -08:00
Daniel Templeton 93219b6454 YARN-6125. The application attempt's diagnostic message should have a maximum size
(Contributed by Andras Piros via Daniel Templeton)

(cherry picked from commit c7a36e6130)
2017-02-17 14:38:06 -08:00
Karthik Kambatla 5575328bb6 YARN-6193. FairScheduler might not trigger preemption when using DRF. (kasha)
(cherry picked from commit dbbfcf74ab)
2017-02-17 14:07:47 -08:00
Daniel Templeton 10fd5627bb YARN-6188. Fix OOM issue with decommissioningNodesWatcher in the case of clusters with
large number of nodes (Contributed by Ajay Jadhav via Daniel Templeton)

(cherry picked from commit 585168cc1bdb87398d4f388f3eeccd1c71114982)
2017-02-17 13:14:37 -08:00
Karthik Kambatla afc8124ff6 YARN-6171. ConcurrentModificationException on FSAppAttempt.containersToPreempt. (Miklos Szegedi via kasha)
(cherry picked from commit a77f432449)
2017-02-16 15:03:35 -08:00
Li Lu 8fc67e5973 YARN-6177. Yarn client should exit with an informative error message if an incompatible Jersey library is used at client. Contributed by Weiwei Yang.
(cherry picked from commit 5d339c46f5)
2017-02-16 13:46:30 -08:00