Commit Graph

399 Commits

Author SHA1 Message Date
Junping Du 086ee0adf8 YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
(cherry picked from commit d81109e588)

(cherry picked from commit f40f17489c)
2017-02-02 09:57:14 -08:00
Naganarasimha 7d87e11be4 YARN-4355. NPE while processing localizer heartbeat. Contributed by Varun Saxena & Jonathan Hung 2016-11-15 17:36:14 +05:30
Jason Lowe 5194a9bb6f YARN-5001. Aggregated Logs root directory is created with wrong group if nonexistent. Contributed by Haibo Chen
(cherry picked from commit 76893a4100)
2016-11-01 20:26:07 +00:00
Jason Lowe 22b9a6ce4a YARN-4717. TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IllegalArgumentException from cleanup (templedf via rkanter)
(cherry picked from commit 22ca176dfe)
2016-08-03 19:37:23 +00:00
Jason Lowe 4948fce1c1 YARN-5462. TestNodeStatusUpdater.testNodeStatusUpdaterRetryAndNMShutdown fails intermittently. Contributed by Eric Badger
(cherry picked from commit db646540f0)
2016-08-03 19:21:27 +00:00
Jason Lowe f8f995629d YARN-4393. Fix intermittent test failure for TestResourceLocalizationService#testFailedDirsResourceRelease (Varun Saxana via rohithsharmaks)
(cherry picked from commit 791c1639ae)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2016-07-12 19:30:53 +00:00
Jason Lowe 6759cbc56a HADOOP-13362. DefaultMetricsSystem leaks the source name when a source unregisters. Contributed by Junping Du 2016-07-11 20:58:40 +00:00
Vinod Kumar Vavilapalli f4b58eae94 Preparing for 2.7.4 development: Updating snapshot version to 2.7.4 on branch-2.7. 2016-06-14 19:14:02 -07:00
Jason Lowe f0520a2c94 YARN-4459. container-executor should only kill process groups. Contributed by Jun Gong
(cherry picked from commit 1ba31fe9e9)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/test/test-container-executor.c
2016-05-25 21:49:21 +00:00
Robert Kanter b0bcb4e728 Remove parent's env vars from child processes 2016-05-06 09:33:36 -07:00
Jason Lowe e12c9e3348 YARN-5009. NMLeveldbStateStoreService database can grow substantially leading to longer recovery times. Contributed by Jason Lowe
(cherry picked from commit 4a8508501b)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-api/src/main/java/org/apache/hadoop/yarn/conf/YarnConfiguration.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-common/src/main/resources/yarn-default.xml
2016-04-29 15:07:07 +00:00
Jason Lowe dbb7458735 Revert "YARN-5009. NMLeveldbStateStoreService database can grow substantially leading to longer recovery times. Contributed by Jason Lowe"
This reverts commit 77ef473981.
2016-04-29 14:38:30 +00:00
Jian He 77ef473981 YARN-5009. NMLeveldbStateStoreService database can grow substantially leading to longer recovery times. Contributed by Jason Lowe
(cherry picked from commit 4a8508501b)
2016-04-28 22:00:42 -07:00
Jason Lowe a996889313 YARN-4924. NM recovery race can lead to container not cleaned up. Contributed by sandflee
(cherry picked from commit 9b5c5bd42f)
2016-04-14 19:44:31 +00:00
Jason Lowe 49428ab6bb YARN-4773. Log aggregation performs extraneous filesystem operations when rolling log aggregation is disabled. Contributed by Jun Gong 2016-04-05 13:51:46 +00:00
Eric Payne f0c278469b YARN-4686. MiniYARNCluster.start() returns before cluster is completely started. Contributed by Eric Badger. 2016-03-18 21:01:53 +00:00
Jason Lowe b406dcaff2 YARN-2046. Out of band heartbeats are sent only on container kill and possibly too early. Contributed by Ming Ma 2016-02-23 20:53:30 +00:00
Jason Lowe f264939c47 YARN-3602. TestResourceLocalizationService.testPublicResourceInitializesLocalDir fails Intermittently due to IOException from cleanup. Contributed by zhihai xu
(cherry picked from commit 333f9a896d)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2016-02-09 16:51:36 +00:00
Jason Lowe 9601a3b2ce YARN-4598. Invalid event: RESOURCE_FAILED at CONTAINER_CLEANEDUP_AFTER_KILL. Contributed by tangshangwen
(cherry picked from commit 46e5ea81e0)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2016-01-22 21:58:12 +00:00
Jason Lowe 4b829c41ce YARN-4414. Nodemanager connection errors are retried at multiple levels. Contributed by Chang Li
(cherry picked from commit 13de8359a1)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2016-01-12 15:59:14 +00:00
Jason Lowe a30b8ef59e YARN-3695. ServerProxy (NMProxy, etc.) shouldn't retry forever for non network exception. Contributed by Raju Bairishetti
(cherry picked from commit 62e583c7dc)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2016-01-12 15:04:43 +00:00
Jian He 6a30d93aa4 YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999). Contributed by Mohammad Shahid Khan and Varun Saxena
(cherry picked from commit 9f77ccad73)

Conflicts:
	hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/webapp/TaskPage.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-applicationhistoryservice/src/test/java/org/apache/hadoop/yarn/server/applicationhistoryservice/webapp/TestAHSWebApp.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-common/src/main/java/org/apache/hadoop/yarn/server/webapp/WebPageUtils.java
2015-12-03 14:11:48 -08:00
Tsuyoshi Ozawa f50f889c17 YARN-4380. TestResourceLocalizationService.testDownloadingResourcesOnContainerKill fails intermittently. Contributed by Varun Saxena.
(cherry picked from commit 0656d2dc83)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestResourceLocalizationService.java
2015-11-26 01:22:11 +09:00
Jian He 27b786660e Revert "YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan"
This reverts commit cb4d4d5918.

Conflicts:
	hadoop-yarn-project/CHANGES.txt
2015-11-16 20:20:46 -08:00
Jason Lowe e1716c7cf9 YARN-4354. Public resource localization fails with NPE. Contributed by Jason Lowe.
(cherry picked from commit 855d52927b)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestLocalResourcesTrackerImpl.java
2015-11-16 15:02:56 +00:00
Jason Lowe 38f37ca2eb Revert "YARN-4354. Public resource localization fails with NPE. Contributed by Jason Lowe."
This reverts commit 8ea88ab0b8.
2015-11-16 14:30:22 +00:00
Junping Du 8ea88ab0b8 YARN-4354. Public resource localization fails with NPE. Contributed by Jason Lowe.
(cherry picked from commit 855d52927b)
(cherry picked from commit 1d107d8058)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestLocalResourcesTrackerImpl.java
2015-11-15 04:50:28 -08:00
Jian He cb4d4d5918 YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999) Contributed by Mohammad Shahid Khan
(cherry picked from commit 8fbea531d7)
(cherry picked from commit a4ff03b200)

Conflicts:
	hadoop-mapreduce-project/hadoop-mapreduce-client/hadoop-mapreduce-client-app/src/main/java/org/apache/hadoop/mapreduce/v2/app/webapp/TaskPage.java
	hadoop-yarn-project/CHANGES.txt
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-applicationhistoryservice/src/test/java/org/apache/hadoop/yarn/server/applicationhistoryservice/webapp/TestAHSWebApp.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-common/src/main/java/org/apache/hadoop/yarn/server/webapp/WebPageUtils.java
2015-11-09 14:30:47 -08:00
Vinod Kumar Vavilapalli 09348c4bce Preparing for 2.7.3 development 2015-11-03 15:18:10 -08:00
Jason Lowe c43cc8fd67 YARN-2902. Killing a container that is localizing can orphan resources in the DOWNLOADING state. Contributed by Varun Saxena
(cherry picked from commit e2267de207)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/DockerContainerExecutor.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestResourceLocalizationService.java
2015-10-29 16:48:57 +00:00
Jonathan Eagles 49a7d70f53 YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
(cherry picked from commit f8adeb712d)
2015-10-23 11:18:32 -05:00
Jason Lowe 84f1d99d02 YARN-4005. Completed container whose app is finished is possibly not removed from NMStateStore. Contributed by Jun Gong
(cherry picked from commit 38aed1a94e)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-10-08 15:06:58 +00:00
Jason Lowe 9f6ed41b95 YARN-3619. ContainerMetrics unregisters during getMetrics and leads to ConcurrentModificationException. Contributed by Zhihai Xu 2015-10-02 20:19:14 +00:00
Jason Lowe 493f072008 YARN-3727. For better error recovery, check if the directory exists before using it for localization. Contributed by Zhihai Xu
(cherry picked from commit 854d25b0c3)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/localizer/TestLocalResourcesTrackerImpl.java
2015-09-30 15:08:10 +00:00
Jason Lowe 30ebb792f9 YARN-4158. Remove duplicate close for LogWriter in AppLogAggregatorImpl#uploadLogsForContainers. Contributed by Zhihai Xu
(cherry picked from commit 8c1cdb17a0)
2015-09-15 20:23:11 +00:00
Zhihai Xu 4b5767e456 YARN-4096. App local logs are leaked if log aggregation fails to initialize for the app. Contributed by Jason Lowe.
(cherry picked from commit 16b9037dc1)
2015-09-08 12:43:55 -07:00
Jason Lowe e8410c0175 YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container log files from full disks. Contributed by zhihai xu
(cherry picked from commit ff9c13e0a7)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/test/java/org/apache/hadoop/yarn/server/nodemanager/webapp/TestContainerLogsPage.java
2015-07-24 22:27:52 +00:00
Jason Lowe 055d9292a7 YARN-3793. Several NPEs when deleting local files on NM recovery. Contributed by Varun Saxena
(cherry picked from commit b5cdf78e8e)
2015-07-01 21:15:30 +00:00
Vinod Kumar Vavilapalli a36f090c55 Preparing for 2.7.2 development: mvn versions:set -DnewVersion=2.7.2 2015-06-28 16:41:12 -07:00
Jason Lowe baf9e22284 YARN-3850. NM fails to read files from full disks which can lead to container logs being lost and other issues. Contributed by Varun Saxena
(cherry picked from commit 40b256949a)
2015-06-26 15:48:34 +00:00
Jason Lowe 41d9677740 YARN-3832. Resource Localization fails on a cluster due to existing cache directories. Contributed by Brahma Reddy Battula
(cherry picked from commit 8d58512d6e)
2015-06-24 16:38:40 +00:00
Karthik Kambatla 6b1a156e27 YARN-3842. NMProxy should retry on NMNotYetReadyException. (Robert Kanter via kasha)
(cherry picked from commit 5ebf2817e5)
(cherry picked from commit 9656ee4ee7)
2015-06-22 17:49:55 -07:00
Jason Lowe 3d2c3f8648 YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM recovery is enabled. Contributed by Rohith Sharmaks
(cherry picked from commit e13b671aa5)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-06-03 19:46:51 +00:00
cnauroth 02ab202a2b YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch.
(cherry picked from commit 7bc33b63c78c6ee9a04c0c1511ef16e63f7f3074)
(cherry picked from commit 6ed8989a6f)
2015-05-27 14:21:26 -07:00
Jason Lowe 5161751433 YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when exceptions happen in stopping NM's sub-services. Contributed by Junping Du
(cherry picked from commit 711d77cc54)
2015-05-13 21:09:03 +00:00
Jason Lowe 7110499817 YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
(cherry picked from commit 5e093f0d40)
2015-05-13 20:38:52 +00:00
Jason Lowe bfd28d6f7e YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
(cherry picked from commit ac32fa187c)

Conflicts:

	hadoop-yarn-project/CHANGES.txt
2015-05-13 20:38:52 +00:00
Xuan 9527cdd12d YARN-3626. On Windows localized resources are not moved to the front of the classpath when they should be. Contributed by Craig Welch
(cherry picked from commit 0f95921447)
(cherry picked from commit 487d9b0f3f)
2015-05-13 13:12:38 -07:00
Jason Lowe a75f4bed6e YARN-3476. Nodemanager can fail to delete local logs if log aggregation fails. Contributed by Rohith
(cherry picked from commit 25e2b02122)
2015-05-08 22:47:18 +00:00
Karthik Kambatla 4ddcc7e5b5 YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
(cherry picked from commit 47279c3228)
(cherry picked from commit 4045c41afe)
2015-04-27 13:37:06 -07:00