524 Commits

Author SHA1 Message Date
Jian He
23eb3c7ceb YARN-5270. Solve miscellaneous issues caused by YARN-4844. Contributed by Wangda Tan 2016-07-11 22:38:35 -07:00
Vinod Kumar Vavilapalli
4ea87cb38c YARN-5214. Fixed locking in DirectoryCollection to avoid hanging NMs when various code-paths hit slow disks. Contributed by Junping Du.
(cherry picked from commit ce9c006430d13a28bc1ca57c5c70cc1b7cba1692)
2016-07-05 17:12:37 -07:00
Junping Du
810470508b YARN-5237. Fix missing log files issue in rolling log aggregation. Contributed by Xuan Gong. 2016-06-16 07:18:36 -07:00
Wangda Tan
d838c6443d YARN-1942. Deprecate toString/fromString methods from ConverterUtils and move them to records classes like ContainerId/ApplicationId, etc. (wangda) 2016-06-14 15:21:41 -07:00
Junping Du
2be48e7d15 YARN-5199. Close LogReader in NMWebServices#getLogs. Contributed by Xuan Gong. 2016-06-09 12:29:25 -07:00
Xuan
11b4d1e486 Revert "YARN-4920. ATS/NM should support a link to dowload/get the logs in text format. Contributed by Xuan Gong."
This reverts commit 22ac37615a933f9cee8cf19ad0182586a037b690.
2016-06-08 11:23:12 -07:00
Wangda Tan
19e578870d YARN-4844. Add getMemorySize/getVirtualCoresSize to o.a.h.y.api.records.Resource. (wangda) 2016-06-07 12:41:50 -07:00
Ming Ma
ec4f9a14f9 MAPREDUCE-5044. Have AM trigger jstack on task attempts that timeout before killing them. (Eric Payne and Gera Shegalov via mingma)
(cherry picked from commit 4a1cedc010d3fa1d8ef3f2773ca12acadfee5ba5)
(cherry picked from commit 74e2b5efa26f27027fed212b4b2108f0e95587fb)
2016-06-06 14:49:43 -07:00
Jian He
3c2bd19fa5 YARN-5190. Registering/unregistering container metrics in ContainerMonitorImpl and ContainerImpl causing uncaught exception in ContainerMonitorImpl. Contributed by Junping Du
(cherry picked from commit 99cc439e29794f8e61bebe03b2a7ca4b6743ec92)
2016-06-03 11:11:49 -07:00
Wangda Tan
2f3e1d965d Revert "YARn-4844. Add getMemoryLong/getVirtualCoreLong to o.a.h.y.api.records.Resource. Contributed by Wangda Tan."
This reverts commit 457884737f75c796413ce860b1859a31cc5292ca.
2016-05-31 22:16:53 -07:00
Varun Vasudev
457884737f YARn-4844. Add getMemoryLong/getVirtualCoreLong to o.a.h.y.api.records.Resource. Contributed by Wangda Tan. 2016-05-29 20:57:56 +05:30
Carlo Curino
7982933c09 YARN-4957. Add getNewReservation in ApplicationClientProtocol (Sean Po via curino)
(cherry picked from commit 013532a95e63d7c53e601be530021d6d5a15ab7f)
(cherry picked from commit c656977961e2ba0f9dfd349ed59bf1d0d41c57f5)
2016-05-25 17:02:22 -07:00
Jason Lowe
fe10caee8d YARN-4459. container-executor should only kill process groups. Contributed by Jun Gong
(cherry picked from commit 1ba31fe9e906dbd093afd4b254216601967a4a7b)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/test/test-container-executor.c
2016-05-25 21:37:31 +00:00
Varun Vasudev
35456bb7c9 YARN-857. Localization failures should be available in container diagnostics. Contributed by Vinod Kumar Vavilapalli.
(cherry picked from commit f440a9d8c4a177bc5062d21d4b4bc4d9b2944344)
(cherry picked from commit 36f2ae0692d73a865a5c0c520d1346b6d4498c25)
2016-05-25 19:03:33 +05:30
Jason Lowe
8e404b4321 YARN-5103. With NM recovery enabled, restarting NM multiple times results in AM restart. Contributed by Junping Du
(cherry picked from commit d1df0266cf4e9ff0ec70813c156556ca4e74f791)
2016-05-23 15:17:26 +00:00
Rohith Sharma K S
726c1f14b8 YARN-3840. Resource Manager web ui issue when sorting application by id (with application having id > 9999). Contributed by Mohammad Shahid Khan and Varun Saxena 2016-05-19 10:50:32 +05:30
Jason Lowe
70faa87ccf YARN-4325. Nodemanager log handlers fail to send finished/failed events in some cases. Contributed by Junping Du
(cherry picked from commit 81effb7dcde2b31423438d6f1b8b8204d4ca05b3)
2016-05-16 15:43:42 +00:00
Wangda Tan
3620d0e623 YARN-4984. LogAggregationService shouldn't swallow exception in handling createAppDir() which cause thread leak. (Junping Du via wangda)
(cherry picked from commit 7bd418e48c71590fc8026d69f9b8f8ad42f2aade)
2016-05-05 10:08:37 -07:00
Junping Du
22ac37615a YARN-4920. ATS/NM should support a link to dowload/get the logs in text format. Contributed by Xuan Gong.
(cherry picked from commit 3cf223166d452a0f58f92676837a9edb8ddc1139)
(cherry picked from commit c79dc07dc193904f2586a5d64ea2f4e56d2396b8)
2016-05-04 09:49:08 -07:00
Robert Kanter
3c3d003402 Remove parent's env vars from child processes
(cherry picked from commit ac8fb579c6058fec60caf30682f902413d68edf3)
2016-04-29 15:30:06 -07:00
Jian He
5ba79d77fb YARN-5009. NMLeveldbStateStoreService database can grow substantially leading to longer recovery times. Contributed by Jason Lowe
(cherry picked from commit 4a8508501bc753858693dacdafba61d604702f71)
2016-04-28 21:54:53 -07:00
Jason Lowe
9b5c5bd42f YARN-4924. NM recovery race can lead to container not cleaned up. Contributed by sandflee
(cherry picked from commit 3150ae8108a1fc40a67926be6254824c1e37cb38)

Conflicts:

	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/ContainerManagerImpl.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/recovery/NMLeveldbStateStoreService.java
2016-04-14 19:40:10 +00:00
Vinod Kumar Vavilapalli
3589b9e10e YARN-4168. Fixed a failing test TestLogAggregationService.testLocalFileDeletionOnDiskFull. Contributed by Takashi Ohnishi.
(cherry picked from commit 44bbc50d919388e4ad08be2e9ba80ac7502d2579)
2016-04-11 12:13:37 -07:00
Akira Ajisaka
13be7a849d YARN-4630. Remove useless boxing/unboxing code. Contributed by Kousuke Saruta.
(cherry picked from commit 1ff27f9d12e8124c1b9a722708264c5b07fd0fde)
(cherry picked from commit 8cf6630fc6cedbd86eff9da6f35ce1da4ed7ed2f)
2016-04-11 14:55:54 +09:00
Karthik Kambatla
ddb1407980 YARN-4756. Unnecessary wait in Node Status Updater during reboot. (Eric Badger via kasha)
(cherry picked from commit e82f961a3925aadf9e53a009820a48ba9e4f78b6)
(cherry picked from commit 2b97a50eec8e9f7167a44b8ca0391fce0aae571c)
2016-04-07 17:35:06 -07:00
naganarasimha
9bd089ac64 YARN-4746. yarn web services should convert parse failures of appId, appAttemptId and containerId to 400. Contributed by Bibin A Chundatt
(cherry picked from commit 5092c94195a63bd2c3e36d5a74b4c061cea1b847)
2016-04-04 18:16:23 +05:30
Jason Lowe
35f9cfda61 YARN-4773. Log aggregation performs extraneous filesystem operations when rolling log aggregation is disabled. Contributed by Jun Gong
(cherry picked from commit 948b75807068c304ffe789e32f2b850c0d653e0a)
2016-03-28 23:03:16 +00:00
Jian He
7c81e374da YARN-4117. End to end unit test with mini YARN cluster for AMRMProxy Service. Contributed by Giovanni Matteo Fumarola 2016-03-28 13:23:53 -07:00
Eric Payne
878e1cfc77 YARN-4686. MiniYARNCluster.start() returns before cluster is completely started. Contributed by Eric Badger.
(cherry picked from commit 92b7e0d41302b6b110927f99de5c2b4a4a93c5fd)
2016-03-18 17:19:06 +00:00
Jason Lowe
53ec7c9243 YARN-4744. Too many signal to container failure in case of LCE. Contributed by Sidharta Seethana
(cherry picked from commit 059caf99891943d9587cac19b48e82efbed06b2d)
2016-03-07 15:48:06 +00:00
Haohui Mai
69b195d619 HADOOP-12813. Migrate TestRPC and related codes to rebase on ProtobufRpcEngine. Contributed by Kai Zheng. 2016-02-29 14:15:25 -08:00
Ming Ma
6f5ca1b293 YARN-4720. Skip unnecessary NN operations in log aggregation. (Jun Gong via mingma)
(cherry picked from commit 7f3139e54da2c496327446a5eac43f8421fc8839)
2016-02-26 08:49:51 -08:00
Jason Lowe
6f3f5a8c38 YARN-2046. Out of band heartbeats are sent only on container kill and possibly too early. Contributed by Ming Ma
(cherry picked from commit d284e187b8db43056236032ebc2114ee462c27f6)
2016-02-23 20:51:41 +00:00
Varun Vasudev
5f68f640a5 YARN-4709. NMWebServices produces incorrect JSON for containers. Contributed by Varun Saxena.
(cherry picked from commit 140cb5d74565aad848b40a251ec02b597a3a20d4)
(cherry picked from commit 2c218ca8a8da9bdea3ba052da10d87e5716c1d16)
2016-02-23 12:33:11 +05:30
Wangda Tan
c487453b91 YARN-4340. Add list API to reservation system. (Sean Po via wangda)
(cherry picked from commit 9875325d5c63f343809907d06bf48a298035a611)
2016-02-02 10:23:25 +08:00
Jason Lowe
290b7db656 YARN-4643. Container recovery is broken with delegating container runtime. Contributed by Sidharta Seethana
(cherry picked from commit 61382ff8fabc76b3a51f227646573cdf367fea1a)
2016-01-28 19:01:28 +00:00
Jian He
389a7fb33f YARN-4520. Finished app info is unnecessarily persisted in NM state-store if container is acquired but not lunched on this node. Contributed by sandflee 2016-01-25 15:36:59 -08:00
Jason Lowe
c1953e5e91 YARN-4598. Invalid event: RESOURCE_FAILED at CONTAINER_CLEANEDUP_AFTER_KILL. Contributed by tangshangwen
(cherry picked from commit 46e5ea81e0e41646ba02391aea31481ead28aaf8)
2016-01-22 21:56:37 +00:00
Colin Patrick Mccabe
de5175d216 HADOOP-12712. Fix some cmake plugin and native build warnings (cmccabe)
(cherry picked from commit b2c155f810888480d058512a326a5083e1bb4eaa)

 Conflicts:
	hadoop-common-project/hadoop-common/CHANGES.txt
	hadoop-maven-plugins/src/main/java/org/apache/hadoop/maven/plugin/cmakebuilder/CompileMojo.java
	hadoop-maven-plugins/src/main/java/org/apache/hadoop/maven/plugin/cmakebuilder/TestMojo.java
	hadoop-maven-plugins/src/main/java/org/apache/hadoop/maven/plugin/util/Exec.java
2016-01-21 16:04:05 +05:30
Wangda Tan
5dc2e78c97 HADOOP-12356. Fix computing CPU usage statistics on Windows. (Inigo Goiri via wangda)
(cherry picked from commit 89d1fd5dac4bccf42d82686e146b02eb60d14736)

Conflicts:
	hadoop-common-project/hadoop-common/CHANGES.txt
2016-01-19 21:31:35 +08:00
Jason Lowe
a8292161fc YARN-4414. Nodemanager connection errors are retried at multiple levels. Contributed by Chang Li
(cherry picked from commit 13de8359a1c6d9fc78cd5013c860c1086d86176f)
2016-01-12 15:57:44 +00:00
Akira Ajisaka
852033ca62 YARN-4544. All the log messages about rolling monitoring interval are shown with WARN level. Contributed by Takashi Ohnishi.
(cherry picked from commit 52b77577c44538c26fb9d0144f0b280be3593a38)
(cherry picked from commit 3905435d3d592b980425169bf1355d3d2b5c616f)
2016-01-08 01:13:44 +09:00
Uma Mahesh
6f30919336 YARN-4480. Clean up some inappropriate imports. (Kai Zheng via umamahesh)
(cherry picked from commit 0f82b5d878a76b1626c9e07b2fbb55ce2a79232a)
2015-12-19 23:28:51 -08:00
Wangda Tan
9ed9eb265f YARN-4293. ResourceUtilization should be a part of yarn node CLI. (Sunil G via wangda)
(cherry picked from commit 79c41b1d83e981ae74cb8b58ffcf7907b7612ad4)
(cherry picked from commit 9f4a6ec6630f3776859f8d677532370095a58d0c)
2015-12-18 16:34:45 -08:00
Wangda Tan
c7e10d70de YARN-4309. Add container launch related debug information to container logs when a container fails. (Varun Vasudev via wangda)
(cherry picked from commit dfcbbddb0963c89c0455d41223427165b9f9e537)
(cherry picked from commit 9f979107c22f365633811a38363d0dbd34573374)
2015-12-18 16:34:44 -08:00
Jian He
bc7603e393 YARN-4402. TestNodeManagerShutdown And TestNodeManagerResync fails with bind exception. Contributed by Brahma Reddy Battula 2015-12-14 15:06:13 -08:00
rohithsharmaks
6a6bbc51c4 YARN-4431. Not necessary to do unRegisterNM() if NM get stop due to failed to connect to RM. (Junpin Du via rohithsharmaks) 2015-12-09 11:06:31 +05:30
Junping Du
6af5255b02 YARN-4408. Fix issue that NodeManager still reports negative running containers. Contributed by Robert Kanter.
(cherry picked from commit 62e9348bc10bb97a5fcb4281f7996a09d8e69c60)
(cherry picked from commit e76ba91fa58dae6750f608c0f8b86e9a43e80005)
2015-12-03 06:39:07 -08:00
Tsuyoshi Ozawa
d76b523b02 YARN-4380. TestResourceLocalizationService.testDownloadingResourcesOnContainerKill fails intermittently. Contributed by Varun Saxena.
(cherry picked from commit 0656d2dc83af6a48a8d8d0e37cdf1f813124f366)
2015-11-26 01:10:41 +09:00
Jason Lowe
9e54433c6c YARN-4132. Separate configs for nodemanager to resourcemanager connection timeout and retries. Contributed by Chang Li
(cherry picked from commit 4ac6799d4a8b071e0d367c2d709e84d8ea06942d)
2015-11-24 22:36:19 +00:00