371 Commits

Author SHA1 Message Date
Wangda Tan
4e77ee2f6f YARN-3375. NodeHealthScriptRunner.shouldRun() check is performing 3 times for starting NodeHealthScriptRunner (Devaraj K via wangda)
(cherry picked from commit 71f4de220c74bf2c90630bd0442979d92380d304)
2015-05-04 16:02:57 -07:00
Allen Wittenauer
02e650248d YARN-2980. Move health check script related functionality to hadoop-common (Varun Saxena via aw)
(cherry picked from commit d4ac6822e1c5dfac504ced48f10ab57a55b49e93)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/NodeManager.java
2015-05-04 16:00:20 -07:00
Jason Lowe
99b49a8181 YARN-3097. Logging of resource recovery on NM restart has redundancies. Contributed by Eric Payne
(cherry picked from commit 8f65c793f2930bfd16885a2ab188a9970b754974)
2015-05-04 10:32:33 -05:00
Robert Kanter
32b3b8e2ef YARN-3363. add localization and container launch time to ContainerMetrics at NM to show these timing information for each active container. (zxu via rkanter)
(cherry picked from commit ac7d152901e29b1f444507fe4e421eb6e1402b5a)
2015-05-01 16:39:59 -07:00
Vinod Kumar Vavilapalli
90f6e8c90f YARN-2619. Added NodeManager support for disk io isolation through cgroups. Contributed by Varun Vasudev and Wei Yan.
(cherry picked from commit 1b3b9e5c31c38388c1ce4208c65e8dd5f956da82)
2015-04-30 21:43:22 -07:00
Karthik Kambatla
4045c41afe YARN-3464. Race condition in LocalizerRunner kills localizer before localizing all resources. (Zhihai Xu via kasha)
(cherry picked from commit 47279c3228185548ed09c36579b420225e4894f5)
2015-04-26 09:14:12 -07:00
Jason Lowe
cf4154676b YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore invoked. Contributed by Brahma Reddy Battula
(cherry picked from commit 5e093f0d400f82f67d9b2d24253c79e4a5abacf9)
2015-04-24 22:04:03 +00:00
Xuan
8f6053ae51 YARN-3516. killing ContainerLocalizer action doesn't take effect when
private localizer receives FETCH_FAILURE status. Contributed by zhihai
xu

(cherry picked from commit 0b3f8957a87ada1a275c9904b211fdbdcefafb02)
2015-04-23 16:41:52 -07:00
Vinod Kumar Vavilapalli
04783b0402 YARN-3366. Enhanced NodeManager to support classifying/shaping outgoing network bandwidth traffic originating from YARN containers Contributed by Sidharta Seethana.
(cherry picked from commit a100be685cc4521e9949589948219231aa5d2733)
2015-04-22 17:36:55 -07:00
Jian He
613a783380 YARN-3503. Expose disk utilization percentage and bad local and log dir counts in NM metrics. Contributed by Varun Vasudev
(cherry picked from commit 674c7ef64916fabbe59c8d6cdd50ca19cf7ddb7c)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/NodeManager.java
2015-04-21 21:06:06 -07:00
Junping Du
b244701fad YARN-1402. Update related Web UI and CLI with exposing client API to check log aggregation status. Contributed by Xuan Gong.
(cherry picked from commit 1db355a875c3ecc40a244045c6812e00c8d36ef1)
2015-04-17 13:20:27 -07:00
Jian He
6fed2c2a79 YARN-3354. Add node label expression in ContainerTokenIdentifier to support RM recovery. Contributed by Wangda Tan
(cherry picked from commit 1b89a3e173f8e905074ed6714a7be5c003c0e2c4)
2015-04-15 14:03:29 -07:00
Harsh J
01af29106a YARN-3462. Patches applied for YARN-2424 are inconsistent between trunk and branch-2. Contributed by Naganarasimha G R. 2015-04-15 09:41:43 +05:30
Junping Du
7c072bf092 YARN-3443. Create a 'ResourceHandler' subsystem to ease addition of support for new resource types on the NM. Contributed by Sidharta Seethana.
(cherry picked from commit 838b06ac87339494cea706d4a97e7f5383bdc442)
2015-04-13 18:37:39 -07:00
Junping Du
0891de0712 YARN-1376. NM need to notify the log aggregation status to RM through Node heartbeat. Contributed by Xuan Gong.
(cherry picked from commit 92431c961741747b5d6442f4025016d48d9a6863)
2015-04-10 08:58:07 -07:00
Karthik Kambatla
53e0bf5c17 YARN-3465. Use LinkedHashMap to preserve order of resource requests. (Zhihai Xu via kasha)
(cherry picked from commit 6495940eae09418a939882a8955845f9241a6485)
2015-04-09 00:10:23 -07:00
Tsuyoshi Ozawa
ac32fa187c YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called. Contributed by Bibin A Chundatt.
(cherry picked from commit dd852f5b8c8fe9e52d15987605f36b5b60f02701)
2015-04-08 15:56:33 +09:00
Wangda Tan
405aead879 YARN-2901. Add errors and warning metrics page to RM, NM web UI. (Varun Vasudev via wangda)
(cherry picked from commit bad070fe15a642cc6f3a165612fbd272187e03cb)
2015-04-02 17:23:44 -07:00
Vinod Kumar Vavilapalli
d8e17c58bc YARN-3365. Enhanced NodeManager to support using the 'tc' tool via container-executor for outbound network traffic control. Contributed by Sidharta Seethana.
(cherry picked from commit b21c72777ae664b08fd1a93b4f88fa43f2478d94)
2015-04-02 16:55:00 -07:00
Tsuyoshi Ozawa
80755edb7a YARN-3424. Change logs for ContainerMonitorImpl's resourse monitoring from info to debug. Contributed by Anubhav Dhoot.
(cherry picked from commit c69ba81497ae4da329ddb34ba712a64a7eec479f)
2015-04-01 17:44:40 +09:00
Karthik Kambatla
93b29d962d YARN-3428. Debug log resources to be localized for a container. (kasha)
(cherry picked from commit 2daa478a6420585dc13cea2111580ed5fe347bc1)
2015-03-31 17:35:11 -07:00
Wangda Tan
cba4ed1678 YARN-2495. Allow admin specify labels from each NM (Distributed configuration for node label). (Naganarasimha G R via wangda)
(cherry picked from commit 2a945d24f7de1a7ae6e7bd6636188ce3b55c7f52)
2015-03-30 12:05:54 -07:00
Vinod Kumar Vavilapalli
c5bc48946d YARN-3304. Cleaning up ResourceCalculatorProcessTree APIs for public use and removing inconsistencies in the default values. Contributed by Junping Du and Karthik Kambatla.
(cherry picked from commit c358368f511963ad8e35f030b9babee541e1bd01)
2015-03-30 10:11:12 -07:00
Ravi Prakash
b1b4951452 YARN-3288. Document and fix indentation in the DockerContainerExecutor code
(cherry picked from commit e0ccea33c9e12f6930b2867e14b1b37569fed659)
2015-03-28 08:01:26 -07:00
Tsuyoshi Ozawa
cbacf20755 YARN-3384. TestLogAggregationService.verifyContainerLogs fails after YARN-2777. Contributed by Naganarasimha G R.
(cherry picked from commit 82eda771e05cf2b31788ee1582551e65f1c0f9aa)
2015-03-24 00:25:52 +09:00
Junping Du
f40f17489c YARN-3269. Yarn.nodemanager.remote-app-log-dir could not be configured to fully qualified path. Contributed by Xuan Gong
(cherry picked from commit d81109e588493cef31e68508a3d671203bd23e12)
2015-03-20 13:42:31 -07:00
Karthik Kambatla
05627dd96d YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
(cherry picked from commit 20b49224eb90c796f042ac4251508f3979fd4787)
2015-03-18 16:33:57 -07:00
Ravi Prakash
9f227ad696 YARN-3339. TestDockerContainerExecutor should pull a single image and not the entire centos repository. (Ravindra Kumar Naik via raviprak)
(cherry picked from commit 56085203c43b8f2561bf3745910e03f8ac176a67)
2015-03-16 16:18:42 -07:00
Tsuyoshi Ozawa
d6e3ef2506 YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in doc comments. Contributed by Akira AJISAKA, Andrew Purtell, and Allen Wittenauer.
(cherry picked from commit 3da9a97cfbcc3a1c50aaf85b1a129d4d269cd5fd)
2015-03-16 23:20:17 +09:00
Vinod Kumar Vavilapalli
53aa3a4d1f YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating running logs of application when rolling is enabled. Contributed by Xuan Gong.
(cherry picked from commit 863079bb874ba77918ca1c0741eae10e245995c8)
2015-03-12 13:33:42 -07:00
Vinod Kumar Vavilapalli
d9416317a9 Preparing for 2.8.0 development. Changing SNAPSHOT version to be 2.8 on branch-2. 2015-03-08 20:32:51 -07:00
Jian He
6cef2c16de YARN-2190. Added CPU and memory limit options to the default container executor for Windows containers. Contributed by Chuan Liu
(cherry picked from commit 21101c01f242439ec8ec40fb3a9ab1991ae0adc7)
2015-03-06 14:18:56 -08:00
Karthik Kambatla
1749094fab YARN-3122. Metrics for container's actual CPU usage. (Anubhav Dhoot via kasha)
(cherry picked from commit 53947f37c7a84a84ef4ab1a3cab63ff27c078385)
2015-03-04 17:39:39 -08:00
Konstantin V Shvachko
9e67f2cb0d YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main() should support generic options. Contributed by Konstantin Shvachko. 2015-02-26 17:26:08 -08:00
Xuan
9cbe9bcbaa YARN-3237. AppLogAggregatorImpl fails to log error cause. Contributed by
Rushabh S Shah

(cherry picked from commit f56c65bb3eb9436b67de2df63098e26589e70e56)
2015-02-20 14:03:46 -08:00
Junping Du
4a87a61fe9 YARN-2799. Cleanup TestLogAggregationService based on the change in YARN-90. Contributed by Zhihai Xu
(cherry picked from commit c33ae271c24f0770c9735ccd2086cafda4f4e0b2)
2015-02-20 09:44:31 -08:00
Tsuyoshi Ozawa
fd8f7e2c7d YARN-3203. Correct a log message in AuxServices. Contributed by Brahma Reddy Battula.
(cherry picked from commit 447bd7b5a61a5788dc2a5d29cedfc19f0e99c0f5)
2015-02-16 23:56:32 +09:00
Junping Du
eee89bd18c YARN-2749. Fix some testcases from TestLogAggregationService fails in trunk. (Contributed by Xuan Gong)
(cherry picked from commit ab0b958a522d502426b91b6e4ab6dd29caccc372)
2015-02-15 06:48:29 -08:00
cnauroth
48302e687a YARN-2899. Run TestDockerContainerExecutorWithMocks on Linux only. Contributed by Ming Ma.
(cherry picked from commit 6804d689019e4d2dfb2b35fd82f775a13e85960c)
2015-02-13 21:59:14 -08:00
Jason Lowe
c9cd58d711 YARN-2847. Linux native container executor segfaults if default banned user detected. Contributed by Olaf Flebbe
(cherry picked from commit 1a0f508b6386b1c26ec606f6d73afddaa191b7d8)
2015-02-13 20:22:12 +00:00
Akira Ajisaka
9d8dbf8ea4 YARN-3191. Log object should be initialized with its own class. Contributed by Rohith.
(cherry picked from commit 99c7f2b8fa417303e8f42ea3641c302135879280)
2015-02-12 17:59:04 -08:00
Junping Du
380cc4dbed YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager restart. (Contributed by Jason Lowe)
(cherry picked from commit 04f5ef18f7877ce30b12b1a3c1e851c420531b72)
2015-02-12 11:48:24 -08:00
Jason Lowe
38333c8f29 YARN-3074. Nodemanager dies when localizer runner tries to write to a full disk. Contributed by Varun Saxena
(cherry picked from commit b379972ab39551d4b57436a54c0098a63742c7e1)
2015-02-11 16:34:42 +00:00
Jason Lowe
ca11ffa5de YARN-2809. Implement workaround for linux kernel panic when removing cgroup. Contributed by Nathan Roberts
(cherry picked from commit 3f5431a22fcef7e3eb9aceeefe324e5b7ac84049)
2015-02-10 17:28:18 +00:00
Jason Lowe
9ffd985dc0 YARN-3090. DeletionService can silently ignore deletion task failures. Contributed by Varun Saxena
(cherry picked from commit 4eb5f7fa32bab1b9ce3fb58eca51e2cd2e194cd5)
2015-02-10 16:55:21 +00:00
Jason Lowe
83449a4e4d YARN-3089. LinuxContainerExecutor does not handle file arguments to deleteAsUser. Contributed by Eric Payne
(cherry picked from commit 4c484320b430950ce195cfad433a97099e117bad)
2015-02-06 20:40:33 +00:00
Arun C. Murthy
92ff524182 YARN-1537. Fix race condition in TestLocalResourcesTrackerImpl.testLocalResourceCache. Contributed by Xuan Gong. 2015-02-05 23:59:34 -08:00
Xuan
c22dcdd191 YARN-3056. Add verification for containerLaunchDuration in
TestNodeManagerMetrics. Contributed by zhihai Xu

(cherry picked from commit b73e776abc3e2479e397729c2bab9668f469e593)
2015-02-03 15:14:23 -08:00
Robert Kanter
410830fe8c YARN-3022. Expose Container resource information from NodeManager for monitoring (adhoot via ranter)
(cherry picked from commit f7a77819a1e4ff394e110941c1f8dd80f47dd38f)
2015-02-03 10:39:51 -08:00
Akira Ajisaka
1081b4c4a7 HADOOP-9907. Webapp http://hostname:port/metrics link is not working. (aajisaka)
(cherry picked from commit 342efa110af9989829b8a8caa699658a2ba01607)
2015-01-30 02:50:46 +09:00