Commit Graph

3826 Commits

Author SHA1 Message Date
Eric E Payne 6fce24fb40 YARN-9285: RM UI progress column is of wrong type. Contributed by Ahmed Hussein.
(cherry picked from commit b094b94d43)
2019-05-02 19:48:06 +00:00
Weiwei Yang cc0c85f04a YARN-9325. TestQueueManagementDynamicEditPolicy fails intermittent. Contributed by Prabhu Joseph.
(cherry picked from commit 1c8046d67e)
2019-04-23 14:24:15 +08:00
Eric Yang ac85aa80d9 YARN-8587. Added retries for fetching docker exit code.
Contributed by Charo Zhang

(cherry picked from commit c16c49b8c3)
2019-04-19 15:40:23 -04:00
Eric Yang 4a64dab0dd YARN-8622. Fixed container-executor compilation on MacOSX.
Contributed by Siyao Meng

(cherry picked from commit ef97a20831)
2019-04-18 19:01:11 -04:00
Eric Yang 2503409977 YARN-6695. Fixed NPE in publishing appFinished events to ATSv2.
Contributed by Prabhu Joseph

(cherry picked from commit df76cdc895)
2019-04-18 12:30:55 -04:00
Siyao Meng 742a3ad24b YARN-9487. NodeManager native build shouldn't link against librt on macOS. Contributed by Siyao Meng.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 6e4399ea61)
2019-04-17 22:57:33 -07:00
Weiwei Yang db185de31c YARN-9463. Add queueName info when failing with queue capacity sanity check. Contributed by Aihua Xu.
(cherry picked from commit 8c1bba375b)
2019-04-10 23:02:24 +08:00
Weiwei Yang 7a80b1b481 YARN-9413. Queue resource leak after app fail for CapacityScheduler. Contributed by Tao Yang.
(cherry picked from commit ec143cbf67)
2019-04-06 20:19:03 +08:00
Eric Yang 10642a6205 YARN-9391. Fixed node manager environment leaks into Docker containers.
Contributed by Jim Brennan

(cherry picked from commit 3c45762a0b)
2019-03-25 15:54:52 -04:00
Sunil G d721634fea YARN-9138. Improve test coverage for nvidia-smi binary execution of GpuDiscoverer. Contributed by Szilard Nemeth.
(cherry picked from commit 46045c5cb3)
2019-03-06 16:01:56 +05:30
bibinchundatt 63ed16e076 Revert "YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph"
This reverts commit cf1944eb6e.
2019-03-04 17:01:40 +05:30
Sunil G d045f02a8d YARN-9139. Simplify initializer code of GpuDiscoverer. Contributed by Szilard Nemeth. 2019-03-01 19:27:03 +05:30
Weiwei Yang 7575e3090d YARN-9324. TestSchedulingRequestContainerAllocation(Async) fails with junit-4.11. Contributed by Prabhu Joseph. 2019-02-28 09:32:07 +08:00
Weiwei Yang 7fa5373ec4 YARN-9248. RMContainerImpl:Invalid event: ACQUIRED at KILLED. Contributed by lujie.
(cherry picked from commit 8c30114b00)
2019-02-27 17:35:09 +08:00
Sunil G 809e3f2453 YARN-9121. Replace GpuDiscoverer.getInstance() to a readable object for easy access control. Contributed by Szilard Nemeth.
(cherry picked from commit 5e91ebd91a)
2019-02-27 12:03:58 +05:30
Sunil G a95a0cbf2f YARN-9087. Improve logging for initialization of Resource plugins. Contributed by Szilard Nemeth. 2019-02-27 11:54:43 +05:30
Weiwei Yang bdde6a612e YARN-9329. updatePriority is blocked when using FairScheduler. Contributed by Jiandan Yang.
(cherry picked from commit 3e1739d589)
2019-02-26 00:18:24 +08:00
Sunil G f282f9c362 YARN-9213. RM Web UI v1 does not show custom resource allocations for containers page. Contributed by Szilard Nemeth. 2019-02-25 11:37:42 +05:30
Weiwei Yang cdce1c17a0 YARN-9316. TestPlacementConstraintsUtil#testInterAppConstraintsByAppID fails intermittently. Contributed by Prabhu Joseph.
(cherry picked from commit 9cd5c5447f)
2019-02-24 22:48:55 +08:00
Weiwei Yang 604a915bab YARN-9238. Avoid allocating opportunistic containers to previous/removed/non-exist application attempt. Contributed by lujie.
(cherry picked from commit 9c88695bcd)
2019-02-24 22:21:53 +08:00
bibinchundatt 3e1bd53a37 YARN-9317. Avoid repeated YarnConfiguration#timelineServiceV2Enabled check. Contributed by Prabhu Joseph 2019-02-23 07:59:51 +05:30
bibinchundatt cf1944eb6e YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph 2019-02-22 20:51:47 +05:30
Sunil G d75aa33612 YARN-9118. Handle exceptions with parsing user defined GPU devices in GpuDiscoverer. Contributed by Szilard Nemeth.
(cherry picked from commit 95fbbfed75)
2019-02-22 20:23:24 +05:30
Weiwei Yang c2ef443359 YARN-9315. TestCapacitySchedulerMetrics fails intermittently. Contributed by Prabhu Joseph. 2019-02-21 18:06:26 +08:00
bibinchundatt e6f2b8730f YARN-9286. [Timeline Server] Sorting based on FinalStatus shows pop-up message. Contributed by Bilwa S T.
(cherry picked from commit b8de78c570)
2019-02-20 01:20:15 +05:30
Adam Antal 830aaac023
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 9385ec45d7)
2019-02-15 18:48:21 +09:00
Weiwei Yang fbd03543d8 YARN-8555. Parameterize TestSchedulingRequestContainerAllocation(Async) to cover both PC handler options. Contributed by Prabhu Joseph.
(cherry picked from commit 0a1637c750)
2019-02-11 15:56:34 +08:00
Masatake Iwasaki 6229469574 YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri.
(cherry picked from commit e0ab1bdece)
2019-02-09 00:28:59 +09:00
Eric E Payne 55dde827e6 YARN-7171: RM UI should sort memory / cores numerically. Contributed by Ahmed Hussein
(cherry picked from commit d1ca9432dd)
2019-02-07 16:47:15 +00:00
Vinayakumar B e2b91b2ccb YARN-8498. Yarn NodeManager OOM Listener Fails Compilation on Ubuntu 18.04. Contributed by Ayush Saxena. 2019-02-07 13:03:42 +05:30
Weiwei Yang b64e9df949 YARN-9262. TestRMAppAttemptTransitions is failing with an NPE. Contributed by lujie.
(cherry picked from commit 28ad20a711)
2019-02-04 14:00:30 +05:30
Sunil G 99876a5ab8 YARN-9206. RMServerUtils does not count SHUTDOWN as an accepted state. Contributed by Kuhu Shukla.
(cherry picked from commit 604b2489a9)
2019-02-04 12:49:06 +05:30
Weiwei Yang a0fafbc3ef YARN-9263. TestConfigurationNodeAttributesProvider fails after Mockito updated. Contributed by Weiwei Yang.
(cherry picked from commit f20b043a02)
2019-02-04 12:45:40 +05:30
Sunil G 0e7060a1d5 YARN-9099. GpuResourceAllocator#getReleasingGpus calculates number of GPUs in a wrong way. Contributed by Szilard Nemeth.
(cherry picked from commit 71c49fa60f)
2019-01-31 09:26:07 +05:30
Eric E Payne 4052b7ee60 YARN-6616: YARN AHS shows submitTime for jobs same as startTime. Contributed by Prabhu Joseph
(cherry picked from commit 04105bbfdb)
2019-01-29 17:52:54 +00:00
Weiwei Yang 6b8dd8d113 YARN-9237. NM should ignore sending finished apps to RM during RM fail-over. Contributed by Jiandan Yang.
(cherry picked from commit 4f63ffe444)
2019-01-29 10:42:09 +08:00
Jonathan Hung bf760e7e81 YARN-9222. Print launchTime in ApplicationSummary
(cherry picked from commit 6cace58e21)
2019-01-25 13:23:37 -08:00
Weiwei Yang bc6374f282 YARN-9205. When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION). Contributed by Zhankun Tang. 2019-01-23 18:10:28 +08:00
Weiwei Yang 8ad7711605 YARN-8101. Add UT to verify node-attributes in RM nodes rest API. Contributed by Prabhu Joseph.
(cherry picked from commit 721d5c2a5f)
2019-01-23 18:07:45 +08:00
Weiwei Yang 9114489566 YARN-9210. RM nodes web page can not display node info. Contributed by Jiandan Yang.
(cherry picked from commit d43df31751)
2019-01-22 10:46:37 +08:00
Weiwei Yang ac2f4b64f9 YARN-9204. RM fails to start if absolute resource is specified for partition capacity in CS queues. Contributed by Jiandan Yang.
(cherry picked from commit abde1e1f58)
2019-01-21 21:20:01 +08:00
Wangda Tan fe7cb2d84a YARN-9194. Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM. (lujie via wangda)
Change-Id: I4359f59a73a278a941f4bb9d106dd38c9cb471fe
(cherry picked from commit 6d7eedfd28)
2019-01-17 15:13:42 -08:00
Wangda Tan 1dc2b49bfd YARN-8822. Nvidia-docker v2 support for YARN GPU feature. (Charo Zhang via wangda)
Change-Id: Ib8044307a4241f6b1b7b9b8266b9256f39b16384
2019-01-07 12:21:33 -08:00
Weiwei Yang 2b549e32e1 YARN-9173. FairShare calculation broken for large values after YARN-8833. Contributed by Wilfred Spiegelenburg.
(cherry picked from commit 944cf87223)
2019-01-07 16:05:57 +08:00
Weiwei Yang a24cca11f2 YARN-9164. Shutdown NM may cause NPE when opportunistic container scheduling is enabled. Contributed by lujie.
(cherry picked from commit cfe89e6f96)
2019-01-04 01:04:39 +08:00
Weiwei Yang 7deef08eb8 YARN-8925. Updating distributed node attributes only when necessary. Contributed by Tao Yang. 2018-12-21 16:31:03 +08:00
Eric Yang 29c9c8a893 YARN-9126. Fix container clean up for reinitialization.
Contributed by Chandni Singh

(cherry picked from commit e815fd9c49)
2018-12-19 14:58:19 -05:00
Eric Yang 28ca14e71b YARN-9040. Fixed memory leak in LevelDBCacheTimelineStore and DBIterator.
Contributed by Tarun Parimi

(cherry picked from commit 71e0b0d800)
2018-12-17 12:08:09 -05:00
Eric Yang 52aafb9789 YARN-9125. Fixed Carriage Return detection in Docker container launch command.
Contributed by Billie Rinaldi

(cherry picked from commit b2d7204ed0)
2018-12-14 17:55:10 -05:00
Weiwei Yang 2b3c3d2a32 YARN-9009. Fix flaky test TestEntityGroupFSTimelineStore.testCleanLogs. Contributed by OrDTesters.
(cherry picked from commit 1c09a10e96)
2018-12-10 12:07:23 +08:00
Jonathan Hung 3ab6ea7aca YARN-9085. Add Guaranteed and MaxCapacity to CSQueueMetrics
(cherry picked from commit 978ab3e958227220cb6f1a08ae6e7cdb8a46628b)
2018-12-07 10:45:47 -08:00
Eric Yang 8c70728f7f YARN-9071. Improved status update for reinitialized containers.
Contributed by Chandni Singh

(cherry picked from commit 1b790f4dd1)
2018-12-05 19:04:55 -05:00
Jonathan Hung 6b01e4d2a8 YARN-9036. Escape newlines in health report in YARN UI. Contributed by Keqiu Hu
(cherry picked from commit 1c8bd7128c99d8215ef16438bd2ce6b1f025a966)
2018-11-30 10:16:00 -08:00
bibinchundatt 183ec39c4b YARN-9069. Fix SchedulerInfo#getSchedulerType for custom schedulers. Contributed by Bilwa S T.
(cherry picked from commit 07142f54a8)
2018-11-29 22:16:32 +05:30
Jason Lowe df0e7766e4 YARN-8812. Containers fail during creating a symlink which started with hyphen for a resource file. Contributed by Oleksandr Shevchenko
(cherry picked from commit 3ce99e32f7)
2018-11-28 08:50:18 -06:00
Eric Yang 838190482d YARN-8986. Added port publish for Docker container running with bridge.
Contributed by Charo Zhang
2018-11-27 14:27:13 -05:00
Weiwei Yang 650581a19d YARN-8833. Avoid potential integer overflow when computing fair shares. Contributed by liyakun.
(cherry picked from commit d027a24f03)
2018-11-18 23:23:09 +08:00
Rohith Sharma K S 13e3670e7f YARN-8303. YarnClient should contact TimelineReader for application/attempt/container report.
(cherry picked from commit ee3355be3c)
2018-11-16 18:37:20 +05:30
Weiwei Yang 2415f8a5be YARN-8987. Usability improvements node-attributes CLI. Contributed by Bibin A Chundatt.
(cherry picked from commit c741109522)
2018-11-12 18:21:11 +08:00
Weiwei Yang b3321d9933 YARN-8988. Reduce the verbose log on RM heartbeat path when distributed node-attributes is enabled. Contributed by Tao Yang.
(cherry picked from commit e1bbf7dcdf)
2018-11-08 17:50:16 +08:00
Weiwei Yang b10ec0aa14 YARN-8977. Remove unnecessary type casting when calling AbstractYarnScheduler#getSchedulerNode. Contributed by Wanqiang Ji.
(cherry picked from commit c96cbe8659)
2018-11-07 22:44:01 +08:00
Akira Ajisaka 41c4e9583d
YARN-8233. NPE in CapacityScheduler#tryCommit when handling allocate/reserve proposal whose allocatedOrReservedContainer is null. Contributed by Tao Yang.
(cherry picked from commit 951c98f890)
2018-11-07 11:18:48 +09:00
Jason Lowe 9265934201 YARN-8865. RMStateStore contains large number of expired RMDelegationToken. Contributed by Wilfred Spiegelenburg
(cherry picked from commit ab6aa4c726)
2018-11-06 08:47:30 -06:00
Weiwei Yang f00125e2d9 YARN-8970. Improve the debug message in CS#allocateContainerOnSingleNode. Contributed by Zhankun Tang.
(cherry picked from commit 5d6554c722)
2018-11-06 14:52:10 +08:00
Weiwei Yang b474239e0b YARN-8969. AbstractYarnScheduler#getNodeTracker should return generic type to avoid type casting. Contributed by Wanqiang Ji.
(cherry picked from commit c7fcca0d7e)
2018-11-06 13:17:19 +08:00
Jonathan Hung b9a3c988c0 YARN-7225. Add queue and partition info to RM audit log. Contributed by Eric Payne
(cherry picked from commit 2ab611d48b)
2018-11-01 14:31:09 -07:00
Rohith Sharma K S 7920f79049 YARN-8950. Fix compilation issue due to dependency convergence error for hbase.profile=2.0.
(cherry picked from commit 4ec4ec6971)
2018-10-30 11:50:33 +05:30
Weiwei Yang 5db4272d91 YARN-8944. TestContainerAllocation.testUserLimitAllocationMultipleContainers failure after YARN-8896. Contributed by Wilfred Spiegelenburg.
(cherry picked from commit 1d90a0dd23)
2018-10-29 11:55:22 +08:00
Jason Lowe 7097755925 YARN-8904. TestRMDelegationTokens can fail in testRMDTMasterKeyStateOnRollingMasterKey. Contributed by Wilfred Spiegelenburg
(cherry picked from commit 93fb3b4b9c)
2018-10-23 12:54:04 -05:00
Rohith Sharma K S 660fff3138 YARN-8826. Fix lingering timeline collector after serviceStop in TimelineCollectorManager. Contributed by Prabha Manepalli.
(cherry picked from commit 0b62983c5a)
2018-10-23 14:06:13 +05:30
Sunil G 30998fea28 YARN-8868. Set HTTPOnly attribute to Cookie. Contributed by Chandni Singh.
(cherry picked from commit 2202e00ba8)
2018-10-23 09:57:26 +05:30
Eric Yang 04600cf5da YARN-8910. Fixed misleading log statement when container max retries is infinite.
Contributed by Chandni Singh

(cherry picked from commit 47ad98b2e1)
2018-10-19 13:49:51 -04:00
Wangda Tan 9ed9e185d7 YARN-8916. Define a constant docker string in ContainerRuntimeConstants.java for better maintainability. (Zhankun Tang via wangda)
Change-Id: I1349e740037f81afdbe30edbe741f20e88fd0a90
(cherry picked from commit 5e02b4915b)
2018-10-19 09:50:18 -07:00
Weiwei Yang 042f2df19b YARN-8907. Fix incorrect logging message in TestCapacityScheduler. Contributed by Zhankun Tang.
(cherry picked from commit 13cc0f50ea)
2018-10-19 10:01:45 +08:00
Wangda Tan 7252f8e117 YARN-8896. Limit the maximum number of container assignments per heartbeat. (Zhankun Tang via wangda)
Change-Id: I6e72f8362bd7f5c2a844cb9e3c4732492314e9f1
(cherry picked from commit 780be14f07)
2018-10-18 12:12:19 -07:00
Wangda Tan 5f7ed043c8 YARN-8456. Fix a configuration handling bug when user leave FPGA discover executable path configuration default but set OpenCL SDK path environment variable. (Zhankun Tang via wangda)
Change-Id: Iff150ea98ba0c60d448474fd940eb121afce6965
(cherry picked from commit a457a8951a)
2018-10-18 12:12:14 -07:00
Sunil G bde4fd5ed9 Preparing for 3.2.0 release 2018-10-18 17:07:45 +05:30
Sunil G 6380ee5512 YARN-8759. Copy of resource-types.xml is not deleted if test fails, causes other test failures. Contributed by Antal Bálint Steinbach.
(cherry picked from commit 5085e5fa9e)
2018-10-17 16:06:07 +05:30
Vrushali C 62d329cac0 YARN-5742 Serve aggregated logs of historical apps from timeline service. Contributed by Rohith Sharma KS
(cherry picked from commit 8d1981806f)
2018-10-12 07:18:13 +05:30
Jason Lowe cdbca8b133 YARN-8861. executorLock is misleading in ContainerLaunch. Contributed by Chandni Singh
(cherry picked from commit e787d65a08)
2018-10-11 10:58:48 -05:00
Jason Lowe 145c7aa663 YARN-7644. NM gets backed up deleting docker containers. Contributed by Chandni Singh
(cherry picked from commit 5ce70e1211)
2018-10-10 10:01:52 -05:00
Weiwei Yang eb0147a4c7 YARN-8858. CapacityScheduler should respect maximum node resource when per-queue maximum-allocation is being used. Contributed by Wangda Tan.
(cherry picked from commit edce866489)
2018-10-10 09:42:45 +08:00
Wangda Tan b3ac886933 YARN-8844. TestNMProxy unit test is failing. (Eric Yang via wangda)
Change-Id: I241fa8701b6f1dbcad87fd2e9a429e32e7aa40f5
2018-10-04 10:48:47 -07:00
Shane Kumpf 5edb9d3b97 YARN-8785. Improve the error message when a bind mount is not whitelisted. Contributed by Simon Prewo 2018-10-02 07:16:29 -06:00
Haibo Chen d0ee6fbe28 YARN-8621. Add test coverage of custom Resource Types for the apps/<appId> REST API endpoint. (Contributed by Szilard Nemeth) 2018-10-01 14:46:42 -07:00
Giovanni Matteo Fumarola 59d5af21b7 YARN-8760. [AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer. Contributed by Botong Huang. 2018-10-01 13:12:38 -07:00
Weiwei Yang fd6be5898a YARN-8468. Enable the use of queue based maximum container allocation limit and implement it in FairScheduler. Contributed by Antal Bálint Steinbach. 2018-09-29 17:47:12 +08:00
Eric E Payne 8598b498bc YARN-8774. Memory leak when CapacityScheduler allocates from reserved container with non-default label. Contributed by Tao Yang. 2018-09-28 15:32:07 +00:00
bibinchundatt 7093afd874 YARN-8829. Cluster metrics can fail with IndexOutOfBoundsException. Contributed by Akshay Agarwal. 2018-09-28 12:35:33 +05:30
Vrushali C 90e2e493b3 YARN-8270 Adding JMX Metrics for Timeline Collector and Reader. Contributed by Sushil Ks. 2018-09-27 15:53:39 -07:00
Eric Yang b237a0dd44 YARN-6456. Added config to set default container runtimes.
Contributed by Craig Condit
2018-09-27 15:31:18 -04:00
Jason Lowe 6b988d821e YARN-8804. resourceLimits may be wrongly calculated when leaf-queue is blocked in cluster with 3+ level queues. Contributed by Tao Yang 2018-09-26 14:43:00 -07:00
Eric Yang 913f87dada YARN-8665. Added Yarn service cancel upgrade option.
Contributed by Chandni Singh
2018-09-26 14:51:35 -04:00
Rohith Sharma K S e5287a4fe0 YARN-8824. App Nodelabel missed after RM restart for finished apps. Contributed by Bibin A Chundatt. 2018-09-26 12:30:26 +05:30
Akira Ajisaka 44edcdfd6a
YARN-8745. Misplaced the TestRMWebServicesFairScheduler.java file. Contributed by Y. SREENIVASULU REDDY. 2018-09-26 10:09:11 +09:00
Rohith Sharma K S 50bc7746d7 YARN-8815. RM fails to recover finished unmanaged AM. Contributed by Bibin A Chundatt. 2018-09-25 11:31:14 +05:30
Haibo Chen 29dad7d258 YARN-8616. systemClock should be used in RMAppImpl instead of System.currentTimeMills(), to be consistent. (Contributed by Szilard Nemeth) 2018-09-24 16:04:28 -07:00
Giovanni Matteo Fumarola 3090922805 YARN-8696. [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async. Contributed by Botong Huang. 2018-09-24 11:37:05 -07:00
Eric Yang aa4bd493c3 YARN-8801. Fixed header comments for docker utility functions.
Contributed by Zian Chen
2018-09-20 13:08:59 -04:00
Jason Lowe 6b5838ed32 YARN-8784. DockerLinuxContainerRuntime prevents access to distributed cache entries on a full disk. Contributed by Eric Badger 2018-09-19 16:44:51 -05:00
Eric Yang efdea85ad1 YARN-8791. Trim docker inspect output for line feed for STOPSIGNAL parsing.
Contributed by Chandni Singh
2018-09-19 13:16:11 -04:00
Weiwei Yang 0712537e79 YARN-8771. CapacityScheduler fails to unreserve when cluster resource contains empty resource type. Contributed by Tao Yang. 2018-09-19 19:31:07 +08:00
Jason Lowe 2df0a8dcb3 YARN-8648. Container cgroups are leaked when using docker. Contributed by Jim Brennan 2018-09-18 15:36:45 -05:00
Shane Kumpf 144a55f0e3 YARN-8045. Reduce log output from container status calls. Contributed by Craig Condit 2018-09-14 10:41:55 -06:00
Shane Kumpf 78902f0250 YARN-8748. Javadoc warnings within the nodemanager package. Contributed by Craig Condit 2018-09-14 10:28:36 -06:00
Eric Yang 99237607bf YARN-8706. Allow additional flag in docker inspect call.
Contributed by Chandni Singh
2018-09-14 11:46:59 -04:00
Weiwei Yang f1a893fdbc YARN-8720. CapacityScheduler does not enforce max resource allocation check at queue level. Contributed by Tarun Parimi. 2018-09-14 16:33:51 +08:00
Jason Lowe 250b50018e YARN-8680. YARN NM: Implement Iterable Abstraction for LocalResourceTracker state. Contributed by Pradeep Ambati 2018-09-13 13:28:54 -05:00
Weiwei Yang 39c1ea1ed4 YARN-8729. Node status updater thread could be lost after it is restarted. Contributed by Tao Yang. 2018-09-13 22:21:35 +08:00
Sunil G f4bda5e8e9 YARN-8630. ATSv2 REST APIs should honor filter-entity-list-by-user in non-secure cluster when ACls are enabled. Contributed by Rohith Sharma K S. 2018-09-13 17:47:21 +05:30
Shane Kumpf 8e9afbfb66 YARN-8768. Javadoc error in node attributes. Contributed by Sunil Govindan. 2018-09-12 15:12:28 -06:00
Giovanni Matteo Fumarola 02b9bfdf9e YARN-8658. [AMRMProxy] Metrics for AMRMClientRelayer inside FederationInterceptor. Contributed by Young Chen. 2018-09-12 11:46:35 -07:00
Sunil G 5e64e62dee YARN-8740. Clear node attribute path after each test run. Contributed by Bibin A Chundatt. 2018-09-12 16:01:01 +05:30
bibinchundatt c44088ac19 YARN-8739. Fix jenkins issues for Node Attributes branch. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
Weiwei Yang 52194351e7 YARN-8721. Relax NE node-attribute check when attribute doesn't exist on a node. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
Naganarasimha 67ae81f0e0 YARN-7863. Modify placement constraints to support node attributes. Contributed by Sunil Govindan. 2018-09-12 16:01:01 +05:30
Naganarasimha eb08543c7a YARN-8103. Add CLI interface to query node attributes. Contributed by Bibin A Chundatt. 2018-09-12 16:01:01 +05:30
Sunil G 76183428b7 YARN-8351. Node attribute manager logs are flooding RM logs. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
bibinchundatt 8cf6a9a2bd YARN-7892. Revisit NodeAttribute class structure. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Naganarasimha 5dc7d6e0f3 YARN-8104. Add API to fetch node to attribute mapping. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
Naganarasimha 0a01b1350d YARN-8100. Support API interface to query cluster attributes and attribute to nodes. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
Sunil G b9890d1f66 YARN-7875. Node Attribute store for storing and recovering attributes. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
bibinchundatt a6590c1f1f YARN-8117. Fix TestRMWebServicesNodes test failure. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
bibinchundatt 901e85238d YARN-8033. CLI Integration with NodeAttributesManagerImpl. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Sunil G 89b3ebd11e YARN-8092. Expose Node Attributes info via RM nodes REST API. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G 440ff7f563 YARN-8094. Support configuration based Node Attribute provider. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G 6f4bc49c6d YARN-7988. Refactor FSNodeLabelStore code for Node Attributes store support. Contributed by Bibin A Chundatt. 2018-09-12 16:01:00 +05:30
Naganarasimha 3b3b6efe21 YARN-7871. Node attributes reporting from NM to RM. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Naganarasimha 86d024ef2a YARN-7965. NodeAttributeManager add/get API is not working properly. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G ffcabd24c3 YARN-7856. Validate Node Attributes from NM. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Sunil G 2f7712be09 YARN-6858. Attribute Manager to store and provide node attributes in RM. Contributed by Naganarasimha G R. 2018-09-12 16:01:00 +05:30
Naganarasimha d312b5cf9f YARN-7757. Refactor NodeLabelsProvider to be more generic and reusable for node attributes providers. Contributed by Weiwei Yang. 2018-09-12 16:01:00 +05:30
Weiwei Yang d9d93e3925 YARN-7842. PB changes to carry node-attributes in NM heartbeat. Contributed by Weiwei Yang. 2018-09-12 16:00:59 +05:30
Naganarasimha 1f42ce907a YARN-6855. [YARN-3409] CLI Proto Modifications to support Node Attributes. Contributed by Naganarasimha G R. 2018-09-12 16:00:59 +05:30
Eric E Payne 987d8191ad YARN-8709: CS preemption monitor always fails since one under-served queue was deleted. Contributed by Tao Yang. 2018-09-10 19:55:20 +00:00
Eric Yang bf8a1750e9 YARN-8706. Updated docker container stop logic to avoid double kill.
Contributed by Chandni Singh
2018-09-07 20:18:09 -04:00
Eric Yang 7d62334387 YARN-8751. Reduce conditions that mark node manager as unhealthy.
Contributed by Craig Condit
2018-09-07 19:46:15 -04:00
Giovanni Matteo Fumarola 3dc2988a37 YARN-8699. Add Yarnclient#yarnclusterMetrics API implementation in router. Contributed by Bibin A Chundatt. 2018-09-07 11:32:03 -07:00
Giovanni Matteo Fumarola 9af96d4ed4 HADOOP-15707. Add IsActiveServlet to be used for Load Balancers. Contributed by Lukas Majercak. 2018-09-05 10:50:25 -07:00
Shane Kumpf dffb7bfe6c YARN-8638. Allow linux container runtimes to be pluggable. Contributed by Craig Condit 2018-09-05 06:47:54 -06:00
bibinchundatt eed8415dc1 YARN-8535. Fix DistributedShell unit tests. Contributed by Abhishek Modi. 2018-09-02 13:35:52 +05:30
Shane Kumpf 73625168c0 YARN-8642. Add support for tmpfs mounts with the Docker runtime. Contributed by Craig Condit 2018-08-29 07:08:37 -06:00
Weiwei Yang 3fa4639421 YARN-8723. Fix a typo in CS init error message when resource calculator is not correctly set. Contributed by Abhishek Modi. 2018-08-29 11:13:44 +08:00
Giovanni Matteo Fumarola 7ed458b255 YARN-8697. LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource. Contributed by Botong Huang. 2018-08-28 16:01:35 -07:00
Giovanni Matteo Fumarola 602d13844a HADOOP-15699. Fix some of testContainerManager failures in Windows. Contributed by Botong Huang. 2018-08-27 12:25:46 -07:00
Billie Rinaldi 05b2bbeb35 YARN-8675. Remove default hostname for docker containers when net=host. Contributed by Suma Shivaprasad 2018-08-27 11:34:33 -07:00
Giovanni Matteo Fumarola f152582562 YARN-8705. Refactor the UAM heartbeat thread in preparation for YARN-8696. Contributed by Botong Huang. 2018-08-27 10:32:22 -07:00
Jason Lowe 585ebd873a YARN-8649. NPE in localizer hearbeat processing if a container is killed while localizing. Contributed by lujie 2018-08-23 09:29:46 -05:00
Sunil G 1ac01444a2 YARN-8015. Support all types of placement constraint support for Capacity Scheduler. Contributed by Weiwei Yang. 2018-08-23 10:05:43 +05:30
Weiwei Yang 9c3fc3ef28 YARN-7494. Add muti-node lookup mechanism and pluggable nodes sorting policies to optimize placement decision. Contributed by Sunil Govindan. 2018-08-21 22:42:28 +08:00
Weiwei Yang 54d0bf8935 YARN-8683. Support to display pending scheduling requests in RM app attempt page. Contributed by Tao Yang. 2018-08-21 19:00:31 +08:00
Rohith Sharma K S d3fef7a5c5 YARN-8129. Improve error message for invalid value in fields attribute. Contributed by Abhishek Modi. 2018-08-21 11:58:07 +05:30
Giovanni Matteo Fumarola e0f6ffdbad YARN-8581. [AMRMProxy] Add sub-cluster timeout in LocalityMulticastAMRMProxyPolicy. Contributed by Botong Huang. 2018-08-20 14:33:16 -07:00
Giovanni Matteo Fumarola 8736fc39ac YARN-8673. [AMRMProxy] More robust responseId resync after an YarnRM master slave switch. Contributed by Botong Huang. 2018-08-20 12:22:36 -07:00
Jason Lowe 65e7469712 YARN-8242. YARN NM: OOM error while reading back the state store on recovery. Contributed by Pradeep Ambati and Kanwaljeet Sachdev 2018-08-20 10:14:40 -05:00
Rohith Sharma K S 4aacbfff60 YARN-8679. [ATSv2] If HBase cluster is down for long time, high chances that NM ContainerManager dispatcher get blocked. Contributed by Wangda Tan. 2018-08-18 10:26:55 +05:30
Eric Yang d42806160e YARN-8667. Cleanup symlinks when container restarted by NM.
Contributed by Chandni Singh
2018-08-16 18:41:58 -04:00
Jason Lowe cb21eaa026 YARN-8656. container-executor should not write cgroup tasks files for docker containers. Contributed by Jim Brennan 2018-08-16 10:06:17 -05:00
Wei-Chiu Chuang bdd0e01ed9 HADOOP-14212. Addendum patch: Expose SecurityEnabled boolean field in JMX for other services besides NameNode. Contributed by Adam Antal. 2018-08-14 18:25:38 -07:00
Wei-Chiu Chuang 363bd16e31 HADOOP-14212. Expose SecurityEnabled boolean field in JMX for other services besides NameNode. Contributed by Adam Antal. 2018-08-14 17:19:00 -07:00
Eric Yang 4cba0741d5 YARN-8160. Support upgrade of service that use docker containers.
Contributed by Chandni Singh
2018-08-14 11:36:26 -04:00
Jason Lowe d1d129aa9d YARN-8640. Restore previous state in container-executor after failure. Contributed by Jim Brennan 2018-08-14 10:21:03 -05:00
Weiwei Yang 0a71bf1452 YARN-8575. Avoid committing allocation proposal to unavailable nodes in async scheduling. Contributed by Tao Yang. 2018-08-10 14:37:45 +08:00
Weiwei Yang 08d5060605 YARN-8521. NPE in AllocationTagsManager when a container is removed more than once. Contributed by Weiwei Yang. 2018-08-10 08:32:02 +08:00
Robert Kanter b2517dd66b YARN-4946. RM should not consider an application as COMPLETED when log aggregation is not in a terminal state (snemeth via rkanter) 2018-08-09 14:58:04 -07:00
Wangda Tan 344c335a92 YARN-8588. Logging improvements for better debuggability. (Suma Shivaprasad via wangda)
Change-Id: I66aa4b0ec031ae5ce0fae558e2f8cbcbbfebc442
2018-08-09 11:04:02 -07:00
Weiwei Yang d352f167eb YARN-8559. Expose mutable-conf scheduler's configuration in RM /scheduler-conf endpoint. Contributed by Weiwei Yang. 2018-08-09 23:46:53 +08:00
Jason Lowe cd04e954d2 YARN-8331. Race condition in NM container launched after done. Contributed by Pradeep Ambati 2018-08-09 10:17:34 -05:00
Giovanni Matteo Fumarola d838179d8d YARN-8626. Create HomePolicyManager that sends all the requests to the home subcluster. Contributed by Inigo Goiri. 2018-08-07 15:33:16 -07:00
Wangda Tan d4258fcad7 YARN-8629. Container cleanup fails while trying to delete Cgroups. (Suma Shivaprasad via wangda)
Change-Id: I392ef4f8baa84d5d7b1f2e438c560b5426b6d4f2
2018-08-07 12:36:55 -07:00
Haibo Chen 889df6f194 YARN-7948. Enable fair scheduler to refresh maximum allocation for multiple resource types. (Szilard Nemeth via Haibo Chen) 2018-08-02 10:10:12 -07:00
Sunil G 12a095a496 YARN-7159. Normalize unit of resource objects in RM to avoid unit conversion in critical path. Contributed by Manikandan R. 2018-08-02 22:29:21 +05:30
Jason Lowe 7526815e32 YARN-8263. DockerClient still touches hadoop.tmp.dir. Contributed by Craig Condit 2018-08-02 10:43:48 -05:00
Sunil G 735b492556 YARN-8593. Add RM web service endpoint to get user information. Contributed by Akhil PB. 2018-08-02 08:34:09 +05:30
Billie Rinaldi 67c65da261 YARN-8403. Change the log level for fail to download resource from INFO to ERROR. Contributed by Eric Yang 2018-08-01 08:51:40 -07:00
Sunil G a48a0cc7fd YARN-8606. Opportunistic scheduling does not work post RM failover. Contributed by Bibin A Chundatt. 2018-08-01 12:17:18 +05:30
Sunil G 6310c0d17d YARN-8397. Potential thread leak in ActivitiesManager. Contributed by Rohith Sharma K S. 2018-08-01 08:33:30 +05:30
Eric Yang c7ebcd76bf YARN-8579. Recover NMToken of previous attempted component data.
Contributed by Gour Saha
2018-07-31 18:01:02 -04:00
Wangda Tan 4b540bbfcf YARN-8418. App local logs could leaked if log aggregation fails to initialize for the app. (Bibin A Chundatt via wangda)
Change-Id: I29a23ca4b219b48c92e7975cd44cddb8b0e04104
2018-07-31 12:08:00 -07:00
Haibo Chen 8aa93a575e YARN-8605. TestDominantResourceFairnessPolicy.testModWhileSorting is flaky. (Wilfred Spiegelenburg via Haibo Chen) 2018-07-31 11:32:40 -07:00
Jonathan Hung 3e06a5dcea YARN-7974. Allow updating application tracking url after registration. Contributed by Jonathan Hung 2018-07-30 17:44:18 -07:00
bibinchundatt 2b39ad2698 YARN-8584. Several typos in Log Aggregation related classes. Contributed by Szilard Nemeth. 2018-07-30 23:25:19 +05:30
Sunil G 63e08ec071 YARN-8591. [ATSv2] NPE while checking for entity acl in non-secure cluster. Contributed by Rohith Sharma K S. 2018-07-30 14:48:04 +05:30
bibinchundatt 3d586841ab YARN-8558. NM recovery level db not cleaned up properly on container finish. Contributed by Bibin A Chundatt. 2018-07-28 20:52:39 +05:30
Eric Yang ed9d60e888 YARN-8508. Release GPU resource for killed container.
Contributed by Chandni Singh
2018-07-27 19:33:58 -04:00
Inigo Goiri 79091cf76f YARN-8596. Allow SQLFederationStateStore to submit the same app in the same subcluster. Contributed by Giovanni Matteo Fumarola. 2018-07-27 15:23:57 -07:00
Robert Kanter 2cccf4061c YARN-8517. getContainer and getContainers ResourceManager REST API methods are not documented (snemeth via rkanter) 2018-07-27 14:35:03 -07:00
Eric Yang f93ecf5c1e YARN-8330. Improved publishing ALLOCATED events to ATS.
Contributed by Suma Shivaprasad
2018-07-25 18:49:30 -04:00
Eric E Payne 9485c9aee6 YARN-4606. CapacityScheduler: applications could get starved because computation of #activeUsers considers pending apps. Contributed by Manikandan R 2018-07-25 16:22:04 +00:00
Weiwei Yang 5be9f4a5d0 YARN-8546. Resource leak caused by a reserved container being released more than once under async scheduling. Contributed by Tao Yang. 2018-07-25 17:35:27 +08:00
Haibo Chen ea2c6c8c9a YARN-7133. Clean up lock-try order in fair scheduler. (Szilard Nemeth via Haibo Chen) 2018-07-24 12:46:59 -07:00
bibinchundatt e673dd1d4d YARN-8541. RM startup failure on recovery after user deletion. Contributed by Bibin A Chundatt. 2018-07-24 23:26:59 +05:30
Sunil G 35ce6eb1f5 YARN-7748. TestContainerResizing.testIncreaseContainerUnreservedWhenApplicationCompleted fails due to multiple container fail events. Contributed by Weiwei Yang. 2018-07-24 22:20:17 +05:30
bibinchundatt ff7c2eda34 YARN-8548. AllocationRespose proto setNMToken initBuilder not done. Contributed by Bilwa S T. 2018-07-24 16:17:20 +05:30
bibinchundatt 8461278833 YARN-8544. [DS] AM registration fails when hadoop authorization is enabled. Contributed by Bibin A Chundatt. 2018-07-24 13:09:17 +05:30
Eric Yang 8688a0c7f8 YARN-8380. Support bind propagation options for mounts in docker runtime.
Contributed by Billie Rinaldi
2018-07-23 20:12:04 -04:00
Haibo Chen 9d3c39e9dd YARN-6966. NodeManager metrics may return wrong negative values when NM restart. (Szilard Nemeth via Haibo Chen) 2018-07-23 11:07:24 -07:00
Haibo Chen 8a6bb8409c YARN-6964. Fair scheduler misuses Resources operations. (Daniel Templeton and Szilard Nemeth via Haibo Chen) 2018-07-20 10:46:28 -07:00
Weiwei Yang cbf2026483 YARN-8528. Final states in ContainerAllocation might be modified externally causing unexpected allocation results. Contributed by Xintong Song. 2018-07-20 22:34:06 +08:00
Haibo Chen e6873dfde0 YARN-7300. DiskValidator is not used in LocalDirAllocator. (Szilard Nemeth via Haibo Chen) 2018-07-19 16:27:11 -07:00
Haibo Chen 2564884757 YARN-8436. FSParentQueue: Comparison method violates its general contract. (Wilfred Spiegelenburg via Haibo Chen) 2018-07-19 13:22:31 -07:00
Eric Yang 5836e0a46b YARN-8501. Reduce complexity of RMWebServices getApps method.
Contributed by Szilard Nemeth
2018-07-19 12:30:38 -04:00
Eric Yang efb4e274e5 YARN-8538. Fixed memory leaks in container-executor and test cases.
Contributed by Billie Rinaldi
2018-07-16 17:38:49 -04:00
Wangda Tan 238ffff999 YARN-8524. Single parameter Resource / LightWeightResource constructor looks confusing. (Szilard Nemeth via wangda)
Change-Id: I4ae97548b5b8d76a6bcebb2d3d70bf8e0be3c125
2018-07-16 10:58:00 -07:00
Wangda Tan a2e49f41a8 YARN-8361. Change App Name Placement Rule to use App Name instead of App Id for configuration. (Zian Chen via wangda)
Change-Id: I17e5021f8f611a9c5e3bd4b38f25e08585afc6b1
2018-07-16 10:57:37 -07:00
Wangda Tan 752dcce5f4 YARN-8511. When AM releases a container, RM removes allocation tags before it is released by NM. (Weiwei Yang via wangda)
Change-Id: I6f9f409f2ef685b405cbff547dea9623bf3322d9
2018-07-16 10:54:41 -07:00
Eric E Payne 937ef39b3f YARN-8421: when moving app, activeUsers is increased, even though app does not have outstanding request. Contributed by Kyungwan Nam 2018-07-16 16:24:21 +00:00
Akira Ajisaka 88625f5cd9
HADOOP-15531. Use commons-text instead of commons-lang in some classes to fix deprecation warnings. Contributed by Takanobu Asanuma. 2018-07-13 11:42:12 -04:00
Jason Lowe 17118f446c YARN-8515. container-executor can crash with SIGPIPE after nodemanager restart. Contributed by Jim Brennan 2018-07-13 10:05:25 -05:00
Robert Kanter 1bc106a738 YARN-8518. test-container-executor test_is_empty() is broken (Jim_Brennan via rkanter) 2018-07-12 16:38:46 -07:00
Sunil G 7f1d3d0e9d YARN-8512. ATSv2 entities are not published to HBase from second attempt onwards. Contributed by Rohith Sharma K S. 2018-07-11 12:26:32 +05:30
Giovanni Matteo Fumarola 82ac3aa6d0 YARN-8502. Use path strings consistently for webservice endpoints in RMWebServices. Contributed by Szilard Nemeth. 2018-07-10 10:36:17 -07:00
Sunil G 705e2c1f7c YARN-8473. Containers being launched as app tears down can leave containers in NEW state. Contributed by Jason Lowe. 2018-07-10 20:11:47 +05:30
Giovanni Matteo Fumarola ea9b608237 YARN-7899. [AMRMProxy] Stateful FederationInterceptor for pending requests. Contributed by Botong Huang. 2018-07-09 12:27:36 -07:00
Robert Kanter 0838fe8337 Only mount non-empty directories for cgroups (miklos.szegedi@cloudera.com via rkanter) 2018-07-09 10:37:20 -07:00
Rohith Sharma K S ba68320449 YARN-8302. ATS v2 should handle HBase connection issue properly. Contributed by Billie Rinaldi. 2018-07-06 15:19:01 -07:00
Sunil G e4bf38cf50 YARN-8492. ATSv2 HBase tests are failing with ClassNotFoundException. Contributed by Rohith Sharma K S. 2018-07-06 12:05:32 -07:00
Robert Kanter a129e3e74e YARN-7451. Add missing tests to verify the presence of custom resources of RM apps and scheduler webservice endpoints (snemeth via rkanter) 2018-07-06 11:04:00 -07:00
Haibo Chen 9edc74f64a YARN-7556. Fair scheduler configuration should allow resource types in the minResources and maxResources properties. (Daniel Templeton and Szilard Nemeth via Haibo Chen) 2018-07-06 11:03:48 -07:00
Anu Engineer 39ad98903a Revert "Merge branch 'trunk' of https://git-wip-us.apache.org/repos/asf/hadoop into trunk"
This reverts commit c163d1797a, reversing
changes made to 0d9804dcef.
2018-07-05 12:22:18 -07:00
Giovanni Matteo Fumarola c163d1797a Merge branch 'trunk' of https://git-wip-us.apache.org/repos/asf/hadoop into trunk 2018-07-05 10:55:05 -07:00
Giovanni Matteo Fumarola 0d9804dcef YARN-8435. Fix NPE when the same client simultaneously contact for the first time Yarn Router. Contributed by Rang Jiaheng. 2018-07-05 10:54:31 -07:00
Robert Kanter 99febe7fd5 YARN-7451. Add missing tests to verify the presence of custom resources of RM apps and scheduler webservice endpoints (snemeth via rkanter) 2018-07-05 10:54:19 -07:00
Haibo Chen 1726247024 YARN-7556. Fair scheduler configuration should allow resource types in the minResources and maxResources properties. (Daniel Templeton and Szilard Nemeth via Haibo Chen) 2018-07-05 10:42:39 -07:00
Sunil G 51654a3962 YARN-8459. Improve Capacity Scheduler logs to debug invalid states. Contributed by Wangda Tan. 2018-07-03 09:59:20 -07:00
Junping Du d9ba6f3656 yarn.resourcemanager.fail-fast is used inconsistently. Contributed by Yuanbo Liu. 2018-07-03 14:46:44 +08:00
Sunil G fa9ef15ecd YARN-8415. TimelineWebServices.getEntity should throw ForbiddenException instead of 404 when ACL checks fail. Contributed by Suma Shivaprasad. 2018-07-02 15:34:47 -07:00
Shane Kumpf 53e267fa72 YARN-8485. Priviledged container app launch is failing intermittently. Contributed by Eric Yang 2018-07-02 16:18:32 -06:00
Eric Yang 5cc2541a16 YARN-8465. Fixed docker container status for node manager restart.
Contributed by Shane Kumpf
2018-07-02 13:37:51 -04:00
Giovanni Matteo Fumarola cdb084426b YARN-8481. AMRMProxyPolicies should accept heartbeat response from new/unknown subclusters. Contributed by Botong Huang. 2018-06-29 11:47:30 -07:00
Jason Lowe 100470140d YARN-8451. Multiple NM heartbeat thread created when a slow NM resync with RM. Contributed by Botong Huang 2018-06-29 13:06:28 -05:00
Sunil G 469b29c081 YARN-8455. Add basic ACL check for all ATS v2 REST APIs. Contributed by Rohith Sharma K S. 2018-06-29 10:02:53 -07:00
Sunil G 291194302c YARN-8379. Improve balancing resources in already satisfied queues by using Capacity Scheduler preemption. Contributed by Zian Chen. 2018-06-28 10:23:31 -07:00
Eric Yang 384764cdea YARN-8409. Fixed NPE in ActiveStandbyElectorBasedElectorService.
Contributed by Chandni Singh
2018-06-28 12:39:49 -04:00
Wangda Tan bedc4fe079 YARN-8464. Async scheduling thread could be interrupted when there are no NodeManagers in cluster. (Sunil G via wangda)
Change-Id: I4f5f856373378685713e77752ba6cf0988a66065
2018-06-26 19:27:17 -07:00
Wangda Tan ada8f63d0b YARN-8423. GPU does not get released even though the application gets killed. (Sunil G via wangda)
Change-Id: I570db7d60f8c6c21762dd618a9207d1107c486a0
2018-06-26 19:25:57 -07:00
Miklos Szegedi 62d83ca536 YARN-8461. Support strict memory control on individual container with elastic control memory mechanism. Contributed by Haibo Chen. 2018-06-26 15:21:35 -07:00
Eric Yang b69ba0f330 YARN-8108. Added option to disable loading existing filters to prevent
security filter from initialize twice.
            Contributed by Sunil Govindan
2018-06-26 17:34:57 -04:00
Miklos Szegedi 35ec9401e8 YARN-8438. TestContainer.testKillOnNew flaky on trunk. Contributed by Szilard Nemeth. 2018-06-25 15:47:54 -07:00
Weiwei Yang 440140cea6 YARN-8443. Total #VCores in cluster metrics is wrong when CapacityScheduler reserved some containers. Contributed by Tao Yang. 2018-06-25 09:15:31 +08:00
Eric Yang 8a32bc39eb YARN-8326. Removed exit code file check for launched container.
Contributed by Shane Kumpf
2018-06-22 19:12:48 -04:00
Yufei Gu 1cdce86d33 YARN-8184. Too many metrics if containerLocalizer/ResourceLocalizationService uses ReadWriteDiskValidator. Contributed by Yufei Gu 2018-06-22 14:03:55 -07:00
Inigo Goiri 99948565cb YARN-8412. Move ResourceRequest.clone logic everywhere into a proper API. Contributed by Botong Huang. 2018-06-21 18:24:10 -07:00
Miklos Szegedi 46f9058164 YARN-8441. Typo in CSQueueUtils local variable names: queueGuranteedResource. Contributed by Szilard Nemeth. 2018-06-20 11:58:18 -07:00
Miklos Szegedi 9a9e969570 YARN-8391. Investigate AllocationFileLoaderService.reloadListener locking issue. Contributed by Szilard Nemeth. 2018-06-20 10:59:33 -07:00
Haibo Chen 4939ffedb1 YARN-8437. Build oom-listener fails on older versions. (Miklos Szegedi via Haibo Chen) 2018-06-20 10:42:13 -07:00
Dedunu Dhananjaya f386e78a4b
YARN-7668. Remove unused variables from ContainerLocalizer
This closes #364

Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
2018-06-18 16:45:50 -07:00
Inigo Goiri b272b71095 HADOOP-15537. Clean up ContainerLaunch and ContainerExecutor pre-HADOOP-15528. Contributed by Giovanni Matteo Fumarola. 2018-06-15 09:40:45 -07:00
Rohith Sharma K S 9119b3cf8f YARN-8155. Improve ATSv2 client logging in RM and NM publisher. Contributed by Abhishek Modi. 2018-06-14 12:38:10 +05:30
Eric Yang 22994889dc YARN-8259. Improve privileged docker container liveliness checks.
Contributed by Shane Kumpf
2018-06-13 19:24:31 -04:00
Sunil G 6307962b93 YARN-8404. Timeline event publish need to be async to avoid Dispatcher thread leak in case ATS is down. Contributed by Rohith Sharma K S 2018-06-13 16:09:16 +05:30
Weiwei Yang 29024a6203 YARN-8394. Improve data locality documentation for Capacity Scheduler. Contributed by Weiwei Yang. 2018-06-13 09:28:05 +08:00
Inigo Goiri 6e756e8a62 HADOOP-15529. ContainerLaunch#testInvalidEnvVariableSubstitutionType is not supported in Windows. Contributed by Giovanni Matteo Fumarola. 2018-06-12 10:24:34 -07:00
Haibo Chen 10d0e4be6e YARN-8325. Miscellaneous QueueManager code clean up. (Szilard Nemeth via Haibo Chen) 2018-06-12 10:11:30 -07:00
Haibo Chen 24a89825f0 YARN-6931. Make the aggregation interval in AppLevelTimelineCollector configurable. (Abhishek Modi via Haibo Chen) 2018-06-12 10:03:34 -07:00
Akira Ajisaka 652bcbb3e4
YARN-8363. Upgrade commons-lang version to 3.7 in hadoop-yarn-project. Contributed by Takanobu Asanuma. 2018-06-13 00:35:37 +09:00
Rohith Sharma K S 2df73dace0 YARN-8405. RM zk-state-store.parent-path ACLs has been changed since HADOOP-14773. Contributed by Íñigo Goiri. 2018-06-12 17:13:09 +05:30
Inigo Goiri 2b2f672022 YARN-8370. Some Node Manager tests fail on Windows due to improper path/file separator. Contributed by Anbang Hu. 2018-06-11 19:25:56 -07:00
Haibo Chen 180b3c960b YARN-8321. AllocationFileLoaderService.getAllocationFile() should be declared as VisibleForTest. (Szilard Nemeth via Haibo Chen) 2018-06-11 11:19:33 -07:00
Haibo Chen 676dcffff5 YARN-8322. Change log level when there is an IOException when the allocation file is loaded. (Szilard Nemeth via Haibo Chen) 2018-06-11 11:16:21 -07:00
Haibo Chen c190ac2be8 YARN-8323. FairScheduler.allocConf should be declared as volatile. (Szilard Nemeth via Haibo Chen) 2018-06-11 11:12:44 -07:00
Inigo Goiri 3b88fe25ba YARN-8359. Exclude containermanager.linux test classes on Windows. Contributed by Jason Lowe. 2018-06-07 17:13:03 -07:00
Robert Kanter 351cf87c92 Disable mounting cgroups by default (miklos.szegedi@cloudera.com via rkanter) 2018-06-07 17:09:34 -07:00
Miklos Szegedi d5eca1a6a0 YARN-6677. Preempt opportunistic containers when root container cgroup goes over memory limit. Contributed by Haibo Chen. 2018-06-07 16:38:23 -07:00
Rohith Sharma K S 58bc34f1e3 YARN-8399. NodeManager is giving 403 GSS exception post upgrade to 3.1 in secure mode. Contributed by Sunil Govindan. 2018-06-07 10:25:56 +05:30
Robert Kanter 0cd145a443 YARN-4677. RMNodeResourceUpdateEvent update from scheduler can lead to race condition (wilfreds and gphillips via rkanter) 2018-06-04 15:32:03 -07:00
Haibo Chen 04cf699dd5 YARN-8388. TestCGroupElasticMemoryController.testNormalExit() hangs on Linux. (Miklos Szegedi via Haibo Chen) 2018-06-04 14:23:08 -07:00
Miklos Szegedi e2c172dc9f YARN-8382. cgroup file leak in NM. Contributed by Hu Ziqian. 2018-06-04 10:41:10 -07:00
Haibo Chen ba12f87dcb YARN-8390. Fix API incompatible changes in FairScheduler's AllocationFileLoaderService. (Gergo Repas via Haibo Chen) 2018-06-04 09:15:23 -07:00
Billie Rinaldi 31998643a5 YARN-8342. Enable untrusted docker image to run with launch command. Contributed by Eric Yang 2018-06-02 14:46:32 -07:00
Rohith Sharma K S 8261f9e571 More YARN pages need to honor yarn.resourcemanager.display.per-user-apps(addendum). Contributed by Sunil G. 2018-06-02 09:29:06 +05:30
Haibo Chen 4880d890ee YARN-8375. TestCGroupElasticMemoryController fails surefire build. (Miklos Szegedi via Haibo Chen) 2018-06-01 16:29:13 -07:00
Wangda Tan ff583d3fa3 YARN-8349. Remove YARN registry entries when a service is killed by the RM. (Billie Rinaldi via wangda)
Change-Id: Ia58db3637789a8921482f564aa9bdf99c45cc36c
2018-06-01 14:49:18 -07:00
Wangda Tan 931f78718f YARN-7962. Race Condition When Stopping DelegationTokenRenewer causes RM crash during failover. (BELUGA BEHR via wangda)
Change-Id: I617e2645f60a57080058ad5f06af860fb3f682c8
2018-06-01 14:49:18 -07:00
Wangda Tan 3a6bd77550 YARN-8384. stdout.txt, stderr.txt logs of a launched docker container is coming with primary group of submit user instead of hadoop. (Eric Yang via wangda)
Change-Id: Idbb2cd250f4f62dc32993e0d3ca3ec5684616baa
2018-06-01 14:49:17 -07:00
Yufei Gu 6b21a599fb YARN-7340. Fix the missing time stamp in exception message in Class
NoOverCommitPolicy. Contributed by Dinesh Chitlangia.
2018-05-31 22:30:12 -07:00
Vinod Kumar Vavilapalli (I am also known as @tshooter.) 6b74f5d7fc YARN-8197. Fixed AM IP Filter and Webapp proxy to redirect app tracking-URLs correctly when UI is secure. Contributed by Sunil Govindan. 2018-05-31 16:48:33 -07:00
Weiwei Yang 6468071f13 YARN-8367. Fix NPE in SingleConstraintAppPlacementAllocator when placement constraint in SchedulingRequest is null. Contributed by Weiwei Yang. 2018-05-31 20:46:39 +08:00
Eric E Payne e44c0849d7 YARN-8377: Javadoc build failed in hadoop-yarn-server-nodemanager. Contributed by Takanobu Asanuma 2018-05-30 16:50:19 +00:00
Eric Yang 135941e00d YARN-8362. Bugfix logic in container retries in node manager.
Contributed by Chandni Singh
2018-05-29 16:56:58 -04:00
Jason Lowe 4827e9a908 YARN-8329. Docker client configuration can still be set incorrectly. Contributed by Shane Kumpf 2018-05-29 14:43:17 -05:00
Wangda Tan 17aa40f669 YARN-8369. Javadoc build failed due to 'bad use of >'. (Takanobu Asanuma via wangda)
Change-Id: I79a42154e8f86ab1c3cc939b3745024b8eebe5f4
2018-05-29 09:27:36 -07:00
Jason Lowe 31ab960f4f YARN-8338. TimelineService V1.5 doesn't come up after HADOOP-15406. Contributed by Vinod Kumar Vavilapalli 2018-05-29 11:00:30 -05:00
Sunil G 7c343669ba YARN-4781. Support intra-queue preemption for fairness ordering policy. Contributed by Eric Payne. 2018-05-28 16:32:53 +05:30
Wangda Tan f24c842d52 YARN-8213. Add Capacity Scheduler performance metrics. (Weiwei Yang via wangda)
Change-Id: Ieea6f3eeb83c90cd74233fea896f0fcd0f325d5f
2018-05-25 21:53:20 -07:00
Eric E Payne 8d5509c681 YARN-8292: Fix the dominant resource preemption cannot happen when some of the resource vector becomes negative. Contributed by Wangda Tan. 2018-05-25 16:06:09 +00:00
Haibo Chen 86bc6425d4 YARN-8191. Fair scheduler: queue deletion without RM restart. (Gergo Repas via Haibo Chen) 2018-05-24 17:12:34 -07:00
Shane Kumpf 1388de18ad YARN-6919. Add default volume mount list. Contributed by Eric Badger 2018-05-24 09:30:39 -06:00
Rohith Sharma K S c05b5d424b YARN-8319. More YARN pages need to honor yarn.resourcemanager.display.per-user-apps. Contributed by Sunil G. 2018-05-24 14:19:46 +05:30
Haibo Chen d996479954 YARN-4599. Set OOM control for memory cgroups. (Miklos Szegedi via Haibo Chen) 2018-05-23 16:35:37 -07:00
Inigo Goiri d72615611c YARN-8348. Incorrect and missing AfterClass in HBase-tests to fix NPE failures. Contributed by Giovanni Matteo Fumarola. 2018-05-23 14:43:59 -07:00
Inigo Goiri e99e5bf104 YARN-8344. Missing nm.stop() in TestNodeManagerResync to fix testKillContainersOnResync. Contributed by Giovanni Matteo Fumarola. 2018-05-23 14:15:26 -07:00
Shane Kumpf 9837ca9cc7 YARN-8285. Remove unused environment variables from the Docker runtime. Contributed by Eric Badger 2018-05-23 06:43:44 -06:00
Robert Kanter b22f56c471 YARN-8273. Log aggregation does not warn if HDFS quota in target directory is exceeded (grepas via rkanter) 2018-05-22 14:24:38 -07:00
Wangda Tan bd15d2396e YARN-8290. SystemMetricsPublisher.appACLsUpdated should be invoked after application information is published to ATS to avoid "User is not set in the application report" Exception. (Eric Yang via wangda)
Change-Id: I0ac6ddd19740d1aa7dd07111cd11af71ddc2fcaf
2018-05-22 13:33:33 -07:00
Eric Yang 6176d2b35c YARN-7960. Added security flag no-new-privileges for YARN Docker integration.
Contributed by Eric Badger
2018-05-22 13:44:58 -04:00
Jason Lowe 5f11288e41 YARN-8206. Sending a kill does not immediately kill docker containers. Contributed by Eric Badger 2018-05-22 09:27:08 -05:00
Eric E Payne 0b4c44bdee YARN-8179: Preemption does not happen due to natural_termination_factor when DRF is used. Contributed by Kyungwan Nam. 2018-05-21 20:14:58 +00:00
Haibo Chen f48fec83d0 YARN-8248. Job hangs when a job requests a resource that its queue does not have. (Szilard Nemeth via Haibo Chen) 2018-05-21 08:10:41 -07:00
Arun Suresh 3159bffce2 YARN-7900. [AMRMProxy] AMRMClientRelayer for stateful FederationInterceptor. (Botong Huang via asuresh) 2018-05-17 20:00:52 -07:00