Szilard Nemeth
df616370f0
YARN-8586. Extract log aggregation related fields and methods from RMAppImpl. Contributed by Peter Bacsko
2019-08-16 11:52:51 +02:00
Szilard Nemeth
8fee3808c5
YARN-9749. TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk. Contributed by Adam Antal
...
(cherry picked from commit 2a05e0ff3b
)
2019-08-16 08:52:34 +02:00
Szilard Nemeth
e616037d1f
YARN-9488. Skip YARNFeatureNotEnabledException from ClientRMService. Contributed by Prabhu Joseph
...
(cherry picked from commit 1845a83cec
)
2019-08-15 17:16:06 +02:00
Adam Antal
d5446b3a23
YARN-9676. Add DEBUG and TRACE level messages to AppLogAggregatorImpl… ( #1261 )
...
* YARN-9676. Add DEBUG and TRACE level messages to AppLogAggregatorImpl and connected classes
* Using {} placeholder, and increasing loglevel if log aggregation failed.
(cherry picked from commit c89bdfacc8
)
2019-08-14 17:36:41 +02:00
Szilard Nemeth
4bb238c480
YARN-9133. Make tests more easy to comprehend in TestGpuResourceHandler. Contributed by Peter Bacsko
2019-08-14 17:16:54 +02:00
Szilard Nemeth
4dc477b606
YARN-9140. Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager. Contributed by Peter Bacsko
2019-08-14 17:01:41 +02:00
Eric Badger
cec71691be
YARN-9442. container working directory has group read permissions. Contributed by Jim Brennan.
...
(cherry picked from commit 2ac029b949
)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/test/test-container-executor.c
2019-08-13 16:34:29 +00:00
Szilard Nemeth
c5aea8ca56
YARN-9723. ApplicationPlacementContext is not required for terminated jobs during recovery. Contributed by Prabhu Joseph
...
(cherry picked from commit e4b538bbda
)
2019-08-12 15:16:18 +02:00
Szilard Nemeth
b20fd9e212
YARN-9135. NM State store ResourceMappings serialization are tested with Strings instead of real Device objects. Contributed by Peter Bacsko
2019-08-12 14:02:17 +02:00
Szilard Nemeth
2e6beb1550
Logging fileSize of log files under NM Local Dir. Contributed by Prabhu Joseph
...
(cherry picked from commit 54ac80176e
)
2019-08-09 13:20:10 +02:00
Szilard Nemeth
02d0e54596
YARN-9092. Create an object for cgroups mount enable and cgroups mount path as they belong together. Contributed by Gergely Pollak
...
(cherry picked from commit e0c21c6da9
)
2019-08-09 10:23:10 +02:00
Szilard Nemeth
f0dfb8b832
YARN-9096: Some GpuResourcePlugin and ResourcePluginManager methods are synchronized unnecessarily. Contributed by Gergely Pollak
...
(cherry picked from commit 742e30b473
)
2019-08-09 10:02:35 +02:00
Szilard Nemeth
3bcf44f070
YARN-9094: Remove unused interface method: NodeResourceUpdaterPlugin#handleUpdatedResourceFromRM. Contributed by Gergely Pollak
...
(cherry picked from commit 72d7e570a7
)
2019-08-09 09:50:32 +02:00
Eric E Payne
e47c483d9f
YARN-9685: NPE when rendering the info table of leaf queue in non-accessible partitions. Contributed by Tao Yang.
...
(cherry picked from commit 3b38f2019e
)
2019-08-08 12:54:31 +00:00
Haibo Chen
8d357343c4
YARN-9559. Create AbstractContainersLauncher for pluggable ContainersLauncher logic. (Contributed by Jonathan Hung)
...
(cherry picked from commit f51702d539
)
2019-08-06 14:59:49 -07:00
Eric E Payne
168dc3f258
YARN-9596: QueueMetrics has incorrect metrics when labelled partitions are involved. Contributed by Muhammad Samir Khan.
...
(cherry picked from commit 42683aef1a
)
2019-07-30 19:19:33 +00:00
Jonathan Hung
15344006bc
YARN-9668. UGI conf doesn't read user overridden configurations on RM and NM startup. (Contributed by Jonanthan Hung)
2019-07-22 10:46:45 -07:00
bibinchundatt
4866735cde
YARN-9645. Fix Invalid event FINISHED_CONTAINERS_PULLED_BY_AM at NEW on NM restart. Contributed by Bilwa S T.
...
(cherry picked from commit 7a93be0f60
)
2019-07-16 14:06:36 +05:30
Szilard Nemeth
28d6a453a9
YARN-9127. Create more tests to verify GpuDeviceInformationParser. Contributed by Peter Bacsko
...
(cherry picked from commit 18ee1092b4
)
2019-07-15 12:02:39 +02:00
Szilard Nemeth
2fcbdf4131
YARN-9337. Addendum to fix compilation error due to mockito spy call
...
(cherry picked from commit bb37c6cb7f
)
2019-07-13 00:45:38 +02:00
Szilard Nemeth
0ede873090
YARN-9337. GPU auto-discovery script runs even when the resource is given by hand. Contributed by Adam Antal
...
(cherry picked from commit 61b0c2bb7c
)
2019-07-12 17:29:47 +02:00
Szilard Nemeth
c61c969668
YARN-9235. If linux container executor is not set for a GPU cluster GpuResourceHandlerImpl is not initialized and NPE is thrown. Contributed by Antal Balint Steinbach, Adam Antal
...
(cherry picked from commit c416284bb7
)
2019-07-12 16:53:26 +02:00
bibinchundatt
5f8395f393
YARN-9557. Application fails in diskchecker when ReadWriteDiskValidator is configured. Contributed by Bilwa S T.
2019-07-10 10:34:39 +05:30
Szilard Nemeth
4638fa00fc
YARN-9629. Support configurable MIN_LOG_ROLLING_INTERVAL. Contributed by Adam Antal.
...
(cherry picked from commit a2a8be18cb
)
2019-07-04 10:26:29 +02:00
Sunil G
d18986e4e8
YARN-9644. First RMContext object is always leaked during switch over. Contributed by Bibin A Chundatt.
2019-07-04 11:05:54 +05:30
Weiwei Yang
c9bccaf148
YARN-9655. AllocateResponse in FederationInterceptor lost applicationPriority. Contributed by hunshenshi.
...
(cherry picked from commit 570eee30e5
)
2019-07-02 10:05:22 +08:00
Erik Krogen
49d7bb6a92
HDFS-13286. [SBN read] Add haadmin commands to transition between standby and observer. Contributed by Chao Sun.
2019-06-28 14:20:01 -07:00
bibinchundatt
a2f4e4698b
YARN-9639. DecommissioningNodesWatcher cause memory leak. Contributed by Bilwa S T.
...
(cherry picked from commit be80334cdf
)
2019-06-27 10:04:40 +05:30
Weiwei Yang
1944a7d844
YARN-9209. When nodePartition is not set in Placement Constraints, containers are allocated only in default partition. Contributed by Tarun Parimi.
...
(cherry picked from commit 83dcb9d87e
)
2019-06-21 17:52:22 +08:00
Zhankun Tang
1e7201f9aa
YARN-9584. Should put initializeProcessTrees method call before get pid. Contributed by Wanqiang Ji.
...
(cherry picked from commit 67414a1a80
)
2019-06-18 13:18:27 +08:00
Inigo Goiri
65f7ec2f39
YARN-8856. TestTimelineReaderWebServicesHBaseStorage tests failing with NoClassDefFoundError. Contributed by Sushil Ks.
...
(cherry picked from commit eeaf8edaa7
)
2019-06-13 14:22:16 -07:00
Sean Mackrory
e0b3cbd221
HADOOP-16213. Update guava to 27.0-jre. Contributed by Gabor Bota.
2019-06-13 07:53:40 -06:00
Sunil G
72203f7a12
YARN-9545. Create healthcheck REST endpoint for ATSv2. Contributed by Zoltan Siegl.
2019-06-12 19:23:40 +05:30
Sunil G
f1ead03672
Revert "YARN-9545. Create healthcheck REST endpoint for ATSv2. Contributed by Zoltan Siegl."
...
This reverts commit f1d3a17d3e
.
2019-06-12 19:10:23 +05:30
bibinchundatt
3303723f55
YARN-9547. ContainerStatusPBImpl default execution type is not returned. Contributed by Bilwa S T.
2019-06-11 23:42:29 +05:30
bibinchundatt
d9284d4a57
YARN-9565. RMAppImpl#ranNodes not cleared on FinalTransition. Contributed by Bilwa S T.
...
(cherry picked from commit 60c95e9b6a
)
2019-06-11 23:13:18 +05:30
bibinchundatt
a37011bd5e
YARN-9594. Fix missing break statement in ContainerScheduler#handle. Contributed by lujie.
...
(cherry picked from commit 6d80b9bc3f
)
Conflicts:
hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/scheduler/ContainerScheduler.java
2019-06-11 23:01:03 +05:30
Sunil G
f1d3a17d3e
YARN-9545. Create healthcheck REST endpoint for ATSv2. Contributed by Zoltan Siegl.
2019-06-06 06:24:01 +05:30
Weiwei Yang
6e2b091515
YARN-9580. Fulfilled reservation information in assignment is lost when transferring in ParentQueue#assignContainers. Contributed by Tao Yang.
2019-06-04 15:24:37 +08:00
Weiwei Yang
e027c87da2
YARN-9507. Fix NPE in NodeManager#serviceStop on startup failure. Contributed by Bilwa S T.
...
(cherry picked from commit 4530f4500d
)
2019-06-03 14:15:20 +08:00
Eric Yang
b2a39e8883
YARN-9542. Fix LogsCLI guessAppOwner ignores custome file format suffix.
...
Contributed by Prabhu Joseph
2019-05-29 18:04:13 -04:00
Eric E Payne
2e561cef47
YARN-8625. Aggregate Resource Allocation for each job is not present in ATS. Contributed by Prabhu Joseph.
...
(cherry picked from commit 3c63551101
)
2019-05-29 18:43:13 +00:00
Ahmed Hussein
777f7345ef
YARN-9563. Resource report REST API could return NaN or Inf (Ahmed Hussein via jeagles)
...
Signed-off-by: Jonathan Eagles <jeagles@gmail.com>
(cherry picked from commit abf76ac371
)
2019-05-29 12:14:01 -05:00
Takanobu Asanuma
a9a3450560
HADOOP-16331. Fix ASF License check in pom.xml. Contributed by Akira Ajisaka.
...
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2019-05-29 17:34:16 +09:00
Akira Ajisaka
855dc997d6
HADOOP-16323. https everywhere in Maven settings.
2019-05-27 15:27:33 +09:00
bibinchundatt
71f5bfb822
YARN-9508. YarnConfiguration areNodeLabel enabled is costly in allocation flow. Contributed by Bilwa S T.
...
(cherry picked from commit 570fa2da20
)
2019-05-15 13:31:07 +05:30
Haibo Chen
c6573562cb
YARN-9529. Log correct cpu controller path on error while initializing CGroups. (Contributed by Jonathan Hung)
...
(cherry picked from commit 597fa47ad1
)
2019-05-06 11:58:31 -07:00
Eric E Payne
6fce24fb40
YARN-9285: RM UI progress column is of wrong type. Contributed by Ahmed Hussein.
...
(cherry picked from commit b094b94d43
)
2019-05-02 19:48:06 +00:00
Weiwei Yang
cc0c85f04a
YARN-9325. TestQueueManagementDynamicEditPolicy fails intermittent. Contributed by Prabhu Joseph.
...
(cherry picked from commit 1c8046d67e
)
2019-04-23 14:24:15 +08:00
Eric Yang
ac85aa80d9
YARN-8587. Added retries for fetching docker exit code.
...
Contributed by Charo Zhang
(cherry picked from commit c16c49b8c3
)
2019-04-19 15:40:23 -04:00
Eric Yang
4a64dab0dd
YARN-8622. Fixed container-executor compilation on MacOSX.
...
Contributed by Siyao Meng
(cherry picked from commit ef97a20831
)
2019-04-18 19:01:11 -04:00
Eric Yang
2503409977
YARN-6695. Fixed NPE in publishing appFinished events to ATSv2.
...
Contributed by Prabhu Joseph
(cherry picked from commit df76cdc895
)
2019-04-18 12:30:55 -04:00
Siyao Meng
742a3ad24b
YARN-9487. NodeManager native build shouldn't link against librt on macOS. Contributed by Siyao Meng.
...
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 6e4399ea61
)
2019-04-17 22:57:33 -07:00
Weiwei Yang
db185de31c
YARN-9463. Add queueName info when failing with queue capacity sanity check. Contributed by Aihua Xu.
...
(cherry picked from commit 8c1bba375b
)
2019-04-10 23:02:24 +08:00
Weiwei Yang
7a80b1b481
YARN-9413. Queue resource leak after app fail for CapacityScheduler. Contributed by Tao Yang.
...
(cherry picked from commit ec143cbf67
)
2019-04-06 20:19:03 +08:00
Eric Yang
10642a6205
YARN-9391. Fixed node manager environment leaks into Docker containers.
...
Contributed by Jim Brennan
(cherry picked from commit 3c45762a0b
)
2019-03-25 15:54:52 -04:00
Sunil G
d721634fea
YARN-9138. Improve test coverage for nvidia-smi binary execution of GpuDiscoverer. Contributed by Szilard Nemeth.
...
(cherry picked from commit 46045c5cb3
)
2019-03-06 16:01:56 +05:30
bibinchundatt
63ed16e076
Revert "YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph"
...
This reverts commit cf1944eb6e
.
2019-03-04 17:01:40 +05:30
Sunil G
d045f02a8d
YARN-9139. Simplify initializer code of GpuDiscoverer. Contributed by Szilard Nemeth.
2019-03-01 19:27:03 +05:30
Weiwei Yang
7575e3090d
YARN-9324. TestSchedulingRequestContainerAllocation(Async) fails with junit-4.11. Contributed by Prabhu Joseph.
2019-02-28 09:32:07 +08:00
Weiwei Yang
7fa5373ec4
YARN-9248. RMContainerImpl:Invalid event: ACQUIRED at KILLED. Contributed by lujie.
...
(cherry picked from commit 8c30114b00
)
2019-02-27 17:35:09 +08:00
Sunil G
809e3f2453
YARN-9121. Replace GpuDiscoverer.getInstance() to a readable object for easy access control. Contributed by Szilard Nemeth.
...
(cherry picked from commit 5e91ebd91a
)
2019-02-27 12:03:58 +05:30
Sunil G
a95a0cbf2f
YARN-9087. Improve logging for initialization of Resource plugins. Contributed by Szilard Nemeth.
2019-02-27 11:54:43 +05:30
Weiwei Yang
bdde6a612e
YARN-9329. updatePriority is blocked when using FairScheduler. Contributed by Jiandan Yang.
...
(cherry picked from commit 3e1739d589
)
2019-02-26 00:18:24 +08:00
Sunil G
f282f9c362
YARN-9213. RM Web UI v1 does not show custom resource allocations for containers page. Contributed by Szilard Nemeth.
2019-02-25 11:37:42 +05:30
Weiwei Yang
cdce1c17a0
YARN-9316. TestPlacementConstraintsUtil#testInterAppConstraintsByAppID fails intermittently. Contributed by Prabhu Joseph.
...
(cherry picked from commit 9cd5c5447f
)
2019-02-24 22:48:55 +08:00
Weiwei Yang
604a915bab
YARN-9238. Avoid allocating opportunistic containers to previous/removed/non-exist application attempt. Contributed by lujie.
...
(cherry picked from commit 9c88695bcd
)
2019-02-24 22:21:53 +08:00
bibinchundatt
3e1bd53a37
YARN-9317. Avoid repeated YarnConfiguration#timelineServiceV2Enabled check. Contributed by Prabhu Joseph
2019-02-23 07:59:51 +05:30
bibinchundatt
cf1944eb6e
YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph
2019-02-22 20:51:47 +05:30
Sunil G
d75aa33612
YARN-9118. Handle exceptions with parsing user defined GPU devices in GpuDiscoverer. Contributed by Szilard Nemeth.
...
(cherry picked from commit 95fbbfed75
)
2019-02-22 20:23:24 +05:30
Weiwei Yang
c2ef443359
YARN-9315. TestCapacitySchedulerMetrics fails intermittently. Contributed by Prabhu Joseph.
2019-02-21 18:06:26 +08:00
bibinchundatt
e6f2b8730f
YARN-9286. [Timeline Server] Sorting based on FinalStatus shows pop-up message. Contributed by Bilwa S T.
...
(cherry picked from commit b8de78c570
)
2019-02-20 01:20:15 +05:30
Adam Antal
830aaac023
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
...
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 9385ec45d7
)
2019-02-15 18:48:21 +09:00
Weiwei Yang
fbd03543d8
YARN-8555. Parameterize TestSchedulingRequestContainerAllocation(Async) to cover both PC handler options. Contributed by Prabhu Joseph.
...
(cherry picked from commit 0a1637c750
)
2019-02-11 15:56:34 +08:00
Masatake Iwasaki
6229469574
YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri.
...
(cherry picked from commit e0ab1bdece
)
2019-02-09 00:28:59 +09:00
Eric E Payne
55dde827e6
YARN-7171: RM UI should sort memory / cores numerically. Contributed by Ahmed Hussein
...
(cherry picked from commit d1ca9432dd
)
2019-02-07 16:47:15 +00:00
Vinayakumar B
e2b91b2ccb
YARN-8498. Yarn NodeManager OOM Listener Fails Compilation on Ubuntu 18.04. Contributed by Ayush Saxena.
2019-02-07 13:03:42 +05:30
Weiwei Yang
b64e9df949
YARN-9262. TestRMAppAttemptTransitions is failing with an NPE. Contributed by lujie.
...
(cherry picked from commit 28ad20a711
)
2019-02-04 14:00:30 +05:30
Sunil G
99876a5ab8
YARN-9206. RMServerUtils does not count SHUTDOWN as an accepted state. Contributed by Kuhu Shukla.
...
(cherry picked from commit 604b2489a9
)
2019-02-04 12:49:06 +05:30
Weiwei Yang
a0fafbc3ef
YARN-9263. TestConfigurationNodeAttributesProvider fails after Mockito updated. Contributed by Weiwei Yang.
...
(cherry picked from commit f20b043a02
)
2019-02-04 12:45:40 +05:30
Sunil G
0e7060a1d5
YARN-9099. GpuResourceAllocator#getReleasingGpus calculates number of GPUs in a wrong way. Contributed by Szilard Nemeth.
...
(cherry picked from commit 71c49fa60f
)
2019-01-31 09:26:07 +05:30
Eric E Payne
4052b7ee60
YARN-6616: YARN AHS shows submitTime for jobs same as startTime. Contributed by Prabhu Joseph
...
(cherry picked from commit 04105bbfdb
)
2019-01-29 17:52:54 +00:00
Weiwei Yang
6b8dd8d113
YARN-9237. NM should ignore sending finished apps to RM during RM fail-over. Contributed by Jiandan Yang.
...
(cherry picked from commit 4f63ffe444
)
2019-01-29 10:42:09 +08:00
Jonathan Hung
bf760e7e81
YARN-9222. Print launchTime in ApplicationSummary
...
(cherry picked from commit 6cace58e21
)
2019-01-25 13:23:37 -08:00
Weiwei Yang
bc6374f282
YARN-9205. When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION). Contributed by Zhankun Tang.
2019-01-23 18:10:28 +08:00
Weiwei Yang
8ad7711605
YARN-8101. Add UT to verify node-attributes in RM nodes rest API. Contributed by Prabhu Joseph.
...
(cherry picked from commit 721d5c2a5f
)
2019-01-23 18:07:45 +08:00
Weiwei Yang
9114489566
YARN-9210. RM nodes web page can not display node info. Contributed by Jiandan Yang.
...
(cherry picked from commit d43df31751
)
2019-01-22 10:46:37 +08:00
Weiwei Yang
ac2f4b64f9
YARN-9204. RM fails to start if absolute resource is specified for partition capacity in CS queues. Contributed by Jiandan Yang.
...
(cherry picked from commit abde1e1f58
)
2019-01-21 21:20:01 +08:00
Wangda Tan
fe7cb2d84a
YARN-9194. Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM. (lujie via wangda)
...
Change-Id: I4359f59a73a278a941f4bb9d106dd38c9cb471fe
(cherry picked from commit 6d7eedfd28
)
2019-01-17 15:13:42 -08:00
Wangda Tan
1dc2b49bfd
YARN-8822. Nvidia-docker v2 support for YARN GPU feature. (Charo Zhang via wangda)
...
Change-Id: Ib8044307a4241f6b1b7b9b8266b9256f39b16384
2019-01-07 12:21:33 -08:00
Weiwei Yang
2b549e32e1
YARN-9173. FairShare calculation broken for large values after YARN-8833. Contributed by Wilfred Spiegelenburg.
...
(cherry picked from commit 944cf87223
)
2019-01-07 16:05:57 +08:00
Weiwei Yang
a24cca11f2
YARN-9164. Shutdown NM may cause NPE when opportunistic container scheduling is enabled. Contributed by lujie.
...
(cherry picked from commit cfe89e6f96
)
2019-01-04 01:04:39 +08:00
Weiwei Yang
7deef08eb8
YARN-8925. Updating distributed node attributes only when necessary. Contributed by Tao Yang.
2018-12-21 16:31:03 +08:00
Eric Yang
29c9c8a893
YARN-9126. Fix container clean up for reinitialization.
...
Contributed by Chandni Singh
(cherry picked from commit e815fd9c49
)
2018-12-19 14:58:19 -05:00
Eric Yang
28ca14e71b
YARN-9040. Fixed memory leak in LevelDBCacheTimelineStore and DBIterator.
...
Contributed by Tarun Parimi
(cherry picked from commit 71e0b0d800
)
2018-12-17 12:08:09 -05:00
Eric Yang
52aafb9789
YARN-9125. Fixed Carriage Return detection in Docker container launch command.
...
Contributed by Billie Rinaldi
(cherry picked from commit b2d7204ed0
)
2018-12-14 17:55:10 -05:00
Weiwei Yang
2b3c3d2a32
YARN-9009. Fix flaky test TestEntityGroupFSTimelineStore.testCleanLogs. Contributed by OrDTesters.
...
(cherry picked from commit 1c09a10e96
)
2018-12-10 12:07:23 +08:00
Jonathan Hung
3ab6ea7aca
YARN-9085. Add Guaranteed and MaxCapacity to CSQueueMetrics
...
(cherry picked from commit 978ab3e958227220cb6f1a08ae6e7cdb8a46628b)
2018-12-07 10:45:47 -08:00
Eric Yang
8c70728f7f
YARN-9071. Improved status update for reinitialized containers.
...
Contributed by Chandni Singh
(cherry picked from commit 1b790f4dd1
)
2018-12-05 19:04:55 -05:00
Jonathan Hung
6b01e4d2a8
YARN-9036. Escape newlines in health report in YARN UI. Contributed by Keqiu Hu
...
(cherry picked from commit 1c8bd7128c99d8215ef16438bd2ce6b1f025a966)
2018-11-30 10:16:00 -08:00
bibinchundatt
183ec39c4b
YARN-9069. Fix SchedulerInfo#getSchedulerType for custom schedulers. Contributed by Bilwa S T.
...
(cherry picked from commit 07142f54a8
)
2018-11-29 22:16:32 +05:30
Jason Lowe
df0e7766e4
YARN-8812. Containers fail during creating a symlink which started with hyphen for a resource file. Contributed by Oleksandr Shevchenko
...
(cherry picked from commit 3ce99e32f7
)
2018-11-28 08:50:18 -06:00
Eric Yang
838190482d
YARN-8986. Added port publish for Docker container running with bridge.
...
Contributed by Charo Zhang
2018-11-27 14:27:13 -05:00
Weiwei Yang
650581a19d
YARN-8833. Avoid potential integer overflow when computing fair shares. Contributed by liyakun.
...
(cherry picked from commit d027a24f03
)
2018-11-18 23:23:09 +08:00
Rohith Sharma K S
13e3670e7f
YARN-8303. YarnClient should contact TimelineReader for application/attempt/container report.
...
(cherry picked from commit ee3355be3c
)
2018-11-16 18:37:20 +05:30
Weiwei Yang
2415f8a5be
YARN-8987. Usability improvements node-attributes CLI. Contributed by Bibin A Chundatt.
...
(cherry picked from commit c741109522
)
2018-11-12 18:21:11 +08:00
Weiwei Yang
b3321d9933
YARN-8988. Reduce the verbose log on RM heartbeat path when distributed node-attributes is enabled. Contributed by Tao Yang.
...
(cherry picked from commit e1bbf7dcdf
)
2018-11-08 17:50:16 +08:00
Weiwei Yang
b10ec0aa14
YARN-8977. Remove unnecessary type casting when calling AbstractYarnScheduler#getSchedulerNode. Contributed by Wanqiang Ji.
...
(cherry picked from commit c96cbe8659
)
2018-11-07 22:44:01 +08:00
Akira Ajisaka
41c4e9583d
YARN-8233. NPE in CapacityScheduler#tryCommit when handling allocate/reserve proposal whose allocatedOrReservedContainer is null. Contributed by Tao Yang.
...
(cherry picked from commit 951c98f890
)
2018-11-07 11:18:48 +09:00
Jason Lowe
9265934201
YARN-8865. RMStateStore contains large number of expired RMDelegationToken. Contributed by Wilfred Spiegelenburg
...
(cherry picked from commit ab6aa4c726
)
2018-11-06 08:47:30 -06:00
Weiwei Yang
f00125e2d9
YARN-8970. Improve the debug message in CS#allocateContainerOnSingleNode. Contributed by Zhankun Tang.
...
(cherry picked from commit 5d6554c722
)
2018-11-06 14:52:10 +08:00
Weiwei Yang
b474239e0b
YARN-8969. AbstractYarnScheduler#getNodeTracker should return generic type to avoid type casting. Contributed by Wanqiang Ji.
...
(cherry picked from commit c7fcca0d7e
)
2018-11-06 13:17:19 +08:00
Jonathan Hung
b9a3c988c0
YARN-7225. Add queue and partition info to RM audit log. Contributed by Eric Payne
...
(cherry picked from commit 2ab611d48b
)
2018-11-01 14:31:09 -07:00
Rohith Sharma K S
7920f79049
YARN-8950. Fix compilation issue due to dependency convergence error for hbase.profile=2.0.
...
(cherry picked from commit 4ec4ec6971
)
2018-10-30 11:50:33 +05:30
Weiwei Yang
5db4272d91
YARN-8944. TestContainerAllocation.testUserLimitAllocationMultipleContainers failure after YARN-8896. Contributed by Wilfred Spiegelenburg.
...
(cherry picked from commit 1d90a0dd23
)
2018-10-29 11:55:22 +08:00
Jason Lowe
7097755925
YARN-8904. TestRMDelegationTokens can fail in testRMDTMasterKeyStateOnRollingMasterKey. Contributed by Wilfred Spiegelenburg
...
(cherry picked from commit 93fb3b4b9c
)
2018-10-23 12:54:04 -05:00
Rohith Sharma K S
660fff3138
YARN-8826. Fix lingering timeline collector after serviceStop in TimelineCollectorManager. Contributed by Prabha Manepalli.
...
(cherry picked from commit 0b62983c5a
)
2018-10-23 14:06:13 +05:30
Sunil G
30998fea28
YARN-8868. Set HTTPOnly attribute to Cookie. Contributed by Chandni Singh.
...
(cherry picked from commit 2202e00ba8
)
2018-10-23 09:57:26 +05:30
Eric Yang
04600cf5da
YARN-8910. Fixed misleading log statement when container max retries is infinite.
...
Contributed by Chandni Singh
(cherry picked from commit 47ad98b2e1
)
2018-10-19 13:49:51 -04:00
Wangda Tan
9ed9e185d7
YARN-8916. Define a constant docker string in ContainerRuntimeConstants.java for better maintainability. (Zhankun Tang via wangda)
...
Change-Id: I1349e740037f81afdbe30edbe741f20e88fd0a90
(cherry picked from commit 5e02b4915b
)
2018-10-19 09:50:18 -07:00
Weiwei Yang
042f2df19b
YARN-8907. Fix incorrect logging message in TestCapacityScheduler. Contributed by Zhankun Tang.
...
(cherry picked from commit 13cc0f50ea
)
2018-10-19 10:01:45 +08:00
Wangda Tan
7252f8e117
YARN-8896. Limit the maximum number of container assignments per heartbeat. (Zhankun Tang via wangda)
...
Change-Id: I6e72f8362bd7f5c2a844cb9e3c4732492314e9f1
(cherry picked from commit 780be14f07
)
2018-10-18 12:12:19 -07:00
Wangda Tan
5f7ed043c8
YARN-8456. Fix a configuration handling bug when user leave FPGA discover executable path configuration default but set OpenCL SDK path environment variable. (Zhankun Tang via wangda)
...
Change-Id: Iff150ea98ba0c60d448474fd940eb121afce6965
(cherry picked from commit a457a8951a
)
2018-10-18 12:12:14 -07:00
Sunil G
bde4fd5ed9
Preparing for 3.2.0 release
2018-10-18 17:07:45 +05:30
Sunil G
6380ee5512
YARN-8759. Copy of resource-types.xml is not deleted if test fails, causes other test failures. Contributed by Antal Bálint Steinbach.
...
(cherry picked from commit 5085e5fa9e
)
2018-10-17 16:06:07 +05:30
Vrushali C
62d329cac0
YARN-5742 Serve aggregated logs of historical apps from timeline service. Contributed by Rohith Sharma KS
...
(cherry picked from commit 8d1981806f
)
2018-10-12 07:18:13 +05:30
Jason Lowe
cdbca8b133
YARN-8861. executorLock is misleading in ContainerLaunch. Contributed by Chandni Singh
...
(cherry picked from commit e787d65a08
)
2018-10-11 10:58:48 -05:00
Jason Lowe
145c7aa663
YARN-7644. NM gets backed up deleting docker containers. Contributed by Chandni Singh
...
(cherry picked from commit 5ce70e1211
)
2018-10-10 10:01:52 -05:00
Weiwei Yang
eb0147a4c7
YARN-8858. CapacityScheduler should respect maximum node resource when per-queue maximum-allocation is being used. Contributed by Wangda Tan.
...
(cherry picked from commit edce866489
)
2018-10-10 09:42:45 +08:00
Wangda Tan
b3ac886933
YARN-8844. TestNMProxy unit test is failing. (Eric Yang via wangda)
...
Change-Id: I241fa8701b6f1dbcad87fd2e9a429e32e7aa40f5
2018-10-04 10:48:47 -07:00
Shane Kumpf
5edb9d3b97
YARN-8785. Improve the error message when a bind mount is not whitelisted. Contributed by Simon Prewo
2018-10-02 07:16:29 -06:00
Haibo Chen
d0ee6fbe28
YARN-8621. Add test coverage of custom Resource Types for the apps/<appId> REST API endpoint. (Contributed by Szilard Nemeth)
2018-10-01 14:46:42 -07:00
Giovanni Matteo Fumarola
59d5af21b7
YARN-8760. [AMRMProxy] Fix concurrent re-register due to YarnRM failover in AMRMClientRelayer. Contributed by Botong Huang.
2018-10-01 13:12:38 -07:00
Weiwei Yang
fd6be5898a
YARN-8468. Enable the use of queue based maximum container allocation limit and implement it in FairScheduler. Contributed by Antal Bálint Steinbach.
2018-09-29 17:47:12 +08:00
Eric E Payne
8598b498bc
YARN-8774. Memory leak when CapacityScheduler allocates from reserved container with non-default label. Contributed by Tao Yang.
2018-09-28 15:32:07 +00:00
bibinchundatt
7093afd874
YARN-8829. Cluster metrics can fail with IndexOutOfBoundsException. Contributed by Akshay Agarwal.
2018-09-28 12:35:33 +05:30
Vrushali C
90e2e493b3
YARN-8270 Adding JMX Metrics for Timeline Collector and Reader. Contributed by Sushil Ks.
2018-09-27 15:53:39 -07:00
Eric Yang
b237a0dd44
YARN-6456. Added config to set default container runtimes.
...
Contributed by Craig Condit
2018-09-27 15:31:18 -04:00
Jason Lowe
6b988d821e
YARN-8804. resourceLimits may be wrongly calculated when leaf-queue is blocked in cluster with 3+ level queues. Contributed by Tao Yang
2018-09-26 14:43:00 -07:00
Eric Yang
913f87dada
YARN-8665. Added Yarn service cancel upgrade option.
...
Contributed by Chandni Singh
2018-09-26 14:51:35 -04:00
Rohith Sharma K S
e5287a4fe0
YARN-8824. App Nodelabel missed after RM restart for finished apps. Contributed by Bibin A Chundatt.
2018-09-26 12:30:26 +05:30
Akira Ajisaka
44edcdfd6a
YARN-8745. Misplaced the TestRMWebServicesFairScheduler.java file. Contributed by Y. SREENIVASULU REDDY.
2018-09-26 10:09:11 +09:00
Rohith Sharma K S
50bc7746d7
YARN-8815. RM fails to recover finished unmanaged AM. Contributed by Bibin A Chundatt.
2018-09-25 11:31:14 +05:30
Haibo Chen
29dad7d258
YARN-8616. systemClock should be used in RMAppImpl instead of System.currentTimeMills(), to be consistent. (Contributed by Szilard Nemeth)
2018-09-24 16:04:28 -07:00
Giovanni Matteo Fumarola
3090922805
YARN-8696. [AMRMProxy] FederationInterceptor upgrade: home sub-cluster heartbeat async. Contributed by Botong Huang.
2018-09-24 11:37:05 -07:00
Eric Yang
aa4bd493c3
YARN-8801. Fixed header comments for docker utility functions.
...
Contributed by Zian Chen
2018-09-20 13:08:59 -04:00
Jason Lowe
6b5838ed32
YARN-8784. DockerLinuxContainerRuntime prevents access to distributed cache entries on a full disk. Contributed by Eric Badger
2018-09-19 16:44:51 -05:00
Eric Yang
efdea85ad1
YARN-8791. Trim docker inspect output for line feed for STOPSIGNAL parsing.
...
Contributed by Chandni Singh
2018-09-19 13:16:11 -04:00
Weiwei Yang
0712537e79
YARN-8771. CapacityScheduler fails to unreserve when cluster resource contains empty resource type. Contributed by Tao Yang.
2018-09-19 19:31:07 +08:00
Jason Lowe
2df0a8dcb3
YARN-8648. Container cgroups are leaked when using docker. Contributed by Jim Brennan
2018-09-18 15:36:45 -05:00
Shane Kumpf
144a55f0e3
YARN-8045. Reduce log output from container status calls. Contributed by Craig Condit
2018-09-14 10:41:55 -06:00
Shane Kumpf
78902f0250
YARN-8748. Javadoc warnings within the nodemanager package. Contributed by Craig Condit
2018-09-14 10:28:36 -06:00
Eric Yang
99237607bf
YARN-8706. Allow additional flag in docker inspect call.
...
Contributed by Chandni Singh
2018-09-14 11:46:59 -04:00
Weiwei Yang
f1a893fdbc
YARN-8720. CapacityScheduler does not enforce max resource allocation check at queue level. Contributed by Tarun Parimi.
2018-09-14 16:33:51 +08:00
Jason Lowe
250b50018e
YARN-8680. YARN NM: Implement Iterable Abstraction for LocalResourceTracker state. Contributed by Pradeep Ambati
2018-09-13 13:28:54 -05:00
Weiwei Yang
39c1ea1ed4
YARN-8729. Node status updater thread could be lost after it is restarted. Contributed by Tao Yang.
2018-09-13 22:21:35 +08:00
Sunil G
f4bda5e8e9
YARN-8630. ATSv2 REST APIs should honor filter-entity-list-by-user in non-secure cluster when ACls are enabled. Contributed by Rohith Sharma K S.
2018-09-13 17:47:21 +05:30
Shane Kumpf
8e9afbfb66
YARN-8768. Javadoc error in node attributes. Contributed by Sunil Govindan.
2018-09-12 15:12:28 -06:00
Giovanni Matteo Fumarola
02b9bfdf9e
YARN-8658. [AMRMProxy] Metrics for AMRMClientRelayer inside FederationInterceptor. Contributed by Young Chen.
2018-09-12 11:46:35 -07:00
Sunil G
5e64e62dee
YARN-8740. Clear node attribute path after each test run. Contributed by Bibin A Chundatt.
2018-09-12 16:01:01 +05:30
bibinchundatt
c44088ac19
YARN-8739. Fix jenkins issues for Node Attributes branch. Contributed by Sunil Govindan.
2018-09-12 16:01:01 +05:30
Weiwei Yang
52194351e7
YARN-8721. Relax NE node-attribute check when attribute doesn't exist on a node. Contributed by Sunil Govindan.
2018-09-12 16:01:01 +05:30
Naganarasimha
67ae81f0e0
YARN-7863. Modify placement constraints to support node attributes. Contributed by Sunil Govindan.
2018-09-12 16:01:01 +05:30
Naganarasimha
eb08543c7a
YARN-8103. Add CLI interface to query node attributes. Contributed by Bibin A Chundatt.
2018-09-12 16:01:01 +05:30
Sunil G
76183428b7
YARN-8351. Node attribute manager logs are flooding RM logs. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
bibinchundatt
8cf6a9a2bd
YARN-7892. Revisit NodeAttribute class structure. Contributed by Naganarasimha G R.
2018-09-12 16:01:00 +05:30
Naganarasimha
5dc7d6e0f3
YARN-8104. Add API to fetch node to attribute mapping. Contributed by Bibin A Chundatt.
2018-09-12 16:01:00 +05:30
Naganarasimha
0a01b1350d
YARN-8100. Support API interface to query cluster attributes and attribute to nodes. Contributed by Bibin A Chundatt.
2018-09-12 16:01:00 +05:30
Sunil G
b9890d1f66
YARN-7875. Node Attribute store for storing and recovering attributes. Contributed by Bibin A Chundatt.
2018-09-12 16:01:00 +05:30
bibinchundatt
a6590c1f1f
YARN-8117. Fix TestRMWebServicesNodes test failure. Contributed by Bibin A Chundatt.
2018-09-12 16:01:00 +05:30
bibinchundatt
901e85238d
YARN-8033. CLI Integration with NodeAttributesManagerImpl. Contributed by Naganarasimha G R.
2018-09-12 16:01:00 +05:30
Sunil G
89b3ebd11e
YARN-8092. Expose Node Attributes info via RM nodes REST API. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
Sunil G
440ff7f563
YARN-8094. Support configuration based Node Attribute provider. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
Sunil G
6f4bc49c6d
YARN-7988. Refactor FSNodeLabelStore code for Node Attributes store support. Contributed by Bibin A Chundatt.
2018-09-12 16:01:00 +05:30
Naganarasimha
3b3b6efe21
YARN-7871. Node attributes reporting from NM to RM. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
Naganarasimha
86d024ef2a
YARN-7965. NodeAttributeManager add/get API is not working properly. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
Sunil G
ffcabd24c3
YARN-7856. Validate Node Attributes from NM. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
Sunil G
2f7712be09
YARN-6858. Attribute Manager to store and provide node attributes in RM. Contributed by Naganarasimha G R.
2018-09-12 16:01:00 +05:30
Naganarasimha
d312b5cf9f
YARN-7757. Refactor NodeLabelsProvider to be more generic and reusable for node attributes providers. Contributed by Weiwei Yang.
2018-09-12 16:01:00 +05:30
Weiwei Yang
d9d93e3925
YARN-7842. PB changes to carry node-attributes in NM heartbeat. Contributed by Weiwei Yang.
2018-09-12 16:00:59 +05:30
Naganarasimha
1f42ce907a
YARN-6855. [YARN-3409] CLI Proto Modifications to support Node Attributes. Contributed by Naganarasimha G R.
2018-09-12 16:00:59 +05:30
Eric E Payne
987d8191ad
YARN-8709: CS preemption monitor always fails since one under-served queue was deleted. Contributed by Tao Yang.
2018-09-10 19:55:20 +00:00
Eric Yang
bf8a1750e9
YARN-8706. Updated docker container stop logic to avoid double kill.
...
Contributed by Chandni Singh
2018-09-07 20:18:09 -04:00
Eric Yang
7d62334387
YARN-8751. Reduce conditions that mark node manager as unhealthy.
...
Contributed by Craig Condit
2018-09-07 19:46:15 -04:00
Giovanni Matteo Fumarola
3dc2988a37
YARN-8699. Add Yarnclient#yarnclusterMetrics API implementation in router. Contributed by Bibin A Chundatt.
2018-09-07 11:32:03 -07:00
Giovanni Matteo Fumarola
9af96d4ed4
HADOOP-15707. Add IsActiveServlet to be used for Load Balancers. Contributed by Lukas Majercak.
2018-09-05 10:50:25 -07:00
Shane Kumpf
dffb7bfe6c
YARN-8638. Allow linux container runtimes to be pluggable. Contributed by Craig Condit
2018-09-05 06:47:54 -06:00
bibinchundatt
eed8415dc1
YARN-8535. Fix DistributedShell unit tests. Contributed by Abhishek Modi.
2018-09-02 13:35:52 +05:30
Shane Kumpf
73625168c0
YARN-8642. Add support for tmpfs mounts with the Docker runtime. Contributed by Craig Condit
2018-08-29 07:08:37 -06:00
Weiwei Yang
3fa4639421
YARN-8723. Fix a typo in CS init error message when resource calculator is not correctly set. Contributed by Abhishek Modi.
2018-08-29 11:13:44 +08:00
Giovanni Matteo Fumarola
7ed458b255
YARN-8697. LocalityMulticastAMRMProxyPolicy should fallback to random sub-cluster when cannot resolve resource. Contributed by Botong Huang.
2018-08-28 16:01:35 -07:00
Giovanni Matteo Fumarola
602d13844a
HADOOP-15699. Fix some of testContainerManager failures in Windows. Contributed by Botong Huang.
2018-08-27 12:25:46 -07:00
Billie Rinaldi
05b2bbeb35
YARN-8675. Remove default hostname for docker containers when net=host. Contributed by Suma Shivaprasad
2018-08-27 11:34:33 -07:00
Giovanni Matteo Fumarola
f152582562
YARN-8705. Refactor the UAM heartbeat thread in preparation for YARN-8696. Contributed by Botong Huang.
2018-08-27 10:32:22 -07:00
Jason Lowe
585ebd873a
YARN-8649. NPE in localizer hearbeat processing if a container is killed while localizing. Contributed by lujie
2018-08-23 09:29:46 -05:00
Sunil G
1ac01444a2
YARN-8015. Support all types of placement constraint support for Capacity Scheduler. Contributed by Weiwei Yang.
2018-08-23 10:05:43 +05:30
Weiwei Yang
9c3fc3ef28
YARN-7494. Add muti-node lookup mechanism and pluggable nodes sorting policies to optimize placement decision. Contributed by Sunil Govindan.
2018-08-21 22:42:28 +08:00
Weiwei Yang
54d0bf8935
YARN-8683. Support to display pending scheduling requests in RM app attempt page. Contributed by Tao Yang.
2018-08-21 19:00:31 +08:00
Rohith Sharma K S
d3fef7a5c5
YARN-8129. Improve error message for invalid value in fields attribute. Contributed by Abhishek Modi.
2018-08-21 11:58:07 +05:30
Giovanni Matteo Fumarola
e0f6ffdbad
YARN-8581. [AMRMProxy] Add sub-cluster timeout in LocalityMulticastAMRMProxyPolicy. Contributed by Botong Huang.
2018-08-20 14:33:16 -07:00