Commit Graph

5563 Commits

Author SHA1 Message Date
Sunil G 69dc329acc YARN-4901. QueueMetrics needs to be cleared before MockRM is initialized. Contributed by Peter Bacsko.
(cherry picked from commit 002dcc4ebf)
2019-12-08 14:40:45 -08:00
Szilard Nemeth 2ad7b90505 YARN-9993. Remove incorrectly committed files from YARN-9011. Contributed by Wilfred Spiegelenburg 2019-11-28 12:39:54 +01:00
Sunil G f9b872b6ec YARN-9949. Add missing queue configs for root queue in RMWebService#CapacitySchedulerInfo.
Contributed by Prabhu Joseph.
2019-11-27 23:14:33 +05:30
Szilard Nemeth 8eda9fcab8 YARN-9937. Add missing queue configs in RMWebService#CapacitySchedulerQueueInfo.
Contributed by Prabhu Joseph.
2019-11-27 22:24:06 +05:30
Szilard Nemeth 3fc8930129 YARN-9011. Race condition during decommissioning. Contributed by Peter Bacsko 2019-11-26 14:26:58 +01:00
HUAN-PING SU 59a6261e81
YARN-9966. Code duplication in UserGroupMappingPlacementRule (#1709)
(cherry picked from commit f8e36e03b4)
2019-11-25 15:29:37 +09:00
Szilard Nemeth dcc453b4b8 YARN-9968. Public Localizer is exiting in NodeManager due to NullPointerException. Contributed by Tarun Parimi 2019-11-22 12:59:35 +01:00
Tao Yang af495192a5 YARN-9838. Fix resource inconsistency for queues when moving app with reserved container to another queue. Contributed by jiulongzhu. 2019-11-22 16:14:16 +08:00
Eric Yang 6951689f4c YARN-9983. Fixed typo in YARN Service overview.
Contributed by Denes Gerencser
2019-11-19 14:19:49 -05:00
Abhishek Modi 31591bb296 YARN-9791. Queue Mutation API does not allow to remove a config. Contributed by Prabhu Joseph.
(cherry picked from commit 751b5a1ac8)
2019-11-19 16:49:14 +05:30
Sunil G b04f152876 YARN-9909. Offline Format of YarnConfigurationStore. Contributed by Prabhu Joseph 2019-11-19 16:10:02 +05:30
Sunil G c1ec51696c YARN-8373. RM Received RMFatalEvent of type CRITICAL_THREAD_CRASH. Contributed by Wilfred Spiegelenburg.
(cherry picked from commit ea68756c0c)
2019-11-19 14:12:03 +05:30
Sunil G 049279bb66 YARN-9984. FSPreemptionThread can cause NullPointerException while app is unregistered with containers running on a node. Contributed by Wilfred Spiegelenburg.
(cherry picked from commit 215f2052fc)
2019-11-19 14:05:11 +05:30
Jonathan Eagles 254e18dcaf Revert "YARN-9949. Add missing queue configs for root queue in RMWebService#CapacitySchedulerInfo. Contributed by Prabhu Joseph."
This reverts commit 11c763c220.
2019-11-05 15:10:01 -06:00
Sunil G 597b315811 YARN-9950. Unset Ordering Policy of Leaf/Parent queue converted from Parent/Leaf queue respectively. Contributed by Prabhu Joseph.
(cherry picked from commit 51e7d1b37e)
2019-11-04 23:28:39 +05:30
Sunil G 11c763c220 YARN-9949. Add missing queue configs for root queue in RMWebService#CapacitySchedulerInfo. Contributed by Prabhu Joseph.
(cherry picked from commit d462308e04)
2019-11-03 08:48:04 +05:30
Jonathan Hung 5d2ffcc7aa Make upstream aware of 2.10.0 release
(cherry picked from commit 7663db59c097c82eeed2df7a91168a4d7123c96b)
2019-10-30 20:59:20 -07:00
Eric Badger fa6b27ea8d YARN-9914. Use separate configs for free disk space checking for full and not-full disks. Contributed by Jim Brennan
(cherry picked from commit eef34f2d87)
2019-10-25 17:15:48 +00:00
Eric E Payne ea574087d1 YARN-9915: Fix FindBug issue in QueueMetrics. Contributed by Prabhu Joseph.
(cherry picked from commit 83d148074f)
2019-10-21 20:56:40 +00:00
Eric E Payne 23b72d8ae1 YARN-9773: Add QueueMetrics for Custom Resources. Contributed by Manikandan R.
(cherry picked from commit a5034c7988)
2019-10-16 21:13:02 +00:00
Sunil G 9672b81fa3 YARN-9900. Revert to previous state when Invalid Config is applied and Refresh Support in SchedulerConfig Format. Contributed by Prabhu Joseph.
(cherry picked from commit 090f73a9aa)
2019-10-16 18:15:34 +05:30
Haibo Chen 3a5474c61e YARN-8842. Expose metrics for custom resource types in QueueMetrics. (Contributed by Szilard Nemeth)
(cherry picked from commit 84e22a6af4)
2019-10-15 22:14:33 +00:00
Haibo Chen 1344823d4d YARN-8750. Refactor TestQueueMetrics. (Contributed by Szilard Nemeth)
(cherry picked from commit e60b797c88)
2019-10-15 15:32:01 +00:00
Szilard Nemeth b10fdd136a YARN-8453. Additional Unit tests to verify queue limit and max-limit with multiple resource types. Contributed by Adam Antal 2019-10-15 13:24:59 +02:00
Akira Ajisaka eb4bd54938
YARN-7243. Moving logging APIs over to slf4j in hadoop-yarn-server-resourcemanager. (#1634)
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit e40e2d6ad5)

 Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/rmnode/RMNodeImpl.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/AbstractYarnScheduler.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/CapacityScheduler.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/capacity/LeafQueue.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/volume/csi/VolumeManagerImpl.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/main/java/org/apache/hadoop/yarn/server/resourcemanager/volume/csi/lifecycle/VolumeImpl.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/ApplicationMasterServiceTestBase.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestApplicationMasterLauncher.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestApplicationMasterServiceInterceptor.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/TestResourceManager.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/rmapp/TestRMAppTransitions.java
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-resourcemanager/src/test/java/org/apache/hadoop/yarn/server/resourcemanager/scheduler/fair/TestFairSchedulerConfiguration.java
2019-10-10 15:27:54 +09:00
Szilard Nemeth da35a22083 Revert "YARN-9128. Use SerializationUtils from apache commons to serialize / deserialize ResourceMappings. Contributed by Zoltan Siegl"
This reverts commit 42177e8b78.
2019-10-09 19:58:46 +02:00
Szilard Nemeth 05966ce204 YARN-9552. FairScheduler: NODE_UPDATE can cause NoSuchElementException. Contributed by Peter Bacsko. 2019-10-09 14:18:06 +02:00
Szilard Nemeth 42177e8b78 YARN-9128. Use SerializationUtils from apache commons to serialize / deserialize ResourceMappings. Contributed by Zoltan Siegl
(cherry picked from commit 6f1ab95168)
2019-10-09 13:28:01 +02:00
Szilard Nemeth 0ddb48a303 YARN-9356. Add more tests to ratio method in TestResourceCalculator. Contributed by Zoltan Siegl
(cherry picked from commit 35f093f5b3)
2019-10-09 13:12:03 +02:00
Sunil G 9cb3ab058f YARN-9873. Mutation API Config Change need to update Version Number. Contributed by Prabhu Joseph
(cherry picked from commit be901f4962)
2019-10-09 15:54:09 +05:30
Jonathan Hung ad3c98456d YARN-9760. Support configuring application priorities on a workflow level. Contributed by Varun Saxena
(cherry picked from commit eebd313d76ed742fe82292bd8c0184970cdc5692)
2019-10-08 11:17:05 -07:00
Szilard Nemeth 9b4aba49d1 YARN-6715. Fix documentation about NodeHealthScriptRunner. Contributed by Peter Bacsko 2019-10-08 17:41:46 +02:00
Sunil G 5704f15589 Revert "YARN-9873. Mutation API Config Change updates Version Number. Contributed by Prabhu Joseph"
This reverts commit 3a0afcfb7f.
2019-10-05 09:16:04 +05:30
Sunil G 3a0afcfb7f YARN-9873. Mutation API Config Change updates Version Number. Contributed by Prabhu Joseph
(cherry picked from commit 4510970e2f)
2019-10-04 21:49:49 +05:30
Sunil G 312cfa994a YARN-9801. SchedConfCli does not work wiwith https mode. Contributed by Prabhu Joseph
(cherry picked from commit 99cd7572f1)
2019-10-01 20:07:18 +05:30
bibinchundatt 5cd6eb2a18 YARN-9858. Optimize RMContext getExclusiveEnforcedPartitions. Contributed by Jonathan Hung. 2019-10-01 16:04:58 +05:30
Sunil G 52f815d39d YARN-9864. Format CS Configuration present in Configuration Store. Contributeed by Prabhu Joseph
(cherry picked from commit 137546a78a)
2019-10-01 09:09:22 +05:30
Jonathan Hung f4f210d2e5 Addendum to YARN-9730. Support forcing configured partitions to be exclusive based on app node label
(cherry picked from commit d86a1acc866cbda845fb3896dc824baf12217383)
2019-09-25 17:49:37 -07:00
Jonathan Hung 806c7b7dfb YARN-9730. Support forcing configured partitions to be exclusive based on app node label
(cherry picked from commit 73a044a63822303f792183244e25432528ecfb1e)
2019-09-24 13:51:54 -07:00
Jonathan Hung a1fa9a8a7f YARN-9762. Add submission context label to audit logs. Contributed by Manoj Kumar
(cherry picked from commit 3d78b1223d)
2019-09-23 13:12:57 -07:00
Sunil G 3e0025d877 YARN-9833. Race condition when DirectoryCollection.checkDirs() runs during container launch. Contributed by Peter Bacsko.
(cherry picked from commit c474e24c0b)
2019-09-18 09:22:48 +05:30
Weiwei Yang 7ec229244a YARN-2255. YARN Audit logging not added to log4j.properties. Contributed by Aihua Xu.
(cherry picked from commit f8c14326ee)
2019-09-18 09:19:38 +08:00
Eric Yang 345ef049df YARN-9837. Fixed reading YARN Service JSON spec file larger than 128k.
Contributed by Tarun Parimi

(cherry picked from commit eefe9bc85c)
2019-09-17 13:22:53 -04:00
Jonathan Hung 1dbf87c9ff YARN-9824. Fall back to configured queue ordering policy class name
(cherry picked from commit f8f8598ea5)
2019-09-10 15:26:57 -07:00
bibinchundatt e10050678d YARN-8948. PlacementRule interface should be for all YarnSchedulers. Contributed by Bibin A Chundatt.
(cherry picked from commit a68d766e87)
2019-09-09 19:03:40 -07:00
Abhishek Modi f6cc887f35 YARN-9821. NM hangs at serviceStop when ATSV2 Backend Hbase is Down. Contributed by Prabhu Joseph. 2019-09-09 15:44:45 +05:30
Jonathan Hung 1f0449ddfb YARN-9820. RM logs InvalidStateTransitionException when app is submitted. Contributed by Prabhu Joseph 2019-09-09 00:24:17 -07:00
Jonathan Hung 45220d1157 YARN-9764. Print application submission context label in application summary. Contributed by Manoj Kumar
(cherry picked from commit 43e389b980)
2019-09-08 19:11:47 -07:00
Rohith Sharma K S 7d5bb2ebb7 Preparing for 3.2.2-SNAPSHOT development. 2019-09-07 08:52:08 +05:30
Wangda Tan 0e77347972 YARN-9813. RM does not start on JDK11 when UIv2 is enabled. (Adam Antal/Eric Yang via wangda)
Change-Id: I18b8edc930b2efa0652f59c246931ad0d46827f3
(cherry picked from commit 34b82e6da0)
2019-09-06 19:19:05 -07:00
Tao Yang 9ee257e353 YARN-8995. Log events info in AsyncDispatcher when event queue size cumulatively reaches a certain number every time(addendum). Contributed by Jonathan Hung. 2019-09-07 08:25:15 +08:00
Tao Yang cfce39023d YARN-9795. ClusterMetrics to include AM allocation delay. Contributed by Fengnan Li. 2019-09-07 07:53:46 +08:00
Jonathan Hung 9c9ff07249 YARN-9763. Print application tags in application summary. Contributed by Manoj Kumar 2019-09-06 10:49:43 -07:00
Jonathan Hung 1f685efc73 YARN-9761. Allow overriding application submissions based on server side configs. Contributed by Pralabh Kumar 2019-09-06 10:13:12 -07:00
Jonathan Hung 79ca399a30 YARN-9810. Add queue capacity/maxcapacity percentage metrics. Contributed by Shubham Gupta
(cherry picked from commit 0ccf4b0fe1)
2019-09-05 14:05:45 -07:00
Billie Rinaldi 66627749d0 YARN-9718. Fixed yarn.service.am.java.opts shell injection. Contributed by Eric Yang
(cherry picked from commit 2e2e5401f2)
2019-09-05 12:54:20 -07:00
Rohith Sharma K S 4d9c5300e2 YARN-8567. Fetching yarn logs fails for long running application if it is not present in timeline store. Contributed by Tarun Parimi. 2019-09-05 18:16:35 +05:30
Eric Yang b87a727ff4 YARN-9374. Improve Timeline service resilience when HBase is unavailable.
Contributed by Prabhu Joseph and Szilard Nemeth
2019-09-05 16:32:18 +05:30
Eric Yang 02779cdc3a YARN-8499 ATSv2 Generalize TimelineStorageMonitor.
Contributed by Prabhu Joseph
2019-09-05 16:31:35 +05:30
Eric Yang 6110af2d1d YARN-7537. Add ability to load hbase config from distributed file system.
Contributed by Prabhu Joseph
2019-09-05 16:28:04 +05:30
Vrushali C bcacb57114 YARN-9335 [atsv2] Restrict the number of elements held in timeline collector when backend is unreachable for async calls. Contributed by Abhishesk Modi. 2019-09-05 16:27:39 +05:30
Vrushali C 6acc1a2bd0 YARN-9382 Publish container killed, paused and resumed events to ATSv2. Contributed by Abhishesk Modi. 2019-09-05 15:39:38 +05:30
Vrushali C f52a88fdc8 YARN-9303 Username splits won't help timelineservice.app_flow table. Contributed by Prabhu Joseph. 2019-09-05 15:39:38 +05:30
Giovanni Matteo Fumarola 998aa3de2c YARN-9418. ATSV2 /apps//entities/YARN_CONTAINER rest api does not show metrics. Contributed by Prabhu Joseph. 2019-09-05 15:39:38 +05:30
Rohith Sharma K S 8de93fca3c YARN-9389. FlowActivity and FlowRun table prefix is wrong. Contributed by Prabhu Joseph. 2019-09-05 15:39:38 +05:30
Rohith Sharma K S 0ccc5a2695 YARN-9387. Update document for ATS HBase Custom tablenames (-entityTableName). Contributed by Prabhu Joseph. 2019-09-05 15:39:38 +05:30
Vrushali C d451ff7534 YARN-3841 [atsv2 Storage implementation] Adding retry semantics to HDFS backing storage. Contributed by Abhishek Modi. 2019-09-05 15:39:38 +05:30
Vrushali C 66e1599761 YARN-3879 [Storage implementation] Create HDFS backing storage implementation for ATS reads. Contributed by Abhishek Modi. 2019-09-05 15:39:38 +05:30
Tao Yang 6f9764076a YARN-8995. Log events info in AsyncDispatcher when event queue size cumulatively reaches a certain number every time. Contributed by zhuqi. 2019-09-05 16:53:16 +08:00
Vrushali C 84a9c3f999 YARN-5336 Limit the flow name size & consider cleanup for hex chars. Contributed by Sushil Ks 2019-09-05 12:43:02 +05:30
Rohith Sharma K S 108c569e3b YARN-6735. Have a way to turn off container metrics from NMs. Contributed by Abhishek Modi. 2019-09-05 12:42:06 +05:30
Rohith Sharma K S 5345508fa3 YARN-6149. Allow port range to be specified while starting NM Timeline collector manager. Contributed by Abhishek Modi. 2019-09-05 12:38:37 +05:30
Suma Shivaprasad 0a6f90d4fc YARN-9034. ApplicationCLI should have option to take clusterId. Contributed by Rohith Sharma K S. 2019-09-05 12:38:07 +05:30
Rohith Sharma K S 4a4a892d32 YARN-7754. [Atsv2] Update document for running v1 and v2 TS. Contributed by Suma Shivaprasad. 2019-09-05 12:31:37 +05:30
Rohith Sharma K S a3496a368b YARN-8871. Document ATSv2 integrated LogWebService. Contributed by Suma Shivaprasad. 2019-09-05 12:30:50 +05:30
Rohith Sharma K S 252afdc8e6 YARN-9804. Update ATSv2 document for latest feature supports. 2019-09-05 09:00:22 +05:30
Zhankun Tang 269aa7ebfe YARN-9785. Fix DominantResourceCalculator when one resource is zero. Contributed by Bibin A Chundatt, Sunil Govindan, Bilwa S T.
(cherry picked from commit bb26514ba9)
2019-09-03 15:02:15 +08:00
bibinchundatt 1e6095f16b YARN-9797. LeafQueue#activateApplications should use resourceCalculator#fitsIn. Contributed by Bilwa S T.
(cherry picked from commit 03489124ea)
2019-09-03 11:55:13 +05:30
Akira Ajisaka a453f38015 YARN-9162. Fix TestRMAdminCLI#testHelp. Contributed by Ayush Saxena.
(cherry picked from commit 5db7c49062)
2019-08-30 17:52:07 -07:00
Rohith Sharma K S 2fc4123fe0 YARN-9714. ZooKeeper connection in ZKRMStateStore leaks after RM transitioned to standby. Contributed by Tao Yang. 2019-08-30 10:36:23 +05:30
Rohith Sharma K S 7616495fb7 YARN-9796. Fix ASF license issue in branch-3.2. Contributed by Prabhu Joseph. 2019-08-29 12:01:38 +05:30
Rohith Sharma K S 81c0809463 YARN-9640. Slow event processing could cause too many attempt unregister events. Contributed by Bibin A Chundatt. 2019-08-29 09:30:20 +05:30
Eric E Payne d562050cec YARN-9756: Create metric that sums total memory/vcores preempted per round. Contributed by Manikandan R (manirajv06). 2019-08-28 20:53:43 +00:00
Jonathan Hung f36ccf0ac1 YARN-9438. launchTime not written to state store for running applications
(cherry picked from commit 9568656cd21d9c02168e18ce35c6726077bbf3a1)
2019-08-27 15:54:22 -07:00
Akira Ajisaka 2d8799f4bc HADOOP-15832. Upgrade BouncyCastle to 1.60. Contributed by Robert Kanter. 2019-08-27 19:08:39 +00:00
Jonathan Hung e4249c3202 YARN-9775. RMWebServices /scheduler-conf GET returns all hadoop configurations for ZKConfigurationStore. Contributed by Prabhu Joseph
(cherry picked from commit 8660e48ca1)
2019-08-26 15:51:38 -07:00
bibinchundatt 7f20c31e31 YARN-9642. Fix Memory Leak in AbstractYarnScheduler caused by timer. Contributed by Bibin A Chundatt.
(cherry picked from commit d3ce53e507)
2019-08-26 23:23:49 +05:30
Rohith Sharma K S ab98f91638 YARN-8917. Absolute (maximum) capacity of level3+ queues is wrongly calculated for absolute resource. Contributed by Tao Yang. 2019-08-26 21:13:02 +05:30
Szilard Nemeth 6980f1740f YARN-9217. Nodemanager will fail to start if GPU is misconfigured on the node or GPU drivers missing. Contributed by Peter Bacsko 2019-08-21 16:49:34 +02:00
Szilard Nemeth a83718f130 YARN-9100. Add tests for GpuResourceAllocator and do minor code cleanup. Contributed by Peter Bacsko 2019-08-16 15:24:44 +02:00
Szilard Nemeth df616370f0 YARN-8586. Extract log aggregation related fields and methods from RMAppImpl. Contributed by Peter Bacsko 2019-08-16 11:52:51 +02:00
Szilard Nemeth 8fee3808c5 YARN-9749. TestAppLogAggregatorImpl#testDFSQuotaExceeded fails on trunk. Contributed by Adam Antal
(cherry picked from commit 2a05e0ff3b)
2019-08-16 08:52:34 +02:00
Szilard Nemeth e616037d1f YARN-9488. Skip YARNFeatureNotEnabledException from ClientRMService. Contributed by Prabhu Joseph
(cherry picked from commit 1845a83cec)
2019-08-15 17:16:06 +02:00
Adam Antal d5446b3a23 YARN-9676. Add DEBUG and TRACE level messages to AppLogAggregatorImpl… (#1261)
* YARN-9676. Add DEBUG and TRACE level messages to AppLogAggregatorImpl and connected classes

* Using {} placeholder, and increasing loglevel if log aggregation failed.

(cherry picked from commit c89bdfacc8)
2019-08-14 17:36:41 +02:00
Szilard Nemeth 4bb238c480 YARN-9133. Make tests more easy to comprehend in TestGpuResourceHandler. Contributed by Peter Bacsko 2019-08-14 17:16:54 +02:00
Szilard Nemeth 4dc477b606 YARN-9140. Code cleanup in ResourcePluginManager.initialize and in TestResourcePluginManager. Contributed by Peter Bacsko 2019-08-14 17:01:41 +02:00
Szilard Nemeth 9a87e74e54 YARN-9134. No test coverage for redefining FPGA / GPU resource types in TestResourceUtils. Contributed by Peter Bacsko 2019-08-14 16:46:34 +02:00
Eric Badger cec71691be YARN-9442. container working directory has group read permissions. Contributed by Jim Brennan.
(cherry picked from commit 2ac029b949)

Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/native/container-executor/test/test-container-executor.c
2019-08-13 16:34:29 +00:00
Szilard Nemeth c5aea8ca56 YARN-9723. ApplicationPlacementContext is not required for terminated jobs during recovery. Contributed by Prabhu Joseph
(cherry picked from commit e4b538bbda)
2019-08-12 15:16:18 +02:00
Szilard Nemeth 844259203f YARN-9451. AggregatedLogsBlock shows wrong NM http port. Contributed by Prabhu Joseph
(cherry picked from commit b91099efd6)
2019-08-12 15:06:16 +02:00
Szilard Nemeth b20fd9e212 YARN-9135. NM State store ResourceMappings serialization are tested with Strings instead of real Device objects. Contributed by Peter Bacsko 2019-08-12 14:02:17 +02:00
Sunil G 02b4635ff0 YARN-9729. [UI2] Fix error message for logs when ATSv2 is offline. Contributed by Zoltan Siegl.
(cherry picked from commit 1c5b28659f)
2019-08-11 11:49:25 +05:30
Szilard Nemeth 2e6beb1550 Logging fileSize of log files under NM Local Dir. Contributed by Prabhu Joseph
(cherry picked from commit 54ac80176e)
2019-08-09 13:20:10 +02:00
Sunil G 9fb6c6e2a1 YARN-9715. [UI2] yarn-container-log URI need to be encoded to avoid potential misuses. Contributed by Akhil PB.
(cherry picked from commit acffec7a92)
2019-08-09 16:07:04 +05:30
Szilard Nemeth 3e9071207a SUBMARINE-57. Add more elaborate message if submarine command is not recognized. Contributed by Adam Antal
(cherry picked from commit e5f4cd0fda)
2019-08-09 12:14:49 +02:00
Adam Antal 4c4f7d9c80 YARN-9124. Resolve contradiction in ResourceUtils: addMandatoryResources / checkMandatoryResources work differently (#1121)
(cherry picked from commit cbcada804d)
2019-08-09 11:43:30 +02:00
Szilard Nemeth 02d0e54596 YARN-9092. Create an object for cgroups mount enable and cgroups mount path as they belong together. Contributed by Gergely Pollak
(cherry picked from commit e0c21c6da9)
2019-08-09 10:23:10 +02:00
Szilard Nemeth f0dfb8b832 YARN-9096: Some GpuResourcePlugin and ResourcePluginManager methods are synchronized unnecessarily. Contributed by Gergely Pollak
(cherry picked from commit 742e30b473)
2019-08-09 10:02:35 +02:00
Szilard Nemeth 3bcf44f070 YARN-9094: Remove unused interface method: NodeResourceUpdaterPlugin#handleUpdatedResourceFromRM. Contributed by Gergely Pollak
(cherry picked from commit 72d7e570a7)
2019-08-09 09:50:32 +02:00
Eric E Payne e47c483d9f YARN-9685: NPE when rendering the info table of leaf queue in non-accessible partitions. Contributed by Tao Yang.
(cherry picked from commit 3b38f2019e)
2019-08-08 12:54:31 +00:00
Haibo Chen 8d357343c4 YARN-9559. Create AbstractContainersLauncher for pluggable ContainersLauncher logic. (Contributed by Jonathan Hung)
(cherry picked from commit f51702d539)
2019-08-06 14:59:49 -07:00
Eric E Payne 168dc3f258 YARN-9596: QueueMetrics has incorrect metrics when labelled partitions are involved. Contributed by Muhammad Samir Khan.
(cherry picked from commit 42683aef1a)
2019-07-30 19:19:33 +00:00
Jonathan Hung 15344006bc YARN-9668. UGI conf doesn't read user overridden configurations on RM and NM startup. (Contributed by Jonanthan Hung) 2019-07-22 10:46:45 -07:00
Weiwei Yang bf3d9f6282 YARN-9682. Wrong log message when finalizing the upgrade. Contributed by kyungwan nam.
(cherry picked from commit 85d9111a88)
2019-07-17 10:47:25 +08:00
bibinchundatt 4866735cde YARN-9645. Fix Invalid event FINISHED_CONTAINERS_PULLED_BY_AM at NEW on NM restart. Contributed by Bilwa S T.
(cherry picked from commit 7a93be0f60)
2019-07-16 14:06:36 +05:30
Szilard Nemeth 7c9cfc0996 YARN-9326. Fair Scheduler configuration defaults are not documented in case of min and maxResources. Contributed by Adam Antal
(cherry picked from commit 5446308360)
2019-07-15 13:30:58 +02:00
Szilard Nemeth 28d6a453a9 YARN-9127. Create more tests to verify GpuDeviceInformationParser. Contributed by Peter Bacsko
(cherry picked from commit 18ee1092b4)
2019-07-15 12:02:39 +02:00
Szilard Nemeth 2fcbdf4131 YARN-9337. Addendum to fix compilation error due to mockito spy call
(cherry picked from commit bb37c6cb7f)
2019-07-13 00:45:38 +02:00
Szilard Nemeth 4fa0de9f04 YARN-9626. UI2 - Fair scheduler queue apps page issues. Contributed by Zoltan Siegl
(cherry picked from commit 557056e18e)
2019-07-12 17:40:57 +02:00
Szilard Nemeth 0ede873090 YARN-9337. GPU auto-discovery script runs even when the resource is given by hand. Contributed by Adam Antal
(cherry picked from commit 61b0c2bb7c)
2019-07-12 17:29:47 +02:00
Szilard Nemeth c61c969668 YARN-9235. If linux container executor is not set for a GPU cluster GpuResourceHandlerImpl is not initialized and NPE is thrown. Contributed by Antal Balint Steinbach, Adam Antal
(cherry picked from commit c416284bb7)
2019-07-12 16:53:26 +02:00
Szilard Nemeth 3e3bbb7f5e YARN-9625. UI2 - No link to a queue on the Queues page for Fair Scheduler. Contributed by Zoltan Siegl
(cherry picked from commit 9cec023186)
2019-07-11 20:01:52 +02:00
Szilard Nemeth 4216090f19 YARN-9573. DistributedShell cannot specify LogAggregationContext. Contributed by Adam Antal. 2019-07-11 19:24:11 +02:00
bibinchundatt 5f8395f393 YARN-9557. Application fails in diskchecker when ReadWriteDiskValidator is configured. Contributed by Bilwa S T. 2019-07-10 10:34:39 +05:30
Szilard Nemeth 4638fa00fc YARN-9629. Support configurable MIN_LOG_ROLLING_INTERVAL. Contributed by Adam Antal.
(cherry picked from commit a2a8be18cb)
2019-07-04 10:26:29 +02:00
Sunil G d18986e4e8 YARN-9644. First RMContext object is always leaked during switch over. Contributed by Bibin A Chundatt. 2019-07-04 11:05:54 +05:30
Sunil G bea79e7645 YARN-9327. Improve synchronisation in ProtoUtils#convertToProtoFormat block. Contributed by Bibin A Chundatt.
(cherry picked from commit 0c8813f135)
2019-07-02 12:15:05 +05:30
Weiwei Yang c9bccaf148 YARN-9655. AllocateResponse in FederationInterceptor lost applicationPriority. Contributed by hunshenshi.
(cherry picked from commit 570eee30e5)
2019-07-02 10:05:22 +08:00
Erik Krogen 49d7bb6a92 HDFS-13286. [SBN read] Add haadmin commands to transition between standby and observer. Contributed by Chao Sun. 2019-06-28 14:20:01 -07:00
Eric Yang 860606fc67 YARN-9581. Add support for get multiple RM webapp URLs.
Contributed by Prabhu Joseph

(cherry picked from commit f02b0e1994)
2019-06-28 14:57:50 -04:00
bibinchundatt a2f4e4698b YARN-9639. DecommissioningNodesWatcher cause memory leak. Contributed by Bilwa S T.
(cherry picked from commit be80334cdf)
2019-06-27 10:04:40 +05:30
Weiwei Yang 1944a7d844 YARN-9209. When nodePartition is not set in Placement Constraints, containers are allocated only in default partition. Contributed by Tarun Parimi.
(cherry picked from commit 83dcb9d87e)
2019-06-21 17:52:22 +08:00
Wanqiang Ji f148b29508 YARN-9630. [UI2] Add a link in docs's top page
Signed-off-by: Masatake Iwasaki <iwasakims@apache.org>
(cherry picked from commit eb6be4643f)
2019-06-18 14:57:01 +09:00
Zhankun Tang 1e7201f9aa YARN-9584. Should put initializeProcessTrees method call before get pid. Contributed by Wanqiang Ji.
(cherry picked from commit 67414a1a80)
2019-06-18 13:18:27 +08:00
Inigo Goiri 65f7ec2f39 YARN-8856. TestTimelineReaderWebServicesHBaseStorage tests failing with NoClassDefFoundError. Contributed by Sushil Ks.
(cherry picked from commit eeaf8edaa7)
2019-06-13 14:22:16 -07:00
Sean Mackrory e0b3cbd221 HADOOP-16213. Update guava to 27.0-jre. Contributed by Gabor Bota. 2019-06-13 07:53:40 -06:00
Sunil G 253dcde517 YARN-9543. [UI2] Handle ATSv2 server down or failures cases gracefully in YARN UI v2. Contributed by Zoltan Siegl and Akhil P B.
(cherry picked from commit 52128e352a)
2019-06-12 19:25:02 +05:30
Sunil G 72203f7a12 YARN-9545. Create healthcheck REST endpoint for ATSv2. Contributed by Zoltan Siegl. 2019-06-12 19:23:40 +05:30
Sunil G f1ead03672 Revert "YARN-9545. Create healthcheck REST endpoint for ATSv2. Contributed by Zoltan Siegl."
This reverts commit f1d3a17d3e.
2019-06-12 19:10:23 +05:30
bibinchundatt 3303723f55 YARN-9547. ContainerStatusPBImpl default execution type is not returned. Contributed by Bilwa S T. 2019-06-11 23:42:29 +05:30
bibinchundatt d9284d4a57 YARN-9565. RMAppImpl#ranNodes not cleared on FinalTransition. Contributed by Bilwa S T.
(cherry picked from commit 60c95e9b6a)
2019-06-11 23:13:18 +05:30
bibinchundatt a37011bd5e YARN-9594. Fix missing break statement in ContainerScheduler#handle. Contributed by lujie.
(cherry picked from commit 6d80b9bc3f)

 Conflicts:
	hadoop-yarn-project/hadoop-yarn/hadoop-yarn-server/hadoop-yarn-server-nodemanager/src/main/java/org/apache/hadoop/yarn/server/nodemanager/containermanager/scheduler/ContainerScheduler.java
2019-06-11 23:01:03 +05:30
Eric Yang 68aec0a98d YARN-9581. Fixed yarn logs cli to access RM2.
Contributed by Prabhu Joseph

(cherry picked from commit cb9bc6e64c)
2019-06-06 16:43:25 -04:00
Sunil G f1d3a17d3e YARN-9545. Create healthcheck REST endpoint for ATSv2. Contributed by Zoltan Siegl. 2019-06-06 06:24:01 +05:30
Weiwei Yang 6e2b091515 YARN-9580. Fulfilled reservation information in assignment is lost when transferring in ParentQueue#assignContainers. Contributed by Tao Yang. 2019-06-04 15:24:37 +08:00
Sunil G 2f012044ff YARN-8906. [UI2] NM hostnames not displayed correctly in Node Heatmap Chart. Contributed by Akhil PB.
(cherry picked from commit 59719dc560)
2019-06-03 15:54:07 +05:30
Sunil G 58042dadc3 YARN-8947. [UI2] Active User info missing from UI2. Contributed by Akhil PB.
(cherry picked from commit 7f46dda513)
2019-06-03 12:25:16 +05:30
Weiwei Yang e027c87da2 YARN-9507. Fix NPE in NodeManager#serviceStop on startup failure. Contributed by Bilwa S T.
(cherry picked from commit 4530f4500d)
2019-06-03 14:15:20 +08:00
Eric Yang b2a39e8883 YARN-9542. Fix LogsCLI guessAppOwner ignores custome file format suffix.
Contributed by Prabhu Joseph
2019-05-29 18:04:13 -04:00
Eric E Payne 2e561cef47 YARN-8625. Aggregate Resource Allocation for each job is not present in ATS. Contributed by Prabhu Joseph.
(cherry picked from commit 3c63551101)
2019-05-29 18:43:13 +00:00
Ahmed Hussein 777f7345ef YARN-9563. Resource report REST API could return NaN or Inf (Ahmed Hussein via jeagles)
Signed-off-by: Jonathan Eagles <jeagles@gmail.com>
(cherry picked from commit abf76ac371)
2019-05-29 12:14:01 -05:00
Takanobu Asanuma a9a3450560 HADOOP-16331. Fix ASF License check in pom.xml. Contributed by Akira Ajisaka.
Signed-off-by: Takanobu Asanuma <tasanuma@apache.org>
2019-05-29 17:34:16 +09:00
Akira Ajisaka c917ba782e
YARN-9500. Fix typos in ResourceModel.md. Contributed by leiqiang.
(cherry picked from commit 4a692bc3be)
2019-05-28 16:54:43 +09:00
Akira Ajisaka 855dc997d6
HADOOP-16323. https everywhere in Maven settings. 2019-05-27 15:27:33 +09:00
bibinchundatt 71f5bfb822 YARN-9508. YarnConfiguration areNodeLabel enabled is costly in allocation flow. Contributed by Bilwa S T.
(cherry picked from commit 570fa2da20)
2019-05-15 13:31:07 +05:30
Sunil G f4ee38df29 YARN-9519. TFile log aggregation file format is not working for yarn.log-aggregation.TFile.remote-app-log-dir config. Contributed by Adam Antal.
(cherry picked from commit 7d831eca64)
2019-05-14 10:49:09 -07:00
Sunil G b5b152aded YARN-9504. [UI2] Fair scheduler queue view page does not show actual capacity. Contributed by Zoltan Siegl.
(cherry picked from commit 64c7f36ab1)
2019-05-10 14:28:33 +05:30
Haibo Chen c6573562cb YARN-9529. Log correct cpu controller path on error while initializing CGroups. (Contributed by Jonathan Hung)
(cherry picked from commit 597fa47ad1)
2019-05-06 11:58:31 -07:00
Eric E Payne 6fce24fb40 YARN-9285: RM UI progress column is of wrong type. Contributed by Ahmed Hussein.
(cherry picked from commit b094b94d43)
2019-05-02 19:48:06 +00:00
Weiwei Yang cc0c85f04a YARN-9325. TestQueueManagementDynamicEditPolicy fails intermittent. Contributed by Prabhu Joseph.
(cherry picked from commit 1c8046d67e)
2019-04-23 14:24:15 +08:00
Eric Yang ac85aa80d9 YARN-8587. Added retries for fetching docker exit code.
Contributed by Charo Zhang

(cherry picked from commit c16c49b8c3)
2019-04-19 15:40:23 -04:00
Eric Yang 4a64dab0dd YARN-8622. Fixed container-executor compilation on MacOSX.
Contributed by Siyao Meng

(cherry picked from commit ef97a20831)
2019-04-18 19:01:11 -04:00
Eric Yang 2503409977 YARN-6695. Fixed NPE in publishing appFinished events to ATSv2.
Contributed by Prabhu Joseph

(cherry picked from commit df76cdc895)
2019-04-18 12:30:55 -04:00
Siyao Meng 742a3ad24b YARN-9487. NodeManager native build shouldn't link against librt on macOS. Contributed by Siyao Meng.
Signed-off-by: Wei-Chiu Chuang <weichiu@apache.org>
(cherry picked from commit 6e4399ea61)
2019-04-17 22:57:33 -07:00
Weiwei Yang db185de31c YARN-9463. Add queueName info when failing with queue capacity sanity check. Contributed by Aihua Xu.
(cherry picked from commit 8c1bba375b)
2019-04-10 23:02:24 +08:00
Weiwei Yang 7a80b1b481 YARN-9413. Queue resource leak after app fail for CapacityScheduler. Contributed by Tao Yang.
(cherry picked from commit ec143cbf67)
2019-04-06 20:19:03 +08:00
Weiwei Yang 06cf8ce30f YARN-9394. Use new API of RackResolver to get better performance. Contributed by Lantao Jin.
(cherry picked from commit 945e8c6064)
2019-04-04 18:41:04 +08:00
Eric Yang 10642a6205 YARN-9391. Fixed node manager environment leaks into Docker containers.
Contributed by Jim Brennan

(cherry picked from commit 3c45762a0b)
2019-03-25 15:54:52 -04:00
Sunil G aff5973401 YARN-8803. [UI2] Show flow runs in the order of recently created time in graph widgets. Contributed by Akhil PB.
(cherry picked from commit c79f139519)
2019-03-06 16:49:49 +05:30
Sunil G d721634fea YARN-9138. Improve test coverage for nvidia-smi binary execution of GpuDiscoverer. Contributed by Szilard Nemeth.
(cherry picked from commit 46045c5cb3)
2019-03-06 16:01:56 +05:30
bibinchundatt 63ed16e076 Revert "YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph"
This reverts commit cf1944eb6e.
2019-03-04 17:01:40 +05:30
Weiwei Yang 4ceb4e4f05 YARN-9332. RackResolver tool should accept multiple hosts. Contributed by Lantao Jin.
(cherry picked from commit fe6b2b2f23e69f0643e870d9c500117088983209)
2019-03-02 16:04:24 +00:00
Sunil G d045f02a8d YARN-9139. Simplify initializer code of GpuDiscoverer. Contributed by Szilard Nemeth. 2019-03-01 19:27:03 +05:30
Eric Yang 3f3548b66a YARN-9334. Allow YARN Service client to send SPNEGO challenge header when authentication type is not simple.
Contributed by Billie Rinaldi

(cherry picked from commit 04b228e43b)
2019-02-28 09:33:05 -08:00
Weiwei Yang 7575e3090d YARN-9324. TestSchedulingRequestContainerAllocation(Async) fails with junit-4.11. Contributed by Prabhu Joseph. 2019-02-28 09:32:07 +08:00
Weiwei Yang 7fa5373ec4 YARN-9248. RMContainerImpl:Invalid event: ACQUIRED at KILLED. Contributed by lujie.
(cherry picked from commit 8c30114b00)
2019-02-27 17:35:09 +08:00
Sunil G 809e3f2453 YARN-9121. Replace GpuDiscoverer.getInstance() to a readable object for easy access control. Contributed by Szilard Nemeth.
(cherry picked from commit 5e91ebd91a)
2019-02-27 12:03:58 +05:30
Sunil G a95a0cbf2f YARN-9087. Improve logging for initialization of Resource plugins. Contributed by Szilard Nemeth. 2019-02-27 11:54:43 +05:30
Weiwei Yang bdde6a612e YARN-9329. updatePriority is blocked when using FairScheduler. Contributed by Jiandan Yang.
(cherry picked from commit 3e1739d589)
2019-02-26 00:18:24 +08:00
Sunil G 359e459df1 YARN-9168. DistributedShell client timeout should be -1 by default. Contributed by Zhankun Tang.
(cherry picked from commit 6cec90653d)
2019-02-25 15:29:31 +05:30
Sunil G f282f9c362 YARN-9213. RM Web UI v1 does not show custom resource allocations for containers page. Contributed by Szilard Nemeth. 2019-02-25 11:37:42 +05:30
Weiwei Yang cdce1c17a0 YARN-9316. TestPlacementConstraintsUtil#testInterAppConstraintsByAppID fails intermittently. Contributed by Prabhu Joseph.
(cherry picked from commit 9cd5c5447f)
2019-02-24 22:48:55 +08:00
Weiwei Yang 604a915bab YARN-9238. Avoid allocating opportunistic containers to previous/removed/non-exist application attempt. Contributed by lujie.
(cherry picked from commit 9c88695bcd)
2019-02-24 22:21:53 +08:00
bibinchundatt 3e1bd53a37 YARN-9317. Avoid repeated YarnConfiguration#timelineServiceV2Enabled check. Contributed by Prabhu Joseph 2019-02-23 07:59:51 +05:30
bibinchundatt cf1944eb6e YARN-8132. Final Status of applications shown as UNDEFINED in ATS app queries. Contributed by Prabhu Joseph 2019-02-22 20:51:47 +05:30
Sunil G d75aa33612 YARN-9118. Handle exceptions with parsing user defined GPU devices in GpuDiscoverer. Contributed by Szilard Nemeth.
(cherry picked from commit 95fbbfed75)
2019-02-22 20:23:24 +05:30
Weiwei Yang fb2735de16 YARN-9258. Support to specify allocation tags without constraint in distributed shell CLI. Contributed by Prabhu Joseph.
(cherry picked from commit 6c4ab0312b)
2019-02-22 00:23:21 +08:00
Weiwei Yang c2ef443359 YARN-9315. TestCapacitySchedulerMetrics fails intermittently. Contributed by Prabhu Joseph. 2019-02-21 18:06:26 +08:00
bibinchundatt e6f2b8730f YARN-9286. [Timeline Server] Sorting based on FinalStatus shows pop-up message. Contributed by Bilwa S T.
(cherry picked from commit b8de78c570)
2019-02-20 01:20:15 +05:30
Sunil G a060e8cb51 YARN-7824. [UI2] Yarn Component Instance page should include link to container logs. Contributed by Akhil PB. 2019-02-17 20:19:46 +05:30
Adam Antal 830aaac023
YARN-9283. Javadoc of LinuxContainerExecutor#addSchedPriorityCommand has a wrong property name as reference
Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 9385ec45d7)
2019-02-15 18:48:21 +09:00
Sunil G b4dc62a120 YARN-8295. [UI2] Improve Resource Usage tab error message when there are no data available. Contributed by Charan Hebri.
(cherry picked from commit 5b55f3538c)
2019-02-15 12:43:01 +05:30
Akira Ajisaka f0b476bb31
YARN-9284. Fix the unit of yarn.service.am-resource.memory in the document. Contributed by Masahiro Tanaka.
(cherry picked from commit 3a39d9a2d2)
2019-02-15 15:43:32 +09:00
bibinchundatt a8563d23f5 YARN-9295. [UI2] Fix label typo in Cluster Overview page. Contributed by Charan Hebri.
(cherry picked from commit b66d5ae9e2)
2019-02-14 23:10:21 +05:30
Sunil G d321d0e747 YARN-7761. [UI2] Clicking 'master container log' or 'Link' next to 'log' under application's appAttempt goes to Old UI's Log link. Contributed by Akhil PB. 2019-02-14 20:55:43 +05:30
Weiwei Yang 4d4968646a YARN-9252. Allocation Tag Namespace support in Distributed Shell. Contributed by Prabhu Joseph.
(cherry picked from commit 2b7f828d46)
2019-02-12 16:31:38 +08:00
Weiwei Yang 9cfa5ce0c1 YARN-9253. Add UT to verify Placement Constraint in Distributed Shell. Contributed by Prabhu Joseph.
(cherry picked from commit 711d22f166)
2019-02-12 16:22:34 +08:00
Giovanni Matteo Fumarola 314c559971 YARN-9191. Add cli option in DS to support enforceExecutionType in resource requests. Contributed by Abhishek Modi.
(cherry picked from commit f738b397ae)
2019-02-12 14:21:06 +08:00
Weiwei Yang fbd03543d8 YARN-8555. Parameterize TestSchedulingRequestContainerAllocation(Async) to cover both PC handler options. Contributed by Prabhu Joseph.
(cherry picked from commit 0a1637c750)
2019-02-11 15:56:34 +08:00
Eric Yang 3d552b24ba YARN-8761. Service AM support for decommissioning component instances.
Contributed by Billie Rinaldi

(cherry picked from commit 4c465f5535)
2019-02-10 09:26:27 -08:00
Masatake Iwasaki 6229469574 YARN-9282. Typo in javadoc of class LinuxContainerExecutor: hadoop.security.authetication should be 'authentication'. Contributed by Charan Hebri.
(cherry picked from commit e0ab1bdece)
2019-02-09 00:28:59 +09:00
Sunil G fbc08145cf YARN-9257. Distributed Shell client throws a NPE for a non-existent queue. Contributed by Charan Hebri. 2019-02-08 11:22:44 +05:30
Eric E Payne 55dde827e6 YARN-7171: RM UI should sort memory / cores numerically. Contributed by Ahmed Hussein
(cherry picked from commit d1ca9432dd)
2019-02-07 16:47:15 +00:00
Vinayakumar B e2b91b2ccb YARN-8498. Yarn NodeManager OOM Listener Fails Compilation on Ubuntu 18.04. Contributed by Ayush Saxena. 2019-02-07 13:03:42 +05:30
Wangda Tan 649da5af04 Make upstream aware of 3.1.2 release
Change-Id: I397bc6ef75498726df4763bd07a8bf8fe1c38365
(cherry picked from commit 308f3168fa)
2019-02-05 14:05:51 -08:00
Weiwei Yang 3a51e50fd8 YARN-9275. Add link to NodeAttributes doc in PlacementConstraints document. Contributed by Masatake Iwasaki.
(cherry picked from commit aa7ce50e55)
2019-02-05 16:09:15 +08:00
Weiwei Yang b64e9df949 YARN-9262. TestRMAppAttemptTransitions is failing with an NPE. Contributed by lujie.
(cherry picked from commit 28ad20a711)
2019-02-04 14:00:30 +05:30
Sunil G 99876a5ab8 YARN-9206. RMServerUtils does not count SHUTDOWN as an accepted state. Contributed by Kuhu Shukla.
(cherry picked from commit 604b2489a9)
2019-02-04 12:49:06 +05:30
Weiwei Yang a0fafbc3ef YARN-9263. TestConfigurationNodeAttributesProvider fails after Mockito updated. Contributed by Weiwei Yang.
(cherry picked from commit f20b043a02)
2019-02-04 12:45:40 +05:30
Sunil G 0e7060a1d5 YARN-9099. GpuResourceAllocator#getReleasingGpus calculates number of GPUs in a wrong way. Contributed by Szilard Nemeth.
(cherry picked from commit 71c49fa60f)
2019-01-31 09:26:07 +05:30
Eric E Payne 4052b7ee60 YARN-6616: YARN AHS shows submitTime for jobs same as startTime. Contributed by Prabhu Joseph
(cherry picked from commit 04105bbfdb)
2019-01-29 17:52:54 +00:00
Weiwei Yang 6b8dd8d113 YARN-9237. NM should ignore sending finished apps to RM during RM fail-over. Contributed by Jiandan Yang.
(cherry picked from commit 4f63ffe444)
2019-01-29 10:42:09 +08:00
Eric Yang 8c332affa8 YARN-8901. Fixed restart policy NEVER/ON_FAILURE with component dependency.
Contributed by Suma Shivaprasad

(cherry picked from commit f5a95f7998)
2019-01-28 18:12:04 -05:00
Jonathan Hung bf760e7e81 YARN-9222. Print launchTime in ApplicationSummary
(cherry picked from commit 6cace58e21)
2019-01-25 13:23:37 -08:00
Sunil G 4aea4b7c9c YARN-8961. [UI2] Flow Run End Time shows 'Invalid date'. Contributed by Akhil PB
(cherry picked from commit c726445990)
2019-01-24 15:02:21 +05:30
Weiwei Yang bc6374f282 YARN-9205. When using custom resource type, application will fail to run due to the CapacityScheduler throws InvalidResourceRequestException(GREATER_THEN_MAX_ALLOCATION). Contributed by Zhankun Tang. 2019-01-23 18:10:28 +08:00
Weiwei Yang 8ad7711605 YARN-8101. Add UT to verify node-attributes in RM nodes rest API. Contributed by Prabhu Joseph.
(cherry picked from commit 721d5c2a5f)
2019-01-23 18:07:45 +08:00
Weiwei Yang 9114489566 YARN-9210. RM nodes web page can not display node info. Contributed by Jiandan Yang.
(cherry picked from commit d43df31751)
2019-01-22 10:46:37 +08:00
Weiwei Yang ac2f4b64f9 YARN-9204. RM fails to start if absolute resource is specified for partition capacity in CS queues. Contributed by Jiandan Yang.
(cherry picked from commit abde1e1f58)
2019-01-21 21:20:01 +08:00
Wangda Tan fe7cb2d84a YARN-9194. Invalid event: REGISTERED and LAUNCH_FAILED at FAILED, and NullPointerException happens in RM while shutdown a NM. (lujie via wangda)
Change-Id: I4359f59a73a278a941f4bb9d106dd38c9cb471fe
(cherry picked from commit 6d7eedfd28)
2019-01-17 15:13:42 -08:00
rahul3 4aee7946dc
YARN-9203. Fix typos in yarn-default.xml.
This closes #437

Signed-off-by: Akira Ajisaka <aajisaka@apache.org>
(cherry picked from commit 0a46baecd3)
2019-01-17 16:19:49 +09:00
Akira Ajisaka a415446705
YARN-8747. [UI2] YARN UI2 page loading failed due to js error under some time zone configuration. Contributed by collinma.
(cherry picked from commit 104ef5df36)
2019-01-16 14:37:43 +09:00
Wangda Tan 1dc2b49bfd YARN-8822. Nvidia-docker v2 support for YARN GPU feature. (Charo Zhang via wangda)
Change-Id: Ib8044307a4241f6b1b7b9b8266b9256f39b16384
2019-01-07 12:21:33 -08:00
Eric Yang bba76b6f31 HADOOP-16031. Fixed TestSecureLogins unit test. Contributed by Akira Ajisaka 2019-01-07 13:24:01 -05:00
Weiwei Yang 2b549e32e1 YARN-9173. FairShare calculation broken for large values after YARN-8833. Contributed by Wilfred Spiegelenburg.
(cherry picked from commit 944cf87223)
2019-01-07 16:05:57 +08:00
Wangda Tan cffe5c1ba0 YARN-9160. [Submarine] Document 'PYTHONPATH' environment variable setting when using -localization options. (Zhankun Tang via wangda)
Change-Id: I115b6710eb11501e21ab3e9e7cd1a87f322e5cf8
(cherry picked from commit 2c02aa6ec2)
2019-01-06 11:10:47 -08:00
Wangda Tan 1ad1ab5d7a YARN-9141. [Submarine] JobStatus outputs with system UTC clock, not local clock. (Zac Zhou via wangda)
Change-Id: Ib2b17cf7860ddc9188f06498086de3326bc09519
(cherry picked from commit 8f004fe4e0)
2019-01-06 11:10:41 -08:00
Weiwei Yang a24cca11f2 YARN-9164. Shutdown NM may cause NPE when opportunistic container scheduling is enabled. Contributed by lujie.
(cherry picked from commit cfe89e6f96)
2019-01-04 01:04:39 +08:00
Sunil G 9bfea98746 YARN-9090. [Submarine] Update submarine installation script document. Contributed by Xun Liu.
(cherry picked from commit 742fd978b5)
2018-12-26 08:27:50 +05:30
Akira Ajisaka d6baf9d5bf
YARN-9154. Fix itemization in YARN service quickstart document. Contributed by Ayush Saxena.
(cherry picked from commit e98e963e51)
2018-12-22 13:26:41 +09:00
Weiwei Yang 7deef08eb8 YARN-8925. Updating distributed node attributes only when necessary. Contributed by Tao Yang. 2018-12-21 16:31:03 +08:00
Eric Yang 29c9c8a893 YARN-9126. Fix container clean up for reinitialization.
Contributed by Chandni Singh

(cherry picked from commit e815fd9c49)
2018-12-19 14:58:19 -05:00
Eric Yang 4e1cef3625 YARN-9084. Reset container state and defer readiness check for upgrade.
Contributed by Chandni Singh

(cherry picked from commit ccdd982e51)
2018-12-18 18:05:40 -05:00
Eric Yang 28ca14e71b YARN-9040. Fixed memory leak in LevelDBCacheTimelineStore and DBIterator.
Contributed by Tarun Parimi

(cherry picked from commit 71e0b0d800)
2018-12-17 12:08:09 -05:00
Akira Ajisaka 71d8ffb551
YARN-9122. Add table of contents to YARN Service API document. Contributed by Zhankun Tang.
(cherry picked from commit 26e55d4ceb)
2018-12-15 10:30:37 +09:00
Eric Yang 52aafb9789 YARN-9125. Fixed Carriage Return detection in Docker container launch command.
Contributed by Billie Rinaldi

(cherry picked from commit b2d7204ed0)
2018-12-14 17:55:10 -05:00
Wangda Tan 29e4e5f62c YARN-9112. [Submarine] Support polling applicationId when it's not ready in cluster. (Zhankun Tang via wangda)
Change-Id: I73d73f3d631b28fb9866faa56571839b13824a97
(cherry picked from commit 9fba6cc247)
2018-12-12 11:52:16 -08:00
Wangda Tan ef0f835537 YARN-8714. [Submarine] Support files/tarballs to be localized for a training job. (Zhankun Tang via wangda)
Change-Id: I845131273e52a9d81dbc813ea6d4af06b205e334
(cherry picked from commit c771fe6e10)
2018-12-12 11:52:07 -08:00
Weiwei Yang 2b3c3d2a32 YARN-9009. Fix flaky test TestEntityGroupFSTimelineStore.testCleanLogs. Contributed by OrDTesters.
(cherry picked from commit 1c09a10e96)
2018-12-10 12:07:23 +08:00
Jonathan Hung 3ab6ea7aca YARN-9085. Add Guaranteed and MaxCapacity to CSQueueMetrics
(cherry picked from commit 978ab3e958227220cb6f1a08ae6e7cdb8a46628b)
2018-12-07 10:45:47 -08:00
Eric Yang 8c70728f7f YARN-9071. Improved status update for reinitialized containers.
Contributed by Chandni Singh

(cherry picked from commit 1b790f4dd1)
2018-12-05 19:04:55 -05:00
Wangda Tan 559cb99bf6 Revert "YARN-8870. [Submarine] Add submarine installation scripts. (Xun Liu via wangda)"
This reverts commit 920401b296.
2018-12-04 14:13:39 -08:00
Jonathan Hung 6b01e4d2a8 YARN-9036. Escape newlines in health report in YARN UI. Contributed by Keqiu Hu
(cherry picked from commit 1c8bd7128c99d8215ef16438bd2ce6b1f025a966)
2018-11-30 10:16:00 -08:00
Eric Yang a8f67ad7c9 YARN-9067. Fixed Resource Manager resource leak via YARN service.
Contributed by Eric Yang
2018-11-29 13:50:06 -05:00
bibinchundatt 183ec39c4b YARN-9069. Fix SchedulerInfo#getSchedulerType for custom schedulers. Contributed by Bilwa S T.
(cherry picked from commit 07142f54a8)
2018-11-29 22:16:32 +05:30
Wangda Tan ee9deb6e9c YARN-9030. Log aggregation changes to handle filesystems which do not support setting permissions. (Suma Shivaprasad via wangda)
Change-Id: I80f1e8196b8624e24d74494719fdedfd7061dced
(cherry picked from commit 9de8e8d049)
2018-11-28 15:06:35 -08:00
Jason Lowe df0e7766e4 YARN-8812. Containers fail during creating a symlink which started with hyphen for a resource file. Contributed by Oleksandr Shevchenko
(cherry picked from commit 3ce99e32f7)
2018-11-28 08:50:18 -06:00
Eric Yang db8b2a130c YARN-9056. Improved YARN service upgrade state logic for readiness check.
Contributed by Chandni Singh

(cherry picked from commit f657a2a661)
2018-11-27 19:45:23 -05:00
Eric Yang 838190482d YARN-8986. Added port publish for Docker container running with bridge.
Contributed by Charo Zhang
2018-11-27 14:27:13 -05:00
Weiwei Yang 01477108d9 YARN-8984. AMRMClient#OutstandingSchedRequests leaks when AllocationTags is null or empty. Contributed by Yang Wang.
(cherry picked from commit 176bb3f812)
2018-11-22 17:04:16 +08:00