5381 lines
198 KiB
Plaintext
5381 lines
198 KiB
Plaintext
Hadoop YARN Change Log
|
||
|
||
Release 2.7.3 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-4287. Capacity Scheduler: Rack Locality improvement. (Nathan Roberts via wangda)
|
||
|
||
YARN-4492. Add documentation for preemption supported in Capacity
|
||
scheduler (Naganarasimha G R via jlowe)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-4326. Fix TestDistributedShell timeout as AHS in MiniYarnCluster no longer
|
||
binds to default port 8188. (Meng Ding via wangda)
|
||
|
||
YARN-4347. Resource manager fails with Null pointer exception. (Jian He via wangda)
|
||
|
||
YARN-4374. RM capacity scheduler UI rounds user limit factor (Chang Li via
|
||
jlowe)
|
||
|
||
YARN-3849. Too much of preemption activity causing continuos killing of containers
|
||
across queues. (Sunil G via wangda)
|
||
|
||
YARN-3769. Consider user limit when calculating total pending resource for
|
||
preemption policy in Capacity Scheduler. (Eric Payne via wangda)
|
||
|
||
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong
|
||
cluster resource calculations (Varun Vasudev via jlowe)
|
||
|
||
YARN-4365. FileSystemNodeLabelStore should check for root dir existence on
|
||
startup (Kuhu Shukla via jlowe)
|
||
|
||
YARN-4380. TestResourceLocalizationService.testDownloadingResourcesOnContainerKill
|
||
fails intermittently. (Varun Saxena via ozawa)
|
||
|
||
YARN-4398. Remove unnecessary synchronization in RMStateStore. (Ning Ding via jianhe)
|
||
|
||
YARN-3840. Resource Manager web ui issue when sorting application by id
|
||
(with application having id > 9999) (Mohammad Shahid Khan & Varun Saxena
|
||
via jianhe)
|
||
|
||
YARN-4422. Generic AHS sometimes doesn't show started, node, or logs on App page
|
||
(Eric Payne via jeagles)
|
||
|
||
YARN-4439. Clarify NMContainerStatus#toString method. (Jian He via xgong)
|
||
|
||
YARN-4452. NPE when submit Unmanaged application. (Naganarasimha G R via
|
||
junping_du)
|
||
|
||
YARN-4546. ResourceManager crash due to scheduling opportunity overflow.
|
||
(Jason Lowe via junping_du)
|
||
|
||
YARN-3695. ServerProxy (NMProxy, etc.) shouldn't retry forever for non
|
||
network exception. (Raju Bairishetti via jianhe)
|
||
|
||
YARN-4414. Nodemanager connection errors are retried at multiple levels
|
||
(Chang Li via jlowe)
|
||
|
||
YARN-4121. Fix typos in capacity scheduler documentation.
|
||
(Kai Sasaki via vvasudev)
|
||
|
||
YARN-4581. AHS writer thread leak makes RM crash while RM is recovering.
|
||
(sandflee via junping_du)
|
||
|
||
YARN-4610. Reservations continue looking for one app causes other apps to
|
||
starve (jlowe)
|
||
|
||
YARN-4623. TestSystemMetricsPublisher#testPublishAppAttemptMetricsForUnmanagedAM
|
||
fails with NPE on branch-2.7. (Naganarasimha G R via junping_du)
|
||
|
||
YARN-4598. Invalid event: RESOURCE_FAILED at
|
||
CONTAINER_CLEANEDUP_AFTER_KILL (tangshangwen via jlowe)
|
||
|
||
Release 2.7.2 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-4009. CORS support for ResourceManager REST API. ( Varun Vasudev via jeagles)
|
||
|
||
YARN-3170. YARN architecture document needs updating. (Brahma Reddy Battula
|
||
via ozawa)
|
||
|
||
YARN-3967. Fetch the application report from the AHS if the RM does not know about it.
|
||
(Mit Desai via xgong)
|
||
|
||
YARN-2801. Add documentation for node labels feature. (Wangda Tan and Naganarasimha
|
||
G R via ozawa)
|
||
|
||
YARN-2513. Host framework UIs in YARN for use with the ATS (jeagles)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-3793. Several NPEs when deleting local files on NM recovery (Varun
|
||
Saxena via jlowe)
|
||
|
||
YARN-3508. Prevent processing preemption events on the main RM dispatcher.
|
||
(Varun Saxena via wangda)
|
||
|
||
YARN-3690. [JDK8] 'mvn site' fails. (Brahma Reddy Battula via aajisaka)
|
||
|
||
YARN-3905. Application History Server UI NPEs when accessing apps run after
|
||
RM restart (Eric Payne via jeagles)
|
||
|
||
YARN-3535. Scheduler must re-request container resources when RMContainer transitions
|
||
from ALLOCATED to KILLED (rohithsharma and peng.zhang via asuresh)
|
||
|
||
YARN-3878. AsyncDispatcher can hang while stopping if it is configured for
|
||
draining events on stop. (Varun Saxena via jianhe)
|
||
|
||
YARN-3969. Allow jobs to be submitted to reservation that is active
|
||
but does not have any allocations. (subru via curino)
|
||
|
||
YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container
|
||
log files from full disks. (zhihai xu via jlowe)
|
||
|
||
YARN-3857: Memory leak in ResourceManager with SIMPLE mode.
|
||
(mujunchao via zxu)
|
||
|
||
YARN-3893. Both RM in active state when Admin#transitionToActive failure
|
||
from refeshAll() (Bibin A Chundatt via rohithsharmaks)
|
||
|
||
YARN-4103. RM WebServices missing scheme for appattempts logLinks.
|
||
(Jonathan Eagles via vvasudeb)
|
||
|
||
YARN-4105. Capacity Scheduler headroom for DRF is wrong (Chang Li via
|
||
jlowe)
|
||
|
||
YARN-4096. App local logs are leaked if log aggregation fails to initialize
|
||
for the app. (Jason Lowe via zxu)
|
||
|
||
YARN-4153. TestAsyncDispatcher failed at branch-2.7 (Zhihai Xu via jianhe)
|
||
|
||
YARN-3697. FairScheduler: ContinuousSchedulingThread can fail to shutdown.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-4158. Remove duplicate close for LogWriter in
|
||
AppLogAggregatorImpl#uploadLogsForContainers (Zhihai Xu via jlowe)
|
||
|
||
YARN-3975. WebAppProxyServlet should not redirect to RM page if AHS is
|
||
enabled (Mit Desai via jlowe)
|
||
|
||
YARN-3624. ApplicationHistoryServer should not reverse the order of the
|
||
filters it gets. (Mit Desai via xgong)
|
||
|
||
YARN-4180. AMLauncher does not retry on failures when talking to NM.
|
||
(adhoot)
|
||
|
||
YARN-3619. ContainerMetrics unregisters during getMetrics and leads to
|
||
ConcurrentModificationException (Zhihai Xu via jlowe)
|
||
|
||
YARN-4209. RMStateStore FENCED state doesn’t work due to updateFencedState called
|
||
by stateMachine.doTransition. (Zhihai Xu via rohithsharmaks)
|
||
|
||
YARN-4281. 2.7 RM app page is broken (Chang Li via jlowe)
|
||
|
||
YARN-4000. RM crashes with NPE if leaf queue becomes parent queue during restart.
|
||
(Varun Saxena via jianhe)
|
||
|
||
YARN-4041. Slow delegation token renewal can severely prolong RM recovery
|
||
(Sunil G via jlowe)
|
||
|
||
YARN-2902. Killing a container that is localizing can orphan resources in
|
||
the DOWNLOADING state (Varun Saxena via jlowe)
|
||
|
||
YARN-4313. Race condition in MiniMRYarnCluster when getting history server
|
||
address. (Jian He via xgong)
|
||
|
||
YARN-3580. [JDK8] TestClientRMService.testGetLabelsToNodes fails. (Robert Kanter
|
||
via junping_du)
|
||
|
||
YARN-4312. TestSubmitApplicationWithRMHA fails on branch-2.7 and branch-2.6
|
||
as some of the test cases time out. (Varun Saxena via ozawa)
|
||
|
||
YARN-4320. TestJobHistoryEventHandler fails as AHS in MiniYarnCluster no longer
|
||
binds to default port 8188. (Varun Saxena via ozawa)
|
||
|
||
YARN-4321. Incessant retries if NoAuthException is thrown by Zookeeper in non
|
||
HA mode. (Varun Saxena via jianhe)
|
||
|
||
YARN-3136. getTransferredContainers can be a bottleneck during AM registration.
|
||
(Sunil G via jianhe)
|
||
|
||
YARN-4127. RM fail with noAuth error if switched from failover mode to non-failover
|
||
mode. (Varun Saxena via jianhe)
|
||
|
||
YARN-4354. Public resource localization fails with NPE. (Jason Lowe via
|
||
junping_du)
|
||
|
||
Release 2.7.1 - 2015-07-06
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-3243. CapacityScheduler should pass headroom from parent to children
|
||
to make sure ParentQueue obey its capacity limits. (Wangda Tan via jianhe)
|
||
|
||
YARN-3539. Updated timeline server documentation and marked REST APIs evolving.
|
||
(Steve Loughran via zjshen)
|
||
|
||
YARN-3723. Need to clearly document primaryFilter and otherInfo value type.
|
||
(Zhijie Shen via xgong)
|
||
|
||
YARN-3489. RMServerUtils.validateResourceRequests should only obtain queue
|
||
info once. (Varun Saxena via wangda)
|
||
|
||
YARN-3711. Documentation of ResourceManager HA should explain configurations
|
||
about listen addresses. (Masatake Iwasaki via ozawa)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
YARN-3006. Improve the error message when attempting manual failover with
|
||
auto-failover enabled. (Akira AJISAKA via wangda)
|
||
|
||
YARN-3469. ZKRMStateStore: Avoid setting watches that are not required.
|
||
(Jun Gong via kasha)
|
||
|
||
BUG FIXES
|
||
|
||
YARN-3462. Patches applied for YARN-2424 are inconsistent between
|
||
trunk and branch-2. (Naganarasimha G R via harsh)
|
||
|
||
YARN-3497. ContainerManagementProtocolProxy modifies IPC timeout conf
|
||
without making a copy. (Jason Lowe via jianhe)
|
||
|
||
YARN-2605. [RM HA] Rest api endpoints doing redirect incorrectly.
|
||
(Xuan Gong via stevel)
|
||
|
||
YARN-3522. Fixed DistributedShell to instantiate TimeLineClient as the
|
||
correct user. (Zhijie Shen via jianhe)
|
||
|
||
YARN-3351. AppMaster tracking URL is broken in HA. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-3382. Some of UserMetricsInfo metrics are incorrectly set to root
|
||
queue metrics. (Rohit Agarwal via jianhe)
|
||
|
||
YARN-3472. Fixed possible leak in DelegationTokenRenewer#allTokens.
|
||
(Rohith Sharmaks via jianhe)
|
||
|
||
YARN-3465. Use LinkedHashMap to preserve order of resource requests.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-3516. killing ContainerLocalizer action doesn't take effect when
|
||
private localizer receives FETCH_FAILURE status.(zhihai xu via xgong)
|
||
|
||
YARN-3485. FairScheduler headroom calculation doesn't consider
|
||
maxResources for Fifo and FairShare policies. (kasha)
|
||
|
||
YARN-3301. Fixed the format issue of the new RM attempt web page.
|
||
(Xuan Gong via jianhe)
|
||
|
||
YARN-3385. Fixed a race-condition in ResourceManager's ZooKeeper based
|
||
state-store to avoid crashing on duplicate deletes. (Zhihai Xu via vinodkv)
|
||
|
||
YARN-3358. Audit log not present while refreshing Service ACLs.
|
||
(Varun Saxena via devaraj)
|
||
|
||
YARN-3476. Nodemanager can fail to delete local logs if log aggregation
|
||
fails (Rohith via jlowe)
|
||
|
||
YARN-3434. Interaction between reservations and userlimit can result in
|
||
significant ULF violation. (Thomas Graves via wangda)
|
||
|
||
YARN-3626. On Windows localized resources are not moved to the front
|
||
of the classpath when they should be. (Craig Welch via xgong)
|
||
|
||
YARN-3457. NPE when NodeManager.serviceInit fails and stopRecoveryStore called.
|
||
(Bibin A Chundatt via ozawa)
|
||
|
||
YARN-3537. NPE when NodeManager.serviceInit fails and stopRecoveryStore
|
||
invoked (Brahma Reddy Battula via jlowe)
|
||
|
||
YARN-3601. Fix UT TestRMFailover.testRMWebAppRedirect. (Weiwei Yang via xgong)
|
||
|
||
YARN-3677. Fix findbugs warnings in yarn-server-resourcemanager.
|
||
(Vinod Kumar Vavilapalli via ozawa)
|
||
|
||
YARN-3681. yarn cmd says "could not find main class 'queue'" in windows.
|
||
(Craig Welch and Varun Saxena via xgong)
|
||
|
||
YARN-3609. Load node labels from storage inside RM serviceStart. (Wangda
|
||
Tan via jianhe)
|
||
|
||
YARN-3694. Fix dead link for TimelineServer REST API.
|
||
(Jagadesh Kiran N via aajisaka)
|
||
|
||
YARN-3646. Applications are getting stuck some times in case of retry
|
||
policy forever. (Raju Bairishetti via devaraj)
|
||
|
||
YARN-3675. FairScheduler: RM quits when node removal races with
|
||
continuous-scheduling on the same node. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-3701. Isolating the error of generating a single app report when
|
||
getting all apps from generic history service. (Zhijie Shen via xgong)
|
||
|
||
YARN-2238. filtering on UI sticks even if I move away from the page.
|
||
(Jian He via xgong)
|
||
|
||
YARN-3686. CapacityScheduler should trim default_node_label_expression.
|
||
(Sunil G via wangda)
|
||
|
||
YARN-3753. RM failed to come up with "java.io.IOException: Wait for
|
||
ZKClient creation timed out”. (Jian He via xgong)
|
||
|
||
YARN-3764. CapacityScheduler should forbid moving LeafQueue from one parent
|
||
to another. (Wangda Tan via jianhe)
|
||
|
||
YARN-3804. Both RM are on standBy state when kerberos user not in yarn.admin.acl
|
||
(Varun Saxena via xgong)
|
||
|
||
YARN-3842. NMProxy should retry on NMNotYetReadyException.
|
||
(Robert Kanter via kasha)
|
||
|
||
YARN-3809. Failed to launch new attempts because
|
||
ApplicationMasterLauncher's threads all hang (Jun Gong via jlowe)
|
||
|
||
Release 2.7.0 - 2015-04-20
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-2179. [YARN-1492] Initial cache manager structure and context.
|
||
(Chris Trezzo via kasha)
|
||
|
||
YARN-2180. [YARN-1492] In-memory backing store for cache manager.
|
||
(Chris Trezzo via kasha)
|
||
|
||
YARN-2183. [YARN-1492] Cleaner service for cache manager.
|
||
(Chris Trezzo and Sangjin Lee via kasha)
|
||
|
||
YARN-2186. [YARN-1492] Node Manager uploader service for cache manager.
|
||
(Chris Trezzo and Sangjin Lee via kasha)
|
||
|
||
YARN-2236. [YARN-1492] Shared Cache uploader service on the Node
|
||
Manager. (Chris Trezzo and Sangjin Lee via kasha)
|
||
|
||
YARN-2188. [YARN-1492] Client service for cache manager.
|
||
(Chris Trezzo and Sangjin Lee via kasha)
|
||
|
||
YARN-2189. [YARN-1492] Admin service for cache manager.
|
||
(Chris Trezzo via kasha)
|
||
|
||
YARN-2765. Added leveldb-based implementation for RMStateStore. (Jason Lowe
|
||
via jianhe)
|
||
|
||
YARN-2203. [YARN-1492] Web UI for cache manager. (Chris Trezzo via kasha)
|
||
|
||
YARN-2738. [YARN-2574] Add FairReservationSystem for FairScheduler.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2881. [YARN-2574] Implement PlanFollower for FairScheduler.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2427. Added the API of moving apps between queues in RM web services.
|
||
(Varun Vasudev via zjshen)
|
||
|
||
YARN-2217. [YARN-1492] Shared cache client side changes.
|
||
(Chris Trezzo via kasha)
|
||
|
||
YARN-2616 [YARN-913] Add CLI client to the registry to list, view
|
||
and manipulate entries. (Akshay Radia via stevel)
|
||
|
||
YARN-2994. Document work-preserving RM restart. (Jian He via ozawa)
|
||
|
||
YARN-2786. Created a yarn cluster CLI and seeded with one command for listing
|
||
node-labels collection. (Wangda Tan via vinodkv)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-3005. [JDK7] Use switch statement for String instead of if-else
|
||
statement in RegistrySecurity.java (Kengo Seki via aajisaka)
|
||
|
||
YARN-2950. Change message to mandate, not suggest JS requirement on UI.
|
||
(Dustin Cote via harsh)
|
||
|
||
YARN-2891. Failed Container Executor does not provide a clear error
|
||
message. (Dustin Cote via harsh)
|
||
|
||
YARN-1979. TestDirectoryCollection fails when the umask is unusual.
|
||
(Vinod Kumar Vavilapalli and Tsuyoshi OZAWA via junping_du)
|
||
|
||
YARN-2641. Decommission nodes on -refreshNodes instead of next
|
||
NM-RM heartbeat. (Zhihai Xu via kasha)
|
||
|
||
YARN-2742. FairSchedulerConfiguration should allow extra spaces
|
||
between value and unit. (Wei Yan via kasha)
|
||
|
||
YARN-2712. TestWorkPreservingRMRestart: Augment FS tests with
|
||
queue and headroom checks. (Tsuyoshi Ozawa via kasha)
|
||
|
||
YARN-2735. diskUtilizationPercentageCutoff and diskUtilizationSpaceCutoff
|
||
are initialized twice in DirectoryCollection. (Zhihai Xu via kasha)
|
||
|
||
YARN-570. Time strings are formated in different timezone.
|
||
(Akira Ajisaka and Peng Zhang via kasha)
|
||
|
||
YARN-2780. Log aggregated resource allocation in rm-appsummary.log (Eric
|
||
Payne via jlowe)
|
||
|
||
YARN-2690. [YARN-2574] Make ReservationSystem and its dependent classes
|
||
independent of Scheduler type. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-2157. Added YARN metrics in the documentaion. (Akira AJISAKA via
|
||
jianhe)
|
||
|
||
YARN-2802. ClusterMetrics to include AM launch and register delays.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2375. Allow enabling/disabling timeline server per framework.
|
||
(Mit Desai via jeagles)
|
||
|
||
YARN-2604. Scheduler should consider max-allocation-* in conjunction
|
||
with the largest node. (Robert Kanter via kasha)
|
||
|
||
YARN-2679. Add metric for container launch duration. (Zhihai Xu via kasha)
|
||
|
||
YARN-2669. FairScheduler: queue names shouldn't allow periods
|
||
(Wei Yan via Sandy Ryza)
|
||
|
||
YARN-2404. Removed ApplicationAttemptState and ApplicationState class in
|
||
RMStateStore. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2165. Added the sanity check for the numeric configuration values of
|
||
the timeline service. (Vasanth kumar RJ via zjshen)
|
||
|
||
YARN-2907. SchedulerNode#toString should print all resource detail instead
|
||
of only memory. (Rohith via junping_du)
|
||
|
||
YARN-2136. Changed RMStateStore to ignore store opearations when fenced.
|
||
(Varun Saxena via jianhe)
|
||
|
||
YARN-1156. Enhance NodeManager AllocatedGB and AvailableGB metrics
|
||
for aggregation of decimal values. (Tsuyoshi OZAWA via junping_du)
|
||
|
||
YARN-2056. Disable preemption at Queue level (Eric Payne via jlowe)
|
||
|
||
YARN-2762. Fixed RMAdminCLI to trim and check node-label related arguments
|
||
before sending to RM. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2972. DelegationTokenRenewer thread pool never expands. (Jason Lowe
|
||
via junping_du)
|
||
|
||
YARN-2949. Add documentation for CGroups (Varun Vasudev via junping_du)
|
||
|
||
YARN-2970. NodeLabel operations in RMAdmin CLI get missing in help command.
|
||
(Varun Saxena via junping_du)
|
||
|
||
YARN-2837. Support TimeLine server to recover delegation token when
|
||
restarting. (Zhijie Shen via jianhe)
|
||
|
||
YARN-2993. Several fixes (missing acl check, error log msg ...) and some
|
||
refinement in AdminService. (Yi Liu via junping_du)
|
||
|
||
YARN-2943. Added node-labels page on RM web UI. (Wangda Tan via jianhe)
|
||
|
||
YARN-2998. Abstract out scheduler independent PlanFollower components.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2360. Fair Scheduler: Display dynamic fair share for queues on the
|
||
scheduler page. (Ashwin Shankar and Wei Yan via kasha)
|
||
|
||
YARN-2880. Added a test to make sure node labels will be recovered
|
||
if RM restart is enabled. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2996. Improved synchronization and I/O operations of FS- and Mem-
|
||
RMStateStore. (Yi Liu via zjshen)
|
||
|
||
YARN-2956. Added missing links in YARN documentation. (Masatake Iwasaki via
|
||
jianhe)
|
||
|
||
YARN-2957. Create unit test to automatically compare YarnConfiguration
|
||
and yarn-default.xml. (rchiang via rkanter)
|
||
|
||
YARN-2643. Don't create a new DominantResourceCalculator on every
|
||
FairScheduler.allocate call. (kasha via rkanter)
|
||
|
||
YARN-3019. Make work-preserving-recovery the default mechanism for RM
|
||
recovery. (Jian He via junping_du)
|
||
|
||
YARN-2807. Option "--forceactive" not works as described in usage of
|
||
"yarn rmadmin -transitionToActive". (Masatake Iwasaki via xgong)
|
||
|
||
YARN-2984. Metrics for container's actual memory usage. (kasha)
|
||
|
||
YARN-2800. Remove MemoryNodeLabelsStore and add a way to enable/disable
|
||
node labels feature. (Wangda Tan via ozawa)
|
||
|
||
YARN-3086. Make NodeManager memory configurable in MiniYARNCluster.
|
||
(Robert Metzger via ozawa)
|
||
|
||
YARN-2897. CrossOriginFilter needs more log statements (Mit Desai via
|
||
jeagles)
|
||
|
||
YARN-3028. Better syntax for replaceLabelsOnNode in RMAdmin CLI
|
||
(Rohith Sharmaks via wangda)
|
||
|
||
YARN-2932. Add entry for "preemptable" status (enabled/disabled) to
|
||
scheduler web UI and queue initialize/refresh logging.
|
||
(Eric Payne via wangda)
|
||
|
||
YARN-3108. ApplicationHistoryServer doesn't process -D arguments (Chang Li
|
||
via jeagles)
|
||
|
||
YARN-2808. Made YARN CLI list attempt’s finished containers of a running
|
||
application. (Naganarasimha G R via zjshen)
|
||
|
||
YARN-3085. Application summary should include the application type (Rohith
|
||
via jlowe)
|
||
|
||
YARN-3022. Expose Container resource information from NodeManager for
|
||
monitoring (adhoot via ranter)
|
||
|
||
YARN-3075. NodeLabelsManager implementation to retrieve label to node
|
||
mapping (Varun Saxena via wangda)
|
||
|
||
YARN-1393. SLS: Add how-to-use instructions. (Wei Yan via kasha)
|
||
|
||
YARN-1723. AMRMClientAsync missing blacklist addition and removal
|
||
functionality. (Bartosz Ługowski via sseth)
|
||
|
||
YARN-3123. Made YARN CLI show a single completed container even if the app
|
||
is running. (Naganarasimha G R via zjshen)
|
||
|
||
YARN-1582. Capacity Scheduler: add a maximum-allocation-mb setting per
|
||
queue (Thomas Graves via jlowe)
|
||
|
||
YARN-1904. Ensure exceptions thrown in ClientRMService &
|
||
ApplicationHistoryClientService are uniform when application-attempt is
|
||
not found. (zjshen via acmurthy)
|
||
|
||
YARN-3144. Configuration for making delegation token failures to timeline
|
||
server not-fatal (Jonathan Eagles via jlowe)
|
||
|
||
YARN-3155. Refactor the exception handling code for TimelineClientImpl's
|
||
retryOn method (Li Lu via wangda)
|
||
|
||
YARN-3100. Made YARN authorization pluggable. (Jian He via zjshen)
|
||
|
||
YARN-2683. [YARN-913] registry config options: document and move to
|
||
core-default. (stevel)
|
||
|
||
YARN-1237. Description for yarn.nodemanager.aux-services in
|
||
yarn-default.xml is misleading. (Brahma Reddy Battula via ozawa)
|
||
|
||
YARN-3157. Refactor the exception handling in ConverterUtils#to*Id.
|
||
(Bibin A Chundatt via ozawa)
|
||
|
||
YARN-3147. Clean up RM web proxy code. (Steve Loughran via xgong)
|
||
|
||
YARN-2079. Recover NonAggregatingLogHandler state upon nodemanager
|
||
restart. (Jason Lowe via junping_du)
|
||
|
||
YARN-3158. Correct log messages in ResourceTrackerService.
|
||
(Varun Saxena via xgong)
|
||
|
||
YARN-3179. Update use of Iterator to Iterable in RMAdminCLI and
|
||
CommonNodeLabelsManager. (Ray Chiang via xgong)
|
||
|
||
YARN-3182. Cleanup switch statement in ApplicationMasterLauncher#handle().
|
||
(Ray Chiang via ozawa)
|
||
|
||
YARN-3203. Correct a log message in AuxServices. (Brahma Reddy Battula
|
||
via ozawa)
|
||
|
||
YARN-1299. Improve a log message in AppSchedulingInfo by adding application
|
||
id. (Ashutosh Jindal and Devaraj K via ozawa)
|
||
|
||
YARN-1514. Utility to benchmark ZKRMStateStore#loadState for RM HA.
|
||
(Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-3076. Add API/Implementation to YarnClient to retrieve label-to-node
|
||
mapping. (Varun Saxena via wangda)
|
||
|
||
YARN-2799. Cleanup TestLogAggregationService based on the change in YARN-90.
|
||
(Zhihai Xu via junping_du)
|
||
|
||
YARN-3237. AppLogAggregatorImpl fails to log error cause.
|
||
(Rushabh S Shah via xgong)
|
||
|
||
YARN-3236. Cleanup RMAuthenticationFilter#AUTH_HANDLER_PROPERTY.
|
||
(zhihai xu via xgong)
|
||
|
||
YARN-2797. TestWorkPreservingRMRestart should use ParametrizedSchedulerTestBase
|
||
(Karthik Kambatla via xgong)
|
||
|
||
YARN-2797. Add -help to yarn logs and nodes CLI command.
|
||
(Jagadesh Kiran N via devaraj)
|
||
|
||
YARN-3217. Remove httpclient dependency from hadoop-yarn-server-web-proxy.
|
||
(Brahma Reddy Battula via ozawa).
|
||
|
||
YARN-3255. RM, NM, JobHistoryServer, and WebAppProxyServer's main()
|
||
should support generic options. (shv)
|
||
|
||
YARN-2820. Retry in FileSystemRMStateStore when FS's operations fail
|
||
due to IOException. (Zhihai Xu via ozawa)
|
||
|
||
YARN-3262. Surface application outstanding resource requests table
|
||
in RM web UI. (Jian He via wangda)
|
||
|
||
YARN-3281. Added RMStateStore to StateMachine visualization list.
|
||
(Chengbing Liu via jianhe)
|
||
|
||
YARN-3272. Surface container locality info in RM web UI.
|
||
(Jian He via wangda)
|
||
|
||
YARN-3285. (Backport YARN-3168) Convert branch-2 .apt.vm files of YARN to
|
||
markdown. (Masatake Iwasaki via jianhe)
|
||
|
||
YARN-3122. Metrics for container's actual CPU usage.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2190. Added CPU and memory limit options to the default container
|
||
executor for Windows containers. (Chuan Liu via jianhe)
|
||
|
||
YARN-3296. Mark ResourceCalculatorProcessTree class as Public for configurable
|
||
resource monitoring. (Hitesh Shah via junping_du)
|
||
|
||
YARN-3187. Documentation of Capacity Scheduler Queue mapping based on user
|
||
or group. (Gururaj Shetty via jianhe)
|
||
|
||
YARN-2854. Updated the documentation of the timeline service and the generic
|
||
history service. (Naganarasimha G R via zjshen)
|
||
|
||
Backport part of YARN-3273 to rename
|
||
CapacitySchedulerLeafQueueInfo#aMResourceLimit to AMResourceLimit.
|
||
(Rohith via jianhe)
|
||
|
||
YARN-2777. Mark the end of individual log in aggregated log.
|
||
(Varun Saxena via xgong)
|
||
|
||
YARN-3273. Improve scheduler UI to facilitate scheduling analysis and
|
||
debugging. (Rohith Sharmaks via jianhe)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
YARN-2990. FairScheduler's delay-scheduling always waits for node-local and
|
||
rack-local delays, even for off-rack-only requests. (kasha)
|
||
|
||
BUG FIXES
|
||
|
||
YARN-3071. Remove invalid char from sample conf in doc of FairScheduler.
|
||
(Masatake Iwasaki via aajisaka)
|
||
|
||
YARN-2254. TestRMWebServicesAppsModification should run against both
|
||
CS and FS. (Zhihai Xu via kasha)
|
||
|
||
YARN-2713. "RM Home" link in NM should point to one of the RMs in an
|
||
HA setup. (kasha)
|
||
|
||
YARN-2857. ConcurrentModificationException in ContainerLogAppender
|
||
(Mohammad Kamrul Islam via jlowe)
|
||
|
||
YARN-2432. RMStateStore should process the pending events before close.
|
||
(Varun Saxena via jianhe)
|
||
|
||
YARN-1703. Fixed ResourceManager web-proxy to close connections correctly.
|
||
(Rohith Sharma via vinodkv)
|
||
|
||
YARN-2870. Updated the command to run the timeline server in the document.
|
||
(Masatake Iwasaki via zjshen)
|
||
|
||
YARN-2878. Fix DockerContainerExecutor.apt.vm formatting. (Abin Shahab via
|
||
jianhe)
|
||
|
||
YARN-2315. FairScheduler: Set current capacity in addition to capacity.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2697. Remove useless RMAuthenticationHandler. (Haosong Huang via zjshen)
|
||
|
||
YARN-2461. Fix PROCFS_USE_SMAPS_BASED_RSS_ENABLED property in
|
||
YarnConfiguration. (rchiang via rkanter)
|
||
|
||
YARN-2869. CapacityScheduler should trim sub queue names when parse
|
||
configuration. (Wangda Tan via jianhe)
|
||
|
||
YARN-2927. [YARN-1492] InMemorySCMStore properties are inconsistent.
|
||
(Ray Chiang via kasha)
|
||
|
||
YARN-2931. PublicLocalizer may fail until directory is initialized by
|
||
LocalizeRunner. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-2930. Fixed TestRMRestart#testRMRestartRecoveringNodeLabelManager
|
||
intermittent failure. (Wangda Tan via jianhe)
|
||
|
||
YARN-2924. Fixed RMAdminCLI to not convert node labels to lower case.
|
||
(Wangda Tan via jianhe)
|
||
|
||
YARN-2243. Order of arguments for Preconditions.checkNotNull() is wrong in
|
||
SchedulerApplicationAttempt ctor. (devaraj)
|
||
|
||
YARN-2912 Jersey Tests failing with port in use. (varun saxena via stevel)
|
||
|
||
YARN-2356. yarn status command for non-existent application/application
|
||
attempt/container is too verbose. (Sunil G via devaraj)
|
||
|
||
YARN-2914. [YARN-1492] Potential race condition in Singleton implementation of
|
||
SharedCacheUploaderMetrics, CleanerMetrics, ClientSCMMetrics. (Varun Saxena via kasha)
|
||
|
||
YARN-2945. FSLeafQueue#assignContainer - document the reason for using both write and
|
||
read locks. (Tsuyoshi Ozawa via kasha)
|
||
|
||
YARN-2944. InMemorySCMStore can not be instantiated with ReflectionUtils#newInstance.
|
||
(Chris Trezzo via kasha)
|
||
|
||
YARN-2675. containersKilled metrics is not updated when the container is killed
|
||
during localization. (Zhihai Xu via kasha)
|
||
|
||
YARN-2975. FSLeafQueue app lists are accessed without required locks. (kasha)
|
||
|
||
YARN-2977. Fixed intermittent TestNMClient failure.
|
||
(Junping Du via ozawa)
|
||
|
||
YARN-2939. Fix new findbugs warnings in hadoop-yarn-common. (Li Lu via junping_du)
|
||
|
||
YARN-2940. Fix new findbugs warnings in rest of the hadoop-yarn components. (Li Lu
|
||
via junping_du)
|
||
|
||
YARN-2937. Fixed new findbugs warnings in hadoop-yarn-nodemanager. (Varun Saxena
|
||
via zjshen)
|
||
|
||
YARN-2946. Fixed potential deadlock in RMStateStore. (Rohith Sharmaks via
|
||
jianhe)
|
||
|
||
YARN-2988. Graph#save() may leak file descriptors. (Ted Yu via ozawa)
|
||
|
||
YARN-2938. Fixed new findbugs warnings in hadoop-yarn-resourcemanager and
|
||
hadoop-yarn-applicationhistoryservice. (Varun Saxena via zjshen)
|
||
|
||
YARN-2987. Fixed ClientRMService#getQueueInfo to check against queue and
|
||
app ACLs. (Varun Saxena via jianhe)
|
||
|
||
YARN-2991. Fixed DrainDispatcher to reuse the draining code path in
|
||
AsyncDispatcher. (Rohith Sharmaks via zjshen)
|
||
|
||
YARN-2958. Made RMStateStore not update the last sequence number when updating the
|
||
delegation token. (Varun Saxena via zjshen)
|
||
|
||
YARN-2230. Fixed few configs description in yarn-default.xml. (Vijay Bhat
|
||
via jianhe)
|
||
|
||
YARN-3010. Fixed findbugs warning in AbstractYarnScheduler. (Yi Liu via
|
||
jianhe)
|
||
|
||
YARN-2936. Changed YARNDelegationTokenIdentifier to set proto fields on
|
||
getProto method. (Varun Saxena via jianhe)
|
||
|
||
YARN-3014. Replaces labels on a host should update all NM's labels on that
|
||
host. (Wangda Tan via jianhe)
|
||
|
||
YARN-3027. Scheduler should use totalAvailable resource from node instead of
|
||
availableResource for maxAllocation. (adhoot via rkanter)
|
||
|
||
YARN-2861. Fixed Timeline DT secret manager to not reuse RM's configs.
|
||
(Zhijie Shen via jianhe)
|
||
|
||
YARN-3064. TestRMRestart/TestContainerResourceUsage/TestNodeManagerResync
|
||
failure with allocation timeout. (Jian He via junping_du)
|
||
|
||
YARN-2815. Excluded transitive dependency of JLine in hadoop-yarn-server-common.
|
||
(Ferdinand Xu via zjshen)
|
||
|
||
YARN-3070. TestRMAdminCLI#testHelp fails for transitionToActive command.
|
||
(Contributed by Junping Du)
|
||
|
||
YARN-3015. yarn classpath command should support same options as hadoop
|
||
classpath. (Contributed by Varun Saxena)
|
||
|
||
YARN-2933. Capacity Scheduler preemption policy should only consider capacity
|
||
without labels temporarily. (Mayank Bansal via wangda)
|
||
|
||
YARN-2731. Fixed RegisterApplicationMasterResponsePBImpl to properly invoke
|
||
maybeInitBuilder. (Carlo Curino via wangda)
|
||
|
||
YARN-3078. LogCLIHelpers lacks of a blank space before string 'does not exist'.
|
||
(Sam Liu via ozawa)
|
||
|
||
YARN-3082. Non thread safe access to systemCredentials in NodeHeartbeatResponse
|
||
processing. (Anubhav Dhoot via ozawa)
|
||
|
||
YARN-3088. LinuxContainerExecutor.deleteAsUser can throw NPE if native
|
||
executor returns an error (Eric Payne via jlowe)
|
||
|
||
YARN-3079. Scheduler should also update maximumAllocation when updateNodeResource.
|
||
(Zhihai Xu via wangda)
|
||
|
||
YARN-3029. FSDownload.unpack() uses local locale for FS case conversion, may not
|
||
work everywhere. (Varun Saxena via ozawa)
|
||
|
||
YARN-3077. Fixed RM to create zk root path recursively. (Chun Chen via jianhe)
|
||
|
||
YARN-3113. Release audit warning for Sorting icons.psd. (stevel via kihwal)
|
||
|
||
YARN-3056. Add verification for containerLaunchDuration
|
||
in TestNodeManagerMetrics. (zhihai xu via xgong)
|
||
|
||
YARN-2543. Made resource usage be published to the timeline server too.
|
||
(Naganarasimha G R via zjshen)
|
||
|
||
YARN-3058. Fix error message of tokens' activation delay configuration.
|
||
(Yi Liu via ozawa)
|
||
|
||
YARN-3101. In Fair Scheduler, fix canceling of reservations for exceeding
|
||
max share (Anubhav Dhoot via Sandy Ryza)
|
||
|
||
YARN-3149. Fix typo in message for invalid application id.
|
||
(Bibin A Chundatt via xgong)
|
||
|
||
YARN-3145. Fixed ConcurrentModificationException on CapacityScheduler
|
||
ParentQueue#getQueueUserAclInfo. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-1537. Fix race condition in
|
||
TestLocalResourcesTrackerImpl.testLocalResourceCache. (xgong via acmurthy)
|
||
|
||
YARN-3089. LinuxContainerExecutor does not handle file arguments to
|
||
deleteAsUser (Eric Payne via jlowe)
|
||
|
||
YARN-3143. RM Apps REST API can return NPE or entries missing id and other
|
||
fields (jlowe)
|
||
|
||
YARN-2971. RM uses conf instead of token service address to renew timeline
|
||
delegation tokens (jeagles)
|
||
|
||
YARN-3090. DeletionService can silently ignore deletion task failures
|
||
(Varun Saxena via jlowe)
|
||
|
||
YARN-2809. Implement workaround for linux kernel panic when removing
|
||
cgroup (Nathan Roberts via jlowe)
|
||
|
||
YARN-3160. Fix non-atomic operation on nodeUpdateQueue in RMNodeImpl.
|
||
(Chengbing Liu via junping_du)
|
||
|
||
YARN-3074. Nodemanager dies when localizer runner tries to write to a full
|
||
disk (Varun Saxena via jlowe)
|
||
|
||
YARN-3151. On Failover tracking url wrong in application cli for
|
||
KILLED application (Rohith via xgong)
|
||
|
||
YARN-1580. Documentation error regarding "container-allocation.expiry-interval-ms"
|
||
(Brahma Reddy Battula via junping_du)
|
||
|
||
YARN-3104. Fixed RM to not generate new AMRM tokens on every heartbeat
|
||
between rolling and activation. (Jason Lowe via jianhe)
|
||
|
||
YARN-3191. Log object should be initialized with its own class. (Rohith via
|
||
aajisaka)
|
||
|
||
YARN-3164. RMAdmin command usage prints incorrect command name.
|
||
(Bibin A Chundatt via xgong)
|
||
|
||
YARN-2847. Linux native container executor segfaults if default banned
|
||
user detected (Olaf Flebbe via jlowe)
|
||
|
||
YARN-2899. Run TestDockerContainerExecutorWithMocks on Linux only.
|
||
(Ming Ma via cnauroth)
|
||
|
||
YARN-2749. Fix some testcases from TestLogAggregationService fails in trunk.
|
||
(Xuan Gong via junping_du)
|
||
|
||
YARN-3132. RMNodeLabelsManager should remove node from node-to-label mapping
|
||
when node becomes deactivated. (Wangda Tan via jianhe)
|
||
|
||
YARN-1615. Fix typos in description about delay scheduling. (Akira Ajisaka via
|
||
ozawa)
|
||
|
||
YARN-933. Fixed InvalidStateTransitonException at FINAL_SAVING state in
|
||
RMApp. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-3247. TestQueueMappings should use CapacityScheduler explicitly.
|
||
(Zhihai Xu via ozawa)
|
||
|
||
YARN-3256. TestClientToAMTokens#testClientTokenRace is not running against
|
||
all Schedulers even when using ParameterizedSchedulerTestBase.
|
||
(Anubhav Dhoot via devaraj)
|
||
|
||
YARN-3270. Fix node label expression not getting set in
|
||
ApplicationSubmissionContext (Rohit Agarwal via wangda)
|
||
|
||
YARN-3265. Fixed a deadlock in CapacityScheduler by always passing a queue's
|
||
available resource-limit from the parent queue. (Wangda Tan via vinodkv)
|
||
|
||
YARN-3131. YarnClientImpl should check FAILED and KILLED state in
|
||
submitApplication (Chang Li via jlowe)
|
||
|
||
YARN-3275. CapacityScheduler: Preemption happening on non-preemptable
|
||
queues (Eric Payne via jlowe)
|
||
|
||
YARN-3300. Outstanding_resource_requests table should not be shown in AHS.
|
||
(Xuan Gong via jianhe)
|
||
|
||
YARN-3295. Fix documentation nits found in markdown conversion.
|
||
(Masatake Iwasaki via ozawa)
|
||
|
||
YARN-3338. Exclude jline dependency from YARN. (Zhijie Shen via xgong)
|
||
|
||
YARN-3154. Added additional APIs in LogAggregationContext to avoid aggregating
|
||
running logs of application when rolling is enabled. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1453. [JDK8] Fix Javadoc errors caused by incorrect or illegal tags in
|
||
doc comments. (Akira AJISAKA, Andrew Purtell, and Allen Wittenauer via ozawa)
|
||
|
||
YARN-3349. Treat all exceptions as failure in
|
||
TestFSRMStateStore#testFSRMStateStoreClientRetry. (Zhihai Xu via ozawa)
|
||
|
||
YARN-3379. Fixed missing data in localityTable and ResourceRequests table
|
||
in RM WebUI. (Xuan Gong via jianhe)
|
||
|
||
YARN-3384. TestLogAggregationService.verifyContainerLogs fails after
|
||
YARN-2777. (Naganarasimha G R via ozawa)
|
||
|
||
YARN-3336. FileSystem memory leak in DelegationTokenRenewer.
|
||
(Zhihai Xu via cnauroth)
|
||
|
||
YARN-2213. Change proxy-user cookie log in AmIpFilter to DEBUG.
|
||
(Varun Saxena via xgong)
|
||
|
||
YARN-3304. Cleaning up ResourceCalculatorProcessTree APIs for public use and
|
||
removing inconsistencies in the default values. (Junping Du and Karthik
|
||
Kambatla via vinodkv)
|
||
|
||
YARN-3430. Made headroom data available on app attempt page of RM WebUI.
|
||
(Xuan Gong via zjshen)
|
||
|
||
YARN-3466. Fix RM nodes web page to sort by node HTTP-address, #containers
|
||
and node-label column (Jason Lowe via wangda)
|
||
|
||
Release 2.6.4 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-4452. NPE when submit Unmanaged application. (Naganarasimha G R
|
||
via junping_du)
|
||
|
||
YARN-4414. Nodemanager connection errors are retried at multiple levels
|
||
(Chang Li via jlowe)
|
||
|
||
YARN-4598. Invalid event: RESOURCE_FAILED at
|
||
CONTAINER_CLEANEDUP_AFTER_KILL (tangshangwen via jlowe)
|
||
|
||
Release 2.6.3 - 2015-12-17
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-4241. Fix typo of property name in yarn-default.xml.
|
||
(Anthony Rojas via aajisaka)
|
||
|
||
YARN-2859. ApplicationHistoryServer binds to default port 8188 in MiniYARNCluster.
|
||
(Vinod Kumar Vavilapalli via xgong)
|
||
|
||
YARN-4344. NMs reconnecting with changed capabilities can lead to wrong
|
||
cluster resource calculations (Varun Vasudev via jlowe)
|
||
|
||
YARN-3925. ContainerLogsUtils#getContainerLogFile fails to read container
|
||
log files from full disks. (zhihai xu via jlowe)
|
||
|
||
YARN-4365. FileSystemNodeLabelStore should check for root dir existence on
|
||
startup (Kuhu Shukla via jlowe)
|
||
|
||
YARN-4348. ZKRMStateStore.syncInternal shouldn't wait for sync completion for
|
||
avoiding blocking ZK's event thread. (ozawa)
|
||
|
||
YARN-4424. Fix deadlock in RMAppImpl. (Jian he via wangda)
|
||
|
||
YARN-4434. NodeManager Disk Checker parameter documentation is not correct.
|
||
(Weiwei Yang via aajisaka)
|
||
|
||
Release 2.6.2 - 2015-10-28
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-2019. Retrospect on decision of making RM crashed if any exception throw
|
||
in ZKRMStateStore. (Jian He via junping_du)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-4092. Fixed UI redirection to print useful messages when both RMs are
|
||
in standby mode. (Xuan Gong via jianhe)
|
||
|
||
YARN-4101. RM should print alert messages if Zookeeper and Resourcemanager
|
||
gets connection issue. (Xuan Gong via jianhe)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-4087. Followup fixes after YARN-2019 regarding RM behavior when
|
||
state-store error occurs. (Jian He via xgong)
|
||
|
||
YARN-3554. Default value for maximum nodemanager connect wait time is too
|
||
high (Naganarasimha G R via jlowe)
|
||
|
||
YARN-3727. For better error recovery, check if the directory exists before
|
||
using it for localization. (Zhihai Xu via jlowe)
|
||
|
||
YARN-4005. Completed container whose app is finished is possibly not
|
||
removed from NMStateStore. (Jun Gong via jianhe)
|
||
|
||
YARN-3780. Should use equals when compare Resource in RMNodeImpl#ReconnectNodeTransition.
|
||
(zhihai xu via devaraj)
|
||
|
||
YARN-3802. Two RMNodes for the same NodeId are used in RM sometimes
|
||
after NM is reconnected. (zhihai xu via xgong)
|
||
|
||
YARN-3194. RM should handle NMContainerStatuses sent by NM while
|
||
registering if NM is Reconnected node (Rohith via jlowe)
|
||
|
||
YARN-3896. RMNode transitioned from RUNNING to REBOOTED because its response id
|
||
has not been reset synchronously. (Jun Gong via rohithsharmaks)
|
||
|
||
YARN-3798. ZKRMStateStore shouldn't create new session without occurrance of
|
||
SESSIONEXPIED. (ozawa and Varun Saxena)
|
||
|
||
Release 2.6.1 - 2015-09-23
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-3249. Add a 'kill application' button to Resource Manager's Web UI.
|
||
(Ryu Kobayashi via ozawa)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-3230. Clarify application states on the web UI. (Jian He via wangda)
|
||
|
||
YARN-1809. Synchronize RM and TimeLineServer Web-UIs. (Zhijie Shen and
|
||
Xuan Gong via jianhe)
|
||
|
||
YARN-3092. Created a common ResourceUsage class to track labeled resource
|
||
usages in Capacity Scheduler. (Wangda Tan via jianhe)
|
||
|
||
YARN-3098. Created common QueueCapacities class in Capacity Scheduler to
|
||
track capacities-by-labels of queues. (Wangda Tan via jianhe)
|
||
|
||
YARN-2301. Improved yarn container command. (Naganarasimha G R via jianhe)
|
||
|
||
YARN-3978. Configurably turn off the saving of container info in Generic AHS
|
||
(Eric Payne via jeagles)
|
||
|
||
YARN-3248. Display count of nodes blacklisted by apps in the web UI.
|
||
(Varun Vasudev via xgong)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-2856. Fixed RMAppImpl to handle ATTEMPT_KILLED event at ACCEPTED state
|
||
on app recovery. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2816. NM fail to start with NPE during container recovery (Zhihai Xu
|
||
via jlowe)
|
||
|
||
YARN-2414. RM web UI: app page will crash if app is failed before any
|
||
attempt has been created (Wangda Tan via jlowe)
|
||
|
||
YARN-2865. Fixed RM to always create a new RMContext when transtions from
|
||
StandBy to Active. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2906. CapacitySchedulerPage shows HTML tags for a queue's Active Users.
|
||
(Jason Lowe via jianhe)
|
||
|
||
YARN-2905. AggregatedLogsBlock page can infinitely loop if the aggregated
|
||
log file is corrupted (Varun Saxena via jlowe)
|
||
|
||
YARN-2890. MiniYARNCluster should start the timeline server based on the
|
||
configuration. (Mit Desai via zjshen)
|
||
|
||
YARN-2894. Fixed a bug regarding application view acl when RM fails over.
|
||
(Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2874. Dead lock in "DelegationTokenRenewer" which blocks RM to execute
|
||
any further apps. (Naganarasimha G R via kasha)
|
||
|
||
YARN-2910. FSLeafQueue can throw ConcurrentModificationException.
|
||
(Wilfred Spiegelenburg via kasha)
|
||
|
||
YARN-2917. Fixed potential deadlock when system.exit is called in AsyncDispatcher
|
||
(Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2964. RM prematurely cancels tokens for jobs that submit jobs (oozie)
|
||
(Jian He via jlowe)
|
||
|
||
YARN-1984. LeveldbTimelineStore does not handle db exceptions properly
|
||
(Varun Saxena via jlowe)
|
||
|
||
YARN-2952. Fixed incorrect version check in StateStore. (Rohith Sharmaks
|
||
via jianhe)
|
||
|
||
YARN-2340. Fixed NPE when queue is stopped during RM restart.
|
||
(Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2992. ZKRMStateStore crashes due to session expiry. (Karthik Kambatla
|
||
via jianhe)
|
||
|
||
YARN-2922. ConcurrentModificationException in CapacityScheduler's LeafQueue.
|
||
(Rohith Sharmaks via ozawa)
|
||
|
||
YARN-2997. Fixed NodeStatusUpdater to not send alreay-sent completed
|
||
container statuses on heartbeat. (Chengbing Liu via jianhe)
|
||
|
||
YARN-3011. Possible IllegalArgumentException in ResourceLocalizationService
|
||
might lead NM to crash. (Varun Saxena via jianhe)
|
||
|
||
YARN-3103. AMRMClientImpl does not update AMRM token properly. (Jason Lowe
|
||
via jianhe)
|
||
|
||
YARN-3094. Reset timer for liveness monitors after RM recovery. (Jun Gong
|
||
via jianhe)
|
||
|
||
YARN-2246. Made the proxy tracking URL always be
|
||
http(s)://proxy addr:port/proxy/<appId> to avoid duplicate sections. (Devaraj
|
||
K via zjshen)
|
||
|
||
YARN-3207. Secondary filter matches entites which do not have the key being
|
||
filtered for. (Zhijie Shen via xgong)
|
||
|
||
YARN-3238. Connection timeouts to nodemanagers are retried at
|
||
multiple levels (Jason Lowe via xgong)
|
||
|
||
YARN-3239. WebAppProxy does not support a final tracking url which has
|
||
query fragments and params (Jian He via jlowe)
|
||
|
||
YARN-3222. Fixed RMNode to send scheduler events in sequential order when a
|
||
node reconnects. (Rohith Sharma K S via jianhe)
|
||
|
||
YARN-3231. FairScheduler: Changing queueMaxRunningApps interferes with pending
|
||
jobs. (Siqi Li via kasha)
|
||
|
||
YARN-3242. Asynchrony in ZK-close can lead to ZKRMStateStore watcher receiving
|
||
events for old client. (Zhihai Xu via kasha)
|
||
|
||
YARN-3227. Timeline renew delegation token fails when RM user's TGT is expired
|
||
(Zhijie Shen via xgong)
|
||
|
||
YARN-3287. Made TimelineClient put methods do as the correct login context.
|
||
(Daryn Sharp and Jonathan Eagles via zjshen)
|
||
|
||
YARN-3267. Timelineserver applies the ACL rules after applying the limit on
|
||
the number of records (Chang Li via jeagles)
|
||
|
||
YARN-3369. Missing NullPointer check in AppSchedulingInfo causes RM to die.
|
||
(Brahma Reddy Battula via wangda)
|
||
|
||
YARN-3393. Getting application(s) goes wrong when app finishes before
|
||
starting the attempt. (Zhijie Shen via xgong)
|
||
|
||
YARN-3055. Fixed ResourceManager's DelegationTokenRenewer to not stop token
|
||
renewal of applications part of a bigger workflow. (Daryn Sharp via vinodkv)
|
||
|
||
YARN-3493. RM fails to come up with error "Failed to load/recover state"
|
||
when mem settings are changed. (Jian He via wangda)
|
||
|
||
YARN-3487. CapacityScheduler scheduler lock obtained unnecessarily when
|
||
calling getQueue (Jason Lowe via wangda)
|
||
|
||
YARN-3024. LocalizerRunner should give DIE action when all resources are
|
||
localized. (Chengbing Liu via xgong)
|
||
|
||
YARN-3464. Race condition in LocalizerRunner kills localizer before
|
||
localizing all resources. (Zhihai Xu via kasha)
|
||
|
||
YARN-3641. NodeManager: stopRecoveryStore() shouldn't be skipped when
|
||
exceptions happen in stopping NM's sub-services. (Junping Du via jlowe)
|
||
|
||
YARN-3526. ApplicationMaster tracking URL is incorrectly redirected
|
||
on a QJM cluster. (Weiwei Yang via xgong)
|
||
|
||
YARN-2766. Made ApplicationHistoryManager return a sorted list of apps,
|
||
attempts and containers. (Robert Kanter via zjshen)
|
||
|
||
YARN-3700. Made generic history service load a number of latest applications
|
||
according to the parameter or the configuration. (Xuan Gong via zjshen)
|
||
|
||
YARN-2900. Application (Attempt and Container) Not Found in AHS results
|
||
in InternalServer Error (500). (Zhijie Shen and Mit Desai via xgong)
|
||
|
||
YARN-3725. App submission via REST API is broken in secure mode due to
|
||
Timeline DT service address is empty. (Zhijie Shen via wangda)
|
||
|
||
YARN-3585. NodeManager cannot exit on SHUTDOWN event triggered and NM
|
||
recovery is enabled (Rohith Sharmaks via jlowe)
|
||
|
||
YARN-3832. Resource Localization fails on a cluster due to existing cache
|
||
directories (Brahma Reddy Battula via jlowe)
|
||
|
||
YARN-3850. NM fails to read files from full disks which can lead to
|
||
container logs being lost and other issues (Varun Saxena via jlowe)
|
||
|
||
YARN-3990. AsyncDispatcher may overloaded with RMAppNodeUpdateEvent when
|
||
Node is connected/disconnected (Bibin A Chundatt via jlowe)
|
||
|
||
YARN-2637. Fixed max-am-resource-percent calculation in CapacityScheduler
|
||
when activating applications. (Craig Welch via jianhe)
|
||
|
||
YARN-3733. Fix DominantRC#compare() does not work as expected if
|
||
cluster resource is empty. (Rohith Sharmaks via wangda)
|
||
|
||
YARN-2920. Changed CapacityScheduler to kill containers on nodes where
|
||
node labels are changed. (Wangda Tan via jianhe)
|
||
|
||
YARN-2978. Fixed potential NPE while getting queue info. (Varun Saxena via
|
||
jianhe)
|
||
|
||
YARN-3099. Capacity Scheduler LeafQueue/ParentQueue should use ResourceUsage
|
||
to track used-resources-by-label.(Wangda Tan via jianhe)
|
||
|
||
YARN-2694. Ensure only single node label specified in ResourceRequest.
|
||
(Wangda Tan via jianhe)
|
||
|
||
YARN-3124. Fixed CS LeafQueue/ParentQueue to use QueueCapacities to track
|
||
capacities-by-label. (Wangda Tan via jianhe)
|
||
|
||
YARN-2918. RM should not fail on startup if queue's configured labels do
|
||
not exist in cluster-node-labels. (Wangda Tan via jianhe)
|
||
|
||
YARN-3999. RM hangs on draing events. (Jian He via xgong)
|
||
|
||
YARN-4047. ClientRMService getApplications has high scheduler lock contention.
|
||
(Jason Lowe via jianhe)
|
||
|
||
YARN-1884. Added nodeHttpAddress into ContainerReport and fixed the link to NM
|
||
web page. (Xuan Gong via zjshen)
|
||
|
||
YARN-3171. Sort by Application id, AppAttempt and ContainerID doesn't work
|
||
in ATS / RM web ui. (Naganarasimha G R via xgong)
|
||
|
||
YARN-3740. Fixed the typo in the configuration name:
|
||
APPLICATION_HISTORY_PREFIX_MAX_APPS. (Xuan Gong via zjshen)
|
||
|
||
YARN-3544. Got back AM logs link on the RM web UI for a completed app.
|
||
(Xuan Gong via zjshen)
|
||
|
||
Release 2.6.0 - 2014-11-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-1964. Create Docker analog of the LinuxContainerExecutor in YARN. (Abin
|
||
Shahab via raviprak)
|
||
|
||
YARN-2131. Add a way to format the RMStateStore. (Robert Kanter via kasha)
|
||
|
||
YARN-1367. Changed NM to not kill containers on NM resync if RM work-preserving
|
||
restart is enabled. (Anubhav Dhoot via jianhe)
|
||
|
||
YARN-1366. Changed AMRMClient to re-register with RM and send outstanding requests
|
||
back to RM on work-preserving RM restart. (Rohith via jianhe)
|
||
|
||
YARN-2181. Added preemption info to logs and RM web UI. (Wangda Tan via
|
||
jianhe)
|
||
|
||
YARN-1354. Recover applications upon nodemanager restart. (Jason Lowe via
|
||
junping_du)
|
||
|
||
YARN-1337. Recover containers upon nodemanager restart. (Jason Lowe via
|
||
junping_du)
|
||
|
||
YARN-2277. Added cross-origin support for the timeline server web services.
|
||
(Jonathan Eagles via zjshen)
|
||
|
||
YARN-2378. Added support for moving applications across queues in
|
||
CapacityScheduler. (Subramaniam Venkatraman Krishnan via jianhe)
|
||
|
||
YARN-2411. Support simple user and group mappings to queues. (Ram Venkatesh
|
||
via jianhe)
|
||
|
||
YARN-2174. Enable HTTPs for the writer REST API of TimelineServer.
|
||
(Zhijie Shen via jianhe)
|
||
|
||
YARN-2393. FairScheduler: Add the notion of steady fair share.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-2395. FairScheduler: Preemption timeout should be configurable per
|
||
queue. (Wei Yan via kasha)
|
||
|
||
YARN-2394. FairScheduler: Configure fairSharePreemptionThreshold per queue.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-415. Capture aggregate memory allocation at the app-level for chargeback.
|
||
(Eric Payne & Andrey Klochkov via jianhe)
|
||
|
||
YARN-2440. Enabled Nodemanagers to limit the aggregate cpu usage across all
|
||
containers to a preconfigured limit. (Varun Vasudev via vinodkv)
|
||
|
||
YARN-2033. Merging generic-history into the Timeline Store
|
||
(Zhijie Shen via junping_du)
|
||
|
||
YARN-611. Added an API to let apps specify an interval beyond which AM
|
||
failures should be ignored towards counting max-attempts. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-2531. Added a configuration for admins to be able to override app-configs
|
||
and enforce/not-enforce strict control of per-container cpu usage. (Varun
|
||
Vasudev via vinodkv)
|
||
|
||
YARN-1250. Generic history service should support application-acls. (Zhijie Shen
|
||
via junping_du)
|
||
|
||
YARN-2569. Added the log handling APIs for the long running services. (Xuan
|
||
Gong via zjshen)
|
||
|
||
YARN-2102. Added the concept of a Timeline Domain to handle read/write ACLs
|
||
on Timeline service event data. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2581. Passed LogAggregationContext to NM via ContainerTokenIdentifier.
|
||
(Xuan Gong via zjshen)
|
||
|
||
YARN-1063. Augmented Hadoop common winutils to have the ability to create
|
||
containers as domain users. (Remus Rusanu via vinodkv)
|
||
|
||
YARN-2613. Support retry in NMClient for rolling-upgrades. (Jian He via
|
||
junping_du)
|
||
|
||
YARN-1972. Added a secure container-executor for Windows. (Remus Rusanu via
|
||
vinodkv)
|
||
|
||
YARN-2446. Augmented Timeline service APIs to start taking in domains as a
|
||
parameter while posting entities and events. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2468. Enhanced NodeManager to support log handling APIs (YARN-2569) for
|
||
use by long running services. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1051. Add a system for creating reservations of cluster capacity.
|
||
(see breakdown below)
|
||
|
||
YARN-913. Add a way to register long-lived services in a YARN cluster.
|
||
(stevel)
|
||
|
||
YARN-2493. Added user-APIs for using node-labels. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2544. Added admin-API objects for using node-labels. (Wangda Tan via
|
||
vinodkv)
|
||
|
||
YARN-2494. Added NodeLabels Manager internal API and implementation. (Wangda
|
||
Tan via vinodkv)
|
||
|
||
YARN-2501. Enhanced AMRMClient library to support requests against node
|
||
labels. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2656. Made RM web services authentication filter support proxy user.
|
||
(Varun Vasudev and Zhijie Shen via zjshen)
|
||
|
||
YARN-2496. Enhanced Capacity Scheduler to have basic support for allocating
|
||
resources based on node-labels. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2500. Enhaced ResourceManager to support schedulers allocating resources
|
||
based on node-labels. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2504. Enhanced RM Admin CLI to support management of node-labels.
|
||
(Wangda Tan via vinodkv)
|
||
|
||
YARN-2198. Remove the need to run NodeManager as privileged account for
|
||
Windows Secure Container Executor. (Remus Rusanu via jianhe)
|
||
|
||
YARN-2647. Added a queue CLI for getting queue information. (Sunil Govind via
|
||
vinodkv)
|
||
|
||
YARN-2632. Document NM Restart feature. (Junping Du and Vinod Kumar
|
||
Vavilapalli via jlowe)
|
||
|
||
YARN-2505. Supported get/add/remove/change labels in RM REST API. (Craig Welch
|
||
via zjshen)
|
||
|
||
YARN-2811. In Fair Scheduler, reservation fulfillments shouldn't ignore max
|
||
share (Siqi Li via Sandy Ryza)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-2242. Improve exception information on AM launch crashes. (Li Lu
|
||
via junping_du)
|
||
|
||
YARN-2274. FairScheduler: Add debug information about cluster capacity,
|
||
availability and reservations. (kasha)
|
||
|
||
YARN-2228. Augmented TimelineServer to load pseudo authentication filter when
|
||
authentication = simple. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1341. Recover NMTokens upon nodemanager restart. (Jason Lowe via
|
||
junping_du)
|
||
|
||
YARN-2208. AMRMTokenManager need to have a way to roll over AMRMToken. (xgong)
|
||
|
||
YARN-2323. FairShareComparator creates too many Resource objects (Hong Zhiguo
|
||
via Sandy Ryza)
|
||
|
||
YARN-2045. Data persisted in NM should be versioned (Junping Du via jlowe)
|
||
|
||
YARN-2013. The diagnostics is always the ExitCodeException stack when the container
|
||
crashes. (Tsuyoshi OZAWA via junping_du)
|
||
|
||
YARN-2295. Refactored DistributedShell to use public APIs of protocol records.
|
||
(Li Lu via jianhe)
|
||
|
||
YARN-1342. Recover container tokens upon nodemanager restart. (Jason Lowe via
|
||
devaraj)
|
||
|
||
YARN-2214. FairScheduler: preemptContainerPreCheck() in FSParentQueue delays
|
||
convergence towards fairness. (Ashwin Shankar via kasha)
|
||
|
||
YARN-2211. Persist AMRMToken master key in RMStateStore for RM recovery.
|
||
(Xuan Gong via jianhe)
|
||
|
||
YARN-2328. FairScheduler: Verify update and continuous scheduling threads are
|
||
stopped when the scheduler is stopped. (kasha)
|
||
|
||
YARN-2347. Consolidated RMStateVersion and NMDBSchemaVersion into Version in
|
||
yarn-server-common. (Junping Du via zjshen)
|
||
|
||
YARN-1994. Expose YARN/MR endpoints on multiple interfaces. (Craig Welch,
|
||
Milan Potocnik, Arpit Agarwal via xgong)
|
||
|
||
YARN-2343. Improve NMToken expire exception message. (Li Lu via jianhe)
|
||
|
||
YARN-2370. Fix comment in o.a.h.y.server.resourcemanager.schedulerAppSchedulingInfo
|
||
(Wenwu Peng via junping_du)
|
||
|
||
YARN-2298. Move TimelineClient to yarn-common project (Zhijie Shen via
|
||
junping_du)
|
||
|
||
YARN-2288. Made persisted data in LevelDB timeline store be versioned. (Junping Du
|
||
via zjshen)
|
||
|
||
YARN-2352. FairScheduler: Collect metrics on duration of critical methods that
|
||
affect performance. (kasha)
|
||
|
||
YARN-2212. ApplicationMaster needs to find a way to update the AMRMToken
|
||
periodically. (xgong)
|
||
|
||
YARN-2026. Fair scheduler: Consider only active queues for computing fairshare.
|
||
(Ashwin Shankar via kasha)
|
||
|
||
YARN-1954. Added waitFor to AMRMClient(Async). (Tsuyoshi Ozawa via zjshen)
|
||
|
||
YARN-2302. Refactor TimelineWebServices. (Zhijie Shen via junping_du)
|
||
|
||
YARN-2337. ResourceManager sets ClientRMService in RMContext multiple times.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2138. Cleaned up notifyDone* APIs in RMStateStore. (Varun Saxena via
|
||
jianhe)
|
||
|
||
YARN-2373. Changed WebAppUtils to use Configuration#getPassword for
|
||
accessing SSL passwords. (Larry McCay via jianhe)
|
||
|
||
YARN-2317. Updated the document about how to write YARN applications. (Li Lu via
|
||
zjshen)
|
||
|
||
YARN-2399. FairScheduler: Merge AppSchedulable and FSSchedulerApp into
|
||
FSAppAttempt. (kasha)
|
||
|
||
YARN-1370. Fair scheduler to re-populate container allocation state.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2197. Add a link to YARN CHANGES.txt in the left side of doc
|
||
(Akira AJISAKA via aw)
|
||
|
||
YARN-1918. Typo in description and error message for
|
||
'yarn.resourcemanager.cluster-id' (Anandha L Ranganathan via aw)
|
||
|
||
YARN-2389. Added functionality for schedulers to kill all applications in a
|
||
queue. (Subramaniam Venkatraman Krishnan via jianhe)
|
||
|
||
YARN-1326. RM should log using RMStore at startup time.
|
||
(Tsuyoshi Ozawa via kasha)
|
||
|
||
YARN-2182. Updated ContainerId#toString() to append RM Epoch number.
|
||
(Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2406. Move RM recovery related proto to
|
||
yarn_server_resourcemanager_recovery.proto. (Tsuyoshi Ozawa via jianhe)
|
||
|
||
YARN-1506. Changed RMNode/SchedulerNode to update resource with event
|
||
notification. (Junping Du via jianhe)
|
||
|
||
YARN-2509. Enable Cross Origin Filter for timeline server only and not all
|
||
Yarn servers (Mit Desai via jeagles)
|
||
|
||
YARN-2511. Allowed all origins by default when CrossOriginFilter is
|
||
enabled. (Jonathan Eagles via zjshen)
|
||
|
||
YARN-2508. Cross Origin configuration parameters prefix are not honored
|
||
(Mit Desai via jeagles)
|
||
|
||
YARN-2512. Allowed pattern matching for origins in CrossOriginFilter.
|
||
(Jonathan Eagles via zjshen)
|
||
|
||
YARN-2507. Documented CrossOriginFilter configurations for the timeline
|
||
server. (Jonathan Eagles via zjshen)
|
||
|
||
YARN-2515. Updated ConverterUtils#toContainerId to parse epoch.
|
||
(Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2448. Changed ApplicationMasterProtocol to expose RM-recognized resource
|
||
types to the AMs. (Varun Vasudev via vinodkv)
|
||
|
||
YARN-2538. Added logs when RM sends roll-overed AMRMToken to AM. (Xuan Gong
|
||
via zjshen)
|
||
|
||
YARN-2229. Changed the integer field of ContainerId to be long type.
|
||
(Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2547. Cross Origin Filter throws UnsupportedOperationException upon
|
||
destroy (Mit Desai via jeagles)
|
||
|
||
YARN-2557. Add a parameter "attempt_Failures_Validity_Interval" into
|
||
DistributedShell. (xgong)
|
||
|
||
YARN-2001. Added a time threshold for RM to wait before starting container
|
||
allocations after restart/failover. (Jian He via vinodkv)
|
||
|
||
YARN-1372. Ensure all completed containers are reported to the AMs across
|
||
RM restart. (Anubhav Dhoot via jianhe)
|
||
|
||
YARN-2539. FairScheduler: Set the default value for maxAMShare to 0.5.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-1959. Fix headroom calculation in FairScheduler.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2577. Clarify ACL delimiter and how to configure ACL groups only
|
||
(Miklos Christine via aw)
|
||
|
||
YARN-2372. There are Chinese Characters in the FairScheduler's document
|
||
(Fengdong Yu via aw)
|
||
|
||
YARN-668. Changed NMTokenIdentifier/AMRMTokenIdentifier/ContainerTokenIdentifier
|
||
to use protobuf object as the payload. (Junping Du via jianhe)
|
||
|
||
YARN-1769. CapacityScheduler: Improve reservations (Thomas Graves via
|
||
jlowe)
|
||
|
||
YARN-2627. Added the info logs of attemptFailuresValidityInterval and number
|
||
of previous failed attempts. (Xuan Gong via zjshen)
|
||
|
||
YARN-2562. Changed ContainerId#toString() to be more readable. (Tsuyoshi
|
||
OZAWA via jianhe)
|
||
|
||
YARN-2615. Changed ClientToAMTokenIdentifier/RM(Timeline)DelegationTokenIdentifier
|
||
to use protobuf as payload. (Junping Du via jianhe)
|
||
|
||
YARN-2629. Made the distributed shell use the domain-based timeline ACLs.
|
||
(zjshen)
|
||
|
||
YARN-2583. Modified AggregatedLogDeletionService to be able to delete rolling
|
||
aggregated logs. (Xuan Gong via zjshen)
|
||
|
||
YARN-2312. Deprecated old ContainerId#getId API and updated MapReduce to
|
||
use ContainerId#getContainerId instead. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2621. Simplify the output when the user doesn't have the access for
|
||
getDomain(s). (Zhijie Shen via jianhe)
|
||
|
||
YARN-1879. Marked Idempotent/AtMostOnce annotations to ApplicationMasterProtocol
|
||
for RM fail over. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2676. Enhanced Timeline auth-filter to support proxy users. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-2673. Made timeline client put APIs retry if ConnectException happens.
|
||
(Li Lu via zjshen)
|
||
|
||
YARN-2582. Fixed Log CLI and Web UI for showing aggregated logs of LRS. (Xuan
|
||
Gong via zjshen)
|
||
|
||
YARN-90. NodeManager should identify failed disks becoming good again
|
||
(Varun Vasudev via jlowe)
|
||
|
||
YARN-2709. Made timeline client getDelegationToken API retry if ConnectException
|
||
happens. (Li Lu via zjshen)
|
||
|
||
YARN-2682. Updated WindowsSecureContainerExecutor to not use
|
||
DefaultContainerExecutor#getFirstApplicationDir and use getWorkingDir()
|
||
instead. (Zhihai Xu via jianhe)
|
||
|
||
YARN-2209. Replaced AM resync/shutdown command with corresponding exceptions and
|
||
made related MR changes. (Jian He via zjshen)
|
||
|
||
YARN-2703. Added logUploadedTime into LogValue for better display. (Xuan Gong
|
||
via zjshen)
|
||
|
||
YARN-2704. Changed ResourceManager to optionally obtain tokens itself for the
|
||
sake of localization and log-aggregation for long-running services. (Jian He
|
||
via vinodkv)
|
||
|
||
YARN-2502. Changed DistributedShell to support node labels. (Wangda Tan via
|
||
jianhe)
|
||
|
||
YARN-2760. Remove 'experimental' from FairScheduler docs. (Harsh J via kasha)
|
||
|
||
YARN-2503. Added node lablels in web UI. (Wangda Tan via jianhe)
|
||
|
||
YARN-2779. Fixed ResourceManager to not require delegation tokens for
|
||
communicating with Timeline Service. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2778. Moved node-lables' reports to the yarn nodes CLI from the admin
|
||
CLI. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2770. Added functionality to renew/cancel TimeLineDelegationToken.
|
||
(Zhijie Shen via jianhe)
|
||
|
||
YARN-2818. Removed the now unnecessary user entity injection from Timeline
|
||
service given we now have domains. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2635. TestRM, TestRMRestart, TestClientToAMTokens should run
|
||
with both CS and FS. (Wei Yan and kasha via kasha)
|
||
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-2251. Avoid negative elapsed time in JHS/MRAM web UI and services.
|
||
(Zhijie Shen via junping_du)
|
||
|
||
YARN-2088. Fixed a bug in GetApplicationsRequestPBImpl#mergeLocalToBuilder.
|
||
(Binglin Chang via jianhe)
|
||
|
||
YARN-2260. Fixed ResourceManager's RMNode to correctly remember containers
|
||
when nodes resync during work-preserving RM restart. (Jian He via vinodkv)
|
||
|
||
YARN-2264. Fixed a race condition in DrainDispatcher which may cause random
|
||
test failures. (Li Lu via jianhe)
|
||
|
||
YARN-2219. Changed ResourceManager to avoid AMs and NMs getting exceptions
|
||
after RM recovery but before scheduler learns about apps and app-attempts.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-2244. FairScheduler missing handling of containers for unknown
|
||
application attempts. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-2321. NodeManager web UI can incorrectly report Pmem enforcement
|
||
(Leitao Guo via jlowe)
|
||
|
||
YARN-2273. NPE in ContinuousScheduling thread when we lose a node.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-2313. Livelock can occur in FairScheduler when there are lots of
|
||
running apps (Tsuyoshi Ozawa via Sandy Ryza)
|
||
|
||
YARN-2147. client lacks delegation token exception details when
|
||
application submit fails (Chen He via jlowe)
|
||
|
||
YARN-1796. container-executor shouldn't require o-r permissions (atm)
|
||
|
||
YARN-2354. DistributedShell may allocate more containers than client
|
||
specified after AM restarts. (Li Lu via jianhe)
|
||
|
||
YARN-2051. Fix bug in PBimpls and add more unit tests with reflection.
|
||
(Binglin Chang via junping_du)
|
||
|
||
YARN-2374. Fixed TestDistributedShell#testDSShell failure due to hostname
|
||
dismatch. (Varun Vasudev via jianhe)
|
||
|
||
YARN-2359. Application hangs when it fails to launch AM container.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2388. Fixed TestTimelineWebServices failure due to HADOOP-10791. (zjshen)
|
||
|
||
YARN-2008. Fixed CapacityScheduler to calculate headroom based on max available
|
||
capacity instead of configured max capacity. (Craig Welch via jianhe)
|
||
|
||
YARN-2400. Fixed TestAMRestart fails intermittently. (Jian He via xgong)
|
||
|
||
YARN-2361. RMAppAttempt state machine entries for KILLED state has duplicate
|
||
event entries. (Zhihai Xu via kasha)
|
||
|
||
YARN-2070. Made DistributedShell publish the short user name to the timeline
|
||
server. (Robert Kanter via zjshen)
|
||
|
||
YARN-2397. Avoided loading two authentication filters for RM and TS web
|
||
interfaces. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2409. RM ActiveToStandBy transition missing stoping previous rmDispatcher.
|
||
(Rohith via jianhe)
|
||
|
||
YARN-2249. Avoided AM release requests being lost on work preserving RM
|
||
restart. (Jian He via zjshen)
|
||
|
||
YARN-2034. Description for yarn.nodemanager.localizer.cache.target-size-mb
|
||
is incorrect (Chen He via jlowe)
|
||
|
||
YARN-1919. Potential NPE in EmbeddedElectorService#stop.
|
||
(Tsuyoshi Ozawa via kasha)
|
||
|
||
YARN-2424. LCE should support non-cgroups, non-secure mode (Chris Douglas
|
||
via aw)
|
||
|
||
YARN-2434. RM should not recover containers from previously failed attempt
|
||
when AM restart is not enabled (Jian He via jlowe)
|
||
|
||
YARN-2035. FileSystemApplicationHistoryStore should not make working dir
|
||
when it already exists. (Jonathan Eagles via zjshen)
|
||
|
||
YARN-2405. NPE in FairSchedulerAppsBlock. (Tsuyoshi Ozawa via kasha)
|
||
|
||
YARN-2449. Fixed the bug that TimelineAuthenticationFilterInitializer
|
||
is not automatically added when hadoop.http.filter.initializers is not
|
||
configured. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2450. Fix typos in log messages. (Ray Chiang via hitesh)
|
||
|
||
YARN-2447. RM web service app submission doesn't pass secrets correctly.
|
||
(Varun Vasudev via jianhe)
|
||
|
||
YARN-2462. TestNodeManagerResync#testBlockNewContainerRequestsOnStartAndResync
|
||
should have a test timeout (Eric Payne via jlowe)
|
||
|
||
YARN-2431. NM restart: cgroup is not removed for reacquired containers
|
||
(jlowe)
|
||
|
||
YARN-2519. Credential Provider related unit tests failed on Windows.
|
||
(Xiaoyu Yao via cnauroth)
|
||
|
||
YARN-2526. SLS can deadlock when all the threads are taken by AMSimulators.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-1458. FairScheduler: Zero weight can lead to livelock.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2459. RM crashes if App gets rejected for any reason
|
||
and HA is enabled. (Jian He and Mayank Bansal via xgong)
|
||
|
||
YARN-2158. Fixed TestRMWebServicesAppsModification#testSingleAppKill test
|
||
failure. (Varun Vasudev via jianhe)
|
||
|
||
YARN-2534. FairScheduler: Potential integer overflow calculating totalMaxShare.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2541. Fixed ResourceManagerRest.apt.vm table syntax error. (jianhe)
|
||
|
||
YARN-2484. FileSystemRMStateStore#readFile/writeFile should close
|
||
FSData(In|Out)putStream in final block (Tsuyoshi OZAWA via jlowe)
|
||
|
||
YARN-2456. Possible livelock in CapacityScheduler when RM is recovering apps.
|
||
(Jian He via xgong)
|
||
|
||
YARN-2542. Fixed NPE when retrieving ApplicationReport from TimeLineServer.
|
||
(Zhijie Shen via jianhe)
|
||
|
||
YARN-2528. Relaxed http response split vulnerability protection for the origins
|
||
header and made it accept multiple origins in CrossOriginFilter. (Jonathan
|
||
Eagles via zjshen)
|
||
|
||
YARN-2549. TestContainerLaunch fails due to classpath problem with hamcrest
|
||
classes. (cnauroth)
|
||
|
||
YARN-2529. Generic history service RPC interface doesn't work when service
|
||
authorization is enabled. (Zhijie Shen via jianhe)
|
||
|
||
YARN-2558. Updated ContainerTokenIdentifier#read/write to use
|
||
ContainerId#getContainerId. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-2559. Fixed NPE in SystemMetricsPublisher when retrieving
|
||
FinalApplicationStatus. (Zhijie Shen via jianhe)
|
||
|
||
YARN-1779. Fixed AMRMClient to handle AMRMTokens correctly across
|
||
ResourceManager work-preserving-restart or failover. (Jian He via vinodkv)
|
||
|
||
YARN-2363. Submitted applications occasionally lack a tracking URL (jlowe)
|
||
|
||
YARN-2561. MR job client cannot reconnect to AM after NM restart. (Junping
|
||
Du via jlowe)
|
||
|
||
YARN-2563. Fixed YarnClient to call getTimeLineDelegationToken only if the
|
||
Token is not present. (Zhijie Shen via jianhe)
|
||
|
||
YARN-2568. Fixed the potential test failures due to race conditions when RM
|
||
work-preserving recovery is enabled. (Jian He via zjshen)
|
||
|
||
YARN-2565. Fixed RM to not use FileSystemApplicationHistoryStore unless
|
||
explicitly set. (Zhijie Shen via jianhe)
|
||
|
||
YARN-2460. Remove obsolete entries from yarn-default.xml (Ray Chiang via
|
||
aw)
|
||
|
||
YARN-2452. TestRMApplicationHistoryWriter fails with FairScheduler.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2453. TestProportionalCapacityPreemptionPolicy fails with
|
||
FairScheduler. (Zhihai Xu via kasha)
|
||
|
||
YARN-2540. FairScheduler: Queue filters not working on scheduler page in
|
||
RM UI. (Ashwin Shankar via kasha)
|
||
|
||
YARN-2584. TestContainerManagerSecurity fails on trunk. (Jian He via
|
||
junping_du)
|
||
|
||
YARN-2252. Intermittent failure of
|
||
TestFairScheduler.testContinuousScheduling.
|
||
(Ratandeep Ratti and kasha via kasha)
|
||
|
||
YARN-2161. Fix build on macosx: YARN parts (Binglin Chang via aw)
|
||
|
||
YARN-2596. TestWorkPreservingRMRestart fails with FairScheduler. (kasha)
|
||
|
||
YARN-2546. Made REST API for application creation/submission use numeric and
|
||
boolean types instead of the string of them. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2523. ResourceManager UI showing negative value for "Decommissioned
|
||
Nodes" field (Rohith via jlowe)
|
||
|
||
YARN-2608. FairScheduler: Potential deadlocks in loading alloc files and
|
||
clock access. (Wei Yan via kasha)
|
||
|
||
YARN-2606. Application History Server tries to access hdfs before doing
|
||
secure login (Mit Desai via jeagles)
|
||
|
||
YARN-2610. Hamlet should close table tags. (Ray Chiang via kasha)
|
||
|
||
YARN-2387. Resource Manager crashes with NPE due to lack of
|
||
synchronization (Mit Desai via jlowe)
|
||
|
||
YARN-2594. Potential deadlock in RM when querying
|
||
ApplicationResourceUsageReport. (Wangda Tan via kasha)
|
||
|
||
YARN-2602. Fixed possible NPE in ApplicationHistoryManagerOnTimelineStore.
|
||
(Zhijie Shen via jianhe)
|
||
|
||
YARN-2630. Prevented previous AM container status from being acquired by the
|
||
current restarted AM. (Jian He via zjshen)
|
||
|
||
YARN-2617. Fixed NM to not send duplicate container status whose app is not
|
||
running. (Jun Gong via jianhe)
|
||
|
||
YARN-2624. Resource Localization fails on a cluster due to existing cache
|
||
directories (Anubhav Dhoot via jlowe)
|
||
|
||
YARN-2527. Fixed the potential NPE in ApplicationACLsManager and added test
|
||
cases for it. (Benoy Antony via zjshen)
|
||
|
||
YARN-2628. Capacity scheduler with DominantResourceCalculator carries out
|
||
reservation even though slots are free. (Varun Vasudev via jianhe)
|
||
|
||
YARN-2685. Fixed a bug in CommonNodeLabelsManager that caused wrong resource
|
||
tracking per label when a host runs multiple node-managers. (Wangda Tan via
|
||
vinodkv)
|
||
|
||
YARN-2699. Fixed a bug in CommonNodeLabelsManager that caused tests to fail
|
||
when using ephemeral ports on NodeIDs. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2705. Fixed bugs in ResourceManager node-label manager that were causing
|
||
test-failures: added a dummy in-memory labels-manager. (Wangda Tan via
|
||
vinodkv)
|
||
|
||
YARN-2715. Fixed ResourceManager to respect common configurations for proxy
|
||
users/groups beyond just the YARN level config. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2743. Fixed a bug in ResourceManager that was causing RMDelegationToken
|
||
identifiers to be tampered and thus causing app submission failures in
|
||
secure mode. (Jian He via vinodkv)
|
||
|
||
BREAKDOWN OF YARN-1051 SUBTASKS AND RELATED JIRAS
|
||
|
||
YARN-1707. Introduce APIs to add/remove/resize queues in the
|
||
CapacityScheduler. (Carlo Curino and Subru Krishnan via curino)
|
||
|
||
YARN-2475. Logic for responding to capacity drops for the
|
||
ReservationSystem. (Carlo Curino and Subru Krishnan via curino)
|
||
|
||
YARN-1708. Public YARN APIs for creating/updating/deleting
|
||
reservations. (Subru Krishnan and Carlo Curino via subru)
|
||
|
||
YARN-1709. In-memory data structures used to track resources over
|
||
time to enable reservations. (Subru Krishnan and Carlo Curino via
|
||
subru)
|
||
|
||
YARN-1710. Logic to find allocations within a Plan that satisfy
|
||
user ReservationRequest(s). (Carlo Curino and Subru Krishnan via
|
||
curino)
|
||
|
||
YARN-1711. Policy to enforce instantaneous and over-time quotas
|
||
on user reservations. (Carlo Curino and Subru Krishnan via curino)
|
||
|
||
YARN-1712. Plan follower that synchronizes the current state of reservation
|
||
subsystem with the scheduler. (Subru Krishnan and Carlo Curino via subru)
|
||
|
||
YARN-2080. Integrating reservation system with ResourceManager and
|
||
client-RM protocol. (Subru Krishnan and Carlo Curino via subru)
|
||
|
||
MAPREDUCE-6103. Adding reservation APIs to MR resource manager
|
||
delegate. (Subru Krishnan and Carlo Curino via subru)
|
||
|
||
YARN-2576. Fixing compilation, javadocs and audit issues to pass
|
||
test patch in branch. (Subru Krishnan and Carlo Curino via subru)
|
||
|
||
YARN-2611. Fixing jenkins findbugs warning and TestRMWebServicesCapacitySched
|
||
for branch YARN-1051. (Subru Krishnan and Carlo Curino via subru)
|
||
|
||
YARN-2644. Fixed CapacityScheduler to return up-to-date headroom when
|
||
AM allocates. (Craig Welch via jianhe)
|
||
|
||
YARN-1857. CapacityScheduler headroom doesn't account for other AM's running.
|
||
(Chen He and Craig Welch via jianhe)
|
||
|
||
YARN-2649. Fixed TestAMRMRPCNodeUpdates test failure. (Ming Ma via jianhe)
|
||
|
||
YARN-2662. TestCgroupsLCEResourcesHandler leaks file descriptors. (cnauroth)
|
||
|
||
BREAKDOWN OF YARN-913 SUBTASKS AND RELATED JIRAS
|
||
|
||
YARN-2652 Add hadoop-yarn-registry package under hadoop-yarn. (stevel)
|
||
|
||
YARN-2668 yarn-registry JAR won't link against ZK 3.4.5. (stevel)
|
||
|
||
YARN-2689 TestSecureRMRegistryOperations failing on windows:
|
||
secure ZK won't start (stevel)
|
||
|
||
YARN-2692 ktutil test hanging on some machines/ktutil versions (stevel)
|
||
|
||
YARN-2700 TestSecureRMRegistryOperations failing on windows: auth problems
|
||
(stevel)
|
||
|
||
YARN-2677 registry punycoding of usernames doesn't fix all usernames to be
|
||
DNS-valid (stevel)
|
||
|
||
YARN-2768 Improved Yarn Registry service record structure (stevel)
|
||
|
||
---
|
||
|
||
YARN-2598 GHS should show N/A instead of null for the inaccessible information
|
||
(Zhijie Shen via mayank)
|
||
|
||
YARN-2671. Fixed ApplicationSubmissionContext to still set resource for
|
||
backward compatibility. (Wangda Tan via zjshen)
|
||
|
||
YARN-2667. Fix the release audit warning caused by hadoop-yarn-registry
|
||
(Yi Liu via jlowe)
|
||
|
||
YARN-2651. Spun off LogRollingInterval from LogAggregationContext. (Xuan Gong
|
||
via zjshen)
|
||
|
||
YARN-2377. Localization exception stack traces are not passed as
|
||
diagnostic info (Gera Shegalov via jlowe)
|
||
|
||
YARN-2308. Changed CapacityScheduler to explicitly throw exception if the
|
||
queue to which the apps were submitted is changed across RM restart.
|
||
(Craig Welch & Chang Li via jianhe)
|
||
|
||
YARN-2566. DefaultContainerExecutor should pick a working directory randomly.
|
||
(Zhihai Xu via kasha)
|
||
|
||
YARN-2588. Standby RM fails to transitionToActive if previous
|
||
transitionToActive failed with ZK exception. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2701. Potential race condition in startLocalizer when using
|
||
LinuxContainerExecutor. (Xuan Gong via jianhe)
|
||
|
||
YARN-2717. Avoided duplicate logging when container logs are not found. (Xuan
|
||
Gong via zjshen)
|
||
|
||
YARN-2720. Windows: Wildcard classpath variables not expanded against
|
||
resources contained in archives. (Craig Welch via cnauroth)
|
||
|
||
YARN-2721. Suppress NodeExist exception thrown by ZKRMStateStore when it
|
||
retries creating znode. (Jian He via zjshen)
|
||
|
||
YARN-2732. Fixed syntax error in SecureContainer.apt.vm. (Jian He via zjshen)
|
||
|
||
YARN-2724. Skipped uploading a local log file to HDFS if exception is raised
|
||
when opening it. (Xuan Gong via zjshen)
|
||
|
||
YARN-1915. Fixed a race condition that client could use the ClientToAMToken
|
||
to contact with AM before AM actually receives the ClientToAMTokenMasterKey.
|
||
(Jason Lowe via jianhe)
|
||
|
||
YARN-2314. Disable ContainerManagementProtocolProxy cache by default to
|
||
prevent creating thousands of threads in a large cluster. (Jason Lowe via
|
||
jianhe)
|
||
|
||
YARN-2723. Fix rmadmin -replaceLabelsOnNode does not correctly parse port.
|
||
(Naganarasimha G R via xgong)
|
||
|
||
YARN-2734. Skipped sub-folders in the local log dir when aggregating logs.
|
||
(Xuan Gong via zjshen)
|
||
|
||
YARN-2726. CapacityScheduler should explicitly log when an accessible
|
||
label has no capacity. (Wangda Tan via xgong)
|
||
|
||
YARN-2591. Fixed AHSWebServices to return FORBIDDEN(403) if the request user
|
||
doesn't have access to the history data. (Zhijie Shen via jianhe)
|
||
|
||
YARN-2279. Add UTs to cover timeline server authentication.
|
||
(Zhijie Shen via xgong)
|
||
|
||
YARN-2758. Update TestApplicationHistoryClientService to use the new generic
|
||
history store. (Zhijie Shen via xgong)
|
||
|
||
YARN-2741. Made NM web UI serve logs on the drive other than C: on Windows. (Craig
|
||
Welch via zjshen)
|
||
|
||
YARN-2747. Fixed the test failure of TestAggregatedLogFormat when native I/O is
|
||
enabled. (Xuan Gong via zjshen)
|
||
|
||
YARN-2769. Fixed the problem that timeline domain is not set in distributed shell
|
||
AM when using shell_command on Windows. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2755. NM fails to clean up usercache_DEL_<timestamp> dirs after
|
||
YARN-661 (Siqi Li via jlowe)
|
||
|
||
YARN-2698. Moved some node label APIs to be correctly placed in client
|
||
protocol. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2789. Re-instated the NodeReport.newInstance private unstable API
|
||
modified in YARN-2698 so that tests in YARN frameworks don't break. (Wangda
|
||
Tan via vinodkv)
|
||
|
||
YARN-2707. Potential null dereference in FSDownload (Gera Shegalov via
|
||
jlowe)
|
||
|
||
YARN-2711. Fixed TestDefaultContainerExecutor#testContainerLaunchError failure on
|
||
Windows. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2790. Fixed a NodeManager bug that was causing log-aggregation to fail
|
||
beyond HFDS delegation-token expiry even when RM is a proxy-user (YARN-2704).
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-2785. Fixed intermittent TestContainerResourceUsage failure. (Varun Vasudev
|
||
via zjshen)
|
||
|
||
YARN-2730. DefaultContainerExecutor runs only one localizer at a time
|
||
(Siqi Li via jlowe)
|
||
|
||
YARN-2798. Fixed YarnClient to populate the renewer correctly for Timeline
|
||
delegation tokens. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2788. Fixed backwards compatiblity issues with log-aggregation feature
|
||
that were caused when adding log-upload-time via YARN-2703. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-2795. Fixed ResourceManager to not crash loading node-label data from
|
||
HDFS in secure mode. (Wangda Tan via vinodkv)
|
||
|
||
YARN-1922. Fixed NodeManager to kill process-trees correctly in the presence
|
||
of races between the launch and the stop-container call and when root
|
||
processes crash. (Billie Rinaldi via vinodkv)
|
||
|
||
YARN-2752. Made ContainerExecutor append "nice -n" arg only when priority
|
||
adjustment flag is set. (Xuan Gong via zjshen)
|
||
|
||
YARN-2010. Handle app-recovery failures gracefully.
|
||
(Jian He and Karthik Kambatla via kasha)
|
||
|
||
YARN-2804. Fixed Timeline service to not fill the logs with JAXB bindings
|
||
exceptions. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2767. Added a test case to verify that http static user cannot kill or submit
|
||
apps in the secure mode. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2805. Fixed ResourceManager to load HA configs correctly before kerberos
|
||
login. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2579. Fixed a deadlock issue when EmbeddedElectorService and
|
||
FatalEventDispatcher try to transition RM to StandBy at the same time.
|
||
(Rohith Sharmaks via jianhe)
|
||
|
||
YARN-2813. Fixed NPE from MemoryTimelineStore.getDomains. (Zhijie Shen via xgong)
|
||
|
||
YARN-2812. TestApplicationHistoryServer is likely to fail on less powerful machine.
|
||
(Zhijie Shen via xgong)
|
||
|
||
YARN-2744. Fixed CapacityScheduler to validate node-labels correctly against
|
||
queues. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2823. Fixed ResourceManager app-attempt state machine to inform
|
||
schedulers about previous finished attempts of a running appliation to avoid
|
||
expectation mismatch w.r.t transferred containers. (Jian He via vinodkv)
|
||
|
||
YARN-2810. TestRMProxyUsersConf fails on Windows VMs. (Varun Vasudev via xgong)
|
||
|
||
YARN-2824. Fixed Capacity Scheduler to not crash when some node-labels are
|
||
not mapped to queues by making default capacities per label to be zero.
|
||
(Wangda Tan via vinodkv)
|
||
|
||
YARN-2827. Fixed bugs in "yarn queue" CLI. (Wangda Tan via vinodkv).
|
||
|
||
YARN-2803. MR distributed cache not working correctly on Windows after
|
||
NodeManager privileged account changes. (Craig Welch via cnauroth)
|
||
|
||
YARN-2753. Fixed a bunch of bugs in the NodeLabelsManager classes. (Zhihai xu
|
||
via vinodkv)
|
||
|
||
YARN-2825. Container leak on NM (Jian He via jlowe)
|
||
|
||
YARN-2819. NPE in ATS Timeline Domains when upgrading from 2.4 to 2.6.
|
||
(Zhijie Shen via xgong)
|
||
|
||
YARN-2826. Fixed user-groups mappings' refresh bug caused by YARN-2826.
|
||
(Wangda Tan via vinodkv)
|
||
|
||
YARN-2607. Fixed issues in TestDistributedShell. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2830. Add backwords compatible ContainerId.newInstance constructor.
|
||
(jeagles via acmurthy)
|
||
|
||
YARN-2834. Fixed ResourceManager to ignore token-renewal failures on recovery
|
||
consistent with the (somewhat incorrect) behaviour in the non-recovery case.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-2841. RMProxy should retry EOFException. (Jian He via xgong)
|
||
|
||
YARN-2843. Fixed NodeLabelsManager to trim inputs for hosts and labels so
|
||
as to make them work correctly. (Wangda Tan via vinodkv)
|
||
|
||
YARN-2794. Fixed log messages about distributing system-credentials. (Jian He via
|
||
zjshen)
|
||
|
||
YARN-2846. Incorrect persist exit code for running containers in
|
||
reacquireContainer() that interrupted by NodeManager restart. (Junping Du
|
||
via jlowe)
|
||
|
||
YARN-2853. Fixed a bug in ResourceManager causing apps to hang when the user
|
||
kill request races with ApplicationMaster finish. (Jian He via vinodkv)
|
||
|
||
Release 2.5.2 - 2014-11-19
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
|
||
Release 2.5.1 - 2014-09-05
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
Release 2.5.0 - 2014-08-11
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-1757. NM Recovery. Auxiliary service support. (Jason Lowe via kasha)
|
||
|
||
YARN-1864. Fair Scheduler Dynamic Hierarchical User Queues (Ashwin Shankar
|
||
via Sandy Ryza)
|
||
|
||
YARN-1362. Distinguish between nodemanager shutdown for decommission vs shutdown
|
||
for restart. (Jason Lowe via junping_du)
|
||
|
||
YARN-1338. Recover localized resource cache state upon nodemanager restart
|
||
(Jason Lowe via junping_du)
|
||
|
||
YARN-1368. Added core functionality of recovering container state into
|
||
schedulers after ResourceManager Restart so as to preserve running work in
|
||
the cluster. (Jian He via vinodkv)
|
||
|
||
YARN-1702. Added kill app functionality to RM web services. (Varun Vasudev
|
||
via vinodkv)
|
||
|
||
YARN-1339. Recover DeletionService state upon nodemanager restart. (Jason Lowe
|
||
via junping_du)
|
||
|
||
YARN-1365. Changed ApplicationMasterService to allow an app to re-register
|
||
after RM restart. (Anubhav Dhoot via jianhe)
|
||
|
||
YARN-2052. Embedded an epoch number in container id to ensure the uniqueness
|
||
of container id after RM restarts. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-1713. Added get-new-app and submit-app functionality to RM web services.
|
||
(Varun Vasudev via vinodkv)
|
||
|
||
YARN-2233. Implemented ResourceManager web-services to create, renew and
|
||
cancel delegation tokens. (Varun Vasudev via vinodkv)
|
||
|
||
YARN-2247. Made RM web services authenticate users via kerberos and delegation
|
||
token. (Varun Vasudev via zjshen)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-1479. Invalid NaN values in Hadoop REST API JSON response (Chen He via
|
||
jeagles)
|
||
|
||
YARN-1736. FS: AppSchedulable.assignContainer's priority argument is
|
||
redundant. (Naren Koneru via kasha)
|
||
|
||
YARN-1678. Fair scheduler gabs incessantly about reservations (Sandy Ryza)
|
||
|
||
YARN-1561. Fix a generic type warning in FairScheduler. (Chen He via junping_du)
|
||
|
||
YARN-1429. *nix: Allow a way for users to augment classpath of YARN daemons.
|
||
(Jarek Jarcec Cecho via kasha)
|
||
|
||
YARN-1520. update capacity scheduler docs to include necessary parameters
|
||
(Chen He via jeagles)
|
||
|
||
YARN-1845. Elapsed time for failed tasks that never started is wrong
|
||
(Rushabh S Shah via jeagles)
|
||
|
||
YARN-1136. Replace junit.framework.Assert with org.junit.Assert (Chen He
|
||
via jeagles)
|
||
|
||
YARN-1889. In Fair Scheduler, avoid creating objects on each call to
|
||
AppSchedulable comparator (Hong Zhiguo via Sandy Ryza)
|
||
|
||
YARN-1923. Make Fair Scheduler resource ratio calculations terminate faster
|
||
(Anubhav Dhoot via Sandy Ryza)
|
||
|
||
YARN-1870. FileInputStream is not closed in ProcfsBasedProcessTree#constructProcessSMAPInfo.
|
||
(Fengdong Yu via junping_du)
|
||
|
||
YARN-1970. Prepare YARN codebase for JUnit 4.11. (cnauroth)
|
||
|
||
YARN-483. Improve documentation on log aggregation in yarn-default.xml
|
||
(Akira Ajisaka via Sandy Ryza)
|
||
|
||
YARN-2036. Document yarn.resourcemanager.hostname in ClusterSetup (Ray
|
||
Chiang via Sandy Ryza)
|
||
|
||
YARN-766. TestNodeManagerShutdown in branch-2 should use Shell to form the output path and a format
|
||
issue in trunk. (Contributed by Siddharth Seth)
|
||
|
||
YARN-1982. Renamed the daemon name to be TimelineServer instead of History
|
||
Server and deprecated the old usage. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1987. Wrapper for leveldb DBIterator to aid in handling database exceptions.
|
||
(Jason Lowe via kasha)
|
||
|
||
YARN-1751. Improve MiniYarnCluster for log aggregation testing (Ming Ma
|
||
via jlowe)
|
||
|
||
YARN-1981. Nodemanager version is not updated when a node reconnects (Jason
|
||
Lowe via jeagles)
|
||
|
||
YARN-1938. Added kerberos login for the Timeline Server. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-2017. Merged some of the common scheduler code. (Jian He via vinodkv)
|
||
|
||
YARN-2049. Added delegation-token support for the Timeline Server. (Zhijie
|
||
Shen via vinodkv)
|
||
|
||
YARN-1936. Added security support for the Timeline Client. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-1937. Added owner-only ACLs support for Timeline Client and server.
|
||
(Zhijie Shen via vinodkv)
|
||
|
||
YARN-2012. Fair Scheduler: allow default queue placement rule to take an
|
||
arbitrary queue (Ashwin Shankar via Sandy Ryza)
|
||
|
||
YARN-2059. Added admin ACLs support to Timeline Server. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-2073. Fair Scheduler: Add a utilization threshold to prevent preempting
|
||
resources when cluster is free (Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-2071. Modified levelDB store permissions to be readable only by the
|
||
server user. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-2107. Refactored timeline classes into o.a.h.y.s.timeline package. (Vinod
|
||
Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-596. Use scheduling policies throughout the queue hierarchy to decide
|
||
which containers to preempt (Wei Yan via Sandy Ryza)
|
||
|
||
YARN-2054. Better defaults for YARN ZK configs for retries and retry-inteval
|
||
when HA is enabled. (kasha)
|
||
|
||
YARN-1877. Document yarn.resourcemanager.zk-auth and its scope.
|
||
(Robert Kanter via kasha)
|
||
|
||
YARN-2115. Replaced RegisterNodeManagerRequest's ContainerStatus with a new
|
||
NMContainerStatus which has more information that is needed for
|
||
work-preserving RM-restart. (Jian He via vinodkv)
|
||
|
||
YARN-1474. Make sechedulers services. (Tsuyoshi Ozawa via kasha)
|
||
|
||
YARN-1913. With Fair Scheduler, cluster can logjam when all resources are
|
||
consumed by AMs (Wei Yan via Sandy Ryza)
|
||
|
||
YARN-2061. Revisit logging levels in ZKRMStateStore. (Ray Chiang via kasha)
|
||
|
||
YARN-1977. Add tests on getApplicationRequest with filtering start time range. (junping_du)
|
||
|
||
YARN-2122. In AllocationFileLoaderService, the reloadThread should be created
|
||
in init() and started in start(). (Robert Kanter via kasha)
|
||
|
||
YARN-2132. ZKRMStateStore.ZKAction#runWithRetries doesn't log the exception
|
||
it encounters. (Vamsee Yarlagadda via kasha)
|
||
|
||
YARN-2030. Augmented RMStateStore with state machine.(Binglin Chang via jianhe)
|
||
|
||
YARN-1424. RMAppAttemptImpl should return the
|
||
DummyApplicationResourceUsageReport for all invalid accesses.
|
||
(Ray Chiang via kasha)
|
||
|
||
YARN-2091. Add more values to ContainerExitStatus and pass it from NM to
|
||
RM and then to app masters (Tsuyoshi OZAWA via bikas)
|
||
|
||
YARN-2125. Changed ProportionalCapacityPreemptionPolicy to log CSV in debug
|
||
level. (Wangda Tan via jianhe)
|
||
|
||
YARN-2159. Better logging in SchedulerNode#allocateContainer.
|
||
(Ray Chiang via kasha)
|
||
|
||
YARN-2191. Added a new test to ensure NM will clean up completed applications
|
||
in the case of RM restart. (Wangda Tan via jianhe)
|
||
|
||
YARN-2195. Clean a piece of code in ResourceRequest. (Wei Yan via devaraj)
|
||
|
||
YARN-2074. Changed ResourceManager to not count AM preemptions towards app
|
||
failures. (Jian He via vinodkv)
|
||
|
||
YARN-2192. TestRMHA fails when run with a mix of Schedulers.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2109. Fix TestRM to work with both schedulers. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-2072. RM/NM UIs and webservices are missing vcore information.
|
||
(Nathan Roberts via tgraves)
|
||
|
||
YARN-2152. Added missing information into ContainerTokenIdentifier so that
|
||
NodeManagers can report the same to RM when RM restarts. (Jian He via vinodkv)
|
||
|
||
YARN-2171. Improved CapacityScheduling to not lock on nodemanager-count when
|
||
AMs heartbeat in. (Jason Lowe via vinodkv)
|
||
|
||
YARN-614. Changed ResourceManager to not count disk failure, node loss and
|
||
RM restart towards app failures. (Xuan Gong via jianhe)
|
||
|
||
YARN-2224. Explicitly enable vmem check in
|
||
TestContainersMonitor#testContainerKillOnMemoryOverflow.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2022. Preempting an Application Master container can be kept as least priority
|
||
when multiple applications are marked for preemption by
|
||
ProportionalCapacityPreemptionPolicy (Sunil G via mayank)
|
||
|
||
YARN-2241. ZKRMStateStore: On startup, show nicer messages if znodes already
|
||
exist. (Robert Kanter via kasha)
|
||
|
||
YARN-1408 Preemption caused Invalid State Event: ACQUIRED at KILLED and caused
|
||
a task timeout for 30mins. (Sunil G via mayank)
|
||
|
||
YARN-2300. Improved the documentation of the sample requests for RM REST API -
|
||
submitting an app. (Varun Vasudev via zjshen)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-1718. Fix a couple isTerminals in Fair Scheduler queue placement rules
|
||
(Sandy Ryza)
|
||
|
||
YARN-1790. Fair Scheduler UI not showing apps table (bc Wong via Sandy Ryza)
|
||
|
||
YARN-1784. TestContainerAllocation assumes CapacityScheduler.
|
||
(Robert Kanter via kasha)
|
||
|
||
YARN-1940. deleteAsUser() terminates early without deleting more files on
|
||
error (Rushabh S Shah via jlowe)
|
||
|
||
YARN-1865. ShellScriptBuilder does not check for some error conditions.
|
||
(Remus Rusanu via ivanmi)
|
||
|
||
YARN-738. TestClientRMTokens is failing irregularly while running all yarn
|
||
tests (Ming Ma via jlowe)
|
||
|
||
YARN-2018. TestClientRMService.testTokenRenewalWrongUser fails after
|
||
HADOOP-10562 (Ming Ma via Arpit Agarwal)
|
||
|
||
YARN-2011. Fix typo and warning in TestLeafQueue (Chen He via junping_du)
|
||
|
||
|
||
YARN-2042. String shouldn't be compared using == in
|
||
QueuePlacementRule#NestedUserQueue#getQueueForApp (Chen He via Sandy Ryza)
|
||
|
||
YARN-2050. Fix LogCLIHelpers to create the correct FileContext (Ming Ma
|
||
via jlowe)
|
||
|
||
YARN-2089. FairScheduler: QueuePlacementPolicy and QueuePlacementRule
|
||
are missing audience annotations. (Zhihai Xu via kasha)
|
||
|
||
YARN-2096. Race in TestRMRestart#testQueueMetricsOnRMRestart.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2105. Fix TestFairScheduler after YARN-2012. (Ashwin Shankar via
|
||
Sandy Ryza)
|
||
|
||
YARN-2112. Fixed yarn-common's pom.xml to include jackson dependencies so
|
||
that both Timeline Server and client can access them. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-1868. YARN status web ui does not show correctly in IE 11.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
YARN-2103. Inconsistency between viaProto flag and initial value of
|
||
SerializedExceptionProto.Builder (Binglin Chang via junping_du)
|
||
|
||
YARN-1550. NPE in FairSchedulerAppsBlock#render. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-2119. DEFAULT_PROXY_ADDRESS should use DEFAULT_PROXY_PORT.
|
||
(Anubhav Dhoot via kasha)
|
||
|
||
YARN-2118. Fixed the type mismatch in Map#containsKey check of
|
||
TimelineWebServices#injectOwnerInfo. (Ted Yu via zjshen)
|
||
|
||
YARN-2117. Fixed the issue that secret file reader is potentially not
|
||
closed in TimelineAuthenticationFilterInitializer. (Chen He via zjshen)
|
||
|
||
YARN-2121. Fixed NPE handling in Timeline Server's TimelineAuthenticator.
|
||
(Zhijie Shen via vinodkv)
|
||
|
||
YARN-2128. FairScheduler: Incorrect calculation of amResource usage.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-2124. Fixed NPE in ProportionalCapacityPreemptionPolicy. (Wangda Tan
|
||
via jianhe)
|
||
|
||
YARN-2148. TestNMClient failed due more exit code values added and passed
|
||
to AM (Wangda Tan via bikas)
|
||
|
||
YARN-2075. Fixed the test failure of TestRMAdminCLI. (Kenji Kikushima via
|
||
zjshen)
|
||
|
||
YARN-2155. FairScheduler: Incorrect threshold check for preemption.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-1885. Fixed a bug that RM may not send application-clean-up signal
|
||
to NMs where the completed applications previously ran in case of RM restart.
|
||
(Wangda Tan via jianhe)
|
||
|
||
YARN-2167. LeveldbIterator should get closed in
|
||
NMLeveldbStateStoreService#loadLocalizationState() within finally block
|
||
(Junping Du via jlowe)
|
||
|
||
YARN-2187. FairScheduler: Disable max-AM-share check by default.
|
||
(Robert Kanter via kasha)
|
||
|
||
YARN-2111. In FairScheduler.attemptScheduling, we don't count containers
|
||
as assigned if they have 0 memory but non-zero cores (Sandy Ryza)
|
||
|
||
YARN-2204. TestAMRestart#testAMRestartWithExistingContainers assumes
|
||
CapacityScheduler. (Robert Kanter via kasha)
|
||
|
||
YARN-2163. WebUI: Order of AppId in apps table should be consistent with
|
||
ApplicationId.compareTo(). (Wangda Tan via raviprak)
|
||
|
||
YARN-2104. Scheduler queue filter failed to work because index of queue
|
||
column changed. (Wangda Tan via jlowe)
|
||
|
||
YARN-2201. Made TestRMWebServicesAppsModification be independent of the
|
||
changes on yarn-default.xml. (Varun Vasudev via zjshen)
|
||
|
||
YARN-2216 TestRMApplicationHistoryWriter sometimes fails in trunk.
|
||
(Zhijie Shen via xgong)
|
||
|
||
YARN-2216 YARN-2065 AM cannot create new containers after restart
|
||
(Jian He via stevel)
|
||
|
||
YARN-2232. Fixed ResourceManager to allow DelegationToken owners to be able
|
||
to cancel their own tokens in secure mode. (Varun Vasudev via vinodkv)
|
||
|
||
YARN-2250. FairScheduler.findLowestCommonAncestorQueue returns null when
|
||
queues not identical (Krisztian Horvath via Sandy Ryza)
|
||
|
||
YARN-2158. Improved assertion messages of TestRMWebServicesAppsModification.
|
||
(Varun Vasudev via zjshen)
|
||
|
||
YARN-2269. Remove External links from YARN UI. (Craig Welch via xgong)
|
||
|
||
YARN-2270. Made TestFSDownload#testDownloadPublicWithStatCache be skipped
|
||
when there’s no ancestor permissions. (Akira Ajisaka via zjshen)
|
||
|
||
YARN-2319. Made the MiniKdc instance start/close before/after the class of
|
||
TestRMWebServicesDelegationTokens. (Wenwu Peng via zjshen)
|
||
|
||
YARN-2335. Annotate all hadoop-sls APIs as @Private. (Wei Yan via kasha)
|
||
|
||
YARN-1726. ResourceSchedulerWrapper broken due to AbstractYarnScheduler.
|
||
(Wei Yan via kasha)
|
||
|
||
YARN-2216. TestRMApplicationHistoryWriter sometimes fails in trunk.
|
||
(Zhijie Shen via xgong)
|
||
|
||
Release 2.4.1 - 2014-06-23
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-1892. Improved some logs in the scheduler. (Jian He via zjshen)
|
||
|
||
YARN-1696. Added documentation for ResourceManager fail-over. (Karthik
|
||
Kambatla, Masatake Iwasaki, Tsuyoshi OZAWA via vinodkv)
|
||
|
||
YARN-1701. Improved default paths of the timeline store and the generic
|
||
history store. (Tsuyoshi Ozawa via zjshen)
|
||
|
||
YARN-1962. Changed Timeline Service client configuration to be off by default
|
||
given the non-readiness of the feature yet. (Mohammad Kamrul Islam via
|
||
vinodkv)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-1898. Made Standby RM links conf, stacks, logLevel, metrics, jmx, logs
|
||
and static not be redirected to Active RM. (Xuan Gong via zjshen)
|
||
|
||
YARN-1837. Fixed TestMoveApplication#testMoveRejectedByScheduler failure.
|
||
(Hong Zhiguo via jianhe)
|
||
|
||
YARN-1905. TestProcfsBasedProcessTree must only run on Linux. (cnauroth)
|
||
|
||
YARN-1883. TestRMAdminService fails due to inconsistent entries in
|
||
UserGroups (Mit Desai via jeagles)
|
||
|
||
YARN-1908. Fixed DistributedShell to not fail in secure clusters. (Vinod
|
||
Kumar Vavilapalli and Jian He via vinodkv)
|
||
|
||
YARN-1910. Fixed a race condition in TestAMRMTokens that causes the test to
|
||
fail more often on Windows. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1920. Fixed TestFileSystemApplicationHistoryStore failure on windows.
|
||
(Vinod Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-1914. Fixed resource-download on NodeManagers to skip permission
|
||
verification of public cache files in Windows+local file-system environment.
|
||
(Varun Vasudev via vinodkv)
|
||
|
||
YARN-1903. Set exit code and diagnostics when container is killed at
|
||
NEW/LOCALIZING state. (Zhijie Shen via jianhe)
|
||
|
||
YARN-1924. Made ZKRMStateStore updateApplication(Attempt)StateInternal work
|
||
when Application(Attempt) state hasn't been stored before. (Jian He via
|
||
zjshen)
|
||
|
||
YARN-1926. Changed DistributedShell to use appIDs as unique identifiers for
|
||
HDFS paths and thus avoid test failures on Windows. (Varun Vasudev via
|
||
vinodkv)
|
||
|
||
YARN-1833. TestRMAdminService Fails in trunk and branch-2 (Mit Desai via
|
||
jeagles)
|
||
|
||
YARN-1907. TestRMApplicationHistoryWriter#testRMWritingMassiveHistory
|
||
intermittently fails. (Mit Desai via kihwal)
|
||
|
||
YARN-1933. Fixed test issues with TestAMRestart and TestNodeHealthService.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-1928. Fixed a race condition in TestAMRMRPCNodeUpdates which caused it
|
||
to fail occassionally. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1934. Fixed a potential NPE in ZKRMStateStore caused by handling
|
||
Disconnected event from ZK. (Karthik Kambatla via jianhe)
|
||
|
||
YARN-1931. Private API change in YARN-1824 in 2.4 broke compatibility
|
||
with previous releases (Sandy Ryza via tgraves)
|
||
|
||
YARN-1750. TestNodeStatusUpdater#testNMRegistration is incorrect in test
|
||
case. (Wangda Tan via junping_du)
|
||
|
||
YARN-1947. TestRMDelegationTokens#testRMDTMasterKeyStateOnRollingMasterKey
|
||
is failing intermittently. (Jian He via junping_du)
|
||
|
||
YARN-1281. Fixed TestZKRMStateStoreZKClientConnections to not fail
|
||
intermittently due to ZK-client timeouts. (Tsuyoshi Ozawa via vinodkv)
|
||
|
||
YARN-1932. Javascript injection on the job status page (Mit Desai via
|
||
jlowe)
|
||
|
||
YARN-1975. Used resources shows escaped html in CapacityScheduler and
|
||
FairScheduler page (Mit Desai via jlowe)
|
||
|
||
YARN-1929. Fixed a deadlock in ResourceManager that occurs when failover
|
||
happens right at the time of shutdown. (Karthik Kambatla via vinodkv)
|
||
|
||
YARN-1201. TestAMAuthorization fails with local hostname cannot be resolved.
|
||
(Wangda Tan via junping_du)
|
||
|
||
YARN-1861. Fixed a bug in RM to reset leader-election on fencing that was
|
||
causing both RMs to be stuck in standby mode when automatic failover is
|
||
enabled. (Karthik Kambatla and Xuan Gong via vinodkv)
|
||
|
||
YARN-1957. Consider the max capacity of the queue when computing the ideal
|
||
capacity for preemption. (Carlo Curino via cdouglas)
|
||
|
||
YARN-1986. In Fifo Scheduler, node heartbeat in between creating app and
|
||
attempt causes NPE (Hong Zhiguo via Sandy Ryza)
|
||
|
||
YARN-1976. Fix yarn application CLI to print the scheme of the tracking url
|
||
of failed/killed applications. (Junping Du via jianhe)
|
||
|
||
YARN-2016. Fix a bug in GetApplicationsRequestPBImpl to add the missed fields
|
||
to proto. (Junping Du via jianhe)
|
||
|
||
YARN-2053. Fixed a bug in AMS to not add null NMToken into NMTokens list from
|
||
previous attempts for work-preserving AM restart. (Wangda Tan via jianhe)
|
||
|
||
YARN-2066. Wrong field is referenced in GetApplicationsRequestPBImpl#mergeLocalToBuilder()
|
||
(Hong Zhiguo via junping_du)
|
||
|
||
YARN-2081. Fixed TestDistributedShell failure after YARN-1962. (Zhiguo Hong
|
||
via zjshen)
|
||
|
||
Release 2.4.0 - 2014-04-07
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-930. Bootstrapping ApplicationHistoryService module. (vinodkv)
|
||
|
||
YARN-947. Implementing the data objects to be used by the History reader
|
||
and writer interfaces. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-934. Defined a Writer Interface for HistoryStorage. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-925. Defined a Reader Interface for HistoryStorage. (Mayank Bansal via
|
||
vinodkv)
|
||
|
||
YARN-978. Created ApplicationAttemptReport. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-956. Added a testable in-memory HistoryStorage. (Mayank Bansal via
|
||
vinodkv)
|
||
|
||
YARN-975. Added a file-system implementation for HistoryStorage. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-1123. Added a new ContainerReport and its Protobuf implementation. (Mayank
|
||
Bansal via vinodkv)
|
||
|
||
YARN-979. Added more APIs for getting information about ApplicationAttempts
|
||
and Containers from ApplicationHistoryProtocol. (Mayank Bansal and Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-953. Changed ResourceManager to start writing history data. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-1266. Implemented PB service and client wrappers for
|
||
ApplicationHistoryProtocol. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-955. Implemented ApplicationHistoryProtocol handler. (Mayank Bansal via
|
||
vinodkv)
|
||
|
||
YARN-1242. Changed yarn scripts to be able to start ApplicationHistoryServer
|
||
as an individual process. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-954. Implemented web UI for the ApplicationHistoryServer and wired it into
|
||
the HistoryStorage. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-967. Added the client and CLI interfaces for obtaining ApplicationHistory
|
||
data. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-1023. Added Webservices REST APIs support for Application History. (Zhijie
|
||
Shen via vinodkv)
|
||
|
||
YARN-1413. Implemented serving of aggregated-logs in the ApplicationHistory
|
||
server. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-1633. Defined user-facing entity, entity-info and event objects related
|
||
to Application Timeline feature. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1611. Introduced the concept of a configuration provider which can be
|
||
used by ResourceManager to read configuration locally or from remote systems
|
||
so as to help RM failover. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1659. Defined the ApplicationTimelineStore store as an abstraction for
|
||
implementing different storage impls for storing timeline information.
|
||
(Billie Rinaldi via vinodkv)
|
||
|
||
YARN-1634. Added a testable in-memory implementation of
|
||
ApplicationTimelineStore. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1461. Added tags for YARN applications and changed RM to handle them.
|
||
(Karthik Kambatla via zjshen)
|
||
|
||
YARN-1636. Augmented Application-history server's web-services to also expose
|
||
new APIs for retrieving and storing timeline information. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-1490. Introduced the ability to make ResourceManager optionally not kill
|
||
all containers when an ApplicationMaster exits. (Jian He via vinodkv)
|
||
|
||
YARN-1041. Added the ApplicationMasterProtocol API for applications to use the
|
||
ability in ResourceManager to optionally not kill containers when the
|
||
ApplicationMaster exits. (Jian He via vinodkv)
|
||
|
||
YARN-1566. Changed Distributed Shell to retain containers across application
|
||
attempts. (Jian He via vinodkv)
|
||
|
||
YARN-1635. Implemented a Leveldb based ApplicationTimelineStore. (Billie
|
||
Rinaldi via zjshen)
|
||
|
||
YARN-1637. Implemented a client library for Java users to post timeline
|
||
entities and events. (zjshen)
|
||
|
||
YARN-1496. Protocol additions to allow moving apps between queues (Sandy
|
||
Ryza)
|
||
|
||
YARN-1498. Common scheduler changes for moving apps between queues (Sandy
|
||
Ryza)
|
||
|
||
YARN-1504. RM changes for moving apps between queues (Sandy Ryza)
|
||
|
||
YARN-1499. Fair Scheduler changes for moving apps between queues (Sandy
|
||
Ryza)
|
||
|
||
YARN-1497. Command line additions for moving apps between queues (Sandy
|
||
Ryza)
|
||
|
||
YARN-1588. Enhanced RM and the scheduling protocol to also send NMTokens of
|
||
transferred containers from previous app-attempts to new AMs after YARN-1490.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-1717. Enabled periodically discarding old data in LeveldbTimelineStore.
|
||
(Billie Rinaldi via zjshen)
|
||
|
||
YARN-1690. Made DistributedShell send timeline entities+events. (Mayank Bansal
|
||
via zjshen)
|
||
|
||
YARN-1775. Enhanced ProcfsBasedProcessTree to optionally add the ability to
|
||
use smaps for obtaining used memory information. (Rajesh Balamohan via
|
||
vinodkv)
|
||
|
||
YARN-1838. Enhanced timeline service getEntities API to get entities from a
|
||
given entity ID or insertion timestamp. (Billie Rinaldi via zjshen)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-1007. Enhance History Reader interface for Containers. (Mayank Bansal via
|
||
devaraj)
|
||
|
||
YARN-974. Added more information to RMContainer to be collected and recorded in
|
||
Application-History. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-987. Added ApplicationHistoryManager responsible for exposing reports to
|
||
all clients. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-1630. Introduce timeout for async polling operations in YarnClientImpl
|
||
(Aditya Acharya via Sandy Ryza)
|
||
|
||
YARN-1617. Remove ancient comment and surround LOG.debug in
|
||
AppSchedulingInfo.allocate (Sandy Ryza)
|
||
|
||
YARN-1639. Modified RM HA configuration handling to have a way of not
|
||
requiring separate configuration files for each RM. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1668. Modified RM HA handling of admin-acls to be available across RM
|
||
failover by making using of a remote configuration-provider. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-1667. Modified RM HA handling of super users (with proxying ability) to
|
||
be available across RM failover by making using of a remote
|
||
configuration-provider. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1285. Changed the default value of yarn.acl.enable in yarn-default.xml
|
||
to be consistent with what exists (false) in the code and documentation.
|
||
(Kenji Kikushima via vinodkv)
|
||
|
||
YARN-1669. Modified RM HA handling of protocol level service-ACLS to
|
||
be available across RM failover by making using of a remote
|
||
configuration-provider. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1665. Simplify the configuration of RM HA by having better default
|
||
values. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1660. Simplified the RM HA configuration to accept and be able to simply
|
||
depend just on configuration properties of the form
|
||
yarn.resourcemanager.hostname.RMID and use the default ports for all service
|
||
addresses. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1493. Changed ResourceManager and Scheduler interfacing to recognize
|
||
app-attempts separately from apps. (Jian He via vinodkv)
|
||
|
||
YARN-1459. Changed ResourceManager to depend its service initialization
|
||
on the configuration-provider mechanism during startup too. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-1706. Created an utility method to dump timeline records to JSON
|
||
strings. (zjshen)
|
||
|
||
YARN-1641. ZK store should attempt a write periodically to ensure it is
|
||
still Active. (kasha)
|
||
|
||
YARN-1531. True up yarn command documentation (Akira Ajisaka via kasha)
|
||
|
||
YARN-1345. Remove FINAL_SAVING state from YarnApplicationAttemptState
|
||
(Zhijie Shen via jianhe)
|
||
|
||
YARN-1676. Modified RM HA handling of user-to-group mappings to
|
||
be available across RM failover by making using of a remote
|
||
configuration-provider. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1666. Modified RM HA handling of include/exclude node-lists to be
|
||
available across RM failover by making using of a remote
|
||
configuration-provider. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1171. Add default queue properties to Fair Scheduler documentation
|
||
(Naren Koneru via Sandy Ryza)
|
||
|
||
YARN-1470. Add audience annotations to MiniYARNCluster. (Anubhav Dhoot
|
||
via kasha)
|
||
|
||
YARN-1732. Changed types of related-entities and primary-filters in the
|
||
timeline-service to be sets instead of maps. (Billie Rinaldi via vinodkv)
|
||
|
||
YARN-1687. Renamed user-facing records for the timeline-service to be simply
|
||
named after 'timeline' instead of 'apptimeline'. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1749. Updated application-history related configs to reflect the latest
|
||
reality and to be consistently named. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1301. Added the INFO level log of the non-empty blacklist additions
|
||
and removals inside ApplicationMasterService. (Tsuyoshi Ozawa via zjshen)
|
||
|
||
YARN-1528. Allow setting auth for ZK connections. (kasha)
|
||
|
||
YARN-1704. Modified LICENSE and NOTICE files to reflect newly used levelDB
|
||
related libraries. (Billie Rinaldi via vinodkv)
|
||
|
||
YARN-1765. Added test cases to verify that killApplication API works across
|
||
ResourceManager failover. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1730. Implemented simple write-locking in the LevelDB based timeline-
|
||
store. (Billie Rinaldi via vinodkv)
|
||
|
||
YARN-986. Changed client side to be able to figure out the right RM Delegation
|
||
token for the right ResourceManager when HA is enabled. (Karthik Kambatla via
|
||
vinodkv)
|
||
|
||
YARN-1761. Modified RMAdmin CLI to check whether HA is enabled or not before
|
||
it executes any of the HA admin related commands. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1780. Improved logging in the Timeline client and server. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-1525. Web UI should redirect to active RM when HA is enabled. (Cindy Li
|
||
via kasha)
|
||
|
||
YARN-1781. Modified NodeManagers to allow admins to specify max disk
|
||
utilization for local disks so as to be able to offline full disks. (Varun
|
||
Vasudev via vinodkv)
|
||
|
||
YARN-1410. Added tests to validate that clients can fail-over to a new RM
|
||
after getting an application-ID but before submission and can still submit to
|
||
the newly active RM with no issues. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1764. Modified YarnClient to correctly handle failover of ResourceManager
|
||
after the submitApplication call goes through. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1389. Made ApplicationClientProtocol and ApplicationHistoryProtocol
|
||
expose analogous getApplication(s)/Attempt(s)/Container(s) APIs. (Mayank
|
||
Bansal via zjshen)
|
||
|
||
YARN-1658. Modified web-app framework to let standby RMs redirect
|
||
web-service calls to the active RM. (Cindy Li via vinodkv)
|
||
|
||
YARN-1824. Improved NodeManager and clients to be able to handle cross
|
||
platform application submissions. (Jian He via vinodkv)
|
||
|
||
YARN-1512. Enhanced CapacityScheduler to be able to decouple scheduling from
|
||
node-heartbeats. (Arun C Murthy via vinodkv)
|
||
|
||
YARN-1570. Fixed formatting of the lines in YarnCommands.apt.vm docs source.
|
||
(Akira Ajisaka via vinodkv)
|
||
|
||
YARN-1536. Cleanup: Get rid of ResourceManager#get*SecretManager() methods
|
||
and use the RMContext methods instead. (Anubhav Dhoot via kasha)
|
||
|
||
YARN-1850. Introduced the ability to optionally disable sending out timeline-
|
||
events in the TimelineClient. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1452. Added documentation about the configuration and usage of generic
|
||
application history and the timeline data service. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1891. Added documentation for NodeManager health-monitoring. (Varun
|
||
Vasudev via vinodkv)
|
||
|
||
YARN-1017. Added documentation for ResourceManager Restart.(jianhe)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
YARN-1771. Reduce the number of NameNode operations during localization of
|
||
public resources using a cache. (Sangjin Lee via cdouglas)
|
||
|
||
BUG FIXES
|
||
|
||
YARN-935. Correcting pom.xml to build applicationhistoryserver module
|
||
successfully. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-962. Fixed bug in application-history proto file and renamed it be just
|
||
a client proto file. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-984. Renamed the incorrectly named applicationhistoryservice.records.pb.impl
|
||
package to be applicationhistoryservice.records.impl.pb. (Devaraj K via vinodkv)
|
||
|
||
YARN-1534. Fixed failure of test TestAHSWebApp. (Shinichi Yamashita via vinodkv)
|
||
|
||
YARN-1555. Fixed test failures in applicationhistoryservice.* (Vinod Kumar
|
||
Vavilapalli via mayank)
|
||
|
||
YARN-1594. Updated pom.xml of applicationhistoryservice sub-project according to
|
||
YARN-888. (Vinod Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-1596. Fixed Javadoc warnings on branch YARN-321. (Vinod Kumar Vavilapalli
|
||
via zjshen)
|
||
|
||
YARN-1597. Fixed Findbugs warnings on branch YARN-321. (Vinod Kumar Vavilapalli
|
||
via zjshen)
|
||
|
||
YARN-1595. Made enabling history service configurable and fixed test failures on
|
||
branch YARN-321. (Vinod Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-1605. Fixed formatting issues in the new module on branch YARN-321. (Vinod
|
||
Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-1625. Fixed RAT warnings after YARN-321 merge. (Shinichi Yamashita via
|
||
vinodkv)
|
||
|
||
YARN-1613. Fixed the typo with the configuration name
|
||
YARN_HISTORY_SERVICE_ENABLED. (Akira Ajisaka via vinodkv)
|
||
|
||
YARN-1618. Fix invalid RMApp transition from NEW to FINAL_SAVING (kasha)
|
||
|
||
YARN-1600. RM does not startup when security is enabled without spnego
|
||
configured (Haohui Mai via jlowe)
|
||
|
||
YARN-1642. RMDTRenewer#getRMClient should use ClientRMProxy (kasha)
|
||
|
||
YARN-1632. TestApplicationMasterServices should be under
|
||
org.apache.hadoop.yarn.server.resourcemanager package (Chen He via jeagles)
|
||
|
||
YARN-1673. Fix option parsing in YARN's application CLI after it is broken
|
||
by YARN-967. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-1684. Fixed history server heap size in yarn script. (Billie Rinaldi
|
||
via zjshen)
|
||
|
||
YARN-1166. Fixed app-specific and attempt-specific QueueMetrics to be
|
||
triggered by accordingly app event and attempt event.
|
||
|
||
YARN-1689. Made RMAppAttempt get killed when RMApp is at ACCEPTED. (Vinod
|
||
Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-1661. Fixed DS ApplicationMaster to write the correct exit log. (Vinod
|
||
Kumar Vavilapalli via zjshen)
|
||
|
||
YARN-1672. YarnConfiguration is missing a default for
|
||
yarn.nodemanager.log.retain-seconds (Naren Koneru via kasha)
|
||
|
||
YARN-1698. Fixed default TimelineStore in code to match what is documented
|
||
in yarn-default.xml (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1697. NodeManager reports negative running containers (Sandy Ryza)
|
||
|
||
YARN-1719. Fixed the root path related Jersey warnings produced in
|
||
ATSWebServices. (Billie Rinaldi via zjshen)
|
||
|
||
YARN-1692. ConcurrentModificationException in fair scheduler AppSchedulable
|
||
(Sangjin Lee via Sandy Ryza)
|
||
|
||
YARN-1578. Fixed reading incomplete application attempt and container data
|
||
in FileSystemApplicationHistoryStore. (Shinichi Yamashita via zjshen)
|
||
|
||
YARN-1417. Modified RM to generate container-tokens not at creation time, but
|
||
at allocation time so as to prevent RM from shelling out containers with
|
||
expired tokens. (Omkar Vinit Joshi and Jian He via vinodkv)
|
||
|
||
YARN-1553. Modified YARN and MR to stop using HttpConfig.isSecure() and
|
||
instead rely on the http policy framework. And also fix some bugs related
|
||
to https handling in YARN web-apps. (Haohui Mai via vinodkv)
|
||
|
||
YARN-1721. When moving app between queues in Fair Scheduler, grab lock on
|
||
FSSchedulerApp (Sandy Ryza)
|
||
|
||
YARN-1724. Race condition in Fair Scheduler when continuous scheduling is
|
||
turned on (Sandy Ryza)
|
||
|
||
YARN-1590. Fixed ResourceManager, web-app proxy and MR JobHistoryServer to
|
||
expand _HOST properly in their kerberos principles. (Mohammad Kamrul Islam
|
||
va vinodkv)
|
||
|
||
YARN-1428. Fixed RM to write the final state of RMApp/RMAppAttempt to the
|
||
application history store in the transition to the final state. (Contributed
|
||
by Zhijie Shen)
|
||
|
||
YARN-713. Fixed ResourceManager to not crash while building tokens when DNS
|
||
issues happen transmittently. (Jian He via vinodkv)
|
||
|
||
YARN-1398. Fixed a deadlock in ResourceManager between users requesting
|
||
queue-acls and completing containers. (vinodkv)
|
||
|
||
YARN-1071. Enabled ResourceManager to recover cluster metrics
|
||
numDecommissionedNMs after restarting. (Jian He via zjshen)
|
||
|
||
YARN-1742. Fixed javadoc of configuration parameter
|
||
DEFAULT_NM_MIN_HEALTHY_DISKS_FRACTION. (Akira Ajisaka via vinodkv)
|
||
|
||
YARN-1686. Fixed NodeManager to properly handle any errors during
|
||
re-registration after a RESYNC and thus avoid hanging. (Rohith Sharma via
|
||
vinodkv)
|
||
|
||
YARN-1734. Fixed ResourceManager to update the configurations when it
|
||
transits from standby to active mode so as to assimilate any changes that
|
||
happened while it was in standby mode. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1760. TestRMAdminService assumes CapacityScheduler. (kasha)
|
||
|
||
YARN-1758. Fixed ResourceManager to not mandate the presence of site specific
|
||
configuration files and thus fix failures in downstream tests. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-1748. Excluded core-site.xml from hadoop-yarn-server-tests package's jar
|
||
and thus avoid breaking downstream tests. (Sravya Tirukkovalur via vinodkv)
|
||
|
||
YARN-1729. Made TimelineWebServices deserialize the string primary- and
|
||
secondary-filters param into the JSON-compatible object. (Billie Rinaldi via
|
||
zjshen)
|
||
|
||
YARN-1766. Fixed a bug in ResourceManager to use configuration loaded from the
|
||
configuration-provider when booting up. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1768. Fixed error message being too verbose when killing a non-existent
|
||
application. (Tsuyoshi OZAWA via raviprak)
|
||
|
||
YARN-1774. FS: Submitting to non-leaf queue throws NPE. (Anubhav Dhoot and
|
||
Karthik Kambatla via kasha)
|
||
|
||
YARN-1783. Fixed a bug in NodeManager's status-updater that was losing
|
||
completed container statuses when NodeManager is forced to resync by the
|
||
ResourceManager. (Jian He via vinodkv)
|
||
|
||
YARN-1787. Fixed help messages for applicationattempt and container
|
||
sub-commands in bin/yarn. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1793. Fixed ClientRMService#forceKillApplication not killing unmanaged
|
||
application. (Karthik Kambatla via jianhe)
|
||
|
||
YARN-1788. Fixed a bug in ResourceManager to set the apps-completed and
|
||
apps-killed metrics correctly for killed applications. (Varun Vasudev via
|
||
vinodkv)
|
||
|
||
YARN-1821. NPE on registerNodeManager if the request has containers for
|
||
UnmanagedAMs. (kasha)
|
||
|
||
YARN-1800. Fixed NodeManager to gracefully handle RejectedExecutionException
|
||
in the public-localizer thread-pool. (Varun Vasudev via vinodkv)
|
||
|
||
YARN-1444. Fix CapacityScheduler to deal with cases where applications
|
||
specify host/rack requests without off-switch request. (Wangda Tan via
|
||
acmurthy)
|
||
|
||
YARN-1812. Fixed ResourceManager to synchrously renew tokens after recovery
|
||
and thus recover app itself synchronously and avoid races with resyncing
|
||
NodeManagers. (Jian He via vinodkv)
|
||
|
||
YARN-1816. Fixed ResourceManager to get RMApp correctly handle
|
||
ATTEMPT_FINISHED event at ACCEPTED state that can happen after RM restarts.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-1789. ApplicationSummary does not escape newlines in the app name
|
||
(Tsuyoshi OZAWA via jlowe)
|
||
|
||
YARN-1830. Fixed TestRMRestart#testQueueMetricsOnRMRestart failure due to
|
||
race condition when app is submitted. (Zhijie Shen via jianhe)
|
||
|
||
YARN-1685. Fixed few bugs related to handling of containers' log-URLs on
|
||
ResourceManager and history-service. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1206. Fixed AM container log to show on NM web page after application
|
||
finishes if log-aggregation is disabled. (Rohith Sharmaks via jianhe)
|
||
|
||
YARN-1591. Fixed AsyncDispatcher to handle interrupts on shutdown in a sane
|
||
manner and thus fix failure of TestResourceTrackerService. (Tsuyoshi Ozawa
|
||
via vinodkv)
|
||
|
||
YARN-1839. Fixed handling of NMTokens in ResourceManager such that containers
|
||
launched by AMs running on the same machine as the AM are correctly
|
||
propagated. (Jian He via vinodkv)
|
||
|
||
YARN-1640. Fixed manual failover of ResourceManagers to work correctly in
|
||
secure clusters. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1855. Made Application-history server to be optional in MiniYARNCluster
|
||
and thus avoid the failure of TestRMFailover#testRMWebAppRedirect. (Zhijie
|
||
Shen via vinodkv)
|
||
|
||
YARN-1859. Fixed WebAppProxyServlet to correctly handle applications absent
|
||
on the ResourceManager. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1811. Fixed AMFilters in YARN to correctly accept requests from either
|
||
web-app proxy or the RMs when HA is enabled. (Robert Kanter via vinodkv)
|
||
|
||
YARN-1670. Fixed a bug in log-aggregation that can cause the writer to write
|
||
more log-data than the log-length that it records. (Mit Desai via vinodk)
|
||
|
||
YARN-1849. Fixed NPE in ResourceTrackerService#registerNodeManager for UAM
|
||
(Karthik Kambatla via jianhe )
|
||
|
||
YARN-1863. Fixed test failure in TestRMFailover after YARN-1859. (Xuan Gong
|
||
via vinodkv)
|
||
|
||
YARN-1854. Fixed test failure in TestRMHA#testStartAndTransitions. (Rohith
|
||
Sharma KS via vinodkv)
|
||
|
||
YARN-1776. Fixed DelegationToken renewal to survive RM failover. (Zhijie
|
||
Shen via jianhe)
|
||
|
||
YARN-1577. Made UnmanagedAMLauncher do launchAM after the attempt reaches
|
||
the LAUNCHED state. (Jian He via zjshen)
|
||
|
||
YARN-1785. FairScheduler treats app lookup failures as ERRORs.
|
||
(bc Wong via kasha)
|
||
|
||
YARN-1752. Fixed ApplicationMasterService to reject unregister request if
|
||
AM did not register before. (Rohith Sharma via jianhe)
|
||
|
||
YARN-1846. TestRM#testNMTokenSentForNormalContainer assumes CapacityScheduler.
|
||
(Robert Kanter via kasha)
|
||
|
||
YARN-1705. Reset cluster-metrics on transition to standby. (Rohith via kasha)
|
||
|
||
YARN-1852. Fixed RMAppAttempt to not resend AttemptFailed/AttemptKilled
|
||
events to already recovered Failed/Killed RMApps. (Rohith via jianhe)
|
||
|
||
YARN-1866. Fixed an issue with renewal of RM-delegation tokens on restart or
|
||
fail-over. (Jian He via vinodkv)
|
||
|
||
YARN-1867. Fixed a bug in ResourceManager that was causing invalid ACL checks
|
||
in the web-services after fail-over. (Vinod Kumar Vavilapalli)
|
||
|
||
YARN-1521. Mark Idempotent/AtMostOnce annotations to the APIs in
|
||
ApplicationClientProtcol, ResourceManagerAdministrationProtocol and
|
||
ResourceTrackerProtocol so that they work in HA scenario. (Xuan Gong
|
||
via jianhe)
|
||
|
||
YARN-1873. Fixed TestDistributedShell failure when the test cases are out of
|
||
order. (Mit Desai via zjshen)
|
||
|
||
YARN-1893. Mark AtMostOnce annotation to ApplicationMasterProtocol#allocate.
|
||
(Xuan Gong via jianhe)
|
||
|
||
Release 2.3.1 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
Release 2.3.0 - 2014-02-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-649. Added a new NM web-service to serve container logs in plain text
|
||
over HTTP. (Sandy Ryza via vinodkv)
|
||
|
||
YARN-1021. Yarn Scheduler Load Simulator. (ywskycn via tucu)
|
||
|
||
YARN-1010. FairScheduler: decouple container scheduling from nodemanager
|
||
heartbeats. (Wei Yan via Sandy Ryza)
|
||
|
||
YARN-1253. Changes to LinuxContainerExecutor to run containers as a single
|
||
dedicated user in non-secure mode. (rvs via tucu)
|
||
|
||
YARN-1027. Implement RMHAProtocolService (Karthik Kambatla via bikas)
|
||
|
||
YARN-1068. Add admin support for HA operations (Karthik Kambatla via
|
||
bikas)
|
||
|
||
YARN-311. RM/scheduler support for dynamic resource configuration.
|
||
(Junping Du via llu)
|
||
|
||
YARN-1392. Allow sophisticated app-to-queue placement policies in the Fair
|
||
Scheduler (Sandy Ryza)
|
||
|
||
YARN-1447. Common PB type definitions for container resizing. (Wangda Tan
|
||
via Sandy Ryza)
|
||
|
||
YARN-1448. AM-RM protocol changes to support container resizing (Wangda Tan
|
||
via Sandy Ryza)
|
||
|
||
YARN-312. Introduced ResourceManagerAdministrationProtocol changes to support
|
||
changing resources on node. (Junping Du via vinodkv)
|
||
|
||
YARN-1028. Added FailoverProxyProvider capability to ResourceManager to help
|
||
with RM failover. (Karthik Kambatla via vinodkv)
|
||
|
||
YARN-1029. Added embedded leader election in the ResourceManager. (Karthik
|
||
Kambatla via vinodkv)
|
||
|
||
YARN-1033. Expose RM active/standby state to Web UI and REST API (kasha)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-305. Fair scheduler logs too many "Node offered to app" messages.
|
||
(Lohit Vijayarenu via Sandy Ryza)
|
||
|
||
YARN-1258. Allow configuring the Fair Scheduler root queue (Sandy Ryza)
|
||
|
||
YARN-1288. Make Fair Scheduler ACLs more user friendly (Sandy Ryza)
|
||
|
||
YARN-1315. TestQueueACLs should also test FairScheduler (Sandy Ryza)
|
||
|
||
YARN-1335. Move duplicate code from FSSchedulerApp and FiCaSchedulerApp
|
||
into SchedulerApplication (Sandy Ryza)
|
||
|
||
YARN-1333. Support blacklisting in the Fair Scheduler (Tsuyoshi Ozawa via
|
||
Sandy Ryza)
|
||
|
||
YARN-1109. Demote NodeManager "Sending out status for container" logs to
|
||
debug (haosdent via Sandy Ryza)
|
||
|
||
YARN-1321. Changed NMTokenCache to support both singleton and an instance
|
||
usage. (Alejandro Abdelnur via vinodkv)
|
||
|
||
YARN-1388. Fair Scheduler page always displays blank fair share (Liyin Liang
|
||
via Sandy Ryza)
|
||
|
||
YARN-7. Support CPU resource for DistributedShell. (Junping Du via llu)
|
||
|
||
YARN-905. Add state filters to nodes CLI (Wei Yan via Sandy Ryza)
|
||
|
||
YARN-1098. Separate out RM services into Always On and Active (Karthik
|
||
Kambatla via bikas)
|
||
|
||
YARN-353. Add Zookeeper-based store implementation for RMStateStore.
|
||
(Bikas Saha, Jian He and Karthik Kambatla via hitesh)
|
||
|
||
YARN-819. ResourceManager and NodeManager should check for a minimum allowed
|
||
version (Robert Parker via jeagles)
|
||
|
||
YARN-425. coverage fix for yarn api (Aleksey Gorshkov via jeagles)
|
||
|
||
YARN-1199. Make NM/RM Versions Available (Mit Desai via jeagles)
|
||
|
||
YARN-1232. Configuration to support multiple RMs (Karthik Kambatla via
|
||
bikas)
|
||
|
||
YARN-465. fix coverage org.apache.hadoop.yarn.server.webproxy (Aleksey
|
||
Gorshkov and Andrey Klochkov via jlowe)
|
||
|
||
YARN-976. Document the meaning of a virtual core. (Sandy Ryza)
|
||
|
||
YARN-1182. MiniYARNCluster creates and inits the RM/NM only on start()
|
||
(Karthik Kambatla via Sandy Ryza)
|
||
|
||
HADOOP-9598. Improve code coverage of RMAdminCLI (Aleksey Gorshkov and
|
||
Andrey Klochkov via jeagles)
|
||
|
||
YARN-1306. Clean up hadoop-sls sample-conf according to YARN-1228 (Wei Yan
|
||
via Sandy Ryza)
|
||
|
||
YARN-891. Modified ResourceManager state-store to remember completed
|
||
applications so that clients can get information about them post RM-restart.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-1290. Let continuous scheduling achieve more balanced task assignment
|
||
(Wei Yan via Sandy Ryza)
|
||
|
||
YARN-786. Expose application resource usage in RM REST API (Sandy Ryza)
|
||
|
||
YARN-1323. Set HTTPS webapp address along with other RPC addresses in HAUtil
|
||
(Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-1121. Changed ResourceManager's state-store to drain all events on
|
||
shut-down. (Jian He via vinodkv)
|
||
|
||
YARN-1387. RMWebServices should use ClientRMService for filtering
|
||
applications (Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-1222. Make improvements in ZKRMStateStore for fencing (Karthik
|
||
Kambatla via bikas)
|
||
|
||
YARN-709. Added tests to verify validity of delegation tokens and logging of
|
||
appsummary after RM restart. (Jian He via vinodkv)
|
||
|
||
YARN-1210. Changed RM to start new app-attempts on RM restart only after
|
||
ensuring that previous AM exited or after expiry time. (Omkar Vinit Joshi via
|
||
vinodkv)
|
||
|
||
YARN-674. Fixed ResourceManager to renew DelegationTokens on submission
|
||
asynchronously to work around potential slowness in state-store. (Omkar Vinit
|
||
Joshi via vinodkv)
|
||
|
||
YARN-584. In scheduler web UIs, queues unexpand on refresh. (Harshit
|
||
Daga via Sandy Ryza)
|
||
|
||
YARN-1303. Fixed DistributedShell to not fail with multiple commands separated
|
||
by a semi-colon as shell-command. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1423. Support queue placement by secondary group in the Fair Scheduler
|
||
(Ted Malaska via Sandy Ryza)
|
||
|
||
YARN-1314. Fixed DistributedShell to not fail with multiple arguments for a
|
||
shell command separated by spaces. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1239. Modified ResourceManager state-store implementations to start
|
||
storing version numbers. (Jian He via vinodkv)
|
||
|
||
YARN-1241. In Fair Scheduler, maxRunningApps does not work for non-leaf
|
||
queues. (Sandy Ryza)
|
||
|
||
YARN-1318. Promoted AdminService to an Always-On service and merged it into
|
||
RMHAProtocolService. (Karthik Kambatla via vinodkv)
|
||
|
||
YARN-1332. In TestAMRMClient, replace assertTrue with assertEquals where
|
||
possible (Sebastian Wong via Sandy Ryza)
|
||
|
||
YARN-1403. Separate out configuration loading from QueueManager in the Fair
|
||
Scheduler (Sandy Ryza)
|
||
|
||
YARN-1181. Augment MiniYARNCluster to support HA mode (Karthik Kambatla)
|
||
|
||
YARN-546. Allow disabling the Fair Scheduler event log (Sandy Ryza)
|
||
|
||
YARN-807. When querying apps by queue, iterating over all apps is
|
||
inefficient and limiting (Sandy Ryza)
|
||
|
||
YARN-1378. Implemented a cleaner of old finished applications from the RM
|
||
state-store. (Jian He via vinodkv)
|
||
|
||
YARN-1481. Move internal services logic from AdminService to ResourceManager.
|
||
(vinodkv via kasha)
|
||
|
||
YARN-1491. Upgrade JUnit3 TestCase to JUnit 4 (Chen He via jeagles)
|
||
|
||
YARN-408. Change CapacityScheduler to not disable delay-scheduling by default.
|
||
(Mayank Bansal via vinodkv)
|
||
|
||
YARN-1325. Modified RM HA configuration validation to also ensure that
|
||
multiple RMs are configured. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1311. Fixed app specific scheduler-events' names to be app-attempt
|
||
based. (vinodkv via jianhe)
|
||
|
||
YARN-1485. Modified RM HA configuration validation to also ensure that
|
||
service-address configuration are configured for every RM. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-1435. Modified Distributed Shell to accept either the command or the
|
||
custom script. (Xuan Gong via zjshen)
|
||
|
||
YARN-1446. Changed client API to retry killing application till RM
|
||
acknowledges so as to account for RM crashes/failover. (Jian He via vinodkv)
|
||
|
||
YARN-1307. Redesign znode structure for Zookeeper based RM state-store for
|
||
better organization and scalability. (Tsuyoshi OZAWA via vinodkv)
|
||
|
||
YARN-1172. Convert SecretManagers in RM to services (Tsuyoshi OZAWA via kasha)
|
||
|
||
YARN-1523. Use StandbyException instead of RMNotYetReadyException (kasha)
|
||
|
||
YARN-1541. Changed ResourceManager to invalidate ApplicationMaster host/port
|
||
information once an AM crashes. (Jian He via vinodkv)
|
||
|
||
YARN-1482. Modified WebApplicationProxy to make it work across ResourceManager
|
||
fail-over. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1568. Rename clusterid to clusterId in ActiveRMInfoProto (kasha)
|
||
|
||
YARN-1579. ActiveRMInfoProto fields should be optional (kasha)
|
||
|
||
YARN-888. Cleaned up POM files so that non-leaf modules don't include any
|
||
dependencies and thus compact the dependency list for leaf modules.
|
||
(Alejandro Abdelnur via vinodkv)
|
||
|
||
YARN-1567. In Fair Scheduler, allow empty queues to change between leaf and
|
||
parent on allocation file reload (Sandy Ryza)
|
||
|
||
YARN-1616. RMFatalEventDispatcher should log the cause of the event (kasha)
|
||
|
||
YARN-1624. QueuePlacementPolicy format is not easily readable via a JAXB
|
||
parser (Aditya Acharya via Sandy Ryza)
|
||
|
||
YARN-1623. Include queue name in RegisterApplicationMasterResponse (Sandy
|
||
Ryza)
|
||
|
||
YARN-1573. ZK store should use a private password for root-node-acls.
|
||
(kasha).
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-1284. LCE: Race condition leaves dangling cgroups entries for killed
|
||
containers. (Alejandro Abdelnur via Sandy Ryza)
|
||
|
||
YARN-1283. Fixed RM to give a fully-qualified proxy URL for an application
|
||
so that clients don't need to do scheme-mangling. (Omkar Vinit Joshi via
|
||
vinodkv)
|
||
|
||
YARN-879. Fixed tests w.r.t o.a.h.y.server.resourcemanager.Application.
|
||
(Junping Du via devaraj)
|
||
|
||
YARN-1265. Fair Scheduler chokes on unhealthy node reconnect (Sandy Ryza)
|
||
|
||
YARN-1044. used/min/max resources do not display info in the scheduler page
|
||
(Sangjin Lee via Sandy Ryza)
|
||
|
||
YARN-1259. In Fair Scheduler web UI, queue num pending and num active apps
|
||
switched. (Robert Kanter via Sandy Ryza)
|
||
|
||
YARN-1295. In UnixLocalWrapperScriptBuilder, using bash -c can cause Text
|
||
file busy errors (Sandy Ryza)
|
||
|
||
YARN-1185. Fixed FileSystemRMStateStore to not leave partial files that
|
||
prevent subsequent ResourceManager recovery. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-1331. yarn.cmd exits with NoClassDefFoundError trying to run rmadmin or
|
||
logs. (cnauroth)
|
||
|
||
YARN-1330. Fair Scheduler: defaultQueueSchedulingPolicy does not take effect
|
||
(Sandy Ryza)
|
||
|
||
YARN-1022. Unnecessary INFO logs in AMRMClientAsync (haosdent via bikas)
|
||
|
||
YARN-1349. yarn.cmd does not support passthrough to any arbitrary class.
|
||
(cnauroth)
|
||
|
||
YARN-1357. TestContainerLaunch.testContainerEnvVariables fails on Windows.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
YARN-1358. TestYarnCLI fails on Windows due to line endings. (Chuan Liu via
|
||
cnauroth)
|
||
|
||
YARN-1343. NodeManagers additions/restarts are not reported as node updates
|
||
in AllocateResponse responses to AMs. (tucu)
|
||
|
||
YARN-1381. Same relaxLocality appears twice in exception message of
|
||
AMRMClientImpl#checkLocalityRelaxationConflict() (Ted Yu via Sandy Ryza)
|
||
|
||
YARN-1407. RM Web UI and REST APIs should uniformly use
|
||
YarnApplicationState (Sandy Ryza)
|
||
|
||
YARN-1438. Ensure container diagnostics includes exception from container
|
||
launch. (stevel via acmurthy)
|
||
|
||
YARN-1138. yarn.application.classpath is set to point to $HADOOP_CONF_DIR
|
||
etc., which does not work on Windows. (Chuan Liu via cnauroth)
|
||
|
||
YARN-461. Fair scheduler should not accept apps with empty string queue name.
|
||
(ywskycn via tucu)
|
||
|
||
YARN-1060. Two tests in TestFairScheduler are missing @Test annotation
|
||
(Niranjan Singh via Sandy Ryza)
|
||
|
||
YARN-1188. The context of QueueMetrics becomes default when using
|
||
FairScheduler (Tsuyoshi Ozawa via Sandy Ryza)
|
||
|
||
YARN-1268. TestFairScheduler.testContinuousScheduling is flaky (Sandy Ryza)
|
||
|
||
YARN-1300. SLS tests fail because conf puts YARN properties in
|
||
fair-scheduler.xml (Ted Yu via Sandy Ryza)
|
||
|
||
YARN-1183. MiniYARNCluster shutdown takes several minutes intermittently
|
||
(Andrey Klochkov via jeagles)
|
||
|
||
YARN-1305. RMHAProtocolService#serviceInit should handle HAUtil's
|
||
IllegalArgumentException (Tsuyoshi Ozawa via bikas)
|
||
|
||
YARN-1374. Changed ResourceManager to start the preemption policy monitors
|
||
as active services. (Karthik Kambatla via vinodkv)
|
||
|
||
YARN-1395. Distributed shell application master launched with debug flag can
|
||
hang waiting for external ls process. (cnauroth)
|
||
|
||
YARN-1400. yarn.cmd uses HADOOP_RESOURCEMANAGER_OPTS. Should be
|
||
YARN_RESOURCEMANAGER_OPTS. (Raja Aluri via cnauroth)
|
||
|
||
YARN-1401. With zero sleep-delay-before-sigkill.ms, no signal is ever sent
|
||
(Gera Shegalov via Sandy Ryza)
|
||
|
||
YARN-1411. HA config shouldn't affect NodeManager RPC addresses (Karthik
|
||
Kambatla via bikas)
|
||
|
||
YARN-1419. TestFifoScheduler.testAppAttemptMetrics fails intermittently
|
||
under jdk7 (Jonathan Eagles via jlowe)
|
||
|
||
YARN-744. Race condition in ApplicationMasterService.allocate .. It might
|
||
process same allocate request twice resulting in additional containers
|
||
getting allocated. (Omkar Vinit Joshi via bikas)
|
||
|
||
YARN-1425. TestRMRestart fails because MockRM.waitForState(AttemptId) uses
|
||
current attempt instead of the attempt passed as argument (Omkar Vinit
|
||
Joshi via bikas)
|
||
|
||
YARN-1053. Diagnostic message from ContainerExitEvent is ignored in
|
||
ContainerImpl (Omkar Vinit Joshi via bikas)
|
||
|
||
YARN-1320. Fixed Distributed Shell application to respect custom log4j
|
||
properties file. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1416. Fixed a few invalid transitions in RMApp, RMAppAttempt and in some
|
||
tests. (Jian He via vinodkv)
|
||
|
||
YARN-895. Changed RM state-store to not crash immediately if RM restarts while
|
||
the state-store is down. (Jian He via vinodkv)
|
||
|
||
YARN-1454. Fixed test failure issue with TestRMRestart. (Karthik Kambatla
|
||
via vinodkv)
|
||
|
||
YARN-1450. Fixed test failure in TestUnmanagedAMLauncher by removing its
|
||
dependency on distributed-shell. (Binglin Chang via vinodkv)
|
||
|
||
YARN-1405. Fixed ResourceManager to not hang when init/start fails with an
|
||
exception w.r.t state-store. (Jian He via vinodkv)
|
||
|
||
YARN-1505. Fixed Webapplication proxy server to not hardcode its bind
|
||
address. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1145. Fixed a potential file-handle leak in the web interface for
|
||
displaying aggregated logs. (Rohith Sharma via vinodkv)
|
||
|
||
YARN-1451. TestResourceManager relies on the scheduler assigning multiple
|
||
containers in a single node update. (Sandy Ryza via kasha)
|
||
|
||
YARN-1527. Fix yarn rmadmin command to print the correct usage info.
|
||
(Akira AJISAKA via jianhe)
|
||
|
||
YARN-1522. Fixed a race condition in the test TestApplicationCleanup that was
|
||
causing it to randomly fail. (Liyin Liang via vinodkv)
|
||
|
||
YARN-1549. Fixed a bug in ResourceManager's ApplicationMasterService that
|
||
was causing unamanged AMs to not finish correctly. (haosdent via vinodkv)
|
||
|
||
YARN-1559. Race between ServerRMProxy and ClientRMProxy setting
|
||
RMProxy#INSTANCE. (kasha and vinodkv via kasha)
|
||
|
||
YARN-1560. Fixed TestYarnClient#testAMMRTokens failure with null AMRM token.
|
||
(Ted Yu via jianhe)
|
||
|
||
YARN-1409. NonAggregatingLogHandler can throw RejectedExecutionException
|
||
(Tsuyoshi OZAWA via jlowe)
|
||
|
||
YARN-1293. Fixed TestContainerLaunch#testInvalidEnvSyntaxDiagnostics failure
|
||
caused by non-English system locale. (Tsuyoshi OZAWA via jianhe)
|
||
|
||
YARN-1574. RMDispatcher should be reset on transition to standby. (Xuan Gong
|
||
via kasha)
|
||
|
||
YARN-1598. HA-related rmadmin commands don't work on a secure cluster (kasha)
|
||
|
||
YARN-1603. Remove two *.orig files which were unexpectedly committed.
|
||
(Zhijie Shen via junping_du)
|
||
|
||
YARN-1601. 3rd party JARs are missing from hadoop-dist output. (tucu)
|
||
|
||
YARN-1351. Invalid string format in Fair Scheduler log warn message
|
||
(Konstantin Weitz via Sandy Ryza)
|
||
|
||
YARN-1608. LinuxContainerExecutor has a few DEBUG messages at INFO level
|
||
(kasha)
|
||
|
||
YARN-1606. Fix the default value of yarn.resourcemanager.zk-timeout-ms
|
||
in yarn-default.xml (kasha)
|
||
|
||
YARN-1607. TestRM relies on the scheduler assigning multiple containers in
|
||
a single node update (Sandy Ryza)
|
||
|
||
YARN-1575. Public localizer crashes with "Localized unkown resource"
|
||
(jlowe)
|
||
|
||
YARN-1629. IndexOutOfBoundsException in MaxRunningAppsEnforcer (Sandy Ryza)
|
||
|
||
YARN-1628. Fixed the test failure in TestContainerManagerSecurity. (Vinod
|
||
Kumar Vavilapalli via zjshen)
|
||
|
||
Release 2.2.0 - 2013-10-13
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
YARN-1229. Define constraints on Auxiliary Service names. Change
|
||
ShuffleHandler service name from mapreduce.shuffle to
|
||
mapreduce_shuffle (Xuan Gong via sseth)
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-1246. Added application finish-status to ApplicationSummary for the sake
|
||
of testing given ApplicationHistoryServer is not yet ready. (Arpit Gupta via
|
||
vinodkv)
|
||
|
||
YARN-899. Added back queue level administrator-acls so that there is no
|
||
regression w.r.t 1.x. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1228. Clean up Fair Scheduler configuration loading. (Sandy Ryza)
|
||
|
||
YARN-1213. Restore config to ban submitting to undeclared pools in the
|
||
Fair Scheduler. (Sandy Ryza)
|
||
|
||
YARN-1277. Added a policy based configuration for http/https in common
|
||
HttpServer and using the same in YARN - related to per project https config
|
||
support via HADOOP-10022. (Suresh Srinivas and Omkar Vinit Joshi via vinodkv)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-1128. FifoPolicy.computeShares throws NPE on empty list of Schedulables
|
||
(Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-1214. Register ClientToken MasterKey in SecretManager after it is
|
||
saved (Jian He via bikas)
|
||
|
||
YARN-49. Improve distributed shell application to work on a secure cluster.
|
||
(Vinod Kumar Vavilapalli via hitesh)
|
||
|
||
YARN-1157. Fixed ResourceManager UI to behave correctly when apps like
|
||
distributed-shell do not set tracking urls. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1221. With Fair Scheduler, reserved MB reported in RM web UI increases
|
||
indefinitely (Siqi Li via Sandy Ryza)
|
||
|
||
YARN-1247. test-container-executor has gotten out of sync with the changes to
|
||
container-executor. (rvs via tucu)
|
||
|
||
YARN-1070. Fixed race conditions in NodeManager during container-kill.
|
||
(Zhijie Shen via vinodkv)
|
||
|
||
YARN-1215. Yarn URL should include userinfo. (Chuan Liu via cnauroth)
|
||
|
||
YARN-1262. TestApplicationCleanup relies on all containers assigned in a
|
||
single heartbeat (Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-1260. Added webapp.http.address to yarn-default.xml so that default
|
||
install with https enabled doesn't have broken link on NM UI. (Omkar Vinit
|
||
Joshi via vinodkv)
|
||
|
||
YARN-1141. Updating resource requests should be decoupled with updating
|
||
blacklist (Zhijie Shen via bikas)
|
||
|
||
YARN-876. Node resource is added twice when node comes back from unhealthy
|
||
to healthy. (Peng Zhang via Sandy Ryza)
|
||
|
||
YARN-890. Ensure CapacityScheduler doesn't round-up metric for available
|
||
resources. (Xuan Gong & Hitesh Shah via acmurthy)
|
||
|
||
YARN-621. Changed YARN web app to not add paths that can cause duplicate
|
||
additions of authenticated filters there by causing kerberos replay errors.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-1236. FairScheduler setting queue name in RMApp is not working.
|
||
(Sandy Ryza)
|
||
|
||
YARN-1256. NM silently ignores non-existent service in
|
||
StartContainerRequest (Xuan Gong via bikas)
|
||
|
||
YARN-1149. NM throws InvalidStateTransitonException: Invalid event:
|
||
APPLICATION_LOG_HANDLING_FINISHED at RUNNING (Xuan Gong via hitesh)
|
||
|
||
YARN-1271. "Text file busy" errors launching containers again
|
||
(Sandy Ryza)
|
||
|
||
YARN-1131. $yarn logs command should return an appropriate error message if
|
||
YARN application is still running. (Siddharth Seth via hitesh)
|
||
|
||
YARN-1219. FSDownload changes file suffix making FileUtil.unTar() throw
|
||
exception. (Shanyu Zhao via cnauroth)
|
||
|
||
YARN-1251. TestDistributedShell#TestDSShell failed with timeout. (Xuan Gong
|
||
via hitesh)
|
||
|
||
YARN-1167. Fixed Distributed Shell to not incorrectly show empty hostname
|
||
on RM UI. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1254. Fixed NodeManager to not pollute container's credentials. (Omkar
|
||
Vinit Joshi via vinodkv)
|
||
|
||
YARN-1273. Fixed Distributed-shell to account for containers that failed
|
||
to start. (Hitesh Shah via vinodkv)
|
||
|
||
YARN-1032. Fixed NPE in RackResolver. (Lohit Vijayarenu via acmurthy)
|
||
|
||
YARN-1090. Fixed CS UI to better reflect applications as non-schedulable
|
||
and not as pending. (Jian He via acmurthy)
|
||
|
||
YARN-1274. Fixed NodeManager's LinuxContainerExecutor to create user, app-dir
|
||
and log-dirs correctly even when there are no resources to localize for the
|
||
container. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-1278. Fixed NodeManager to not delete local resources for apps on resync
|
||
command from RM - a bug caused by YARN-1149. (Hitesh Shah via vinodkv)
|
||
|
||
YARN-1463. Tests should avoid starting http-server where possible or creates
|
||
spnego keytab/principals (vinodkv via kasha)
|
||
|
||
Release 2.1.1-beta - 2013-09-23
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
YARN-707. Added user information also in the YARN ClientToken so that AMs
|
||
can implement authorization based on incoming users. (Jason Lowe via vinodkv)
|
||
|
||
YARN-1170. YARN & MapReduce proto definitions fixed to specify protobuf
|
||
package as hadoop.yarn and hadoop.mapreduce respectively. (Binglin Chang
|
||
via acmurthy)
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-589. Expose a REST API for monitoring the fair scheduler (Sandy Ryza).
|
||
|
||
YARN-1074. Cleaned up YARN CLI application list to only display running
|
||
applications by default. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1093. Corrections to Fair Scheduler documentation (Wing Yew Poon via
|
||
Sandy Ryza)
|
||
|
||
YARN-942. In Fair Scheduler documentation, inconsistency on which
|
||
properties have prefix (Akira Ajisaka via Sandy Ryza)
|
||
|
||
YARN-1083. Changed ResourceManager to fail when the expiry interval is less
|
||
than the configured node-heartbeat interval. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1081. Made a trivial change to YARN node CLI header to avoid potential
|
||
confusion. (Akira AJISAKA via vinodkv)
|
||
|
||
YARN-1034. Remove "experimental" in the Fair Scheduler documentation.
|
||
(Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-1080. Improved help message for "yarn logs" command. (Xuan Gong via
|
||
vinodkv)
|
||
|
||
YARN-771. AMRMClient support for resource blacklisting (Junping Du via
|
||
bikas)
|
||
|
||
YARN-1117. Improved help messages for "yarn application" and "yarn node"
|
||
commands. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1120. Made ApplicationConstants.Environment.USER definition OS neutral
|
||
as the corresponding value is now set correctly end-to-end. (Chuan Liu via
|
||
vinodkv)
|
||
|
||
YARN-1124. Modified YARN CLI application list to display new and submitted
|
||
applications together with running apps by default, following up YARN-1074.
|
||
(Xuan Gong via vinodkv)
|
||
|
||
YARN-1065. NM should provide AuxillaryService data to the container (Xuan
|
||
Gong via bikas)
|
||
|
||
YARN-758. Augment MockNM to use multiple cores (Karthik Kambatla via
|
||
Sandy Ryza)
|
||
|
||
YARN-696. Changed RMWebservice apps call to take in multiple application
|
||
states. (Trevor Lorimer via vinodkv)
|
||
|
||
YARN-910. Augmented auxiliary services to listen for container starts and
|
||
completions in addition to application events. (Alejandro Abdelnur via
|
||
vinodkv)
|
||
|
||
YARN-1137. Add support whitelist for system users to Yarn
|
||
container-executor.c. (rvs via tucu)
|
||
|
||
YARN-1001. Added a web-service to get statistics about per application-type
|
||
per state for consumption by downstream projects. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1203. Changed YARN web-app proxy to handle http and https URLs from
|
||
AM registration and finish correctly. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-1204. Added separate configuration properties for https for RM and NM
|
||
without which servers enabled with https will also start on http ports.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-948. Changed ResourceManager to validate the release container list
|
||
before actually releasing them. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-966. Fixed ContainerLaunch to not fail quietly when there are no
|
||
localized resources due to some other failure. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-502. Fixed a state machine issue with RMNode inside ResourceManager
|
||
which was crashing scheduler. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-573. Shared data structures in Public Localizer and Private Localizer
|
||
are not Thread safe. (Omkar Vinit Joshi via jlowe)
|
||
|
||
YARN-903. Changed ContainerManager to suppress unnecessary warnings when
|
||
stopping already stopped containers. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-906. Fixed a bug in NodeManager where cancelling ContainerLaunch at
|
||
KILLING state causes that the container to hang. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-994. HeartBeat thread in AMRMClientAsync does not handle runtime
|
||
exception correctly (Xuan Gong via bikas)
|
||
|
||
YARN-337. RM handles killed application tracking URL poorly (jlowe)
|
||
|
||
YARN-107. Fixed ResourceManager and clients to better handle
|
||
forceKillApplication on non-running and finished applications. (Xuan Gong
|
||
via vinodkv)
|
||
|
||
YARN-643. Fixed ResourceManager to remove all tokens consistently on app
|
||
finish. (Xuan Gong via vinodkv)
|
||
|
||
YARN-1006. Fixed broken rendering in the Nodes list web page on the RM web
|
||
UI. (Xuan Gong via vinodkv)
|
||
|
||
YARN-881. Priority#compareTo method seems to be wrong. (Jian He via bikas)
|
||
|
||
YARN-1082. Create base directories on HDFS after RM login to ensure RM
|
||
recovery doesn't fail in secure mode. (vinodkv via acmurthy)
|
||
|
||
YARN-1085. Modified YARN and MR2 web-apps to do HTTP authentication in
|
||
secure setup with kerberos. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-1094. Fixed a blocker with RM restart code because of which RM crashes
|
||
when try to recover an existing app. (vinodkv)
|
||
|
||
YARN-1008. MiniYARNCluster with multiple nodemanagers, all nodes have same
|
||
key for allocations. (tucu)
|
||
|
||
YARN-981. Fixed YARN webapp so that /logs servlet works like before. (Jian He
|
||
via vinodkv)
|
||
|
||
YARN-602. Fixed NodeManager to not let users override some mandatory
|
||
environmental variables. (Kenji Kikushima via vinodkv)
|
||
|
||
YARN-1101. Active nodes can be decremented below 0 (Robert Parker
|
||
via tgraves)
|
||
|
||
YARN-1077. Fixed TestContainerLaunch test failure on Windows. (Chuan Liu via
|
||
vinodkv)
|
||
|
||
YARN-957. Fixed a bug in CapacityScheduler because of which requests that
|
||
need more than a node's total capability were incorrectly allocated on that
|
||
node causing apps to hang. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-1107. Fixed a bug in ResourceManager because of which RM in secure mode
|
||
fails to restart. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-1049. ContainerExistStatus should define a status for preempted
|
||
containers. (tucu)
|
||
|
||
YARN-1144. Unmanaged AMs registering a tracking URI should not be
|
||
proxy-fied. (tucu)
|
||
|
||
YARN-1152. Fixed a bug in ResourceManager that was causing clients to get
|
||
invalid client token key errors when an appliation is about to finish.
|
||
(Jason Lowe via vinodkv)
|
||
|
||
YARN-292. Fixed FifoScheduler and FairScheduler to make their applications
|
||
data structures thread safe to avoid RM crashing with
|
||
ArrayIndexOutOfBoundsException. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-1025. ResourceManager and NodeManager do not load native libraries on
|
||
Windows. (cnauroth)
|
||
|
||
YARN-1176. RM web services ClusterMetricsInfo total nodes doesn't include
|
||
unhealthy nodes (Jonathan Eagles via tgraves)
|
||
|
||
YARN-1078. TestNodeManagerResync, TestNodeManagerShutdown, and
|
||
TestNodeStatusUpdater fail on Windows. (Chuan Liu via cnauroth)
|
||
|
||
YARN-1194. TestContainerLogsPage fails with native builds (Roman Shaposhnik
|
||
via jlowe)
|
||
|
||
YARN-1116. Populate AMRMTokens back to AMRMTokenSecretManager after RM
|
||
restarts (Jian He via bikas)
|
||
|
||
YARN-1189. NMTokenSecretManagerInNM is not being told when applications
|
||
have finished (Omkar Vinit Joshi via jlowe)
|
||
|
||
YARN-540. Race condition causing RM to potentially relaunch already
|
||
unregistered AMs on RM restart (Jian He via bikas)
|
||
|
||
YARN-1184. ClassCastException during preemption enforcement. (cdouglas)
|
||
|
||
Release 2.1.0-beta - 2013-08-22
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
YARN-396. Rationalize AllocateResponse in RM Scheduler API. (Zhijie Shen
|
||
via hitesh)
|
||
|
||
YARN-439. Flatten NodeHeartbeatResponse. (Xuan Gong via sseth)
|
||
|
||
YARN-440. Flatten RegisterNodeManagerResponse. (Xuan Gong via sseth)
|
||
|
||
YARN-536. Removed the unused objects ContainerStatus and ContainerStatus from
|
||
Container which also don't belong to the container. (Xuan Gong via vinodkv)
|
||
|
||
YARN-486. Changed NM's startContainer API to accept Container record given by
|
||
RM as a direct parameter instead of as part of the ContainerLaunchContext
|
||
record. (Xuan Gong via vinodkv)
|
||
|
||
YARN-444. Moved special container exit codes from YarnConfiguration to API
|
||
where they belong. (Sandy Ryza via vinodkv)
|
||
|
||
YARN-441. Removed unused utility methods for collections from two API
|
||
records. (Xuan Gong via vinodkv)
|
||
|
||
YARN-561. Modified NodeManager to set key information into the environment
|
||
of every container that it launches. (Xuan Gong via vinodkv)
|
||
|
||
YARN-579. Stop setting the Application Token in the AppMaster env, in
|
||
favour of the copy present in the container token field.
|
||
(Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-629. Make YarnRemoteException not be rooted at IOException. (Xuan Gong
|
||
via vinodkv)
|
||
|
||
YARN-633. Changed RMAdminProtocol api to throw IOException and
|
||
YarnRemoteException. (Xuan Gong via vinodkv)
|
||
|
||
YARN-632. Changed ContainerManager api to throw IOException and
|
||
YarnRemoteException. (Xuan Gong via vinodkv)
|
||
|
||
YARN-631. Changed ClientRMProtocol api to throw IOException and
|
||
YarnRemoteException. (Xuan Gong via vinodkv)
|
||
|
||
YARN-630. Changed AMRMProtocol api to throw IOException and
|
||
YarnRemoteException. (Xuan Gong via vinodkv)
|
||
|
||
YARN-615. Rename ContainerLaunchContext.containerTokens to tokens.
|
||
(Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-571. Remove user from ContainerLaunchContext. (Omkar Vinit Joshi via
|
||
vinodkv)
|
||
|
||
YARN-716. Making ApplicationID immutable. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-684. ContainerManager.startContainer should use
|
||
ContainerTokenIdentifier instead of the entire Container.
|
||
(Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-735. Make ApplicationAttemptId, ContaienrId and NodeId immutable.
|
||
(Jian He via sseth)
|
||
|
||
YARN-749. Rename ResourceRequest.(get,set)HostName to
|
||
ResourceRequest.(get,set)ResourceName. (acmurthy)
|
||
|
||
YARN-720. container-log4j.properties should not refer to mapreduce
|
||
property names. (Zhijie Shen via sseth)
|
||
|
||
YARN-748. Moved BuilderUtils from yarn-common to yarn-server-common for
|
||
eventual retirement. (Jian He via vinodkv)
|
||
|
||
YARN-635. Renamed YarnRemoteException to YarnException. (Siddharth Seth via
|
||
vinodkv)
|
||
|
||
YARN-755. Renamed AllocateResponse.reboot to AllocateResponse.resync. (Bikas
|
||
Saha via vinodkv)
|
||
|
||
YARN-753. Added individual factory methods for all api protocol records and
|
||
converted the records to be abstract classes. (Jian He via vinodkv)
|
||
|
||
YARN-724. Moved ProtoBase from api.records to api.records.impl.pb. (Jian He
|
||
via vinodkv)
|
||
|
||
YARN-759. Create Command enum in AllocateResponse (bikas)
|
||
|
||
YARN-777. Removed unreferenced objects from .proto files. (Jian He via
|
||
vinodkv)
|
||
|
||
YARN-642. Removed health parameter from ResourceManager /nodes web-service
|
||
and cleaned the behaviour of the status parameter. (Sandy Ryza vid vinodkv)
|
||
|
||
YARN-530. Defined Service model strictly, implemented AbstractService for
|
||
robust subclassing and migrated yarn-common services. (Steve Loughran via
|
||
vinodkv)
|
||
|
||
YARN-746. Renamed Service.register() and Service.unregister() to
|
||
registerServiceListener() & unregisterServiceListener() respectively.
|
||
(Steve Loughran via vinodkv)
|
||
|
||
YARN-792. Moved NodeHealthStatus from yarn.api.record to
|
||
yarn.server.api.record. (Jian He via vinodkv)
|
||
|
||
YARN-806. Moved ContainerExitStatus from yarn.api to yarn.api.records. (Jian
|
||
He via vinodkv)
|
||
|
||
YARN-821. Renamed setFinishApplicationStatus to setFinalApplicationStatus in
|
||
FinishApplicationMasterRequest for consistency. (Jian He via vinodkv)
|
||
|
||
YARN-787. Removed minimum resource from RegisterApplicationMasterResponse.
|
||
(tucu via acmurthy)
|
||
|
||
YARN-829. Renamed RMTokenSelector to be RMDelegationTokenSelector. (Zhijie
|
||
Shen via vinodkv)
|
||
|
||
YARN-828. Removed the unsed YarnVersionAnnotation. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-823. Moved RMAdmin from yarn.client to yarn.client.cli and renamed it to
|
||
be RMAdminCLI. (Jian He via vinodkv)
|
||
|
||
YARN-387. Renamed YARN protocols for consistency.
|
||
ClientRMProtocol -> ApplicationClientProtocol
|
||
AMRMProtocol -> ApplicationMasterProtocol
|
||
ContainerManager -> ContainerManagementProtocol
|
||
(vinodkv via acmurthy)
|
||
|
||
YARN-831. Removed minimum resource from GetNewApplicationResponse as a
|
||
follow-up to YARN-787. (Jian He via acmurthy)
|
||
|
||
YARN-824. Added static factory methods to hadoop-yarn-client interfaces.
|
||
(Jian He via acmurthy)
|
||
|
||
YARN-826. Moved Clock and SystemClock into yarn.util package. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-837. Moved yarn.ClusterInfo into MapReduce project as it doesn't belong
|
||
to YARN. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-822. Renamed ApplicationToken to be AMRMToken, and similarly the
|
||
corresponding TokenSelector and SecretManager. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-610. ClientToken is no longer set in the environment of the Containers.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-834. Fixed annotations for yarn-client module, reorganized packages and
|
||
clearly differentiated *Async apis. (Arun C Murthy and Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-840. Moved ProtoUtils to yarn.api.records.pb.impl. (Jian He via
|
||
acmurthy)
|
||
|
||
YARN-841. Move Auxiliary service to yarn-api, annotate and document it.
|
||
(vinodkv)
|
||
|
||
YARN-850. Rename getClusterAvailableResources to getAvailableResources in
|
||
AMRMClients (Jian He via bikas)
|
||
|
||
YARN-694. Starting to use NMTokens to authenticate all communication with
|
||
NodeManagers. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-553. Replaced YarnClient.getNewApplication with
|
||
YarnClient.createApplication which provides a directly usable
|
||
ApplicationSubmissionContext to simplify the api. (Karthik Kambatla via
|
||
acmurthy)
|
||
|
||
YARN-851. Share NMTokens using NMTokenCache (api-based) between AMRMClient
|
||
and NMClient instead of memory based approach which is used currently. (Omkar
|
||
Vinit Joshi via vinodkv)
|
||
|
||
YARN-869. Move ResourceManagerAdministrationProtocol out of main YARN api.
|
||
(vinodkv via acmurthy)
|
||
|
||
YARN-791. Changed RM APIs and web-services related to nodes to ensure that
|
||
both are consistent with each other. (Sandy Ryza via vinodkv)
|
||
|
||
YARN-727. ClientRMProtocol.getAllApplications should accept ApplicationType as
|
||
a parameter. (Xuan Gong via hitesh)
|
||
|
||
YARN-701. Use application tokens irrespective of secure or non-secure
|
||
mode. (vinodkv via acmurthy)
|
||
|
||
YARN-918. Remove ApplicationAttemptId from
|
||
RegisterApplicationMasterRequestProto. (vinodkv via acmurthy)
|
||
|
||
YARN-926. Modified ContainerManagerProtcol APIs to take in requests for
|
||
multiple containers. (Jian He via vinodkv)
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-482. FS: Extend SchedulingMode to intermediate queues.
|
||
(kkambatl via tucu)
|
||
|
||
YARN-45. Add protocol for schedulers to request containers back from
|
||
ApplicationMasters. (Carlo Curino, cdouglas)
|
||
|
||
YARN-563. Add the concept of an application-type for each application.
|
||
(Mayank Bansal via vinodkv)
|
||
|
||
HADOOP-8562. Enhancements to support Hadoop on Windows Server and Windows
|
||
Azure environments. (See breakdown of tasks below for subtasks and
|
||
contributors)
|
||
|
||
YARN-422. Add a NM Client library to help application-writers. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-392. Make it possible to specify hard locality constraints in resource
|
||
requests. (sandyr via tucu)
|
||
|
||
YARN-326. Add multi-resource scheduling to the fair scheduler.
|
||
(sandyr via tucu)
|
||
|
||
YARN-398. Make it possible to specify hard locality constraints in resource
|
||
requests for CapacityScheduler. (acmurthy)
|
||
|
||
YARN-781. Exposing LOGDIR in all containers' environment which should be used
|
||
by containers for logging purposes. (Jian He via vinodkv)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-347. Node CLI should show CPU info besides memory in node status.
|
||
(Junping Du via llu)
|
||
|
||
YARN-365. Change NM heartbeat handling to not generate a scheduler event
|
||
on each heartbeat. (Xuan Gong via sseth)
|
||
|
||
YARN-380. Fix yarn node -status output to be better readable. (Omkar Vinit
|
||
Joshi via vinodkv)
|
||
|
||
YARN-410. Fixed RM UI so that the new lines diagnostics for a failed app on
|
||
the per-application page are translated to html line breaks. (Omkar Vinit
|
||
Joshi via vinodkv)
|
||
|
||
YARN-198. Added a link to RM pages from the NodeManager web app. (Jian He
|
||
via vinodkv)
|
||
|
||
YARN-237. Refreshing the RM page forgets how many rows I had in my
|
||
Datatables (jian he via bobby)
|
||
|
||
YARN-481. Add AM Host and RPC Port to ApplicationCLI Status Output
|
||
(Chris Riccomini via bikas)
|
||
|
||
YARN-297. Improve hashCode implementations for PB records. (Xuan Gong via
|
||
hitesh)
|
||
|
||
YARN-417. Create AMRMClient wrapper that provides asynchronous callbacks.
|
||
(Sandy Ryza via bikas)
|
||
|
||
YARN-497. Yarn unmanaged-am launcher jar does not define a main class in
|
||
its manifest (Hitesh Shah via bikas)
|
||
|
||
YARN-469. Make scheduling mode in FS pluggable. (kkambatl via tucu)
|
||
|
||
YARN-450. Define value for * in the scheduling protocol (Zhijie Shen via
|
||
bikas)
|
||
|
||
YARN-475. Remove a unused constant in the public API -
|
||
ApplicationConstants.AM_APP_ATTEMPT_ID_ENV. (Hitesh Shah via vinodkv)
|
||
|
||
YARN-309. Changed NodeManager to obtain heart-beat interval from the
|
||
ResourceManager. (Xuan Gong via vinodkv)
|
||
|
||
YARN-447. Move ApplicationComparator in CapacityScheduler to use comparator
|
||
in ApplicationId. (Nemon Lou via vinodkv)
|
||
|
||
YARN-381. Improve fair scheduler docs. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-458. YARN daemon addresses must be placed in many different configs.
|
||
(sandyr via tucu)
|
||
|
||
YARN-193. Scheduler.normalizeRequest does not account for allocation
|
||
requests that exceed maximumAllocation limits (Zhijie Shen via bikas)
|
||
|
||
YARN-479. NM retry behavior for connection to RM should be similar for
|
||
lost heartbeats (Jian He via bikas)
|
||
|
||
YARN-495. Changed NM reboot behaviour to be a simple resync - kill all
|
||
containers and re-register with RM. (Jian He via vinodkv)
|
||
|
||
YARN-514. Delayed store operations should not result in RM unavailability
|
||
for app submission (Zhijie Shen via bikas)
|
||
|
||
YARN-586. Fixed a typo in ApplicationSubmissionContext#setApplicationId.
|
||
(Zhijie Shen via vinodkv)
|
||
|
||
YARN-542. Changed the default global AM max-attempts value to be not one.
|
||
(Zhijie Shen via vinodkv)
|
||
|
||
YARN-583. Moved application level local resources to be localized under the
|
||
filecache sub-directory under application directory. (Omkar Vinit Joshi via
|
||
vinodkv)
|
||
|
||
YARN-581. Added a test to verify that app delegation tokens are restored
|
||
after RM restart. (Jian He via vinodkv)
|
||
|
||
YARN-577. Add application-progress also to ApplicationReport. (Hitesh Shah
|
||
via vinodkv)
|
||
|
||
YARN-595. Refactor fair scheduler to use common Resources. (Sandy Ryza
|
||
via tomwhite)
|
||
|
||
YARN-562. Modified NM to reject any containers allocated by a previous
|
||
ResourceManager. (Jian He via vinodkv)
|
||
|
||
YARN-591. Moved RM recovery related records out of public API as they do not
|
||
belong there. (vinodkv)
|
||
|
||
YARN-599. Refactoring submitApplication in ClientRMService and RMAppManager
|
||
to separate out various validation checks depending on whether they rely on
|
||
RM configuration or not. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-618. Modified RM_INVALID_IDENTIFIER to be -1 instead of zero. (Jian He
|
||
via vinodkv)
|
||
|
||
YARN-625. Move the utility method unwrapAndThrowException from
|
||
YarnRemoteExceptionPBImpl to RPCUtil. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-645. Moved RMDelegationTokenSecretManager from yarn-server-common to
|
||
yarn-server-resourcemanager where it really belongs. (Jian He via vinodkv)
|
||
|
||
YARN-651. Changed PBClientImpls of ContainerManager and RMAdmin to throw
|
||
IOExceptions also. (Xuan Gong via vinodkv)
|
||
|
||
YARN-582. Changed ResourceManager to recover Application token and client
|
||
tokens for app attempt so that RM can be restarted while preserving current
|
||
applications. (Jian He via vinodkv)
|
||
|
||
YARN-568. Add support for work preserving preemption to the FairScheduler.
|
||
(Carlo Curino and Sandy Ryza via cdouglas)
|
||
|
||
YARN-598. Add virtual cores to queue metrics. (sandyr via tucu)
|
||
|
||
YARN-634. Modified YarnRemoteException to be not backed by PB and introduced
|
||
a separate SerializedException record. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-663. Changed ResourceTracker API and LocalizationProtocol API to throw
|
||
YarnRemoteException and IOException. (Xuan Gong via vinodkv)
|
||
|
||
YARN-590. Added an optional mesage to be returned by ResourceMaanger when RM
|
||
asks an RM to shutdown/resync etc so that NMs can log this message locally
|
||
for better debuggability. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-617. Made ContainerTokens to be used for validation at NodeManager
|
||
also in unsecure mode to prevent AMs from faking resource requirements in
|
||
unsecure mode. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-708. Moved RecordFactory classes to hadoop-yarn-api, and put some
|
||
miscellaneous fixes to the interfaces. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-711. Copied BuilderUtil methods in individual API records as
|
||
BuilderUtils is going to be dismantled. (Jian He via vinodkv)
|
||
|
||
YARN-714. Added NMTokens to be sent to AMs as part of heart-beat response.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-638. Modified ResourceManager to restore RMDelegationTokens after
|
||
restarting. (Jian He via vinodkv)
|
||
|
||
YARN-660. Improve AMRMClient with matching requests (bikas)
|
||
|
||
YARN-717. Put object creation factories for Token in the class itself and
|
||
remove useless derivations for specific tokens. (Jian He via vinodkv)
|
||
|
||
YARN-756. Move Preemption* records to yarn.api where they really belong.
|
||
(Jian He via vinodkv)
|
||
|
||
YARN-750. Allow for black-listing resources in YARN API and Impl in CS
|
||
(acmurthy via bikas)
|
||
|
||
YARN-877. Support resource blacklisting for FifoScheduler.
|
||
(Junping Du via llu)
|
||
|
||
YARN-686. Flatten NodeReport. (sandyr via tucu)
|
||
|
||
YARN-737. Throw some specific exceptions directly instead of wrapping them
|
||
in YarnException. (Jian He via sseth)
|
||
|
||
YARN-731. RPCUtil.unwrapAndThrowException should unwrap remote
|
||
RuntimeExceptions. (Zhijie Shen via sseth)
|
||
|
||
YARN-600. Hook up cgroups CPU settings to the number of virtual cores
|
||
allocated. (sandyr via tucu)
|
||
|
||
YARN-648. FS: Add documentation for pluggable policy. (kkambatl via tucu)
|
||
|
||
YARN-773. Moved YarnRuntimeException from package api.yarn to
|
||
api.yarn.exceptions. (Jian He via vinodkv)
|
||
|
||
YARN-692. Creating NMToken master key on RM and sharing it with NM as a part
|
||
of RM-NM heartbeat. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-782. vcores-pcores ratio functions differently from vmem-pmem ratio in
|
||
misleading way. (sandyr via tucu)
|
||
|
||
YARN-803. factor out scheduler config validation from the ResourceManager
|
||
to each scheduler implementation. (tucu)
|
||
|
||
YARN-789. Enable zero capabilities resource requests in fair scheduler.
|
||
(tucu)
|
||
|
||
YARN-639. Modified Distributed Shell application to start using the new
|
||
NMClient library. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-693. Modified RM to send NMTokens on allocate call so that AMs can then
|
||
use them for authentication with NMs. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-752. In AMRMClient, automatically add corresponding rack requests for
|
||
requested nodes. (sandyr via tucu)
|
||
|
||
YARN-825. Fixed javadoc and annotations for yarn-common module. (vinodkv)
|
||
|
||
YARN-833. Moved Graph and VisualizeStateMachine into yarn.state package.
|
||
(Zhijie Shen via vinodkv)
|
||
|
||
YARN-805. Fix javadoc and annotations on classes in the yarn-api
|
||
package. (Jian He via sseth)
|
||
|
||
YARN-846. Move pb Impl classes from yarn-api to yarn-common. (Jian He via
|
||
vinodkv)
|
||
|
||
YARN-827. Need to make Resource arithmetic methods accessible (Jian He via
|
||
bikas)
|
||
|
||
YARN-866. Add test for class ResourceWeights. (ywskycn via tucu)
|
||
|
||
YARN-736. Add a multi-resource fair sharing metric. (sandyr via tucu)
|
||
|
||
YARN-883. Expose Fair Scheduler-specific queue metrics. (sandyr via tucu)
|
||
|
||
YARN-569. Add support for requesting and enforcing preemption requests via
|
||
a capacity monitor. (Carlo Curino, cdouglas)
|
||
|
||
YARN-521. Augment AM - RM client module to be able to request containers
|
||
only at specific locations (Sandy Ryza via bikas)
|
||
|
||
YARN-513. Create common proxy client for communicating with RM. (Xuan Gong
|
||
& Jian He via bikas)
|
||
|
||
YARN-927. Change ContainerRequest to not have more than 1 container count
|
||
and remove StoreContainerRequest (bikas)
|
||
|
||
YARN-922. Change FileSystemRMStateStore to use directories (Jian He via
|
||
bikas)
|
||
|
||
YARN-865. RM webservices can't query based on application Types. (Xuan Gong
|
||
via hitesh)
|
||
|
||
YARN-912. Move client facing exceptions to yarn-api module. (Mayank Bansal
|
||
via vinodkv)
|
||
|
||
YARN-84. Use Builder to build RPC server. (Brandon Li via szetszwo)
|
||
|
||
YARN-1046. Disable mem monitoring by default in MiniYARNCluster. (Karthik
|
||
Kambatla via Sandy Ryza)
|
||
|
||
YARN-1045. Improve toString implementation for PBImpls. (Jian He via sseth)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
YARN-512. Log aggregation root directory check is more expensive than it
|
||
needs to be. (Maysam Yabandeh via jlowe)
|
||
|
||
YARN-719. Move RMIdentifier from Container to ContainerTokenIdentifier.
|
||
(Vinod Kumar Vavilapalli via sseth)
|
||
|
||
BUG FIXES
|
||
|
||
YARN-383. AMRMClientImpl should handle null rmClient in stop()
|
||
(Hitesh Shah via sseth)
|
||
|
||
YARN-385. Add missing fields - location and #containers to
|
||
ResourceRequestPBImpl's toString(). (Sandy Ryza via sseth)
|
||
|
||
YARN-377. Use the new StringUtils methods added by HADOOP-9252 and fix
|
||
TestContainersMonitor. (Chris Nauroth via szetszwo)
|
||
|
||
YARN-391. Formatting fixes for LCEResourceHandler classes.
|
||
(Steve Loughran via sseth)
|
||
|
||
YARN-390. ApplicationCLI and NodeCLI hard-coded platform-specific line
|
||
separator causes test failures on Windows. (Chris Nauroth via suresh)
|
||
|
||
YARN-406. Fix TestRackResolver to function in networks where "host1"
|
||
resolves to a valid host. (Hitesh Shah via sseth)
|
||
|
||
YARN-376. Fixes a bug which would prevent the NM knowing about completed
|
||
containers and applications. (Jason Lowe via sseth)
|
||
|
||
YARN-196. Nodemanager should be more robust in handling connection failure
|
||
to ResourceManager when a cluster is started (Xuan Gong via hitesh)
|
||
|
||
YARN-485. TestProcfsProcessTree#testProcessTree() doesn't wait long enough
|
||
for the process to die. (kkambatl via tucu)
|
||
|
||
YARN-71. Fix the NodeManager to clean up local-dirs on restart.
|
||
(Xuan Gong via sseth)
|
||
|
||
YARN-378. Fix RM to make the AM max attempts/retries to be configurable
|
||
per application by clients. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-498. Unmanaged AM launcher does not set various constants in env for
|
||
an AM, also does not handle failed AMs properly. (Hitesh Shah via bikas)
|
||
|
||
YARN-496. Fair scheduler configs are refreshed inconsistently in
|
||
reinitialize. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-474. Fix CapacityScheduler to trigger application-activation when
|
||
am-resource-percent configuration is refreshed. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-209. Fix CapacityScheduler to trigger application-activation when
|
||
the cluster capacity changes. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-24. Nodemanager fails to start if log aggregation enabled and
|
||
namenode unavailable. (sandyr via tucu)
|
||
|
||
YARN-515. Node Manager not getting the master key. (Robert Joseph Evans
|
||
via jlowe)
|
||
|
||
YARN-382. SchedulerUtils improve way normalizeRequest sets the resource
|
||
capabilities. (Zhijie Shen via bikas)
|
||
|
||
YARN-467. Modify public distributed cache to localize files such that no
|
||
local directory hits unix file count limits and thus prevent job failures.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-101. Fix NodeManager heartbeat processing to not lose track of completed
|
||
containers in case of dropped heartbeats. (Xuan Gong via vinodkv)
|
||
|
||
YARN-538. RM address DNS lookup can cause unnecessary slowness on every JHS
|
||
page load. (sandyr via tucu)
|
||
|
||
YARN-532. Change RMAdmin and Localization client protocol PB implementations
|
||
to implement closeable so that they can be stopped when needed via
|
||
RPC.stopProxy(). (Siddharth Seth via vinodkv)
|
||
|
||
YARN-99. Modify private distributed cache to localize files such that no
|
||
local directory hits unix file count limits and thus prevent job failures.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-112. Fixed a race condition during localization that fails containers.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-534. Change RM restart recovery to also account for AM max-attempts
|
||
configuration after the restart. (Jian He via vinodkv)
|
||
|
||
YARN-539. Addressed memory leak of LocalResource objects NM when a resource
|
||
localization fails. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-319. Submitting a job to a fair scheduler queue for which the user
|
||
does not have permission causes the client to wait forever.
|
||
(shenhong via tomwhite)
|
||
|
||
YARN-412. Fixed FifoScheduler to check hostname of a NodeManager rather
|
||
than its host:port during scheduling which caused incorrect locality for
|
||
containers. (Roger Hoover via acmurthy)
|
||
|
||
YARN-500. Fixed YARN webapps to not roll-over ports when explicitly asked
|
||
to use non-ephemeral ports. (Kenji Kikushima via vinodkv)
|
||
|
||
YARN-518. Fair Scheduler's document link could be added to the hadoop 2.x
|
||
main doc page. (sandyr via tucu)
|
||
|
||
YARN-476. ProcfsBasedProcessTree info message confuses users.
|
||
(sandyr via tucu)
|
||
|
||
YARN-585. Fix failure in TestFairScheduler#testNotAllowSubmitApplication
|
||
caused by YARN-514. (Zhijie Shen via vinodkv)
|
||
|
||
YARN-547. Fixed race conditions in public and private resource localization
|
||
which used to cause duplicate downloads. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-594. Update test and add comments in YARN-534 (Jian He via bikas)
|
||
|
||
YARN-549. YarnClient.submitApplication should wait for application to be
|
||
accepted by the RM (Zhijie Shen via bikas)
|
||
|
||
YARN-605. Fix failing unit test in TestNMWebServices when versionInfo has
|
||
parantheses like when running on a git checkout. (Hitesh Shah via vinodkv)
|
||
|
||
YARN-289. Fair scheduler allows reservations that won't fit on node.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
YARN-576. Modified ResourceManager to reject NodeManagers that don't satisy
|
||
minimum resource requirements. (Kenji Kikushima via vinodkv)
|
||
|
||
YARN-646. Fix two typos in Fair Scheduler user guide. (Dapeng Sun via atm)
|
||
|
||
YARN-507. Add interface visibility and stability annotations to FS
|
||
interfaces/classes. (kkambatl via tucu)
|
||
|
||
YARN-637. FS: maxAssign is not honored. (kkambatl via tucu)
|
||
|
||
YARN-655. Fair scheduler metrics should subtract allocated memory from
|
||
available memory. (sandyr via tucu)
|
||
|
||
YARN-628. Fix the way YarnRemoteException is being unrolled to extract out
|
||
the underlying exception. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-695. Remove masterContainer and status unused fields from
|
||
ApplicationReportProto and fix bugs in ApplicationReportPBImpl. (Zhijie Shen
|
||
via vinodkv)
|
||
|
||
YARN-706. Fixed race conditions in TestFSDownload. (Zhijie Shen via vinodkv).
|
||
|
||
YARN-715. Fixed unit test failures - TestDistributedShell and
|
||
TestUnmanagedAMLauncher. (Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-578. Fixed NM to use SecureIOUtils for reading and aggregating logs.
|
||
(Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-733. Fixed TestNMClient from failing occasionally. (Zhijie Shen via
|
||
vinodkv)
|
||
|
||
YARN-730. Fix NMClientAsync to remove completed containers. (Zhijie Shen
|
||
via acmurthy)
|
||
|
||
YARN-726. Fix queue & finish time fields in web-ui for ResourceManager.
|
||
(Mayank Bansal via acmurthy)
|
||
|
||
YARN-757. Changed TestRMRestart to use the default scheduler to avoid test
|
||
failures. (Bikas Saha via vinodkv)
|
||
|
||
YARN-742. Log aggregation causes a lot of redundant setPermission calls.
|
||
(jlowe via kihwal)
|
||
|
||
YARN-764. blank Used Resources on Capacity Scheduler page (Nemon Lou via
|
||
tgraves)
|
||
|
||
YARN-761. TestNMClientAsync fails sometimes (Zhijie Shen via bikas)
|
||
|
||
YARN-760. NodeManager throws AvroRuntimeException on failed start.
|
||
(Niranjan Singh via jlowe)
|
||
|
||
YARN-767. Initialize application metrics at RM bootup. (Jian He via
|
||
acmurthy)
|
||
|
||
YARN-700. TestInfoBlock fails on Windows because of line ending missmatch.
|
||
(Ivan Mitic via cnauroth)
|
||
|
||
YARN-117. Migrated rest of YARN to the new service model. (Steve Louhran via
|
||
vinodkv)
|
||
|
||
YARN-812. Set default logger for application summary logger to
|
||
hadoop.root.logger. (sseth via acmurthy)
|
||
|
||
YARN-848. Nodemanager does not register with RM using the fully qualified
|
||
hostname. (Hitesh Shah via sseth)
|
||
|
||
YARN-854. Fixing YARN bugs that are failing applications in secure
|
||
environment. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-861. TestContainerManager is failing. (Vinod Kumar Vavilapalli via
|
||
hitesh)
|
||
|
||
YARN-874. Making common RPC to switch to not switch to simple when other
|
||
mechanisms are enabled and thus fix YARN/MR test failures after HADOOP-9421.
|
||
(Daryn Sharp and Vinod Kumar Vavilapalli via vinodkv)
|
||
|
||
YARN-845. RM crash with NPE on NODE_UPDATE (Mayank Bansal via bikas)
|
||
|
||
YARN-369. Handle ( or throw a proper error when receiving) status updates
|
||
from application masters that have not registered (Mayank Bansal &
|
||
Abhishek Kapoor via bikas)
|
||
|
||
YARN-541. getAllocatedContainers() is not returning all the allocated
|
||
containers (bikas)
|
||
|
||
YARN-763. AMRMClientAsync should stop heartbeating after receiving
|
||
shutdown from RM (Xuan Gong via bikas)
|
||
|
||
YARN-654. AMRMClient: Perform sanity checks for parameters of public
|
||
methods (Xuan Gong via bikas)"
|
||
|
||
YARN-919. Document setting default heap sizes in yarn-env.sh (Mayank
|
||
Bansal via hitesh)
|
||
|
||
YARN-795. Fair scheduler queue metrics should subtract allocated vCores from
|
||
available vCores. (ywskycn via tucu)
|
||
|
||
YARN-799. Fix CgroupsLCEResourcesHandler to use /tasks instead of
|
||
/cgroup.procs. (Chris Riccomini via acmurthy)
|
||
|
||
YARN-333. Schedulers cannot control the queue-name of an
|
||
application. (sandyr via tucu)
|
||
|
||
YARN-368. Fixed a typo in error message in Auxiliary services. (Albert Chu
|
||
via vinodkv)
|
||
|
||
YARN-295. Fixed a race condition in ResourceManager RMAppAttempt state
|
||
machine. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-523. Modified a test-case to validate container diagnostics on
|
||
localization failures. (Jian He via vinodkv)
|
||
|
||
YARN-661. Fixed NM to cleanup users' local directories correctly when
|
||
starting up. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-820. Fixed an invalid state transition in NodeManager caused by failing
|
||
resource localization. (Mayank Bansal via vinodkv)
|
||
|
||
YARN-62. Modified NodeManagers to avoid AMs from abusing container tokens for
|
||
repetitive container launches. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-814. Improving diagnostics when containers fail during launch due to
|
||
various reasons like invalid env etc. (Jian He via vinodkv)
|
||
|
||
YARN-897. Ensure child queues are ordered correctly to account for
|
||
completed containers. (Djellel Eddine Difallah via acmurthy)
|
||
|
||
YARN-853. Fixed CapacityScheduler's maximum-am-resource-percent to properly
|
||
work beyond refreshing queues. (Devaraj K via vinodkv)
|
||
|
||
YARN-873. YARNClient.getApplicationReport(unknownAppId) returns a null
|
||
report (Xuan Gong via bikas)
|
||
|
||
YARN-875. Application can hang if AMRMClientAsync callback thread has
|
||
exception (Xuan Gong via bikas)
|
||
|
||
YARN-968. RM admin commands don't work. (vinodkv via kihwal)
|
||
|
||
YARN-688. Fixed NodeManager to properly cleanup containers when it is shut
|
||
down. (Jian He via vinodkv)
|
||
|
||
YARN-960. Fixed ResourceManager to propagate client-submitted credentials
|
||
irrespective of security. (Daryn Sharp via vinodkv)
|
||
|
||
YARN-937. Fix unmanaged AM in non-secure/secure setup post YARN-701. (tucu)
|
||
|
||
YARN-932. TestResourceLocalizationService.testLocalizationInit can fail on
|
||
JDK7. (Karthik Kambatla via Sandy Ryza)
|
||
|
||
YARN-961. Changed ContainerManager to enforce Token auth irrespective of
|
||
security. (Omkar Vinit Joshi via vinodkv)
|
||
|
||
YARN-945. Removed setting of AMRMToken's service from ResourceManager
|
||
and changed client libraries do it all the time and correctly. (vinodkv)
|
||
|
||
YARN-656. In scheduler UI, including reserved memory in Memory Total can
|
||
make it exceed cluster capacity. (Sandy Ryza)
|
||
|
||
BREAKDOWN OF HADOOP-8562/YARN-191 SUBTASKS AND RELATED JIRAS
|
||
|
||
YARN-158. Yarn creating package-info.java must not depend on sh.
|
||
(Chris Nauroth via suresh)
|
||
|
||
YARN-176. Some YARN tests fail to find winutils. (Chris Nauroth via suresh)
|
||
|
||
YARN-207. YARN distribution build fails on Windows. (Chris Nauroth via
|
||
suresh)
|
||
|
||
YARN-199. Yarn cmd line scripts for windows. (Ivan Mitic via suresh)
|
||
|
||
YARN-213. YARN build script would be more readable using abspath.
|
||
(Chris Nauroth via suresh)
|
||
|
||
YARN-233. Added support for running containers in MS Windows to YARN. (Chris
|
||
Nauroth via acmurthy)
|
||
|
||
YARN-234. Added support for process tree and resource calculator in MS Windows
|
||
to YARN. (Chris Nauroth via acmurthy)
|
||
|
||
YARN-259. Fix LocalDirsHandlerService to use Path rather than URIs. (Xuan
|
||
Gong via acmurthy)
|
||
|
||
YARN-316. YARN container launch may exceed maximum Windows command line
|
||
length due to long classpath. (Chris Nauroth via suresh)
|
||
|
||
YARN-359. Fixing commands for container signalling in Windows. (Chris Nauroth
|
||
via vinodkv)
|
||
|
||
YARN-506. Move to common utils FileUtil#setReadable/Writable/Executable and
|
||
FileUtil#canRead/Write/Execute. (Ivan Mitic via suresh)
|
||
|
||
YARN-488. TestContainerManagerSecurity fails on Windows. (Chris Nauroth
|
||
via hitesh)
|
||
|
||
YARN-490. TestDistributedShell fails on Windows. (Chris Nauroth via hitesh)
|
||
|
||
YARN-491. TestContainerLogsPage fails on Windows. (Chris Nauroth via hitesh)
|
||
|
||
YARN-487. Modify path manipulation in LocalDirsHandlerService to let
|
||
TestDiskFailures pass on Windows. (Chris Nauroth via vinodkv)
|
||
|
||
YARN-593. container launch on Windows does not correctly populate
|
||
classpath with new process's environment variables and localized resources
|
||
(Chris Nauroth via bikas)
|
||
|
||
YARN-493. Fixed some shell related flaws in YARN on Windows. (Chris Nauroth
|
||
via vinodkv)
|
||
|
||
YARN-839. TestContainerLaunch.testContainerEnvVariables fails on Windows.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
YARN-597. TestFSDownload fails on Windows due to dependencies on
|
||
tar/gzip/jar tools. (Ivan Mitic via acmurthy)
|
||
|
||
YARN-852. TestAggregatedLogFormat.testContainerLogsFileAccess fails on
|
||
Windows. (Chuan Liu via cnauroth)
|
||
|
||
YARN-894. NodeHealthScriptRunner timeout checking is inaccurate on Windows.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
YARN-909. Disable TestLinuxContainerExecutorWithMocks on Windows. (Chuan Liu
|
||
via cnauroth)
|
||
|
||
YARN-1043. Push all metrics consistently. (Jian He via acmurthy)
|
||
|
||
YARN-1056. Remove dual use of string 'resourcemanager' in
|
||
yarn.resourcemanager.connect.{max.wait.secs|retry_interval.secs}
|
||
(Karthik Kambatla via acmurthy)
|
||
|
||
Release 2.0.6-alpha - 08/22/2013
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-854. Fixing YARN bugs that are failing applications in secure
|
||
environment. (Omkar Vinit Joshi and shv)
|
||
|
||
Release 2.0.5-alpha - 06/06/2013
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
Release 2.0.4-alpha - 2013-04-25
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-429. capacity-scheduler config missing from yarn-test artifact.
|
||
(sseth via hitesh)
|
||
|
||
YARN-470. Support a way to disable resource monitoring on the NodeManager.
|
||
(Siddharth Seth via hitesh)
|
||
|
||
Release 2.0.3-alpha - 2013-02-06
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-145. Add a Web UI to the fair share scheduler. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-3. Add support for CPU isolation/monitoring of containers.
|
||
(adferguson via tucu)
|
||
|
||
YARN-230. RM Restart phase 1 - includes support for saving/restarting all
|
||
applications on an RM bounce. (Bikas Saha via acmurthy)
|
||
|
||
YARN-103. Add a yarn AM-RM client module. (Bikas Saha via sseth)
|
||
|
||
YARN-286. Add a YARN ApplicationClassLoader. (tomwhite)
|
||
|
||
YARN-2. Enhanced CapacityScheduler to account for CPU alongwith memory for
|
||
multi-dimensional resource scheduling. (acmurthy)
|
||
|
||
YARN-328. Use token request messages defined in hadoop common. (suresh)
|
||
|
||
YARN-231. RM Restart - Add FS-based persistent store implementation for
|
||
RMStateStore (Bikas Saha via hitesh)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-223. Update process tree instead of getting new process trees.
|
||
(Radim Kolar via llu)
|
||
|
||
YARN-57. Allow process-tree based resource calculation et al. to be
|
||
pluggable to support it on multiple platforms. (Radim Kolar via acmurthy)
|
||
|
||
YARN-78. Changed UnManagedAM application to use YarnClient. (Bikas Saha via
|
||
vinodkv)
|
||
|
||
YARN-93. Fixed RM to propagate diagnostics from applications that have
|
||
finished but failed (Jason Lowe via vinodkv).
|
||
|
||
YARN-28. Fixed TestCompositeService to not depend on test-order and thus
|
||
made it pass on JDK7 (Thomas Graves via vinodkv).
|
||
|
||
YARN-82. Change the default local and log dirs to be based on
|
||
hadoop.tmp.dir and yarn.log.dir. (Hemanth Yamijala via sseth)
|
||
|
||
YARN-53. Added the missing getGroups API to ResourceManager. (Bo Wang via
|
||
vinodkv)
|
||
|
||
YARN-116. Add the ability to change the RM include/exclude file without
|
||
a restart. (xieguiming and Harsh J via sseth)
|
||
|
||
YARN-23. FairScheduler: FSQueueSchedulable#updateDemand() - potential
|
||
redundant aggregation. (kkambatl via tucu)
|
||
|
||
YARN-127. Move RMAdmin tool to its correct location - the client module.
|
||
(vinodkv)
|
||
|
||
YARN-40. Provided support for missing YARN commands (Devaraj K and Vinod
|
||
Kumar Vavilapalli via vinodkv)
|
||
|
||
YARN-33. Change LocalDirsHandlerService to validate the configured local and
|
||
log dirs. (Mayank Bansal via sseth)
|
||
|
||
YARN-94. Modify DistributedShell to point to main-class by default, clean up
|
||
the help message, and hard-code the AM class. (Hitesh Shah via vinodkv)
|
||
|
||
YARN-146. Add unit tests for computing fair share in the fair scheduler.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
HADOOP-8911. CRLF characters in source and text files.
|
||
(Raja Aluri via suresh)
|
||
|
||
YARN-136. Make ClientToAMTokenSecretManager part of RMContext (Vinod Kumar
|
||
Vavilapalli via sseth)
|
||
|
||
YARN-183. Clean up fair scheduler code. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-129. Simplify classpath construction for mini YARN tests. (tomwhite)
|
||
|
||
YARN-254. Update fair scheduler web UI for hierarchical queues.
|
||
(sandyr via tucu)
|
||
|
||
YARN-315. Using the common security token protobuf definition from hadoop
|
||
common. (Suresh Srinivas via vinodkv)
|
||
|
||
YARN-170. Change NodeManager stop to be reentrant. (Sandy Ryza via vinodkv)
|
||
|
||
YARN-331. Fill in missing fair scheduler documentation. (sandyr via tucu)
|
||
|
||
YARN-277. Use AMRMClient in DistributedShell to exemplify the approach.
|
||
(Bikas Saha via hitesh)
|
||
|
||
YARN-360. Allow apps to concurrently register tokens for renewal.
|
||
(Daryn Sharp via sseth)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-131. Fix incorrect ACL properties in capacity scheduler documentation.
|
||
(Ahmed Radwan via sseth)
|
||
|
||
YARN-102. Move the apache header to the top of the file in MemStore.java.
|
||
(Devaraj K via sseth)
|
||
|
||
YARN-134. ClientToAMSecretManager creates keys without checking for
|
||
validity of the appID. (Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-30. Fixed tests verifying web-services to work on JDK7. (Thomas Graves
|
||
via vinodkv)
|
||
|
||
YARN-150. Fixes AppRejectedTransition does not unregister a rejected
|
||
app-attempt from the ApplicationMasterService (Bikas Saha via sseth)
|
||
|
||
YARN-140. Add capacity-scheduler-default.xml to provide a default set of
|
||
configurations for the capacity scheduler. (ahmed via tucu)
|
||
|
||
YARN-179. Fix some unit test failures. (Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-181. Fixed eclipse settings broken by capacity-scheduler.xml move via
|
||
YARN-140. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-169. Update log4j.appender.EventCounter to use
|
||
org.apache.hadoop.log.metrics.EventCounter (Anthony Rojas via tomwhite)
|
||
|
||
YARN-184. Remove unnecessary locking in fair scheduler, and address
|
||
findbugs excludes. (sandyr via tucu)
|
||
|
||
YARN-224. Fair scheduler logs too many nodeUpdate INFO messages.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
YARN-222. Fair scheduler should create queue for each user by default.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
MAPREDUCE-4778. Fair scheduler event log is only written if directory
|
||
exists on HDFS. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-229. Remove old unused RM recovery code. (Bikas Saha via acmurthy)
|
||
|
||
YARN-187. Add hierarchical queues to the fair scheduler.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
YARN-72. NM should handle cleaning up containers when it shuts down.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
YARN-267. Fix fair scheduler web UI. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-264. y.s.rm.DelegationTokenRenewer attempts to renew token even
|
||
after removing an app. (kkambatl via tucu)
|
||
|
||
YARN-271. Fair scheduler hits IllegalStateException trying to reserve
|
||
different apps on same node. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-272. Fair scheduler log messages try to print objects without
|
||
overridden toString methods. (sandyr via tucu)
|
||
|
||
YARN-278. Fair scheduler maxRunningApps config causes no apps to make
|
||
progress. (sandyr via tucu)
|
||
|
||
YARN-282. Fair scheduler web UI double counts Apps Submitted.
|
||
(sandyr via tucu)
|
||
|
||
YARN-283. Fair scheduler fails to get queue info without root prefix.
|
||
(sandyr via tucu)
|
||
|
||
YARN-192. Node update causes NPE in the fair scheduler.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
YARN-288. Fair scheduler queue doesn't accept any jobs when ACLs are
|
||
configured. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-300. After YARN-271, fair scheduler can infinite loop and not
|
||
schedule any application. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-301. Fair scheduler throws ConcurrentModificationException when
|
||
iterating over app's priorities. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-217. Fix RMAdmin protocol description to make it work in secure mode
|
||
also. (Devaraj K via vinodkv)
|
||
|
||
YARN-253. Fixed container-launch to not fail when there are no local
|
||
resources to localize. (Tom White via vinodkv)
|
||
|
||
YARN-330. Fix flakey test: TestNodeManagerShutdown#testKillContainersOnShutdown.
|
||
(Sandy Ryza via hitesh)
|
||
|
||
YARN-335. Fair scheduler doesn't check whether rack needs containers
|
||
before assigning to node. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-336. Fair scheduler FIFO scheduling within a queue only allows 1
|
||
app at a time. (Sandy Ryza via tomwhite)
|
||
|
||
YARN-135. Client tokens should be per app-attempt, and should be
|
||
unregistered on App-finish. (vinodkv via sseth)
|
||
|
||
YARN-302. Fair scheduler assignmultiple should default to false. (sandyr via tucu)
|
||
|
||
YARN-372. Move InlineDispatcher from hadoop-yarn-server-resourcemanager to
|
||
hadoop-yarn-common (sseth via hitesh)
|
||
|
||
YARN-370. Fix SchedulerUtils to correctly round up the resource for
|
||
containers. (Zhijie Shen via acmurthy)
|
||
|
||
YARN-355. Fixes a bug where RM app submission could jam under load.
|
||
(Daryn Sharp via sseth)
|
||
|
||
Release 2.0.2-alpha - 2012-09-07
|
||
|
||
YARN-9. Rename YARN_HOME to HADOOP_YARN_HOME. (vinodkv via acmurthy)
|
||
|
||
NEW FEATURES
|
||
|
||
YARN-1. Promote YARN to be a sub-project of Apache Hadoop. (acmurthy)
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-29. Add a yarn-client module. (Vinod Kumar Vavilapalli via sseth)
|
||
|
||
YARN-10. Fix DistributedShell module to not have a dependency on
|
||
hadoop-mapreduce-client-core. (Hitesh Shah via vinodkv)
|
||
|
||
YARN-80. Add support for delaying rack-local containers in
|
||
CapacityScheduler. (acmurthy)
|
||
|
||
YARN-137. Change the default YARN scheduler to be the CapacityScheduler.
|
||
(sseth via acmurthy)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-13. Fix pom versions for YARN in branch-2 (todd)
|
||
|
||
MAPREDUCE-2374. "Text File Busy" errors launching MR tasks. (Andy Isaacson
|
||
via atm)
|
||
|
||
YARN-12. Fix findbugs warnings in FairScheduler. (Junping Du via acmurthy)
|
||
|
||
YARN-22. Fix ContainerLogs to work if the log-dir is specified as a URI.
|
||
(Mayank Bansal via sseth)
|
||
|
||
YARN-37. Change TestRMAppTransitions to use the DrainDispatcher.
|
||
(Mayank Bansal via sseth)
|
||
|
||
YARN-79. Implement close on all clients to YARN so that RPC clients don't
|
||
throw exceptions on shut-down. (Vinod Kumar Vavilapalli)
|
||
|
||
YARN-42. Modify NM's non-aggregating logs' handler to stop properly so that
|
||
NMs don't get NPEs on startup errors. (Devaraj K via vinodkv)
|
||
|
||
YARN-15. Updated default classpath for YARN applications to reflect split of
|
||
YARN into a sub-project. (Arun C Murthy via vinodkv)
|
||
|
||
YARN-75. Modified ResourceManager's RMContainer to handle a valid RELEASE
|
||
event at RUNNING state. (Siddharth Seth via vinodkv)
|
||
|
||
YARN-138. Ensure default values for minimum/maximum container sizes is
|
||
sane. (harsh & sseth via acmurthy)
|
||
|
||
Release 0.23.11 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-1180. Update capacity scheduler docs to include types on the configs
|
||
(Chen He via jeagles)
|
||
|
||
Release 0.23.10 - 2013-12-09
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-985. Nodemanager should log where a resource was localized (Ravi
|
||
Prakash via jeagles)
|
||
|
||
YARN-1119. Add ClusterMetrics checks to tho TestRMNodeTransitions tests
|
||
(Mit Desai via jeagles)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-337. RM handles killed application tracking URL poorly (jlowe)
|
||
|
||
YARN-1101. Active nodes can be decremented below 0 (Robert Parker
|
||
via tgraves)
|
||
|
||
YARN-1176. RM web services ClusterMetricsInfo total nodes doesn't include
|
||
unhealthy nodes (Jonathan Eagles via tgraves)
|
||
|
||
YARN-1386. NodeManager mistakenly loses resources and relocalizes them
|
||
(Jason Lowe via jeagles)
|
||
|
||
Release 0.23.9 - 2013-07-08
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-427. Coverage fix for org.apache.hadoop.yarn.server.api.* (Aleksey
|
||
Gorshkov via jeagles)
|
||
|
||
YARN-478. fix coverage org.apache.hadoop.yarn.webapp.log (Aleksey Gorshkov
|
||
via jeagles)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
Release 0.23.8 - 2013-06-05
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
YARN-548. Add tests for YarnUncaughtExceptionHandler (Vadim Bondarev via
|
||
jeagles)
|
||
|
||
BUG FIXES
|
||
|
||
YARN-363. Add webapps/proxy directory without which YARN proxy-server fails
|
||
when started in stand-alone mode. (Kenji Kikushima via vinodkv)
|
||
|
||
YARN-690. RM exits on token cancel/renew problems (daryn via bobby)
|
||
|
||
Release 0.23.7 - 2013-04-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-133 Update web services docs for RM clusterMetrics (Ravi Prakash via
|
||
kihwal)
|
||
|
||
YARN-249. Capacity Scheduler web page should show list of active users per
|
||
queue like it used to (in 1.x) (Ravi Prakash via tgraves)
|
||
|
||
YARN-236. RM should point tracking URL to RM web page when app fails to
|
||
start (Jason Lowe via jeagles)
|
||
|
||
YARN-269. Resource Manager not logging the health_check_script result when
|
||
taking it out (Jason Lowe via kihwal)
|
||
|
||
YARN-227. Application expiration difficult to debug for end-users
|
||
(Jason Lowe via jeagles)
|
||
|
||
YARN-443. allow OS scheduling priority of NM to be different than the
|
||
containers it launches (tgraves)
|
||
|
||
YARN-468. coverage fix for org.apache.hadoop.yarn.server.webproxy.amfilter
|
||
(Aleksey Gorshkov via bobby)
|
||
|
||
YARN-200. yarn log does not output all needed information, and is in a
|
||
binary format (Ravi Prakash via jlowe)
|
||
|
||
YARN-525. make CS node-locality-delay refreshable (Thomas Graves via jlowe)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
YARN-357. App submission should not be synchronized (daryn)
|
||
|
||
BUG FIXES
|
||
|
||
YARN-343. Capacity Scheduler maximum-capacity value -1 is invalid (Xuan
|
||
Gong via tgraves)
|
||
|
||
YARN-364. AggregatedLogDeletionService can take too long to delete logs
|
||
(jlowe)
|
||
|
||
YARN-362. Unexpected extra results when using webUI table search (Ravi
|
||
Prakash via jlowe)
|
||
|
||
YARN-400. RM can return null application resource usage report leading to
|
||
NPE in client (Jason Lowe via tgraves)
|
||
|
||
YARN-426. Failure to download a public resource prevents further downloads
|
||
(Jason Lowe via bobby)
|
||
|
||
YARN-448. Remove unnecessary hflush from log aggregation (Kihwal Lee via
|
||
bobby)
|
||
|
||
YARN-345. Many InvalidStateTransitonException errors for ApplicationImpl
|
||
in Node Manager (Robert Parker via jlowe)
|
||
|
||
YARN-109. .tmp file is not deleted for localized archives (Mayank Bansal
|
||
via bobby)
|
||
|
||
YARN-460. CS user left in list of active users for the queue even when
|
||
application finished (tgraves)
|
||
|
||
Release 0.23.6 - 2013-02-06
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-285. Added a temporary plugin interface for RM to be able to redirect
|
||
to JobHistory server for apps that it no longer tracks. (Derek Dagit via
|
||
vinodkv)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-188. Coverage fixing for CapacityScheduler (Aleksey Gorshkov via
|
||
bobby)
|
||
|
||
YARN-214. RMContainerImpl does not handle event EXPIRE at state RUNNING
|
||
(jeagles via bobby)
|
||
|
||
YARN-151. Browser thinks RM main page JS is taking too long
|
||
(Ravi Prakash via bobby)
|
||
|
||
YARN-204. test coverage for org.apache.hadoop.tools (Aleksey Gorshkov via
|
||
bobby)
|
||
|
||
YARN-251. Proxy URI generation fails for blank tracking URIs (Tom White
|
||
via jlowe)
|
||
|
||
YARN-258. RM web page UI shows Invalid Date for start and finish times
|
||
(Ravi Prakash via jlowe)
|
||
|
||
YARN-266. RM and JHS Web UIs are blank because AppsBlock is not escaping
|
||
string properly (Ravi Prakash via jlowe)
|
||
|
||
YARN-280. RM does not reject app submission with invalid tokens
|
||
(Daryn Sharp via tgraves)
|
||
|
||
YARN-225. Proxy Link in RM UI thows NPE in Secure mode
|
||
(Devaraj K via bobby)
|
||
|
||
YARN-293. Node Manager leaks LocalizerRunner object for every Container
|
||
(Robert Joseph Evans via jlowe)
|
||
|
||
YARN-50. Implement renewal / cancellation of Delegation Tokens
|
||
(Siddharth Seth via tgraves)
|
||
|
||
YARN-320. RM should always be able to renew its own tokens.
|
||
(Daryn Sharp via sseth)
|
||
|
||
YARN-325. RM CapacityScheduler can deadlock when getQueueInfo() is
|
||
called and a container is completing (Arun C Murthy via tgraves)
|
||
|
||
YARN-334. Maven RAT plugin is not checking all source files (tgraves)
|
||
|
||
YARN-354. WebAppProxyServer exits immediately after startup (Liang Xie via
|
||
jlowe)
|
||
|
||
Release 0.23.5 - 2012-11-28
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
YARN-161. Fix multiple compiler warnings for unchecked operations in YARN
|
||
common. (Chris Nauroth via vinodkv)
|
||
|
||
YARN-43. Fix TestResourceTrackerService to not depend on test order and thus
|
||
pass on JDK7. (Thomas Graves via vinodkv)
|
||
|
||
YARN-32. Fix TestApplicationTokens to not depend on test order and thus pass
|
||
on JDK7. (vinodkv)
|
||
|
||
YARN-186. Coverage fixing LinuxContainerExecutor (Aleksey Gorshkov via
|
||
bobby)
|
||
|
||
YARN-216. Remove jquery theming support. (Robert Joseph Evans via jlowe)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-163. Retrieving container log via NM webapp can hang with multibyte
|
||
characters in log (jlowe via bobby)
|
||
|
||
YARN-174. Modify NodeManager to pass the user's configuration even when
|
||
rebooting. (vinodkv)
|
||
|
||
YARN-177. CapacityScheduler - adding a queue while the RM is running has
|
||
wacky results (acmurthy vai tgraves)
|
||
|
||
YARN-178. Fix custom ProcessTree instance creation (Radim Kolar via bobby)
|
||
|
||
YARN-180. Capacity scheduler - containers that get reserved create
|
||
container token to early (acmurthy and bobby)
|
||
|
||
YARN-139. Interrupted Exception within AsyncDispatcher leads to user
|
||
confusion. (Vinod Kumar Vavilapalli via jlowe)
|
||
|
||
YARN-165. RM should point tracking URL to RM web page for app when AM fails
|
||
(jlowe via bobby)
|
||
|
||
YARN-159. RM web ui applications page should be sorted to display last app
|
||
first (tgraves via bobby)
|
||
|
||
YARN-166. capacity scheduler doesn't allow capacity < 1.0 (tgraves via
|
||
bobby)
|
||
|
||
YARN-189. Fixed a deadlock between RM's ApplicationMasterService and the
|
||
dispatcher. (Thomas Graves via vinodkv)
|
||
|
||
YARN-202. Log Aggregation generates a storm of fsync() for namenode
|
||
(Kihwal Lee via bobby)
|
||
|
||
YARN-201. Fix CapacityScheduler to be less conservative for starved
|
||
off-switch requests. (jlowe via acmurthy)
|
||
|
||
YARN-206. TestApplicationCleanup.testContainerCleanup occasionally fails.
|
||
(jlowe via jeagles)
|
||
|
||
YARN-212. NM state machine ignores an APPLICATION_CONTAINER_FINISHED event
|
||
when it shouldn't (Nathan Roberts via jlowe)
|
||
|
||
YARN-219. NM should aggregate logs when application finishes. (bobby)
|
||
|
||
Release 0.23.4
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
Change package of YarnClient to org.apache.hadoop. (Bikas Saha via vinodkv)
|
||
|
||
YARN-108. FSDownload can create cache directories with the wrong
|
||
permissions (Jason Lowe via bobby)
|
||
|
||
YARN-57. Allow process-tree based resource calculation et al. to be
|
||
pluggable to support it on multiple platforms. (Radim Kolar via acmurthy)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-88. DefaultContainerExecutor can fail to set proper permissions.
|
||
(Jason Lowe via sseth)
|
||
|
||
YARN-106. Nodemanager needs to set permissions of local directories (jlowe
|
||
via bobby)
|
||
|
||
Release 0.23.3
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMAZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
YARN-14. Symlinks to peer distributed cache files no longer work
|
||
(Jason Lowe via bobby)
|
||
|
||
YARN-25. remove old aggregated logs (Robert Evans via tgraves)
|
||
|
||
YARN-27. Failed refreshQueues due to misconfiguration prevents further
|
||
refreshing of queues (Arun Murthy via tgraves)
|
||
|
||
MAPREDUCE-4323. NM leaks filesystems (Jason Lowe via jeagles)
|
||
|
||
YARN-39. RM-NM secret-keys should be randomly generated and rolled every
|
||
so often. (vinodkv and sseth via sseth)
|
||
|
||
YARN-31. Fix TestDelegationTokenRenewer to not depend on test order so as to
|
||
pass tests on jdk7. (Thomas Graves via vinodkv)
|
||
|
||
YARN-63. RMNodeImpl is missing valid transitions from the UNHEALTHY state
|
||
(Jason Lowe via bobby)
|
||
|
||
YARN-60. Fixed a bug in ResourceManager which causes all NMs to get NPEs and
|
||
thus causes all containers to be rejected. (vinodkv)
|
||
|
||
YARN-66. aggregated logs permissions not set properly (tgraves via bobby)
|
||
|
||
YARN-68. NodeManager will refuse to shutdown indefinitely due to container
|
||
log aggregation (daryn via bobby)
|
||
|
||
YARN-87. NM ResourceLocalizationService does not set permissions of local
|
||
cache directories (Jason Lowe via tgraves)
|