14561 lines
547 KiB
Plaintext
14561 lines
547 KiB
Plaintext
Hadoop Change Log
|
||
|
||
Release 2.5.0 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-10451. Remove unused field and imports from SaslRpcServer.
|
||
(Benoy Antony via jing9)
|
||
|
||
HADOOP-10345. Sanitize the the inputs (groups and hosts) for the proxyuser
|
||
configuration. (Benoy Antony via jing9)
|
||
|
||
HADOOP-10454. Provide FileContext version of har file system. (Kihwal Lee
|
||
via jeagles)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-10378. Typo in help printed by hdfs dfs -help.
|
||
(Mit Desai via suresh)
|
||
|
||
HADOOP-10418. SaslRpcClient should not assume that remote principals are in
|
||
the default_realm. (atm)
|
||
|
||
HADOOP-10426. Declare CreateOpts.getOpt(..) with generic type argument,
|
||
removes unused FileContext.getFileStatus(..) and fixes various javac
|
||
warnings. (szetszwo)
|
||
|
||
HADOOP-10439. Fix compilation error in branch-2 after HADOOP-10426. (wheat9)
|
||
|
||
HADOOP-10414. Incorrect property name for RefreshUserMappingProtocol in
|
||
hadoop-policy.xml. (Joey Echeverria via atm)
|
||
|
||
HADOOP-10459. distcp V2 doesn't preserve root dir's attributes when -p is
|
||
specified. (Yongjun Zhang via atm)
|
||
|
||
HADOOP-10462. DF#getFilesystem is not parsing the command output.
|
||
(Akira AJISAKA via umamahesh)
|
||
|
||
Release 2.4.1 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-10455. When there is an exception, ipc.Server should first check
|
||
whether it is an terse exception. (szetszwo)
|
||
|
||
HADOOP-10456. Bug in Configuration.java exposed by Spark
|
||
(ConcurrentModificationException). (Nishkam Ravi via cnauroth)
|
||
|
||
Release 2.4.0 - 2014-04-07
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-8691. FsShell can print "Found xxx items" unnecessarily often.
|
||
(Daryn Sharp via wheat9)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-10184. Hadoop Common changes required to support HDFS ACLs. (See
|
||
breakdown of tasks below for features and contributors)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-10139. Update and improve the Single Cluster Setup document.
|
||
(Akira Ajisaka via Arpit Agarwal)
|
||
|
||
HADOOP-10295. Allow distcp to automatically identify the checksum type of
|
||
source files and use it for the target. (jing9 and Laurent Goujon)
|
||
|
||
HADOOP-10333. Fix grammatical error in overview.html document.
|
||
(René Nyffenegger via suresh)
|
||
|
||
HADOOP-10343. Change info to debug log in LossyRetryInvocationHandler. (arpit)
|
||
|
||
HADOOP-10348. Deprecate hadoop.ssl.configuration in branch-2, and remove
|
||
it in trunk. (Haohui Mai via jing9)
|
||
|
||
HADOOP-9454. Support multipart uploads for s3native. (Jordan Mendelson and
|
||
Akira AJISAKA via atm)
|
||
|
||
HADOOP-10374. InterfaceAudience annotations should have
|
||
RetentionPolicy.RUNTIME (Enis Soztutar via Arpit Agarwal)
|
||
|
||
HADOOP-10278. Refactor to make CallQueue pluggable. (Chris Li via
|
||
Arpit Agarwal)
|
||
|
||
HADOOP-10285. Admin interface to swap callqueue at runtime. (Chris Li via
|
||
Arpit Agarwal)
|
||
|
||
HADOOP-10314. The ls command help still shows outdated 0.16 format.
|
||
(Rushabh S Shah via kihwal)
|
||
|
||
HADOOP-10379. Protect authentication cookies with the HttpOnly and Secure
|
||
flags. (wheat9)
|
||
|
||
HADOOP-10211. Enable RPC protocol to negotiate SASL-QOP values between
|
||
clients and servers. (Benoy Antony via Arpit Agarwal)
|
||
|
||
HADOOP-10386. Log proxy hostname in various exceptions being thrown in a HA
|
||
setup. (wheat9)
|
||
|
||
HADOOP-3679. Fixup assert ordering in unit tests to yield meaningful error
|
||
messages. (Jay Vyas via cdouglas)
|
||
|
||
HADOOP-10221. Add a plugin to specify SaslProperties for RPC protocol
|
||
based on connection properties. (Benoy Antony and Daryn Sharp via
|
||
Arpit Agarwal)
|
||
|
||
HADOOP-10015. UserGroupInformation prints out excessive warnings.
|
||
(Nicolas Liochon via szetszwo)
|
||
|
||
HADOOP-10423. Clarify compatibility policy document for combination of
|
||
new client and old server. (Chris Nauroth via kasha)
|
||
|
||
HADOOP-10280. Make Schedulables return a configurable identity of user
|
||
or group. (Chris Li via Arpit Agarwal)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-10320. Javadoc in InterfaceStability.java lacks final </ul>.
|
||
(René Nyffenegger via cnauroth)
|
||
|
||
HADOOP-10085. CompositeService should allow adding services while being
|
||
inited. (Steve Loughran via kasha)
|
||
|
||
HADOOP-10327. Trunk windows build broken after HDFS-5746.
|
||
(Vinay via cnauroth)
|
||
|
||
HADOOP-10330. TestFrameDecoder fails if it cannot bind port 12345.
|
||
(Arpit Agarwal)
|
||
|
||
HADOOP-10326. M/R jobs can not access S3 if Kerberos is enabled. (bc Wong
|
||
via atm)
|
||
|
||
HADOOP-10338. Cannot get the FileStatus of the root inode from the new
|
||
Globber (cmccabe)
|
||
|
||
HADOOP-10249. LdapGroupsMapping should trim ldap password read from file.
|
||
(Dilli Armugam via suresh)
|
||
|
||
HADOOP-10346. Deadlock while logging tokens (jlowe)
|
||
|
||
HADOOP-10328. loadGenerator exit code is not reliable.
|
||
(Haohui Mai via cnauroth)
|
||
|
||
HADOOP-10355. Fix TestLoadGenerator#testLoadGenerator. (Haohui Mai via jing9)
|
||
|
||
HADOOP-10070. RPC client doesn't use per-connection conf to determine
|
||
server's expected Kerberos principal name. (atm)
|
||
|
||
HADOOP-10368. InputStream is not closed in VersionInfo ctor.
|
||
(Tsuyoshi OZAWA via szetszwo)
|
||
|
||
HADOOP-10353. FsUrlStreamHandlerFactory is not thread safe.
|
||
(Tudor Scurtu via cnauroth)
|
||
|
||
HADOOP-10393. Fix the javac warnings in hadoop-auth. (szetszwo)
|
||
|
||
HADOOP-10395. TestCallQueueManager is flaky. (Arpit Agarwal)
|
||
|
||
HADOOP-10394. TestAuthenticationFilter is flaky. (Arpit Agarwal)
|
||
|
||
HADOOP-10337 ConcurrentModificationException from
|
||
MetricsDynamicMBeanBase.createMBeanInfo() (Liang Xie via stack)
|
||
|
||
HADOOP-10407. Fix the javac warnings in org.apache.hadoop.ipc package.
|
||
(szetszwo)
|
||
|
||
HADOOP-10191. Missing executable permission on viewfs internal dirs.
|
||
(Gera Shegalov via cnauroth)
|
||
|
||
HADOOP-10422. Remove redundant logging of RPC retry attempts. (cnauroth)
|
||
|
||
HADOOP-10425. LocalFileSystem.getContentSummary should not count crc files.
|
||
(szetszwo)
|
||
|
||
HADOOP-10440. HarFsInputStream.read(byte[]) updates position incorrectly.
|
||
(guodongdong via szetszwo)
|
||
|
||
HADOOP-10437. Fix the javac warnings in the conf and the util package.
|
||
(szetszwo)
|
||
|
||
HADOOP-10441. Namenode metric "rpc.RetryCache/NameNodeRetryCache.CacheHit"
|
||
can't be correctly processed by Ganglia. (jing9)
|
||
|
||
HADOOP-10449. Fix the javac warnings in the security package. (szetszwo)
|
||
|
||
HADOOP-10450. Build zlib native code bindings in hadoop.dll for Windows.
|
||
(cnauroth)
|
||
|
||
HADOOP-10301. AuthenticationFilter should return Forbidden for failed
|
||
authentication. (Daryn Sharp via jing9)
|
||
|
||
HADOOP-9525. Add tests that validate winutils chmod behavior on folders
|
||
(ivanmi)
|
||
|
||
BREAKDOWN OF HADOOP-10184 SUBTASKS AND RELATED JIRAS
|
||
|
||
HADOOP-10185. FileSystem API for ACLs. (cnauroth)
|
||
|
||
HADOOP-10186. Remove AclReadFlag and AclWriteFlag in FileSystem API.
|
||
(Haohui Mai via cnauroth)
|
||
|
||
HADOOP-10187. FsShell CLI: add getfacl and setfacl with minimal support for
|
||
getting and setting ACLs. (Vinay via cnauroth)
|
||
|
||
HADOOP-10192. FileSystem#getAclStatus has incorrect JavaDocs. (cnauroth)
|
||
|
||
HADOOP-10220. Add ACL indicator bit to FsPermission. (cnauroth)
|
||
|
||
HADOOP-10241. Clean up output of FsShell getfacl. (Chris Nauroth via wheat9)
|
||
|
||
HADOOP-10213. Fix bugs parsing ACL spec in FsShell setfacl.
|
||
(Vinay via cnauroth)
|
||
|
||
HADOOP-10277. setfacl -x fails to parse ACL spec if trying to remove the
|
||
mask entry. (Vinay via cnauroth)
|
||
|
||
HADOOP-10270. getfacl does not display effective permissions of masked
|
||
entries. (cnauroth)
|
||
|
||
HADOOP-10344. Fix TestAclCommands after merging HADOOP-10338 patch.
|
||
(cnauroth)
|
||
|
||
HADOOP-10352. Recursive setfacl erroneously attempts to apply default ACL to
|
||
files. (cnauroth)
|
||
|
||
HADOOP-10354. TestWebHDFS fails after merge of HDFS-4685 to trunk. (cnauroth)
|
||
|
||
HADOOP-10361. Correct alignment in CLI output for ACLs. (cnauroth)
|
||
|
||
HADOOP-10399. FileContext API for ACLs. (Vinayakumar B via cnauroth)
|
||
|
||
HADOOP-10442. Group look-up can cause segmentation fault when certain
|
||
JNI-based mapping module is used. (Kihwal Lee via jeagles)
|
||
|
||
Release 2.3.1 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
Release 2.3.0 - 2014-02-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-8545. Filesystem Implementation for OpenStack Swift
|
||
(Dmitry Mezhensky, David Dobbins, Stevel via stevel)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-10046. Print a log message when SSL is enabled.
|
||
(David S. Wang via wang)
|
||
|
||
HADOOP-10079. log a warning message if group resolution takes too long.
|
||
(cmccabe)
|
||
|
||
HADOOP-9623. Update jets3t dependency to 0.9.0 (Amandeep Khurana via Colin
|
||
Patrick McCabe)
|
||
|
||
HADOOP-10132. RPC#stopProxy() should log the class of proxy when IllegalArgumentException
|
||
is encountered (Ted yu via umamahesh)
|
||
|
||
HADOOP-10248. Property name should be included in the exception where property value
|
||
is null (Akira AJISAKA via umamahesh)
|
||
|
||
HADOOP-10086. User document for authentication in secure cluster.
|
||
(Masatake Iwasaki via Arpit Agarwal)
|
||
|
||
HADOOP-10274 Lower the logging level from ERROR to WARN for UGI.doAs method
|
||
(Takeshi Miao via stack)
|
||
|
||
HADOOP-9784. Add a builder for HttpServer. (Junping Du via llu)
|
||
|
||
HADOOP 9871. Fix intermittent findbugs warnings in DefaultMetricsSystem.
|
||
(Junping Du via llu)
|
||
|
||
HADOOP-9319. Update bundled LZ4 source to r99. (Binglin Chang via llu)
|
||
|
||
HADOOP-9432 Add support for markdown .md files in site documentation (stevel)
|
||
|
||
HADOOP-9241. DU refresh interval is not configurable (harsh)
|
||
|
||
HADOOP-9417. Support for symlink resolution in LocalFileSystem /
|
||
RawLocalFileSystem. (Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-9703. org.apache.hadoop.ipc.Client leaks threads on stop.
|
||
(Tsuyoshi OZAWA vi Colin Patrick McCabe)
|
||
|
||
HADOOP-9618. Add thread which detects GC pauses.
|
||
(Todd Lipcon via Colin Patrick McCabe)
|
||
|
||
HADOOP-9848. Create a MiniKDC for use with security testing.
|
||
(ywskycn via tucu)
|
||
|
||
HADOOP-9860. Remove class HackedKeytab and HackedKeytabEncoder from
|
||
hadoop-minikdc once jira DIRSERVER-1882 solved. (ywskycn via tucu)
|
||
|
||
HADOOP-9866. convert hadoop-auth testcases requiring kerberos to
|
||
use minikdc. (ywskycn via tucu)
|
||
|
||
HADOOP-9487 Deprecation warnings in Configuration should go to their
|
||
own log or otherwise be suppressible (Chu Tong via stevel)
|
||
|
||
HADOOP-9889. Refresh the Krb5 configuration when creating a new kdc in
|
||
Hadoop-MiniKDC (Wei Yan via Sandy Ryza)
|
||
|
||
HADOOP-9915. o.a.h.fs.Stat support on Mac OS X (Binglin Chang via Colin
|
||
Patrick McCabe)
|
||
|
||
HADOOP-9998. Provide methods to clear only part of the DNSToSwitchMapping.
|
||
(Junping Du via Colin Patrick McCabe)
|
||
|
||
HADOOP-9063. enhance unit-test coverage of class
|
||
org.apache.hadoop.fs.FileUtil (Ivan A. Veselovsky via jlowe)
|
||
|
||
HADOOP-9254. Cover packages org.apache.hadoop.util.bloom,
|
||
org.apache.hadoop.util.hash (Vadim Bondarev via jlowe)
|
||
|
||
HADOOP-9225. Cover package org.apache.hadoop.compress.Snappy (Vadim
|
||
Bondarev, Andrey Klochkov and Nathan Roberts via jlowe)
|
||
|
||
HADOOP-9199. Cover package org.apache.hadoop.io with unit tests (Andrey
|
||
Klochkov via jeagles)
|
||
|
||
HADOOP-9470. eliminate duplicate FQN tests in different Hadoop modules
|
||
(Ivan A. Veselovsky via daryn)
|
||
|
||
HADOOP-9494. Excluded auto-generated and examples code from clover reports
|
||
(Andrey Klochkov via jeagles)
|
||
|
||
HADOOP-9897. Add method to get path start position without drive specifier in
|
||
o.a.h.fs.Path. (Binglin Chang via cnauroth)
|
||
|
||
HADOOP-9078. enhance unit-test coverage of class
|
||
org.apache.hadoop.fs.FileContext (Ivan A. Veselovsky via jeagles)
|
||
|
||
HDFS-5276. FileSystem.Statistics should use thread-local counters to avoid
|
||
multi-threaded performance issues on read/write. (Colin Patrick McCabe)
|
||
|
||
HADOOP-9291. enhance unit-test coverage of package o.a.h.metrics2 (Ivan A.
|
||
Veselovsky via jeagles)
|
||
|
||
HADOOP-10064. Upgrade to maven antrun plugin version 1.7 (Arpit Agarwal via
|
||
jeagles)
|
||
|
||
HADOOP-9594. Update apache commons math dependency (Timothy St. Clair via
|
||
stevel)
|
||
|
||
HADOOP-10095. In CodecPool, synchronize pool and codecList separately in
|
||
order to reduce lock contention. (Nicolas Liochon via szetszwo)
|
||
|
||
HADOOP-10067. Missing POM dependency on jsr305 (Robert Rati via stevel)
|
||
|
||
HADOOP-10103. update commons-lang to 2.6 (Akira AJISAKA via stevel)
|
||
|
||
HADOOP-10111. Allow DU to be initialized with an initial value (Kihwal Lee
|
||
via jeagles)
|
||
|
||
HADOOP-10126. LightWeightGSet log message is confusing. (Vinay via suresh)
|
||
|
||
HADOOP-10127. Add ipc.client.connect.retry.interval to control the frequency
|
||
of connection retries (Karthik Kambatla via Sandy Ryza)
|
||
|
||
HADOOP-10102. Update commons IO from 2.1 to 2.4 (Akira Ajisaka via stevel)
|
||
|
||
HADOOP-10168. fix javadoc of ReflectionUtils#copy. (Thejas Nair via suresh)
|
||
|
||
HADOOP-10164. Allow UGI to login with a known Subject (bobby)
|
||
|
||
HADOOP-10169. Remove the unnecessary synchronized in JvmMetrics class.
|
||
(Liang Xie via jing9)
|
||
|
||
HADOOP-10198. DomainSocket: add support for socketpair.
|
||
(Colin Patrick McCabe via wang)
|
||
|
||
HADOOP-10208. Remove duplicate initialization in StringUtils.getStringCollection.
|
||
(Benoy Antony via jing9)
|
||
|
||
HADOOP-9420. Add percentile or max metric for rpcQueueTime, processing time.
|
||
(Liang Xie via wang)
|
||
|
||
HADOOP-9652. Allow RawLocalFs#getFileLinkStatus to fill in the link owner
|
||
and mode if requested. (Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-10305. Add "rpc.metrics.quantile.enable" and
|
||
"rpc.metrics.percentiles.intervals" to core-default.xml.
|
||
(Akira Ajisaka via wang)
|
||
|
||
HADOOP-10317. Rename branch-2.3 release version from 2.4.0-SNAPSHOT
|
||
to 2.3.0-SNAPSHOT. (wang)
|
||
|
||
HADOOP-10313. Script and jenkins job to produce Hadoop release artifacts.
|
||
(tucu)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-10142. Avoid groups lookup for unprivileged users such as "dr.who"
|
||
(vinay via cmccabe)
|
||
|
||
HADOOP-9748. Reduce blocking on UGI.ensureInitialized (daryn)
|
||
|
||
HADOOP-10047. Add a direct-buffer based apis for compression. (Gopal V
|
||
via acmurthy)
|
||
|
||
HADOOP-10172. Cache SASL server factories (daryn)
|
||
|
||
HADOOP-10173. Remove UGI from DIGEST-MD5 SASL server creation (daryn via
|
||
kihwal)
|
||
|
||
HADOOP-10228. FsPermission#fromShort() should cache FsAction.values().
|
||
(Haohui Mai via cnauroth)
|
||
|
||
HADOOP-10143 replace WritableFactories's hashmap with ConcurrentHashMap
|
||
(Liang Xie via stack)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-10028. Malformed ssl-server.xml.example. (Haohui Mai via jing9)
|
||
|
||
HADOOP-10030. FsShell -put/copyFromLocal should support Windows local path.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
HADOOP-10031. FsShell -get/copyToLocal/moveFromLocal should support Windows
|
||
local path. (Chuan Liu via cnauroth)
|
||
|
||
HADOOP-10039. Add Hive to the list of projects using
|
||
AbstractDelegationTokenSecretManager. (Haohui Mai via jing9)
|
||
|
||
HADOOP-10040. hadoop.cmd in UNIX format and would not run by default on
|
||
Windows. (cnauroth)
|
||
|
||
HADOOP-10055. FileSystemShell.apt.vm doc has typo "numRepicas".
|
||
(Akira Ajisaka via cnauroth)
|
||
|
||
HADOOP-10072. TestNfsExports#testMultiMatchers fails due to non-deterministic
|
||
timing around cache expiry check. (cnauroth)
|
||
|
||
HADOOP-9898. Set SO_KEEPALIVE on all our sockets. (todd via wang)
|
||
|
||
HADOOP-9478. Fix race conditions during the initialization of Configuration
|
||
related to deprecatedKeyMap (cmccabe)
|
||
|
||
HADOOP-9660. [WINDOWS] Powershell / cmd parses -Dkey=value from command line
|
||
as [-Dkey, value] which breaks GenericsOptionParser.
|
||
(Enis Soztutar via cnauroth)
|
||
|
||
HADOOP-10078. KerberosAuthenticator always does SPNEGO. (rkanter via tucu)
|
||
|
||
HADOOP-10110. hadoop-auth has a build break due to missing dependency.
|
||
(Chuan Liu via arp)
|
||
|
||
HADOOP-9114. After defined the dfs.checksum.type as the NULL, write file and hflush will
|
||
through java.lang.ArrayIndexOutOfBoundsException (Sathish via umamahesh)
|
||
|
||
HADOOP-10130. RawLocalFS::LocalFSFileInputStream.pread does not track
|
||
FS::Statistics (Binglin Chang via Colin Patrick McCabe)
|
||
|
||
HDFS-5560. Trash configuration log statements prints incorrect units.
|
||
(Josh Elser via Andrew Wang)
|
||
|
||
HADOOP-10081. Client.setupIOStreams can leak socket resources on exception
|
||
or error (Tsuyoshi OZAWA via jlowe)
|
||
|
||
HADOOP-10087. UserGroupInformation.getGroupNames() fails to return primary
|
||
group first when JniBasedUnixGroupsMappingWithFallback is used (cmccabe)
|
||
|
||
HADOOP-10175. Har files system authority should preserve userinfo.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
HADOOP-10090. Jobtracker metrics not updated properly after execution
|
||
of a mapreduce job. (ivanmi)
|
||
|
||
HADOOP-10193. hadoop-auth's PseudoAuthenticationHandler can consume getInputStream.
|
||
(gchanan via tucu)
|
||
|
||
HADOOP-10178. Configuration deprecation always emit "deprecated" warnings
|
||
when a new key is used. (Shanyu Zhao via cnauroth)
|
||
|
||
HADOOP-10234. "hadoop.cmd jar" does not propagate exit code. (cnauroth)
|
||
|
||
HADOOP-10240. Windows build instructions incorrectly state requirement of
|
||
protoc 2.4.1 instead of 2.5.0. (cnauroth)
|
||
|
||
HADOOP-10110. hadoop-auth has a build break due to missing dependency
|
||
(Chuan Liu via cos)
|
||
|
||
HADOOP-10167. Mark hadoop-common source as UTF-8 in Maven pom files / refactoring
|
||
(Mikhail Antonov via cos)
|
||
|
||
HADOOP-9982. Fix dead links in hadoop site docs. (Akira Ajisaka via Arpit
|
||
Agarwal)
|
||
|
||
HADOOP-10212. Incorrect compile command in Native Library document.
|
||
(Akira Ajisaka via Arpit Agarwal)
|
||
|
||
HADOOP-9830. Fix typo at http://hadoop.apache.org/docs/current/
|
||
(Kousuke Saruta via Arpit Agarwal)
|
||
|
||
HADOOP-10255. Rename HttpServer to HttpServer2 to retain older
|
||
HttpServer in branch-2 for compatibility. (Haohui Mai via suresh)
|
||
|
||
HADOOP-10292. Restore HttpServer from branch-2.2 in branch-2.
|
||
(Haohui Mai via suresh)
|
||
|
||
HADOOP-10291. TestSecurityUtil#testSocketAddrWithIP fails due to test
|
||
order dependency. (Mit Desai via Arpit Agarwal)
|
||
|
||
HADOOP-9964. Fix deadlocks in TestHttpServer by synchronize
|
||
ReflectionUtils.printThreadInfo. (Junping Du via llu)
|
||
|
||
HADOOP-9582. Non-existent file to "hadoop fs -conf" doesn't throw error
|
||
(Ashwin Shankar via jlowe)
|
||
|
||
HADOOP-9817. FileSystem#globStatus and FileContext#globStatus need to work
|
||
with symlinks. (Colin Patrick McCabe via Andrew Wang)
|
||
|
||
HADOOP-9875. TestDoAsEffectiveUser can fail on JDK 7. (Aaron T. Myers via
|
||
Colin Patrick McCabe)
|
||
|
||
HADOOP-9865. FileContext#globStatus has a regression with respect to
|
||
relative path. (Chuan Lin via Colin Patrick McCabe)
|
||
|
||
HADOOP-9909. org.apache.hadoop.fs.Stat should permit other LANG.
|
||
(Shinichi Yamashita via Andrew Wang)
|
||
|
||
HADOOP-9908. Fix NPE when versioninfo properties file is missing (todd)
|
||
|
||
HADOOP-9929. Insufficient permissions for a path reported as File Not
|
||
Found. (Contributed by Colin Patrick McCabe)
|
||
|
||
HADOOP-9791. Add a test case covering long paths for new FileUtil access
|
||
check methods (ivanmi)
|
||
|
||
HADOOP-9981. globStatus should minimize its listStatus and getFileStatus
|
||
calls. (Contributed by Colin Patrick McCabe)
|
||
|
||
HADOOP-10006. Compilation failure in trunk for
|
||
o.a.h.fs.swift.util.JSONUtil (Junping Du via stevel)
|
||
|
||
HADOOP-9016. HarFsInputStream.skip(long) must never return negative value.
|
||
(Ivan A. Veselovsky via jeagles)
|
||
|
||
HADOOP-10088. copy-nativedistlibs.sh needs to quote snappy lib dir.
|
||
(Raja Aluri via cnauroth)
|
||
|
||
HADOOP-10093. hadoop-env.cmd sets HADOOP_CLIENT_OPTS with a max heap size
|
||
that is too small. (Shanyu Zhao via cnauroth)
|
||
|
||
HADOOP-10094. NPE in GenericOptionsParser#preProcessForWindows().
|
||
(Enis Soztutar via cnauroth)
|
||
|
||
HADOOP-10100. MiniKDC shouldn't use apacheds-all artifact. (rkanter via tucu)
|
||
|
||
HADOOP-10107. Server.getNumOpenConnections may throw NPE. (Kihwal Lee via
|
||
jing9)
|
||
|
||
HADOOP-10135 writes to swift fs over partition size leave temp files and
|
||
empty output file (David Dobbins via stevel)
|
||
|
||
HADOOP-10129. Distcp may succeed when it fails (daryn)
|
||
|
||
HADOOP-10058. TestMetricsSystemImpl#testInitFirstVerifyStopInvokedImmediately
|
||
fails on trunk (Chen He via jeagles)
|
||
|
||
HADOOP-10162. Fix symlink-related test failures in
|
||
TestFileContextResolveAfs and TestStat in branch-2 (Mit Desai via Colin
|
||
Patrick McCabe)
|
||
|
||
HADOOP-8753. LocalDirAllocator throws "ArithmeticException: / by zero" when
|
||
there is no available space on configured local dir. (Benoy Antony via hitesh)
|
||
|
||
HADOOP-10106. Incorrect thread name in RPC log messages. (Ming Ma via jing9)
|
||
|
||
HADOOP-9611 mvn-rpmbuild against google-guice > 3.0 yields missing cglib
|
||
dependency (Timothy St. Clair via stevel)
|
||
|
||
HADOOP-10171. TestRPC fails intermittently on jkd7 (Mit Desai via jeagles)
|
||
|
||
HADOOP-10147 HDFS-5678 Upgrade to commons-logging 1.1.3 to avoid potential
|
||
deadlock in MiniDFSCluster (stevel)
|
||
|
||
HADOOP-10207. TestUserGroupInformation#testLogin is flaky (jxiang via
|
||
cmccabe)
|
||
|
||
HADOOP-10214. Fix multithreaded correctness warnings in ActiveStandbyElector
|
||
(Liang Xie via kasha)
|
||
|
||
HADOOP-10223. MiniKdc#main() should close the FileReader it creates.
|
||
(Ted Yu via tucu)
|
||
|
||
HADOOP-10236. Fix typo in o.a.h.ipc.Client#checkResponse. (Akira Ajisaka
|
||
via suresh)
|
||
|
||
HADOOP-10146. Workaround JDK7 Process fd close bug (daryn)
|
||
|
||
HADOOP-10125. no need to process RPC request if the client connection
|
||
has been dropped (Ming Ma via brandonli)
|
||
|
||
HADOOP-10235. Hadoop tarball has 2 versions of stax-api JARs. (tucu)
|
||
|
||
HADOOP-10252. HttpServer can't start if hostname is not specified. (Jimmy
|
||
Xiang via atm)
|
||
|
||
HADOOP-10203. Connection leak in
|
||
Jets3tNativeFileSystemStore#retrieveMetadata. (Andrei Savu via atm)
|
||
|
||
HADOOP-10250. VersionUtil returns wrong value when comparing two versions.
|
||
(Yongjun Zhang via atm)
|
||
|
||
HADOOP-10288. Explicit reference to Log4JLogger breaks non-log4j users
|
||
(todd)
|
||
|
||
HADOOP-10310. SaslRpcServer should be initialized even when no secret
|
||
manager present. (atm)
|
||
|
||
HADOOP-10311. Cleanup vendor names from the code base. (tucu)
|
||
|
||
HADOOP-10273. Fix 'mvn site'. (Arpit Agarwal)
|
||
|
||
Release 2.2.0 - 2013-10-13
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-10020. Disable symlinks temporarily (cmccabe)
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-9948. Add a config value to CLITestHelper to skip tests on Windows.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
HADOOP-9976. Different versions of avro and avro-maven-plugin (Karthik
|
||
Kambatla via Sandy Ryza)
|
||
|
||
HADOOP-9758. Provide configuration option for FileSystem/FileContext
|
||
symlink resolution. (Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-8315. Support SASL-authenticated ZooKeeper in ActiveStandbyElector
|
||
(todd)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9776. HarFileSystem.listStatus() returns invalid authority if port
|
||
number is empty. (Shanyu Zhao via ivanmi)
|
||
|
||
HADOOP-9761. ViewFileSystem#rename fails when using DistributedFileSystem.
|
||
(Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-10003. HarFileSystem.listLocatedStatus() fails.
|
||
(Jason Dere and suresh via suresh)
|
||
|
||
HADOOP-10017. Fix NPE in DFSClient#getDelegationToken when doing Distcp
|
||
from a secured cluster to an insecured cluster. (Haohui Mai via jing9)
|
||
|
||
Release 2.1.1-beta - 2013-09-23
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-9944. Fix RpcRequestHeaderProto.callId to be sint32 rather than
|
||
uint32 since ipc.Client.CONNECTION_CONTEXT_CALL_ID is signed (i.e. -3)
|
||
(acmurthy)
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-9910. proxy server start and stop documentation wrong
|
||
(Andre Kelpe via harsh)
|
||
|
||
HADOOP-9446. Support Kerberos SPNEGO for IBM JDK. (Yu Gao via llu)
|
||
|
||
HADOOP-9787. ShutdownHelper util to shutdown threads and threadpools.
|
||
(Karthik Kambatla via Sandy Ryza)
|
||
|
||
HADOOP-9803. Add a generic type parameter to RetryInvocationHandler.
|
||
(szetszwo)
|
||
|
||
HADOOP-9821. ClientId should have getMsb/getLsb methods.
|
||
(Tsuyoshi OZAWA via jing9)
|
||
|
||
HADOOP-9435. Support building the JNI code against the IBM JVM.
|
||
(Tian Hong Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-9833 move slf4j to version 1.7.5 (Kousuke Saruta via stevel)
|
||
|
||
HADOOP-9672. Upgrade Avro dependency to 1.7.4. (sandy via kihwal)
|
||
|
||
HADOOP-9789. Support server advertised kerberos principals (daryn)
|
||
|
||
HADOOP-8814. Replace string equals "" by String#isEmpty().
|
||
(Brandon Li via suresh)
|
||
|
||
HADOOP-9802. Support Snappy codec on Windows. (cnauroth)
|
||
|
||
HADOOP-9879. Move the version info of zookeeper dependencies to
|
||
hadoop-project/pom (Karthik Kambatla via Sandy Ryza)
|
||
|
||
HADOOP-9886. Turn warning message in RetryInvocationHandler to debug (arpit)
|
||
|
||
HADOOP-9906. Move HAZKUtil to o.a.h.util.ZKUtil and make inner-classes
|
||
public (Karthik Kambatla via Sandy Ryza)
|
||
|
||
HADOOP-9918. Add addIfService to CompositeService (Karthik Kambatla via
|
||
Sandy Ryza)
|
||
|
||
HADOOP-9945. HAServiceState should have a state for stopped services.
|
||
(Karthik Kambatla via atm)
|
||
|
||
HADOOP-9962. in order to avoid dependency divergence within Hadoop itself
|
||
lets enable DependencyConvergence. (rvs via tucu)
|
||
|
||
HADOOP-9669. Reduce the number of byte array creations and copies in
|
||
XDR data manipulation. (Haohui Mai via brandonli)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9916. Fix race in ipc.Client retry. (Binglin Chang via llu)
|
||
|
||
HADOOP-9768. chown and chgrp reject users and groups with spaces on platforms
|
||
where spaces are otherwise acceptable. (cnauroth)
|
||
|
||
HADOOP-9801. Configuration#writeXml uses platform defaulting encoding, which
|
||
may mishandle multi-byte characters. (cnauroth)
|
||
|
||
HADOOP-9806 PortmapInterface should check if the procedure is out-of-range
|
||
(brandonli)
|
||
|
||
HADOOP-9527. Add symlink support to LocalFileSystem on Windows.
|
||
(Arpit Agarwal via cnauroth)
|
||
|
||
HADOOP-9315. Port HADOOP-9249 hadoop-maven-plugins Clover fix to branch-2 to
|
||
fix build failures. (Dennis Y via cnauroth)
|
||
|
||
HADOOP-9831. Make checknative shell command accessible on Windows. (cnauroth)
|
||
|
||
HADOOP-9675. use svn:eol-style native for html to prevent line ending
|
||
issues (Colin Patrick McCabe)
|
||
|
||
HADOOP-9757. Har metadata cache can grow without limit (Cristina Abad via daryn)
|
||
|
||
HADOOP-9858. Remove unused private RawLocalFileSystem#execCommand method from
|
||
branch-2. (cnauroth)
|
||
|
||
HADOOP-9857. Tests block and sometimes timeout on Windows due to invalid
|
||
entropy source. (cnauroth)
|
||
|
||
HADOOP-9381. Document dfs cp -f option. (Keegan Witt, suresh via suresh)
|
||
|
||
HADOOP-9868. Server must not advertise kerberos realm. (daryn via kihwal)
|
||
|
||
HADOOP-9880. SASL changes from HADOOP-9421 breaks Secure HA NN. (daryn via
|
||
jing9)
|
||
|
||
HADOOP-9887. globStatus does not correctly handle paths starting with a drive
|
||
spec on Windows. (Chuan Liu via cnauroth)
|
||
|
||
HADOOP-9899. Remove the debug message, added by HADOOP-8855, from
|
||
KerberosAuthenticator. (szetszwo)
|
||
|
||
HADOOP-9894. Race condition in Shell leads to logged error stream handling
|
||
exceptions (Arpit Agarwal)
|
||
|
||
HADOOP-9774. RawLocalFileSystem.listStatus() return absolute paths when
|
||
input path is relative on Windows. (Shanyu Zhao via ivanmi)
|
||
|
||
HADOOP-9924. FileUtil.createJarWithClassPath() does not generate relative
|
||
classpath correctly. (Shanyu Zhao via ivanmi)
|
||
|
||
HADOOP-9932. Improper synchronization in RetryCache. (kihwal)
|
||
|
||
HADOOP-9958. Add old constructor back to DelegationTokenInformation to
|
||
unbreak downstream builds. (Andrew Wang)
|
||
|
||
HADOOP-9960. Upgrade Jersey version to 1.9. (Karthik Kambatla via atm)
|
||
|
||
HADOOP-9557. hadoop-client excludes commons-httpclient. (Lohit Vijayarenu via
|
||
cnauroth)
|
||
|
||
HADOOP-9350. Hadoop not building against Java7 on OSX
|
||
(Robert Kanter via stevel)
|
||
|
||
HADOOP-9961. versions of a few transitive dependencies diverged between hadoop
|
||
subprojects. (rvs via tucu)
|
||
|
||
HADOOP-9977. Hadoop services won't start with different keypass and
|
||
keystorepass when https is enabled. (cnauroth)
|
||
|
||
HADOOP-10005. No need to check INFO severity level is enabled or not.
|
||
(Jackie Chang via suresh)
|
||
|
||
Release 2.1.0-beta - 2013-08-22
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-8886. Remove KFS support. (eli)
|
||
|
||
HADOOP-9163. [RPC v9] The rpc msg in ProtobufRpcEngine.proto should be moved out to
|
||
avoid an extra copy (Sanjay Radia)
|
||
|
||
HADOOP-9151. [RPC v9] Include RPC error info in RpcResponseHeader instead of sending
|
||
it separately (sanjay Radia)
|
||
|
||
HADOOP-9380. [RPC v9] Add totalLength to rpc response (sanjay Radia)
|
||
|
||
HADOOP-9425. [RPC v9] Add error codes to rpc-response (sanjay Radia)
|
||
|
||
HADOOP-9194. [RPC v9] RPC support for QoS. (Junping Du via llu)
|
||
|
||
HADOOP-9630. [RPC v9] Remove IpcSerializationType. (Junping Du via llu)
|
||
|
||
HADOOP-9421. [RPC v9] Convert SASL to use ProtoBuf and provide
|
||
negotiation capabilities (daryn)
|
||
|
||
HADOOP-9688. Add globally unique Client ID to RPC requests. (suresh)
|
||
|
||
HADOOP-9683. [RPC v9] Wrap IpcConnectionContext in RPC headers (daryn)
|
||
|
||
HADOOP-9698. [RPC v9] Client must honor server's SASL negotiate response (daryn)
|
||
|
||
HADOOP-9832. [RPC v9] Add RPC header to client ping (daryn)
|
||
|
||
HADOOP-9820. [RPC v9] Wire protocol is insufficient to support multiplexing. (daryn via jitendra)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-9283. Add support for running the Hadoop client on AIX. (atm)
|
||
|
||
HADOOP-8415. Add getDouble() and setDouble() in
|
||
org.apache.hadoop.conf.Configuration (Jan van der Lugt via harsh)
|
||
|
||
HADOOP-9338. FsShell Copy Commands Should Optionally Preserve File
|
||
Attributes. (Nick White via atm)
|
||
|
||
HADOOP-8562. Enhancements to support Hadoop on Windows Server and Windows
|
||
Azure environments. (See breakdown of tasks below for subtasks and
|
||
contributors)
|
||
|
||
HADOOP-8469. Make NetworkTopology class pluggable. (Junping Du via
|
||
szetszwo)
|
||
|
||
HADOOP-8470. Add NetworkTopologyWithNodeGroup, a 4-layer implementation
|
||
of NetworkTopology. (Junping Du via szetszwo)
|
||
|
||
HADOOP-9763. Extends LightWeightGSet to support eviction of expired
|
||
elements. (Tsz Wo (Nicholas) SZE via jing9)
|
||
|
||
HADOOP-9762. RetryCache utility for implementing RPC retries.
|
||
(Suresh Srinivas via jing9)
|
||
|
||
HADOOP-9792. Retry the methods that are tagged @AtMostOnce along
|
||
with @Idempotent. (suresh)
|
||
|
||
HADOOP-9509. Implement ONCRPC and XDR. (brandonli)
|
||
|
||
HADOOP-9515. Add general interface for NFS and Mount. (brandonli)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-9164. Print paths of loaded native libraries in
|
||
NativeLibraryChecker. (Binglin Chang via llu)
|
||
|
||
HADOOP-9253. Capture ulimit info in the logs at service start time.
|
||
(Arpit Gupta via suresh)
|
||
|
||
HADOOP-8924. Add maven plugin alternative to shell script to save
|
||
package-info.java. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9117. replace protoc ant plugin exec with a maven plugin. (tucu)
|
||
|
||
HADOOP-9279. Document the need to build hadoop-maven-plugins for
|
||
eclipse and separate project builds. (Tsuyoshi Ozawa via suresh)
|
||
|
||
HADOOP-9334. Upgrade netty version. (Nicolas Liochon via suresh)
|
||
|
||
HADOOP-9343. Allow additional exceptions through the RPC layer. (sseth)
|
||
|
||
HADOOP-9318. When exiting on a signal, print the signal name first. (Colin
|
||
Patrick McCabe via atm)
|
||
|
||
HADOOP-9358. "Auth failed" log should include exception string (todd)
|
||
|
||
HADOOP-9401. CodecPool: Add counters for number of (de)compressors
|
||
leased out. (kkambatl via tucu)
|
||
|
||
HADOOP-9450. HADOOP_USER_CLASSPATH_FIRST is not honored; CLASSPATH
|
||
is PREpended instead of APpended. (Chris Nauroth and harsh via harsh)
|
||
|
||
HADOOP-9496. Bad merge of HADOOP-9450 on branch-2 breaks all bin/hadoop
|
||
calls that need HADOOP_CLASSPATH. (harsh)
|
||
|
||
HADOOP-9503. Remove sleep between IPC client connect timeouts.
|
||
(Varun Sharma via szetszwo)
|
||
|
||
HADOOP-9322. LdapGroupsMapping doesn't seem to set a timeout for
|
||
its directory search. (harsh)
|
||
|
||
HADOOP-9523. Provide a generic IBM java vendor flag in PlatformName.java
|
||
to support non-Sun JREs. (Tian Hong Wang via suresh)
|
||
|
||
HADOOP-9511. Adding support for additional input streams (FSDataInputStream
|
||
and RandomAccessFile) in SecureIOUtils so as to help YARN-578. (Omkar Vinit
|
||
Joshi via vinodkv)
|
||
|
||
HADOOP-9560. metrics2#JvmMetrics should have max memory size of JVM.
|
||
(Tsuyoshi Ozawa via suresh)
|
||
|
||
HADOOP-9140 Cleanup rpc PB protos (sanjay Radia)
|
||
|
||
HADOOP-9218 Document the Rpc-wrappers used internally (sanjay Radia)
|
||
|
||
HADOOP-9574. Added new methods in AbstractDelegationTokenSecretManager for
|
||
helping YARN ResourceManager to reuse code for RM restart. (Jian He via
|
||
vinodkv)
|
||
|
||
HADOOP-7391 Document Interface Classification from HADOOP-5073 (sanjay Radia)
|
||
|
||
HADOOP-9287. Parallel-testing hadoop-common (Andrey Klochkov via jlowe)
|
||
|
||
HADOOP-9604. Javadoc of FSDataOutputStream is slightly inaccurate. (Jingguo
|
||
Yao via atm)
|
||
|
||
HADOOP-9625. HADOOP_OPTS not picked up by hadoop command.
|
||
(Paul Han via arpit)
|
||
|
||
HADOOP-9649. Promoted YARN service life-cycle libraries into Hadoop Common
|
||
for usage across all Hadoop projects. (Zhijie Shen via vinodkv)
|
||
|
||
HADOOP-9517. Documented various aspects of compatibility for Apache
|
||
Hadoop. (Karthik Kambatla via acmurthy)
|
||
|
||
HADOOP-8608. Add Configuration API for parsing time durations. (cdouglas)
|
||
|
||
HADOOP-9619 Mark stability of .proto files (sanjay Radia)
|
||
|
||
HADOOP-9676. Make maximum RPC buffer size configurable (Colin Patrick
|
||
McCabe)
|
||
|
||
HADOOP-9691. RPC clients can generate call ID using AtomicInteger instead of
|
||
synchronizing on the Client instance. (cnauroth)
|
||
|
||
HADOOP-9661. Allow metrics sources to be extended. (sandyr via tucu)
|
||
|
||
HADOOP-9370. Write FSWrapper class to wrap FileSystem and FileContext for
|
||
better test coverage. (Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-9355. Abstract symlink tests to use either FileContext or
|
||
FileSystem. (Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-9673. NetworkTopology: when a node can't be added, print out its
|
||
location for diagnostic purposes. (Colin Patrick McCabe)
|
||
|
||
HADOOP-9414. Refactor out FSLinkResolver and relevant helper methods.
|
||
(Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-9416. Add new symlink resolution methods in FileSystem and
|
||
FileSystemLinkResolver. (Andrew Wang via Colin Patrick McCabe)
|
||
|
||
HADOOP-9720. Rename Client#uuid to Client#clientId. (Arpit Agarwal via
|
||
suresh)
|
||
|
||
HADOOP-9734. Common protobuf definitions for GetUserMappingsProtocol,
|
||
RefreshAuthorizationPolicyProtocol and RefreshUserMappingsProtocol (jlowe)
|
||
|
||
HADOOP-9716. Rpc retries should use the same call ID as the original call.
|
||
(szetszwo)
|
||
|
||
HADOOP-9717. Add retry attempt count to the RPC requests. (jing9)
|
||
|
||
HADOOP-9751. Add clientId and retryCount to RpcResponseHeaderProto.
|
||
(szetszwo)
|
||
|
||
HADOOP-9754. Remove unnecessary "throws IOException/InterruptedException",
|
||
and fix generic and other javac warnings. (szetszwo)
|
||
|
||
HADOOP-9760. Move GSet and related classes to common from HDFS.
|
||
(suresh)
|
||
|
||
HADOOP-9756. Remove the deprecated getServer(..) methods from RPC.
|
||
(Junping Du via szetszwo)
|
||
|
||
HADOOP-9770. Make RetryCache#state non volatile. (suresh)
|
||
|
||
HADOOP-9786. RetryInvocationHandler#isRpcInvocation should support
|
||
ProtocolTranslator. (suresh and jing9)
|
||
|
||
HADOOP-9847. TestGlobPath symlink tests fail to cleanup properly.
|
||
(cmccabe via wang)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-9150. Avoid unnecessary DNS resolution attempts for logical URIs
|
||
(todd)
|
||
|
||
HADOOP-9845. Update protobuf to 2.5 from 2.4.x. (tucu)
|
||
|
||
HADOOP-9872. Improve protoc version handling and detection. (tucu)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9451. Fault single-layer config if node group topology is enabled.
|
||
(Junping Du via llu)
|
||
|
||
HADOOP-9294. GetGroupsTestBase fails on Windows. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9305. Add support for running the Hadoop client on 64-bit AIX. (atm)
|
||
|
||
HADOOP-9245. mvn clean without running mvn install before fails.
|
||
(Karthik Kambatla via suresh)
|
||
|
||
HADOOP-9246 Execution phase for hadoop-maven-plugin should be
|
||
process-resources (Karthik Kambatla and Chris Nauroth via jlowe)
|
||
|
||
HADOOP-9297. remove old record IO generation and tests. (tucu)
|
||
|
||
HADOOP-9154. SortedMapWritable#putAll() doesn't add key/value classes to
|
||
the map. (Karthik Kambatla via tomwhite)
|
||
|
||
HADOOP-9304. remove addition of avro genreated-sources dirs to build. (tucu)
|
||
|
||
HADOOP-9267. hadoop -help, -h, --help should show usage instructions.
|
||
(Andrew Wang via atm)
|
||
|
||
HADOOP-8569. CMakeLists.txt: define _GNU_SOURCE and _LARGEFILE_SOURCE.
|
||
(Colin Patrick McCabe via atm)
|
||
|
||
HADOOP-9323. Fix typos in API documentation. (suresh)
|
||
|
||
HADOOP-7487. DF should throw a more reasonable exception when mount cannot
|
||
be determined. (Andrew Wang via atm)
|
||
|
||
HADOOP-8917. add LOCALE.US to toLowerCase in SecurityUtil.replacePattern.
|
||
(Arpit Gupta via suresh)
|
||
|
||
HADOOP-9342. Remove jline from distribution. (thw via tucu)
|
||
|
||
HADOOP-9230. TestUniformSizeInputFormat fails intermittently.
|
||
(kkambatl via tucu)
|
||
|
||
HADOOP-9349. Confusing output when running hadoop version from one hadoop
|
||
installation when HADOOP_HOME points to another. (sandyr via tucu)
|
||
|
||
HADOOP-9337. org.apache.hadoop.fs.DF.getMount() does not work on Mac OS.
|
||
(Ivan A. Veselovsky via atm)
|
||
|
||
HADOOP-9369. DNS#reverseDns() can return hostname with . appended at the
|
||
end. (Karthik Kambatla via atm)
|
||
|
||
HADOOP-9379. capture the ulimit info after printing the log to the
|
||
console. (Arpit Gupta via suresh)
|
||
|
||
HADOOP-9399. protoc maven plugin doesn't work on mvn 3.0.2 (todd)
|
||
|
||
HADOOP-9407. commons-daemon 1.0.3 dependency has bad group id causing
|
||
build issues. (Sangjin Lee via suresh)
|
||
|
||
HADOOP-9405. TestGridmixSummary#testExecutionSummarizer is broken. (Andrew
|
||
Wang via atm)
|
||
|
||
HADOOP-9430. TestSSLFactory fails on IBM JVM. (Amir Sanjar via suresh)
|
||
|
||
HADOOP-9125. LdapGroupsMapping threw CommunicationException after some
|
||
idle time. (Kai Zheng via atm)
|
||
|
||
HADOOP-9429. TestConfiguration fails with IBM JAVA. (Amir Sanjar via
|
||
suresh)
|
||
|
||
HADOOP-9222. Cover package with org.apache.hadoop.io.lz4 unit tests (Vadim
|
||
Bondarev via jlowe)
|
||
|
||
HADOOP-9233. Cover package org.apache.hadoop.io.compress.zlib with unit
|
||
tests (Vadim Bondarev via jlowe)
|
||
|
||
HADOOP-9211. Set default max heap size in HADOOP_CLIENT_OPTS to 512m
|
||
in order to avoid OOME. (Plamen Jeliazkov via shv)
|
||
|
||
HADOOP-9473. Typo in FileUtil copy() method. (Glen Mazza via suresh)
|
||
|
||
HADOOP-9504. MetricsDynamicMBeanBase has concurrency issues in
|
||
createMBeanInfo (Liang Xie via jlowe)
|
||
|
||
HADOOP-9455. HADOOP_CLIENT_OPTS appended twice causes JVM failures.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9550. Remove aspectj dependency. (kkambatl via tucu)
|
||
|
||
HADOOP-9549. WebHdfsFileSystem hangs on close(). (daryn via kihwal)
|
||
|
||
HADOOP-9485. No default value in the code for
|
||
hadoop.rpc.socket.factory.class.default. (Colin Patrick McCabe via atm)
|
||
|
||
HADOOP-9459. ActiveStandbyElector can join election even before
|
||
Service HEALTHY, and results in null data at ActiveBreadCrumb.
|
||
(Vinay and todd via todd)
|
||
|
||
HADOOP-9307. BufferedFSInputStream.read returns wrong results
|
||
after certain seeks. (todd)
|
||
|
||
HADOOP-9220. Unnecessary transition to standby in ActiveStandbyElector.
|
||
(tom and todd via todd)
|
||
|
||
HADOOP-9563. Fix incompatibility introduced by HADOOP-9523.
|
||
(Tian Hong Wang via suresh)
|
||
|
||
HADOOP-9566. Performing direct read using libhdfs sometimes raises SIGPIPE
|
||
(which in turn throws SIGABRT) causing client crashes. (Colin Patrick
|
||
McCabe via atm)
|
||
|
||
HADOOP-9481. Broken conditional logic with HADOOP_SNAPPY_LIBRARY. (Vadim
|
||
Bondarev via atm)
|
||
|
||
HADOOP-9593. stack trace printed at ERROR for all yarn clients without
|
||
hadoop.home set (stevel)
|
||
|
||
HADOOP-8957. AbstractFileSystem#IsValidName should be overridden for
|
||
embedded file systems like ViewFs (Chris Nauroth via Sanjay Radia)
|
||
|
||
HADOOP-9607. Fixes in Javadoc build (Timothy St. Clair via cos)
|
||
|
||
HADOOP-9605. Update junit dependency. (Timothy St. Clair via cos)
|
||
|
||
HADOOP-9581. hadoop --config non-existent directory should result in error
|
||
(Ashwin Shankar via jlowe)
|
||
|
||
HADOOP-9638. Parallel test changes caused invalid test path for several HDFS
|
||
tests on Windows (Andrey Klochkov via cnauroth)
|
||
|
||
HADOOP-9632. TestShellCommandFencer will fail if there is a 'host' machine in
|
||
the network. (Chuan Liu via cnauroth)
|
||
|
||
HADOOP-9624. TestFSMainOperationsLocalFileSystem failed when the Hadoop test
|
||
root path has "X" in its name. (Xi Fang via cnauroth)
|
||
|
||
HADOOP-9439. JniBasedUnixGroupsMapping: fix some crash bugs (Colin Patrick
|
||
McCabe)
|
||
|
||
HADOOP-9656. Gridmix unit tests fail on Windows and Linux. (Chuan Liu via
|
||
cnauroth)
|
||
|
||
HADOOP-9707. Fix register lists for crc32c inline assembly. (todd via
|
||
kihwal)
|
||
|
||
HADOOP-9738. TestDistCh fails. (jing9 via kihwal)
|
||
|
||
HADOOP-9759. Add support for NativeCodeLoader#getLibraryName on Windows.
|
||
(Chuan Liu via cnauroth)
|
||
|
||
HADOOP-9773. TestLightWeightCache should not set size limit to zero when
|
||
testing it. (szetszwo)
|
||
|
||
HADOOP-9507. LocalFileSystem rename() is broken in some cases when
|
||
destination exists. (cnauroth)
|
||
|
||
HADOOP-9816. RPC Sasl QOP is broken (daryn)
|
||
|
||
HADOOP-9850. RPC kerberos errors don't trigger relogin. (daryn via kihwal)
|
||
|
||
BREAKDOWN OF HADOOP-8562 SUBTASKS AND RELATED JIRAS
|
||
|
||
HADOOP-8924. Hadoop Common creating package-info.java must not depend on
|
||
sh. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8945. Merge winutils from branch-1-win to branch-trunk-win.
|
||
(Bikas Saha, Chuan Liu, Giridharan Kesavan, Ivan Mitic, and Steve Maine
|
||
ported by Chris Nauroth via suresh)
|
||
|
||
HADOOP-8946. winutils: compile codebase during Maven build on
|
||
branch-trunk-win. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8947. Merge FileUtil and Shell changes from branch-1-win to
|
||
branch-trunk-win to enable initial test pass. (Raja Aluri, Davio Lao,
|
||
Sumadhur Reddy Bolli, Ahmed El Baz, Kanna Karanam, Chuan Liu,
|
||
Ivan Mitic, Chris Nauroth, and Bikas Saha via suresh)
|
||
|
||
HADOOP-8954. "stat" executable not found on Windows. (Bikas Saha, Ivan Mitic
|
||
ported by Chris Narouth via suresh)
|
||
|
||
HADOOP-8959. TestUserGroupInformation fails on Windows due to "id" executable
|
||
not found. (Bikas Saha, Ivan Mitic, ported by Chris Narouth via suresh)
|
||
|
||
HADOOP-8955. "chmod" executable not found on Windows.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-8960. TestMetricsServlet fails on Windows. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-8961. GenericOptionsParser URI parsing failure on Windows.
|
||
(Ivan Mitic via suresh)
|
||
|
||
HADOOP-8949. Remove FileUtil.CygPathCommand dead code. (Chris Nauroth via
|
||
suresh)
|
||
|
||
HADOOP-8956. FileSystem.primitiveMkdir failures on Windows cause multiple
|
||
test suites to fail. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8978. TestTrash fails on Windows. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8979. TestHttpServer fails on Windows. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8953. Shell PathData parsing failures on Windows. (Arpit Agarwal via
|
||
suresh)
|
||
|
||
HADOOP-8975. TestFileContextResolveAfs fails on Windows. (Chris Nauroth via
|
||
suresh)
|
||
|
||
HADOOP-8977. Multiple FsShell test failures on Windows. (Chris Nauroth via
|
||
suresh)
|
||
|
||
HADOOP-9005. Merge hadoop cmd line scripts from branch-1-win. (David Lao,
|
||
Bikas Saha, Lauren Yang, Chuan Liu, Thejas M Nair and Ivan Mitic via suresh)
|
||
|
||
HADOOP-9008. Building hadoop tarball fails on Windows. (Chris Nauroth via
|
||
suresh)
|
||
|
||
HADOOP-9011. saveVersion.py does not include branch in version annotation.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9110. winutils ls off-by-one error indexing MONTHS array can cause
|
||
access violation. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9056. Build native library on Windows. (Chuan Liu, Arpit Agarwal via
|
||
suresh)
|
||
|
||
HADOOP-9144. Fix findbugs warnings. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9081. Add TestWinUtils. (Chuan Liu, Ivan Mitic, Chris Nauroth,
|
||
and Bikas Saha via suresh)
|
||
|
||
HADOOP-9146. Fix sticky bit regression on branch-trunk-win.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9266. Fix javac, findbugs, and release audit warnings on
|
||
branch-trunk-win. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9270. Remove a stale java comment from FileUtil. (Chris Nauroth via
|
||
szetszwo)
|
||
|
||
HADOOP-9271. Revert Python build scripts from branch-trunk-win.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9313. Remove spurious mkdir from hadoop-config.cmd.
|
||
(Ivan Mitic via suresh)
|
||
|
||
HADOOP-9309. Test failures on Windows due to UnsatisfiedLinkError
|
||
in NativeCodeLoader#buildSupportsSnappy. (Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9347. Add instructions to BUILDING.txt describing how to
|
||
build on Windows. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9348. Address TODO in winutils to add more command line usage
|
||
and examples. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9354. Windows native project files missing license headers.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9356. Remove remaining references to cygwin/cygpath from scripts.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9232. JniBasedUnixGroupsMappingWithFallback fails on Windows
|
||
with UnsatisfiedLinkError. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-9368. Add timeouts to new tests in branch-trunk-win.
|
||
(Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9373. Merge CHANGES.branch-trunk-win.txt to CHANGES.txt.
|
||
(suresh)
|
||
|
||
HADOOP-9372. Fix bad timeout annotations on tests.
|
||
(Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9376. TestProxyUserFromEnv fails on a Windows domain joined machine.
|
||
(Ivan Mitic via suresh)
|
||
|
||
HADOOP-9365. TestHAZKUtil fails on Windows. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-9364. PathData#expandAsGlob does not return correct results for
|
||
absolute paths on Windows. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-8973. DiskChecker cannot reliably detect an inaccessible disk on
|
||
Windows with NTFS ACLs. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9388. TestFsShellCopy fails on Windows. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-9387. Fix DF so that it won't execute a shell command on Windows
|
||
to compute the file system/mount point. (Ivan Mitic via szetszwo)
|
||
|
||
HADOOP-9353. Activate native-win maven profile by default on Windows.
|
||
(Arpit Agarwal via szetszwo)
|
||
|
||
HADOOP-9437. TestNativeIO#testRenameTo fails on Windows due to assumption
|
||
that POSIX errno is embedded in NativeIOException. (Chris Nauroth via
|
||
suresh)
|
||
|
||
HADOOP-9443. Port winutils static code analysis change to trunk.
|
||
(Chuan Liu via suresh)
|
||
|
||
HADOOP-9290. Some tests cannot load native library on windows.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9500. TestUserGroupInformation#testGetServerSideGroups fails on
|
||
Windows due to failure to find winutils.exe. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9490. LocalFileSystem#reportChecksumFailure not closing the
|
||
checksum file handle before rename. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-9524. Fix ShellCommandFencer to work on Windows.
|
||
(Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9413. Add common utils for File#setReadable/Writable/Executable &
|
||
File#canRead/Write/Execute that work cross-platform. (Ivan Mitic via suresh)
|
||
|
||
HADOOP-9532. HADOOP_CLIENT_OPTS is appended twice by Windows cmd scripts.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-9043. Disallow in winutils creating symlinks with forwards slashes.
|
||
(Chris Nauroth and Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9483. winutils support for readlink command.
|
||
(Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9488. FileUtil#createJarWithClassPath only substitutes environment
|
||
variables from current process environment/does not support overriding
|
||
when launching new process (Chris Nauroth via bikas)
|
||
|
||
HADOOP-9556. disable HA tests on Windows that fail due to ZooKeeper client
|
||
connection management bug. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9553. TestAuthenticationToken fails on Windows.
|
||
(Arpit Agarwal via suresh)
|
||
|
||
HADOOP-9397. Incremental dist tar build fails. (Chris Nauroth via jlowe)
|
||
|
||
HADOOP-9131. Turn off TestLocalFileSystem#testListStatusWithColons on
|
||
Windows. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9526. TestShellCommandFencer and TestShell fail on Windows.
|
||
(Arpit Agarwal via suresh)
|
||
|
||
HADOOP-8982. TestSocketIOWithTimeout fails on Windows.
|
||
(Chris Nauroth via suresh)
|
||
|
||
HADOOP-8958. ViewFs:Non absolute mount name failures when running
|
||
multiple tests on Windows. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9599. hadoop-config.cmd doesn't set JAVA_LIBRARY_PATH correctly.
|
||
(Mostafa Elhemali via ivanmi)
|
||
|
||
HADOOP-9637. Adding Native Fstat for Windows as needed by YARN. (Chuan Liu
|
||
via cnauroth)
|
||
|
||
HADOOP-9264. Port change to use Java untar API on Windows from
|
||
branch-1-win to trunk. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-9678. TestRPC#testStopsAllThreads intermittently fails on Windows.
|
||
(Ivan Mitic via cnauroth)
|
||
|
||
HADOOP-9681. FileUtil.unTarUsingJava() should close the InputStream upon
|
||
finishing. (Chuan Liu via cnauroth)
|
||
|
||
HADOOP-9665. Fixed BlockDecompressorStream#decompress to return -1 rather
|
||
than throw EOF at end of file. (Zhijie Shen via acmurthy)
|
||
|
||
HADOOP-8440. HarFileSystem.decodeHarURI fails for URIs whose host contains
|
||
numbers. (Ivan Mitic via cnauroth)
|
||
|
||
HADOOP-9643. org.apache.hadoop.security.SecurityUtil calls
|
||
toUpperCase(Locale.getDefault()) as well as toLowerCase(Locale.getDefault())
|
||
on hadoop.security.authentication value. (markrmiller@gmail.com via tucu)
|
||
|
||
HADOOP-9701. mvn site ambiguous links in hadoop-common. (kkambatl via tucu)
|
||
|
||
Release 2.0.6-alpha - 08/22/2013
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
Release 2.0.5-alpha - 06/06/2013
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9407. commons-daemon 1.0.3 dependency has bad group id causing
|
||
build issues. (Sangjin Lee via suresh)
|
||
|
||
Release 2.0.4-alpha - 2013-04-25
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9467. Metrics2 record filter should check name as well as tags.
|
||
(Chris Nauroth and Ganeshan Iyler via llu)
|
||
|
||
HADOOP-9406. hadoop-client leaks dependency on JDK tools jar. (tucu)
|
||
|
||
HADOOP-9301. hadoop client servlet/jsp/jetty/tomcat JARs creating
|
||
conflicts in Oozie & HttpFS. (tucu)
|
||
|
||
HADOOP-9299. kerberos name resolution is kicking in even when kerberos
|
||
is not configured (daryn)
|
||
|
||
HADOOP-9408. misleading description for net.topology.table.file.name
|
||
property in core-default.xml. (rajeshbabu via suresh)
|
||
|
||
HADOOP-9444. Modify hadoop-policy.xml to replace unexpanded variables to a
|
||
default value of '*'. (Roman Shaposhnik via vinodkv)
|
||
|
||
HADOOP-9471. hadoop-client wrongfully excludes jetty-util JAR,
|
||
breaking webhdfs. (tucu)
|
||
|
||
Release 2.0.3-alpha - 2013-02-06
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-8999. SASL negotiation is flawed (daryn)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-8561. Introduce HADOOP_PROXY_USER for secure impersonation in child
|
||
hadoop client processes. (Yu Gao via llu)
|
||
|
||
HADOOP-8597. Permit FsShell's text command to read Avro files.
|
||
(Ivan Vladimirov Ivanov via cutting)
|
||
|
||
HADOOP-9020. Add a SASL PLAIN server (daryn via bobby)
|
||
|
||
HADOOP-9090. Support on-demand publish of metrics. (Mostafa Elhemali via
|
||
suresh)
|
||
|
||
HADOOP-9054. Add AuthenticationHandler that uses Kerberos but allows for
|
||
an alternate form of authentication for browsers. (rkanter via tucu)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8789. Tests setLevel(Level.OFF) should be Level.ERROR.
|
||
(Andy Isaacson via eli)
|
||
|
||
HADOOP-8755. Print thread dump when tests fail due to timeout. (Andrey
|
||
Klochkov via atm)
|
||
|
||
HADOOP-8806. libhadoop.so: dlopen should be better at locating
|
||
libsnappy.so, etc. (Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8812. ExitUtil#terminate should print Exception#toString. (eli)
|
||
|
||
HADOOP-8736. Add Builder for building RPC server. (Brandon Li via Suresh)
|
||
|
||
HDFS-3957. Change MutableQuantiles to use a shared thread for rolling
|
||
over metrics. (Andrew Wang via todd)
|
||
|
||
HADOOP-8851. Use -XX:+HeapDumpOnOutOfMemoryError JVM option in the forked
|
||
tests. (Ivan A. Veselovsky via atm)
|
||
|
||
HADOOP-8783. Improve RPC.Server's digest auth (daryn)
|
||
|
||
HADOOP-8889. Upgrade to Surefire 2.12.3 (todd)
|
||
|
||
HADOOP-8804. Improve Web UIs when the wildcard address is used.
|
||
(Senthil Kumar via eli)
|
||
|
||
HADOOP-8894. GenericTestUtils.waitFor should dump thread stacks on timeout
|
||
(todd)
|
||
|
||
HADOOP-8909. Hadoop Common Maven protoc calls must not depend on external
|
||
sh script. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8911. CRLF characters in source and text files.
|
||
(Raja Aluri via suresh)
|
||
|
||
HADOOP-8912. Add .gitattributes file to prevent CRLF and LF mismatches
|
||
for source and text files. (Raja Aluri via suresh)
|
||
|
||
HADOOP-8784. Improve IPC.Client's token use (daryn)
|
||
|
||
HADOOP-8929. Add toString, other improvements for SampleQuantiles (todd)
|
||
|
||
HADOOP-8922. Provide alternate JSONP output for JMXJsonServlet to allow
|
||
javascript in browser (Damien Hardy via bobby)
|
||
|
||
HADOOP-8931. Add Java version to startup message. (eli)
|
||
|
||
HADOOP-8925. Remove the packaging. (eli)
|
||
|
||
HADOOP-8985. Add namespace declarations in .proto files for languages
|
||
other than java. (Binglin Chan via suresh)
|
||
|
||
HADOOP-9009. Add SecurityUtil methods to get/set authentication method
|
||
(daryn via bobby)
|
||
|
||
HADOOP-9010. Map UGI authenticationMethod to RPC authMethod (daryn via
|
||
bobby)
|
||
|
||
HADOOP-9013. UGI should not hardcode loginUser's authenticationType (daryn
|
||
via bobby)
|
||
|
||
HADOOP-9014. Standardize creation of SaslRpcClients (daryn via bobby)
|
||
|
||
HADOOP-9015. Standardize creation of SaslRpcServers (daryn via bobby)
|
||
|
||
HADOOP-8860. Split MapReduce and YARN sections in documentation navigation.
|
||
(tomwhite via tucu)
|
||
|
||
HADOOP-9021. Enforce configured SASL method on the server (daryn via
|
||
bobby)
|
||
|
||
HADOO-8998. set Cache-Control no-cache header on all dynamic content. (tucu)
|
||
|
||
HADOOP-9035. Generalize setup of LoginContext (daryn via bobby)
|
||
|
||
HADOOP-9093. Move all the Exception in PathExceptions to o.a.h.fs package.
|
||
(suresh)
|
||
|
||
HADOOP-9042. Add a test for umask in FileSystemContractBaseTest.
|
||
(Colin McCabe via eli)
|
||
|
||
HADOOP-9127. Update documentation for ZooKeeper Failover Controller.
|
||
(Daisuke Kobayashi via atm)
|
||
|
||
HADOOP-9004. Allow security unit tests to use external KDC. (Stephen Chu
|
||
via suresh)
|
||
|
||
HADOOP-9147. Add missing fields to FIleStatus.toString.
|
||
(Jonathan Allen via suresh)
|
||
|
||
HADOOP-8427. Convert Forrest docs to APT, incremental. (adi2 via tucu)
|
||
|
||
HADOOP-9162. Add utility to check native library availability.
|
||
(Binglin Chang via suresh)
|
||
|
||
HADOOP-9173. Add security token protobuf definition to common and
|
||
use it in hdfs. (suresh)
|
||
|
||
HADOOP-9119. Add test to FileSystemContractBaseTest to verify integrity
|
||
of overwritten files. (Steve Loughran via suresh)
|
||
|
||
HADOOP-9192. Move token related request/response messages to common.
|
||
(suresh)
|
||
|
||
HADOOP-8712. Change default hadoop.security.group.mapping to
|
||
JniBasedUnixGroupsNetgroupMappingWithFallback (Robert Parker via todd)
|
||
|
||
HADOOP-9106. Allow configuration of IPC connect timeout.
|
||
(Rober Parker via suresh)
|
||
|
||
HADOOP-9216. CompressionCodecFactory#getCodecClasses should trim the
|
||
result of parsing by Configuration. (Tsuyoshi Ozawa via todd)
|
||
|
||
HADOOP-9231. Parametrize staging URL for the uniformity of
|
||
distributionManagement. (Konstantin Boudnik via suresh)
|
||
|
||
HADOOP-9276. Allow BoundedByteArrayOutputStream to be resettable.
|
||
(Arun Murthy via hitesh)
|
||
|
||
HADOOP-7688. Add servlet handler check in HttpServer.start().
|
||
(Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7886. Add toString to FileStatus. (SreeHari via jghoman)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-8866. SampleQuantiles#query is O(N^2) instead of O(N). (Andrew Wang
|
||
via atm)
|
||
|
||
HADOOP-8926. hadoop.util.PureJavaCrc32 cache hit-ratio is low for static
|
||
data (Gopal V via bobby)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9041. FsUrlStreamHandlerFactory could cause an infinite loop in
|
||
FileSystem initialization. (Yanbo Liang and Radim Kolar via llu)
|
||
|
||
HADOOP-8418. Update UGI Principal classes name for running with
|
||
IBM JDK on 64 bits Windows. (Yu Gao via eyang)
|
||
|
||
HADOOP-8795. BASH tab completion doesn't look in PATH, assumes path to
|
||
executable is specified. (Sean Mackrory via atm)
|
||
|
||
HADOOP-8780. Update DeprecatedProperties apt file. (Ahmed Radwan via
|
||
tomwhite)
|
||
|
||
HADOOP-8833. fs -text should make sure to call inputstream.seek(0)
|
||
before using input stream. (tomwhite and harsh)
|
||
|
||
HADOOP-8791. Fix rm command documentation to indicte it deletes
|
||
files and not directories. (Jing Zhao via suresh)
|
||
|
||
HADOOP-8855. SSL-based image transfer does not work when Kerberos
|
||
is disabled. (todd via eli)
|
||
|
||
HADOOP-8616. ViewFS configuration requires a trailing slash. (Sandy Ryza
|
||
via atm)
|
||
|
||
HADOOP-8756. Fix SEGV when libsnappy is in java.library.path but
|
||
not LD_LIBRARY_PATH. (Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8881. FileBasedKeyStoresFactory initialization logging should
|
||
be debug not info. (tucu)
|
||
|
||
HADOOP-8913. hadoop-metrics2.properties should give units in comment
|
||
for sampling period. (Sandy Ryza via suresh)
|
||
|
||
HADOOP-8878. Uppercase namenode hostname causes hadoop dfs calls with
|
||
webhdfs filesystem and fsck to fail when security is on.
|
||
(Arpit Gupta via suresh)
|
||
|
||
HADOOP-8901. GZip and Snappy support may not work without unversioned
|
||
libraries (Colin Patrick McCabe via todd)
|
||
|
||
HADOOP-8883. Anonymous fallback in KerberosAuthenticator is broken.
|
||
(rkanter via tucu)
|
||
|
||
HADOOP-8900. BuiltInGzipDecompressor throws IOException - stored gzip size
|
||
doesn't match decompressed size. (Andy Isaacson via suresh)
|
||
|
||
HADOOP-8948. TestFileUtil.testGetDU fails on Windows due to incorrect
|
||
assumption of line separator. (Chris Nauroth via suresh)
|
||
|
||
HADOOP-8951. RunJar to fail with user-comprehensible error
|
||
message if jar missing. (stevel via suresh)
|
||
|
||
HADOOP-8713. TestRPCCompatibility fails intermittently with JDK7
|
||
(Trevor Robinson via tgraves)
|
||
|
||
HADOOP-9012. IPC Client sends wrong connection context (daryn via bobby)
|
||
|
||
HADOOP-7115. Add a cache for getpwuid_r and getpwgid_r calls (tucu)
|
||
|
||
HADOOP-6607. Add different variants of non caching HTTP headers. (tucu)
|
||
|
||
HADOOP-9049. DelegationTokenRenewer needs to be Singleton and FileSystems
|
||
should register/deregister to/from. (Karthik Kambatla via tomwhite)
|
||
|
||
HADOOP-9064. Augment DelegationTokenRenewer API to cancel the tokens on
|
||
calls to removeRenewAction. (kkambatl via tucu)
|
||
|
||
HADOOP-9103. UTF8 class does not properly decode Unicode characters
|
||
outside the basic multilingual plane. (todd)
|
||
|
||
HADOOP-9070. Kerberos SASL server cannot find kerberos key. (daryn via atm)
|
||
|
||
HADOOP-6762. Exception while doing RPC I/O closes channel
|
||
(Sam Rash and todd via todd)
|
||
|
||
HADOOP-9126. FormatZK and ZKFC startup can fail due to zkclient connection
|
||
establishment delay. (Rakesh R and todd via todd)
|
||
|
||
HADOOP-9113. o.a.h.fs.TestDelegationTokenRenewer is failing intermittently.
|
||
(Karthik Kambatla via eli)
|
||
|
||
HADOOP-9135. JniBasedUnixGroupsMappingWithFallback should log at debug
|
||
rather than info during fallback. (Colin Patrick McCabe via todd)
|
||
|
||
HADOOP-9152. HDFS can report negative DFS Used on clusters with very small
|
||
amounts of data. (Brock Noland via atm)
|
||
|
||
HADOOP-9153. Support createNonRecursive in ViewFileSystem.
|
||
(Sandy Ryza via tomwhite)
|
||
|
||
HADOOP-9181. Set daemon flag for HttpServer's QueuedThreadPool.
|
||
(Liang Xie via suresh)
|
||
|
||
HADOOP-9155. FsPermission should have different default value, 777 for
|
||
directory and 666 for file. (Binglin Chang via atm)
|
||
|
||
HADOOP-9183. Potential deadlock in ActiveStandbyElector. (tomwhite)
|
||
|
||
HADOOP-9203. RPCCallBenchmark should find a random available port.
|
||
(Andrew Purtell via suresh)
|
||
|
||
HADOOP-9178. src/main/conf is missing hadoop-policy.xml.
|
||
(Sandy Ryza via eli)
|
||
|
||
HADOOP-8816. HTTP Error 413 full HEAD if using kerberos authentication.
|
||
(moritzmoeller via tucu)
|
||
|
||
HADOOP-9212. Potential deadlock in FileSystem.Cache/IPC/UGI. (tomwhite)
|
||
|
||
HADOOP-8589 ViewFs tests fail when tests and home dirs are nested.
|
||
(sanjay Radia)
|
||
|
||
HADOOP-9193. hadoop script can inadvertently expand wildcard arguments
|
||
when delegating to hdfs script. (Andy Isaacson via todd)
|
||
|
||
HADOOP-9215. when using cmake-2.6, libhadoop.so doesn't get created
|
||
(only libhadoop.so.1.0.0) (Colin Patrick McCabe via todd)
|
||
|
||
HADOOP-8857. hadoop.http.authentication.signature.secret.file docs
|
||
should not state that secret is randomly generated. (tucu)
|
||
|
||
HADOOP-9190. packaging docs is broken. (Andy Isaacson via tgraves)
|
||
|
||
HADOOP-9221. Convert remaining xdocs to APT. (Andy Isaacson via atm)
|
||
|
||
HADOOP-8981. TestMetricsSystemImpl fails on Windows. (Xuan Gong via suresh)
|
||
|
||
HADOOP-9124. SortedMapWritable violates contract of Map interface for
|
||
equals() and hashCode(). (Surenkumar Nihalani via tomwhite)
|
||
|
||
HADOOP-9278. Fix the file handle leak in HarMetaData.parseMetaData() in
|
||
HarFileSystem. (Chris Nauroth via szetszwo)
|
||
|
||
HADOOP-9252. In StringUtils, humanReadableInt(..) has a race condition and
|
||
the synchronization of limitDecimalTo2(double) can be avoided. (szetszwo)
|
||
|
||
HADOOP-9260. Hadoop version may be not correct when starting name node or
|
||
data node. (Chris Nauroth via jlowe)
|
||
|
||
HADOOP-9289. FsShell rm -f fails for non-matching globs. (Daryn Sharp via
|
||
suresh)
|
||
|
||
Release 2.0.2-alpha - 2012-09-07
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-8388. Remove unused BlockLocation serialization.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8689. Make trash a server side configuration option. (eli)
|
||
|
||
HADOOP-8710. Remove ability for users to easily run the trash emptire. (eli)
|
||
|
||
HADOOP-8794. Rename YARN_HOME to HADOOP_YARN_HOME. (vinodkv via acmurthy)
|
||
|
||
NEW FEATURES
|
||
|
||
HDFS-3042. Automatic failover support for NameNode HA (todd)
|
||
(see dedicated section below for breakdown of subtasks)
|
||
|
||
HADOOP-8135. Add ByteBufferReadable interface to FSDataInputStream. (Henry
|
||
Robinson via atm)
|
||
|
||
HADOOP-8458. Add management hook to AuthenticationHandler to enable
|
||
delegation token operations support (tucu)
|
||
|
||
HADOOP-8465. hadoop-auth should support ephemeral authentication (tucu)
|
||
|
||
HADOOP-8644. AuthenticatedURL should be able to use SSLFactory. (tucu)
|
||
|
||
HADOOP-8581. add support for HTTPS to the web UIs. (tucu)
|
||
|
||
HADOOP-7754. Expose file descriptors from Hadoop-wrapped local
|
||
FileSystems (todd and ahmed via tucu)
|
||
|
||
HADOOP-8240. Add a new API to allow users to specify a checksum type
|
||
on FileSystem.create(..). (Kihwal Lee via szetszwo)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8340. SNAPSHOT build versions should compare as less than their eventual
|
||
final release. (todd)
|
||
|
||
HADOOP-8361. Avoid out-of-memory problems when deserializing strings.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8224. Don't hardcode hdfs.audit.logger in the scripts.
|
||
(Tomohiko Kinebuchi via eli)
|
||
|
||
HADOOP-8398. Cleanup BlockLocation. (eli)
|
||
|
||
HADOOP-8422. Deprecate FileSystem#getDefault* and getServerDefault
|
||
methods that don't take a Path argument. (eli)
|
||
|
||
HADOOP-8323. Add javadoc and tests for Text.clear() behavior (harsh)
|
||
|
||
HADOOP-8358. Config-related WARN for dfs.web.ugi can be avoided. (harsh)
|
||
|
||
HADOOP-8450. Remove src/test/system. (eli)
|
||
|
||
HADOOP-8244. Improve comments on ByteBufferReadable.read. (Henry Robinson
|
||
via atm)
|
||
|
||
HADOOP-8368. Use CMake rather than autotools to build native code (ccccabe via tucu)
|
||
|
||
HADOOP-8524. Allow users to get source of a Configuration
|
||
parameter (harsh)
|
||
|
||
HADOOP-8449. hadoop fs -text fails with compressed sequence files
|
||
with the codec file extension (harsh)
|
||
|
||
HADOOP-6802. Remove FS_CLIENT_BUFFER_DIR_KEY = "fs.client.buffer.dir"
|
||
from CommonConfigurationKeys.java (not used, deprecated)
|
||
(Sho Shimauchi via harsh)
|
||
|
||
HADOOP-3450. Add tests to Local Directory Allocator for
|
||
asserting their URI-returning capability (Sho Shimauchi via harsh)
|
||
|
||
HADOOP-8463. hadoop.security.auth_to_local needs a key definition and doc.
|
||
(Madhukara Phatak via eli)
|
||
|
||
HADOOP-8533. Remove parallel call ununsed capability in RPC.
|
||
(Brandon Li via suresh)
|
||
|
||
HADOOP-8423. MapFile.Reader.get() crashes jvm or throws
|
||
EOFException on Snappy or LZO block-compressed data
|
||
(todd via harsh)
|
||
|
||
HADOOP-8541. Better high-percentile latency metrics. (Andrew Wang via atm)
|
||
|
||
HADOOP-8362. Improve exception message when Configuration.set() is
|
||
called with a null key or value. (Madhukara Phatak
|
||
and Suresh Srinivas via harsh)
|
||
|
||
HADOOP-7818. DiskChecker#checkDir should fail if the directory is
|
||
not executable. (Madhukara Phatak via harsh)
|
||
|
||
HADOOP-8531. SequenceFile Writer can throw out a better error if a
|
||
serializer or deserializer isn't available
|
||
(Madhukara Phatak via harsh)
|
||
|
||
HADOOP-8609. IPC server logs a useless message when shutting down socket.
|
||
(Jon Zuanich via atm)
|
||
|
||
HADOOP-8620. Add -Drequire.fuse and -Drequire.snappy. (Colin
|
||
Patrick McCabe via eli)
|
||
|
||
HADOOP-8687. Upgrade log4j to 1.2.17. (eli)
|
||
|
||
HADOOP-8278. Make sure components declare correct set of dependencies.
|
||
(tomwhite)
|
||
|
||
HADOOP-8700. Use enum to define the checksum constants in DataChecksum.
|
||
(szetszwo)
|
||
|
||
HADOOP-8686. Fix warnings in native code. (Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8239. Add subclasses of MD5MD5CRC32FileChecksum to support file
|
||
checksum with CRC32C. (Kihwal Lee via szetszwo)
|
||
|
||
HADOOP-8619. WritableComparator must implement no-arg constructor.
|
||
(Chris Douglas via Suresh)
|
||
|
||
HADOOP-8075. Lower native-hadoop library log from info to debug.
|
||
(Hızır Sefa İrken via eli)
|
||
|
||
HADOOP-8748. Refactor DFSClient retry utility methods to a new class
|
||
in org.apache.hadoop.io.retry. (Arun C Murthy via szetszwo)
|
||
|
||
HADOOP-8754. Deprecate all the RPC.getServer() variants. (Brandon Li
|
||
via szetszwo)
|
||
|
||
HADOOP-8801. ExitUtil#terminate should capture the exception stack trace. (eli)
|
||
|
||
HADOOP-8819. Incorrectly & is used instead of && in some file system
|
||
implementations. (Brandon Li via suresh)
|
||
|
||
HADOOP-7808. Port HADOOP-7510 - Add configurable option to use original
|
||
hostname in token instead of IP to allow server IP change.
|
||
(Daryn Sharp via suresh)
|
||
|
||
HADOOP-8367 Improve documentation of declaringClassProtocolName in
|
||
rpc headers. (Sanjay Radia)
|
||
|
||
HADOOP-8624. ProtobufRpcEngine should log all RPCs if TRACE logging is
|
||
enabled (todd)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-8372. NetUtils.normalizeHostName() incorrectly handles hostname
|
||
starting with a numeric character. (Junping Du via suresh)
|
||
|
||
HADOOP-8393. hadoop-config.sh missing variable exports, causes Yarn jobs
|
||
to fail with ClassNotFoundException MRAppMaster. (phunt via tucu)
|
||
|
||
HADOOP-8316. Audit logging should be disabled by default. (eli)
|
||
|
||
HADOOP-8400. All commands warn "Kerberos krb5 configuration not found"
|
||
when security is not enabled. (tucu)
|
||
|
||
HADOOP-8406. CompressionCodecFactory.CODEC_PROVIDERS iteration is
|
||
thread-unsafe (todd)
|
||
|
||
HADOOP-8287. etc/hadoop is missing hadoop-env.sh (eli)
|
||
|
||
HADOOP-8408. MR doesn't work with a non-default ViewFS mount table
|
||
and security enabled. (atm via eli)
|
||
|
||
HADOOP-8329. Build fails with Java 7. (eli)
|
||
|
||
HADOOP-8268. A few pom.xml across Hadoop project
|
||
may fail XML validation. (Radim Kolar via harsh)
|
||
|
||
HADOOP-8444. Fix the tests FSMainOperationsBaseTest.java and
|
||
FileContextMainOperationsBaseTest.java to avoid potential
|
||
test failure (Madhukara Phatak via harsh)
|
||
|
||
HADOOP-8452. DN logs backtrace when running under jsvc and /jmx is loaded
|
||
(Andy Isaacson via bobby)
|
||
|
||
HADOOP-8460. Document proper setting of HADOOP_PID_DIR and
|
||
HADOOP_SECURE_DN_PID_DIR (bobby)
|
||
|
||
HADOOP-8466. hadoop-client POM incorrectly excludes avro. (bmahe via tucu)
|
||
|
||
HADOOP-8481. update BUILDING.txt to talk about cmake rather than autotools.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8485. Don't hardcode "Apache Hadoop 0.23" in the docs. (eli)
|
||
|
||
HADOOP-8488. test-patch.sh gives +1 even if the native build fails.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8507. Avoid OOM while deserializing DelegationTokenIdentifer.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8433. Don't set HADOOP_LOG_DIR in hadoop-env.sh.
|
||
(Brahma Reddy Battula via eli)
|
||
|
||
HADOOP-8509. JarFinder duplicate entry: META-INF/MANIFEST.MF exception (tucu)
|
||
|
||
HADOOP-8512. AuthenticatedURL should reset the Token when the server returns
|
||
other than OK on authentication (tucu)
|
||
|
||
HADOOP-8168. empty-string owners or groups causes {{MissingFormatWidthException}}
|
||
in o.a.h.fs.shell.Ls.ProcessPath() (ekoontz via tucu)
|
||
|
||
HADOOP-8438. hadoop-validate-setup.sh refers to examples jar file which doesn't exist
|
||
(Devaraj K via umamahesh)
|
||
|
||
HADOOP-8538. CMake builds fail on ARM. (Trevor Robinson via eli)
|
||
|
||
HADOOP-8547. Package hadoop-pipes examples/bin directory (again).
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8563. don't package hadoop-pipes examples/bin
|
||
(Colin Patrick McCabe via tgraves)
|
||
|
||
HADOOP-8566. AvroReflectSerializer.accept(Class) throws a NPE if the class has no
|
||
package (primitive types and arrays). (tucu)
|
||
|
||
HADOOP-8586. Fixup a bunch of SPNEGO misspellings. (eli)
|
||
|
||
HADOOP-3886. Error in javadoc of Reporter, Mapper and Progressable
|
||
(Jingguo Yao via harsh)
|
||
|
||
HADOOP-8587. HarFileSystem access of harMetaCache isn't threadsafe. (eli)
|
||
|
||
HADOOP-8585. Fix initialization circularity between UserGroupInformation
|
||
and HadoopConfiguration. (Colin Patrick McCabe via atm)
|
||
|
||
HADOOP-8552. Conflict: Same security.log.file for multiple users.
|
||
(kkambatl via tucu)
|
||
|
||
HADOOP-8537. Fix TFile tests to pass even when native zlib support is not
|
||
compiled. (todd)
|
||
|
||
HADOOP-8626. Typo in default setting for
|
||
hadoop.security.group.mapping.ldap.search.filter.user. (Jonathan Natkins
|
||
via atm)
|
||
|
||
HADOOP-8480. The native build should honor -DskipTests.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8659. Native libraries must build with soft-float ABI for Oracle JVM
|
||
on ARM. (Trevor Robinson via todd)
|
||
|
||
HADOOP-8654. TextInputFormat delimiter bug (Gelesh and Jason Lowe via
|
||
bobby)
|
||
|
||
HADOOP-8614. IOUtils#skipFully hangs forever on EOF.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8720. TestLocalFileSystem should use test root subdirectory.
|
||
(Vlad Rozov via eli)
|
||
|
||
HADOOP-8721. ZKFC should not retry 45 times when attempting a graceful
|
||
fence during a failover. (Vinayakumar B via atm)
|
||
|
||
HADOOP-8632. Configuration leaking class-loaders (Costin Leau via bobby)
|
||
|
||
HADOOP-4572. Can not access user logs - Jetty is not configured by default
|
||
to serve aliases/symlinks (ahmed via tucu)
|
||
|
||
HADOOP-8660. TestPseudoAuthenticator failing with NPE. (tucu)
|
||
|
||
HADOOP-8699. some common testcases create core-site.xml in test-classes
|
||
making other testcases to fail. (tucu)
|
||
|
||
HADOOP-8031. Configuration class fails to find embedded .jar resources;
|
||
should use URL.openStream() (genman via tucu)
|
||
|
||
HADOOP-8737. cmake: always use JAVA_HOME to find libjvm.so, jni.h, jni_md.h.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8747. Syntax error on cmake version 2.6 patch 2 in JNIFlags.cmake. (cmccabe via tucu)
|
||
|
||
HADOOP-8722. Update BUILDING.txt with latest snappy info.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8764. CMake: HADOOP-8737 broke ARM build. (Trevor Robinson via eli)
|
||
|
||
HADOOP-8770. NN should not RPC to self to find trash defaults. (eli)
|
||
|
||
HADOOP-8648. libhadoop: native CRC32 validation crashes when
|
||
io.bytes.per.checksum=1. (Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8766. FileContextMainOperationsBaseTest should randomize the root
|
||
dir. (Colin Patrick McCabe via atm)
|
||
|
||
HADOOP-8749. HADOOP-8031 changed the way in which relative xincludes are handled in
|
||
Configuration. (ahmed via tucu)
|
||
|
||
HADOOP-8431. Running distcp wo args throws IllegalArgumentException.
|
||
(Sandy Ryza via eli)
|
||
|
||
HADOOP-8775. MR2 distcp permits non-positive value to -bandwidth option
|
||
which causes job never to complete. (Sandy Ryza via atm)
|
||
|
||
HADOOP-8781. hadoop-config.sh should add JAVA_LIBRARY_PATH to LD_LIBRARY_PATH. (tucu)
|
||
|
||
BREAKDOWN OF HDFS-3042 SUBTASKS
|
||
|
||
HADOOP-8220. ZKFailoverController doesn't handle failure to become active
|
||
correctly (todd)
|
||
|
||
HADOOP-8228. Auto HA: Refactor tests and add stress tests. (todd)
|
||
|
||
HADOOP-8215. Security support for ZK Failover controller (todd)
|
||
|
||
HADOOP-8245. Fix flakiness in TestZKFailoverController (todd)
|
||
|
||
HADOOP-8257. TestZKFailoverControllerStress occasionally fails with Mockito
|
||
error (todd)
|
||
|
||
HADOOP-8260. Replace ClientBaseWithFixes with our own modified copy of the
|
||
class (todd)
|
||
|
||
HADOOP-8246. Auto-HA: automatically scope znode by nameservice ID (todd)
|
||
|
||
HADOOP-8247. Add a config to enable auto-HA, which disables manual
|
||
FailoverController (todd)
|
||
|
||
HADOOP-8306. ZKFC: improve error message when ZK is not running. (todd)
|
||
|
||
HADOOP-8279. Allow manual failover to be invoked when auto-failover is
|
||
enabled. (todd)
|
||
|
||
HADOOP-8276. Auto-HA: add config for java options to pass to zkfc daemon
|
||
(todd via eli)
|
||
|
||
HADOOP-8405. ZKFC tests leak ZK instances. (todd)
|
||
|
||
Release 2.0.0-alpha - 05-23-2012
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-7920. Remove Avro Rpc. (suresh)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-7773. Add support for protocol buffer based RPC engine.
|
||
(suresh)
|
||
|
||
HADOOP-7875. Add helper class to unwrap protobuf ServiceException.
|
||
(suresh)
|
||
|
||
HADOOP-7454. Common side of High Availability Framework (HDFS-1623)
|
||
Contributed by Todd Lipcon, Aaron T. Myers, Eli Collins, Uma Maheswara Rao G,
|
||
Bikas Saha, Suresh Srinivas, Jitendra Nath Pandey, Hari Mankude, Brandon Li,
|
||
Sanjay Radia, Mingjie Lai, and Gregory Chanan
|
||
|
||
HADOOP-8121. Active Directory Group Mapping Service. (Jonathan Natkins via
|
||
atm)
|
||
|
||
HADOOP-7030. Add TableMapping topology implementation to read host to rack
|
||
mapping from a file. (Patrick Angeles and tomwhite via tomwhite)
|
||
|
||
HADOOP-8206. Common portion of a ZK-based failover controller (todd)
|
||
|
||
HADOOP-8210. Common side of HDFS-3148: The client should be able
|
||
to use multiple local interfaces for data transfer. (eli)
|
||
|
||
HADOOP-8343. Allow configuration of authorization for JmxJsonServlet and
|
||
MetricsServlet (tucu)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-7524. Change RPC to allow multiple protocols including multuple
|
||
versions of the same protocol (sanjay Radia)
|
||
|
||
HADOOP-7607. Simplify the RPC proxy cleanup process. (atm)
|
||
|
||
HADOOP-7687. Make getProtocolSignature public (sanjay)
|
||
|
||
HADOOP-7693. Enhance AvroRpcEngine to support the new #addProtocol
|
||
interface introduced in HADOOP-7524. (cutting)
|
||
|
||
HADOOP-7716. RPC protocol registration on SS does not log the protocol name
|
||
(only the class which may be different) (sanjay)
|
||
|
||
HADOOP-7776. Make the Ipc-Header in a RPC-Payload an explicit header.
|
||
(sanjay)
|
||
|
||
HADOOP-7862. Move the support for multiple protocols to lower layer so
|
||
that Writable, PB and Avro can all use it (Sanjay)
|
||
|
||
HADOOP-7876. Provided access to encoded key in DelegationKey for
|
||
use in protobuf based RPCs. (suresh)
|
||
|
||
HADOOP-7899. Generate proto java files as part of the build. (tucu)
|
||
|
||
HADOOP-7957. Classes deriving GetGroupsBase should be able to override
|
||
proxy creation. (jitendra)
|
||
|
||
HADOOP-7965. Support for protocol version and signature in PB. (jitendra)
|
||
|
||
HADOOP-8070. Add a standalone benchmark for RPC call performance. (todd)
|
||
|
||
HADOOP-8084. Updates ProtoBufRpc engine to not do an unnecessary copy
|
||
for RPC request/response. (ddas)
|
||
|
||
HADOOP-8085. Add RPC metrics to ProtobufRpcEngine. (Hari Mankude via
|
||
suresh)
|
||
|
||
HADOOP-8098. KerberosAuthenticatorHandler should use _HOST replacement to
|
||
resolve principal name (tucu)
|
||
|
||
HADOOP-8118. In metrics2.util.MBeans, change log level to trace for the
|
||
stack trace of InstanceAlreadyExistsException. (szetszwo)
|
||
|
||
HADOOP-8125. make hadoop-client set of curated jars available in a
|
||
distribution tarball (rvs via tucu)
|
||
|
||
HADOOP-7717. Move handling of concurrent client fail-overs to
|
||
RetryInvocationHandler (atm)
|
||
|
||
HADOOP-7728. Enable task memory management to be configurable in hadoop
|
||
config setup script. (ramya)
|
||
|
||
HADOOP-7358. Improve log levels when exceptions caught in RPC handler
|
||
(Todd Lipcon via shv)
|
||
|
||
HADOOP-7557 Make IPC header be extensible (sanjay radia)
|
||
|
||
HADOOP-7806. Support binding to sub-interfaces (eli)
|
||
|
||
HADOOP-6941. Adds support for building Hadoop with IBM's JDK
|
||
(Stephen Watt, Eli and ddas)
|
||
|
||
HADOOP-8183. Stop using "mapred.used.genericoptions.parser" (harsh)
|
||
|
||
HADOOP-6924. Adds a directory to the list of directories to search
|
||
for the libjvm.so file. The new directory is found by running a 'find'
|
||
command and the first output is taken. This was done to handle the
|
||
build of Hadoop with IBM's JDK. (Stephen Watt, Guillermo Cabrera and ddas)
|
||
|
||
HADOOP-8200. Remove HADOOP_[JOBTRACKER|TASKTRACKER]_OPTS. (eli)
|
||
|
||
HADOOP-8184. ProtoBuf RPC engine uses the IPC layer reply packet.
|
||
(Sanjay Radia via szetszwo)
|
||
|
||
HADOOP-8163. Improve ActiveStandbyElector to provide hooks for
|
||
fencing old active. (todd)
|
||
|
||
HADOOP-8193. Refactor FailoverController/HAAdmin code to add an abstract
|
||
class for "target" services. (todd)
|
||
|
||
HADOOP-8212. Improve ActiveStandbyElector's behavior when session expires
|
||
(todd)
|
||
|
||
HADOOP-8216. Address log4j.properties inconsistencies btw main and
|
||
template dirs. (Patrick Hunt via eli)
|
||
|
||
HADOOP-8149. Cap space usage of default log4j rolling policy.
|
||
(Patrick Hunt via eli)
|
||
|
||
HADOOP-8211. Update commons-net version to 3.1. (eli)
|
||
|
||
HADOOP-8236. haadmin should have configurable timeouts for failover
|
||
commands. (todd)
|
||
|
||
HADOOP-8242. AbstractDelegationTokenIdentifier: add getter methods
|
||
for owner and realuser. (Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8007. Use substitution tokens for fencing argument (todd)
|
||
|
||
HADOOP-8077. HA: fencing method should be able to be configured on
|
||
a per-NN or per-NS basis (todd)
|
||
|
||
HADOOP-8086. KerberosName silently sets defaultRealm to "" if the
|
||
Kerberos config is not found, it should log a WARN (tucu)
|
||
|
||
HADOOP-8280. Move VersionUtil/TestVersionUtil and GenericTestUtils from
|
||
HDFS into Common. (Ahmed Radwan via atm)
|
||
|
||
HADOOP-8117. Upgrade test build to Surefire 2.12 (todd)
|
||
|
||
HADOOP-8152. Expand public APIs for security library classes. (atm via eli)
|
||
|
||
HADOOP-7549. Use JDK ServiceLoader mechanism to find FileSystem implementations. (tucu)
|
||
|
||
HADOOP-8185. Update namenode -format documentation and add -nonInteractive
|
||
and -force. (Arpit Gupta via atm)
|
||
|
||
HADOOP-8214. make hadoop script recognize a full set of deprecated commands (rvs via tucu)
|
||
|
||
HADOOP-8347. Hadoop Common logs misspell 'successful'.
|
||
(Philip Zeyliger via eli)
|
||
|
||
HADOOP-8350. Improve NetUtils.getInputStream to return a stream which has
|
||
a tunable timeout. (todd)
|
||
|
||
HADOOP-8356. FileSystem service loading mechanism should print the FileSystem
|
||
impl it is failing to load (tucu)
|
||
|
||
HADOOP-8353. hadoop-daemon.sh and yarn-daemon.sh can be misleading on stop.
|
||
(Roman Shaposhnik via atm)
|
||
|
||
HADOOP-8113. Correction to BUILDING.txt: HDFS needs ProtocolBuffer, too
|
||
(not just MapReduce). Contributed by Eugene Koontz.
|
||
|
||
HADOOP-8285 Use ProtoBuf for RpcPayLoadHeader (sanjay radia)
|
||
|
||
HADOOP-8366 Use ProtoBuf for RpcResponseHeader (sanjay radia)
|
||
|
||
HADOOP-7729. Send back valid HTTP response if user hits IPC port with
|
||
HTTP GET. (todd)
|
||
|
||
HADOOP-7987. Support setting the run-as user in unsecure mode. (jitendra)
|
||
|
||
HADOOP-7994. Remove getProtocolVersion and getProtocolSignature from the
|
||
client side translator and server side implementation. (jitendra)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-8199. Fix issues in start-all.sh and stop-all.sh (Devaraj K via umamahesh)
|
||
|
||
HADOOP-7635. RetryInvocationHandler should release underlying resources on
|
||
close. (atm)
|
||
|
||
HADOOP-7695. RPC.stopProxy can throw unintended exception while logging
|
||
error. (atm)
|
||
|
||
HADOOP-7833. Fix findbugs warnings in protobuf generated code.
|
||
(John Lee via suresh)
|
||
|
||
HADOOP-7897. ProtobufRpcEngine client side exception mechanism is not
|
||
consistent with WritableRpcEngine. (suresh)
|
||
|
||
HADOOP-7913. Fix bug in ProtoBufRpcEngine. (sanjay)
|
||
|
||
HADOOP-7892. IPC logs too verbose after "RpcKind" introduction. (todd)
|
||
|
||
HADOOP-7968. Errant println left in RPC.getHighestSupportedProtocol. (Sho
|
||
Shimauchi via harsh)
|
||
|
||
HADOOP-7931. o.a.h.ipc.WritableRpcEngine should have a way to force
|
||
initialization. (atm)
|
||
|
||
HADOOP-8104. Inconsistent Jackson versions (tucu)
|
||
|
||
HADOOP-8119. Fix javac warnings in TestAuthenticationFilter in hadoop-auth.
|
||
(szetszwo)
|
||
|
||
HADOOP-7888. TestFailoverProxy fails intermittently on trunk. (Jason Lowe
|
||
via atm)
|
||
|
||
HADOOP-8154. DNS#getIPs shouldn't silently return the local host
|
||
IP for bogus interface names. (eli)
|
||
|
||
HADOOP-8169. javadoc generation fails with java.lang.OutOfMemoryError:
|
||
Java heap space (tgraves via bobby)
|
||
|
||
HADOOP-8167. Configuration deprecation logic breaks backwards compatibility (tucu)
|
||
|
||
HADOOP-8189. LdapGroupsMapping shouldn't throw away IOException. (Jonathan Natkins via atm)
|
||
|
||
HADOOP-8191. SshFenceByTcpPort uses netcat incorrectly (todd)
|
||
|
||
HADOOP-8157. Fix race condition in Configuration that could cause spurious
|
||
ClassNotFoundExceptions after a GC. (todd)
|
||
|
||
HADOOP-8197. Configuration logs WARNs on every use of a deprecated key (tucu)
|
||
|
||
HADOOP-8159. NetworkTopology: getLeaf should check for invalid topologies.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8204. TestHealthMonitor fails occasionally (todd)
|
||
|
||
HADOOP-8202. RPC stopProxy() does not close the proxy correctly.
|
||
(Hari Mankude via suresh)
|
||
|
||
HADOOP-8218. RPC.closeProxy shouldn't throw error when closing a mock
|
||
(todd)
|
||
|
||
HADOOP-8238. NetUtils#getHostNameOfIP blows up if given ip:port
|
||
string w/o port. (eli)
|
||
|
||
HADOOP-8243. Security support broken in CLI (manual) failover controller
|
||
(todd)
|
||
|
||
HADOOP-8251. Fix SecurityUtil.fetchServiceTicket after HADOOP-6941 (todd)
|
||
|
||
HADOOP-8249. invalid hadoop-auth cookies should trigger authentication
|
||
if info is avail before returning HTTP 401 (tucu)
|
||
|
||
HADOOP-8261. Har file system doesn't deal with FS URIs with a host but no
|
||
port. (atm)
|
||
|
||
HADOOP-8263. Stringification of IPC calls not useful (todd)
|
||
|
||
HADOOP-8264. Remove irritating double double quotes in front of hostname
|
||
(Bernd Fondermann via bobby)
|
||
|
||
HADOOP-8270. hadoop-daemon.sh stop action should return 0 for an
|
||
already stopped service. (Roman Shaposhnik via eli)
|
||
|
||
HADOOP-8144. pseudoSortByDistance in NetworkTopology doesn't work
|
||
properly if no local node and first node is local rack node.
|
||
(Junping Du)
|
||
|
||
HADOOP-8282. start-all.sh refers incorrectly start-dfs.sh
|
||
existence for starting start-yarn.sh. (Devaraj K via eli)
|
||
|
||
HADOOP-7350. Use ServiceLoader to discover compression codec classes.
|
||
(tomwhite)
|
||
|
||
HADOOP-8284. clover integration broken, also mapreduce poms are pulling
|
||
in clover as a dependency. (phunt via tucu)
|
||
|
||
HADOOP-8309. Pseudo & Kerberos AuthenticationHandler should use
|
||
getType() to create token (tucu)
|
||
|
||
HADOOP-8314. HttpServer#hasAdminAccess should return false if
|
||
authorization is enabled but user is not authenticated. (tucu)
|
||
|
||
HADOOP-8296. hadoop/yarn daemonlog usage wrong (Devaraj K via tgraves)
|
||
|
||
HADOOP-8310. FileContext#checkPath should handle URIs with no port. (atm)
|
||
|
||
HADOOP-8321. TestUrlStreamHandler fails. (tucu)
|
||
|
||
HADOOP-8325. Add a ShutdownHookManager to be used by different
|
||
components instead of the JVM shutdownhook (tucu)
|
||
|
||
HADOOP-8275. Range check DelegationKey length.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8342. HDFS command fails with exception following merge of
|
||
HADOOP-8325 (tucu)
|
||
|
||
HADOOP-8346. Makes oid changes to make SPNEGO work. Was broken due
|
||
to fixes introduced by the IBM JDK compatibility patch. (ddas)
|
||
|
||
HADOOP-8355. SPNEGO filter throws/logs exception when authentication fails (tucu)
|
||
|
||
HADOOP-8349. ViewFS doesn't work when the root of a file system is mounted. (atm)
|
||
|
||
HADOOP-8328. Duplicate FileSystem Statistics object for 'file' scheme.
|
||
(tomwhite)
|
||
|
||
HADOOP-8359. Fix javadoc warnings in Configuration. (Anupam Seth via
|
||
szetszwo)
|
||
|
||
HADOOP-7988. Upper case in hostname part of the principals doesn't work with
|
||
kerberos. (jitendra)
|
||
|
||
BREAKDOWN OF HADOOP-7454 SUBTASKS
|
||
|
||
HADOOP-7455. HA: Introduce HA Service Protocol Interface. (suresh)
|
||
|
||
HADOOP-7774. HA: Administrative CLI to control HA daemons. (todd)
|
||
|
||
HADOOP-7896. HA: if both NNs are in Standby mode, client needs to try failing
|
||
back and forth several times with sleeps. (atm)
|
||
|
||
HADOOP-7922. Improve some logging for client IPC failovers and
|
||
StandbyExceptions (todd)
|
||
|
||
HADOOP-7921. StandbyException should extend IOException (todd)
|
||
|
||
HADOOP-7928. HA: Client failover policy is incorrectly trying to fail over all
|
||
IOExceptions (atm)
|
||
|
||
HADOOP-7925. Add interface and update CLI to query current state to
|
||
HAServiceProtocol (eli via todd)
|
||
|
||
HADOOP-7932. Make client connection retries on socket time outs configurable.
|
||
(Uma Maheswara Rao G via todd)
|
||
|
||
HADOOP-7924. FailoverController for client-based configuration (eli)
|
||
|
||
HADOOP-7961. Move HA fencing to common. (eli)
|
||
|
||
HADOOP-7970. HAServiceProtocol methods must throw IOException. (Hari Mankude
|
||
via suresh).
|
||
|
||
HADOOP-7992. Add ZKClient library to facilitate leader election. (Bikas Saha
|
||
via suresh).
|
||
|
||
HADOOP-7983. HA: failover should be able to pass args to fencers. (eli)
|
||
|
||
HADOOP-7938. HA: the FailoverController should optionally fence the active
|
||
during failover. (eli)
|
||
|
||
HADOOP-7991. HA: the FailoverController should check the standby is ready
|
||
before failing over. (eli)
|
||
|
||
HADOOP-8038. Add 'ipc.client.connect.max.retries.on.timeouts' entry in
|
||
core-default.xml file. (Uma Maheswara Rao G via atm)
|
||
|
||
HADOOP-8041. Log a warning when a failover is first attempted (todd)
|
||
|
||
HADOOP-8068. void methods can swallow exceptions when going through failover
|
||
path (todd)
|
||
|
||
HADOOP-8116. RetriableCommand is using RetryPolicy incorrectly after
|
||
HADOOP-7896. (atm)
|
||
|
||
HADOOP-8317. Update maven-assembly-plugin to 2.3 - fix build on FreeBSD
|
||
(Radim Kolar via bobby)
|
||
|
||
HADOOP-8172. Configuration no longer sets all keys in a deprecated key
|
||
list. (Anupam Seth via bobby)
|
||
|
||
HADOOP-7868. Hadoop native fails to compile when default linker
|
||
option is -Wl,--as-needed. (Trevor Robinson via eli)
|
||
|
||
HADOOP-8655. Fix TextInputFormat for large deliminators. (Gelesh via
|
||
bobby)
|
||
|
||
HADOOP-7900. LocalDirAllocator confChanged() accesses conf.get() twice
|
||
(Ravi Gummadi via Uma Maheswara Rao G)
|
||
|
||
HADOOP-8146. FsShell commands cannot be interrupted
|
||
(Daryn Sharp via Uma Maheswara Rao G)
|
||
|
||
HADOOP-8018. Hudson auto test for HDFS has started throwing javadoc
|
||
(Jon Eagles via bobby)
|
||
|
||
HADOOP-8001 ChecksumFileSystem's rename doesn't correctly handle checksum
|
||
files. (Daryn Sharp via bobby)
|
||
|
||
HADOOP-8006 TestFSInputChecker is failing in trunk.
|
||
(Daryn Sharp via bobby)
|
||
|
||
HADOOP-7998. CheckFileSystem does not correctly honor setVerifyChecksum
|
||
(Daryn Sharp via bobby)
|
||
|
||
HADOOP-7606. Upgrade Jackson to version 1.7.1 to match the version required
|
||
by Jersey (Alejandro Abdelnur via atm)
|
||
|
||
Release 0.23.11 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-10129. Distcp may succeed when it fails (daryn)
|
||
|
||
HADOOP-10332. HttpServer's jetty audit log always logs 200 OK (jeagles)
|
||
|
||
Release 0.23.10 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-9686. Easy access to final parameters in Configuration (Jason Lowe
|
||
via jeagles)
|
||
|
||
HADOOP-8704. add request logging to jetty/httpserver (jeagles)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-9956. RPC listener inefficiently assigns connections to readers (daryn)
|
||
|
||
HADOOP-9955. RPC idle connection closing is extremely inefficient (daryn)
|
||
|
||
BUG FIXES
|
||
|
||
Release 0.23.9 - 2013-07-08
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9581. hadoop --config non-existent directory should result in error
|
||
(Ashwin Shankar via jlowe)
|
||
|
||
HADOOP-9582. Non-existent file to "hadoop fs -conf" doesn't throw error
|
||
(Ashwin Shankar via jlowe)
|
||
|
||
Release 0.23.8 - 2013-06-05
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9222. Cover package with org.apache.hadoop.io.lz4 unit tests (Vadim
|
||
Bondarev via jlowe)
|
||
|
||
HADOOP-9233. Cover package org.apache.hadoop.io.compress.zlib with unit
|
||
tests (Vadim Bondarev via jlowe)
|
||
|
||
HADOOP-9469. mapreduce/yarn source jars not included in dist tarball
|
||
(Robert Parker via tgraves)
|
||
|
||
HADOOP-9504. MetricsDynamicMBeanBase has concurrency issues in
|
||
createMBeanInfo (Liang Xie via jlowe)
|
||
|
||
HADOOP-9614. smart-test-patch.sh hangs for new version of patch (2.7.1)
|
||
(Ravi Prakash via jeagles)
|
||
|
||
Release 0.23.7 - 2013-04-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8849. FileUtil#fullyDelete should grant the target directories +rwx
|
||
permissions (Ivan A. Veselovsky via bobby)
|
||
|
||
HADOOP-9067. provide test for LocalFileSystem.reportChecksumFailure
|
||
(Ivan A. Veselovsky via bobby)
|
||
|
||
HADOOP-9336. Allow UGI of current connection to be queried. (Daryn Sharp
|
||
via kihwal)
|
||
|
||
HADOOP-9352. Expose UGI.setLoginUser for tests (daryn)
|
||
|
||
HADOOP-9209. Add shell command to dump file checksums (Todd Lipcon via
|
||
jeagles)
|
||
|
||
HADOOP-9374. Add tokens from -tokenCacheFile into UGI (daryn)
|
||
|
||
HADOOP-8711. IPC Server supports adding exceptions for which
|
||
the message is printed and the stack trace is not printed to avoid chatter.
|
||
(Brandon Li via Suresh)
|
||
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-8462. Native-code implementation of bzip2 codec. (Govind Kamat via
|
||
jlowe)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9302. HDFS docs not linked from top level (Andy Isaacson via
|
||
tgraves)
|
||
|
||
HADOOP-9303. command manual dfsadmin missing entry for restoreFailedStorage
|
||
option (Andy Isaacson via tgraves)
|
||
|
||
HADOOP-9339. IPC.Server incorrectly sets UGI auth type (Daryn Sharp via
|
||
kihwal)
|
||
|
||
Release 0.23.6 - 2013-02-06
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
HADOOP-9217. Print thread dumps when hadoop-common tests fail.
|
||
(Andrey Klochkov via suresh)
|
||
|
||
HADOOP-9247. Parametrize Clover "generateXxx" properties to make them
|
||
re-definable via -D in mvn calls. (Ivan A. Veselovsky via suresh)
|
||
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-9072. Hadoop-Common-0.23-Build Fails to build in Jenkins
|
||
(Robert Parker via tgraves)
|
||
|
||
HADOOP-8992. Enhance unit-test coverage of class HarFileSystem (Ivan A.
|
||
Veselovsky via bobby)
|
||
|
||
HADOOP-9038. unit-tests for AllocatorPerContext.PathIterator (Ivan A.
|
||
Veselovsky via bobby)
|
||
|
||
HADOOP-9105. FsShell -moveFromLocal erroneously fails (daryn via bobby)
|
||
|
||
HADOOP-9097. Maven RAT plugin is not checking all source files (tgraves)
|
||
|
||
HADOOP-9255. relnotes.py missing last jira (tgraves)
|
||
|
||
Release 0.23.5 - 2012-11-28
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8932. JNI-based user-group mapping modules can be too chatty on
|
||
lookup failures. (Kihwal Lee via suresh)
|
||
|
||
HADOOP-8930. Cumulative code coverage calculation (Andrey Klochkov via
|
||
bobby)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-8906. paths with multiple globs are unreliable. (Daryn Sharp via
|
||
jlowe)
|
||
|
||
HADOOP-8811. Compile hadoop native library in FreeBSD (Radim Kolar via
|
||
bobby)
|
||
|
||
HADOOP-8962. RawLocalFileSystem.listStatus fails when a child filename
|
||
contains a colon (jlowe via bobby)
|
||
|
||
HADOOP-8986. Server$Call object is never released after it is sent (bobby)
|
||
|
||
HADOOP-9022. Hadoop distcp tool fails to copy file if -m 0 specified
|
||
(Jonathan Eagles vai bobby)
|
||
|
||
HADOOP-9025. org.apache.hadoop.tools.TestCopyListing failing (Jonathan
|
||
Eagles via jlowe)
|
||
|
||
Release 0.23.4
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8822. relnotes.py was deleted post mavenization (bobby)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-8843. Old trash directories are never deleted on upgrade
|
||
from 1.x (jlowe)
|
||
|
||
HADOOP-8684. Deadlock between WritableComparator and WritableComparable.
|
||
(Jing Zhao via suresh)
|
||
|
||
Release 0.23.3
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-7967. Need generalized multi-token filesystem support (daryn)
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8108. Move method getHostPortString() from NameNode to NetUtils.
|
||
(Brandon Li via jitendra)
|
||
|
||
HADOOP-8288. Remove references of mapred.child.ulimit etc. since they are
|
||
not being used any more (Ravi Prakash via bobby)
|
||
|
||
HADOOP-8535. Cut hadoop build times in half (Job Eagles via bobby)
|
||
|
||
HADOOP-8525. Provide Improved Traceability for Configuration (bobby)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-8088. User-group mapping cache incorrectly does negative caching on
|
||
transient failures (Khiwal Lee via bobby)
|
||
|
||
HADOOP-8179. risk of NPE in CopyCommands processArguments() (Daryn Sharp
|
||
via bobby)
|
||
|
||
HADOOP-6963. In FileUtil.getDU(..), neither include the size of directories
|
||
nor follow symbolic links. (Ravi Prakash via szetszwo)
|
||
|
||
HADOOP-8180. Remove hsqldb since its not needed from pom.xml (Ravi Prakash
|
||
via tgraves)
|
||
|
||
HADOOP-8014. ViewFileSystem does not correctly implement getDefaultBlockSize,
|
||
getDefaultReplication, getContentSummary (John George via bobby)
|
||
|
||
HADOOP-7510. Tokens should use original hostname provided instead of ip
|
||
(Daryn Sharp via bobby)
|
||
|
||
HADOOP-8283. Allow tests to control token service value (Daryn Sharp via
|
||
bobby)
|
||
|
||
HADOOP-8286. Simplify getting a socket address from conf (Daryn Sharp via
|
||
bobby)
|
||
|
||
HADOOP-8227. Allow RPC to limit ephemeral port range. (bobby)
|
||
|
||
HADOOP-8305. distcp over viewfs is broken (John George via bobby)
|
||
|
||
HADOOP-8334. HttpServer sometimes returns incorrect port (Daryn Sharp via
|
||
bobby)
|
||
|
||
HADOOP-8330. Update TestSequenceFile.testCreateUsesFsArg() for HADOOP-8305.
|
||
(John George via szetszwo)
|
||
|
||
HADOOP-8335. Improve Configuration's address handling (Daryn Sharp via
|
||
bobby)
|
||
|
||
HADOOP-8327. distcpv2 and distcpv1 jars should not coexist (Dave Thompson
|
||
via bobby)
|
||
|
||
HADOOP-8341. Fix or filter findbugs issues in hadoop-tools (bobby)
|
||
|
||
HADOOP-8373. Port RPC.getServerAddress to 0.23 (Daryn Sharp via bobby)
|
||
|
||
HADOOP-8495. Update Netty to avoid leaking file descriptors during shuffle
|
||
(Jason Lowe via tgraves)
|
||
|
||
HADOOP-8129. ViewFileSystemTestSetup setupForViewFileSystem is erring
|
||
(Ahmed Radwan and Ravi Prakash via bobby)
|
||
|
||
HADOOP-8573. Configuration tries to read from an inputstream resource
|
||
multiple times (Robert Evans via tgraves)
|
||
|
||
HADOOP-8599. Non empty response from FileSystem.getFileBlockLocations when
|
||
asking for data beyond the end of file. (Andrey Klochkov via todd)
|
||
|
||
HADOOP-8606. FileSystem.get may return the wrong filesystem (Daryn Sharp
|
||
via bobby)
|
||
|
||
HADOOP-8551. fs -mkdir creates parent directories without the -p option
|
||
(John George via bobby)
|
||
|
||
HADOOP-8613. AbstractDelegationTokenIdentifier#getUser() should set token
|
||
auth type. (daryn)
|
||
|
||
HADOOP-8627. FS deleteOnExit may delete the wrong path (daryn via bobby)
|
||
|
||
HADOOP-8634. Ensure FileSystem#close doesn't squawk for deleteOnExit paths
|
||
(daryn via bobby)
|
||
|
||
HADOOP-8550. hadoop fs -touchz automatically created parent directories
|
||
(John George via bobby)
|
||
|
||
HADOOP-8635. Cannot cancel paths registered deleteOnExit (daryn via bobby)
|
||
|
||
HADOOP-8637. FilterFileSystem#setWriteChecksum is broken (daryn via bobby)
|
||
|
||
HADOOP-8370. Native build failure: javah: class file for
|
||
org.apache.hadoop.classification.InterfaceAudience not found (Trevor
|
||
Robinson via tgraves)
|
||
|
||
HADOOP-8633. Interrupted FsShell copies may leave tmp files (Daryn Sharp
|
||
via tgraves)
|
||
|
||
HADOOP-8703. distcpV2: turn CRC checking off for 0 byte size (Dave
|
||
Thompson via bobby)
|
||
|
||
HADOOP-8390. TestFileSystemCanonicalization fails with JDK7 (Trevor
|
||
Robinson via tgraves)
|
||
|
||
HADOOP-8692. TestLocalDirAllocator fails intermittently with JDK7
|
||
(Trevor Robinson via tgraves)
|
||
|
||
HADOOP-8693. TestSecurityUtil fails intermittently with JDK7 (Trevor
|
||
Robinson via tgraves)
|
||
|
||
HADOOP-8697. TestWritableName fails intermittently with JDK7 (Trevor
|
||
Robinson via tgraves)
|
||
|
||
HADOOP-8695. TestPathData fails intermittently with JDK7 (Trevor
|
||
Robinson via tgraves)
|
||
|
||
HADOOP-8611. Allow fall-back to the shell-based implementation when
|
||
JNI-based users-group mapping fails (Robert Parker via bobby)
|
||
|
||
HADOOP-8225. DistCp fails when invoked by Oozie (daryn via bobby)
|
||
|
||
HADOOP-8709. globStatus changed behavior from 0.20/1.x (Jason Lowe via
|
||
bobby)
|
||
|
||
HADOOP-8725. MR is broken when security is off (daryn via bobby)
|
||
|
||
HADOOP-8726. The Secrets in Credentials are not available to MR tasks
|
||
(daryn and Benoy Antony via bobby)
|
||
|
||
HADOOP-8727. Gracefully deprecate dfs.umaskmode in 2.x onwards (Harsh J
|
||
via bobby)
|
||
|
||
Release 0.23.2 - UNRELEASED
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-8048. Allow merging of Credentials (Daryn Sharp via tgraves)
|
||
|
||
HADOOP-8032. mvn site:stage-deploy should be able to use the scp protocol
|
||
to stage documents (Ravi Prakash via tgraves)
|
||
|
||
HADOOP-7923. Automate the updating of version numbers in the doc system.
|
||
(szetszwo)
|
||
|
||
HADOOP-8137. Added links to CLI manuals to the site. (tgraves via
|
||
acmurthy)
|
||
|
||
OPTIMIZATIONS
|
||
HADOOP-8071. Avoid an extra packet in client code when nagling is
|
||
disabled. (todd)
|
||
|
||
HADOOP-6502. Improve the performance of Configuration.getClassByName when
|
||
the class is not found by caching negative results.
|
||
(sharad, todd via todd)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-7660. Maven generated .classpath doesnot includes
|
||
"target/generated-test-source/java" as source directory.
|
||
(Laxman via bobby)
|
||
|
||
HADOOP-8042 When copying a file out of HDFS, modifying it, and uploading
|
||
it back into HDFS, the put fails due to a CRC mismatch
|
||
(Daryn Sharp via bobby)
|
||
|
||
HADOOP-8035 Hadoop Maven site is inefficient and runs phases redundantly
|
||
(abayer via tucu)
|
||
|
||
HADOOP-8051 HttpFS documentation it is not wired to the generated site (tucu)
|
||
|
||
HADOOP-8055. Hadoop tarball distribution lacks a core-site.xml (harsh)
|
||
|
||
HADOOP-8052. Hadoop Metrics2 should emit Float.MAX_VALUE (instead of
|
||
Double.MAX_VALUE) to avoid making Ganglia's gmetad core. (Varun Kapoor
|
||
via mattf)
|
||
|
||
HADOOP-8074. Small bug in hadoop error message for unknown commands.
|
||
(Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8082 add hadoop-client and hadoop-minicluster to the
|
||
dependency-management section. (tucu)
|
||
|
||
HADOOP-8066 The full docs build intermittently fails (abayer via tucu)
|
||
|
||
HADOOP-8083 javadoc generation for some modules is not done under target/ (tucu)
|
||
|
||
HADOOP-8036. TestViewFsTrash assumes the user's home directory is
|
||
2 levels deep. (Colin Patrick McCabe via eli)
|
||
|
||
HADOOP-8046 Revert StaticMapping semantics to the existing ones, add DNS
|
||
mapping diagnostics in progress (stevel)
|
||
|
||
HADOOP-8057 hadoop-setup-conf.sh not working because of some extra spaces.
|
||
(Vinayakumar B via stevel)
|
||
|
||
HADOOP-7680 TestHardLink fails on Mac OS X, when gnu stat is in path.
|
||
(Milind Bhandarkar via stevel)
|
||
|
||
HADOOP-8050. Deadlock in metrics. (Kihwal Lee via mattf)
|
||
|
||
HADOOP-8131. FsShell put doesn't correctly handle a non-existent dir
|
||
(Daryn Sharp via bobby)
|
||
|
||
HADOOP-8123. Use java.home rather than env.JAVA_HOME for java in the
|
||
project. (Jonathan Eagles via acmurthy)
|
||
|
||
HADOOP-8064. Remove unnecessary dependency on w3c.org in document processing
|
||
(Khiwal Lee via bobby)
|
||
|
||
HADOOP-8140. dfs -getmerge should process its argments better (Daryn Sharp
|
||
via bobby)
|
||
|
||
HADOOP-8164. Back slash as path separator is handled for Windows only.
|
||
(Daryn Sharp via suresh)
|
||
|
||
HADOOP-8173. FsShell needs to handle quoted metachars. (Daryn Sharp via
|
||
szetszwo)
|
||
|
||
HADOOP-8175. Add -p option to mkdir in FsShell. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-8176. Disambiguate the destination of FsShell copies (Daryn Sharp
|
||
via bobby)
|
||
|
||
HADOOP-8208. Disallow self failover. (eli)
|
||
|
||
Release 0.23.1 - 2012-02-17
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-7777 Implement a base class for DNSToSwitchMapping implementations
|
||
that can offer extra topology information. (stevel)
|
||
|
||
HADOOP-7657. Add support for LZ4 compression. (Binglin Chang via todd)
|
||
|
||
HADOOP-7910. Add Configuration.getLongBytes to handle human readable byte size values. (Sho Shimauchi via harsh)
|
||
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-7801. HADOOP_PREFIX cannot be overriden. (Bruno Mahé via tomwhite)
|
||
|
||
HADOOP-7802. Hadoop scripts unconditionally source
|
||
"$bin"/../libexec/hadoop-config.sh. (Bruno Mahé via tomwhite)
|
||
|
||
HADOOP-7858. Drop some info logging to DEBUG level in IPC,
|
||
metrics, and HTTP. (todd via eli)
|
||
|
||
HADOOP-7424. Log an error if the topology script doesn't handle multiple args.
|
||
(Uma Maheswara Rao G via eli)
|
||
|
||
HADOOP-7804. Enable hadoop config generator to set configurations to enable
|
||
short circuit read. (Arpit Gupta via jitendra)
|
||
|
||
HADOOP-7877. Update balancer CLI usage documentation to include the new
|
||
-policy option. (szetszwo)
|
||
|
||
HADOOP-6840. Support non-recursive create() in FileSystem and
|
||
SequenceFile.Writer. (jitendra and eli via eli)
|
||
|
||
HADOOP-6886. LocalFileSystem Needs createNonRecursive API.
|
||
(Nicolas Spiegelberg and eli via eli)
|
||
|
||
HADOOP-7912. test-patch should run eclipse:eclipse to verify that it does
|
||
not break again. (Robert Joseph Evans via tomwhite)
|
||
|
||
HADOOP-7890. Redirect hadoop script's deprecation message to stderr.
|
||
(Koji Knoguchi via mahadev)
|
||
|
||
HADOOP-7504. Add the missing Ganglia31 opts to hadoop-metrics.properties as a comment. (harsh)
|
||
|
||
HADOOP-7933. Add a getDelegationTokens api to FileSystem which checks
|
||
for known tokens in the passed Credentials object. (sseth)
|
||
|
||
HADOOP-7737. normalize hadoop-mapreduce & hadoop-dist dist/tar build with
|
||
common/hdfs. (tucu)
|
||
|
||
HADOOP-7743. Add Maven profile to create a full source tarball. (tucu)
|
||
|
||
HADOOP-7758. Make GlobFilter class public. (tucu)
|
||
|
||
HADOOP-7590. Mavenize streaming and MR examples. (tucu)
|
||
|
||
HADOOP-7934. Normalize dependencies versions across all modules. (tucu)
|
||
|
||
HADOOP-7348. Change 'addnl' in getmerge util to be a flag '-nl' instead.
|
||
(XieXianshan via harsh)
|
||
|
||
HADOOP-7975. Add LZ4 as an entry in the default codec list, missed by HADOOP-7657 (harsh)
|
||
|
||
HADOOP-4515. Configuration#getBoolean must not be case sensitive. (Sho Shimauchi via harsh)
|
||
|
||
HADOOP-6490. Use StringUtils over String#replace in Path#normalizePath.
|
||
(Uma Maheswara Rao G via harsh)
|
||
|
||
HADOOP-7574. Improve FSShell -stat, add user/group elements.
|
||
(XieXianshan via harsh)
|
||
|
||
HADOOP-7736. Remove duplicate Path#normalizePath call. (harsh)
|
||
|
||
HADOOP-7919. Remove the unused hadoop.logfile.* properties from the
|
||
core-default.xml file. (harsh)
|
||
|
||
HADOOP-7939. Improve Hadoop subcomponent integration in Hadoop 0.23. (rvs via tucu)
|
||
|
||
HADOOP-8002. SecurityUtil acquired token message should be a debug rather than info.
|
||
(Arpit Gupta via mahadev)
|
||
|
||
HADOOP-8009. Create hadoop-client and hadoop-minicluster artifacts for downstream
|
||
projects. (tucu)
|
||
|
||
HADOOP-7470. Move up to Jackson 1.8.8. (Enis Soztutar via szetszwo)
|
||
|
||
HADOOP-8027. Visiting /jmx on the daemon web interfaces may print
|
||
unnecessary error in logs. (atm)
|
||
|
||
HADOOP-7792. Add verifyToken method to AbstractDelegationTokenSecretManager.
|
||
(jitendra)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-7811. TestUserGroupInformation#testGetServerSideGroups test fails in chroot.
|
||
(Jonathan Eagles via mahadev)
|
||
|
||
HADOOP-7813. Fix test-patch to use proper numerical comparison when checking
|
||
javadoc and findbugs warning counts. (Jonathan Eagles via tlipcon)
|
||
|
||
HADOOP-7841. Run tests with non-secure random. (tlipcon)
|
||
|
||
HADOOP-7851. Configuration.getClasses() never returns the default value.
|
||
(Uma Maheswara Rao G via amarrk)
|
||
|
||
HADOOP-7787. Make source tarball use conventional name.
|
||
(Bruno Mahé via tomwhite)
|
||
|
||
HADOOP-6614. RunJar should provide more diags when it can't create
|
||
a temp file. (Jonathan Hsieh via eli)
|
||
|
||
HADOOP-7859. TestViewFsHdfs.testgetFileLinkStatus is failing an assert. (eli)
|
||
|
||
HADOOP-7864. Building mvn site with Maven < 3.0.2 causes OOM errors.
|
||
(Andrew Bayer via eli)
|
||
|
||
HADOOP-7854. UGI getCurrentUser is not synchronized. (Daryn Sharp via jitendra)
|
||
|
||
HADOOP-7870. fix SequenceFile#createWriter with boolean
|
||
createParent arg to respect createParent. (Jon Hsieh via eli)
|
||
|
||
HADOOP-7898. Fix javadoc warnings in AuthenticationToken.java. (suresh)
|
||
|
||
HADOOP-7878 Regression: HADOOP-7777 switch changes break HDFS tests when the
|
||
isSingleSwitch() predicate is used. (stevel)
|
||
|
||
HADOOP-7914. Remove the duplicated declaration of hadoop-hdfs test-jar in
|
||
hadoop-project/pom.xml. (szetszwo)
|
||
|
||
HADOOP-7837. no NullAppender in the log4j config. (eli)
|
||
|
||
HADOOP-7948. Shell scripts created by hadoop-dist/pom.xml to build tar do not
|
||
properly propagate failure. (cim_michajlomatijkiw via tucu)
|
||
|
||
HADOOP-7949. Updated maxIdleTime default in the code to match
|
||
core-default.xml (eli)
|
||
|
||
HADOOP-7853. multiple javax security configurations cause conflicts.
|
||
(daryn via tucu)
|
||
|
||
HDFS-2614. hadoop dist tarball is missing hdfs headers. (tucu)
|
||
|
||
HADOOP-7874. native libs should be under lib/native/ dir. (tucu)
|
||
|
||
HADOOP-7887. KerberosAuthenticatorHandler is not setting
|
||
KerberosName name rules from configuration. (tucu)
|
||
|
||
HADOOP-7902. skipping name rules setting (if already set) should be done
|
||
on UGI initialization only. (tucu)
|
||
|
||
HADOOP-7810. move hadoop archive to core from tools. (tucu)
|
||
|
||
HADOOP_7917. compilation of protobuf files fails in windows/cygwin. (tucu)
|
||
|
||
HADOOP-7907. hadoop-tools JARs are not part of the distro. (tucu)
|
||
|
||
HADOOP-7936. There's a Hoop README in the root dir of the tarball. (tucu)
|
||
|
||
HADOOP-7963. Fix ViewFS to catch a null canonical service-name and pass
|
||
tests TestViewFileSystem* (Siddharth Seth via vinodkv)
|
||
|
||
HADOOP-7964. Deadlock in NetUtils and SecurityUtil class initialization.
|
||
(Daryn Sharp via suresh)
|
||
|
||
HADOOP-7974. TestViewFsTrash incorrectly determines the user's home
|
||
directory. (harsh via eli)
|
||
|
||
HADOOP-7971. Adding back job/pipes/queue commands to bin/hadoop for
|
||
backward compatibility. (Prashath Sharma via acmurthy)
|
||
|
||
HADOOP-7982. UserGroupInformation fails to login if thread's context
|
||
classloader can't load HadoopLoginModule. (todd)
|
||
|
||
HADOOP-7986. Adding config for MapReduce History Server protocol in
|
||
hadoop-policy.xml for service level authorization. (Mahadev Konar via vinodkv)
|
||
|
||
HADOOP-7981. Improve documentation for org.apache.hadoop.io.compress.
|
||
Decompressor.getRemaining (Jonathan Eagles via mahadev)
|
||
|
||
HADOOP-7997. SequenceFile.createWriter(...createParent...) no
|
||
longer works on existing file. (Gregory Chanan via eli)
|
||
|
||
HADOOP-7993. Hadoop ignores old-style config options for enabling compressed
|
||
output. (Anupam Seth via mahadev)
|
||
|
||
HADOOP-8000. fetchdt command not available in bin/hadoop.
|
||
(Arpit Gupta via mahadev)
|
||
|
||
HADOOP-7999. "hadoop archive" fails with ClassNotFoundException.
|
||
(Jason Lowe via mahadev)
|
||
|
||
HADOOP-8012. hadoop-daemon.sh and yarn-daemon.sh are trying to mkdir
|
||
and chown log/pid dirs which can fail. (Roman Shaposhnik via eli)
|
||
|
||
HADOOP-8013. ViewFileSystem does not honor setVerifyChecksum
|
||
(Daryn Sharp via bobby)
|
||
|
||
HADOOP-8054 NPE with FilterFileSystem (Daryn Sharp via bobby)
|
||
|
||
Release 0.23.0 - 2011-11-01
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-6904. Support method based RPC compatiblity. (hairong)
|
||
|
||
HADOOP-6432. Add Statistics support in FileContext. (jitendra)
|
||
|
||
HADOOP-7136. Remove failmon contrib component. (nigel)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-7324. Ganglia plugins for metrics v2. (Priyo Mustafi via llu)
|
||
|
||
HADOOP-7342. Add an utility API in FileUtil for JDK File.list
|
||
avoid NPEs on File.list() (Bharath Mundlapudi via mattf)
|
||
|
||
HADOOP-7322. Adding a util method in FileUtil for directory listing,
|
||
avoid NPEs on File.listFiles() (Bharath Mundlapudi via mattf)
|
||
|
||
HADOOP-7023. Add listCorruptFileBlocks to Filesysem. (Patrick Kling
|
||
via hairong)
|
||
|
||
HADOOP-7096. Allow setting of end-of-record delimiter for TextInputFormat
|
||
(Ahmed Radwan via todd)
|
||
|
||
HADOOP-6994. Api to get delegation token in AbstractFileSystem. (jitendra)
|
||
|
||
HADOOP-7171. Support UGI in FileContext API. (jitendra)
|
||
|
||
HADOOP-7257 Client side mount tables (sanjay)
|
||
|
||
HADOOP-6919. New metrics2 framework. (Luke Lu via acmurthy)
|
||
|
||
HADOOP-6920. Metrics instrumentation to move new metrics2 framework.
|
||
(Luke Lu via suresh)
|
||
|
||
HADOOP-7214. Add Common functionality necessary to provide an equivalent
|
||
of /usr/bin/groups for Hadoop. (Aaron T. Myers via todd)
|
||
|
||
HADOOP-6832. Add an authentication plugin using a configurable static user
|
||
for the web UI. (Owen O'Malley and Todd Lipcon via cdouglas)
|
||
|
||
HADOOP-7144. Expose JMX metrics via JSON servlet. (Robert Joseph Evans via
|
||
cdouglas)
|
||
|
||
HADOOP-7379. Add the ability to serialize and deserialize protocol buffers
|
||
in ObjectWritable. (todd)
|
||
|
||
HADOOP-7206. Support Snappy compression. (Issei Yoshida and
|
||
Alejandro Abdelnur via eli)
|
||
|
||
HADOOP-7329. Add the capability of getting invividual attribute of a mbean
|
||
using JMXProxyServlet. (tanping)
|
||
|
||
HADOOP-7380. Add client failover functionality to o.a.h.io.(ipc|retry).
|
||
(atm via eli)
|
||
|
||
HADOOP-7460. Support pluggable trash policies. (Usman Masoon via suresh)
|
||
|
||
HADOOP-6385. dfs should support -rmdir (was HDFS-639). (Daryn Sharp
|
||
via mattf)
|
||
|
||
HADOOP-7119. add Kerberos HTTP SPNEGO authentication support to Hadoop
|
||
JT/NN/DN/TT web-consoles. (Alejandro Abdelnur via atm)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-7655. Provide a small validation script that smoke tests the installed
|
||
cluster. (Arpit Gupta via mattf)
|
||
|
||
HADOOP-7042. Updates to test-patch.sh to include failed test names and
|
||
improve other messaging. (nigel)
|
||
|
||
HADOOP-7001. Configuration changes can occur via the Reconfigurable
|
||
interface. (Patrick Kling via dhruba)
|
||
|
||
HADOOP-6764. Add number of reader threads and queue length as
|
||
configuration parameters in RPC.getServer. (Dmytro Molkov via hairong)
|
||
|
||
HADOOP-7049. TestReconfiguration should be junit v4.
|
||
(Patrick Kling via eli)
|
||
|
||
HADOOP-7054 Change NN LoadGenerator to use FileContext APIs
|
||
(Sanjay Radia)
|
||
|
||
HADOOP-7060. A more elegant FileSystem#listCorruptFileBlocks API.
|
||
(Patrick Kling via hairong)
|
||
|
||
HADOOP-7058. Expose number of bytes in FSOutputSummer buffer to
|
||
implementatins. (Todd Lipcon via hairong)
|
||
|
||
HADOOP-7061. unprecise javadoc for CompressionCodec. (Jingguo Yao via eli)
|
||
|
||
HADOOP-7059. Remove "unused" warning in native code. (Noah Watkins via eli)
|
||
|
||
HADOOP-6864. Provide a JNI-based implementation of
|
||
ShellBasedUnixGroupsNetgroupMapping
|
||
(implementation of GroupMappingServiceProvider) (Erik Seffl via boryas)
|
||
|
||
HADOOP-7078. Improve javadocs for RawComparator interface.
|
||
(Harsh J Chouraria via todd)
|
||
|
||
HADOOP-6995. Allow wildcards to be used in ProxyUsers configurations.
|
||
(todd)
|
||
|
||
HADOOP-6376. Add a comment header to conf/slaves that specifies the file
|
||
format. (Kay Kay via todd)
|
||
|
||
HADOOP-7151. Document need for stable hashCode() in WritableComparable.
|
||
(Dmitriy V. Ryaboy via todd)
|
||
|
||
HADOOP-7112. Issue a warning when GenericOptionsParser libjars are not on
|
||
local filesystem. (tomwhite)
|
||
|
||
HADOOP-7114. FsShell should dump all exceptions at DEBUG level.
|
||
(todd via tomwhite)
|
||
|
||
HADOOP-7159. RPC server should log the client hostname when read exception
|
||
happened. (Scott Chen via todd)
|
||
|
||
HADOOP-7167. Allow using a file to exclude certain tests from build. (todd)
|
||
|
||
HADOOP-7133. Batch the calls in DataStorage to FileUtil.createHardLink().
|
||
(Matt Foley via jghoman)
|
||
|
||
HADOOP-7166. Add DaemonFactory to common. (Erik Steffl & jitendra)
|
||
|
||
HADOOP-7175. Add isEnabled() to Trash. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7180. Better support on CommandFormat on the API and exceptions.
|
||
(Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7202. Improve shell Command base class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7224. Add CommandFactory to shell. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7014. Generalize CLITest structure and interfaces to facilitate
|
||
upstream adoption (e.g. for web testing). (cos)
|
||
|
||
HADOOP-7230. Move "fs -help" shell command tests from HDFS to COMMOM; see
|
||
also HDFS-1844. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7233. Refactor ls to conform to new FsCommand class. (Daryn Sharp
|
||
via szetszwo)
|
||
|
||
HADOOP-7235. Refactor the tail command to conform to new FsCommand class.
|
||
(Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7179. Federation: Improve HDFS startup scripts. (Erik Steffl
|
||
and Tanping Wang via suresh)
|
||
|
||
HADOOP-7227. Remove protocol version check at proxy creation in Hadoop
|
||
RPC. (jitendra)
|
||
|
||
HADOOP-7236. Refactor the mkdir command to conform to new FsCommand class.
|
||
(Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7250. Refactor the setrep command to conform to new FsCommand class.
|
||
(Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7249. Refactor the chmod/chown/chgrp command to conform to new
|
||
FsCommand class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7251. Refactor the getmerge command to conform to new FsCommand
|
||
class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7265. Keep track of relative paths in PathData. (Daryn Sharp
|
||
via szetszwo)
|
||
|
||
HADOOP-7238. Refactor the cat and text commands to conform to new FsCommand
|
||
class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7271. Standardize shell command error messages. (Daryn Sharp
|
||
via szetszwo)
|
||
|
||
HADOOP-7272. Remove unnecessary security related info logs. (suresh)
|
||
|
||
HADOOP-7275. Refactor the stat command to conform to new FsCommand
|
||
class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7237. Refactor the touchz command to conform to new FsCommand
|
||
class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7267. Refactor the rm/rmr/expunge commands to conform to new
|
||
FsCommand class. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7285. Refactor the test command to conform to new FsCommand
|
||
class. (Daryn Sharp via todd)
|
||
|
||
HADOOP-7289. In ivy.xml, test conf should not extend common conf.
|
||
(Eric Yang via szetszwo)
|
||
|
||
HADOOP-7291. Update Hudson job not to run test-contrib. (Nigel Daley via eli)
|
||
|
||
HADOOP-7286. Refactor the du/dus/df commands to conform to new FsCommand
|
||
class. (Daryn Sharp via todd)
|
||
|
||
HADOOP-7301. FSDataInputStream should expose a getWrappedStream method.
|
||
(Jonathan Hsieh via eli)
|
||
|
||
HADOOP-7306. Start metrics system even if config files are missing
|
||
(Luke Lu via todd)
|
||
|
||
HADOOP-7302. webinterface.private.actions should be renamed and moved to
|
||
the MapReduce project. (Ari Rabkin via todd)
|
||
|
||
HADOOP-7329. Improve help message for "df" to include "-h" flag.
|
||
(Xie Xianshan via todd)
|
||
|
||
HADOOP-7320. Refactor the copy and move commands to conform to new
|
||
FsCommand class. (Daryn Sharp via todd)
|
||
|
||
HADOOP-7312. Update value of hadoop.common.configuration.version.
|
||
(Harsh J Chouraria via todd)
|
||
|
||
HADOOP-7337. Change PureJavaCrc32 annotations to public stable. (szetszwo)
|
||
|
||
HADOOP-7331. Make hadoop-daemon.sh return exit code 1 if daemon processes
|
||
did not get started. (Tanping Wang via todd)
|
||
|
||
HADOOP-7316. Add public javadocs to FSDataInputStream and
|
||
FSDataOutputStream. (eli)
|
||
|
||
HADOOP-7323. Add capability to resolve compression codec based on codec
|
||
name. (Alejandro Abdelnur via tomwhite)
|
||
|
||
HADOOP-1886. Undocumented parameters in FilesSystem. (Frank Conrad via eli)
|
||
|
||
HADOOP-7375. Add resolvePath method to FileContext. (Sanjay Radia via eli)
|
||
|
||
HADOOP-7383. HDFS needs to export protobuf library dependency in pom.
|
||
(todd via eli)
|
||
|
||
HADOOP-7374. Don't add tools.jar to the classpath when running Hadoop.
|
||
(eli)
|
||
|
||
HADOOP-7106. Reorganize project SVN layout to "unsplit" the projects.
|
||
(todd, nigel)
|
||
|
||
HADOOP-6605. Add JAVA_HOME detection to hadoop-config. (eli)
|
||
|
||
HADOOP-7384. Allow test-patch to be more flexible about patch format. (todd)
|
||
|
||
HADOOP-6929. RPC should have a way to pass Security information other than
|
||
protocol annotations. (sharad and omalley via mahadev)
|
||
|
||
HADOOP-7385. Remove StringUtils.stringifyException(ie) in logger functions.
|
||
(Bharath Mundlapudi via Tanping Wang).
|
||
|
||
HADOOP-310. Additional constructor requested in BytesWritable. (Brock
|
||
Noland via atm)
|
||
|
||
HADOOP-7429. Add another IOUtils#copyBytes method. (eli)
|
||
|
||
HADOOP-7451. Generalize StringUtils#join. (Chris Douglas via mattf)
|
||
|
||
HADOOP-7449. Add Data(In,Out)putByteBuffer to work with ByteBuffer similar
|
||
to Data(In,Out)putBuffer for byte[]. Merge from yahoo-merge branch,
|
||
-r 1079163. Fix missing Apache license headers. (Chris Douglas via mattf)
|
||
|
||
HADOOP-7361. Provide an option, -overwrite/-f, in put and copyFromLocal
|
||
shell commands. (Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7430. Improve error message when moving to trash fails due to
|
||
quota issue. (Ravi Prakash via mattf)
|
||
|
||
HADOOP-7444. Add Checksum API to verify and calculate checksums "in bulk"
|
||
(todd)
|
||
|
||
HADOOP-7443. Add CRC32C as another DataChecksum implementation (todd)
|
||
|
||
HADOOP-7305. Eclipse project files are incomplete. (Niels Basjes via eli)
|
||
|
||
HADOOP-7314. Add support for throwing UnknownHostException when a host doesn't
|
||
resolve. (Jeffrey Naisbitt via jitendra)
|
||
|
||
HADOOP-7465. A several tiny improvements for the LOG format.
|
||
(Xie Xianshan via eli)
|
||
|
||
HADOOP-7434. Display error when using "daemonlog -setlevel" with
|
||
illegal level. (yanjinshuang via eli)
|
||
|
||
HADOOP-7463. Adding a configuration parameter to SecurityInfo interface.
|
||
(mahadev)
|
||
|
||
HADOOP-7298. Add test utility for writing multi-threaded tests. (todd and
|
||
Harsh J Chouraria via todd)
|
||
|
||
HADOOP-7485. Add -h option to ls to list file sizes in human readable
|
||
format. (XieXianshan via suresh)
|
||
|
||
HADOOP-7378. Add -d option to ls to not expand directories.
|
||
(Daryn Sharp via suresh)
|
||
|
||
HADOOP-7474. Refactor ClientCache out of WritableRpcEngine. (jitendra)
|
||
|
||
HADOOP-7491. hadoop command should respect HADOOP_OPTS when given
|
||
a class name. (eli)
|
||
|
||
HADOOP-7178. Add a parameter, useRawLocalFileSystem, to copyToLocalFile(..)
|
||
in FileSystem. (Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-6671. Use maven for hadoop common builds. (Alejandro Abdelnur
|
||
via tomwhite)
|
||
|
||
HADOOP-7502. Make generated sources IDE friendly.
|
||
(Alejandro Abdelnur via llu)
|
||
|
||
HADOOP-7501. Publish Hadoop Common artifacts (post HADOOP-6671) to Apache
|
||
SNAPSHOTs repo. (Alejandro Abdelnur via tomwhite)
|
||
|
||
HADOOP-7525. Make arguments to test-patch optional. (tomwhite)
|
||
|
||
HADOOP-7472. RPC client should deal with IP address change.
|
||
(Kihwal Lee via suresh)
|
||
|
||
HADOOP-7499. Add method for doing a sanity check on hostnames in NetUtils.
|
||
(Jeffrey Naisbit via mahadev)
|
||
|
||
HADOOP-6158. Move CyclicIteration to HDFS. (eli)
|
||
|
||
HADOOP-7526. Add TestPath tests for URI conversion and reserved
|
||
characters. (eli)
|
||
|
||
HADOOP-7531. Add servlet util methods for handling paths in requests. (eli)
|
||
|
||
HADOOP-7493. Add ShortWritable. (Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7555. Add a eclipse-generated files to .gitignore. (atm)
|
||
|
||
HADOOP-7264. Bump avro version to at least 1.4.1. (Alejandro Abdelnur via
|
||
tomwhite)
|
||
|
||
HADOOP-7498. Remove legacy TAR layout creation. (Alejandro Abdelnur via
|
||
tomwhite)
|
||
|
||
HADOOP-7496. Break Maven TAR & bintar profiles into just LAYOUT & TAR proper.
|
||
(Alejandro Abdelnur via tomwhite)
|
||
|
||
HADOOP-7561. Make test-patch only run tests for changed modules. (tomwhite)
|
||
|
||
HADOOP-7547. Add generic type in WritableComparable subclasses.
|
||
(Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7579. Rename package names from alfredo to auth.
|
||
(Alejandro Abdelnur via szetszwo)
|
||
|
||
HADOOP-7594. Support HTTP REST in HttpServer. (szetszwo)
|
||
|
||
HADOOP-7552. FileUtil#fullyDelete doesn't throw IOE but lists it
|
||
in the throws clause. (eli)
|
||
|
||
HADOOP-7580. Add a version of getLocalPathForWrite to LocalDirAllocator
|
||
which doesn't create dirs. (Chris Douglas & Siddharth Seth via acmurthy)
|
||
|
||
HADOOP-7507. Allow ganglia metrics to include the metrics system tags
|
||
in the gmetric names. (Alejandro Abdelnur via todd)
|
||
|
||
HADOOP-7612. Change test-patch to run tests for all nested modules.
|
||
(tomwhite)
|
||
|
||
HADOOP-7599. Script improvements to setup a secure Hadoop cluster
|
||
(Eric Yang via ddas)
|
||
|
||
HADOOP-7639. Enhance HttpServer to allow passing path-specs for filtering,
|
||
so that servers like Yarn WebApp can get filtered the paths served by
|
||
their own injected servlets. (Thomas Graves via vinodkv)
|
||
|
||
HADOOP-7575. Enhanced LocalDirAllocator to support fully-qualified
|
||
paths. (Jonathan Eagles via vinodkv)
|
||
|
||
HADOOP-7469 Add a standard handler for socket connection problems which
|
||
improves diagnostics (Uma Maheswara Rao G and stevel via stevel)
|
||
|
||
HADOOP-7710. Added hadoop-setup-application.sh for creating
|
||
application directory (Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7707. Added toggle for dfs.support.append, webhdfs and hadoop proxy
|
||
user to setup config script. (Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7720. Added parameter for HBase user to setup config script.
|
||
(Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7624. Set things up for a top level hadoop-tools module. (tucu)
|
||
|
||
HADOOP-7627. Improve MetricsAsserts to give more understandable output
|
||
on failure. (todd)
|
||
|
||
HADOOP-7642. create hadoop-dist module where TAR stitching would happen.
|
||
(Thomas White via tucu)
|
||
|
||
HADOOP-7709. Running a set of methods in a Single Test Class.
|
||
(Jonathan Eagles via mahadev)
|
||
|
||
HADOOP-7705. Add a log4j back end that can push out JSON data,
|
||
one per line. (stevel)
|
||
|
||
HADOOP-7749. Add a NetUtils createSocketAddr call which provides more
|
||
help in exception messages. (todd)
|
||
|
||
HADOOP-7762. Common side of MR-2736. (eli)
|
||
|
||
HADOOP-7668. Add a NetUtils method that can tell if an InetAddress
|
||
belongs to local host. (suresh)
|
||
|
||
HADOOP-7509. Improve exception message thrown when Authentication is
|
||
required. (Ravi Prakash via suresh)
|
||
|
||
HADOOP-7745. Fix wrong variable name in exception message introduced
|
||
in HADOOP-7509. (Ravi Prakash via suresh)
|
||
|
||
MAPREDUCE-2764. Fix renewal of dfs delegation tokens. (Owen via jitendra)
|
||
|
||
HADOOP-7360. Preserve relative paths that do not contain globs in FsShell.
|
||
(Daryn Sharp and Kihwal Lee via szetszwo)
|
||
|
||
HADOOP-7771. FsShell -copyToLocal, -get, etc. commands throw NPE if the
|
||
destination directory does not exist. (John George and Daryn Sharp
|
||
via szetszwo)
|
||
|
||
HADOOP-7782. Aggregate project javadocs. (tomwhite)
|
||
|
||
HADOOP-7789. Improvements to site navigation. (acmurthy)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-7333. Performance improvement in PureJavaCrc32. (Eric Caspole
|
||
via todd)
|
||
|
||
HADOOP-7445. Implement bulk checksum verification using efficient native
|
||
code. (todd)
|
||
|
||
HADOOP-7753. Support fadvise and sync_file_range in NativeIO. Add
|
||
ReadaheadPool infrastructure for use in HDFS and MR. (todd)
|
||
|
||
HADOOP-7446. Implement CRC32C native code using SSE4.2 instructions.
|
||
(Kihwal Lee and todd via todd)
|
||
|
||
HADOOP-7763. Add top-level navigation to APT docs. (tomwhite)
|
||
|
||
HADOOP-7785. Add equals, hashcode, toString to DataChecksum (todd)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-7740. Fixed security audit logger configuration. (Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7630. hadoop-metrics2.properties should have a property *.period
|
||
set to a default value for metrics. (Eric Yang via mattf)
|
||
|
||
HADOOP-7327. FileSystem.listStatus() throws NullPointerException instead of
|
||
IOException upon access permission failure. (mattf)
|
||
|
||
HADOOP-7015. RawLocalFileSystem#listStatus does not deal with a directory
|
||
whose entries are changing (e.g. in a multi-thread or multi-process
|
||
environment). (Sanjay Radia via eli)
|
||
|
||
HADOOP-7045. TestDU fails on systems with local file systems with
|
||
extended attributes. (eli)
|
||
|
||
HADOOP-6939. Inconsistent lock ordering in
|
||
AbstractDelegationTokenSecretManager. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-7129. Fix typo in method name getProtocolSigature (todd)
|
||
|
||
HADOOP-7048. Wrong description of Block-Compressed SequenceFile Format in
|
||
SequenceFile's javadoc. (Jingguo Yao via tomwhite)
|
||
|
||
HADOOP-7153. MapWritable violates contract of Map interface for equals()
|
||
and hashCode(). (Nicholas Telford via todd)
|
||
|
||
HADOOP-6754. DefaultCodec.createOutputStream() leaks memory.
|
||
(Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-7098. Tasktracker property not set in conf/hadoop-env.sh.
|
||
(Bernd Fondermann via tomwhite)
|
||
|
||
HADOOP-7131. Exceptions thrown by Text methods should include the causing
|
||
exception. (Uma Maheswara Rao G via todd)
|
||
|
||
HADOOP-6912. Guard against NPE when calling UGI.isLoginKeytabBased().
|
||
(Kan Zhang via jitendra)
|
||
|
||
HADOOP-7204. remove local unused fs variable from CmdHandler
|
||
and FsShellPermissions.changePermissions (boryas)
|
||
|
||
HADOOP-7210. Chown command is not working from FSShell
|
||
(Uma Maheswara Rao G via todd)
|
||
|
||
HADOOP-7215. RPC clients must use network interface corresponding to
|
||
the host in the client's kerberos principal key. (suresh)
|
||
|
||
HADOOP-7019. Refactor build targets to enable faster cross project dev
|
||
cycles. (Luke Lu via cos)
|
||
|
||
HADOOP-7216. Add FsCommand.runAll() with deprecated annotation for the
|
||
transition of Command base class improvement. (Daryn Sharp via szetszwo)
|
||
|
||
HADOOP-7207. fs member of FSShell is not really needed (boryas)
|
||
|
||
HADOOP-7223. FileContext createFlag combinations are not clearly defined.
|
||
(suresh)
|
||
|
||
HADOOP-7231. Fix synopsis for -count. (Daryn Sharp via eli).
|
||
|
||
HADOOP-7261. Disable IPV6 for junit tests. (suresh)
|
||
|
||
HADOOP-7268. FileContext.getLocalFSFileContext() behavior needs to be fixed
|
||
w.r.t tokens. (jitendra)
|
||
|
||
HADOOP-7290. Unit test failure in
|
||
TestUserGroupInformation.testGetServerSideGroups. (Trevor Robison via eli)
|
||
|
||
HADOOP-7292. Fix racy test case TestSinkQueue. (Luke Lu via todd)
|
||
|
||
HADOOP-7282. ipc.Server.getRemoteIp() may return null. (John George
|
||
via szetszwo)
|
||
|
||
HADOOP-7208. Fix implementation of equals() and hashCode() in
|
||
StandardSocketFactory. (Uma Maheswara Rao G via todd)
|
||
|
||
HADOOP-7336. TestFileContextResolveAfs will fail with default
|
||
test.build.data property. (jitendra)
|
||
|
||
HADOOP-7284 Trash and shell's rm does not work for viewfs (Sanjay Radia)
|
||
|
||
HADOOP-7341. Fix options parsing in CommandFormat (Daryn Sharp via todd)
|
||
|
||
HADOOP-7353. Cleanup FsShell and prevent masking of RTE stack traces.
|
||
(Daryn Sharp via todd)
|
||
|
||
HADOOP-7356. RPM packages broke bin/hadoop script in developer environment.
|
||
(Eric Yang via todd)
|
||
|
||
HADOOP-7389. Use of TestingGroups by tests causes subsequent tests to fail.
|
||
(atm via tomwhite)
|
||
|
||
HADOOP-7377. Fix command name handling affecting DFSAdmin. (Daryn Sharp
|
||
via mattf)
|
||
|
||
HADOOP-7402. TestConfiguration doesn't clean up after itself. (atm via eli)
|
||
|
||
HADOOP-7428. IPC connection is orphaned with null 'out' member.
|
||
(todd via eli)
|
||
|
||
HADOOP-7437. IOUtils.copybytes will suppress the stream closure exceptions.
|
||
(Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7090. Fix resource leaks in s3.INode, BloomMapFile, WritableUtils
|
||
and CBZip2OutputStream. (Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7440. HttpServer.getParameterValues throws NPE for missing
|
||
parameters. (Uma Maheswara Rao G and todd via todd)
|
||
|
||
HADOOP-7442. Docs in core-default.xml still reference deprecated config
|
||
"topology.script.file.name" (atm)
|
||
|
||
HADOOP-7419. new hadoop-config.sh doesn't manage classpath for
|
||
HADOOP_CONF_DIR correctly. (Bing Zheng and todd via todd)
|
||
|
||
HADOOP-7448. merge from yahoo-merge branch (via mattf):
|
||
-r 1079157: Fix content type for /stacks servlet to be
|
||
plain text (Luke Lu)
|
||
-r 1079164: No need to escape plain text (Luke Lu)
|
||
|
||
HADOOP-7471. The saveVersion.sh script sometimes fails to extract SVN URL.
|
||
(Alejandro Abdelnur via eli)
|
||
|
||
HADOOP-2081. Configuration getInt, getLong, and getFloat replace
|
||
invalid numbers with the default value. (Harsh J via eli)
|
||
|
||
HADOOP-7111. Several TFile tests failing when native libraries are
|
||
present. (atm)
|
||
|
||
HADOOP-7438. Fix deprecated warnings from hadoop-daemon.sh script.
|
||
(Ravi Prakash via suresh)
|
||
|
||
HADOOP-7468 hadoop-core JAR contains a log4j.properties file.
|
||
(Jolly Chen)
|
||
|
||
HADOOP-7508. Compiled nativelib is in wrong directory and it is not picked
|
||
up by surefire setup. (Alejandro Abdelnur via tomwhite)
|
||
|
||
HADOOP-7520. Fix to add distribution management info to hadoop-main
|
||
(Alejandro Abdelnur via gkesavan)
|
||
|
||
HADOOP-7515. test-patch reports the wrong number of javadoc warnings.
|
||
(tomwhite)
|
||
|
||
HADOOP-7523. Test org.apache.hadoop.fs.TestFilterFileSystem fails due to
|
||
java.lang.NoSuchMethodException. (John Lee via tomwhite)
|
||
|
||
HADOOP-7528. Maven build fails in Windows. (Alejandro Abdelnur via
|
||
tomwhite)
|
||
|
||
HADOOP-7533. Allow test-patch to be run from any subproject directory.
|
||
(tomwhite)
|
||
|
||
HADOOP-7512. Fix example mistake in WritableComparable javadocs.
|
||
(Harsh J via eli)
|
||
|
||
HADOOP-7357. hadoop.io.compress.TestCodec#main() should exit with
|
||
non-zero exit code if test failed. (Philip Zeyliger via eli)
|
||
|
||
HADOOP-6622. Token should not print the password in toString. (eli)
|
||
|
||
HADOOP-7529. Fix lock cycles in metrics system. (llu)
|
||
|
||
HADOOP-7545. Common -tests JAR should not include properties and configs.
|
||
(todd)
|
||
|
||
HADOOP-7536. Correct the dependency version regressions introduced in
|
||
HADOOP-6671. (Alejandro Abdelnur via tomwhite)
|
||
|
||
HADOOP-7566. MR tests are failing webapps/hdfs not found in CLASSPATH.
|
||
(Alejandro Abdelnur via mahadev)
|
||
|
||
HADOOP-7567. 'mvn eclipse:eclipse' fails for hadoop-alfredo (auth).
|
||
(Alejandro Abdelnur via tomwhite)
|
||
|
||
HADOOP-7563. Setup HADOOP_HDFS_HOME, HADOOP_MAPRED_HOME and classpath
|
||
correction. (Eric Yang via acmurthy)
|
||
|
||
HADOOP-7560. Change src layout to be heirarchical. (Alejandro Abdelnur
|
||
via acmurthy)
|
||
|
||
HADOOP-7576. Fix findbugs warnings and javac warnings in hadoop-auth.
|
||
(szetszwo)
|
||
|
||
HADOOP-7593. Fix AssertionError in TestHttpServer.testMaxThreads().
|
||
(Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7598. Fix smart-apply-patch.sh to handle patching from a sub
|
||
directory correctly. (Robert Evans via acmurthy)
|
||
|
||
HADOOP-7328. When a serializer class is missing, return null, not throw
|
||
an NPE. (Harsh J Chouraria via todd)
|
||
|
||
HADOOP-7626. Bugfix for a config generator (Eric Yang via ddas)
|
||
|
||
HADOOP-7629. Allow immutable FsPermission objects to be used as IPC
|
||
parameters. (todd)
|
||
|
||
HADOOP-7608. SnappyCodec check for Hadoop native lib is wrong
|
||
(Alejandro Abdelnur via todd)
|
||
|
||
HADOOP-7637. Fix to include FairScheduler configuration file in
|
||
RPM. (Eric Yang via ddas)
|
||
|
||
HADOOP-7633. Adds log4j.properties to the hadoop-conf dir on
|
||
deploy (Eric Yang via ddas)
|
||
|
||
HADOOP-7631. Fixes a config problem to do with running streaming jobs
|
||
(Eric Yang via ddas)
|
||
|
||
HADOOP-7662. Fixed logs servlet to use the pathspec '/*' instead of '/'
|
||
for correct filtering. (Thomas Graves via vinodkv)
|
||
|
||
HADOOP-7691. Fixed conflict uid for install packages. (Eric Yang)
|
||
|
||
HADOOP-7603. Set hdfs, mapred uid, and hadoop uid to fixed numbers.
|
||
(Eric Yang)
|
||
|
||
HADOOP-7658. Fixed HADOOP_SECURE_DN_USER environment variable in
|
||
hadoop-evn.sh (Eric Yang)
|
||
|
||
HADOOP-7684. Added init.d script for jobhistory server and
|
||
secondary namenode. (Eric Yang)
|
||
|
||
HADOOP-7715. Removed unnecessary security logger configuration. (Eric Yang)
|
||
|
||
HADOOP-7685. Improved directory ownership check function in
|
||
hadoop-setup-conf.sh. (Eric Yang)
|
||
|
||
HADOOP-7711. Fixed recursive sourcing of HADOOP_OPTS environment
|
||
variables (Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7681. Fixed security and hdfs audit log4j properties
|
||
(Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7708. Fixed hadoop-setup-conf.sh to handle config files
|
||
consistently. (Eric Yang)
|
||
|
||
HADOOP-7724. Fixed hadoop-setup-conf.sh to put proxy user in
|
||
core-site.xml. (Arpit Gupta via Eric Yang)
|
||
|
||
HADOOP-7755. Detect MapReduce PreCommit Trunk builds silently failing
|
||
when running test-patch.sh. (Jonathan Eagles via tomwhite)
|
||
|
||
HADOOP-7744. Ensure failed tests exit with proper error code. (Jonathan
|
||
Eagles via acmurthy)
|
||
|
||
HADOOP-7764. Allow HttpServer to set both ACL list and path spec filters.
|
||
(Jonathan Eagles via acmurthy)
|
||
|
||
HADOOP-7766. The auth to local mappings are not being respected, with webhdfs
|
||
and security enabled. (jitendra)
|
||
|
||
HADOOP-7721. Add log before login in KerberosAuthenticationHandler.
|
||
(jitendra)
|
||
|
||
HADOOP-7778. FindBugs warning in Token.getKind(). (tomwhite)
|
||
|
||
HADOOP-7798. Add support gpg signatures for maven release artifacts.
|
||
(cutting via acmurthy)
|
||
|
||
HADOOP-7797. Fix top-level pom.xml to refer to correct staging maven
|
||
repository. (omalley via acmurthy)
|
||
|
||
HADOOP-7101. UserGroupInformation.getCurrentUser() fails when called from
|
||
non-Hadoop JAAS context. (todd)
|
||
|
||
Release 0.22.1 - Unreleased
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
NEW FEATURES
|
||
|
||
IMPROVEMENTS
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-7937. Forward port SequenceFile#syncFs and friends from Hadoop 1.x.
|
||
(tomwhite)
|
||
|
||
Release 0.22.0 - 2011-11-29
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-7137. Remove hod contrib. (nigel via eli)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-6791. Refresh for proxy superuser config
|
||
(common part for HDFS-1096) (boryas)
|
||
|
||
HADOOP-6581. Add authenticated TokenIdentifiers to UGI so that
|
||
they can be used for authorization (Kan Zhang and Jitendra Pandey
|
||
via jghoman)
|
||
|
||
HADOOP-6584. Provide Kerberized SSL encryption for webservices.
|
||
(jghoman and Kan Zhang via jghoman)
|
||
|
||
HADOOP-6853. Common component of HDFS-1045. (jghoman)
|
||
|
||
HADOOP-6859 - Introduce additional statistics to FileSystem to track
|
||
file system operations (suresh)
|
||
|
||
HADOOP-6870. Add a new API getFiles to FileSystem and FileContext that
|
||
lists all files under the input path or the subtree rooted at the
|
||
input path if recursive is true. Block locations are returned together
|
||
with each file's status. (hairong)
|
||
|
||
HADOOP-6888. Add a new FileSystem API closeAllForUGI(..) for closing all
|
||
file systems associated with a particular UGI. (Devaraj Das and Kan Zhang
|
||
via szetszwo)
|
||
|
||
HADOOP-6892. Common component of HDFS-1150 (Verify datanodes' identities
|
||
to clients in secure clusters) (jghoman)
|
||
|
||
HADOOP-6889. Make RPC to have an option to timeout. (hairong)
|
||
|
||
HADOOP-6996. Allow CodecFactory to return a codec object given a codec'
|
||
class name. (hairong)
|
||
|
||
HADOOP-7013. Add boolean field isCorrupt to BlockLocation.
|
||
(Patrick Kling via hairong)
|
||
|
||
HADOOP-6978. Adds support for NativeIO using JNI.
|
||
(Todd Lipcon, Devaraj Das & Owen O'Malley via ddas)
|
||
|
||
HADOOP-7134. configure files that are generated as part of the released
|
||
tarball need to have executable bit set. (Roman Shaposhnik via cos)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-6644. util.Shell getGROUPS_FOR_USER_COMMAND method name
|
||
- should use common naming convention (boryas)
|
||
|
||
HADOOP-6778. add isRunning() method to
|
||
AbstractDelegationTokenSecretManager (for HDFS-1044) (boryas)
|
||
|
||
HADOOP-6633. normalize property names for JT/NN kerberos principal
|
||
names in configuration (boryas)
|
||
|
||
HADOOP-6627. "Bad Connection to FS" message in FSShell should print
|
||
message from the exception (boryas)
|
||
|
||
HADOOP-6600. mechanism for authorization check for inter-server
|
||
protocols. (boryas)
|
||
|
||
HADOOP-6623. Add StringUtils.split for non-escaped single-character
|
||
separator. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6761. The Trash Emptier has the ability to run more frequently.
|
||
(Dmytro Molkov via dhruba)
|
||
|
||
HADOOP-6714. Resolve compressed files using CodecFactory in FsShell::text.
|
||
(Patrick Angeles via cdouglas)
|
||
|
||
HADOOP-6661. User document for UserGroupInformation.doAs.
|
||
(Jitendra Pandey via jghoman)
|
||
|
||
HADOOP-6674. Makes use of the SASL authentication options in the
|
||
SASL RPC. (Jitendra Pandey via ddas)
|
||
|
||
HADOOP-6526. Need mapping from long principal names to local OS
|
||
user names. (boryas)
|
||
|
||
HADOOP-6814. Adds an API in UserGroupInformation to get the real
|
||
authentication method of a passed UGI. (Jitendra Pandey via ddas)
|
||
|
||
HADOOP-6756. Documentation for common configuration keys.
|
||
(Erik Steffl via shv)
|
||
|
||
HADOOP-6835. Add support for concatenated gzip input. (Greg Roelofs via
|
||
cdouglas)
|
||
|
||
HADOOP-6845. Renames the TokenStorage class to Credentials.
|
||
(Jitendra Pandey via ddas)
|
||
|
||
HADOOP-6826. FileStatus needs unit tests. (Rodrigo Schmidt via Eli
|
||
Collins)
|
||
|
||
HADOOP-6905. add buildDTServiceName method to SecurityUtil
|
||
(as part of MAPREDUCE-1718) (boryas)
|
||
|
||
HADOOP-6632. Adds support for using different keytabs for different
|
||
servers in a Hadoop cluster. In the earier implementation, all servers
|
||
of a certain type (like TaskTracker), would have the same keytab and the
|
||
same principal. Now the principal name is a pattern that has _HOST in it.
|
||
(Kan Zhang & Jitendra Pandey via ddas)
|
||
|
||
HADOOP-6861. Adds new non-static methods in Credentials to read and
|
||
write token storage file. (Jitendra Pandey & Owen O'Malley via ddas)
|
||
|
||
HADOOP-6877. Common part of HDFS-1178 (NameNode servlets should communicate
|
||
with NameNode directrly). (Kan Zhang via jghoman)
|
||
|
||
HADOOP-6475. Adding some javadoc to Server.RpcMetrics, UGI.
|
||
(Jitendra Pandey and borya via jghoman)
|
||
|
||
HADOOP-6656. Adds a thread in the UserGroupInformation to renew TGTs
|
||
periodically. (Owen O'Malley and ddas via ddas)
|
||
|
||
HADOOP-6890. Improve listFiles API introduced by HADOOP-6870. (hairong)
|
||
|
||
HADOOP-6862. Adds api to add/remove user and group to AccessControlList
|
||
(amareshwari)
|
||
|
||
HADOOP-6911. doc update for DelegationTokenFetcher (boryas)
|
||
|
||
HADOOP-6900. Make the iterator returned by FileSystem#listLocatedStatus to
|
||
throw IOException rather than RuntimeException when there is an IO error
|
||
fetching the next file. (hairong)
|
||
|
||
HADOOP-6905. Better logging messages when a delegation token is invalid.
|
||
(Kan Zhang via jghoman)
|
||
|
||
HADOOP-6693. Add metrics to track kerberol login activity. (suresh)
|
||
|
||
HADOOP-6803. Add native gzip read/write coverage to TestCodec.
|
||
(Eli Collins via tomwhite)
|
||
|
||
HADOOP-6950. Suggest that HADOOP_CLASSPATH should be preserved in
|
||
hadoop-env.sh.template. (Philip Zeyliger via Eli Collins)
|
||
|
||
HADOOP-6922. Make AccessControlList a writable and update documentation
|
||
for Job ACLs. (Ravi Gummadi via vinodkv)
|
||
|
||
HADOOP-6965. Introduces checks for whether the original tgt is valid
|
||
in the reloginFromKeytab method.
|
||
|
||
HADOOP-6856. Simplify constructors for SequenceFile, and MapFile. (omalley)
|
||
|
||
HADOOP-6987. Use JUnit Rule to optionally fail test cases that run more
|
||
than 10 seconds (jghoman)
|
||
|
||
HADOOP-7005. Update test-patch.sh to remove callback to Hudson. (nigel)
|
||
|
||
HADOOP-6985. Suggest that HADOOP_OPTS be preserved in
|
||
hadoop-env.sh.template. (Ramkumar Vadali via cutting)
|
||
|
||
HADOOP-7007. Update the hudson-test-patch ant target to work with the
|
||
latest test-patch.sh script (gkesavan)
|
||
|
||
HADOOP-7010. Typo in FileSystem.java. (Jingguo Yao via eli)
|
||
|
||
HADOOP-7009. MD5Hash provides a public factory method that creates an
|
||
instance of thread local MessageDigest. (hairong)
|
||
|
||
HADOOP-7008. Enable test-patch.sh to have a configured number of
|
||
acceptable findbugs and javadoc warnings. (nigel and gkesavan)
|
||
|
||
HADOOP-6818. Provides a JNI implementation of group resolution. (ddas)
|
||
|
||
HADOOP-6943. The GroupMappingServiceProvider interface should be public.
|
||
(Aaron T. Myers via tomwhite)
|
||
|
||
HADOOP-4675. Current Ganglia metrics implementation is incompatible with
|
||
Ganglia 3.1. (Brian Bockelman via tomwhite)
|
||
|
||
HADOOP-6977. Herriot daemon clients should vend statistics (cos)
|
||
|
||
HADOOP-7024. Create a test method for adding file systems during tests.
|
||
(Kan Zhang via jghoman)
|
||
|
||
HADOOP-6903. Make AbstractFSileSystem methods and some FileContext methods
|
||
to be public. (Sanjay Radia)
|
||
|
||
HADOOP-7034. Add TestPath tests to cover dot, dot dot, and slash
|
||
normalization. (eli)
|
||
|
||
HADOOP-7032. Assert type constraints in the FileStatus constructor. (eli)
|
||
|
||
HADOOP-6562. FileContextSymlinkBaseTest should use FileContextTestHelper.
|
||
(eli)
|
||
|
||
HADOOP-7028. ant eclipse does not include requisite ant.jar in the
|
||
classpath. (Patrick Angeles via eli)
|
||
|
||
HADOOP-6298. Add copyBytes to Text and BytesWritable. (omalley)
|
||
|
||
HADOOP-6578. Configuration should trim whitespace around a lot of value
|
||
types. (Michele Catasta via eli)
|
||
|
||
HADOOP-6811. Remove EC2 bash scripts. They are replaced by Apache Whirr
|
||
(incubating, http://incubator.apache.org/whirr). (tomwhite)
|
||
|
||
HADOOP-7102. Remove "fs.ramfs.impl" field from core-deafult.xml (shv)
|
||
|
||
HADOOP-7104. Remove unnecessary DNS reverse lookups from RPC layer
|
||
(Kan Zhang via todd)
|
||
|
||
HADOOP-6056. Use java.net.preferIPv4Stack to force IPv4.
|
||
(Michele Catasta via shv)
|
||
|
||
HADOOP-7110. Implement chmod with JNI. (todd)
|
||
|
||
HADOOP-6812. Change documentation for correct placement of configuration
|
||
variables: mapreduce.reduce.input.buffer.percent,
|
||
mapreduce.task.io.sort.factor, mapreduce.task.io.sort.mb
|
||
(Chris Douglas via shv)
|
||
|
||
HADOOP-6436. Remove auto-generated native build files. (rvs via eli)
|
||
|
||
HADOOP-6970. SecurityAuth.audit should be generated under /build. (boryas)
|
||
|
||
HADOOP-7154. Should set MALLOC_ARENA_MAX in hadoop-env.sh (todd)
|
||
|
||
HADOOP-7187. Fix socket leak in GangliaContext. (Uma Maheswara Rao G
|
||
via szetszwo)
|
||
|
||
HADOOP-7241. fix typo of command 'hadoop fs -help tail'.
|
||
(Wei Yongjun via eli)
|
||
|
||
HADOOP-7244. Documentation change for updated configuration keys.
|
||
(tomwhite via eli)
|
||
|
||
HADOOP-7189. Add ability to enable 'debug' property in JAAS configuration.
|
||
(Ted Yu via todd)
|
||
|
||
HADOOP-7192. Update fs -stat docs to reflect the format features. (Harsh
|
||
J Chouraria via todd)
|
||
|
||
HADOOP-7355 Add audience and stability annotations to HttpServer class
|
||
(stack)
|
||
|
||
HADOOP-7346. Send back nicer error message to clients using outdated IPC
|
||
version. (todd)
|
||
|
||
HADOOP-7335. Force entropy to come from non-true random for tests.
|
||
(todd via eli)
|
||
|
||
HADOOP-7325. The hadoop command should not accept class names starting with
|
||
a hyphen. (Brock Noland via todd)
|
||
|
||
HADOOP-7772. javadoc the topology classes (stevel)
|
||
|
||
HADOOP-7786. Remove HDFS-specific config keys defined in FsConfig. (eli)
|
||
|
||
HADOOP-7861. changes2html.pl generates links to HADOOP, HDFS, and MAPREDUCE
|
||
jiras. (shv)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-6884. Add LOG.isDebugEnabled() guard for each LOG.debug(..).
|
||
(Erik Steffl via szetszwo)
|
||
|
||
HADOOP-6683. ZlibCompressor does not fully utilize the buffer.
|
||
(Kang Xiao via eli)
|
||
|
||
HADOOP-6949. Reduce RPC packet size of primitive arrays using
|
||
ArrayPrimitiveWritable instead of ObjectWritable. (Matt Foley via suresh)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-6638. try to relogin in a case of failed RPC connection (expired
|
||
tgt) only in case the subject is loginUser or proxyUgi.realUser. (boryas)
|
||
|
||
HADOOP-6781. security audit log shouldn't have exception in it. (boryas)
|
||
|
||
HADOOP-6612. Protocols RefreshUserToGroupMappingsProtocol and
|
||
RefreshAuthorizationPolicyProtocol will fail with security enabled (boryas)
|
||
|
||
HADOOP-6764. Remove verbose logging from the Groups class. (Boris Shkolnik)
|
||
|
||
HADOOP-6730. Bug in FileContext#copy and provide base class for
|
||
FileContext tests. (Ravi Phulari via jghoman)
|
||
|
||
HADOOP-6669. Respect compression configuration when creating DefaultCodec
|
||
instances. (Koji Noguchi via cdouglas)
|
||
|
||
HADOOP-6747. TestNetUtils fails on Mac OS X. (Todd Lipcon via jghoman)
|
||
|
||
HADOOP-6787. Factor out glob pattern code from FileContext and
|
||
Filesystem. Also fix bugs identified in HADOOP-6618 and make the
|
||
glob pattern code less restrictive and more POSIX standard
|
||
compliant. (Luke Lu via eli)
|
||
|
||
HADOOP-6649. login object in UGI should be inside the subject (jnp via
|
||
boryas)
|
||
|
||
HADOOP-6687. user object in the subject in UGI should be reused in case
|
||
of a relogin. (jnp via boryas)
|
||
|
||
HADOOP-6603. Provide workaround for issue with Kerberos not resolving
|
||
cross-realm principal (Kan Zhang and Jitendra Pandey via jghoman)
|
||
|
||
HADOOP-6620. NPE if renewer is passed as null in getDelegationToken.
|
||
(Jitendra Pandey via jghoman)
|
||
|
||
HADOOP-6613. Moves the RPC version check ahead of the AuthMethod check.
|
||
(Kan Zhang via ddas)
|
||
|
||
HADOOP-6682. NetUtils:normalizeHostName does not process hostnames starting
|
||
with [a-f] correctly. (jghoman)
|
||
|
||
HADOOP-6652. Removes the unnecessary cache from
|
||
ShellBasedUnixGroupsMapping. (ddas)
|
||
|
||
HADOOP-6815. refreshSuperUserGroupsConfiguration should use server side
|
||
configuration for the refresh (boryas)
|
||
|
||
HADOOP-6648. Adds a check for null tokens in Credentials.addToken api.
|
||
(ddas)
|
||
|
||
HADOOP-6647. balancer fails with "is not authorized for protocol
|
||
interface NamenodeProtocol" in secure environment (boryas)
|
||
|
||
HADOOP-6834. TFile.append compares initial key against null lastKey
|
||
(hong tang via mahadev)
|
||
|
||
HADOOP-6670. Use the UserGroupInformation's Subject as the criteria for
|
||
equals and hashCode. (Owen O'Malley and Kan Zhang via ddas)
|
||
|
||
HADOOP-6536. Fixes FileUtil.fullyDelete() not to delete the contents of
|
||
the sym-linked directory. (Ravi Gummadi via amareshwari)
|
||
|
||
HADOOP-6873. using delegation token over hftp for long
|
||
running clients (boryas)
|
||
|
||
HADOOP-6706. Improves the sasl failure handling due to expired tickets,
|
||
and other server detected failures. (Jitendra Pandey and ddas via ddas)
|
||
|
||
HADOOP-6715. Fixes AccessControlList.toString() to return a descriptive
|
||
String representation of the ACL. (Ravi Gummadi via amareshwari)
|
||
|
||
HADOOP-6885. Fix java doc warnings in Groups and
|
||
RefreshUserMappingsProtocol. (Eli Collins via jghoman)
|
||
|
||
HADOOP-6482. GenericOptionsParser constructor that takes Options and
|
||
String[] ignores options. (Eli Collins via jghoman)
|
||
|
||
HADOOP-6906. FileContext copy() utility doesn't work with recursive
|
||
copying of directories. (vinod k v via mahadev)
|
||
|
||
HADOOP-6453. Hadoop wrapper script shouldn't ignore an existing
|
||
JAVA_LIBRARY_PATH. (Chad Metcalf via jghoman)
|
||
|
||
HADOOP-6932. Namenode start (init) fails because of invalid kerberos
|
||
key, even when security set to "simple" (boryas)
|
||
|
||
HADOOP-6913. Circular initialization between UserGroupInformation and
|
||
KerberosName (Kan Zhang via boryas)
|
||
|
||
HADOOP-6907. Rpc client doesn't use the per-connection conf to figure
|
||
out server's Kerberos principal (Kan Zhang via hairong)
|
||
|
||
HADOOP-6938. ConnectionId.getRemotePrincipal() should check if security
|
||
is enabled. (Kan Zhang via hairong)
|
||
|
||
HADOOP-6930. AvroRpcEngine doesn't work with generated Avro code.
|
||
(sharad)
|
||
|
||
HADOOP-6940. RawLocalFileSystem's markSupported method misnamed
|
||
markSupport. (Tom White via eli).
|
||
|
||
HADOOP-6951. Distinct minicluster services (e.g. NN and JT) overwrite each
|
||
other's service policies. (Aaron T. Myers via tomwhite)
|
||
|
||
HADOOP-6879. Provide SSH based (Jsch) remote execution API for system
|
||
tests (cos)
|
||
|
||
HADOOP-6989. Correct the parameter for SetFile to set the value type
|
||
for SetFile to be NullWritable instead of the key. (cdouglas via omalley)
|
||
|
||
HADOOP-6984. Combine the compress kind and the codec in the same option
|
||
for SequenceFiles. (cdouglas via omalley)
|
||
|
||
HADOOP-6933. TestListFiles is flaky. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6947. Kerberos relogin should set refreshKrb5Config to true.
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-7006. Fix 'fs -getmerge' command to not be a no-op.
|
||
(Chris Nauroth via cutting)
|
||
|
||
HADOOP-6663. BlockDecompressorStream get EOF exception when decompressing
|
||
the file compressed from empty file. (Kang Xiao via tomwhite)
|
||
|
||
HADOOP-6991. Fix SequenceFile::Reader to honor file lengths and call
|
||
openFile (cdouglas via omalley)
|
||
|
||
HADOOP-7011. Fix KerberosName.main() to not throw an NPE.
|
||
(Aaron T. Myers via tomwhite)
|
||
|
||
HADOOP-6975. Integer overflow in S3InputStream for blocks > 2GB.
|
||
(Patrick Kling via tomwhite)
|
||
|
||
HADOOP-6758. MapFile.fix does not allow index interval definition.
|
||
(Gianmarco De Francisci Morales via tomwhite)
|
||
|
||
HADOOP-6926. SocketInputStream incorrectly implements read().
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6899 RawLocalFileSystem#setWorkingDir() does not work for relative names
|
||
(Sanjay Radia)
|
||
|
||
HADOOP-6496. HttpServer sends wrong content-type for CSS files
|
||
(and others). (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-7057. IOUtils.readFully and IOUtils.skipFully have typo in
|
||
exception creation's message. (cos)
|
||
|
||
HADOOP-7038. saveVersion script includes an additional \r while running
|
||
whoami under windows. (Wang Xu via cos)
|
||
|
||
HADOOP-7082. Configuration.writeXML should not hold lock while outputting
|
||
(todd)
|
||
|
||
HADOOP-7070. JAAS configuration should delegate unknown application names
|
||
to pre-existing configuration. (todd)
|
||
|
||
HADOOP-7087. SequenceFile.createWriter ignores FileSystem parameter (todd)
|
||
|
||
HADOOP-7091. reloginFromKeytab() should happen even if TGT can't be found.
|
||
(Kan Zhang via jghoman)
|
||
|
||
HADOOP-7100. Fix build to not refer to contrib/ec2 removed by HADOOP-6811
|
||
(todd)
|
||
|
||
HADOOP-7097. JAVA_LIBRARY_PATH missing base directory. (Noah Watkins via
|
||
todd)
|
||
|
||
HADOOP-7093. Servlets should default to text/plain (todd)
|
||
|
||
HADOOP-7089. Fix link resolution logic in hadoop-config.sh. (eli)
|
||
|
||
HADOOP-7046. Fix Findbugs warning in Configuration. (Po Cheung via shv)
|
||
|
||
HADOOP-7118. Fix NPE in Configuration.writeXml (todd)
|
||
|
||
HADOOP-7122. Fix thread leak when shell commands time out. (todd)
|
||
|
||
HADOOP-7126. Fix file permission setting for RawLocalFileSystem on Windows.
|
||
(Po Cheung via shv)
|
||
|
||
HADOOP-6642. Fix javac, javadoc, findbugs warnings related to security work.
|
||
(Chris Douglas, Po Cheung via shv)
|
||
|
||
HADOOP-7140. IPC Reader threads do not stop when server stops (todd)
|
||
|
||
HADOOP-7094. hadoop.css got lost during project split (cos)
|
||
|
||
HADOOP-7145. Configuration.getLocalPath should trim whitespace from
|
||
the provided directories. (todd)
|
||
|
||
HADOOP-7156. Workaround for unsafe implementations of getpwuid_r (todd)
|
||
|
||
HADOOP-6898. FileSystem.copyToLocal creates files with 777 permissions.
|
||
(Aaron T. Myers via tomwhite)
|
||
|
||
HADOOP-7229. Do not default to an absolute path for kinit in Kerberos
|
||
auto-renewal thread. (Aaron T. Myers via todd)
|
||
|
||
HADOOP-7172. SecureIO should not check owner on non-secure
|
||
clusters that have no native support. (todd via eli)
|
||
|
||
HADOOP-7184. Remove deprecated config local.cache.size from
|
||
core-default.xml (todd)
|
||
|
||
HADOOP-7245. FsConfig should use constants in CommonConfigurationKeys.
|
||
(tomwhite via eli)
|
||
|
||
HADOOP-7068. Ivy resolve force mode should be turned off by default.
|
||
(Luke Lu via tomwhite)
|
||
|
||
HADOOP-7296. The FsPermission(FsPermission) constructor does not use the
|
||
sticky bit. (Siddharth Seth via tomwhite)
|
||
|
||
HADOOP-7300. Configuration methods that return collections are inconsistent
|
||
about mutability. (todd)
|
||
|
||
HADOOP-7305. Eclipse project classpath should include tools.jar from JDK.
|
||
(Niels Basjes via todd)
|
||
|
||
HADOOP-7318. MD5Hash factory should reset the digester it returns.
|
||
(todd via eli)
|
||
|
||
HADOOP-7287. Configuration deprecation mechanism doesn't work properly for
|
||
GenericOptionsParser and Tools. (Aaron T. Myers via todd)
|
||
|
||
HADOOP-7146. RPC server leaks file descriptors (todd)
|
||
|
||
HADOOP-7276. Hadoop native builds fail on ARM due to -m32 (Trevor Robinson
|
||
via eli)
|
||
|
||
HADOOP-7121. Exceptions while serializing IPC call responses are not
|
||
handled well. (todd)
|
||
|
||
HADOOP-7351 Regression: HttpServer#getWebAppsPath used to be protected
|
||
so subclasses could supply alternate webapps path but it was made private
|
||
by HADOOP-6461 (Stack)
|
||
|
||
HADOOP-7349. HADOOP-7121 accidentally disabled some tests in TestIPC.
|
||
(todd)
|
||
|
||
HADOOP-7390. VersionInfo not generated properly in git after unsplit. (todd
|
||
via atm)
|
||
|
||
HADOOP-7568. SequenceFile should not print into stdout.
|
||
(Plamen Jeliazkov via shv)
|
||
|
||
HADOOP-7663. Fix TestHDFSTrash failure. (Mayank Bansal via shv)
|
||
|
||
HADOOP-7457. Remove out-of-date Chinese language documentation.
|
||
(Jakob Homan via eli)
|
||
|
||
HADOOP-7783. Add more symlink tests that cover intermediate links. (eli)
|
||
|
||
Release 0.21.1 - Unreleased
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-6934. Test for ByteWritable comparator.
|
||
(Johannes Zillmann via Eli Collins)
|
||
|
||
HADOOP-6786. test-patch needs to verify Herriot integrity (cos)
|
||
|
||
HADOOP-7177. CodecPool should report which compressor it is using.
|
||
(Allen Wittenauer via eli)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-6925. BZip2Codec incorrectly implements read().
|
||
(Todd Lipcon via Eli Collins)
|
||
|
||
HADOOP-6833. IPC leaks call parameters when exceptions thrown.
|
||
(Todd Lipcon via Eli Collins)
|
||
|
||
HADOOP-6971. Clover build doesn't generate per-test coverage (cos)
|
||
|
||
HADOOP-6993. Broken link on cluster setup page of docs. (eli)
|
||
|
||
HADOOP-6944. [Herriot] Implement a functionality for getting proxy users
|
||
definitions like groups and hosts. (Vinay Thota via cos)
|
||
|
||
HADOOP-6954. Sources JARs are not correctly published to the Maven
|
||
repository. (tomwhite)
|
||
|
||
HADOOP-7052. misspelling of threshold in conf/log4j.properties.
|
||
(Jingguo Yao via eli)
|
||
|
||
HADOOP-7053. wrong FSNamesystem Audit logging setting in
|
||
conf/log4j.properties. (Jingguo Yao via eli)
|
||
|
||
HADOOP-7120. Fix a syntax error in test-patch.sh. (szetszwo)
|
||
|
||
HADOOP-7162. Rmove a duplicated call FileSystem.listStatus(..) in FsShell.
|
||
(Alexey Diomin via szetszwo)
|
||
|
||
HADOOP-7117. Remove fs.checkpoint.* from core-default.xml and replace
|
||
fs.checkpoint.* with dfs.namenode.checkpoint.* in documentations.
|
||
(Harsh J Chouraria via szetszwo)
|
||
|
||
HADOOP-7193. Correct the "fs -touchz" command help message.
|
||
(Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7174. Null is displayed in the "fs -copyToLocal" command.
|
||
(Uma Maheswara Rao G via szetszwo)
|
||
|
||
HADOOP-7194. Fix resource leak in IOUtils.copyBytes(..).
|
||
(Devaraj K via szetszwo)
|
||
|
||
HADOOP-7183. WritableComparator.get should not cache comparator objects.
|
||
(tomwhite via eli)
|
||
|
||
Release 0.21.0 - 2010-08-13
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-4895. Remove deprecated methods DFSClient.getHints(..) and
|
||
DFSClient.isDirectory(..). (szetszwo)
|
||
|
||
HADOOP-4941. Remove deprecated FileSystem methods: getBlockSize(Path f),
|
||
getLength(Path f) and getReplication(Path src). (szetszwo)
|
||
|
||
HADOOP-4648. Remove obsolete, deprecated InMemoryFileSystem and
|
||
ChecksumDistributedFileSystem. (cdouglas via szetszwo)
|
||
|
||
HADOOP-4940. Remove a deprecated method FileSystem.delete(Path f). (Enis
|
||
Soztutar via szetszwo)
|
||
|
||
HADOOP-4010. Change semantics for LineRecordReader to read an additional
|
||
line per split- rather than moving back one character in the stream- to
|
||
work with splittable compression codecs. (Abdul Qadeer via cdouglas)
|
||
|
||
HADOOP-5094. Show hostname and separate live/dead datanodes in DFSAdmin
|
||
report. (Jakob Homan via szetszwo)
|
||
|
||
HADOOP-4942. Remove deprecated FileSystem methods getName() and
|
||
getNamed(String name, Configuration conf). (Jakob Homan via szetszwo)
|
||
|
||
HADOOP-5486. Removes the CLASSPATH string from the command line and instead
|
||
exports it in the environment. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2827. Remove deprecated NetUtils::getServerAddress. (cdouglas)
|
||
|
||
HADOOP-5681. Change examples RandomWriter and RandomTextWriter to
|
||
use new mapreduce API. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5680. Change org.apache.hadoop.examples.SleepJob to use new
|
||
mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5699. Change org.apache.hadoop.examples.PiEstimator to use
|
||
new mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5720. Introduces new task types - JOB_SETUP, JOB_CLEANUP
|
||
and TASK_CLEANUP. Removes the isMap methods from TaskID/TaskAttemptID
|
||
classes. (ddas)
|
||
|
||
HADOOP-5668. Change TotalOrderPartitioner to use new API. (Amareshwari
|
||
Sriramadasu via cdouglas)
|
||
|
||
HADOOP-5738. Split "waiting_tasks" JobTracker metric into waiting maps and
|
||
waiting reduces. (Sreekanth Ramakrishnan via cdouglas)
|
||
|
||
HADOOP-5679. Resolve findbugs warnings in core/streaming/pipes/examples.
|
||
(Jothi Padmanabhan via sharad)
|
||
|
||
HADOOP-4359. Support for data access authorization checking on Datanodes.
|
||
(Kan Zhang via rangadi)
|
||
|
||
HADOOP-5690. Change org.apache.hadoop.examples.DBCountPageView to use
|
||
new mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5694. Change org.apache.hadoop.examples.dancing to use new
|
||
mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5696. Change org.apache.hadoop.examples.Sort to use new
|
||
mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5698. Change org.apache.hadoop.examples.MultiFileWordCount to
|
||
use new mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5913. Provide ability to an administrator to stop and start
|
||
job queues. (Rahul Kumar Singh and Hemanth Yamijala via yhemanth)
|
||
|
||
MAPREDUCE-711. Removed Distributed Cache from Common, to move it
|
||
under Map/Reduce. (Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-6201. Change FileSystem::listStatus contract to throw
|
||
FileNotFoundException if the directory does not exist, rather than letting
|
||
this be implementation-specific. (Jakob Homan via cdouglas)
|
||
|
||
HADOOP-6230. Moved process tree and memory calculator related classes
|
||
from Common to Map/Reduce. (Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-6203. FsShell rm/rmr error message indicates exceeding Trash quota
|
||
and suggests using -skpTrash, when moving to trash fails.
|
||
(Boris Shkolnik via suresh)
|
||
|
||
HADOOP-6303. Eclipse .classpath template has outdated jar files and is
|
||
missing some new ones. (cos)
|
||
|
||
HADOOP-6396. Fix uninformative exception message when unable to parse
|
||
umask. (jghoman)
|
||
|
||
HADOOP-6299. Reimplement the UserGroupInformation to use the OS
|
||
specific and Kerberos JAAS login. (omalley)
|
||
|
||
HADOOP-6686. Remove redundant exception class name from the exception
|
||
message for the exceptions thrown at RPC client. (suresh)
|
||
|
||
HADOOP-6701. Fix incorrect exit codes returned from chmod, chown and chgrp
|
||
commands from FsShell. (Ravi Phulari via suresh)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-6332. Large-scale Automated Test Framework. (sharad, Sreekanth
|
||
Ramakrishnan, at all via cos)
|
||
|
||
HADOOP-4268. Change fsck to use ClientProtocol methods so that the
|
||
corresponding permission requirement for running the ClientProtocol
|
||
methods will be enforced. (szetszwo)
|
||
|
||
HADOOP-3953. Implement sticky bit for directories in HDFS. (Jakob Homan
|
||
via szetszwo)
|
||
|
||
HADOOP-4368. Implement df in FsShell to show the status of a FileSystem.
|
||
(Craig Macdonald via szetszwo)
|
||
|
||
HADOOP-3741. Add a web ui to the SecondaryNameNode for showing its status.
|
||
(szetszwo)
|
||
|
||
HADOOP-5018. Add pipelined writers to Chukwa. (Ari Rabkin via cdouglas)
|
||
|
||
HADOOP-5052. Add an example computing exact digits of pi using the
|
||
Bailey-Borwein-Plouffe algorithm. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-4927. Adds a generic wrapper around outputformat to allow creation of
|
||
output on demand (Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-5144. Add a new DFSAdmin command for changing the setting of restore
|
||
failed storage replicas in namenode. (Boris Shkolnik via szetszwo)
|
||
|
||
HADOOP-5258. Add a new DFSAdmin command to print a tree of the rack and
|
||
datanode topology as seen by the namenode. (Jakob Homan via szetszwo)
|
||
|
||
HADOOP-4756. A command line tool to access JMX properties on NameNode
|
||
and DataNode. (Boris Shkolnik via rangadi)
|
||
|
||
HADOOP-4539. Introduce backup node and checkpoint node. (shv)
|
||
|
||
HADOOP-5363. Add support for proxying connections to multiple clusters with
|
||
different versions to hdfsproxy. (Zhiyong Zhang via cdouglas)
|
||
|
||
HADOOP-5528. Add a configurable hash partitioner operating on ranges of
|
||
BinaryComparable keys. (Klaas Bosteels via shv)
|
||
|
||
HADOOP-5257. HDFS servers may start and stop external components through
|
||
a plugin interface. (Carlos Valiente via dhruba)
|
||
|
||
HADOOP-5450. Add application-specific data types to streaming's typed bytes
|
||
interface. (Klaas Bosteels via omalley)
|
||
|
||
HADOOP-5518. Add contrib/mrunit, a MapReduce unit test framework.
|
||
(Aaron Kimball via cutting)
|
||
|
||
HADOOP-5469. Add /metrics servlet to daemons, providing metrics
|
||
over HTTP as either text or JSON. (Philip Zeyliger via cutting)
|
||
|
||
HADOOP-5467. Introduce offline fsimage image viewer. (Jakob Homan via shv)
|
||
|
||
HADOOP-5752. Add a new hdfs image processor, Delimited, to oiv. (Jakob
|
||
Homan via szetszwo)
|
||
|
||
HADOOP-5266. Adds the capability to do mark/reset of the reduce values
|
||
iterator in the Context object API. (Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-5745. Allow setting the default value of maxRunningJobs for all
|
||
pools. (dhruba via matei)
|
||
|
||
HADOOP-5643. Adds a way to decommission TaskTrackers while the JobTracker
|
||
is running. (Amar Kamat via ddas)
|
||
|
||
HADOOP-4829. Allow FileSystem shutdown hook to be disabled.
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-5815. Sqoop: A database import tool for Hadoop.
|
||
(Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-4861. Add disk usage with human-readable size (-duh).
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-5844. Use mysqldump when connecting to local mysql instance in Sqoop.
|
||
(Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-5976. Add a new command, classpath, to the hadoop script. (Owen
|
||
O'Malley and Gary Murry via szetszwo)
|
||
|
||
HADOOP-6120. Add support for Avro specific and reflect data.
|
||
(sharad via cutting)
|
||
|
||
HADOOP-6226. Moves BoundedByteArrayOutputStream from the tfile package to
|
||
the io package and makes it available to other users (MAPREDUCE-318).
|
||
(Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-6105. Adds support for automatically handling deprecation of
|
||
configuration keys. (V.V.Chaitanya Krishna via yhemanth)
|
||
|
||
HADOOP-6235. Adds new method to FileSystem for clients to get server
|
||
defaults. (Kan Zhang via suresh)
|
||
|
||
HADOOP-6234. Add new option dfs.umaskmode to set umask in configuration
|
||
to use octal or symbolic instead of decimal. (Jakob Homan via suresh)
|
||
|
||
HADOOP-5073. Add annotation mechanism for interface classification.
|
||
(Jakob Homan via suresh)
|
||
|
||
HADOOP-4012. Provide splitting support for bzip2 compressed files. (Abdul
|
||
Qadeer via cdouglas)
|
||
|
||
HADOOP-6246. Add backward compatibility support to use deprecated decimal
|
||
umask from old configuration. (Jakob Homan via suresh)
|
||
|
||
HADOOP-4952. Add new improved file system interface FileContext for the
|
||
application writer (Sanjay Radia via suresh)
|
||
|
||
HADOOP-6170. Add facility to tunnel Avro RPCs through Hadoop RPCs.
|
||
This permits one to take advantage of both Avro's RPC versioning
|
||
features and Hadoop's proven RPC scalability. (cutting)
|
||
|
||
HADOOP-6267. Permit building contrib modules located in external
|
||
source trees. (Todd Lipcon via cutting)
|
||
|
||
HADOOP-6240. Add new FileContext rename operation that posix compliant
|
||
that allows overwriting existing destination. (suresh)
|
||
|
||
HADOOP-6204. Implementing aspects development and fault injeciton
|
||
framework for Hadoop (cos)
|
||
|
||
HADOOP-6313. Implement Syncable interface in FSDataOutputStream to expose
|
||
flush APIs to application users. (Hairong Kuang via suresh)
|
||
|
||
HADOOP-6284. Add a new parameter, HADOOP_JAVA_PLATFORM_OPTS, to
|
||
hadoop-config.sh so that it allows setting java command options for
|
||
JAVA_PLATFORM. (Koji Noguchi via szetszwo)
|
||
|
||
HADOOP-6337. Updates FilterInitializer class to be more visible,
|
||
and the init of the class is made to take a Configuration argument.
|
||
(Jakob Homan via ddas)
|
||
|
||
Hadoop-6223. Add new file system interface AbstractFileSystem with
|
||
implementation of some file systems that delegate to old FileSystem.
|
||
(Sanjay Radia via suresh)
|
||
|
||
HADOOP-6433. Introduce asychronous deletion of files via a pool of
|
||
threads. This can be used to delete files in the Distributed
|
||
Cache. (Zheng Shao via dhruba)
|
||
|
||
HADOOP-6415. Adds a common token interface for both job token and
|
||
delegation token. (Kan Zhang via ddas)
|
||
|
||
HADOOP-6408. Add a /conf servlet to dump running configuration.
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6520. Adds APIs to read/write Token and secret keys. Also
|
||
adds the automatic loading of tokens into UserGroupInformation
|
||
upon login. The tokens are read from a file specified in the
|
||
environment variable. (ddas)
|
||
|
||
HADOOP-6419. Adds SASL based authentication to RPC.
|
||
(Kan Zhang via ddas)
|
||
|
||
HADOOP-6510. Adds a way for superusers to impersonate other users
|
||
in a secure environment. (Jitendra Nath Pandey via ddas)
|
||
|
||
HADOOP-6421. Adds Symbolic links to FileContext, AbstractFileSystem.
|
||
It also adds a limited implementation for the local file system
|
||
(RawLocalFs) that allows local symlinks. (Eli Collins via Sanjay Radia)
|
||
|
||
HADOOP-6577. Add hidden configuration option "ipc.server.max.response.size"
|
||
to change the default 1 MB, the maximum size when large IPC handler
|
||
response buffer is reset. (suresh)
|
||
|
||
HADOOP-6568. Adds authorization for the default servlets.
|
||
(Vinod Kumar Vavilapalli via ddas)
|
||
|
||
HADOOP-6586. Log authentication and authorization failures and successes
|
||
for RPC (boryas)
|
||
|
||
HADOOP-6580. UGI should contain authentication method. (jnp via boryas)
|
||
|
||
HADOOP-6657. Add a capitalization method to StringUtils for MAPREDUCE-1545.
|
||
(Luke Lu via Steve Loughran)
|
||
|
||
HADOOP-6692. Add FileContext#listStatus that returns an iterator.
|
||
(hairong)
|
||
|
||
HADOOP-6869. Functionality to create file or folder on a remote daemon
|
||
side (Vinay Thota via cos)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-6798. Align Ivy version for all Hadoop subprojects. (cos)
|
||
|
||
HADOOP-6777. Implement a functionality for suspend and resume a process.
|
||
(Vinay Thota via cos)
|
||
|
||
HADOOP-6772. Utilities for system tests specific. (Vinay Thota via cos)
|
||
|
||
HADOOP-6771. Herriot's artifact id for Maven deployment should be set to
|
||
hadoop-core-instrumented (cos)
|
||
|
||
HADOOP-6752. Remote cluster control functionality needs JavaDocs
|
||
improvement (Balaji Rajagopalan via cos).
|
||
|
||
HADOOP-4565. Added CombineFileInputFormat to use data locality information
|
||
to create splits. (dhruba via zshao)
|
||
|
||
HADOOP-4936. Improvements to TestSafeMode. (shv)
|
||
|
||
HADOOP-4985. Remove unnecessary "throw IOException" declarations in
|
||
FSDirectory related methods. (szetszwo)
|
||
|
||
HADOOP-5017. Change NameNode.namesystem declaration to private. (szetszwo)
|
||
|
||
HADOOP-4794. Add branch information from the source version control into
|
||
the version information that is compiled into Hadoop. (cdouglas via
|
||
omalley)
|
||
|
||
HADOOP-5070. Increment copyright year to 2009, remove assertions of ASF
|
||
copyright to licensed files. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-5037. Deprecate static FSNamesystem.getFSNamesystem(). (szetszwo)
|
||
|
||
HADOOP-5088. Include releaseaudit target as part of developer test-patch
|
||
target. (Giridharan Kesavan via nigel)
|
||
|
||
HADOOP-2721. Uses setsid when creating new tasks so that subprocesses of
|
||
this process will be within this new session (and this process will be
|
||
the process leader for all the subprocesses). Killing the process leader,
|
||
or the main Java task in Hadoop's case, kills the entire subtree of
|
||
processes. (Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5097. Remove static variable JspHelper.fsn, a static reference to
|
||
a non-singleton FSNamesystem object. (szetszwo)
|
||
|
||
HADOOP-3327. Improves handling of READ_TIMEOUT during map output copying.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5124. Choose datanodes randomly instead of starting from the first
|
||
datanode for providing fairness. (hairong via szetszwo)
|
||
|
||
HADOOP-4930. Implement a Linux native executable that can be used to
|
||
launch tasks as users. (Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5122. Fix format of fs.default.name value in libhdfs test conf.
|
||
(Craig Macdonald via tomwhite)
|
||
|
||
HADOOP-5038. Direct daemon trace to debug log instead of stdout. (Jerome
|
||
Boulon via cdouglas)
|
||
|
||
HADOOP-5101. Improve packaging by adding 'all-jars' target building core,
|
||
tools, and example jars. Let findbugs depend on this rather than the 'tar'
|
||
target. (Giridharan Kesavan via cdouglas)
|
||
|
||
HADOOP-4868. Splits the hadoop script into three parts - bin/hadoop,
|
||
bin/mapred and bin/hdfs. (Sharad Agarwal via ddas)
|
||
|
||
HADOOP-1722. Adds support for TypedBytes and RawBytes in Streaming.
|
||
(Klaas Bosteels via ddas)
|
||
|
||
HADOOP-4220. Changes the JobTracker restart tests so that they take much
|
||
less time. (Amar Kamat via ddas)
|
||
|
||
HADOOP-4885. Try to restore failed name-node storage directories at
|
||
checkpoint time. (Boris Shkolnik via shv)
|
||
|
||
HADOOP-5209. Update year to 2009 for javadoc. (szetszwo)
|
||
|
||
HADOOP-5279. Remove unnecessary targets from test-patch.sh.
|
||
(Giridharan Kesavan via nigel)
|
||
|
||
HADOOP-5120. Remove the use of FSNamesystem.getFSNamesystem() from
|
||
UpgradeManagerNamenode and UpgradeObjectNamenode. (szetszwo)
|
||
|
||
HADOOP-5222. Add offset to datanode clienttrace. (Lei Xu via cdouglas)
|
||
|
||
HADOOP-5240. Skip re-building javadoc when it is already
|
||
up-to-date. (Aaron Kimball via cutting)
|
||
|
||
HADOOP-5042. Add a cleanup stage to log rollover in Chukwa appender.
|
||
(Jerome Boulon via cdouglas)
|
||
|
||
HADOOP-5264. Removes redundant configuration object from the TaskTracker.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-5232. Enable patch testing to occur on more than one host.
|
||
(Giri Kesavan via nigel)
|
||
|
||
HADOOP-4546. Fix DF reporting for AIX. (Bill Habermaas via cdouglas)
|
||
|
||
HADOOP-5023. Add Tomcat support to HdfsProxy. (Zhiyong Zhang via cdouglas)
|
||
|
||
HADOOP-5317. Provide documentation for LazyOutput Feature.
|
||
(Jothi Padmanabhan via johan)
|
||
|
||
HADOOP-5455. Document rpc metrics context to the extent dfs, mapred, and
|
||
jvm contexts are documented. (Philip Zeyliger via cdouglas)
|
||
|
||
HADOOP-5358. Provide scripting functionality to the synthetic load
|
||
generator. (Jakob Homan via hairong)
|
||
|
||
HADOOP-5442. Paginate jobhistory display and added some search
|
||
capabilities. (Amar Kamat via acmurthy)
|
||
|
||
HADOOP-4842. Streaming now allows specifiying a command for the combiner.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5196. avoiding unnecessary byte[] allocation in
|
||
SequenceFile.CompressedBytes and SequenceFile.UncompressedBytes.
|
||
(hong tang via mahadev)
|
||
|
||
HADOOP-4655. New method FileSystem.newInstance() that always returns
|
||
a newly allocated FileSystem object. (dhruba)
|
||
|
||
HADOOP-4788. Set Fair scheduler to assign both a map and a reduce on each
|
||
heartbeat by default. (matei)
|
||
|
||
HADOOP-5491. In contrib/index, better control memory usage.
|
||
(Ning Li via cutting)
|
||
|
||
HADOOP-5423. Include option of preserving file metadata in
|
||
SequenceFile::sort. (Michael Tamm via cdouglas)
|
||
|
||
HADOOP-5331. Add support for KFS appends. (Sriram Rao via cdouglas)
|
||
|
||
HADOOP-4365. Make Configuration::getProps protected in support of
|
||
meaningful subclassing. (Steve Loughran via cdouglas)
|
||
|
||
HADOOP-2413. Remove the static variable FSNamesystem.fsNamesystemObject.
|
||
(Konstantin Shvachko via szetszwo)
|
||
|
||
HADOOP-4584. Improve datanode block reports and associated file system
|
||
scan to avoid interefering with normal datanode operations.
|
||
(Suresh Srinivas via rangadi)
|
||
|
||
HADOOP-5502. Documentation for backup and checkpoint nodes.
|
||
(Jakob Homan via shv)
|
||
|
||
HADOOP-5485. Mask actions in the fair scheduler's servlet UI based on
|
||
value of webinterface.private.actions.
|
||
(Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-5581. HDFS should throw FileNotFoundException when while opening
|
||
a file that does not exist. (Brian Bockelman via rangadi)
|
||
|
||
HADOOP-5509. PendingReplicationBlocks does not start monitor in the
|
||
constructor. (shv)
|
||
|
||
HADOOP-5494. Modify sorted map output merger to lazily read values,
|
||
rather than buffering at least one record for each segment. (Devaraj Das
|
||
via cdouglas)
|
||
|
||
HADOOP-5396. Provide ability to refresh queue ACLs in the JobTracker
|
||
without having to restart the daemon.
|
||
(Sreekanth Ramakrishnan and Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-4490. Provide ability to run tasks as job owners.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5697. Change org.apache.hadoop.examples.Grep to use new
|
||
mapreduce api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5625. Add operation duration to clienttrace. (Lei Xu via cdouglas)
|
||
|
||
HADOOP-5705. Improve TotalOrderPartitioner efficiency by updating the trie
|
||
construction. (Dick King via cdouglas)
|
||
|
||
HADOOP-5589. Eliminate source limit of 64 for map-side joins imposed by
|
||
TupleWritable encoding. (Jingkei Ly via cdouglas)
|
||
|
||
HADOOP-5734. Correct block placement policy description in HDFS
|
||
Design document. (Konstantin Boudnik via shv)
|
||
|
||
HADOOP-5657. Validate data in TestReduceFetch to improve merge test
|
||
coverage. (cdouglas)
|
||
|
||
HADOOP-5613. Change S3Exception to checked exception.
|
||
(Andrew Hitchcock via tomwhite)
|
||
|
||
HADOOP-5717. Create public enum class for the Framework counters in
|
||
org.apache.hadoop.mapreduce. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5217. Split AllTestDriver for core, hdfs and mapred. (sharad)
|
||
|
||
HADOOP-5364. Add certificate expiration warning to HsftpFileSystem and HDFS
|
||
proxy. (Zhiyong Zhang via cdouglas)
|
||
|
||
HADOOP-5733. Add map/reduce slot capacity and blacklisted capacity to
|
||
JobTracker metrics. (Sreekanth Ramakrishnan via cdouglas)
|
||
|
||
HADOOP-5596. Add EnumSetWritable. (He Yongqiang via szetszwo)
|
||
|
||
HADOOP-5727. Simplify hashcode for ID types. (Shevek via cdouglas)
|
||
|
||
HADOOP-5500. In DBOutputFormat, where field names are absent permit the
|
||
number of fields to be sufficient to construct the select query. (Enis
|
||
Soztutar via cdouglas)
|
||
|
||
HADOOP-5081. Split TestCLI into HDFS, Mapred and Core tests. (sharad)
|
||
|
||
HADOOP-5015. Separate block management code from FSNamesystem. (Suresh
|
||
Srinivas via szetszwo)
|
||
|
||
HADOOP-5080. Add new test cases to TestMRCLI and TestHDFSCLI
|
||
(V.Karthikeyan via nigel)
|
||
|
||
HADOOP-5135. Splits the tests into different directories based on the
|
||
package. Four new test targets have been defined - run-test-core,
|
||
run-test-mapred, run-test-hdfs and run-test-hdfs-with-mr.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-5771. Implements unit tests for LinuxTaskController.
|
||
(Sreekanth Ramakrishnan and Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-5419. Provide a facility to query the Queue ACLs for the
|
||
current user.
|
||
(Rahul Kumar Singh via yhemanth)
|
||
|
||
HADOOP-5780. Improve per block message prited by "-metaSave" in HDFS.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-5823. Added a new class DeprecatedUTF8 to help with removing
|
||
UTF8 related javac warnings. These warnings are removed in
|
||
FSEditLog.java as a use case. (Raghu Angadi)
|
||
|
||
HADOOP-5824. Deprecate DataTransferProtocol.OP_READ_METADATA and remove
|
||
the corresponding unused codes. (Kan Zhang via szetszwo)
|
||
|
||
HADOOP-5721. Factor out EditLogFileInputStream and EditLogFileOutputStream
|
||
into independent classes. (Luca Telloli & Flavio Junqueira via shv)
|
||
|
||
HADOOP-5838. Fix a few javac warnings in HDFS. (Raghu Angadi)
|
||
|
||
HADOOP-5854. Fix a few "Inconsistent Synchronization" warnings in HDFS.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-5369. Small tweaks to reduce MapFile index size. (Ben Maurer
|
||
via sharad)
|
||
|
||
HADOOP-5858. Eliminate UTF8 and fix warnings in test/hdfs-with-mr package.
|
||
(shv)
|
||
|
||
HADOOP-5866. Move DeprecatedUTF8 from o.a.h.io to o.a.h.hdfs since it may
|
||
not be used outside hdfs. (Raghu Angadi)
|
||
|
||
HADOOP-5857. Move normal java methods from hdfs .jsp files to .java files.
|
||
(szetszwo)
|
||
|
||
HADOOP-5873. Remove deprecated methods randomDataNode() and
|
||
getDatanodeByIndex(..) in FSNamesystem. (szetszwo)
|
||
|
||
HADOOP-5572. Improves the progress reporting for the sort phase for both
|
||
maps and reduces. (Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5839. Fix EC2 scripts to allow remote job submission.
|
||
(Joydeep Sen Sarma via tomwhite)
|
||
|
||
HADOOP-5877. Fix javac warnings in TestHDFSServerPorts, TestCheckpoint,
|
||
TestNameEditsConfig, TestStartup and TestStorageRestore.
|
||
(Jakob Homan via shv)
|
||
|
||
HADOOP-5438. Provide a single FileSystem method to create or
|
||
open-for-append to a file. (He Yongqiang via dhruba)
|
||
|
||
HADOOP-5472. Change DistCp to support globbing of input paths. (Dhruba
|
||
Borthakur and Rodrigo Schmidt via szetszwo)
|
||
|
||
HADOOP-5175. Don't unpack libjars on classpath. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-5620. Add an option to DistCp for preserving modification and access
|
||
times. (Rodrigo Schmidt via szetszwo)
|
||
|
||
HADOOP-5664. Change map serialization so a lock is obtained only where
|
||
contention is possible, rather than for each write. (cdouglas)
|
||
|
||
HADOOP-5896. Remove the dependency of GenericOptionsParser on
|
||
Option.withArgPattern. (Giridharan Kesavan and Sharad Agarwal via
|
||
sharad)
|
||
|
||
HADOOP-5784. Makes the number of heartbeats that should arrive a second
|
||
at the JobTracker configurable. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5955. Changes TestFileOuputFormat so that is uses LOCAL_MR
|
||
instead of CLUSTER_MR. (Jothi Padmanabhan via das)
|
||
|
||
HADOOP-5948. Changes TestJavaSerialization to use LocalJobRunner
|
||
instead of MiniMR/DFS cluster. (Jothi Padmanabhan via das)
|
||
|
||
HADOOP-2838. Add mapred.child.env to pass environment variables to
|
||
tasktracker's child processes. (Amar Kamat via sharad)
|
||
|
||
HADOOP-5961. DataNode process understand generic hadoop command line
|
||
options (like -Ddfs.property=value). (Raghu Angadi)
|
||
|
||
HADOOP-5938. Change org.apache.hadoop.mapred.jobcontrol to use new
|
||
api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-2141. Improves the speculative execution heuristic. The heuristic
|
||
is currently based on the progress-rates of tasks and the expected time
|
||
to complete. Also, statistics about trackers are collected, and speculative
|
||
tasks are not given to the ones deduced to be slow.
|
||
(Andy Konwinski and ddas)
|
||
|
||
HADOOP-5952. Change "-1 tests included" wording in test-patch.sh.
|
||
(Gary Murry via szetszwo)
|
||
|
||
HADOOP-6106. Provides an option in ShellCommandExecutor to timeout
|
||
commands that do not complete within a certain amount of time.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5925. EC2 scripts should exit on error. (tomwhite)
|
||
|
||
HADOOP-6109. Change Text to grow its internal buffer exponentially, rather
|
||
than the max of the current length and the proposed length to improve
|
||
performance reading large values. (thushara wijeratna via cdouglas)
|
||
|
||
HADOOP-2366. Support trimmed strings in Configuration. (Michele Catasta
|
||
via szetszwo)
|
||
|
||
HADOOP-6099. The RPC module can be configured to not send period pings.
|
||
The default behaviour of sending periodic pings remain unchanged. (dhruba)
|
||
|
||
HADOOP-6142. Update documentation and use of harchives for relative paths
|
||
added in MAPREDUCE-739. (Mahadev Konar via cdouglas)
|
||
|
||
HADOOP-6148. Implement a fast, pure Java CRC32 calculator which outperforms
|
||
java.util.zip.CRC32. (Todd Lipcon and Scott Carey via szetszwo)
|
||
|
||
HADOOP-6146. Upgrade to JetS3t version 0.7.1. (tomwhite)
|
||
|
||
HADOOP-6161. Add get/setEnum methods to Configuration. (cdouglas)
|
||
|
||
HADOOP-6160. Fix releaseaudit target to run on specific directories.
|
||
(gkesavan)
|
||
|
||
HADOOP-6169. Removing deprecated method calls in TFile. (hong tang via
|
||
mahadev)
|
||
|
||
HADOOP-6176. Add a couple package private methods to AccessTokenHandler
|
||
for testing. (Kan Zhang via szetszwo)
|
||
|
||
HADOOP-6182. Fix ReleaseAudit warnings (Giridharan Kesavan and Lee Tucker
|
||
via gkesavan)
|
||
|
||
HADOOP-6173. Change src/native/packageNativeHadoop.sh to package all
|
||
native library files. (Hong Tang via szetszwo)
|
||
|
||
HADOOP-6184. Provide an API to dump Configuration in a JSON format.
|
||
(V.V.Chaitanya Krishna via yhemanth)
|
||
|
||
HADOOP-6224. Add a method to WritableUtils performing a bounded read of an
|
||
encoded String. (Jothi Padmanabhan via cdouglas)
|
||
|
||
HADOOP-6133. Add a caching layer to Configuration::getClassByName to
|
||
alleviate a performance regression introduced in a compatibility layer.
|
||
(Todd Lipcon via cdouglas)
|
||
|
||
HADOOP-6252. Provide a method to determine if a deprecated key is set in
|
||
config file. (Jakob Homan via suresh)
|
||
|
||
HADOOP-5879. Read compression level and strategy from Configuration for
|
||
gzip compression. (He Yongqiang via cdouglas)
|
||
|
||
HADOOP-6216. Support comments in host files. (Ravi Phulari and Dmytro
|
||
Molkov via szetszwo)
|
||
|
||
HADOOP-6217. Update documentation for project split. (Corinne Chandel via
|
||
omalley)
|
||
|
||
HADOOP-6268. Add ivy jar to .gitignore. (Todd Lipcon via cdouglas)
|
||
|
||
HADOOP-6270. Support deleteOnExit in FileContext. (Suresh Srinivas via
|
||
szetszwo)
|
||
|
||
HADOOP-6233. Rename configuration keys towards API standardization and
|
||
backward compatibility. (Jithendra Pandey via suresh)
|
||
|
||
HADOOP-6260. Add additional unit tests for FileContext util methods.
|
||
(Gary Murry via suresh).
|
||
|
||
HADOOP-6309. Change build.xml to run tests with java asserts. (Eli
|
||
Collins via szetszwo)
|
||
|
||
HADOOP-6326. Hundson runs should check for AspectJ warnings and report
|
||
failure if any is present (cos)
|
||
|
||
HADOOP-6329. Add build-fi directory to the ignore lists. (szetszwo)
|
||
|
||
HADOOP-5107. Use Maven ant tasks to publish the subproject jars.
|
||
(Giridharan Kesavan via omalley)
|
||
|
||
HADOOP-6343. Log unexpected throwable object caught in RPC. (Jitendra Nath
|
||
Pandey via szetszwo)
|
||
|
||
HADOOP-6367. Removes Access Token implementation from common.
|
||
(Kan Zhang via ddas)
|
||
|
||
HADOOP-6395. Upgrade some libraries to be consistent across common, hdfs,
|
||
and mapreduce. (omalley)
|
||
|
||
HADOOP-6398. Build is broken after HADOOP-6395 patch has been applied (cos)
|
||
|
||
HADOOP-6413. Move TestReflectionUtils to Common. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6283. Improve the exception messages thrown by
|
||
FileUtil$HardLink.getLinkCount(..). (szetszwo)
|
||
|
||
HADOOP-6279. Add Runtime::maxMemory to JVM metrics. (Todd Lipcon via
|
||
cdouglas)
|
||
|
||
HADOOP-6305. Unify build property names to facilitate cross-projects
|
||
modifications (cos)
|
||
|
||
HADOOP-6312. Remove unnecessary debug logging in Configuration constructor.
|
||
(Aaron Kimball via cdouglas)
|
||
|
||
HADOOP-6366. Reduce ivy console output to ovservable level (cos)
|
||
|
||
HADOOP-6400. Log errors getting Unix UGI. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6346. Add support for specifying unpack pattern regex to
|
||
RunJar.unJar. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6422. Make RPC backend plugable, protocol-by-protocol, to
|
||
ease evolution towards Avro. (cutting)
|
||
|
||
HADOOP-5958. Use JDK 1.6 File APIs in DF.java wherever possible.
|
||
(Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-6222. Core doesn't have TestCommonCLI facility. (cos)
|
||
|
||
HADOOP-6394. Add a helper class to simplify FileContext related tests and
|
||
improve code reusability. (Jitendra Nath Pandey via suresh)
|
||
|
||
HADOOP-4656. Add a user to groups mapping service. (boryas, acmurthy)
|
||
|
||
HADOOP-6435. Make RPC.waitForProxy with timeout public. (Steve Loughran
|
||
via tomwhite)
|
||
|
||
HADOOP-6472. add tokenCache option to GenericOptionsParser for passing
|
||
file with secret keys to a map reduce job. (boryas)
|
||
|
||
HADOOP-3205. Read multiple chunks directly from FSInputChecker subclass
|
||
into user buffers. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6479. TestUTF8 assertions could fail with better text.
|
||
(Steve Loughran via tomwhite)
|
||
|
||
HADOOP-6155. Deprecate RecordIO anticipating Avro. (Tom White via cdouglas)
|
||
|
||
HADOOP-6492. Make some Avro serialization APIs public.
|
||
(Aaron Kimball via cutting)
|
||
|
||
HADOOP-6497. Add an adapter for Avro's SeekableInput interface, so
|
||
that Avro can read FileSystem data.
|
||
(Aaron Kimball via cutting)
|
||
|
||
HADOOP-6495. Identifier should be serialized after the password is
|
||
created In Token constructor (jnp via boryas)
|
||
|
||
HADOOP-6518. Makes the UGI honor the env var KRB5CCNAME.
|
||
(Owen O'Malley via ddas)
|
||
|
||
HADOOP-6531. Enhance FileUtil with an API to delete all contents of a
|
||
directory. (Amareshwari Sriramadasu via yhemanth)
|
||
|
||
HADOOP-6547. Move DelegationToken into Common, so that it can be used by
|
||
MapReduce also. (devaraj via omalley)
|
||
|
||
HADOOP-6552. Puts renewTGT=true and useTicketCache=true for the keytab
|
||
kerberos options. (ddas)
|
||
|
||
HADOOP-6534. Trim whitespace from directory lists initializing
|
||
LocalDirAllocator. (Todd Lipcon via cdouglas)
|
||
|
||
HADOOP-6559. Makes the RPC client automatically re-login when the SASL
|
||
connection setup fails. This is applicable only to keytab based logins.
|
||
(Devaraj Das)
|
||
|
||
HADOOP-6551. Delegation token renewing and cancelling should provide
|
||
meaningful exceptions when there are failures instead of returning
|
||
false. (omalley)
|
||
|
||
HADOOP-6583. Captures authentication and authorization metrics. (ddas)
|
||
|
||
HADOOP-6543. Allows secure clients to talk to unsecure clusters.
|
||
(Kan Zhang via ddas)
|
||
|
||
HADOOP-6579. Provide a mechanism for encoding/decoding Tokens from
|
||
a url-safe string and change the commons-code library to 1.4. (omalley)
|
||
|
||
HADOOP-6596. Add a version field to the AbstractDelegationTokenIdentifier's
|
||
serialized value. (omalley)
|
||
|
||
HADOOP-6573. Support for persistent delegation tokens.
|
||
(Jitendra Pandey via shv)
|
||
|
||
HADOOP-6594. Provide a fetchdt tool via bin/hdfs. (jhoman via acmurthy)
|
||
|
||
HADOOP-6589. Provide better error messages when RPC authentication fails.
|
||
(Kan Zhang via omalley)
|
||
|
||
HADOOP-6599 Split existing RpcMetrics into RpcMetrics & RpcDetailedMetrics.
|
||
(Suresh Srinivas via Sanjay Radia)
|
||
|
||
HADOOP-6537 Declare more detailed exceptions in FileContext and
|
||
AbstractFileSystem (Suresh Srinivas via Sanjay Radia)
|
||
|
||
HADOOP-6486. fix common classes to work with Avro 1.3 reflection.
|
||
(cutting via tomwhite)
|
||
|
||
HADOOP-6591. HarFileSystem can handle paths with the whitespace characters.
|
||
(Rodrigo Schmidt via dhruba)
|
||
|
||
HADOOP-6407. Have a way to automatically update Eclipse .classpath file
|
||
when new libs are added to the classpath through Ivy. (tomwhite)
|
||
|
||
HADOOP-3659. Patch to allow hadoop native to compile on Mac OS X.
|
||
(Colin Evans and Allen Wittenauer via tomwhite)
|
||
|
||
HADOOP-6471. StringBuffer -> StringBuilder - conversion of references
|
||
as necessary. (Kay Kay via tomwhite)
|
||
|
||
HADOOP-6646. Move HarfileSystem out of Hadoop Common. (mahadev)
|
||
|
||
HADOOP-6566. Add methods supporting, enforcing narrower permissions on
|
||
local daemon directories. (Arun Murthy and Luke Lu via cdouglas)
|
||
|
||
HADOOP-6705. Fix to work with 1.5 version of jiracli
|
||
(Giridharan Kesavan)
|
||
|
||
HADOOP-6658. Exclude Private elements from generated Javadoc. (tomwhite)
|
||
|
||
HADOOP-6635. Install/deploy source jars to Maven repo.
|
||
(Patrick Angeles via jghoman)
|
||
|
||
HADOOP-6717. Log levels in o.a.h.security.Groups too high
|
||
(Todd Lipcon via jghoman)
|
||
|
||
HADOOP-6667. RPC.waitForProxy should retry through NoRouteToHostException.
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6677. InterfaceAudience.LimitedPrivate should take a string not an
|
||
enum. (tomwhite)
|
||
|
||
HADOOP-678. Remove FileContext#isFile, isDirectory, and exists.
|
||
(Eli Collins via hairong)
|
||
|
||
HADOOP-6515. Make maximum number of http threads configurable.
|
||
(Scott Chen via zshao)
|
||
|
||
HADOOP-6563. Add more symlink tests to cover intermediate symlinks
|
||
in paths. (Eli Collins via suresh)
|
||
|
||
HADOOP-6585. Add FileStatus#isDirectory and isFile. (Eli Collins via
|
||
tomwhite)
|
||
|
||
HADOOP-6738. Move cluster_setup.xml from MapReduce to Common.
|
||
(Tom White via tomwhite)
|
||
|
||
HADOOP-6794. Move configuration and script files post split. (tomwhite)
|
||
|
||
HADOOP-6403. Deprecate EC2 bash scripts. (tomwhite)
|
||
|
||
HADOOP-6769. Add an API in FileSystem to get FileSystem instances based
|
||
on users(ddas via boryas)
|
||
|
||
HADOOP-6813. Add a new newInstance method in FileSystem that takes
|
||
a "user" as argument (ddas via boryas)
|
||
|
||
HADOOP-6668. Apply audience and stability annotations to classes in
|
||
common. (tomwhite)
|
||
|
||
HADOOP-6821. Document changes to memory monitoring. (Hemanth Yamijala
|
||
via tomwhite)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-5595. NameNode does not need to run a replicator to choose a
|
||
random DataNode. (hairong)
|
||
|
||
HADOOP-5603. Improve NameNode's block placement performance. (hairong)
|
||
|
||
HADOOP-5638. More improvement on block placement performance. (hairong)
|
||
|
||
HADOOP-6180. NameNode slowed down when many files with same filename
|
||
were moved to Trash. (Boris Shkolnik via hairong)
|
||
|
||
HADOOP-6166. Further improve the performance of the pure-Java CRC32
|
||
implementation. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-6271. Add recursive and non recursive create and mkdir to
|
||
FileContext. (Sanjay Radia via suresh)
|
||
|
||
HADOOP-6261. Add URI based tests for FileContext.
|
||
(Ravi Pulari via suresh).
|
||
|
||
HADOOP-6307. Add a new SequenceFile.Reader constructor in order to support
|
||
reading on un-closed file. (szetszwo)
|
||
|
||
HADOOP-6467. Improve the performance on HarFileSystem.listStatus(..).
|
||
(mahadev via szetszwo)
|
||
|
||
HADOOP-6569. FsShell#cat should avoid calling unecessary getFileStatus
|
||
before opening a file to read. (hairong)
|
||
|
||
HADOOP-6689. Add directory renaming test to existing FileContext tests.
|
||
(Eli Collins via suresh)
|
||
|
||
HADOOP-6713. The RPC server Listener thread is a scalability bottleneck.
|
||
(Dmytro Molkov via hairong)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-6748. Removes hadoop.cluster.administrators, cluster administrators
|
||
acl is passed as parameter in constructor. (amareshwari)
|
||
|
||
HADOOP-6828. Herrior uses old way of accessing logs directories (Sreekanth
|
||
Ramakrishnan via cos)
|
||
|
||
HADOOP-6788. [Herriot] Exception exclusion functionality is not working
|
||
correctly. (Vinay Thota via cos)
|
||
|
||
HADOOP-6773. Ivy folder contains redundant files (cos)
|
||
|
||
HADOOP-5379. CBZip2InputStream to throw IOException on data crc error.
|
||
(Rodrigo Schmidt via zshao)
|
||
|
||
HADOOP-5326. Fixes CBZip2OutputStream data corruption problem.
|
||
(Rodrigo Schmidt via zshao)
|
||
|
||
HADOOP-4963. Fixes a logging to do with getting the location of
|
||
map output file. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2337. Trash should close FileSystem on exit and should not start
|
||
emtying thread if disabled. (shv)
|
||
|
||
HADOOP-5072. Fix failure in TestCodec because testSequenceFileGzipCodec
|
||
won't pass without native gzip codec. (Zheng Shao via dhruba)
|
||
|
||
HADOOP-5050. TestDFSShell.testFilePermissions should not assume umask
|
||
setting. (Jakob Homan via szetszwo)
|
||
|
||
HADOOP-4975. Set classloader for nested mapred.join configs. (Jingkei Ly
|
||
via cdouglas)
|
||
|
||
HADOOP-5078. Remove invalid AMI kernel in EC2 scripts. (tomwhite)
|
||
|
||
HADOOP-5045. FileSystem.isDirectory() should not be deprecated. (Suresh
|
||
Srinivas via szetszwo)
|
||
|
||
HADOOP-4960. Use datasource time, rather than system time, during metrics
|
||
demux. (Eric Yang via cdouglas)
|
||
|
||
HADOOP-5032. Export conf dir set in config script. (Eric Yang via cdouglas)
|
||
|
||
HADOOP-5176. Fix a typo in TestDFSIO. (Ravi Phulari via szetszwo)
|
||
|
||
HADOOP-4859. Distinguish daily rolling output dir by adding a timestamp.
|
||
(Jerome Boulon via cdouglas)
|
||
|
||
HADOOP-4959. Correct system metric collection from top on Redhat 5.1. (Eric
|
||
Yang via cdouglas)
|
||
|
||
HADOOP-5039. Fix log rolling regex to process only the relevant
|
||
subdirectories. (Jerome Boulon via cdouglas)
|
||
|
||
HADOOP-5095. Update Chukwa watchdog to accept config parameter. (Jerome
|
||
Boulon via cdouglas)
|
||
|
||
HADOOP-5147. Correct reference to agent list in Chukwa bin scripts. (Ari
|
||
Rabkin via cdouglas)
|
||
|
||
HADOOP-5148. Fix logic disabling watchdog timer in Chukwa daemon scripts.
|
||
(Ari Rabkin via cdouglas)
|
||
|
||
HADOOP-5100. Append, rather than truncate, when creating log4j metrics in
|
||
Chukwa. (Jerome Boulon via cdouglas)
|
||
|
||
HADOOP-5204. Fix broken trunk compilation on Hudson by letting
|
||
task-controller be an independent target in build.xml.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5212. Fix the path translation problem introduced by HADOOP-4868
|
||
running on cygwin. (Sharad Agarwal via omalley)
|
||
|
||
HADOOP-5226. Add license headers to html and jsp files. (szetszwo)
|
||
|
||
HADOOP-5172. Disable misbehaving Chukwa unit test until it can be fixed.
|
||
(Jerome Boulon via nigel)
|
||
|
||
HADOOP-4933. Fixes a ConcurrentModificationException problem that shows up
|
||
when the history viewer is accessed concurrently.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-5253. Remove duplicate call to cn-docs target.
|
||
(Giri Kesavan via nigel)
|
||
|
||
HADOOP-5251. Fix classpath for contrib unit tests to include clover jar.
|
||
(nigel)
|
||
|
||
HADOOP-5206. Synchronize "unprotected*" methods of FSDirectory on the root.
|
||
(Jakob Homan via shv)
|
||
|
||
HADOOP-5292. Fix NPE in KFS::getBlockLocations. (Sriram Rao via lohit)
|
||
|
||
HADOOP-5219. Adds a new property io.seqfile.local.dir for use by
|
||
SequenceFile, which earlier used mapred.local.dir. (Sharad Agarwal
|
||
via ddas)
|
||
|
||
HADOOP-5300. Fix ant javadoc-dev target and the typo in the class name
|
||
NameNodeActivtyMBean. (szetszwo)
|
||
|
||
HADOOP-5218. libhdfs unit test failed because it was unable to
|
||
start namenode/datanode. Fixed. (dhruba)
|
||
|
||
HADOOP-5273. Add license header to TestJobInProgress.java. (Jakob Homan
|
||
via szetszwo)
|
||
|
||
HADOOP-5229. Remove duplicate version variables in build files
|
||
(Stefan Groschupf via johan)
|
||
|
||
HADOOP-5383. Avoid building an unused string in NameNode's
|
||
verifyReplication(). (Raghu Angadi)
|
||
|
||
HADOOP-5347. Create a job output directory for the bbp examples. (szetszwo)
|
||
|
||
HADOOP-5341. Make hadoop-daemon scripts backwards compatible with the
|
||
changes in HADOOP-4868. (Sharad Agarwal via yhemanth)
|
||
|
||
HADOOP-5456. Fix javadoc links to ClientProtocol#restoreFailedStorage(..).
|
||
(Boris Shkolnik via szetszwo)
|
||
|
||
HADOOP-5458. Remove leftover Chukwa entries from build, etc. (cdouglas)
|
||
|
||
HADOOP-5386. Modify hdfsproxy unit test to start on a random port,
|
||
implement clover instrumentation. (Zhiyong Zhang via cdouglas)
|
||
|
||
HADOOP-5511. Add Apache License to EditLogBackupOutputStream. (shv)
|
||
|
||
HADOOP-5507. Fix JMXGet javadoc warnings. (Boris Shkolnik via szetszwo)
|
||
|
||
HADOOP-5191. Accessing HDFS with any ip or hostname should work as long
|
||
as it points to the interface NameNode is listening on. (Raghu Angadi)
|
||
|
||
HADOOP-5561. Add javadoc.maxmemory parameter to build, preventing OOM
|
||
exceptions from javadoc-dev. (Jakob Homan via cdouglas)
|
||
|
||
HADOOP-5149. Modify HistoryViewer to ignore unfamiliar files in the log
|
||
directory. (Hong Tang via cdouglas)
|
||
|
||
HADOOP-5477. Fix rare failure in TestCLI for hosts returning variations of
|
||
'localhost'. (Jakob Homan via cdouglas)
|
||
|
||
HADOOP-5194. Disables setsid for tasks run on cygwin.
|
||
(Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5322. Fix misleading/outdated comments in JobInProgress.
|
||
(Amareshwari Sriramadasu via cdouglas)
|
||
|
||
HADOOP-5198. Fixes a problem to do with the task PID file being absent and
|
||
the JvmManager trying to look for it. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5464. DFSClient did not treat write timeout of 0 properly.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-4045. Fix processing of IO errors in EditsLog.
|
||
(Boris Shkolnik via shv)
|
||
|
||
HADOOP-5462. Fixed a double free bug in the task-controller
|
||
executable. (Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5652. Fix a bug where in-memory segments are incorrectly retained in
|
||
memory. (cdouglas)
|
||
|
||
HADOOP-5533. Recovery duration shown on the jobtracker webpage is
|
||
inaccurate. (Amar Kamat via sharad)
|
||
|
||
HADOOP-5647. Fix TestJobHistory to not depend on /tmp. (Ravi Gummadi
|
||
via sharad)
|
||
|
||
HADOOP-5661. Fixes some findbugs warnings in o.a.h.mapred* packages and
|
||
supresses a bunch of them. (Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-5704. Fix compilation problems in TestFairScheduler and
|
||
TestCapacityScheduler. (Chris Douglas via szetszwo)
|
||
|
||
HADOOP-5650. Fix safemode messages in the Namenode log. (Suresh Srinivas
|
||
via szetszwo)
|
||
|
||
HADOOP-5488. Removes the pidfile management for the Task JVM from the
|
||
framework and instead passes the PID back and forth between the
|
||
TaskTracker and the Task processes. (Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5658. Fix Eclipse templates. (Philip Zeyliger via shv)
|
||
|
||
HADOOP-5709. Remove redundant synchronization added in HADOOP-5661. (Jothi
|
||
Padmanabhan via cdouglas)
|
||
|
||
HADOOP-5715. Add conf/mapred-queue-acls.xml to the ignore lists.
|
||
(szetszwo)
|
||
|
||
HADOOP-5592. Fix typo in Streaming doc in reference to GzipCodec.
|
||
(Corinne Chandel via tomwhite)
|
||
|
||
HADOOP-5656. Counter for S3N Read Bytes does not work. (Ian Nowland
|
||
via tomwhite)
|
||
|
||
HADOOP-5406. Fix JNI binding for ZlibCompressor::setDictionary. (Lars
|
||
Francke via cdouglas)
|
||
|
||
HADOOP-3426. Fix/provide handling when DNS lookup fails on the loopback
|
||
address. Also cache the result of the lookup. (Steve Loughran via cdouglas)
|
||
|
||
HADOOP-5476. Close the underlying InputStream in SequenceFile::Reader when
|
||
the constructor throws an exception. (Michael Tamm via cdouglas)
|
||
|
||
HADOOP-5675. Do not launch a job if DistCp has no work to do. (Tsz Wo
|
||
(Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-5737. Fixes a problem in the way the JobTracker used to talk to
|
||
other daemons like the NameNode to get the job's files. Also adds APIs
|
||
in the JobTracker to get the FileSystem objects as per the JobTracker's
|
||
configuration. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5648. Not able to generate gridmix.jar on the already compiled
|
||
version of hadoop. (gkesavan)
|
||
|
||
HADOOP-5808. Fix import never used javac warnings in hdfs. (szetszwo)
|
||
|
||
HADOOP-5203. TT's version build is too restrictive. (Rick Cox via sharad)
|
||
|
||
HADOOP-5818. Revert the renaming from FSNamesystem.checkSuperuserPrivilege
|
||
to checkAccess by HADOOP-5643. (Amar Kamat via szetszwo)
|
||
|
||
HADOOP-5820. Fix findbugs warnings for http related codes in hdfs.
|
||
(szetszwo)
|
||
|
||
HADOOP-5822. Fix javac warnings in several dfs tests related to unncessary
|
||
casts. (Jakob Homan via szetszwo)
|
||
|
||
HADOOP-5842. Fix a few javac warnings under packages fs and util.
|
||
(Hairong Kuang via szetszwo)
|
||
|
||
HADOOP-5845. Build successful despite test failure on test-core target.
|
||
(sharad)
|
||
|
||
HADOOP-5314. Prevent unnecessary saving of the file system image during
|
||
name-node startup. (Jakob Homan via shv)
|
||
|
||
HADOOP-5855. Fix javac warnings for DisallowedDatanodeException and
|
||
UnsupportedActionException. (szetszwo)
|
||
|
||
HADOOP-5582. Fixes a problem in Hadoop Vaidya to do with reading
|
||
counters from job history files. (Suhas Gogate via ddas)
|
||
|
||
HADOOP-5829. Fix javac warnings found in ReplicationTargetChooser,
|
||
FSImage, Checkpointer, SecondaryNameNode and a few other hdfs classes.
|
||
(Suresh Srinivas via szetszwo)
|
||
|
||
HADOOP-5835. Fix findbugs warnings found in Block, DataNode, NameNode and
|
||
a few other hdfs classes. (Suresh Srinivas via szetszwo)
|
||
|
||
HADOOP-5853. Undeprecate HttpServer.addInternalServlet method. (Suresh
|
||
Srinivas via szetszwo)
|
||
|
||
HADOOP-5801. Fixes the problem: If the hosts file is changed across restart
|
||
then it should be refreshed upon recovery so that the excluded hosts are
|
||
lost and the maps are re-executed. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5841. Resolve findbugs warnings in DistributedFileSystem,
|
||
DatanodeInfo, BlocksMap, DataNodeDescriptor. (Jakob Homan via szetszwo)
|
||
|
||
HADOOP-5878. Fix import and Serializable javac warnings found in hdfs jsp.
|
||
(szetszwo)
|
||
|
||
HADOOP-5782. Revert a few formatting changes introduced in HADOOP-5015.
|
||
(Suresh Srinivas via rangadi)
|
||
|
||
HADOOP-5687. NameNode throws NPE if fs.default.name is the default value.
|
||
(Philip Zeyliger via shv)
|
||
|
||
HADOOP-5867. Fix javac warnings found in NNBench and NNBenchWithoutMR.
|
||
(Konstantin Boudnik via szetszwo)
|
||
|
||
HADOOP-5728. Fixed FSEditLog.printStatistics IndexOutOfBoundsException.
|
||
(Wang Xu via johan)
|
||
|
||
HADOOP-5847. Fixed failing Streaming unit tests (gkesavan)
|
||
|
||
HADOOP-5252. Streaming overrides -inputformat option (Klaas Bosteels
|
||
via sharad)
|
||
|
||
HADOOP-5710. Counter MAP_INPUT_BYTES missing from new mapreduce api.
|
||
(Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5809. Fix job submission, broken by errant directory creation.
|
||
(Sreekanth Ramakrishnan and Jothi Padmanabhan via cdouglas)
|
||
|
||
HADOOP-5635. Change distributed cache to work with other distributed file
|
||
systems. (Andrew Hitchcock via tomwhite)
|
||
|
||
HADOOP-5856. Fix "unsafe multithreaded use of DateFormat" findbugs warning
|
||
in DataBlockScanner. (Kan Zhang via szetszwo)
|
||
|
||
HADOOP-4864. Fixes a problem to do with -libjars with multiple jars when
|
||
client and cluster reside on different OSs. (Amareshwari Sriramadasu via
|
||
ddas)
|
||
|
||
HADOOP-5623. Fixes a problem to do with status messages getting overwritten
|
||
in streaming jobs. (Rick Cox and Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-5895. Fixes computation of count of merged bytes for logging.
|
||
(Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5805. problem using top level s3 buckets as input/output
|
||
directories. (Ian Nowland via tomwhite)
|
||
|
||
HADOOP-5940. trunk eclipse-plugin build fails while trying to copy
|
||
commons-cli jar from the lib dir (Giridharan Kesavan via gkesavan)
|
||
|
||
HADOOP-5864. Fix DMI and OBL findbugs in packages hdfs and metrics.
|
||
(hairong)
|
||
|
||
HADOOP-5935. Fix Hudson's release audit warnings link is broken.
|
||
(Giridharan Kesavan via gkesavan)
|
||
|
||
HADOOP-5947. Delete empty TestCombineFileInputFormat.java
|
||
|
||
HADOOP-5899. Move a log message in FSEditLog to the right place for
|
||
avoiding unnecessary log. (Suresh Srinivas via szetszwo)
|
||
|
||
HADOOP-5944. Add Apache license header to BlockManager.java. (Suresh
|
||
Srinivas via szetszwo)
|
||
|
||
HADOOP-5891. SecondaryNamenode is able to converse with the NameNode
|
||
even when the default value of dfs.http.address is not overridden.
|
||
(Todd Lipcon via dhruba)
|
||
|
||
HADOOP-5953. The isDirectory(..) and isFile(..) methods in KosmosFileSystem
|
||
should not be deprecated. (szetszwo)
|
||
|
||
HADOOP-5954. Fix javac warnings in TestFileCreation, TestSmallBlock,
|
||
TestFileStatus, TestDFSShellGenericOptions, TestSeekBug and
|
||
TestDFSStartupVersions. (szetszwo)
|
||
|
||
HADOOP-5956. Fix ivy dependency in hdfsproxy and capacity-scheduler.
|
||
(Giridharan Kesavan via szetszwo)
|
||
|
||
HADOOP-5836. Bug in S3N handling of directory markers using an object with
|
||
a trailing "/" causes jobs to fail. (Ian Nowland via tomwhite)
|
||
|
||
HADOOP-5861. s3n files are not getting split by default. (tomwhite)
|
||
|
||
HADOOP-5762. Fix a problem that DistCp does not copy empty directory.
|
||
(Rodrigo Schmidt via szetszwo)
|
||
|
||
HADOOP-5859. Fix "wait() or sleep() with locks held" findbugs warnings in
|
||
DFSClient. (Kan Zhang via szetszwo)
|
||
|
||
HADOOP-5457. Fix to continue to run builds even if contrib test fails
|
||
(Giridharan Kesavan via gkesavan)
|
||
|
||
HADOOP-5963. Remove an unnecessary exception catch in NNBench. (Boris
|
||
Shkolnik via szetszwo)
|
||
|
||
HADOOP-5989. Fix streaming test failure. (gkesavan)
|
||
|
||
HADOOP-5981. Fix a bug in HADOOP-2838 in parsing mapred.child.env.
|
||
(Amar Kamat via sharad)
|
||
|
||
HADOOP-5420. Fix LinuxTaskController to kill tasks using the process
|
||
groups they are launched with.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-6031. Remove @author tags from Java source files. (Ravi Phulari
|
||
via szetszwo)
|
||
|
||
HADOOP-5980. Fix LinuxTaskController so tasks get passed
|
||
LD_LIBRARY_PATH and other environment variables.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4041. IsolationRunner does not work as documented.
|
||
(Philip Zeyliger via tomwhite)
|
||
|
||
HADOOP-6004. Fixes BlockLocation deserialization. (Jakob Homan via
|
||
szetszwo)
|
||
|
||
HADOOP-6079. Serialize proxySource as DatanodeInfo in DataTransferProtocol.
|
||
(szetszwo)
|
||
|
||
HADOOP-6096. Fix Eclipse project and classpath files following project
|
||
split. (tomwhite)
|
||
|
||
HADOOP-6122. The great than operator in test-patch.sh should be "-gt" but
|
||
not ">". (szetszwo)
|
||
|
||
HADOOP-6114. Fix javadoc documentation for FileStatus.getLen.
|
||
(Dmitry Rzhevskiy via dhruba)
|
||
|
||
HADOOP-6131. A sysproperty should not be set unless the property
|
||
is set on the ant command line in build.xml (hong tang via mahadev)
|
||
|
||
HADOOP-6137. Fix project specific test-patch requirements
|
||
(Giridharan Kesavan)
|
||
|
||
HADOOP-6138. Eliminate the deprecated warnings introduced by H-5438.
|
||
(He Yongqiang via szetszwo)
|
||
|
||
HADOOP-6132. RPC client create an extra connection because of incorrect
|
||
key for connection cache. (Kan Zhang via rangadi)
|
||
|
||
HADOOP-6123. Add missing classpaths in hadoop-config.sh. (Sharad Agarwal
|
||
via szetszwo)
|
||
|
||
HADOOP-6172. Fix jar file names in hadoop-config.sh and include
|
||
${build.src} as a part of the source list in build.xml. (Hong Tang via
|
||
szetszwo)
|
||
|
||
HADOOP-6124. Fix javac warning detection in test-patch.sh. (Giridharan
|
||
Kesavan via szetszwo)
|
||
|
||
HADOOP-6177. FSInputChecker.getPos() would return position greater
|
||
than the file size. (Hong Tang via hairong)
|
||
|
||
HADOOP-6188. TestTrash uses java.io.File api but not hadoop FileSystem api.
|
||
(Boris Shkolnik via szetszwo)
|
||
|
||
HADOOP-6192. Fix Shell.getUlimitMemoryCommand to not rely on Map-Reduce
|
||
specific configs. (acmurthy)
|
||
|
||
HADOOP-6103. Clones the classloader as part of Configuration clone.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-6152. Fix classpath variables in bin/hadoop-config.sh and some
|
||
other scripts. (Aaron Kimball via szetszwo)
|
||
|
||
HADOOP-6215. fix GenericOptionParser to deal with -D with '=' in the
|
||
value. (Amar Kamat via sharad)
|
||
|
||
HADOOP-6227. Fix Configuration to allow final parameters to be set to null
|
||
and prevent them from being overridden.
|
||
(Amareshwari Sriramadasu via yhemanth)
|
||
|
||
HADOOP-6199. Move io.map.skip.index property to core-default from mapred.
|
||
(Amareshwari Sriramadasu via cdouglas)
|
||
|
||
HADOOP-6229. Attempt to make a directory under an existing file on
|
||
LocalFileSystem should throw an Exception. (Boris Shkolnik via tomwhite)
|
||
|
||
HADOOP-6243. Fix a NullPointerException in processing deprecated keys.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-6009. S3N listStatus incorrectly returns null instead of empty
|
||
array when called on empty root. (Ian Nowland via tomwhite)
|
||
|
||
HADOOP-6181. Fix .eclipse.templates/.classpath for avro and jets3t jar
|
||
files. (Carlos Valiente via szetszwo)
|
||
|
||
HADOOP-6196. Fix a bug in SequenceFile.Reader where syncing within the
|
||
header would cause the reader to read the sync marker as a record. (Jay
|
||
Booth via cdouglas)
|
||
|
||
HADOOP-6250. Modify test-patch to delete copied XML files before running
|
||
patch build. (Rahul Kumar Singh via yhemanth)
|
||
|
||
HADOOP-6257. Two TestFileSystem classes are confusing
|
||
hadoop-hdfs-hdfwithmr. (Philip Zeyliger via tomwhite)
|
||
|
||
HADOOP-6151. Added a input filter to all of the http servlets that quotes
|
||
html characters in the parameters, to prevent cross site scripting
|
||
attacks. (omalley)
|
||
|
||
HADOOP-6274. Fix TestLocalFSFileContextMainOperations test failure.
|
||
(Gary Murry via suresh).
|
||
|
||
HADOOP-6281. Avoid null pointer exceptions when the jsps don't have
|
||
paramaters (omalley)
|
||
|
||
HADOOP-6285. Fix the result type of the getParameterMap method in the
|
||
HttpServer.QuotingInputFilter. (omalley)
|
||
|
||
HADOOP-6286. Fix bugs in related to URI handling in glob methods in
|
||
FileContext. (Boris Shkolnik via suresh)
|
||
|
||
HADOOP-6292. Update native libraries guide. (Corinne Chandel via cdouglas)
|
||
|
||
HADOOP-6327. FileContext tests should not use /tmp and should clean up
|
||
files. (Sanjay Radia via szetszwo)
|
||
|
||
HADOOP-6318. Upgrade to Avro 1.2.0. (cutting)
|
||
|
||
HADOOP-6334. Fix GenericOptionsParser to understand URI for -files,
|
||
-libjars and -archives options and fix Path to support URI with fragment.
|
||
(Amareshwari Sriramadasu via szetszwo)
|
||
|
||
HADOOP-6344. Fix rm and rmr immediately delete files rather than sending
|
||
to trash, if a user is over-quota. (Jakob Homan via suresh)
|
||
|
||
HADOOP-6347. run-test-core-fault-inject runs a test case twice if
|
||
-Dtestcase is set (cos)
|
||
|
||
HADOOP-6375. Sync documentation for FsShell du with its implementation.
|
||
(Todd Lipcon via cdouglas)
|
||
|
||
HADOOP-6441. Protect web ui from cross site scripting attacks (XSS) on
|
||
the host http header and using encoded utf-7. (omalley)
|
||
|
||
HADOOP-6451. Fix build to run contrib unit tests. (Tom White via cdouglas)
|
||
|
||
HADOOP-6374. JUnit tests should never depend on anything in conf.
|
||
(Anatoli Fomenko via cos)
|
||
|
||
HADOOP-6290. Prevent duplicate slf4j-simple jar via Avro's classpath.
|
||
(Owen O'Malley via cdouglas)
|
||
|
||
HADOOP-6293. Fix FsShell -text to work on filesystems other than the
|
||
default. (cdouglas)
|
||
|
||
HADOOP-6341. Fix test-patch.sh for checkTests function. (gkesavan)
|
||
|
||
HADOOP-6314. Fix "fs -help" for the "-count" commond. (Ravi Phulari via
|
||
szetszwo)
|
||
|
||
HADOOP-6405. Update Eclipse configuration to match changes to Ivy
|
||
configuration (Edwin Chan via cos)
|
||
|
||
HADOOP-6411. Remove deprecated file src/test/hadoop-site.xml. (cos)
|
||
|
||
HADOOP-6386. NameNode's HttpServer can't instantiate InetSocketAddress:
|
||
IllegalArgumentException is thrown (cos)
|
||
|
||
HADOOP-6254. Slow reads cause s3n to fail with SocketTimeoutException.
|
||
(Andrew Hitchcock via tomwhite)
|
||
|
||
HADOOP-6428. HttpServer sleeps with negative values. (cos)
|
||
|
||
HADOOP-6414. Add command line help for -expunge command.
|
||
(Ravi Phulari via tomwhite)
|
||
|
||
HADOOP-6391. Classpath should not be part of command line arguments.
|
||
(Cristian Ivascu via tomwhite)
|
||
|
||
HADOOP-6462. Target "compile" does not exist in contrib/cloud. (tomwhite)
|
||
|
||
HADOOP-6402. testConf.xsl is not well-formed XML. (Steve Loughran
|
||
via tomwhite)
|
||
|
||
HADOOP-6489. Fix 3 findbugs warnings. (Erik Steffl via suresh)
|
||
|
||
HADOOP-6517. Fix UserGroupInformation so that tokens are saved/retrieved
|
||
to/from the embedded Subject (Owen O'Malley & Kan Zhang via ddas)
|
||
|
||
HADOOP-6538. Sets hadoop.security.authentication to simple by default.
|
||
(ddas)
|
||
|
||
HADOOP-6540. Contrib unit tests have invalid XML for core-site, etc.
|
||
(Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-6521. User specified umask using deprecated dfs.umask must override
|
||
server configured using new dfs.umaskmode for backward compatibility.
|
||
(suresh)
|
||
|
||
HADOOP-6522. Fix decoding of codepoint zero in UTF8. (cutting)
|
||
|
||
HADOOP-6505. Use tr rather than sed to effect literal substitution in the
|
||
build script. (Allen Wittenauer via cdouglas)
|
||
|
||
HADOOP-6548. Replace mortbay imports with commons logging. (cdouglas)
|
||
|
||
HADOOP-6560. Handle invalid har:// uri in HarFileSystem. (szetszwo)
|
||
|
||
HADOOP-6549. TestDoAsEffectiveUser should use ip address of the host
|
||
for superuser ip check(jnp via boryas)
|
||
|
||
HADOOP-6570. RPC#stopProxy throws NPE if getProxyEngine(proxy) returns
|
||
null. (hairong)
|
||
|
||
HADOOP-6558. Return null in HarFileSystem.getFileChecksum(..) since no
|
||
checksum algorithm is implemented. (szetszwo)
|
||
|
||
HADOOP-6572. Makes sure that SASL encryption and push to responder
|
||
queue for the RPC response happens atomically. (Kan Zhang via ddas)
|
||
|
||
HADOOP-6545. Changes the Key for the FileSystem cache to be UGI (ddas)
|
||
|
||
HADOOP-6609. Fixed deadlock in RPC by replacing shared static
|
||
DataOutputBuffer in the UTF8 class with a thread local variable. (omalley)
|
||
|
||
HADOOP-6504. Invalid example in the documentation of
|
||
org.apache.hadoop.util.Tool. (Benoit Sigoure via tomwhite)
|
||
|
||
HADOOP-6546. BloomMapFile can return false negatives. (Clark Jefcoat
|
||
via tomwhite)
|
||
|
||
HADOOP-6593. TextRecordInputStream doesn't close SequenceFile.Reader.
|
||
(Chase Bradford via tomwhite)
|
||
|
||
HADOOP-6175. Incorrect version compilation with es_ES.ISO8859-15 locale
|
||
on Solaris 10. (Urko Benito via tomwhite)
|
||
|
||
HADOOP-6645. Bugs on listStatus for HarFileSystem (rodrigo via mahadev)
|
||
|
||
HADOOP-6645. Re: Bugs on listStatus for HarFileSystem (rodrigo via
|
||
mahadev)
|
||
|
||
HADOOP-6654. Fix code example in WritableComparable javadoc. (Tom White
|
||
via szetszwo)
|
||
|
||
HADOOP-6640. FileSystem.get() does RPC retries within a static
|
||
synchronized block. (hairong)
|
||
|
||
HADOOP-6691. TestFileSystemCaching sometimes hangs. (hairong)
|
||
|
||
HADOOP-6507. Hadoop Common Docs - delete 3 doc files that do not belong
|
||
under Common. (Corinne Chandel via tomwhite)
|
||
|
||
HADOOP-6439. Fixes handling of deprecated keys to follow order in which
|
||
keys are defined. (V.V.Chaitanya Krishna via yhemanth)
|
||
|
||
HADOOP-6690. FilterFileSystem correctly handles setTimes call.
|
||
(Rodrigo Schmidt via dhruba)
|
||
|
||
HADOOP-6703. Prevent renaming a file, directory or symbolic link to
|
||
itself. (Eli Collins via suresh)
|
||
|
||
HADOOP-6710. Symbolic umask for file creation is not conformant with posix.
|
||
(suresh)
|
||
|
||
HADOOP-6719. Insert all missing methods in FilterFs.
|
||
(Rodrigo Schmidt via dhruba)
|
||
|
||
HADOOP-6724. IPC doesn't properly handle IOEs thrown by socket factory.
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6722. NetUtils.connect should check that it hasn't connected a socket
|
||
to itself. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6634. Fix AccessControlList to use short names to verify access
|
||
control. (Vinod Kumar Vavilapalli via sharad)
|
||
|
||
HADOOP-6709. Re-instate deprecated FileSystem methods that were removed
|
||
after 0.20. (tomwhite)
|
||
|
||
HADOOP-6630. hadoop-config.sh fails to get executed if hadoop wrapper
|
||
scripts are in path. (Allen Wittenauer via tomwhite)
|
||
|
||
HADOOP-6742. Add methods HADOOP-6709 from to TestFilterFileSystem.
|
||
(Eli Collins via tomwhite)
|
||
|
||
HADOOP-6727. Remove UnresolvedLinkException from public FileContext APIs.
|
||
(Eli Collins via tomwhite)
|
||
|
||
HADOOP-6631. Fix FileUtil.fullyDelete() to continue deleting other files
|
||
despite failure at any level. (Contributed by Ravi Gummadi and
|
||
Vinod Kumar Vavilapalli)
|
||
|
||
HADOOP-6723. Unchecked exceptions thrown in IPC Connection should not
|
||
orphan clients. (Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-6404. Rename the generated artifacts to common instead of core.
|
||
(tomwhite)
|
||
|
||
HADOOP-6461. Webapps aren't located correctly post-split.
|
||
(Todd Lipcon and Steve Loughran via tomwhite)
|
||
|
||
HADOOP-6826. Revert FileSystem create method that takes CreateFlags.
|
||
(tomwhite)
|
||
|
||
HADOOP-6800. Harmonize JAR library versions. (tomwhite)
|
||
|
||
HADOOP-6847. Problem staging 0.21.0 artifacts to Apache Nexus Maven
|
||
Repository (Giridharan Kesavan via cos)
|
||
|
||
HADOOP-6819. [Herriot] Shell command for getting the new exceptions in
|
||
the logs returning exitcode 1 after executing successfully. (Vinay Thota
|
||
via cos)
|
||
|
||
HADOOP-6839. [Herriot] Implement a functionality for getting the user list
|
||
for creating proxy users. (Vinay Thota via cos)
|
||
|
||
HADOOP-6836. [Herriot]: Generic method for adding/modifying the attributes
|
||
for new configuration. (Vinay Thota via cos)
|
||
|
||
HADOOP-6860. 'compile-fault-inject' should never be called directly.
|
||
(Konstantin Boudnik)
|
||
|
||
HADOOP-6790. Instrumented (Herriot) build uses too wide mask to include
|
||
aspect files. (Konstantin Boudnik)
|
||
|
||
HADOOP-6875. [Herriot] Cleanup of temp. configurations is needed upon
|
||
restart of a cluster (Vinay Thota via cos)
|
||
|
||
Release 0.20.3 - Unreleased
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-6637. Benchmark for establishing RPC session. (shv)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-6760. WebServer shouldn't increase port number in case of negative
|
||
port setting caused by Jetty's race (cos)
|
||
|
||
HADOOP-6881. Make WritableComparator intialize classes when
|
||
looking for their raw comparator, as classes often register raw
|
||
comparators in initializers, which are no longer automatically run
|
||
in Java 6 when a class is referenced. (cutting via omalley)
|
||
|
||
HADOOP-7072. Remove java5 dependencies from build. (cos)
|
||
|
||
Release 0.20.204.0 - Unreleased
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-6255. Create RPM and Debian packages for common. Changes deployment
|
||
layout to be consistent across the binary tgz, rpm, and deb. Adds setup
|
||
scripts for easy one node cluster configuration and user creation.
|
||
(Eric Yang via omalley)
|
||
|
||
Release 0.20.203.0 - 2011-5-11
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-7258. The Gzip codec should not return null decompressors. (omalley)
|
||
|
||
Release 0.20.2 - 2010-2-16
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-6218. Adds a feature where TFile can be split by Record
|
||
Sequence number. (Hong Tang and Raghu Angadi via ddas)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-6231. Allow caching of filesystem instances to be disabled on a
|
||
per-instance basis. (tomwhite)
|
||
|
||
HADOOP-5759. Fix for IllegalArgumentException when CombineFileInputFormat
|
||
is used as job InputFormat. (Amareshwari Sriramadasu via dhruba)
|
||
|
||
HADOOP-6097. Fix Path conversion in makeQualified and reset LineReader byte
|
||
count at the start of each block in Hadoop archives. (Ben Slusky, Tom
|
||
White, and Mahadev Konar via cdouglas)
|
||
|
||
HADOOP-6269. Fix threading issue with defaultResource in Configuration.
|
||
(Sreekanth Ramakrishnan via cdouglas)
|
||
|
||
HADOOP-6460. Reinitializes buffers used for serializing responses in ipc
|
||
server on exceeding maximum response size to free up Java heap. (suresh)
|
||
|
||
HADOOP-6315. Avoid incorrect use of BuiltInflater/BuiltInDeflater in
|
||
GzipCodec. (Aaron Kimball via cdouglas)
|
||
|
||
HADOOP-6498. IPC client bug may cause rpc call hang. (Ruyue Ma and
|
||
hairong via hairong)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-5611. Fix C++ libraries to build on Debian Lenny. (Todd Lipcon
|
||
via tomwhite)
|
||
|
||
HADOOP-5612. Some c++ scripts are not chmodded before ant execution.
|
||
(Todd Lipcon via tomwhite)
|
||
|
||
HADOOP-1849. Add undocumented configuration parameter for per handler
|
||
call queue size in IPC Server. (shv)
|
||
|
||
Release 0.20.1 - 2009-09-01
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-5726. Remove pre-emption from capacity scheduler code base.
|
||
(Rahul Kumar Singh via yhemanth)
|
||
|
||
HADOOP-5881. Simplify memory monitoring and scheduling related
|
||
configuration. (Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-6080. Introduce -skipTrash option to rm and rmr.
|
||
(Jakob Homan via shv)
|
||
|
||
HADOOP-3315. Add a new, binary file foramt, TFile. (Hong Tang via cdouglas)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-5711. Change Namenode file close log to info. (szetszwo)
|
||
|
||
HADOOP-5736. Update the capacity scheduler documentation for features
|
||
like memory based scheduling, job initialization and removal of pre-emption.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5714. Add a metric for NameNode getFileInfo operation. (Jakob Homan
|
||
via szetszwo)
|
||
|
||
HADOOP-4372. Improves the way history filenames are obtained and manipulated.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-5897. Add name-node metrics to capture java heap usage.
|
||
(Suresh Srinivas via shv)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-5691. Makes org.apache.hadoop.mapreduce.Reducer concrete class
|
||
instead of abstract. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5646. Fixes a problem in TestQueueCapacities.
|
||
(Vinod Kumar Vavilapalli via ddas)
|
||
|
||
HADOOP-5655. TestMRServerPorts fails on java.net.BindException. (Devaraj
|
||
Das via hairong)
|
||
|
||
HADOOP-5654. TestReplicationPolicy.<init> fails on java.net.BindException.
|
||
(hairong)
|
||
|
||
HADOOP-5688. Fix HftpFileSystem checksum path construction. (Tsz Wo
|
||
(Nicholas) Sze via cdouglas)
|
||
|
||
HADOOP-4674. Fix fs help messages for -test, -text, -tail, -stat
|
||
and -touchz options. (Ravi Phulari via szetszwo)
|
||
|
||
HADOOP-5718. Remove the check for the default queue in capacity scheduler.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5719. Remove jobs that failed initialization from the waiting queue
|
||
in the capacity scheduler. (Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4744. Attaching another fix to the jetty port issue. The TaskTracker
|
||
kills itself if it ever discovers that the port to which jetty is actually
|
||
bound is invalid (-1). (ddas)
|
||
|
||
HADOOP-5349. Fixes a problem in LocalDirAllocator to check for the return
|
||
path value that is returned for the case where the file we want to write
|
||
is of an unknown size. (Vinod Kumar Vavilapalli via ddas)
|
||
|
||
HADOOP-5636. Prevents a job from going to RUNNING state after it has been
|
||
KILLED (this used to happen when the SetupTask would come back with a
|
||
success after the job has been killed). (Amar Kamat via ddas)
|
||
|
||
HADOOP-5641. Fix a NullPointerException in capacity scheduler's memory
|
||
based scheduling code when jobs get retired. (yhemanth)
|
||
|
||
HADOOP-5828. Use absolute path for mapred.local.dir of JobTracker in
|
||
MiniMRCluster. (yhemanth)
|
||
|
||
HADOOP-4981. Fix capacity scheduler to schedule speculative tasks
|
||
correctly in the presence of High RAM jobs.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5210. Solves a problem in the progress report of the reduce task.
|
||
(Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5850. Fixes a problem to do with not being able to jobs with
|
||
0 maps/reduces. (Vinod K V via ddas)
|
||
|
||
HADOOP-4626. Correct the API links in hdfs forrest doc so that they
|
||
point to the same version of hadoop. (szetszwo)
|
||
|
||
HADOOP-5883. Fixed tasktracker memory monitoring to account for
|
||
momentary spurts in memory usage due to java's fork() model.
|
||
(yhemanth)
|
||
|
||
HADOOP-5539. Fixes a problem to do with not preserving intermediate
|
||
output compression for merged data.
|
||
(Jothi Padmanabhan and Billy Pearson via ddas)
|
||
|
||
HADOOP-5932. Fixes a problem in capacity scheduler in computing
|
||
available memory on a tasktracker.
|
||
(Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-5908. Fixes a problem to do with ArithmeticException in the
|
||
JobTracker when there are jobs with 0 maps. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5924. Fixes a corner case problem to do with job recovery with
|
||
empty history files. Also, after a JT restart, sends KillTaskAction to
|
||
tasks that report back but the corresponding job hasn't been initialized
|
||
yet. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5882. Fixes a reducer progress update problem for new mapreduce
|
||
api. (Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-5746. Fixes a corner case problem in Streaming, where if an exception
|
||
happens in MROutputThread after the last call to the map/reduce method, the
|
||
exception goes undetected. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5884. Fixes accounting in capacity scheduler so that high RAM jobs
|
||
take more slots. (Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-5937. Correct a safemode message in FSNamesystem. (Ravi Phulari
|
||
via szetszwo)
|
||
|
||
HADOOP-5869. Fix bug in assignment of setup / cleanup task that was
|
||
causing TestQueueCapacities to fail.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5921. Fixes a problem in the JobTracker where it sometimes never used
|
||
to come up due to a system file creation on JobTracker's system-dir failing.
|
||
This problem would sometimes show up only when the FS for the system-dir
|
||
(usually HDFS) is started at nearly the same time as the JobTracker.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-5920. Fixes a testcase failure for TestJobHistory.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-6139. Fix the FsShell help messages for rm and rmr. (Jakob Homan
|
||
via szetszwo)
|
||
|
||
HADOOP-6145. Fix FsShell rm/rmr error messages when there is a FNFE.
|
||
(Jakob Homan via szetszwo)
|
||
|
||
HADOOP-6150. Users should be able to instantiate comparator using TFile
|
||
API. (Hong Tang via rangadi)
|
||
|
||
Release 0.20.0 - 2009-04-15
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-4210. Fix findbugs warnings for equals implementations of mapred ID
|
||
classes. Removed public, static ID::read and ID::forName; made ID an
|
||
abstract class. (Suresh Srinivas via cdouglas)
|
||
|
||
HADOOP-4253. Fix various warnings generated by findbugs.
|
||
Following deprecated methods in RawLocalFileSystem are removed:
|
||
public String getName()
|
||
public void lock(Path p, boolean shared)
|
||
public void release(Path p)
|
||
(Suresh Srinivas via johan)
|
||
|
||
HADOOP-4618. Move http server from FSNamesystem into NameNode.
|
||
FSNamesystem.getNameNodeInfoPort() is removed.
|
||
FSNamesystem.getDFSNameNodeMachine() and FSNamesystem.getDFSNameNodePort()
|
||
replaced by FSNamesystem.getDFSNameNodeAddress().
|
||
NameNode(bindAddress, conf) is removed.
|
||
(shv)
|
||
|
||
HADOOP-4567. GetFileBlockLocations returns the NetworkTopology
|
||
information of the machines where the blocks reside. (dhruba)
|
||
|
||
HADOOP-4435. The JobTracker WebUI displays the amount of heap memory
|
||
in use. (dhruba)
|
||
|
||
HADOOP-4628. Move Hive into a standalone subproject. (omalley)
|
||
|
||
HADOOP-4188. Removes task's dependency on concrete filesystems.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-1650. Upgrade to Jetty 6. (cdouglas)
|
||
|
||
HADOOP-3986. Remove static Configuration from JobClient. (Amareshwari
|
||
Sriramadasu via cdouglas)
|
||
JobClient::setCommandLineConfig is removed
|
||
JobClient::getCommandLineConfig is removed
|
||
JobShell, TestJobShell classes are removed
|
||
|
||
HADOOP-4422. S3 file systems should not create bucket.
|
||
(David Phillips via tomwhite)
|
||
|
||
HADOOP-4035. Support memory based scheduling in capacity scheduler.
|
||
(Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-3497. Fix bug in overly restrictive file globbing with a
|
||
PathFilter. (tomwhite)
|
||
|
||
HADOOP-4445. Replace running task counts with running task
|
||
percentage in capacity scheduler UI. (Sreekanth Ramakrishnan via
|
||
yhemanth)
|
||
|
||
HADOOP-4631. Splits the configuration into three parts - one for core,
|
||
one for mapred and the last one for HDFS. (Sharad Agarwal via cdouglas)
|
||
|
||
HADOOP-3344. Fix libhdfs build to use autoconf and build the same
|
||
architecture (32 vs 64 bit) of the JVM running Ant. The libraries for
|
||
pipes, utils, and libhdfs are now all in c++/<os_osarch_jvmdatamodel>/lib.
|
||
(Giridharan Kesavan via nigel)
|
||
|
||
HADOOP-4874. Remove LZO codec because of licensing issues. (omalley)
|
||
|
||
HADOOP-4970. The full path name of a file is preserved inside Trash.
|
||
(Prasad Chakka via dhruba)
|
||
|
||
HADOOP-4103. NameNode keeps a count of missing blocks. It warns on
|
||
WebUI if there are such blocks. '-report' and '-metaSave' have extra
|
||
info to track such blocks. (Raghu Angadi)
|
||
|
||
HADOOP-4783. Change permissions on history files on the jobtracker
|
||
to be only group readable instead of world readable.
|
||
(Amareshwari Sriramadasu via yhemanth)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-4575. Add a proxy service for relaying HsftpFileSystem requests.
|
||
Includes client authentication via user certificates and config-based
|
||
access control. (Kan Zhang via cdouglas)
|
||
|
||
HADOOP-4661. Add DistCh, a new tool for distributed ch{mod,own,grp}.
|
||
(szetszwo)
|
||
|
||
HADOOP-4709. Add several new features and bug fixes to Chukwa.
|
||
Added Hadoop Infrastructure Care Center (UI for visualize data collected
|
||
by Chukwa)
|
||
Added FileAdaptor for streaming small file in one chunk
|
||
Added compression to archive and demux output
|
||
Added unit tests and validation for agent, collector, and demux map
|
||
reduce job
|
||
Added database loader for loading demux output (sequence file) to jdbc
|
||
connected database
|
||
Added algorithm to distribute collector load more evenly
|
||
(Jerome Boulon, Eric Yang, Andy Konwinski, Ariel Rabkin via cdouglas)
|
||
|
||
HADOOP-4179. Add Vaidya tool to analyze map/reduce job logs for performanc
|
||
problems. (Suhas Gogate via omalley)
|
||
|
||
HADOOP-4029. Add NameNode storage information to the dfshealth page and
|
||
move DataNode information to a separated page. (Boris Shkolnik via
|
||
szetszwo)
|
||
|
||
HADOOP-4348. Add service-level authorization for Hadoop. (acmurthy)
|
||
|
||
HADOOP-4826. Introduce admin command saveNamespace. (shv)
|
||
|
||
HADOOP-3063 BloomMapFile - fail-fast version of MapFile for sparsely
|
||
populated key space (Andrzej Bialecki via stack)
|
||
|
||
HADOOP-1230. Add new map/reduce API and deprecate the old one. Generally,
|
||
the old code should work without problem. The new api is in
|
||
org.apache.hadoop.mapreduce and the old classes in org.apache.hadoop.mapred
|
||
are deprecated. Differences in the new API:
|
||
1. All of the methods take Context objects that allow us to add new
|
||
methods without breaking compatability.
|
||
2. Mapper and Reducer now have a "run" method that is called once and
|
||
contains the control loop for the task, which lets applications
|
||
replace it.
|
||
3. Mapper and Reducer by default are Identity Mapper and Reducer.
|
||
4. The FileOutputFormats use part-r-00000 for the output of reduce 0 and
|
||
part-m-00000 for the output of map 0.
|
||
5. The reduce grouping comparator now uses the raw compare instead of
|
||
object compare.
|
||
6. The number of maps in FileInputFormat is controlled by min and max
|
||
split size rather than min size and the desired number of maps.
|
||
(omalley)
|
||
|
||
HADOOP-3305. Use Ivy to manage dependencies. (Giridharan Kesavan
|
||
and Steve Loughran via cutting)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-4749. Added a new counter REDUCE_INPUT_BYTES. (Yongqiang He via
|
||
zshao)
|
||
|
||
HADOOP-4234. Fix KFS "glue" layer to allow applications to interface
|
||
with multiple KFS metaservers. (Sriram Rao via lohit)
|
||
|
||
HADOOP-4245. Update to latest version of KFS "glue" library jar.
|
||
(Sriram Rao via lohit)
|
||
|
||
HADOOP-4244. Change test-patch.sh to check Eclipse classpath no matter
|
||
it is run by Hudson or not. (szetszwo)
|
||
|
||
HADOOP-3180. Add name of missing class to WritableName.getClass
|
||
IOException. (Pete Wyckoff via omalley)
|
||
|
||
HADOOP-4178. Make the capacity scheduler's default values configurable.
|
||
(Sreekanth Ramakrishnan via omalley)
|
||
|
||
HADOOP-4262. Generate better error message when client exception has null
|
||
message. (stevel via omalley)
|
||
|
||
HADOOP-4226. Refactor and document LineReader to make it more readily
|
||
understandable. (Yuri Pradkin via cdouglas)
|
||
|
||
HADOOP-4238. When listing jobs, if scheduling information isn't available
|
||
print NA instead of empty output. (Sreekanth Ramakrishnan via johan)
|
||
|
||
HADOOP-4284. Support filters that apply to all requests, or global filters,
|
||
to HttpServer. (Kan Zhang via cdouglas)
|
||
|
||
HADOOP-4276. Improve the hashing functions and deserialization of the
|
||
mapred ID classes. (omalley)
|
||
|
||
HADOOP-4485. Add a compile-native ant task, as a shorthand. (enis)
|
||
|
||
HADOOP-4454. Allow # comments in slaves file. (Rama Ramasamy via omalley)
|
||
|
||
HADOOP-3461. Remove hdfs.StringBytesWritable. (szetszwo)
|
||
|
||
HADOOP-4437. Use Halton sequence instead of java.util.Random in
|
||
PiEstimator. (szetszwo)
|
||
|
||
HADOOP-4572. Change INode and its sub-classes to package private.
|
||
(szetszwo)
|
||
|
||
HADOOP-4187. Does a runtime lookup for JobConf/JobConfigurable, and if
|
||
found, invokes the appropriate configure method. (Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4453. Improve ssl configuration and handling in HsftpFileSystem,
|
||
particularly when used with DistCp. (Kan Zhang via cdouglas)
|
||
|
||
HADOOP-4583. Several code optimizations in HDFS. (Suresh Srinivas via
|
||
szetszwo)
|
||
|
||
HADOOP-3923. Remove org.apache.hadoop.mapred.StatusHttpServer. (szetszwo)
|
||
|
||
HADOOP-4622. Explicitly specify interpretor for non-native
|
||
pipes binaries. (Fredrik Hedberg via johan)
|
||
|
||
HADOOP-4505. Add a unit test to test faulty setup task and cleanup
|
||
task killing the job. (Amareshwari Sriramadasu via johan)
|
||
|
||
HADOOP-4608. Don't print a stack trace when the example driver gets an
|
||
unknown program to run. (Edward Yoon via omalley)
|
||
|
||
HADOOP-4645. Package HdfsProxy contrib project without the extra level
|
||
of directories. (Kan Zhang via omalley)
|
||
|
||
HADOOP-4126. Allow access to HDFS web UI on EC2 (tomwhite via omalley)
|
||
|
||
HADOOP-4612. Removes RunJar's dependency on JobClient.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4185. Adds setVerifyChecksum() method to FileSystem.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4523. Prevent too many tasks scheduled on a node from bringing
|
||
it down by monitoring for cumulative memory usage across tasks.
|
||
(Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-4640. Adds an input format that can split lzo compressed
|
||
text files. (johan)
|
||
|
||
HADOOP-4666. Launch reduces only after a few maps have run in the
|
||
Fair Scheduler. (Matei Zaharia via johan)
|
||
|
||
HADOOP-4339. Remove redundant calls from FileSystem/FsShell when
|
||
generating/processing ContentSummary. (David Phillips via cdouglas)
|
||
|
||
HADOOP-2774. Add counters tracking records spilled to disk in MapTask and
|
||
ReduceTask. (Ravi Gummadi via cdouglas)
|
||
|
||
HADOOP-4513. Initialize jobs asynchronously in the capacity scheduler.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4649. Improve abstraction for spill indices. (cdouglas)
|
||
|
||
HADOOP-3770. Add gridmix2, an iteration on the gridmix benchmark. (Runping
|
||
Qi via cdouglas)
|
||
|
||
HADOOP-4708. Add support for dfsadmin commands in TestCLI. (Boris Shkolnik
|
||
via cdouglas)
|
||
|
||
HADOOP-4758. Add a splitter for metrics contexts to support more than one
|
||
type of collector. (cdouglas)
|
||
|
||
HADOOP-4722. Add tests for dfsadmin quota error messages. (Boris Shkolnik
|
||
via cdouglas)
|
||
|
||
HADOOP-4690. fuse-dfs - create source file/function + utils + config +
|
||
main source files. (pete wyckoff via mahadev)
|
||
|
||
HADOOP-3750. Fix and enforce module dependencies. (Sharad Agarwal via
|
||
tomwhite)
|
||
|
||
HADOOP-4747. Speed up FsShell::ls by removing redundant calls to the
|
||
filesystem. (David Phillips via cdouglas)
|
||
|
||
HADOOP-4305. Improves the blacklisting strategy, whereby, tasktrackers
|
||
that are blacklisted are not given tasks to run from other jobs, subject
|
||
to the following conditions (all must be met):
|
||
1) The TaskTracker has been blacklisted by at least 4 jobs (configurable)
|
||
2) The TaskTracker has been blacklisted 50% more number of times than
|
||
the average (configurable)
|
||
3) The cluster has less than 50% trackers blacklisted
|
||
Once in 24 hours, a TaskTracker blacklisted for all jobs is given a chance.
|
||
Restarting the TaskTracker moves it out of the blacklist.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4688. Modify the MiniMRDFSSort unit test to spill multiple times,
|
||
exercising the map-side merge code. (cdouglas)
|
||
|
||
HADOOP-4737. Adds the KILLED notification when jobs get killed.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4728. Add a test exercising different namenode configurations.
|
||
(Boris Shkolnik via cdouglas)
|
||
|
||
HADOOP-4807. Adds JobClient commands to get the active/blacklisted tracker
|
||
names. Also adds commands to display running/completed task attempt IDs.
|
||
(ddas)
|
||
|
||
HADOOP-4699. Remove checksum validation from map output servlet. (cdouglas)
|
||
|
||
HADOOP-4838. Added a registry to automate metrics and mbeans management.
|
||
(Sanjay Radia via acmurthy)
|
||
|
||
HADOOP-3136. Fixed the default scheduler to assign multiple tasks to each
|
||
tasktracker per heartbeat, when feasible. To ensure locality isn't hurt
|
||
too badly, the scheudler will not assign more than one off-switch task per
|
||
heartbeat. The heartbeat interval is also halved since the task-tracker is
|
||
fixed to no longer send out heartbeats on each task completion. A
|
||
slow-start for scheduling reduces is introduced to ensure that reduces
|
||
aren't started till sufficient number of maps are done, else reduces of
|
||
jobs whose maps aren't scheduled might swamp the cluster.
|
||
Configuration changes to mapred-default.xml:
|
||
add mapred.reduce.slowstart.completed.maps
|
||
(acmurthy)
|
||
|
||
HADOOP-4545. Add example and test case of secondary sort for the reduce.
|
||
(omalley)
|
||
|
||
HADOOP-4753. Refactor gridmix2 to reduce code duplication. (cdouglas)
|
||
|
||
HADOOP-4909. Fix Javadoc and make some of the API more consistent in their
|
||
use of the JobContext instead of Configuration. (omalley)
|
||
|
||
HADOOP-4920. Stop storing Forrest output in Subversion. (cutting)
|
||
|
||
HADOOP-4948. Add parameters java5.home and forrest.home to the ant commands
|
||
in test-patch.sh. (Giridharan Kesavan via szetszwo)
|
||
|
||
HADOOP-4830. Add end-to-end test cases for testing queue capacities.
|
||
(Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-4980. Improve code layout of capacity scheduler to make it
|
||
easier to fix some blocker bugs. (Vivek Ratan via yhemanth)
|
||
|
||
HADOOP-4916. Make user/location of Chukwa installation configurable by an
|
||
external properties file. (Eric Yang via cdouglas)
|
||
|
||
HADOOP-4950. Make the CompressorStream, DecompressorStream,
|
||
BlockCompressorStream, and BlockDecompressorStream public to facilitate
|
||
non-Hadoop codecs. (omalley)
|
||
|
||
HADOOP-4843. Collect job history and configuration in Chukwa. (Eric Yang
|
||
via cdouglas)
|
||
|
||
HADOOP-5030. Build Chukwa RPM to install into configured directory. (Eric
|
||
Yang via cdouglas)
|
||
|
||
HADOOP-4828. Updates documents to do with configuration (HADOOP-4631).
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4939. Adds a test that would inject random failures for tasks in
|
||
large jobs and would also inject TaskTracker failures. (ddas)
|
||
|
||
HADOOP-4944. A configuration file can include other configuration
|
||
files. (Rama Ramasamy via dhruba)
|
||
|
||
HADOOP-4804. Provide Forrest documentation for the Fair Scheduler.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-5248. A testcase that checks for the existence of job directory
|
||
after the job completes. Fails if it exists. (ddas)
|
||
|
||
HADOOP-4664. Introduces multiple job initialization threads, where the
|
||
number of threads are configurable via mapred.jobinit.threads.
|
||
(Matei Zaharia and Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-4191. Adds a testcase for JobHistory. (Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5466. Change documenation CSS style for headers and code. (Corinne
|
||
Chandel via szetszwo)
|
||
|
||
HADOOP-5275. Add ivy directory and files to built tar.
|
||
(Giridharan Kesavan via nigel)
|
||
|
||
HADOOP-5468. Add sub-menus to forrest documentation and make some minor
|
||
edits. (Corinne Chandel via szetszwo)
|
||
|
||
HADOOP-5437. Fix TestMiniMRDFSSort to properly test jvm-reuse. (omalley)
|
||
|
||
HADOOP-5521. Removes dependency of TestJobInProgress on RESTART_COUNT
|
||
JobHistory tag. (Ravi Gummadi via ddas)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-3293. Fixes FileInputFormat to do provide locations for splits
|
||
based on the rack/host that has the most number of bytes.
|
||
(Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-4683. Fixes Reduce shuffle scheduler to invoke
|
||
getMapCompletionEvents in a separate thread. (Jothi Padmanabhan
|
||
via ddas)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-4204. Fix findbugs warnings related to unused variables, naive
|
||
Number subclass instantiation, Map iteration, and badly scoped inner
|
||
classes. (Suresh Srinivas via cdouglas)
|
||
|
||
HADOOP-4207. Update derby jar file to release 10.4.2 release.
|
||
(Prasad Chakka via dhruba)
|
||
|
||
HADOOP-4325. SocketInputStream.read() should return -1 in case EOF.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-4408. FsAction functions need not create new objects. (cdouglas)
|
||
|
||
HADOOP-4440. TestJobInProgressListener tests for jobs killed in queued
|
||
state (Amar Kamat via ddas)
|
||
|
||
HADOOP-4346. Implement blocking connect so that Hadoop is not affected
|
||
by selector problem with JDK default implementation. (Raghu Angadi)
|
||
|
||
HADOOP-4388. If there are invalid blocks in the transfer list, Datanode
|
||
should handle them and keep transferring the remaining blocks. (Suresh
|
||
Srinivas via szetszwo)
|
||
|
||
HADOOP-4587. Fix a typo in Mapper javadoc. (Koji Noguchi via szetszwo)
|
||
|
||
HADOOP-4530. In fsck, HttpServletResponse sendError fails with
|
||
IllegalStateException. (hairong)
|
||
|
||
HADOOP-4377. Fix a race condition in directory creation in
|
||
NativeS3FileSystem. (David Phillips via cdouglas)
|
||
|
||
HADOOP-4621. Fix javadoc warnings caused by duplicate jars. (Kan Zhang via
|
||
cdouglas)
|
||
|
||
HADOOP-4566. Deploy new hive code to support more types.
|
||
(Zheng Shao via dhruba)
|
||
|
||
HADOOP-4571. Add chukwa conf files to svn:ignore list. (Eric Yang via
|
||
szetszwo)
|
||
|
||
HADOOP-4589. Correct PiEstimator output messages and improve the code
|
||
readability. (szetszwo)
|
||
|
||
HADOOP-4650. Correct a mismatch between the default value of
|
||
local.cache.size in the config and the source. (Jeff Hammerbacher via
|
||
cdouglas)
|
||
|
||
HADOOP-4606. Fix cygpath error if the log directory does not exist.
|
||
(szetszwo via omalley)
|
||
|
||
HADOOP-4141. Fix bug in ScriptBasedMapping causing potential infinite
|
||
loop on misconfigured hadoop-site. (Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-4691. Correct a link in the javadoc of IndexedSortable. (szetszwo)
|
||
|
||
HADOOP-4598. '-setrep' command skips under-replicated blocks. (hairong)
|
||
|
||
HADOOP-4429. Set defaults for user, group in UnixUserGroupInformation so
|
||
login fails more predictably when misconfigured. (Alex Loddengaard via
|
||
cdouglas)
|
||
|
||
HADOOP-4676. Fix broken URL in blacklisted tasktrackers page. (Amareshwari
|
||
Sriramadasu via cdouglas)
|
||
|
||
HADOOP-3422 Ganglia counter metrics are all reported with the metric
|
||
name "value", so the counter values can not be seen. (Jason Attributor
|
||
and Brian Bockelman via stack)
|
||
|
||
HADOOP-4704. Fix javadoc typos "the the". (szetszwo)
|
||
|
||
HADOOP-4677. Fix semantics of FileSystem::getBlockLocations to return
|
||
meaningful values. (Hong Tang via cdouglas)
|
||
|
||
HADOOP-4669. Use correct operator when evaluating whether access time is
|
||
enabled (Dhruba Borthakur via cdouglas)
|
||
|
||
HADOOP-4732. Pass connection and read timeouts in the correct order when
|
||
setting up fetch in reduce. (Amareshwari Sriramadasu via cdouglas)
|
||
|
||
HADOOP-4558. Fix capacity reclamation in capacity scheduler.
|
||
(Amar Kamat via yhemanth)
|
||
|
||
HADOOP-4770. Fix rungridmix_2 script to work with RunJar. (cdouglas)
|
||
|
||
HADOOP-4738. When using git, the saveVersion script will use only the
|
||
commit hash for the version and not the message, which requires escaping.
|
||
(cdouglas)
|
||
|
||
HADOOP-4576. Show pending job count instead of task count in the UI per
|
||
queue in capacity scheduler. (Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4623. Maintain running tasks even if speculative execution is off.
|
||
(Amar Kamat via yhemanth)
|
||
|
||
HADOOP-4786. Fix broken compilation error in
|
||
TestTrackerBlacklistAcrossJobs. (yhemanth)
|
||
|
||
HADOOP-4785. Fixes theJobTracker heartbeat to not make two calls to
|
||
System.currentTimeMillis(). (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4792. Add generated Chukwa configuration files to version control
|
||
ignore lists. (cdouglas)
|
||
|
||
HADOOP-4796. Fix Chukwa test configuration, remove unused components. (Eric
|
||
Yang via cdouglas)
|
||
|
||
HADOOP-4708. Add binaries missed in the initial checkin for Chukwa. (Eric
|
||
Yang via cdouglas)
|
||
|
||
HADOOP-4805. Remove black list collector from Chukwa Agent HTTP Sender.
|
||
(Eric Yang via cdouglas)
|
||
|
||
HADOOP-4837. Move HADOOP_CONF_DIR configuration to chukwa-env.sh (Jerome
|
||
Boulon via cdouglas)
|
||
|
||
HADOOP-4825. Use ps instead of jps for querying process status in Chukwa.
|
||
(Eric Yang via cdouglas)
|
||
|
||
HADOOP-4844. Fixed javadoc for
|
||
org.apache.hadoop.fs.permission.AccessControlException to document that
|
||
it's deprecated in favour of
|
||
org.apache.hadoop.security.AccessControlException. (acmurthy)
|
||
|
||
HADOOP-4706. Close the underlying output stream in
|
||
IFileOutputStream::close. (Jothi Padmanabhan via cdouglas)
|
||
|
||
HADOOP-4855. Fixed command-specific help messages for refreshServiceAcl in
|
||
DFSAdmin and MRAdmin. (acmurthy)
|
||
|
||
HADOOP-4820. Remove unused method FSNamesystem::deleteInSafeMode. (Suresh
|
||
Srinivas via cdouglas)
|
||
|
||
HADOOP-4698. Lower io.sort.mb to 10 in the tests and raise the junit memory
|
||
limit to 512m from 256m. (Nigel Daley via cdouglas)
|
||
|
||
HADOOP-4860. Split TestFileTailingAdapters into three separate tests to
|
||
avoid contention. (Eric Yang via cdouglas)
|
||
|
||
HADOOP-3921. Fixed clover (code coverage) target to work with JDK 6.
|
||
(tomwhite via nigel)
|
||
|
||
HADOOP-4845. Modify the reduce input byte counter to record only the
|
||
compressed size and add a human-readable label. (Yongqiang He via cdouglas)
|
||
|
||
HADOOP-4458. Add a test creating symlinks in the working directory.
|
||
(Amareshwari Sriramadasu via cdouglas)
|
||
|
||
HADOOP-4879. Fix org.apache.hadoop.mapred.Counters to correctly define
|
||
Object.equals rather than depend on contentEquals api. (omalley via
|
||
acmurthy)
|
||
|
||
HADOOP-4791. Fix rpm build process for Chukwa. (Eric Yang via cdouglas)
|
||
|
||
HADOOP-4771. Correct initialization of the file count for directories
|
||
with quotas. (Ruyue Ma via shv)
|
||
|
||
HADOOP-4878. Fix eclipse plugin classpath file to point to ivy's resolved
|
||
lib directory and added the same to test-patch.sh. (Giridharan Kesavan via
|
||
acmurthy)
|
||
|
||
HADOOP-4774. Fix default values of some capacity scheduler configuration
|
||
items which would otherwise not work on a fresh checkout.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4876. Fix capacity scheduler reclamation by updating count of
|
||
pending tasks correctly. (Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4849. Documentation for Service Level Authorization implemented in
|
||
HADOOP-4348. (acmurthy)
|
||
|
||
HADOOP-4827. Replace Consolidator with Aggregator macros in Chukwa (Eric
|
||
Yang via cdouglas)
|
||
|
||
HADOOP-4894. Correctly parse ps output in Chukwa jettyCollector.sh. (Ari
|
||
Rabkin via cdouglas)
|
||
|
||
HADOOP-4892. Close fds out of Chukwa ExecPlugin. (Ari Rabkin via cdouglas)
|
||
|
||
HADOOP-4889. Fix permissions in RPM packaging. (Eric Yang via cdouglas)
|
||
|
||
HADOOP-4869. Fixes the TT-JT heartbeat to have an explicit flag for
|
||
restart apart from the initialContact flag that there was earlier.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4716. Fixes ReduceTask.java to clear out the mapping between
|
||
hosts and MapOutputLocation upon a JT restart (Amar Kamat via ddas)
|
||
|
||
HADOOP-4880. Removes an unnecessary testcase from TestJobTrackerRestart.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-4924. Fixes a race condition in TaskTracker re-init. (ddas)
|
||
|
||
HADOOP-4854. Read reclaim capacity interval from capacity scheduler
|
||
configuration. (Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4896. HDFS Fsck does not load HDFS configuration. (Raghu Angadi)
|
||
|
||
HADOOP-4956. Creates TaskStatus for failed tasks with an empty Counters
|
||
object instead of null. (ddas)
|
||
|
||
HADOOP-4979. Fix capacity scheduler to block cluster for failed high
|
||
RAM requirements across task types. (Vivek Ratan via yhemanth)
|
||
|
||
HADOOP-4949. Fix native compilation. (Chris Douglas via acmurthy)
|
||
|
||
HADOOP-4787. Fixes the testcase TestTrackerBlacklistAcrossJobs which was
|
||
earlier failing randomly. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4914. Add description fields to Chukwa init.d scripts (Eric Yang via
|
||
cdouglas)
|
||
|
||
HADOOP-4884. Make tool tip date format match standard HICC format. (Eric
|
||
Yang via cdouglas)
|
||
|
||
HADOOP-4925. Make Chukwa sender properties configurable. (Ari Rabkin via
|
||
cdouglas)
|
||
|
||
HADOOP-4947. Make Chukwa command parsing more forgiving of whitespace. (Ari
|
||
Rabkin via cdouglas)
|
||
|
||
HADOOP-5026. Make chukwa/bin scripts executable in repository. (Andy
|
||
Konwinski via cdouglas)
|
||
|
||
HADOOP-4977. Fix a deadlock between the reclaimCapacity and assignTasks
|
||
in capacity scheduler. (Vivek Ratan via yhemanth)
|
||
|
||
HADOOP-4988. Fix reclaim capacity to work even when there are queues with
|
||
no capacity. (Vivek Ratan via yhemanth)
|
||
|
||
HADOOP-5065. Remove generic parameters from argument to
|
||
setIn/OutputFormatClass so that it works with SequenceIn/OutputFormat.
|
||
(cdouglas via omalley)
|
||
|
||
HADOOP-4818. Pass user config to instrumentation API. (Eric Yang via
|
||
cdouglas)
|
||
|
||
HADOOP-4993. Fix Chukwa agent configuration and startup to make it both
|
||
more modular and testable. (Ari Rabkin via cdouglas)
|
||
|
||
HADOOP-5048. Fix capacity scheduler to correctly cleanup jobs that are
|
||
killed after initialization, but before running.
|
||
(Sreekanth Ramakrishnan via yhemanth)
|
||
|
||
HADOOP-4671. Mark loop control variables shared between threads as
|
||
volatile. (cdouglas)
|
||
|
||
HADOOP-5079. HashFunction inadvertently destroys some randomness
|
||
(Jonathan Ellis via stack)
|
||
|
||
HADOOP-4999. A failure to write to FsEditsLog results in
|
||
IndexOutOfBounds exception. (Boris Shkolnik via rangadi)
|
||
|
||
HADOOP-5139. Catch IllegalArgumentException during metrics registration
|
||
in RPC. (Hairong Kuang via szetszwo)
|
||
|
||
HADOOP-5085. Copying a file to local with Crc throws an exception.
|
||
(hairong)
|
||
|
||
HADOOP-5211. Fix check for job completion in TestSetupAndCleanupFailure.
|
||
(enis)
|
||
|
||
HADOOP-5254. The Configuration class should be able to work with XML
|
||
parsers that do not support xmlinclude. (Steve Loughran via dhruba)
|
||
|
||
HADOOP-4692. Namenode in infinite loop for replicating/deleting corrupt
|
||
blocks. (hairong)
|
||
|
||
HADOOP-5255. Fix use of Math.abs to avoid overflow. (Jonathan Ellis via
|
||
cdouglas)
|
||
|
||
HADOOP-5269. Fixes a problem to do with tasktracker holding on to
|
||
FAILED_UNCLEAN or KILLED_UNCLEAN tasks forever. (Amareshwari Sriramadasu
|
||
via ddas)
|
||
|
||
HADOOP-5214. Fixes a ConcurrentModificationException while the Fairshare
|
||
Scheduler accesses the tasktrackers stored by the JobTracker.
|
||
(Rahul Kumar Singh via yhemanth)
|
||
|
||
HADOOP-5233. Addresses the three issues - Race condition in updating
|
||
status, NPE in TaskTracker task localization when the conf file is missing
|
||
(HADOOP-5234) and NPE in handling KillTaskAction of a cleanup task
|
||
(HADOOP-5235). (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5247. Introduces a broadcast of KillJobAction to all trackers when
|
||
a job finishes. This fixes a bunch of problems to do with NPE when a
|
||
completed job is not in memory and a tasktracker comes to the jobtracker
|
||
with a status report of a task belonging to that job. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5282. Fixed job history logs for task attempts that are
|
||
failed by the JobTracker, say due to lost task trackers. (Amar
|
||
Kamat via yhemanth)
|
||
|
||
HADOOP-5241. Fixes a bug in disk-space resource estimation. Makes
|
||
the estimation formula linear where blowUp =
|
||
Total-Output/Total-Input. (Sharad Agarwal via ddas)
|
||
|
||
HADOOP-5142. Fix MapWritable#putAll to store key/value classes.
|
||
(Do??acan G??ney via enis)
|
||
|
||
HADOOP-4744. Workaround for jetty6 returning -1 when getLocalPort
|
||
is invoked on the connector. The workaround patch retries a few
|
||
times before failing. (Jothi Padmanabhan via yhemanth)
|
||
|
||
HADOOP-5280. Adds a check to prevent a task state transition from
|
||
FAILED to any of UNASSIGNED, RUNNING, COMMIT_PENDING or
|
||
SUCCEEDED. (ddas)
|
||
|
||
HADOOP-5272. Fixes a problem to do with detecting whether an
|
||
attempt is the first attempt of a Task. This affects JobTracker
|
||
restart. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5306. Fixes a problem to do with logging/parsing the http port of a
|
||
lost tracker. Affects JobTracker restart. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5111. Fix Job::set* methods to work with generics. (cdouglas)
|
||
|
||
HADOOP-5274. Fix gridmix2 dependency on wordcount example. (cdouglas)
|
||
|
||
HADOOP-5145. Balancer sometimes runs out of memory after running
|
||
days or weeks. (hairong)
|
||
|
||
HADOOP-5338. Fix jobtracker restart to clear task completion
|
||
events cached by tasktrackers forcing them to fetch all events
|
||
afresh, thus avoiding missed task completion events on the
|
||
tasktrackers. (Amar Kamat via yhemanth)
|
||
|
||
HADOOP-4695. Change TestGlobalFilter so that it allows a web page to be
|
||
filtered more than once for a single access. (Kan Zhang via szetszwo)
|
||
|
||
HADOOP-5298. Change TestServletFilter so that it allows a web page to be
|
||
filtered more than once for a single access. (szetszwo)
|
||
|
||
HADOOP-5432. Disable ssl during unit tests in hdfsproxy, as it is unused
|
||
and causes failures. (cdouglas)
|
||
|
||
HADOOP-5416. Correct the shell command "fs -test" forrest doc description.
|
||
(Ravi Phulari via szetszwo)
|
||
|
||
HADOOP-5327. Fixed job tracker to remove files from system directory on
|
||
ACL check failures and also check ACLs on restart.
|
||
(Amar Kamat via yhemanth)
|
||
|
||
HADOOP-5395. Change the exception message when a job is submitted to an
|
||
invalid queue. (Rahul Kumar Singh via yhemanth)
|
||
|
||
HADOOP-5276. Fixes a problem to do with updating the start time of
|
||
a task when the tracker that ran the task is lost. (Amar Kamat via
|
||
ddas)
|
||
|
||
HADOOP-5278. Fixes a problem to do with logging the finish time of
|
||
a task during recovery (after a JobTracker restart). (Amar Kamat
|
||
via ddas)
|
||
|
||
HADOOP-5490. Fixes a synchronization problem in the
|
||
EagerTaskInitializationListener class. (Jothi Padmanabhan via
|
||
ddas)
|
||
|
||
HADOOP-5493. The shuffle copier threads return the codecs back to
|
||
the pool when the shuffle completes. (Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-5414. Fixes IO exception while executing hadoop fs -touchz
|
||
fileName by making sure that lease renewal thread exits before dfs
|
||
client exits. (hairong)
|
||
|
||
HADOOP-5103. FileInputFormat now reuses the clusterMap network
|
||
topology object and that brings down the log messages in the
|
||
JobClient to do with NetworkTopology.add significantly. (Jothi
|
||
Padmanabhan via ddas)
|
||
|
||
HADOOP-5483. Fixes a problem in the Directory Cleanup Thread due to which
|
||
TestMiniMRWithDFS sometimes used to fail. (ddas)
|
||
|
||
HADOOP-5281. Prevent sharing incompatible ZlibCompressor instances between
|
||
GzipCodec and DefaultCodec. (cdouglas)
|
||
|
||
HADOOP-5463. Balancer throws "Not a host:port pair" unless port is
|
||
specified in fs.default.name. (Stuart White via hairong)
|
||
|
||
HADOOP-5514. Fix JobTracker metrics and add metrics for wating, failed
|
||
tasks. (cdouglas)
|
||
|
||
HADOOP-5516. Fix NullPointerException in TaskMemoryManagerThread
|
||
that comes when monitored processes disappear when the thread is
|
||
running. (Vinod Kumar Vavilapalli via yhemanth)
|
||
|
||
HADOOP-5382. Support combiners in the new context object API. (omalley)
|
||
|
||
HADOOP-5471. Fixes a problem to do with updating the log.index file in the
|
||
case where a cleanup task is run. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5534. Fixed a deadlock in Fair scheduler's servlet.
|
||
(Rahul Kumar Singh via yhemanth)
|
||
|
||
HADOOP-5328. Fixes a problem in the renaming of job history files during
|
||
job recovery. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5417. Don't ignore InterruptedExceptions that happen when calling
|
||
into rpc. (omalley)
|
||
|
||
HADOOP-5320. Add a close() in TestMapReduceLocal. (Jothi Padmanabhan
|
||
via szetszwo)
|
||
|
||
HADOOP-5520. Fix a typo in disk quota help message. (Ravi Phulari
|
||
via szetszwo)
|
||
|
||
HADOOP-5519. Remove claims from mapred-default.xml that prime numbers
|
||
of tasks are helpful. (Owen O'Malley via szetszwo)
|
||
|
||
HADOOP-5484. TestRecoveryManager fails wtih FileAlreadyExistsException.
|
||
(Amar Kamat via hairong)
|
||
|
||
HADOOP-5564. Limit the JVM heap size in the java command for initializing
|
||
JAVA_PLATFORM. (Suresh Srinivas via szetszwo)
|
||
|
||
HADOOP-5565. Add API for failing/finalized jobs to the JT metrics
|
||
instrumentation. (Jerome Boulon via cdouglas)
|
||
|
||
HADOOP-5390. Remove duplicate jars from tarball, src from binary tarball
|
||
added by hdfsproxy. (Zhiyong Zhang via cdouglas)
|
||
|
||
HADOOP-5066. Building binary tarball should not build docs/javadocs, copy
|
||
src, or run jdiff. (Giridharan Kesavan via cdouglas)
|
||
|
||
HADOOP-5459. Fix undetected CRC errors where intermediate output is closed
|
||
before it has been completely consumed. (cdouglas)
|
||
|
||
HADOOP-5571. Remove widening primitive conversion in TupleWritable mask
|
||
manipulation. (Jingkei Ly via cdouglas)
|
||
|
||
HADOOP-5588. Remove an unnecessary call to listStatus(..) in
|
||
FileSystem.globStatusInternal(..). (Hairong Kuang via szetszwo)
|
||
|
||
HADOOP-5473. Solves a race condition in killing a task - the state is KILLED
|
||
if there is a user request pending to kill the task and the TT reported
|
||
the state as SUCCESS. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5576. Fix LocalRunner to work with the new context object API in
|
||
mapreduce. (Tom White via omalley)
|
||
|
||
HADOOP-4374. Installs a shutdown hook in the Task JVM so that log.index is
|
||
updated before the JVM exits. Also makes the update to log.index atomic.
|
||
(Ravi Gummadi via ddas)
|
||
|
||
HADOOP-5577. Add a verbose flag to mapreduce.Job.waitForCompletion to get
|
||
the running job's information printed to the user's stdout as it runs.
|
||
(omalley)
|
||
|
||
HADOOP-5607. Fix NPE in TestCapacityScheduler. (cdouglas)
|
||
|
||
HADOOP-5605. All the replicas incorrectly got marked as corrupt. (hairong)
|
||
|
||
HADOOP-5337. JobTracker, upon restart, now waits for the TaskTrackers to
|
||
join back before scheduling new tasks. This fixes race conditions associated
|
||
with greedy scheduling as was the case earlier. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5227. Fix distcp so -update and -delete can be meaningfully
|
||
combined. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-5305. Increase number of files and print debug messages in
|
||
TestCopyFiles. (szetszwo)
|
||
|
||
HADOOP-5548. Add synchronization for JobTracker methods in RecoveryManager.
|
||
(Amareshwari Sriramadasu via sharad)
|
||
|
||
HADOOP-3810. NameNode seems unstable on a cluster with little space left.
|
||
(hairong)
|
||
|
||
HADOOP-5068. Fix NPE in TestCapacityScheduler. (Vinod Kumar Vavilapalli
|
||
via szetszwo)
|
||
|
||
HADOOP-5585. Clear FileSystem statistics between tasks when jvm-reuse
|
||
is enabled. (omalley)
|
||
|
||
HADOOP-5394. JobTracker might schedule 2 attempts of the same task
|
||
with the same attempt id across restarts. (Amar Kamat via sharad)
|
||
|
||
HADOOP-5645. After HADOOP-4920 we need a place to checkin
|
||
releasenotes.html. (nigel)
|
||
|
||
Release 0.19.2 - 2009-06-30
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-5154. Fixes a deadlock in the fairshare scheduler.
|
||
(Matei Zaharia via yhemanth)
|
||
|
||
HADOOP-5146. Fixes a race condition that causes LocalDirAllocator to miss
|
||
files. (Devaraj Das via yhemanth)
|
||
|
||
HADOOP-4638. Fixes job recovery to not crash the job tracker for problems
|
||
with a single job file. (Amar Kamat via yhemanth)
|
||
|
||
HADOOP-5384. Fix a problem that DataNodeCluster creates blocks with
|
||
generationStamp == 1. (szetszwo)
|
||
|
||
HADOOP-5376. Fixes the code handling lost tasktrackers to set the task state
|
||
to KILLED_UNCLEAN only for relevant type of tasks.
|
||
(Amareshwari Sriramadasu via yhemanth)
|
||
|
||
HADOOP-5285. Fixes the issues - (1) obtainTaskCleanupTask checks whether job is
|
||
inited before trying to lock the JobInProgress (2) Moves the CleanupQueue class
|
||
outside the TaskTracker and makes it a generic class that is used by the
|
||
JobTracker also for deleting the paths on the job's output fs. (3) Moves the
|
||
references to completedJobStore outside the block where the JobTracker is locked.
|
||
(ddas)
|
||
|
||
HADOOP-5392. Fixes a problem to do with JT crashing during recovery when
|
||
the job files are garbled. (Amar Kamat via ddas)
|
||
|
||
HADOOP-5332. Appending to files is not allowed (by default) unless
|
||
dfs.support.append is set to true. (dhruba)
|
||
|
||
HADOOP-5333. libhdfs supports appending to files. (dhruba)
|
||
|
||
HADOOP-3998. Fix dfsclient exception when JVM is shutdown. (dhruba)
|
||
|
||
HADOOP-5440. Fixes a problem to do with removing a taskId from the list
|
||
of taskIds that the TaskTracker's TaskMemoryManager manages.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5446. Restore TaskTracker metrics. (cdouglas)
|
||
|
||
HADOOP-5449. Fixes the history cleaner thread.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5479. NameNode should not send empty block replication request to
|
||
DataNode. (hairong)
|
||
|
||
HADOOP-5259. Job with output hdfs:/user/<username>/outputpath (no
|
||
authority) fails with Wrong FS. (Doug Cutting via hairong)
|
||
|
||
HADOOP-5522. Documents the setup/cleanup tasks in the mapred tutorial.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5549. ReplicationMonitor should schedule both replication and
|
||
deletion work in one iteration. (hairong)
|
||
|
||
HADOOP-5554. DataNodeCluster and CreateEditsLog should create blocks with
|
||
the same generation stamp value. (hairong via szetszwo)
|
||
|
||
HADOOP-5231. Clones the TaskStatus before passing it to the JobInProgress.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4719. Fix documentation of 'ls' format for FsShell. (Ravi Phulari
|
||
via cdouglas)
|
||
|
||
HADOOP-5374. Fixes a NPE problem in getTasksToSave method.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4780. Cache the size of directories in DistributedCache, avoiding
|
||
long delays in recalculating it. (He Yongqiang via cdouglas)
|
||
|
||
HADOOP-5551. Prevent directory destruction on file create.
|
||
(Brian Bockelman via shv)
|
||
|
||
HADOOP-5671. Fix FNF exceptions when copying from old versions of
|
||
HftpFileSystem. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-5213. Fix Null pointer exception caused when bzip2compression
|
||
was used and user closed a output stream without writing any data.
|
||
(Zheng Shao via dhruba)
|
||
|
||
HADOOP-5579. Set errno correctly in libhdfs for permission, quota, and FNF
|
||
conditions. (Brian Bockelman via cdouglas)
|
||
|
||
HADOOP-5816. Fixes a problem in the KeyFieldBasedComparator to do with
|
||
ArrayIndexOutOfBounds exception. (He Yongqiang via ddas)
|
||
|
||
HADOOP-5951. Add Apache license header to StorageInfo.java. (Suresh
|
||
Srinivas via szetszwo)
|
||
|
||
Release 0.19.1 - 2009-02-23
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-4739. Fix spelling and grammar, improve phrasing of some sections in
|
||
mapred tutorial. (Vivek Ratan via cdouglas)
|
||
|
||
HADOOP-3894. DFSClient logging improvements. (Steve Loughran via shv)
|
||
|
||
HADOOP-5126. Remove empty file BlocksWithLocations.java (shv)
|
||
|
||
HADOOP-5127. Remove public methods in FSDirectory. (Jakob Homan via shv)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-4697. Fix getBlockLocations in KosmosFileSystem to handle multiple
|
||
blocks correctly. (Sriram Rao via cdouglas)
|
||
|
||
HADOOP-4420. Add null checks for job, caused by invalid job IDs.
|
||
(Aaron Kimball via tomwhite)
|
||
|
||
HADOOP-4632. Fix TestJobHistoryVersion to use test.build.dir instead of the
|
||
current workding directory for scratch space. (Amar Kamat via cdouglas)
|
||
|
||
HADOOP-4508. Fix FSDataOutputStream.getPos() for append. (dhruba via
|
||
szetszwo)
|
||
|
||
HADOOP-4727. Fix a group checking bug in fill_stat_structure(...) in
|
||
fuse-dfs. (Brian Bockelman via szetszwo)
|
||
|
||
HADOOP-4836. Correct typos in mapred related documentation. (Jord? Polo
|
||
via szetszwo)
|
||
|
||
HADOOP-4821. Usage description in the Quotas guide documentations are
|
||
incorrect. (Boris Shkolnik via hairong)
|
||
|
||
HADOOP-4847. Moves the loading of OutputCommitter to the Task.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4966. Marks completed setup tasks for removal.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4982. TestFsck should run in Eclipse. (shv)
|
||
|
||
HADOOP-5008. TestReplication#testPendingReplicationRetry leaves an opened
|
||
fd unclosed. (hairong)
|
||
|
||
HADOOP-4906. Fix TaskTracker OOM by keeping a shallow copy of JobConf in
|
||
TaskTracker.TaskInProgress. (Sharad Agarwal via acmurthy)
|
||
|
||
HADOOP-4918. Fix bzip2 compression to work with Sequence Files.
|
||
(Zheng Shao via dhruba).
|
||
|
||
HADOOP-4965. TestFileAppend3 should close FileSystem. (shv)
|
||
|
||
HADOOP-4967. Fixes a race condition in the JvmManager to do with killing
|
||
tasks. (ddas)
|
||
|
||
HADOOP-5009. DataNode#shutdown sometimes leaves data block scanner
|
||
verification log unclosed. (hairong)
|
||
|
||
HADOOP-5086. Use the appropriate FileSystem for trash URIs. (cdouglas)
|
||
|
||
HADOOP-4955. Make DBOutputFormat us column names from setOutput().
|
||
(Kevin Peterson via enis)
|
||
|
||
HADOOP-4862. Minor : HADOOP-3678 did not remove all the cases of
|
||
spurious IOExceptions logged by DataNode. (Raghu Angadi)
|
||
|
||
HADOOP-5034. NameNode should send both replication and deletion requests
|
||
to DataNode in one reply to a heartbeat. (hairong)
|
||
|
||
HADOOP-4759. Removes temporary output directory for failed and killed
|
||
tasks by launching special CLEANUP tasks for the same.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-5161. Accepted sockets do not get placed in
|
||
DataXceiverServer#childSockets. (hairong)
|
||
|
||
HADOOP-5193. Correct calculation of edits modification time. (shv)
|
||
|
||
HADOOP-4494. Allow libhdfs to append to files.
|
||
(Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-5166. Fix JobTracker restart to work when ACLs are configured
|
||
for the JobTracker. (Amar Kamat via yhemanth).
|
||
|
||
HADOOP-5067. Fixes TaskInProgress.java to keep track of count of failed and
|
||
killed tasks correctly. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4760. HDFS streams should not throw exceptions when closed twice.
|
||
(enis)
|
||
|
||
Release 0.19.0 - 2008-11-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-3595. Remove deprecated methods for mapred.combine.once
|
||
functionality, which was necessary to providing backwards
|
||
compatible combiner semantics for 0.18. (cdouglas via omalley)
|
||
|
||
HADOOP-3667. Remove the following deprecated methods from JobConf:
|
||
addInputPath(Path)
|
||
getInputPaths()
|
||
getMapOutputCompressionType()
|
||
getOutputPath()
|
||
getSystemDir()
|
||
setInputPath(Path)
|
||
setMapOutputCompressionType(CompressionType style)
|
||
setOutputPath(Path)
|
||
(Amareshwari Sriramadasu via omalley)
|
||
|
||
HADOOP-3652. Remove deprecated class OutputFormatBase.
|
||
(Amareshwari Sriramadasu via cdouglas)
|
||
|
||
HADOOP-2885. Break the hadoop.dfs package into separate packages under
|
||
hadoop.hdfs that reflect whether they are client, server, protocol,
|
||
etc. DistributedFileSystem and DFSClient have moved and are now
|
||
considered package private. (Sanjay Radia via omalley)
|
||
|
||
HADOOP-2325. Require Java 6. (cutting)
|
||
|
||
HADOOP-372. Add support for multiple input paths with a different
|
||
InputFormat and Mapper for each path. (Chris Smith via tomwhite)
|
||
|
||
HADOOP-1700. Support appending to file in HDFS. (dhruba)
|
||
|
||
HADOOP-3792. Make FsShell -test consistent with unix semantics, returning
|
||
zero for true and non-zero for false. (Ben Slusky via cdouglas)
|
||
|
||
HADOOP-3664. Remove the deprecated method InputFormat.validateInput,
|
||
which is no longer needed. (tomwhite via omalley)
|
||
|
||
HADOOP-3549. Give more meaningful errno's in libhdfs. In particular,
|
||
EACCES is returned for permission problems. (Ben Slusky via omalley)
|
||
|
||
HADOOP-4036. ResourceStatus was added to TaskTrackerStatus by HADOOP-3759,
|
||
so increment the InterTrackerProtocol version. (Hemanth Yamijala via
|
||
omalley)
|
||
|
||
HADOOP-3150. Moves task promotion to tasks. Defines a new interface for
|
||
committing output files. Moves job setup to jobclient, and moves jobcleanup
|
||
to a separate task. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3446. Keep map outputs in memory during the reduce. Remove
|
||
fs.inmemory.size.mb and replace with properties defining in memory map
|
||
output retention during the shuffle and reduce relative to maximum heap
|
||
usage. (cdouglas)
|
||
|
||
HADOOP-3245. Adds the feature for supporting JobTracker restart. Running
|
||
jobs can be recovered from the history file. The history file format has
|
||
been modified to support recovery. The task attempt ID now has the
|
||
JobTracker start time to disinguish attempts of the same TIP across
|
||
restarts. (Amar Ramesh Kamat via ddas)
|
||
|
||
HADOOP-4007. REMOVE DFSFileInfo - FileStatus is sufficient.
|
||
(Sanjay Radia via hairong)
|
||
|
||
HADOOP-3722. Fixed Hadoop Streaming and Hadoop Pipes to use the Tool
|
||
interface and GenericOptionsParser. (Enis Soztutar via acmurthy)
|
||
|
||
HADOOP-2816. Cluster summary at name node web reports the space
|
||
utilization as:
|
||
Configured Capacity: capacity of all the data directories - Reserved space
|
||
Present Capacity: Space available for dfs,i.e. remaining+used space
|
||
DFS Used%: DFS used space/Present Capacity
|
||
(Suresh Srinivas via hairong)
|
||
|
||
HADOOP-3938. Disk space quotas for HDFS. This is similar to namespace
|
||
quotas in 0.18. (rangadi)
|
||
|
||
HADOOP-4293. Make Configuration Writable and remove unreleased
|
||
WritableJobConf. Configuration.write is renamed to writeXml. (omalley)
|
||
|
||
HADOOP-4281. Change dfsadmin to report available disk space in a format
|
||
consistent with the web interface as defined in HADOOP-2816. (Suresh
|
||
Srinivas via cdouglas)
|
||
|
||
HADOOP-4430. Further change the cluster summary at name node web that was
|
||
changed in HADOOP-2816:
|
||
Non DFS Used - This indicates the disk space taken by non DFS file from
|
||
the Configured capacity
|
||
DFS Used % - DFS Used % of Configured Capacity
|
||
DFS Remaining % - Remaing % Configured Capacity available for DFS use
|
||
DFS command line report reflects the same change. Config parameter
|
||
dfs.datanode.du.pct is no longer used and is removed from the
|
||
hadoop-default.xml. (Suresh Srinivas via hairong)
|
||
|
||
HADOOP-4116. Balancer should provide better resource management. (hairong)
|
||
|
||
HADOOP-4599. BlocksMap and BlockInfo made package private. (shv)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-3341. Allow streaming jobs to specify the field separator for map
|
||
and reduce input and output. The new configuration values are:
|
||
stream.map.input.field.separator
|
||
stream.map.output.field.separator
|
||
stream.reduce.input.field.separator
|
||
stream.reduce.output.field.separator
|
||
All of them default to "\t". (Zheng Shao via omalley)
|
||
|
||
HADOOP-3479. Defines the configuration file for the resource manager in
|
||
Hadoop. You can configure various parameters related to scheduling, such
|
||
as queues and queue properties here. The properties for a queue follow a
|
||
naming convention,such as, hadoop.rm.queue.queue-name.property-name.
|
||
(Hemanth Yamijala via ddas)
|
||
|
||
HADOOP-3149. Adds a way in which map/reducetasks can create multiple
|
||
outputs. (Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-3714. Add a new contrib, bash-tab-completion, which enables
|
||
bash tab completion for the bin/hadoop script. See the README file
|
||
in the contrib directory for the installation. (Chris Smith via enis)
|
||
|
||
HADOOP-3730. Adds a new JobConf constructor that disables loading
|
||
default configurations. (Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-3772. Add a new Hadoop Instrumentation api for the JobTracker and
|
||
the TaskTracker, refactor Hadoop Metrics as an implementation of the api.
|
||
(Ari Rabkin via acmurthy)
|
||
|
||
HADOOP-2302. Provides a comparator for numerical sorting of key fields.
|
||
(ddas)
|
||
|
||
HADOOP-153. Provides a way to skip bad records. (Sharad Agarwal via ddas)
|
||
|
||
HADOOP-657. Free disk space should be modelled and used by the scheduler
|
||
to make scheduling decisions. (Ari Rabkin via omalley)
|
||
|
||
HADOOP-3719. Initial checkin of Chukwa, which is a data collection and
|
||
analysis framework. (Jerome Boulon, Andy Konwinski, Ari Rabkin,
|
||
and Eric Yang)
|
||
|
||
HADOOP-3873. Add -filelimit and -sizelimit options to distcp to cap the
|
||
number of files/bytes copied in a particular run to support incremental
|
||
updates and mirroring. (TszWo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3585. FailMon package for hardware failure monitoring and
|
||
analysis of anomalies. (Ioannis Koltsidas via dhruba)
|
||
|
||
HADOOP-1480. Add counters to the C++ Pipes API. (acmurthy via omalley)
|
||
|
||
HADOOP-3854. Add support for pluggable servlet filters in the HttpServers.
|
||
(Tsz Wo (Nicholas) Sze via omalley)
|
||
|
||
HADOOP-3759. Provides ability to run memory intensive jobs without
|
||
affecting other running tasks on the nodes. (Hemanth Yamijala via ddas)
|
||
|
||
HADOOP-3746. Add a fair share scheduler. (Matei Zaharia via omalley)
|
||
|
||
HADOOP-3754. Add a thrift interface to access HDFS. (dhruba via omalley)
|
||
|
||
HADOOP-3828. Provides a way to write skipped records to DFS.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-3948. Separate name-node edits and fsimage directories.
|
||
(Lohit Vijayarenu via shv)
|
||
|
||
HADOOP-3939. Add an option to DistCp to delete files at the destination
|
||
not present at the source. (Tsz Wo (Nicholas) Sze via cdouglas)
|
||
|
||
HADOOP-3601. Add a new contrib module for Hive, which is a sql-like
|
||
query processing tool that uses map/reduce. (Ashish Thusoo via omalley)
|
||
|
||
HADOOP-3866. Added sort and multi-job updates in the JobTracker web ui.
|
||
(Craig Weisenfluh via omalley)
|
||
|
||
HADOOP-3698. Add access control to control who is allowed to submit or
|
||
modify jobs in the JobTracker. (Hemanth Yamijala via omalley)
|
||
|
||
HADOOP-1869. Support access times for HDFS files. (dhruba)
|
||
|
||
HADOOP-3941. Extend FileSystem API to return file-checksums.
|
||
(szetszwo)
|
||
|
||
HADOOP-3581. Prevents memory intensive user tasks from taking down
|
||
nodes. (Vinod K V via ddas)
|
||
|
||
HADOOP-3970. Provides a way to recover counters written to JobHistory.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-3702. Adds ChainMapper and ChainReducer classes allow composing
|
||
chains of Maps and Reduces in a single Map/Reduce job, something like
|
||
MAP+ / REDUCE MAP*. (Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-3445. Add capacity scheduler that provides guaranteed capacities to
|
||
queues as a percentage of the cluster. (Vivek Ratan via omalley)
|
||
|
||
HADOOP-3992. Add a synthetic load generation facility to the test
|
||
directory. (hairong via szetszwo)
|
||
|
||
HADOOP-3981. Implement a distributed file checksum algorithm in HDFS
|
||
and change DistCp to use file checksum for comparing src and dst files
|
||
(szetszwo)
|
||
|
||
HADOOP-3829. Narrown down skipped records based on user acceptable value.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-3930. Add common interfaces for the pluggable schedulers and the
|
||
cli & gui clients. (Sreekanth Ramakrishnan via omalley)
|
||
|
||
HADOOP-4176. Implement getFileChecksum(Path) in HftpFileSystem. (szetszwo)
|
||
|
||
HADOOP-249. Reuse JVMs across Map-Reduce Tasks.
|
||
Configuration changes to hadoop-default.xml:
|
||
add mapred.job.reuse.jvm.num.tasks
|
||
(Devaraj Das via acmurthy)
|
||
|
||
HADOOP-4070. Provide a mechanism in Hive for registering UDFs from the
|
||
query language. (tomwhite)
|
||
|
||
HADOOP-2536. Implement a JDBC based database input and output formats to
|
||
allow Map-Reduce applications to work with databases. (Fredrik Hedberg and
|
||
Enis Soztutar via acmurthy)
|
||
|
||
HADOOP-3019. A new library to support total order partitions.
|
||
(cdouglas via omalley)
|
||
|
||
HADOOP-3924. Added a 'KILLED' job status. (Subramaniam Krishnan via
|
||
acmurthy)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-4205. hive: metastore and ql to use the refactored SerDe library.
|
||
(zshao)
|
||
|
||
HADOOP-4106. libhdfs: add time, permission and user attribute support
|
||
(part 2). (Pete Wyckoff through zshao)
|
||
|
||
HADOOP-4104. libhdfs: add time, permission and user attribute support.
|
||
(Pete Wyckoff through zshao)
|
||
|
||
HADOOP-3908. libhdfs: better error message if llibhdfs.so doesn't exist.
|
||
(Pete Wyckoff through zshao)
|
||
|
||
HADOOP-3732. Delay intialization of datanode block verification till
|
||
the verification thread is started. (rangadi)
|
||
|
||
HADOOP-1627. Various small improvements to 'dfsadmin -report' output.
|
||
(rangadi)
|
||
|
||
HADOOP-3577. Tools to inject blocks into name node and simulated
|
||
data nodes for testing. (Sanjay Radia via hairong)
|
||
|
||
HADOOP-2664. Add a lzop compatible codec, so that files compressed by lzop
|
||
may be processed by map/reduce. (cdouglas via omalley)
|
||
|
||
HADOOP-3655. Add additional ant properties to control junit. (Steve
|
||
Loughran via omalley)
|
||
|
||
HADOOP-3543. Update the copyright year to 2008. (cdouglas via omalley)
|
||
|
||
HADOOP-3587. Add a unit test for the contrib/data_join framework.
|
||
(cdouglas)
|
||
|
||
HADOOP-3402. Add terasort example program (omalley)
|
||
|
||
HADOOP-3660. Add replication factor for injecting blocks in simulated
|
||
datanodes. (Sanjay Radia via cdouglas)
|
||
|
||
HADOOP-3684. Add a cloning function to the contrib/data_join framework
|
||
permitting users to define a more efficient method for cloning values from
|
||
the reduce than serialization/deserialization. (Runping Qi via cdouglas)
|
||
|
||
HADOOP-3478. Improves the handling of map output fetching. Now the
|
||
randomization is by the hosts (and not the map outputs themselves).
|
||
(Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-3617. Removed redundant checks of accounting space in MapTask and
|
||
makes the spill thread persistent so as to avoid creating a new one for
|
||
each spill. (Chris Douglas via acmurthy)
|
||
|
||
HADOOP-3412. Factor the scheduler out of the JobTracker and make
|
||
it pluggable. (Tom White and Brice Arnould via omalley)
|
||
|
||
HADOOP-3756. Minor. Remove unused dfs.client.buffer.dir from
|
||
hadoop-default.xml. (rangadi)
|
||
|
||
HADOOP-3747. Adds counter suport for MultipleOutputs.
|
||
(Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-3169. LeaseChecker daemon should not be started in DFSClient
|
||
constructor. (TszWo (Nicholas), SZE via hairong)
|
||
|
||
HADOOP-3824. Move base functionality of StatusHttpServer to a core
|
||
package. (TszWo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3646. Add a bzip2 compatible codec, so bzip compressed data
|
||
may be processed by map/reduce. (Abdul Qadeer via cdouglas)
|
||
|
||
HADOOP-3861. MapFile.Reader and Writer should implement Closeable.
|
||
(tomwhite via omalley)
|
||
|
||
HADOOP-3791. Introduce generics into ReflectionUtils. (Chris Smith via
|
||
cdouglas)
|
||
|
||
HADOOP-3694. Improve unit test performance by changing
|
||
MiniDFSCluster to listen only on 127.0.0.1. (cutting)
|
||
|
||
HADOOP-3620. Namenode should synchronously resolve a datanode's network
|
||
location when the datanode registers. (hairong)
|
||
|
||
HADOOP-3860. NNThroughputBenchmark is extended with rename and delete
|
||
benchmarks. (shv)
|
||
|
||
HADOOP-3892. Include unix group name in JobConf. (Matei Zaharia via johan)
|
||
|
||
HADOOP-3875. Change the time period between heartbeats to be relative to
|
||
the end of the heartbeat rpc, rather than the start. This causes better
|
||
behavior if the JobTracker is overloaded. (acmurthy via omalley)
|
||
|
||
HADOOP-3853. Move multiple input format (HADOOP-372) extension to
|
||
library package. (tomwhite via johan)
|
||
|
||
HADOOP-9. Use roulette scheduling for temporary space when the size
|
||
is not known. (Ari Rabkin via omalley)
|
||
|
||
HADOOP-3202. Use recursive delete rather than FileUtil.fullyDelete.
|
||
(Amareshwari Sriramadasu via omalley)
|
||
|
||
HADOOP-3368. Remove common-logging.properties from conf. (Steve Loughran
|
||
via omalley)
|
||
|
||
HADOOP-3851. Fix spelling mistake in FSNamesystemMetrics. (Steve Loughran
|
||
via omalley)
|
||
|
||
HADOOP-3780. Remove asynchronous resolution of network topology in the
|
||
JobTracker (Amar Kamat via omalley)
|
||
|
||
HADOOP-3852. Add ShellCommandExecutor.toString method to make nicer
|
||
error messages. (Steve Loughran via omalley)
|
||
|
||
HADOOP-3844. Include message of local exception in RPC client failures.
|
||
(Steve Loughran via omalley)
|
||
|
||
HADOOP-3935. Split out inner classes from DataNode.java. (johan)
|
||
|
||
HADOOP-3905. Create generic interfaces for edit log streams. (shv)
|
||
|
||
HADOOP-3062. Add metrics to DataNode and TaskTracker to record network
|
||
traffic for HDFS reads/writes and MR shuffling. (cdouglas)
|
||
|
||
HADOOP-3742. Remove HDFS from public java doc and add javadoc-dev for
|
||
generative javadoc for developers. (Sanjay Radia via omalley)
|
||
|
||
HADOOP-3944. Improve documentation for public TupleWritable class in
|
||
join package. (Chris Douglas via enis)
|
||
|
||
HADOOP-2330. Preallocate HDFS transaction log to improve performance.
|
||
(dhruba and hairong)
|
||
|
||
HADOOP-3965. Convert DataBlockScanner into a package private class. (shv)
|
||
|
||
HADOOP-3488. Prevent hadoop-daemon from rsync'ing log files (Stefan
|
||
Groshupf and Craig Macdonald via omalley)
|
||
|
||
HADOOP-3342. Change the kill task actions to require http post instead of
|
||
get to prevent accidental crawls from triggering it. (enis via omalley)
|
||
|
||
HADOOP-3937. Limit the job name in the job history filename to 50
|
||
characters. (Matei Zaharia via omalley)
|
||
|
||
HADOOP-3943. Remove unnecessary synchronization in
|
||
NetworkTopology.pseudoSortByDistance. (hairong via omalley)
|
||
|
||
HADOOP-3498. File globbing alternation should be able to span path
|
||
components. (tomwhite)
|
||
|
||
HADOOP-3361. Implement renames for NativeS3FileSystem.
|
||
(Albert Chern via tomwhite)
|
||
|
||
HADOOP-3605. Make EC2 scripts show an error message if AWS_ACCOUNT_ID is
|
||
unset. (Al Hoang via tomwhite)
|
||
|
||
HADOOP-4147. Remove unused class JobWithTaskContext from class
|
||
JobInProgress. (Amareshwari Sriramadasu via johan)
|
||
|
||
HADOOP-4151. Add a byte-comparable interface that both Text and
|
||
BytesWritable implement. (cdouglas via omalley)
|
||
|
||
HADOOP-4174. Move fs image/edit log methods from ClientProtocol to
|
||
NamenodeProtocol. (shv via szetszwo)
|
||
|
||
HADOOP-4181. Include a .gitignore and saveVersion.sh change to support
|
||
developing under git. (omalley)
|
||
|
||
HADOOP-4186. Factor LineReader out of LineRecordReader. (tomwhite via
|
||
omalley)
|
||
|
||
HADOOP-4184. Break the module dependencies between core, hdfs, and
|
||
mapred. (tomwhite via omalley)
|
||
|
||
HADOOP-4075. test-patch.sh now spits out ant commands that it runs.
|
||
(Ramya R via nigel)
|
||
|
||
HADOOP-4117. Improve configurability of Hadoop EC2 instances.
|
||
(tomwhite)
|
||
|
||
HADOOP-2411. Add support for larger CPU EC2 instance types.
|
||
(Chris K Wensel via tomwhite)
|
||
|
||
HADOOP-4083. Changed the configuration attribute queue.name to
|
||
mapred.job.queue.name. (Hemanth Yamijala via acmurthy)
|
||
|
||
HADOOP-4194. Added the JobConf and JobID to job-related methods in
|
||
JobTrackerInstrumentation for better metrics. (Mac Yang via acmurthy)
|
||
|
||
HADOOP-3975. Change test-patch script to report working the dir
|
||
modifications preventing the suite from being run. (Ramya R via cdouglas)
|
||
|
||
HADOOP-4124. Added a command-line switch to allow users to set job
|
||
priorities, also allow it to be manipulated via the web-ui. (Hemanth
|
||
Yamijala via acmurthy)
|
||
|
||
HADOOP-2165. Augmented JobHistory to include the URIs to the tasks'
|
||
userlogs. (Vinod Kumar Vavilapalli via acmurthy)
|
||
|
||
HADOOP-4062. Remove the synchronization on the output stream when a
|
||
connection is closed and also remove an undesirable exception when
|
||
a client is stoped while there is no pending RPC request. (hairong)
|
||
|
||
HADOOP-4227. Remove the deprecated class org.apache.hadoop.fs.ShellCommand.
|
||
(szetszwo)
|
||
|
||
HADOOP-4006. Clean up FSConstants and move some of the constants to
|
||
better places. (Sanjay Radia via rangadi)
|
||
|
||
HADOOP-4279. Trace the seeds of random sequences in append unit tests to
|
||
make itermitant failures reproducible. (szetszwo via cdouglas)
|
||
|
||
HADOOP-4209. Remove the change to the format of task attempt id by
|
||
incrementing the task attempt numbers by 1000 when the job restarts.
|
||
(Amar Kamat via omalley)
|
||
|
||
HADOOP-4301. Adds forrest doc for the skip bad records feature.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4354. Separate TestDatanodeDeath.testDatanodeDeath() into 4 tests.
|
||
(szetszwo)
|
||
|
||
HADOOP-3790. Add more unit tests for testing HDFS file append. (szetszwo)
|
||
|
||
HADOOP-4321. Include documentation for the capacity scheduler. (Hemanth
|
||
Yamijala via omalley)
|
||
|
||
HADOOP-4424. Change menu layout for Hadoop documentation (Boris Shkolnik
|
||
via cdouglas).
|
||
|
||
HADOOP-4438. Update forrest documentation to include missing FsShell
|
||
commands. (Suresh Srinivas via cdouglas)
|
||
|
||
HADOOP-4105. Add forrest documentation for libhdfs.
|
||
(Pete Wyckoff via cutting)
|
||
|
||
HADOOP-4510. Make getTaskOutputPath public. (Chris Wensel via omalley)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-3556. Removed lock contention in MD5Hash by changing the
|
||
singleton MessageDigester by an instance per Thread using
|
||
ThreadLocal. (Iv?n de Prado via omalley)
|
||
|
||
HADOOP-3328. When client is writing data to DFS, only the last
|
||
datanode in the pipeline needs to verify the checksum. Saves around
|
||
30% CPU on intermediate datanodes. (rangadi)
|
||
|
||
HADOOP-3863. Use a thread-local string encoder rather than a static one
|
||
that is protected by a lock. (acmurthy via omalley)
|
||
|
||
HADOOP-3864. Prevent the JobTracker from locking up when a job is being
|
||
initialized. (acmurthy via omalley)
|
||
|
||
HADOOP-3816. Faster directory listing in KFS. (Sriram Rao via omalley)
|
||
|
||
HADOOP-2130. Pipes submit job should have both blocking and non-blocking
|
||
versions. (acmurthy via omalley)
|
||
|
||
HADOOP-3769. Make the SampleMapper and SampleReducer from
|
||
GenericMRLoadGenerator public, so they can be used in other contexts.
|
||
(Lingyun Yang via omalley)
|
||
|
||
HADOOP-3514. Inline the CRCs in intermediate files as opposed to reading
|
||
it from a different .crc file. (Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-3638. Caches the iFile index files in memory to reduce seeks
|
||
(Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-4225. FSEditLog.logOpenFile() should persist accessTime
|
||
rather than modificationTime. (shv)
|
||
|
||
HADOOP-4380. Made several new classes (Child, JVMId,
|
||
JobTrackerInstrumentation, QueueManager, ResourceEstimator,
|
||
TaskTrackerInstrumentation, and TaskTrackerMetricsInst) in
|
||
org.apache.hadoop.mapred package private instead of public. (omalley)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3563. Refactor the distributed upgrade code so that it is
|
||
easier to identify datanode and namenode related code. (dhruba)
|
||
|
||
HADOOP-3640. Fix the read method in the NativeS3InputStream. (tomwhite via
|
||
omalley)
|
||
|
||
HADOOP-3711. Fixes the Streaming input parsing to properly find the
|
||
separator. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3725. Prevent TestMiniMRMapDebugScript from swallowing exceptions.
|
||
(Steve Loughran via cdouglas)
|
||
|
||
HADOOP-3726. Throw exceptions from TestCLI setup and teardown instead of
|
||
swallowing them. (Steve Loughran via cdouglas)
|
||
|
||
HADOOP-3721. Refactor CompositeRecordReader and related mapred.join classes
|
||
to make them clearer. (cdouglas)
|
||
|
||
HADOOP-3720. Re-read the config file when dfsadmin -refreshNodes is invoked
|
||
so dfs.hosts and dfs.hosts.exclude are observed. (lohit vijayarenu via
|
||
cdouglas)
|
||
|
||
HADOOP-3485. Allow writing to files over fuse.
|
||
(Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-3723. The flags to the libhdfs.create call can be treated as
|
||
a bitmask. (Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-3643. Filter out completed tasks when asking for running tasks in
|
||
the JobTracker web/ui. (Amar Kamat via omalley)
|
||
|
||
HADOOP-3777. Ensure that Lzo compressors/decompressors correctly handle the
|
||
case where native libraries aren't available. (Chris Douglas via acmurthy)
|
||
|
||
HADOOP-3728. Fix SleepJob so that it doesn't depend on temporary files,
|
||
this ensures we can now run more than one instance of SleepJob
|
||
simultaneously. (Chris Douglas via acmurthy)
|
||
|
||
HADOOP-3795. Fix saving image files on Namenode with different checkpoint
|
||
stamps. (Lohit Vijayarenu via mahadev)
|
||
|
||
HADOOP-3624. Improving createeditslog to create tree directory structure.
|
||
(Lohit Vijayarenu via mahadev)
|
||
|
||
HADOOP-3778. DFSInputStream.seek() did not retry in case of some errors.
|
||
(Luo Ning via rangadi)
|
||
|
||
HADOOP-3661. The handling of moving files deleted through fuse-dfs to
|
||
Trash made similar to the behaviour from dfs shell.
|
||
(Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-3819. Unset LANG and LC_CTYPE in saveVersion.sh to make it
|
||
compatible with non-English locales. (Rong-En Fan via cdouglas)
|
||
|
||
HADOOP-3848. Cache calls to getSystemDir in the TaskTracker instead of
|
||
calling it for each task start. (acmurthy via omalley)
|
||
|
||
HADOOP-3131. Fix reduce progress reporting for compressed intermediate
|
||
data. (Matei Zaharia via acmurthy)
|
||
|
||
HADOOP-3796. fuse-dfs configuration is implemented as file system
|
||
mount options. (Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-3836. Fix TestMultipleOutputs to correctly clean up. (Alejandro
|
||
Abdelnur via acmurthy)
|
||
|
||
HADOOP-3805. Improve fuse-dfs write performance.
|
||
(Pete Wyckoff via zshao)
|
||
|
||
HADOOP-3846. Fix unit test CreateEditsLog to generate paths correctly.
|
||
(Lohit Vjayarenu via cdouglas)
|
||
|
||
HADOOP-3904. Fix unit tests using the old dfs package name.
|
||
(TszWo (Nicholas), SZE via johan)
|
||
|
||
HADOOP-3319. Fix some HOD error messages to go stderr instead of
|
||
stdout. (Vinod Kumar Vavilapalli via omalley)
|
||
|
||
HADOOP-3907. Move INodeDirectoryWithQuota to its own .java file.
|
||
(Tsz Wo (Nicholas), SZE via hairong)
|
||
|
||
HADOOP-3919. Fix attribute name in hadoop-default for
|
||
mapred.jobtracker.instrumentation. (Ari Rabkin via omalley)
|
||
|
||
HADOOP-3903. Change the package name for the servlets to be hdfs instead of
|
||
dfs. (Tsz Wo (Nicholas) Sze via omalley)
|
||
|
||
HADOOP-3773. Change Pipes to set the default map output key and value
|
||
types correctly. (Koji Noguchi via omalley)
|
||
|
||
HADOOP-3952. Fix compilation error in TestDataJoin referencing dfs package.
|
||
(omalley)
|
||
|
||
HADOOP-3951. Fix package name for FSNamesystem logs and modify other
|
||
hard-coded Logs to use the class name. (cdouglas)
|
||
|
||
HADOOP-3889. Improve error reporting from HftpFileSystem, handling in
|
||
DistCp. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3946. Fix TestMapRed after hadoop-3664. (tomwhite via omalley)
|
||
|
||
HADOOP-3949. Remove duplicate jars from Chukwa. (Jerome Boulon via omalley)
|
||
|
||
HADOOP-3933. DataNode sometimes sends up to io.byte.per.checksum bytes
|
||
more than required to client. (Ning Li via rangadi)
|
||
|
||
HADOOP-3962. Shell command "fs -count" should support paths with different
|
||
file systems. (Tsz Wo (Nicholas), SZE via mahadev)
|
||
|
||
HADOOP-3957. Fix javac warnings in DistCp and TestCopyFiles. (Tsz Wo
|
||
(Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3958. Fix TestMapRed to check the success of test-job. (omalley via
|
||
acmurthy)
|
||
|
||
HADOOP-3985. Fix TestHDFSServerPorts to use random ports. (Hairong Kuang
|
||
via omalley)
|
||
|
||
HADOOP-3964. Fix javadoc warnings introduced by FailMon. (dhruba)
|
||
|
||
HADOOP-3785. Fix FileSystem cache to be case-insensitive for scheme and
|
||
authority. (Bill de hOra via cdouglas)
|
||
|
||
HADOOP-3506. Fix a rare NPE caused by error handling in S3. (Tom White via
|
||
cdouglas)
|
||
|
||
HADOOP-3705. Fix mapred.join parser to accept InputFormats named with
|
||
underscore and static, inner classes. (cdouglas)
|
||
|
||
HADOOP-4023. Fix javadoc warnings introduced when the HDFS javadoc was
|
||
made private. (omalley)
|
||
|
||
HADOOP-4030. Remove lzop from the default list of codecs. (Arun Murthy via
|
||
cdouglas)
|
||
|
||
HADOOP-3961. Fix task disk space requirement estimates for virtual
|
||
input jobs. Delays limiting task placement until after 10% of the maps
|
||
have finished. (Ari Rabkin via omalley)
|
||
|
||
HADOOP-2168. Fix problem with C++ record reader's progress not being
|
||
reported to framework. (acmurthy via omalley)
|
||
|
||
HADOOP-3966. Copy findbugs generated output files to PATCH_DIR while
|
||
running test-patch. (Ramya R via lohit)
|
||
|
||
HADOOP-4037. Fix the eclipse plugin for versions of kfs and log4j. (nigel
|
||
via omalley)
|
||
|
||
HADOOP-3950. Cause the Mini MR cluster to wait for task trackers to
|
||
register before continuing. (enis via omalley)
|
||
|
||
HADOOP-3910. Remove unused ClusterTestDFSNamespaceLogging and
|
||
ClusterTestDFS. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3954. Disable record skipping by default. (Sharad Agarwal via
|
||
cdouglas)
|
||
|
||
HADOOP-4050. Fix TestFairScheduler to use absolute paths for the work
|
||
directory. (Matei Zaharia via omalley)
|
||
|
||
HADOOP-4069. Keep temporary test files from TestKosmosFileSystem under
|
||
test.build.data instead of /tmp. (lohit via omalley)
|
||
|
||
HADOOP-4078. Create test files for TestKosmosFileSystem in separate
|
||
directory under test.build.data. (lohit)
|
||
|
||
HADOOP-3968. Fix getFileBlockLocations calls to use FileStatus instead
|
||
of Path reflecting the new API. (Pete Wyckoff via lohit)
|
||
|
||
HADOOP-3963. libhdfs does not exit on its own, instead it returns error
|
||
to the caller and behaves as a true library. (Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-4100. Removes the cleanupTask scheduling from the Scheduler
|
||
implementations and moves it to the JobTracker.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4097. Make hive work well with speculative execution turned on.
|
||
(Joydeep Sen Sarma via dhruba)
|
||
|
||
HADOOP-4113. Changes to libhdfs to not exit on its own, rather return
|
||
an error code to the caller. (Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-4054. Remove duplicate lease removal during edit log loading.
|
||
(hairong)
|
||
|
||
HADOOP-4071. FSNameSystem.isReplicationInProgress should add an
|
||
underReplicated block to the neededReplication queue using method
|
||
"add" not "update". (hairong)
|
||
|
||
HADOOP-4154. Fix type warnings in WritableUtils. (szetszwo via omalley)
|
||
|
||
HADOOP-4133. Log files generated by Hive should reside in the
|
||
build directory. (Prasad Chakka via dhruba)
|
||
|
||
HADOOP-4094. Hive now has hive-default.xml and hive-site.xml similar
|
||
to core hadoop. (Prasad Chakka via dhruba)
|
||
|
||
HADOOP-4112. Handles cleanupTask in JobHistory
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3831. Very slow reading clients sometimes failed while reading.
|
||
(rangadi)
|
||
|
||
HADOOP-4155. Use JobTracker's start time while initializing JobHistory's
|
||
JobTracker Unique String. (lohit)
|
||
|
||
HADOOP-4099. Fix null pointer when using HFTP from an 0.18 server.
|
||
(dhruba via omalley)
|
||
|
||
HADOOP-3570. Includes user specified libjar files in the client side
|
||
classpath path. (Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4129. Changed memory limits of TaskTracker and Tasks to be in
|
||
KiloBytes rather than bytes. (Vinod Kumar Vavilapalli via acmurthy)
|
||
|
||
HADOOP-4139. Optimize Hive multi group-by.
|
||
(Namin Jain via dhruba)
|
||
|
||
HADOOP-3911. Add a check to fsck options to make sure -files is not
|
||
the first option to resolve conflicts with GenericOptionsParser
|
||
(lohit)
|
||
|
||
HADOOP-3623. Refactor LeaseManager. (szetszwo)
|
||
|
||
HADOOP-4125. Handles Reduce cleanup tip on the web ui.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4087. Hive Metastore API for php and python clients.
|
||
(Prasad Chakka via dhruba)
|
||
|
||
HADOOP-4197. Update DATA_TRANSFER_VERSION for HADOOP-3981. (szetszwo)
|
||
|
||
HADOOP-4138. Refactor the Hive SerDe library to better structure
|
||
the interfaces to the serializer and de-serializer.
|
||
(Zheng Shao via dhruba)
|
||
|
||
HADOOP-4195. Close compressor before returning to codec pool.
|
||
(acmurthy via omalley)
|
||
|
||
HADOOP-2403. Escapes some special characters before logging to
|
||
history files. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4200. Fix a bug in the test-patch.sh script.
|
||
(Ramya R via nigel)
|
||
|
||
HADOOP-4084. Add explain plan capabilities to Hive Query Language.
|
||
(Ashish Thusoo via dhruba)
|
||
|
||
HADOOP-4121. Preserve cause for exception if the initialization of
|
||
HistoryViewer for JobHistory fails. (Amareshwari Sri Ramadasu via
|
||
acmurthy)
|
||
|
||
HADOOP-4213. Fixes NPE in TestLimitTasksPerJobTaskScheduler.
|
||
(Sreekanth Ramakrishnan via ddas)
|
||
|
||
HADOOP-4077. Setting access and modification time for a file
|
||
requires write permissions on the file. (dhruba)
|
||
|
||
HADOOP-3592. Fix a couple of possible file leaks in FileUtil
|
||
(Bill de hOra via rangadi)
|
||
|
||
HADOOP-4120. Hive interactive shell records the time taken by a
|
||
query. (Raghotham Murthy via dhruba)
|
||
|
||
HADOOP-4090. The hive scripts pick up hadoop from HADOOP_HOME
|
||
and then the path. (Raghotham Murthy via dhruba)
|
||
|
||
HADOOP-4242. Remove extra ";" in FSDirectory that blocks compilation
|
||
in some IDE's. (szetszwo via omalley)
|
||
|
||
HADOOP-4249. Fix eclipse path to include the hsqldb.jar. (szetszwo via
|
||
omalley)
|
||
|
||
HADOOP-4247. Move InputSampler into org.apache.hadoop.mapred.lib, so that
|
||
examples.jar doesn't depend on tools.jar. (omalley)
|
||
|
||
HADOOP-4269. Fix the deprecation of LineReader by extending the new class
|
||
into the old name and deprecating it. Also update the tests to test the
|
||
new class. (cdouglas via omalley)
|
||
|
||
HADOOP-4280. Fix conversions between seconds in C and milliseconds in
|
||
Java for access times for files. (Pete Wyckoff via rangadi)
|
||
|
||
HADOOP-4254. -setSpaceQuota command does not convert "TB" extenstion to
|
||
terabytes properly. Implementation now uses StringUtils for parsing this.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-4259. Findbugs should run over tools.jar also. (cdouglas via
|
||
omalley)
|
||
|
||
HADOOP-4275. Move public method isJobValidName from JobID to a private
|
||
method in JobTracker. (omalley)
|
||
|
||
HADOOP-4173. fix failures in TestProcfsBasedProcessTree and
|
||
TestTaskTrackerMemoryManager tests. ProcfsBasedProcessTree and
|
||
memory management in TaskTracker are disabled on Windows.
|
||
(Vinod K V via rangadi)
|
||
|
||
HADOOP-4189. Fixes the history blocksize & intertracker protocol version
|
||
issues introduced as part of HADOOP-3245. (Amar Kamat via ddas)
|
||
|
||
HADOOP-4190. Fixes the backward compatibility issue with Job History.
|
||
introduced by HADOOP-3245 and HADOOP-2403. (Amar Kamat via ddas)
|
||
|
||
HADOOP-4237. Fixes the TestStreamingBadRecords.testNarrowDown testcase.
|
||
(Sharad Agarwal via ddas)
|
||
|
||
HADOOP-4274. Capacity scheduler accidently modifies the underlying
|
||
data structures when browing the job lists. (Hemanth Yamijala via omalley)
|
||
|
||
HADOOP-4309. Fix eclipse-plugin compilation. (cdouglas)
|
||
|
||
HADOOP-4232. Fix race condition in JVM reuse when multiple slots become
|
||
free. (ddas via acmurthy)
|
||
|
||
HADOOP-4302. Fix a race condition in TestReduceFetch that can yield false
|
||
negatvies. (cdouglas)
|
||
|
||
HADOOP-3942. Update distcp documentation to include features introduced in
|
||
HADOOP-3873, HADOOP-3939. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-4319. fuse-dfs dfs_read function returns as many bytes as it is
|
||
told to read unlesss end-of-file is reached. (Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-4246. Ensure we have the correct lower bound on the number of
|
||
retries for fetching map-outputs; also fixed the case where the reducer
|
||
automatically kills on too many unique map-outputs could not be fetched
|
||
for small jobs. (Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-4163. Report FSErrors from map output fetch threads instead of
|
||
merely logging them. (Sharad Agarwal via cdouglas)
|
||
|
||
HADOOP-4261. Adds a setup task for jobs. This is required so that we
|
||
don't setup jobs that haven't been inited yet (since init could lead
|
||
to job failure). Only after the init has successfully happened do we
|
||
launch the setupJob task. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4256. Removes Completed and Failed Job tables from
|
||
jobqueue_details.jsp. (Sreekanth Ramakrishnan via ddas)
|
||
|
||
HADOOP-4267. Occasional exceptions during shutting down HSQLDB is logged
|
||
but not rethrown. (enis)
|
||
|
||
HADOOP-4018. The number of tasks for a single job cannot exceed a
|
||
pre-configured maximum value. (dhruba)
|
||
|
||
HADOOP-4288. Fixes a NPE problem in CapacityScheduler.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-4014. Create hard links with 'fsutil hardlink' on Windows. (shv)
|
||
|
||
HADOOP-4393. Merged org.apache.hadoop.fs.permission.AccessControlException
|
||
and org.apache.hadoop.security.AccessControlIOException into a single
|
||
class hadoop.security.AccessControlException. (omalley via acmurthy)
|
||
|
||
HADOOP-4287. Fixes an issue to do with maintaining counts of running/pending
|
||
maps/reduces. (Sreekanth Ramakrishnan via ddas)
|
||
|
||
HADOOP-4361. Makes sure that jobs killed from command line are killed
|
||
fast (i.e., there is a slot to run the cleanup task soon).
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4400. Add "hdfs://" to fs.default.name on quickstart.html.
|
||
(Jeff Hammerbacher via omalley)
|
||
|
||
HADOOP-4378. Fix TestJobQueueInformation to use SleepJob rather than
|
||
WordCount via TestMiniMRWithDFS. (Sreekanth Ramakrishnan via acmurthy)
|
||
|
||
HADOOP-4376. Fix formatting in hadoop-default.xml for
|
||
hadoop.http.filter.initializers. (Enis Soztutar via acmurthy)
|
||
|
||
HADOOP-4410. Adds an extra arg to the API FileUtil.makeShellPath to
|
||
determine whether to canonicalize file paths or not.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4236. Ensure un-initialized jobs are killed correctly on
|
||
user-demand. (Sharad Agarwal via acmurthy)
|
||
|
||
HADOOP-4373. Fix calculation of Guaranteed Capacity for the
|
||
capacity-scheduler. (Hemanth Yamijala via acmurthy)
|
||
|
||
HADOOP-4053. Schedulers must be notified when jobs complete. (Amar Kamat via omalley)
|
||
|
||
HADOOP-4335. Fix FsShell -ls for filesystems without owners/groups. (David
|
||
Phillips via cdouglas)
|
||
|
||
HADOOP-4426. TestCapacityScheduler broke due to the two commits HADOOP-4053
|
||
and HADOOP-4373. This patch fixes that. (Hemanth Yamijala via ddas)
|
||
|
||
HADOOP-4418. Updates documentation in forrest for Mapred, streaming and pipes.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3155. Ensure that there is only one thread fetching
|
||
TaskCompletionEvents on TaskTracker re-init. (Dhruba Borthakur via
|
||
acmurthy)
|
||
|
||
HADOOP-4425. Fix EditLogInputStream to overload the bulk read method.
|
||
(cdouglas)
|
||
|
||
HADOOP-4427. Adds the new queue/job commands to the manual.
|
||
(Sreekanth Ramakrishnan via ddas)
|
||
|
||
HADOOP-4278. Increase debug logging for unit test TestDatanodeDeath.
|
||
Fix the case when primary is dead. (dhruba via szetszwo)
|
||
|
||
HADOOP-4423. Keep block length when the block recovery is triggered by
|
||
append. (szetszwo)
|
||
|
||
HADOOP-4449. Fix dfsadmin usage. (Raghu Angadi via cdouglas)
|
||
|
||
HADOOP-4455. Added TestSerDe so that unit tests can run successfully.
|
||
(Ashish Thusoo via dhruba)
|
||
|
||
HADOOP-4457. Fixes an input split logging problem introduced by
|
||
HADOOP-3245. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-4464. Separate out TestFileCreationClient from TestFileCreation.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-4404. saveFSImage() removes files from a storage directory that do
|
||
not correspond to its type. (shv)
|
||
|
||
HADOOP-4149. Fix handling of updates to the job priority, by changing the
|
||
list of jobs to be keyed by the priority, submit time, and job tracker id.
|
||
(Amar Kamat via omalley)
|
||
|
||
HADOOP-4296. Fix job client failures by not retiring a job as soon as it
|
||
is finished. (dhruba)
|
||
|
||
HADOOP-4439. Remove configuration variables that aren't usable yet, in
|
||
particular mapred.tasktracker.tasks.maxmemory and mapred.task.max.memory.
|
||
(Hemanth Yamijala via omalley)
|
||
|
||
HADOOP-4230. Fix for serde2 interface, limit operator, select * operator,
|
||
UDF trim functions and sampling. (Ashish Thusoo via dhruba)
|
||
|
||
HADOOP-4358. No need to truncate access time in INode. Also fixes NPE
|
||
in CreateEditsLog. (Raghu Angadi)
|
||
|
||
HADOOP-4387. TestHDFSFileSystemContract fails on windows nightly builds.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-4466. Ensure that SequenceFileOutputFormat isn't tied to Writables
|
||
and can be used with other Serialization frameworks. (Chris Wensel via
|
||
acmurthy)
|
||
|
||
HADOOP-4525. Fix ipc.server.ipcnodelay originally missed in in HADOOP-2232.
|
||
(cdouglas via Clint Morgan)
|
||
|
||
HADOOP-4498. Ensure that JobHistory correctly escapes the job name so that
|
||
regex patterns work. (Chris Wensel via acmurthy)
|
||
|
||
HADOOP-4446. Modify guaranteed capacity labels in capacity scheduler's UI
|
||
to reflect the information being displayed. (Sreekanth Ramakrishnan via
|
||
yhemanth)
|
||
|
||
HADOOP-4282. Some user facing URLs are not filtered by user filters.
|
||
(szetszwo)
|
||
|
||
HADOOP-4595. Fixes two race conditions - one to do with updating free slot count,
|
||
and another to do with starting the MapEventsFetcher thread. (ddas)
|
||
|
||
HADOOP-4552. Fix a deadlock in RPC server. (Raghu Angadi)
|
||
|
||
HADOOP-4471. Sort running jobs by priority in the capacity scheduler.
|
||
(Amar Kamat via yhemanth)
|
||
|
||
HADOOP-4500. Fix MultiFileSplit to get the FileSystem from the relevant
|
||
path rather than the JobClient. (Joydeep Sen Sarma via cdouglas)
|
||
|
||
Release 0.18.4 - Unreleased
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-5114. Remove timeout for accept() in DataNode. This makes accept()
|
||
fail in JDK on Windows and causes many tests to fail. (Raghu Angadi)
|
||
|
||
HADOOP-5192. Block receiver should not remove a block that's created or
|
||
being written by other threads. (hairong)
|
||
|
||
HADOOP-5134. FSNamesystem#commitBlockSynchronization adds under-construction
|
||
block locations to blocksMap. (Dhruba Borthakur via hairong)
|
||
|
||
HADOOP-5412. Simulated DataNode should not write to a block that's being
|
||
written by another thread. (hairong)
|
||
|
||
HADOOP-5465. Fix the problem of blocks remaining under-replicated by
|
||
providing synchronized modification to the counter xmitsInProgress in
|
||
DataNode. (hairong)
|
||
|
||
HADOOP-5557. Fixes some minor problems in TestOverReplicatedBlocks.
|
||
(szetszwo)
|
||
|
||
HADOOP-5644. Namenode is stuck in safe mode. (suresh Srinivas via hairong)
|
||
|
||
HADOOP-6017. Lease Manager in NameNode does not handle certain characters
|
||
in filenames. This results in fatal errors in Secondary NameNode and while
|
||
restrating NameNode. (Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
Release 0.18.3 - 2009-01-27
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-4150. Include librecordio in hadoop releases. (Giridharan Kesavan
|
||
via acmurthy)
|
||
|
||
HADOOP-4668. Improve documentation for setCombinerClass to clarify the
|
||
restrictions on combiners. (omalley)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-4499. DFSClient should invoke checksumOk only once. (Raghu Angadi)
|
||
|
||
HADOOP-4597. Calculate mis-replicated blocks when safe-mode is turned
|
||
off manually. (shv)
|
||
|
||
HADOOP-3121. lsr should keep listing the remaining items but not
|
||
terminate if there is any IOException. (szetszwo)
|
||
|
||
HADOOP-4610. Always calculate mis-replicated blocks when safe-mode is
|
||
turned off. (shv)
|
||
|
||
HADOOP-3883. Limit namenode to assign at most one generation stamp for
|
||
a particular block within a short period. (szetszwo)
|
||
|
||
HADOOP-4556. Block went missing. (hairong)
|
||
|
||
HADOOP-4643. NameNode should exclude excessive replicas when counting
|
||
live replicas for a block. (hairong)
|
||
|
||
HADOOP-4703. Should not wait for proxy forever in lease recovering.
|
||
(szetszwo)
|
||
|
||
HADOOP-4647. NamenodeFsck should close the DFSClient it has created.
|
||
(szetszwo)
|
||
|
||
HADOOP-4616. Fuse-dfs can handle bad values from FileSystem.read call.
|
||
(Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-4061. Throttle Datanode decommission monitoring in Namenode.
|
||
(szetszwo)
|
||
|
||
HADOOP-4659. Root cause of connection failure is being lost to code that
|
||
uses it for delaying startup. (Steve Loughran and Hairong via hairong)
|
||
|
||
HADOOP-4614. Lazily open segments when merging map spills to avoid using
|
||
too many file descriptors. (Yuri Pradkin via cdouglas)
|
||
|
||
HADOOP-4257. The DFS client should pick only one datanode as the candidate
|
||
to initiate lease recovery. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-4713. Fix librecordio to handle records larger than 64k. (Christian
|
||
Kunz via cdouglas)
|
||
|
||
HADOOP-4635. Fix a memory leak in fuse dfs. (pete wyckoff via mahadev)
|
||
|
||
HADOOP-4714. Report status between merges and make the number of records
|
||
between progress reports configurable. (Jothi Padmanabhan via cdouglas)
|
||
|
||
HADOOP-4726. Fix documentation typos "the the". (Edward J. Yoon via
|
||
szetszwo)
|
||
|
||
HADOOP-4679. Datanode prints tons of log messages: waiting for threadgroup
|
||
to exit, active threads is XX. (hairong)
|
||
|
||
HADOOP-4746. Job output directory should be normalized. (hairong)
|
||
|
||
HADOOP-4717. Removal of default port# in NameNode.getUri() causes a
|
||
map/reduce job failed to prompt temporary output. (hairong)
|
||
|
||
HADOOP-4778. Check for zero size block meta file when updating a block.
|
||
(szetszwo)
|
||
|
||
HADOOP-4742. Replica gets deleted by mistake. (Wang Xu via hairong)
|
||
|
||
HADOOP-4702. Failed block replication leaves an incomplete block in
|
||
receiver's tmp data directory. (hairong)
|
||
|
||
HADOOP-4613. Fix block browsing on Web UI. (Johan Oskarsson via shv)
|
||
|
||
HADOOP-4806. HDFS rename should not use src path as a regular expression.
|
||
(szetszwo)
|
||
|
||
HADOOP-4795. Prevent lease monitor getting into an infinite loop when
|
||
leases and the namespace tree does not match. (szetszwo)
|
||
|
||
HADOOP-4620. Fixes Streaming to handle well the cases of map/reduce with empty
|
||
input/output. (Ravi Gummadi via ddas)
|
||
|
||
HADOOP-4857. Fixes TestUlimit to have exactly 1 map in the jobs spawned.
|
||
(Ravi Gummadi via ddas)
|
||
|
||
HADOOP-4810. Data lost at cluster startup time. (hairong)
|
||
|
||
HADOOP-4797. Improve how RPC server reads and writes large buffers. Avoids
|
||
soft-leak of direct buffers and excess copies in NIO layer. (Raghu Angadi)
|
||
|
||
HADOOP-4840. TestNodeCount sometimes fails with NullPointerException.
|
||
(hairong)
|
||
|
||
HADOOP-4904. Fix deadlock while leaving safe mode. (shv)
|
||
|
||
HADOOP-1980. 'dfsadmin -safemode enter' should prevent the namenode from
|
||
leaving safemode automatically. (shv & Raghu Angadi)
|
||
|
||
HADOOP-4951. Lease monitor should acquire the LeaseManager lock but not the
|
||
Monitor lock. (szetszwo)
|
||
|
||
HADOOP-4935. processMisReplicatedBlocks() should not clear
|
||
excessReplicateMap. (shv)
|
||
|
||
HADOOP-4961. Fix ConcurrentModificationException in lease recovery
|
||
of empty files. (shv)
|
||
|
||
HADOOP-4971. A long (unexpected) delay at datanodes could make subsequent
|
||
block reports from many datanode at the same time. (Raghu Angadi)
|
||
|
||
HADOOP-4910. NameNode should exclude replicas when choosing excessive
|
||
replicas to delete to avoid data lose. (hairong)
|
||
|
||
HADOOP-4983. Fixes a problem in updating Counters in the status reporting.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
Release 0.18.2 - 2008-11-03
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3614. Fix a bug that Datanode may use an old GenerationStamp to get
|
||
meta file. (szetszwo)
|
||
|
||
HADOOP-4314. Simulated datanodes should not include blocks that are still
|
||
being written in their block report. (Raghu Angadi)
|
||
|
||
HADOOP-4228. dfs datanode metrics, bytes_read and bytes_written, overflow
|
||
due to incorrect type used. (hairong)
|
||
|
||
HADOOP-4395. The FSEditLog loading is incorrect for the case OP_SET_OWNER.
|
||
(szetszwo)
|
||
|
||
HADOOP-4351. FSNamesystem.getBlockLocationsInternal throws
|
||
ArrayIndexOutOfBoundsException. (hairong)
|
||
|
||
HADOOP-4403. Make TestLeaseRecovery and TestFileCreation more robust.
|
||
(szetszwo)
|
||
|
||
HADOOP-4292. Do not support append() for LocalFileSystem. (hairong)
|
||
|
||
HADOOP-4399. Make fuse-dfs multi-thread access safe.
|
||
(Pete Wyckoff via dhruba)
|
||
|
||
HADOOP-4369. Use setMetric(...) instead of incrMetric(...) for metrics
|
||
averages. (Brian Bockelman via szetszwo)
|
||
|
||
HADOOP-4469. Rename and add the ant task jar file to the tar file. (nigel)
|
||
|
||
HADOOP-3914. DFSClient sends Checksum Ok only once for a block.
|
||
(Christian Kunz via hairong)
|
||
|
||
HADOOP-4467. SerializationFactory now uses the current context ClassLoader
|
||
allowing for user supplied Serialization instances. (Chris Wensel via
|
||
acmurthy)
|
||
|
||
HADOOP-4517. Release FSDataset lock before joining ongoing create threads.
|
||
(szetszwo)
|
||
|
||
HADOOP-4526. fsck failing with NullPointerException. (hairong)
|
||
|
||
HADOOP-4483 Honor the max parameter in DatanodeDescriptor.getBlockArray(..)
|
||
(Ahad Rana and Hairong Kuang via szetszwo)
|
||
|
||
HADOOP-4340. Correctly set the exit code from JobShell.main so that the
|
||
'hadoop jar' command returns the right code to the user. (acmurthy)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-2421. Add jdiff output to documentation, listing all API
|
||
changes from the prior release. (cutting)
|
||
|
||
Release 0.18.1 - 2008-09-17
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-3934. Upgrade log4j to 1.2.15. (omalley)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3995. In case of quota failure on HDFS, rename does not restore
|
||
source filename. (rangadi)
|
||
|
||
HADOOP-3821. Prevent SequenceFile and IFile from duplicating codecs in
|
||
CodecPool when closed more than once. (Arun Murthy via cdouglas)
|
||
|
||
HADOOP-4040. Remove coded default of the IPC idle connection timeout
|
||
from the TaskTracker, which was causing HDFS client connections to not be
|
||
collected. (ddas via omalley)
|
||
|
||
HADOOP-4046. Made WritableComparable's constructor protected instead of
|
||
private to re-enable class derivation. (cdouglas via omalley)
|
||
|
||
HADOOP-3940. Fix in-memory merge condition to wait when there are no map
|
||
outputs or when the final map outputs are being fetched without contention.
|
||
(cdouglas)
|
||
|
||
Release 0.18.0 - 2008-08-19
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-2703. The default options to fsck skips checking files
|
||
that are being written to. The output of fsck is incompatible
|
||
with previous release. (lohit vijayarenu via dhruba)
|
||
|
||
HADOOP-2865. FsShell.ls() printout format changed to print file names
|
||
in the end of the line. (Edward J. Yoon via shv)
|
||
|
||
HADOOP-3283. The Datanode has a RPC server. It currently supports
|
||
two RPCs: the first RPC retrives the metadata about a block and the
|
||
second RPC sets the generation stamp of an existing block.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2797. Code related to upgrading to 0.14 (Block CRCs) is
|
||
removed. As result, upgrade to 0.18 or later from 0.13 or earlier
|
||
is not supported. If upgrading from 0.13 or earlier is required,
|
||
please upgrade to an intermediate version (0.14-0.17) and then
|
||
to this version. (rangadi)
|
||
|
||
HADOOP-544. This issue introduces new classes JobID, TaskID and
|
||
TaskAttemptID, which should be used instead of their string counterparts.
|
||
Functions in JobClient, TaskReport, RunningJob, jobcontrol.Job and
|
||
TaskCompletionEvent that use string arguments are deprecated in favor
|
||
of the corresponding ones that use ID objects. Applications can use
|
||
xxxID.toString() and xxxID.forName() methods to convert/restore objects
|
||
to/from strings. (Enis Soztutar via ddas)
|
||
|
||
HADOOP-2188. RPC client sends a ping rather than throw timeouts.
|
||
RPC server does not throw away old RPCs. If clients and the server are on
|
||
different versions, they are not able to function well. In addition,
|
||
The property ipc.client.timeout is removed from the default hadoop
|
||
configuration. It also removes metrics RpcOpsDiscardedOPsNum. (hairong)
|
||
|
||
HADOOP-2181. This issue adds logging for input splits in Jobtracker log
|
||
and jobHistory log. Also adds web UI for viewing input splits in job UI
|
||
and history UI. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3226. Run combiners multiple times over map outputs as they
|
||
are merged in both the map and the reduce tasks. (cdouglas via omalley)
|
||
|
||
HADOOP-3329. DatanodeDescriptor objects should not be stored in the
|
||
fsimage. (dhruba)
|
||
|
||
HADOOP-2656. The Block object has a generation stamp inside it.
|
||
Existing blocks get a generation stamp of 0. This is needed to support
|
||
appends. (dhruba)
|
||
|
||
HADOOP-3390. Removed deprecated ClientProtocol.abandonFileInProgress().
|
||
(Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
HADOOP-3405. Made some map/reduce internal classes non-public:
|
||
MapTaskStatus, ReduceTaskStatus, JobSubmissionProtocol,
|
||
CompletedJobStatusStore. (enis via omaley)
|
||
|
||
HADOOP-3265. Removed depcrecated API getFileCacheHints().
|
||
(Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3310. The namenode instructs the primary datanode to do lease
|
||
recovery. The block gets a new generation stamp.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2909. Improve IPC idle connection management. Property
|
||
ipc.client.maxidletime is removed from the default configuration,
|
||
instead it is defined as twice of the ipc.client.connection.maxidletime.
|
||
A connection with outstanding requests won't be treated as idle.
|
||
(hairong)
|
||
|
||
HADOOP-3459. Change in the output format of dfs -ls to more closely match
|
||
/bin/ls. New format is: perm repl owner group size date name
|
||
(Mukund Madhugiri via omally)
|
||
|
||
HADOOP-3113. An fsync invoked on a HDFS file really really
|
||
persists data! The datanode moves blocks in the tmp directory to
|
||
the real block directory on a datanode-restart. (dhruba)
|
||
|
||
HADOOP-3452. Change fsck to return non-zero status for a corrupt
|
||
FileSystem. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3193. Include the address of the client that found the corrupted
|
||
block in the log. Also include a CorruptedBlocks metric to track the size
|
||
of the corrupted block map. (cdouglas)
|
||
|
||
HADOOP-3512. Separate out the tools into a tools jar. (omalley)
|
||
|
||
HADOOP-3598. Ensure that temporary task-output directories are not created
|
||
if they are not necessary e.g. for Maps with no side-effect files.
|
||
(acmurthy)
|
||
|
||
HADOOP-3665. Modify WritableComparator so that it only creates instances
|
||
of the keytype if the type does not define a WritableComparator. Calling
|
||
the superclass compare will throw a NullPointerException. Also define
|
||
a RawComparator for NullWritable and permit it to be written as a key
|
||
to SequenceFiles. (cdouglas)
|
||
|
||
HADOOP-3673. Avoid deadlock caused by DataNode RPC receoverBlock().
|
||
(Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-3074. Provides a UrlStreamHandler for DFS and other FS,
|
||
relying on FileSystem (taton)
|
||
|
||
HADOOP-2585. Name-node imports namespace data from a recent checkpoint
|
||
accessible via a NFS mount. (shv)
|
||
|
||
HADOOP-3061. Writable types for doubles and bytes. (Andrzej
|
||
Bialecki via omalley)
|
||
|
||
HADOOP-2857. Allow libhdfs to set jvm options. (Craig Macdonald
|
||
via omalley)
|
||
|
||
HADOOP-3317. Add default port for HDFS namenode. The port in
|
||
"hdfs:" URIs now defaults to 8020, so that one may simply use URIs
|
||
of the form "hdfs://example.com/dir/file". (cutting)
|
||
|
||
HADOOP-2019. Adds support for .tar, .tgz and .tar.gz files in
|
||
DistributedCache (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3058. Add FSNamesystem status metrics.
|
||
(Lohit Vjayarenu via rangadi)
|
||
|
||
HADOOP-1915. Allow users to specify counters via strings instead
|
||
of enumerations. (tomwhite via omalley)
|
||
|
||
HADOOP-2065. Delay invalidating corrupt replicas of block until its
|
||
is removed from under replicated state. If all replicas are found to
|
||
be corrupt, retain all copies and mark the block as corrupt.
|
||
(Lohit Vjayarenu via rangadi)
|
||
|
||
HADOOP-3221. Adds org.apache.hadoop.mapred.lib.NLineInputFormat, which
|
||
splits files into splits each of N lines. N can be specified by
|
||
configuration property "mapred.line.input.format.linespermap", which
|
||
defaults to 1. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3336. Direct a subset of annotated FSNamesystem calls for audit
|
||
logging. (cdouglas)
|
||
|
||
HADOOP-3400. A new API FileSystem.deleteOnExit() that facilitates
|
||
handling of temporary files in HDFS. (dhruba)
|
||
|
||
HADOOP-4. Add fuse-dfs to contrib, permitting one to mount an
|
||
HDFS filesystem on systems that support FUSE, e.g., Linux.
|
||
(Pete Wyckoff via cutting)
|
||
|
||
HADOOP-3246. Add FTPFileSystem. (Ankur Goel via cutting)
|
||
|
||
HADOOP-3250. Extend FileSystem API to allow appending to files.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3177. Implement Syncable interface for FileSystem.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-1328. Implement user counters in streaming. (tomwhite via
|
||
omalley)
|
||
|
||
HADOOP-3187. Quotas for namespace management. (Hairong Kuang via ddas)
|
||
|
||
HADOOP-3307. Support for Archives in Hadoop. (Mahadev Konar via ddas)
|
||
|
||
HADOOP-3460. Add SequenceFileAsBinaryOutputFormat to permit direct
|
||
writes of serialized data. (Koji Noguchi via cdouglas)
|
||
|
||
HADOOP-3230. Add ability to get counter values from command
|
||
line. (tomwhite via omalley)
|
||
|
||
HADOOP-930. Add support for native S3 files. (tomwhite via cutting)
|
||
|
||
HADOOP-3502. Quota API needs documentation in Forrest. (hairong)
|
||
|
||
HADOOP-3413. Allow SequenceFile.Reader to use serialization
|
||
framework. (tomwhite via omalley)
|
||
|
||
HADOOP-3541. Import of the namespace from a checkpoint documented
|
||
in hadoop user guide. (shv)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-3677. Simplify generation stamp upgrade by making is a
|
||
local upgrade on datandodes. Deleted distributed upgrade.
|
||
(rangadi)
|
||
|
||
HADOOP-2928. Remove deprecated FileSystem.getContentLength().
|
||
(Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3130. Make the connect timeout smaller for getFile.
|
||
(Amar Ramesh Kamat via ddas)
|
||
|
||
HADOOP-3160. Remove deprecated exists() from ClientProtocol and
|
||
FSNamesystem (Lohit Vjayarenu via rangadi)
|
||
|
||
HADOOP-2910. Throttle IPC Clients during bursts of requests or
|
||
server slowdown. Clients retry connection for up to 15 minutes
|
||
when socket connection times out. (hairong)
|
||
|
||
HADOOP-3295. Allow TextOutputFormat to use configurable spearators.
|
||
(Zheng Shao via cdouglas).
|
||
|
||
HADOOP-3308. Improve QuickSort by excluding values eq the pivot from the
|
||
partition. (cdouglas)
|
||
|
||
HADOOP-2461. Trim property names in configuration.
|
||
(Tsz Wo (Nicholas), SZE via shv)
|
||
|
||
HADOOP-2799. Deprecate o.a.h.io.Closable in favor of java.io.Closable.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3345. Enhance the hudson-test-patch target to cleanup messages,
|
||
fix minor defects, and add eclipse plugin and python unit tests. (nigel)
|
||
|
||
HADOOP-3144. Improve robustness of LineRecordReader by defining a maximum
|
||
line length (mapred.linerecordreader.maxlength), thereby avoiding reading
|
||
too far into the following split. (Zheng Shao via cdouglas)
|
||
|
||
HADOOP-3334. Move lease handling from FSNamesystem into a seperate class.
|
||
(Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
HADOOP-3332. Reduces the amount of logging in Reducer's shuffle phase.
|
||
(Devaraj Das)
|
||
|
||
HADOOP-3355. Enhances Configuration class to accept hex numbers for getInt
|
||
and getLong. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3350. Add an argument to distcp to permit the user to limit the
|
||
number of maps. (cdouglas)
|
||
|
||
HADOOP-3013. Add corrupt block reporting to fsck.
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3377. Remove TaskRunner::replaceAll and replace with equivalent
|
||
String::replace. (Brice Arnould via cdouglas)
|
||
|
||
HADOOP-3398. Minor improvement to a utility function in that participates
|
||
in backoff calculation. (cdouglas)
|
||
|
||
HADOOP-3381. Clear referenced when directories are deleted so that
|
||
effect of memory leaks are not multiplied. (rangadi)
|
||
|
||
HADOOP-2867. Adds the task's CWD to its LD_LIBRARY_PATH.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3232. DU class runs the 'du' command in a seperate thread so
|
||
that it does not block user. DataNode misses heartbeats in large
|
||
nodes otherwise. (Johan Oskarsson via rangadi)
|
||
|
||
HADOOP-3035. During block transfers between datanodes, the receiving
|
||
datanode, now can report corrupt replicas received from src node to
|
||
the namenode. (Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3434. Retain the cause of the bind failure in Server::bind.
|
||
(Steve Loughran via cdouglas)
|
||
|
||
HADOOP-3429. Increases the size of the buffers used for the communication
|
||
for Streaming jobs. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3486. Change default for initial block report to 0 seconds
|
||
and document it. (Sanjay Radia via omalley)
|
||
|
||
HADOOP-3448. Improve the text in the assertion making sure the
|
||
layout versions are consistent in the data node. (Steve Loughran
|
||
via omalley)
|
||
|
||
HADOOP-2095. Improve the Map-Reduce shuffle/merge by cutting down
|
||
buffer-copies; changed intermediate sort/merge to use the new IFile format
|
||
rather than SequenceFiles and compression of map-outputs is now
|
||
implemented by compressing the entire file rather than SequenceFile
|
||
compression. Shuffle also has been changed to use a simple byte-buffer
|
||
manager rather than the InMemoryFileSystem.
|
||
Configuration changes to hadoop-default.xml:
|
||
deprecated mapred.map.output.compression.type
|
||
(acmurthy)
|
||
|
||
HADOOP-236. JobTacker now refuses connection from a task tracker with a
|
||
different version number. (Sharad Agarwal via ddas)
|
||
|
||
HADOOP-3427. Improves the shuffle scheduler. It now waits for notifications
|
||
from shuffle threads when it has scheduled enough, before scheduling more.
|
||
(ddas)
|
||
|
||
HADOOP-2393. Moves the handling of dir deletions in the tasktracker to
|
||
a separate thread. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3501. Deprecate InMemoryFileSystem. (cutting via omalley)
|
||
|
||
HADOOP-3366. Stall the shuffle while in-memory merge is in progress.
|
||
(acmurthy)
|
||
|
||
HADOOP-2916. Refactor src structure, but leave package structure alone.
|
||
(Raghu Angadi via mukund)
|
||
|
||
HADOOP-3492. Add forrest documentation for user archives.
|
||
(Mahadev Konar via hairong)
|
||
|
||
HADOOP-3467. Improve documentation for FileSystem::deleteOnExit.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3379. Documents stream.non.zero.exit.status.is.failure for Streaming.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3096. Improves documentation about the Task Execution Environment in
|
||
the Map-Reduce tutorial. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2984. Add forrest documentation for DistCp. (cdouglas)
|
||
|
||
HADOOP-3406. Add forrest documentation for Profiling.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2762. Add forrest documentation for controls of memory limits on
|
||
hadoop daemons and Map-Reduce tasks. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3535. Fix documentation and name of IOUtils.close to
|
||
reflect that it should only be used in cleanup contexts. (omalley)
|
||
|
||
HADOOP-3593. Updates the mapred tutorial. (ddas)
|
||
|
||
HADOOP-3547. Documents the way in which native libraries can be distributed
|
||
via the DistributedCache. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3606. Updates the Streaming doc. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3532. Add jdiff reports to the build scripts. (omalley)
|
||
|
||
HADOOP-3100. Develop tests to test the DFS command line interface. (mukund)
|
||
|
||
HADOOP-3688. Fix up HDFS docs. (Robert Chansler via hairong)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-3274. The default constructor of BytesWritable creates empty
|
||
byte array. (Tsz Wo (Nicholas), SZE via shv)
|
||
|
||
HADOOP-3272. Remove redundant copy of Block object in BlocksMap.
|
||
(Lohit Vjayarenu via shv)
|
||
|
||
HADOOP-3164. Reduce DataNode CPU usage by using FileChannel.tranferTo().
|
||
On Linux DataNode takes 5 times less CPU while serving data. Results may
|
||
vary on other platforms. (rangadi)
|
||
|
||
HADOOP-3248. Optimization of saveFSImage. (Dhruba via shv)
|
||
|
||
HADOOP-3297. Fetch more task completion events from the job
|
||
tracker and task tracker. (ddas via omalley)
|
||
|
||
HADOOP-3364. Faster image and log edits loading. (shv)
|
||
|
||
HADOOP-3369. Fast block processing during name-node startup. (shv)
|
||
|
||
HADOOP-1702. Reduce buffer copies when data is written to DFS.
|
||
DataNodes take 30% less CPU while writing data. (rangadi)
|
||
|
||
HADOOP-3095. Speed up split generation in the FileInputSplit,
|
||
especially for non-HDFS file systems. Deprecates
|
||
InputFormat.validateInput. (tomwhite via omalley)
|
||
|
||
HADOOP-3552. Add forrest documentation for Hadoop commands.
|
||
(Sharad Agarwal via cdouglas)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2905. 'fsck -move' triggers NPE in NameNode.
|
||
(Lohit Vjayarenu via rangadi)
|
||
|
||
Increment ClientProtocol.versionID missed by HADOOP-2585. (shv)
|
||
|
||
HADOOP-3254. Restructure internal namenode methods that process
|
||
heartbeats to use well-defined BlockCommand object(s) instead of
|
||
using the base java Object. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-3176. Change lease record when a open-for-write-file
|
||
gets renamed. (dhruba)
|
||
|
||
HADOOP-3269. Fix a case when namenode fails to restart
|
||
while processing a lease record. ((Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-3282. Port issues in TestCheckpoint resolved. (shv)
|
||
|
||
HADOOP-3268. file:// URLs issue in TestUrlStreamHandler under Windows.
|
||
(taton)
|
||
|
||
HADOOP-3127. Deleting files in trash should really remove them.
|
||
(Brice Arnould via omalley)
|
||
|
||
HADOOP-3300. Fix locking of explicit locks in NetworkTopology.
|
||
(tomwhite via omalley)
|
||
|
||
HADOOP-3270. Constant DatanodeCommands are stored in static final
|
||
immutable variables for better code clarity.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2793. Fix broken links for worst performing shuffle tasks in
|
||
the job history page. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3313. Avoid unnecessary calls to System.currentTimeMillis
|
||
in RPC::Invoker. (cdouglas)
|
||
|
||
HADOOP-3318. Recognize "Darwin" as an alias for "Mac OS X" to
|
||
support Soylatte. (Sam Pullara via omalley)
|
||
|
||
HADOOP-3301. Fix misleading error message when S3 URI hostname
|
||
contains an underscore. (tomwhite via omalley)
|
||
|
||
HADOOP-3338. Fix Eclipse plugin to compile after HADOOP-544 was
|
||
committed. Updated all references to use the new JobID representation.
|
||
(taton via nigel)
|
||
|
||
HADOOP-3337. Loading FSEditLog was broken by HADOOP-3283 since it
|
||
changed Writable serialization of DatanodeInfo. This patch handles it.
|
||
(Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
HADOOP-3101. Prevent JobClient from throwing an exception when printing
|
||
usage. (Edward J. Yoon via cdouglas)
|
||
|
||
HADOOP-3119. Update javadoc for Text::getBytes to better describe its
|
||
behavior. (Tim Nelson via cdouglas)
|
||
|
||
HADOOP-2294. Fix documentation in libhdfs to refer to the correct free
|
||
function. (Craig Macdonald via cdouglas)
|
||
|
||
HADOOP-3335. Prevent the libhdfs build from deleting the wrong
|
||
files on make clean. (cutting via omalley)
|
||
|
||
HADOOP-2930. Make {start,stop}-balancer.sh work even if hadoop-daemon.sh
|
||
is not in the PATH. (Spiros Papadimitriou via hairong)
|
||
|
||
HADOOP-3085. Catch Exception in metrics util classes to ensure that
|
||
misconfigured metrics don't prevent others from updating. (cdouglas)
|
||
|
||
HADOOP-3299. CompositeInputFormat should configure the sub-input
|
||
formats. (cdouglas via omalley)
|
||
|
||
HADOOP-3309. Lower io.sort.mb and fs.inmemory.size.mb for MiniMRDFSSort
|
||
unit test so it passes on Windows. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3348. TestUrlStreamHandler should set URLStreamFactory after
|
||
DataNodes are initialized. (Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3371. Ignore InstanceAlreadyExistsException from
|
||
MBeanUtil::registerMBean. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3349. A file rename was incorrectly changing the name inside a
|
||
lease record. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-3365. Removes an unnecessary copy of the key from SegmentDescriptor
|
||
to MergeQueue. (Devaraj Das)
|
||
|
||
HADOOP-3388. Fix for TestDatanodeBlockScanner to handle blocks with
|
||
generation stamps in them. (dhruba)
|
||
|
||
HADOOP-3203. Fixes TaskTracker::localizeJob to pass correct file sizes
|
||
for the jarfile and the jobfile. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3391. Fix a findbugs warning introduced by HADOOP-3248 (rangadi)
|
||
|
||
HADOOP-3393. Fix datanode shutdown to call DataBlockScanner::shutdown and
|
||
close its log, even if the scanner thread is not running. (lohit vijayarenu
|
||
via cdouglas)
|
||
|
||
HADOOP-3399. A debug message was logged at info level. (rangadi)
|
||
|
||
HADOOP-3396. TestDatanodeBlockScanner occationally fails.
|
||
(Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3339. Some of the failures on 3rd datanode in DFS write pipelie
|
||
are not detected properly. This could lead to hard failure of client's
|
||
write operation. (rangadi)
|
||
|
||
HADOOP-3409. Namenode should save the root inode into fsimage. (hairong)
|
||
|
||
HADOOP-3296. Fix task cache to work for more than two levels in the cache
|
||
hierarchy. This also adds a new counter to track cache hits at levels
|
||
greater than two. (Amar Kamat via cdouglas)
|
||
|
||
HADOOP-3375. Lease paths were sometimes not removed from
|
||
LeaseManager.sortedLeasesByPath. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-3424. Values returned by getPartition should be checked to
|
||
make sure they are in the range 0 to #reduces - 1 (cdouglas via
|
||
omalley)
|
||
|
||
HADOOP-3408. Change FSNamesystem to send its metrics as integers to
|
||
accommodate collectors that don't support long values. (lohit vijayarenu
|
||
via cdouglas)
|
||
|
||
HADOOP-3403. Fixes a problem in the JobTracker to do with handling of lost
|
||
tasktrackers. (Arun Murthy via ddas)
|
||
|
||
HADOOP-1318. Completed maps are not failed if the number of reducers are
|
||
zero. (Amareshwari Sriramadasu via ddas).
|
||
|
||
HADOOP-3351. Fixes the history viewer tool to not do huge StringBuffer
|
||
allocations. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3419. Fixes TestFsck to wait for updates to happen before
|
||
checking results to make the test more reliable. (Lohit Vijaya
|
||
Renu via omalley)
|
||
|
||
HADOOP-3259. Makes failure to read system properties due to a
|
||
security manager non-fatal. (Edward Yoon via omalley)
|
||
|
||
HADOOP-3451. Update libhdfs to use FileSystem::getFileBlockLocations
|
||
instead of removed getFileCacheHints. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3401. Update FileBench to set the new
|
||
"mapred.work.output.dir" property to work post-3041. (cdouglas via omalley)
|
||
|
||
HADOOP-2669. DFSClient locks pendingCreates appropriately. (dhruba)
|
||
|
||
HADOOP-3410. Fix KFS implemenation to return correct file
|
||
modification time. (Sriram Rao via cutting)
|
||
|
||
HADOOP-3340. Fix DFS metrics for BlocksReplicated, HeartbeatsNum, and
|
||
BlockReportsAverageTime. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3435. Remove the assuption in the scripts that bash is at
|
||
/bin/bash and fix the test patch to require bash instead of sh.
|
||
(Brice Arnould via omalley)
|
||
|
||
HADOOP-3471. Fix spurious errors from TestIndexedSort and add additional
|
||
logging to let failures be reproducible. (cdouglas)
|
||
|
||
HADOOP-3443. Avoid copying map output across partitions when renaming a
|
||
single spill. (omalley via cdouglas)
|
||
|
||
HADOOP-3454. Fix Text::find to search only valid byte ranges. (Chad Whipkey
|
||
via cdouglas)
|
||
|
||
HADOOP-3417. Removes the static configuration variable,
|
||
commandLineConfig from JobClient. Moves the cli parsing from
|
||
JobShell to GenericOptionsParser. Thus removes the class
|
||
org.apache.hadoop.mapred.JobShell. (Amareshwari Sriramadasu via
|
||
ddas)
|
||
|
||
HADOOP-2132. Only RUNNING/PREP jobs can be killed. (Jothi Padmanabhan
|
||
via ddas)
|
||
|
||
HADOOP-3476. Code cleanup in fuse-dfs.
|
||
(Peter Wyckoff via dhruba)
|
||
|
||
HADOOP-2427. Ensure that the cwd of completed tasks is cleaned-up
|
||
correctly on task-completion. (Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-2565. Remove DFSPath cache of FileStatus.
|
||
(Tsz Wo (Nicholas), SZE via hairong)
|
||
|
||
HADOOP-3326. Cleanup the local-fs and in-memory merge in the ReduceTask by
|
||
spawing only one thread each for the on-disk and in-memory merge.
|
||
(Sharad Agarwal via acmurthy)
|
||
|
||
HADOOP-3493. Fix TestStreamingFailure to use FileUtil.fullyDelete to
|
||
ensure correct cleanup. (Lohit Vijayarenu via acmurthy)
|
||
|
||
HADOOP-3455. Fix NPE in ipc.Client in case of connection failure and
|
||
improve its synchronization. (hairong)
|
||
|
||
HADOOP-3240. Fix a testcase to not create files in the current directory.
|
||
Instead the file is created in the test directory (Mahadev Konar via ddas)
|
||
|
||
HADOOP-3496. Fix failure in TestHarFileSystem.testArchives due to change
|
||
in HADOOP-3095. (tomwhite)
|
||
|
||
HADOOP-3135. Get the system directory from the JobTracker instead of from
|
||
the conf. (Subramaniam Krishnan via ddas)
|
||
|
||
HADOOP-3503. Fix a race condition when client and namenode start
|
||
simultaneous recovery of the same block. (dhruba & Tsz Wo
|
||
(Nicholas), SZE)
|
||
|
||
HADOOP-3440. Fixes DistributedCache to not create symlinks for paths which
|
||
don't have fragments even when createSymLink is true.
|
||
(Abhijit Bagri via ddas)
|
||
|
||
HADOOP-3463. Hadoop-daemons script should cd to $HADOOP_HOME. (omalley)
|
||
|
||
HADOOP-3489. Fix NPE in SafeModeMonitor. (Lohit Vijayarenu via shv)
|
||
|
||
HADOOP-3509. Fix NPE in FSNamesystem.close. (Tsz Wo (Nicholas), SZE via
|
||
shv)
|
||
|
||
HADOOP-3491. Name-node shutdown causes InterruptedException in
|
||
ResolutionMonitor. (Lohit Vijayarenu via shv)
|
||
|
||
HADOOP-3511. Fixes namenode image to not set the root's quota to an
|
||
invalid value when the quota was not saved in the image. (hairong)
|
||
|
||
HADOOP-3516. Ensure the JobClient in HadoopArchives is initialized
|
||
with a configuration. (Subramaniam Krishnan via omalley)
|
||
|
||
HADOOP-3513. Improve NNThroughputBenchmark log messages. (shv)
|
||
|
||
HADOOP-3519. Fix NPE in DFS FileSystem rename. (hairong via tomwhite)
|
||
|
||
HADOOP-3528. Metrics FilesCreated and files_deleted metrics
|
||
do not match. (Lohit via Mahadev)
|
||
|
||
HADOOP-3418. When a directory is deleted, any leases that point to files
|
||
in the subdirectory are removed. ((Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-3542. Diables the creation of _logs directory for the archives
|
||
directory. (Mahadev Konar via ddas)
|
||
|
||
HADOOP-3544. Fixes a documentation issue for hadoop archives.
|
||
(Mahadev Konar via ddas)
|
||
|
||
HADOOP-3517. Fixes a problem in the reducer due to which the last InMemory
|
||
merge may be missed. (Arun Murthy via ddas)
|
||
|
||
HADOOP-3548. Fixes build.xml to copy all *.jar files to the dist.
|
||
(Owen O'Malley via ddas)
|
||
|
||
HADOOP-3363. Fix unformatted storage detection in FSImage. (shv)
|
||
|
||
HADOOP-3560. Fixes a problem to do with split creation in archives.
|
||
(Mahadev Konar via ddas)
|
||
|
||
HADOOP-3545. Fixes a overflow problem in archives.
|
||
(Mahadev Konar via ddas)
|
||
|
||
HADOOP-3561. Prevent the trash from deleting its parent directories.
|
||
(cdouglas)
|
||
|
||
HADOOP-3575. Fix the clover ant target after package refactoring.
|
||
(Nigel Daley via cdouglas)
|
||
|
||
HADOOP-3539. Fix the tool path in the bin/hadoop script under
|
||
cygwin. (Tsz Wo (Nicholas), Sze via omalley)
|
||
|
||
HADOOP-3520. TestDFSUpgradeFromImage triggers a race condition in the
|
||
Upgrade Manager. Fixed. (dhruba)
|
||
|
||
HADOOP-3586. Provide deprecated, backwards compatibile semantics for the
|
||
combiner to be run once and only once on each record. (cdouglas)
|
||
|
||
HADOOP-3533. Add deprecated methods to provide API compatibility
|
||
between 0.18 and 0.17. Remove the deprecated methods in trunk. (omalley)
|
||
|
||
HADOOP-3580. Fixes a problem to do with specifying a har as an input to
|
||
a job. (Mahadev Konar via ddas)
|
||
|
||
HADOOP-3333. Don't assign a task to a tasktracker that it failed to
|
||
execute earlier (used to happen in the case of lost tasktrackers where
|
||
the tasktracker would reinitialize and bind to a different port).
|
||
(Jothi Padmanabhan and Arun Murthy via ddas)
|
||
|
||
HADOOP-3534. Log IOExceptions that happen in closing the name
|
||
system when the NameNode shuts down. (Tsz Wo (Nicholas) Sze via omalley)
|
||
|
||
HADOOP-3546. TaskTracker re-initialization gets stuck in cleaning up.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3576. Fix NullPointerException when renaming a directory
|
||
to its subdirectory. (Tse Wo (Nicholas), SZE via hairong)
|
||
|
||
HADOOP-3320. Fix NullPointerException in NetworkTopology.getDistance().
|
||
(hairong)
|
||
|
||
HADOOP-3569. KFS input stream read() now correctly reads 1 byte
|
||
instead of 4. (Sriram Rao via omalley)
|
||
|
||
HADOOP-3599. Fix JobConf::setCombineOnceOnly to modify the instance rather
|
||
than a parameter. (Owen O'Malley via cdouglas)
|
||
|
||
HADOOP-3590. Null pointer exception in JobTracker when the task tracker is
|
||
not yet resolved. (Amar Ramesh Kamat via ddas)
|
||
|
||
HADOOP-3603. Fix MapOutputCollector to spill when io.sort.spill.percent is
|
||
1.0 and to detect spills when emitted records write no data. (cdouglas)
|
||
|
||
HADOOP-3615. Set DatanodeProtocol.versionID to the correct value.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3559. Fix the libhdfs test script and config to work with the
|
||
current semantics. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3480. Need to update Eclipse template to reflect current trunk.
|
||
(Brice Arnould via tomwhite)
|
||
|
||
HADOOP-3588. Fixed usability issues with archives. (mahadev)
|
||
|
||
HADOOP-3635. Uncaught exception in DataBlockScanner.
|
||
(Tsz Wo (Nicholas), SZE via hairong)
|
||
|
||
HADOOP-3639. Exception when closing DFSClient while multiple files are
|
||
open. (Benjamin Gufler via hairong)
|
||
|
||
HADOOP-3572. SetQuotas usage interface has some minor bugs. (hairong)
|
||
|
||
HADOOP-3649. Fix bug in removing blocks from the corrupted block map.
|
||
(Lohit Vijayarenu via shv)
|
||
|
||
HADOOP-3604. Work around a JVM synchronization problem observed while
|
||
retrieving the address of direct buffers from compression code by obtaining
|
||
a lock during this call. (Arun C Murthy via cdouglas)
|
||
|
||
HADOOP-3683. Fix dfs metrics to count file listings rather than files
|
||
listed. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3597. Fix SortValidator to use filesystems other than the default as
|
||
input. Validation job still runs on default fs.
|
||
(Jothi Padmanabhan via cdouglas)
|
||
|
||
HADOOP-3693. Fix archives, distcp and native library documentation to
|
||
conform to style guidelines. (Amareshwari Sriramadasu via cdouglas)
|
||
|
||
HADOOP-3653. Fix test-patch target to properly account for Eclipse
|
||
classpath jars. (Brice Arnould via nigel)
|
||
|
||
HADOOP-3692. Fix documentation for Cluster setup and Quick start guides.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3691. Fix streaming and tutorial docs. (Jothi Padmanabhan via ddas)
|
||
|
||
HADOOP-3630. Fix NullPointerException in CompositeRecordReader from empty
|
||
sources (cdouglas)
|
||
|
||
HADOOP-3706. Fix a ClassLoader issue in the mapred.join Parser that
|
||
prevents it from loading user-specified InputFormats.
|
||
(Jingkei Ly via cdouglas)
|
||
|
||
HADOOP-3718. Fix KFSOutputStream::write(int) to output a byte instead of
|
||
an int, per the OutputStream contract. (Sriram Rao via cdouglas)
|
||
|
||
HADOOP-3647. Add debug logs to help track down a very occassional,
|
||
hard-to-reproduce, bug in shuffle/merge on the reducer. (acmurthy)
|
||
|
||
HADOOP-3716. Prevent listStatus in KosmosFileSystem from returning
|
||
null for valid, empty directories. (Sriram Rao via cdouglas)
|
||
|
||
HADOOP-3752. Fix audit logging to record rename events. (cdouglas)
|
||
|
||
HADOOP-3737. Fix CompressedWritable to call Deflater::end to release
|
||
compressor memory. (Grant Glouser via cdouglas)
|
||
|
||
HADOOP-3670. Fixes JobTracker to clear out split bytes when no longer
|
||
required. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3755. Update gridmix to work with HOD 0.4 (Runping Qi via cdouglas)
|
||
|
||
HADOOP-3743. Fix -libjars, -files, -archives options to work even if
|
||
user code does not implement tools. (Amareshwari Sriramadasu via mahadev)
|
||
|
||
HADOOP-3774. Fix typos in shell output. (Tsz Wo (Nicholas), SZE via
|
||
cdouglas)
|
||
|
||
HADOOP-3762. Fixed FileSystem cache to work with the default port. (cutting
|
||
via omalley)
|
||
|
||
HADOOP-3798. Fix tests compilation. (Mukund Madhugiri via omalley)
|
||
|
||
HADOOP-3794. Return modification time instead of zero for KosmosFileSystem.
|
||
(Sriram Rao via cdouglas)
|
||
|
||
HADOOP-3806. Remove debug statement to stdout from QuickSort. (cdouglas)
|
||
|
||
HADOOP-3776. Fix NPE at NameNode when datanode reports a block after it is
|
||
deleted at NameNode. (rangadi)
|
||
|
||
HADOOP-3537. Disallow adding a datanode to a network topology when its
|
||
network location is not resolved. (hairong)
|
||
|
||
HADOOP-3571. Fix bug in block removal used in lease recovery. (shv)
|
||
|
||
HADOOP-3645. MetricsTimeVaryingRate returns wrong value for
|
||
metric_avg_time. (Lohit Vijayarenu via hairong)
|
||
|
||
HADOOP-3521. Reverted the missing cast to float for sending Counters' values
|
||
to Hadoop metrics which was removed by HADOOP-544. (acmurthy)
|
||
|
||
HADOOP-3820. Fixes two problems in the gridmix-env - a syntax error, and a
|
||
wrong definition of USE_REAL_DATASET by default. (Arun Murthy via ddas)
|
||
|
||
HADOOP-3724. Fixes two problems related to storing and recovering lease
|
||
in the fsimage. (dhruba)
|
||
|
||
HADOOP-3827. Fixed compression of empty map-outputs. (acmurthy)
|
||
|
||
HADOOP-3865. Remove reference to FSNamesystem from metrics preventing
|
||
garbage collection. (Lohit Vijayarenu via cdouglas)
|
||
|
||
HADOOP-3884. Fix so that Eclipse plugin builds against recent
|
||
Eclipse releases. (cutting)
|
||
|
||
HADOOP-3837. Streaming jobs report progress status. (dhruba)
|
||
|
||
HADOOP-3897. Fix a NPE in secondary namenode. (Lohit Vijayarenu via
|
||
cdouglas)
|
||
|
||
HADOOP-3901. Fix bin/hadoop to correctly set classpath under cygwin.
|
||
(Tsz Wo (Nicholas) Sze via omalley)
|
||
|
||
HADOOP-3947. Fix a problem in tasktracker reinitialization.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
Release 0.17.3 - Unreleased
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-4164. Chinese translation of the documentation. (Xuebing Yan via
|
||
omalley)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-4277. Checksum verification was mistakenly disabled for
|
||
LocalFileSystem. (Raghu Angadi)
|
||
|
||
HADOOP-4271. Checksum input stream can sometimes return invalid
|
||
data to the user. (Ning Li via rangadi)
|
||
|
||
HADOOP-4318. DistCp should use absolute paths for cleanup. (szetszwo)
|
||
|
||
HADOOP-4326. ChecksumFileSystem does not override create(...) correctly.
|
||
(szetszwo)
|
||
|
||
Release 0.17.2 - 2008-08-11
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3678. Avoid spurious exceptions logged at DataNode when clients
|
||
read from DFS. (rangadi)
|
||
|
||
HADOOP-3707. NameNode keeps a count of number of blocks scheduled
|
||
to be written to a datanode and uses it to avoid allocating more
|
||
blocks than a datanode can hold. (rangadi)
|
||
|
||
HADOOP-3760. Fix a bug with HDFS file close() mistakenly introduced
|
||
by HADOOP-3681. (Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3681. DFSClient can get into an infinite loop while closing
|
||
a file if there are some errors. (Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3002. Hold off block removal while in safe mode. (shv)
|
||
|
||
HADOOP-3685. Unbalanced replication target. (hairong)
|
||
|
||
HADOOP-3758. Shutdown datanode on version mismatch instead of retrying
|
||
continuously, preventing excessive logging at the namenode.
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-3633. Correct exception handling in DataXceiveServer, and throttle
|
||
the number of xceiver threads in a data-node. (shv)
|
||
|
||
HADOOP-3370. Ensure that the TaskTracker.runningJobs data-structure is
|
||
correctly cleaned-up on task completion. (Zheng Shao via acmurthy)
|
||
|
||
HADOOP-3813. Fix task-output clean-up on HDFS to use the recursive
|
||
FileSystem.delete rather than the FileUtil.fullyDelete. (Amareshwari
|
||
Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-3859. Allow the maximum number of xceivers in the data node to
|
||
be configurable. (Johan Oskarsson via omalley)
|
||
|
||
HADOOP-3931. Fix corner case in the map-side sort that causes some values
|
||
to be counted as too large and cause pre-mature spills to disk. Some values
|
||
will also bypass the combiner incorrectly. (cdouglas via omalley)
|
||
|
||
Release 0.17.1 - 2008-06-23
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-3565. Fix the Java serialization, which is not enabled by
|
||
default, to clear the state of the serializer between objects.
|
||
(tomwhite via omalley)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-3522. Improve documentation on reduce pointing out that
|
||
input keys and values will be reused. (omalley)
|
||
|
||
HADOOP-3487. Balancer uses thread pools for managing its threads;
|
||
therefore provides better resource management. (hairong)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2159 Namenode stuck in safemode. The counter blockSafe should
|
||
not be decremented for invalid blocks. (hairong)
|
||
|
||
HADOOP-3472 MapFile.Reader getClosest() function returns incorrect results
|
||
when before is true (Todd Lipcon via Stack)
|
||
|
||
HADOOP-3442. Limit recursion depth on the stack for QuickSort to prevent
|
||
StackOverflowErrors. To avoid O(n*n) cases, when partitioning depth exceeds
|
||
a multiple of log(n), change to HeapSort. (cdouglas)
|
||
|
||
HADOOP-3477. Fix build to not package contrib/*/bin twice in
|
||
distributions. (Adam Heath via cutting)
|
||
|
||
HADOOP-3475. Fix MapTask to correctly size the accounting allocation of
|
||
io.sort.mb. (cdouglas)
|
||
|
||
HADOOP-3550. Fix the serialization data structures in MapTask where the
|
||
value lengths are incorrectly calculated. (cdouglas)
|
||
|
||
HADOOP-3526. Fix contrib/data_join framework by cloning values retained
|
||
in the reduce. (Spyros Blanas via cdouglas)
|
||
|
||
HADOOP-1979. Speed up fsck by adding a buffered stream. (Lohit
|
||
Vijaya Renu via omalley)
|
||
|
||
Release 0.17.0 - 2008-05-18
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-2786. Move hbase out of hadoop core
|
||
|
||
HADOOP-2345. New HDFS transactions to support appending
|
||
to files. Disk layout version changed from -11 to -12. (dhruba)
|
||
|
||
HADOOP-2192. Error messages from "dfs mv" command improved.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-1902. "dfs du" command without any arguments operates on the
|
||
current working directory. (Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2873. Fixed bad disk format introduced by HADOOP-2345.
|
||
Disk layout version changed from -12 to -13. See changelist 630992
|
||
(dhruba)
|
||
|
||
HADOOP-1985. This addresses rack-awareness for Map tasks and for
|
||
HDFS in a uniform way. (ddas)
|
||
|
||
HADOOP-1986. Add support for a general serialization mechanism for
|
||
Map Reduce. (tomwhite)
|
||
|
||
HADOOP-771. FileSystem.delete() takes an explicit parameter that
|
||
specifies whether a recursive delete is intended.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2470. Remove getContentLength(String), open(String, long, long)
|
||
and isDir(String) from ClientProtocol. ClientProtocol version changed
|
||
from 26 to 27. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-2822. Remove deprecated code for classes InputFormatBase and
|
||
PhasedFileSystem. (Amareshwari Sriramadasu via enis)
|
||
|
||
HADOOP-2116. Changes the layout of the task execution directory.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2828. The following deprecated methods in Configuration.java
|
||
have been removed
|
||
getObject(String name)
|
||
setObject(String name, Object value)
|
||
get(String name, Object defaultValue)
|
||
set(String name, Object value)
|
||
Iterator entries()
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2824. Removes one deprecated constructor from MiniMRCluster.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2823. Removes deprecated methods getColumn(), getLine() from
|
||
org.apache.hadoop.record.compiler.generated.SimpleCharStream.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3060. Removes one unused constructor argument from MiniMRCluster.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-2854. Remove deprecated o.a.h.ipc.Server::getUserInfo().
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-2563. Remove deprecated FileSystem::listPaths.
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-2818. Remove deprecated methods in Counters.
|
||
(Amareshwari Sriramadasu via tomwhite)
|
||
|
||
HADOOP-2831. Remove deprecated o.a.h.dfs.INode::getAbsoluteName()
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-2839. Remove deprecated FileSystem::globPaths.
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-2634. Deprecate ClientProtocol::exists.
|
||
(lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-2410. Make EC2 cluster nodes more independent of each other.
|
||
Multiple concurrent EC2 clusters are now supported, and nodes may be
|
||
added to a cluster on the fly with new nodes starting in the same EC2
|
||
availability zone as the cluster. Ganglia monitoring and large
|
||
instance sizes have also been added. (Chris K Wensel via tomwhite)
|
||
|
||
HADOOP-2826. Deprecated FileSplit.getFile(), LineRecordReader.readLine().
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3239. getFileInfo() returns null for non-existing files instead
|
||
of throwing FileNotFoundException. (Lohit Vijayarenu via shv)
|
||
|
||
HADOOP-3266. Removed HOD changes from CHANGES.txt, as they are now inside
|
||
src/contrib/hod (Hemanth Yamijala via ddas)
|
||
|
||
HADOOP-3280. Separate the configuration of the virtual memory size
|
||
(mapred.child.ulimit) from the jvm heap size, so that 64 bit
|
||
streaming applications are supported even when running with 32 bit
|
||
jvms. (acmurthy via omalley)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-1398. Add HBase in-memory block cache. (tomwhite)
|
||
|
||
HADOOP-2178. Job History on DFS. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2063. A new parameter to dfs -get command to fetch a file
|
||
even if it is corrupted. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2219. A new command "df -count" that counts the number of
|
||
files and directories. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2906. Add an OutputFormat capable of using keys, values, and
|
||
config params to map records to different output files.
|
||
(Runping Qi via cdouglas)
|
||
|
||
HADOOP-2346. Utilities to support timeout while writing to sockets.
|
||
DFSClient and DataNode sockets have 10min write timeout. (rangadi)
|
||
|
||
HADOOP-2951. Add a contrib module that provides a utility to
|
||
build or update Lucene indexes using Map/Reduce. (Ning Li via cutting)
|
||
|
||
HADOOP-1622. Allow multiple jar files for map reduce.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2055. Allows users to set PathFilter on the FileInputFormat.
|
||
(Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-2551. More environment variables like HADOOP_NAMENODE_OPTS
|
||
for better control of HADOOP_OPTS for each component. (rangadi)
|
||
|
||
HADOOP-3001. Add job counters that measure the number of bytes
|
||
read and written to HDFS, S3, KFS, and local file systems. (omalley)
|
||
|
||
HADOOP-3048. A new Interface and a default implementation to convert
|
||
and restore serializations of objects to/from strings. (enis)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-2655. Copy on write for data and metadata files in the
|
||
presence of snapshots. Needed for supporting appends to HDFS
|
||
files. (dhruba)
|
||
|
||
HADOOP-1967. When a Path specifies the same scheme as the default
|
||
FileSystem but no authority, the default FileSystem's authority is
|
||
used. Also add warnings for old-format FileSystem names, accessor
|
||
methods for fs.default.name, and check for null authority in HDFS.
|
||
(cutting)
|
||
|
||
HADOOP-2895. Let the profiling string be configurable.
|
||
(Martin Traverso via cdouglas)
|
||
|
||
HADOOP-910. Enables Reduces to do merges for the on-disk map output files
|
||
in parallel with their copying. (Amar Kamat via ddas)
|
||
|
||
HADOOP-730. Use rename rather than copy for local renames. (cdouglas)
|
||
|
||
HADOOP-2810. Updated the Hadoop Core logo. (nigel)
|
||
|
||
HADOOP-2057. Streaming should optionally treat a non-zero exit status
|
||
of a child process as a failed task. (Rick Cox via tomwhite)
|
||
|
||
HADOOP-2765. Enables specifying ulimits for streaming/pipes tasks (ddas)
|
||
|
||
HADOOP-2888. Make gridmix scripts more readily configurable and amenable
|
||
to automated execution. (Mukund Madhugiri via cdouglas)
|
||
|
||
HADOOP-2908. A document that describes the DFS Shell command.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2981. Update README.txt to reflect the upcoming use of
|
||
cryptography. (omalley)
|
||
|
||
HADOOP-2804. Add support to publish CHANGES.txt as HTML when running
|
||
the Ant 'docs' target. (nigel)
|
||
|
||
HADOOP-2559. Change DFS block placement to allocate the first replica
|
||
locally, the second off-rack, and the third intra-rack from the
|
||
second. (lohit vijayarenu via cdouglas)
|
||
|
||
HADOOP-2939. Make the automated patch testing process an executable
|
||
Ant target, test-patch. (nigel)
|
||
|
||
HADOOP-2239. Add HsftpFileSystem to permit transferring files over ssl.
|
||
(cdouglas)
|
||
|
||
HADOOP-2886. Track individual RPC metrics.
|
||
(girish vaitheeswaran via dhruba)
|
||
|
||
HADOOP-2373. Improvement in safe-mode reporting. (shv)
|
||
|
||
HADOOP-3091. Modify FsShell command -put to accept multiple sources.
|
||
(Lohit Vijaya Renu via cdouglas)
|
||
|
||
HADOOP-3092. Show counter values from job -status command.
|
||
(Tom White via ddas)
|
||
|
||
HADOOP-1228. Ant task to generate Eclipse project files. (tomwhite)
|
||
|
||
HADOOP-3093. Adds Configuration.getStrings(name, default-value) and
|
||
the corresponding setStrings. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3106. Adds documentation in forrest for debugging.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3099. Add an option to distcp to preserve user, group, and
|
||
permission information. (Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-2841. Unwrap AccessControlException and FileNotFoundException
|
||
from RemoteException for DFSClient. (shv)
|
||
|
||
HADOOP-3152. Make index interval configuable when using
|
||
MapFileOutputFormat for map-reduce job. (Rong-En Fan via cutting)
|
||
|
||
HADOOP-3143. Decrease number of slaves from 4 to 3 in TestMiniMRDFSSort,
|
||
as Hudson generates false negatives under the current load.
|
||
(Nigel Daley via cdouglas)
|
||
|
||
HADOOP-3174. Illustrative example for MultipleFileInputFormat. (Enis
|
||
Soztutar via acmurthy)
|
||
|
||
HADOOP-2993. Clarify the usage of JAVA_HOME in the Quick Start guide.
|
||
(acmurthy via nigel)
|
||
|
||
HADOOP-3124. Make DataNode socket write timeout configurable. (rangadi)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-2790. Fixed inefficient method hasSpeculativeTask by removing
|
||
repetitive calls to get the current time and late checking to see if
|
||
we want speculation on at all. (omalley)
|
||
|
||
HADOOP-2758. Reduce buffer copies in DataNode when data is read from
|
||
HDFS, without negatively affecting read throughput. (rangadi)
|
||
|
||
HADOOP-2399. Input key and value to combiner and reducer is reused.
|
||
(Owen O'Malley via ddas).
|
||
|
||
HADOOP-2423. Code optimization in FSNamesystem.mkdirs.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2606. ReplicationMonitor selects data-nodes to replicate directly
|
||
from needed replication blocks instead of looking up for the blocks for
|
||
each live data-node. (shv)
|
||
|
||
HADOOP-2148. Eliminate redundant data-node blockMap lookups. (shv)
|
||
|
||
HADOOP-2027. Return the number of bytes in each block in a file
|
||
via a single rpc to the namenode to speed up job planning.
|
||
(Lohit Vijaya Renu via omalley)
|
||
|
||
HADOOP-2902. Replace uses of "fs.default.name" with calls to the
|
||
accessor methods added in HADOOP-1967. (cutting)
|
||
|
||
HADOOP-2119. Optimize scheduling of jobs with large numbers of
|
||
tasks by replacing static arrays with lists of runnable tasks.
|
||
(Amar Kamat via omalley)
|
||
|
||
HADOOP-2919. Reduce the number of memory copies done during the
|
||
map output sorting. Also adds two config variables:
|
||
io.sort.spill.percent - the percentages of io.sort.mb that should
|
||
cause a spill (default 80%)
|
||
io.sort.record.percent - the percent of io.sort.mb that should
|
||
hold key/value indexes (default 5%)
|
||
(cdouglas via omalley)
|
||
|
||
HADOOP-3140. Doesn't add a task in the commit queue if the task hadn't
|
||
generated any output. (Amar Kamat via ddas)
|
||
|
||
HADOOP-3168. Reduce the amount of logging in streaming to an
|
||
exponentially increasing number of records (up to 10,000
|
||
records/log). (Zheng Shao via omalley)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2195. '-mkdir' behaviour is now closer to Linux shell in case of
|
||
errors. (Mahadev Konar via rangadi)
|
||
|
||
HADOOP-2190. bring behaviour '-ls' and '-du' closer to Linux shell
|
||
commands in case of errors. (Mahadev Konar via rangadi)
|
||
|
||
HADOOP-2193. 'fs -rm' and 'fs -rmr' show error message when the target
|
||
file does not exist. (Mahadev Konar via rangadi)
|
||
|
||
HADOOP-2738 Text is not subclassable because set(Text) and compareTo(Object)
|
||
access the other instance's private members directly. (jimk)
|
||
|
||
HADOOP-2779. Remove the references to HBase in the build.xml. (omalley)
|
||
|
||
HADOOP-2194. dfs cat on a non-existent file throws FileNotFoundException.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2767. Fix for NetworkTopology erroneously skipping the last leaf
|
||
node on a rack. (Hairong Kuang and Mark Butler via dhruba)
|
||
|
||
HADOOP-1593. FsShell works with paths in non-default FileSystem.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2191. du and dus command on non-existent directory gives
|
||
appropriate error message. (Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2832. Remove tabs from code of DFSClient for better
|
||
indentation. (dhruba)
|
||
|
||
HADOOP-2844. distcp closes file handles for sequence files.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2727. Fix links in Web UI of the hadoop daemons and some docs
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2871. Fixes a problem to do with file: URI in the JobHistory init.
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2800. Deprecate SetFile.Writer constructor not the whole class.
|
||
(Johan Oskarsson via tomwhite)
|
||
|
||
HADOOP-2891. DFSClient.close() closes all open files. (dhruba)
|
||
|
||
HADOOP-2845. Fix dfsadmin disk utilization report on Solaris.
|
||
(Martin Traverso via tomwhite)
|
||
|
||
HADOOP-2912. MiniDFSCluster restart should wait for namenode to exit
|
||
safemode. This was causing TestFsck to fail. (Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2820. The following classes in streaming are removed :
|
||
StreamLineRecordReader StreamOutputFormat StreamSequenceRecordReader.
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2819. The following methods in JobConf are removed:
|
||
getInputKeyClass() setInputKeyClass getInputValueClass()
|
||
setInputValueClass(Class theClass) setSpeculativeExecution
|
||
getSpeculativeExecution() (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2817. Removes deprecated mapred.tasktracker.tasks.maximum and
|
||
ClusterStatus.getMaxTasks(). (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2821. Removes deprecated ShellUtil and ToolBase classes from
|
||
the util package. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2934. The namenode was encountreing a NPE while loading
|
||
leases from the fsimage. Fixed. (dhruba)
|
||
|
||
HADOOP-2938. Some fs commands did not glob paths.
|
||
(Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
HADOOP-2943. Compression of intermediate map output causes failures
|
||
in the merge. (cdouglas)
|
||
|
||
HADOOP-2870. DataNode and NameNode closes all connections while
|
||
shutting down. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2973. Fix TestLocalDFS for Windows platform.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2971. select multiple times if it returns early in
|
||
SocketIOWithTimeout. (rangadi)
|
||
|
||
HADOOP-2955. Fix TestCrcCorruption test failures caused by HADOOP-2758
|
||
(rangadi)
|
||
|
||
HADOOP-2657. A flush call on the DFSOutputStream flushes the last
|
||
partial CRC chunk too. (dhruba)
|
||
|
||
HADOOP-2974. IPC unit tests used "0.0.0.0" to connect to server, which
|
||
is not always supported. (rangadi)
|
||
|
||
HADOOP-2996. Fixes uses of StringBuffer in StreamUtils class.
|
||
(Dave Brosius via ddas)
|
||
|
||
HADOOP-2995. Fixes StreamBaseRecordReader's getProgress to return a
|
||
floating point number. (Dave Brosius via ddas)
|
||
|
||
HADOOP-2972. Fix for a NPE in FSDataset.invalidate.
|
||
(Mahadev Konar via dhruba)
|
||
|
||
HADOOP-2994. Code cleanup for DFSClient: remove redundant
|
||
conversions from string to string. (Dave Brosius via dhruba)
|
||
|
||
HADOOP-3009. TestFileCreation sometimes fails because restarting
|
||
minidfscluster sometimes creates datanodes with ports that are
|
||
different from their original instance. (dhruba)
|
||
|
||
HADOOP-2992. Distributed Upgrade framework works correctly with
|
||
more than one upgrade object. (Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-2679. Fix a typo in libhdfs. (Jason via dhruba)
|
||
|
||
HADOOP-2976. When a lease expires, the Namenode ensures that
|
||
blocks of the file are adequately replicated. (dhruba)
|
||
|
||
HADOOP-2901. Fixes the creation of info servers in the JobClient
|
||
and JobTracker. Removes the creation from JobClient and removes
|
||
additional info server from the JobTracker. Also adds the command
|
||
line utility to view the history files (HADOOP-2896), and fixes
|
||
bugs in JSPs to do with analysis - HADOOP-2742, HADOOP-2792.
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2890. If different datanodes report the same block but
|
||
with different sizes to the namenode, the namenode picks the
|
||
replica(s) with the largest size as the only valid replica(s). (dhruba)
|
||
|
||
HADOOP-2825. Deprecated MapOutputLocation.getFile() is removed.
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2806. Fixes a streaming document.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3008. SocketIOWithTimeout throws InterruptedIOException if the
|
||
thread is interrupted while it is waiting. (rangadi)
|
||
|
||
HADOOP-3006. Fix wrong packet size reported by DataNode when a block
|
||
is being replicated. (rangadi)
|
||
|
||
HADOOP-3029. Datanode prints log message "firstbadlink" only if
|
||
it detects a bad connection to another datanode in the pipeline. (dhruba)
|
||
|
||
HADOOP-3030. Release reserved space for file in InMemoryFileSystem if
|
||
checksum reservation fails. (Devaraj Das via cdouglas)
|
||
|
||
HADOOP-3036. Fix findbugs warnings in UpgradeUtilities. (Konstantin
|
||
Shvachko via cdouglas)
|
||
|
||
HADOOP-3025. ChecksumFileSystem supports the delete method with
|
||
the recursive flag. (Mahadev Konar via dhruba)
|
||
|
||
HADOOP-3012. dfs -mv file to user home directory throws exception if
|
||
the user home directory does not exist. (Mahadev Konar via dhruba)
|
||
|
||
HADOOP-3066. Should not require superuser privilege to query if hdfs is in
|
||
safe mode (jimk)
|
||
|
||
HADOOP-3040. If the input line starts with the separator char, the key
|
||
is set as empty. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3080. Removes flush calls from JobHistory.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3086. Adds the testcase missed during commit of hadoop-3040.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3046. Fix the raw comparators for Text and BytesWritables
|
||
to use the provided length rather than recompute it. (omalley)
|
||
|
||
HADOOP-3094. Fix BytesWritable.toString to avoid extending the sign bit
|
||
(Owen O'Malley via cdouglas)
|
||
|
||
HADOOP-3067. DFSInputStream's position read does not close the sockets.
|
||
(rangadi)
|
||
|
||
HADOOP-3073. close() on SocketInputStream or SocketOutputStream should
|
||
close the underlying channel. (rangadi)
|
||
|
||
HADOOP-3087. Fixes a problem to do with refreshing of loadHistory.jsp.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3065. Better logging message if the rack location of a datanode
|
||
cannot be determined. (Devaraj Das via dhruba)
|
||
|
||
HADOOP-3064. Commas in a file path should not be treated as delimiters.
|
||
(Hairong Kuang via shv)
|
||
|
||
HADOOP-2997. Adds test for non-writable serialier. Also fixes a problem
|
||
introduced by HADOOP-2399. (Tom White via ddas)
|
||
|
||
HADOOP-3114. Fix TestDFSShell on Windows. (Lohit Vijaya Renu via cdouglas)
|
||
|
||
HADOOP-3118. Fix Namenode NPE while loading fsimage after a cluster
|
||
upgrade from older disk format. (dhruba)
|
||
|
||
HADOOP-3161. Fix FIleUtil.HardLink.getLinkCount on Mac OS. (nigel
|
||
via omalley)
|
||
|
||
HADOOP-2927. Fix TestDU to acurately calculate the expected file size.
|
||
(shv via nigel)
|
||
|
||
HADOOP-3123. Fix the native library build scripts to work on Solaris.
|
||
(tomwhite via omalley)
|
||
|
||
HADOOP-3089. Streaming should accept stderr from task before
|
||
first key arrives. (Rick Cox via tomwhite)
|
||
|
||
HADOOP-3146. A DFSOutputStream.flush method is renamed as
|
||
DFSOutputStream.fsync. (dhruba)
|
||
|
||
HADOOP-3165. -put/-copyFromLocal did not treat input file "-" as stdin.
|
||
(Lohit Vijayarenu via rangadi)
|
||
|
||
HADOOP-3041. Deprecate JobConf.setOutputPath and JobConf.getOutputPath.
|
||
Deprecate OutputFormatBase. Add FileOutputFormat. Existing output formats
|
||
extending OutputFormatBase, now extend FileOutputFormat. Add the following
|
||
APIs in FileOutputFormat: setOutputPath, getOutputPath, getWorkOutputPath.
|
||
(Amareshwari Sriramadasu via nigel)
|
||
|
||
HADOOP-3083. The fsimage does not store leases. This would have to be
|
||
reworked in the next release to support appends. (dhruba)
|
||
|
||
HADOOP-3166. Fix an ArrayIndexOutOfBoundsException in the spill thread
|
||
and make exception handling more promiscuous to catch this condition.
|
||
(cdouglas)
|
||
|
||
HADOOP-3050. DataNode sends one and only one block report after
|
||
it registers with the namenode. (Hairong Kuang)
|
||
|
||
HADOOP-3044. NNBench sets the right configuration for the mapper.
|
||
(Hairong Kuang)
|
||
|
||
HADOOP-3178. Fix GridMix scripts for small and medium jobs
|
||
to handle input paths differently. (Mukund Madhugiri via nigel)
|
||
|
||
HADOOP-1911. Fix an infinite loop in DFSClient when all replicas of a
|
||
block are bad (cdouglas)
|
||
|
||
HADOOP-3157. Fix path handling in DistributedCache and TestMiniMRLocalFS.
|
||
(Doug Cutting via rangadi)
|
||
|
||
HADOOP-3018. Fix the eclipse plug-in contrib wrt removed deprecated
|
||
methods (taton)
|
||
|
||
HADOOP-3183. Fix TestJobShell to use 'ls' instead of java.io.File::exists
|
||
since cygwin symlinks are unsupported.
|
||
(Mahadev konar via cdouglas)
|
||
|
||
HADOOP-3175. Fix FsShell.CommandFormat to handle "-" in arguments.
|
||
(Edward J. Yoon via rangadi)
|
||
|
||
HADOOP-3220. Safemode message corrected. (shv)
|
||
|
||
HADOOP-3208. Fix WritableDeserializer to set the Configuration on
|
||
deserialized Writables. (Enis Soztutar via cdouglas)
|
||
|
||
HADOOP-3224. 'dfs -du /dir' does not return correct size.
|
||
(Lohit Vjayarenu via rangadi)
|
||
|
||
HADOOP-3223. Fix typo in help message for -chmod. (rangadi)
|
||
|
||
HADOOP-1373. checkPath() should ignore case when it compares authoriy.
|
||
(Edward J. Yoon via rangadi)
|
||
|
||
HADOOP-3204. Fixes a problem to do with ReduceTask's LocalFSMerger not
|
||
catching Throwable. (Amar Ramesh Kamat via ddas)
|
||
|
||
HADOOP-3229. Report progress when collecting records from the mapper and
|
||
the combiner. (Doug Cutting via cdouglas)
|
||
|
||
HADOOP-3225. Unwrapping methods of RemoteException should initialize
|
||
detailedMassage field. (Mahadev Konar, shv, cdouglas)
|
||
|
||
HADOOP-3247. Fix gridmix scripts to use the correct globbing syntax and
|
||
change maxentToSameCluster to run the correct number of jobs.
|
||
(Runping Qi via cdouglas)
|
||
|
||
HADOOP-3242. Fix the RecordReader of SequenceFileAsBinaryInputFormat to
|
||
correctly read from the start of the split and not the beginning of the
|
||
file. (cdouglas via acmurthy)
|
||
|
||
HADOOP-3256. Encodes the job name used in the filename for history files.
|
||
(Arun Murthy via ddas)
|
||
|
||
HADOOP-3162. Ensure that comma-separated input paths are treated correctly
|
||
as multiple input paths. (Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-3263. Ensure that the job-history log file always follows the
|
||
pattern of hostname_timestamp_jobid_username_jobname even if username
|
||
and/or jobname are not specfied. This helps to avoid wrong assumptions
|
||
made about the job-history log filename in jobhistory.jsp. (acmurthy)
|
||
|
||
HADOOP-3251. Fixes getFilesystemName in JobTracker and LocalJobRunner to
|
||
use FileSystem.getUri instead of FileSystem.getName. (Arun Murthy via ddas)
|
||
|
||
HADOOP-3237. Fixes TestDFSShell.testErrOutPut on Windows platform.
|
||
(Mahadev Konar via ddas)
|
||
|
||
HADOOP-3279. TaskTracker checks for SUCCEEDED task status in addition to
|
||
COMMIT_PENDING status when it fails maps due to lost map.
|
||
(Devaraj Das)
|
||
|
||
HADOOP-3286. Prevent collisions in gridmix output dirs by increasing the
|
||
granularity of the timestamp. (Runping Qi via cdouglas)
|
||
|
||
HADOOP-3285. Fix input split locality when the splits align to
|
||
fs blocks. (omalley)
|
||
|
||
HADOOP-3372. Fix heap management in streaming tests. (Arun Murthy via
|
||
cdouglas)
|
||
|
||
HADOOP-3031. Fix javac warnings in test classes. (cdouglas)
|
||
|
||
HADOOP-3382. Fix memory leak when files are not cleanly closed (rangadi)
|
||
|
||
HADOOP-3322. Fix to push MetricsRecord for rpc metrics. (Eric Yang via
|
||
mukund)
|
||
|
||
Release 0.16.4 - 2008-05-05
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3138. DFS mkdirs() should not throw an exception if the directory
|
||
already exists. (rangadi via mukund)
|
||
|
||
HADOOP-3294. Fix distcp to check the destination length and retry the copy
|
||
if it doesn't match the src length. (Tsz Wo (Nicholas), SZE via mukund)
|
||
|
||
HADOOP-3186. Fix incorrect permission checkding for mv and renameTo
|
||
in HDFS. (Tsz Wo (Nicholas), SZE via mukund)
|
||
|
||
Release 0.16.3 - 2008-04-16
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3010. Fix ConcurrentModificationException in ipc.Server.Responder.
|
||
(rangadi)
|
||
|
||
HADOOP-3154. Catch all Throwables from the SpillThread in MapTask, rather
|
||
than IOExceptions only. (ddas via cdouglas)
|
||
|
||
HADOOP-3159. Avoid file system cache being overwritten whenever
|
||
configuration is modified. (Tsz Wo (Nicholas), SZE via hairong)
|
||
|
||
HADOOP-3139. Remove the consistency check for the FileSystem cache in
|
||
closeAll() that causes spurious warnings and a deadlock.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3195. Fix TestFileSystem to be deterministic.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-3069. Primary name-node should not truncate image when transferring
|
||
it from the secondary. (shv)
|
||
|
||
HADOOP-3182. Change permissions of the job-submission directory to 777
|
||
from 733 to ensure sharing of HOD clusters works correctly. (Tsz Wo
|
||
(Nicholas), Sze and Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
Release 0.16.2 - 2008-04-02
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-3011. Prohibit distcp from overwriting directories on the
|
||
destination filesystem with files. (cdouglas)
|
||
|
||
HADOOP-3033. The BlockReceiver thread in the datanode writes data to
|
||
the block file, changes file position (if needed) and flushes all by
|
||
itself. The PacketResponder thread does not flush block file. (dhruba)
|
||
|
||
HADOOP-2978. Fixes the JobHistory log format for counters.
|
||
(Runping Qi via ddas)
|
||
|
||
HADOOP-2985. Fixes LocalJobRunner to tolerate null job output path.
|
||
Also makes the _temporary a constant in MRConstants.java.
|
||
(Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3003. FileSystem cache key is updated after a
|
||
FileSystem object is created. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-3042. Updates the Javadoc in JobConf.getOutputPath to reflect
|
||
the actual temporary path. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3007. Tolerate mirror failures while DataNode is replicating
|
||
blocks as it used to before. (rangadi)
|
||
|
||
HADOOP-2944. Fixes a "Run on Hadoop" wizard NPE when creating a
|
||
Location from the wizard. (taton)
|
||
|
||
HADOOP-3049. Fixes a problem in MultiThreadedMapRunner to do with
|
||
catching RuntimeExceptions. (Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-3039. Fixes a problem to do with exceptions in tasks not
|
||
killing jobs. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3027. Fixes a problem to do with adding a shutdown hook in
|
||
FileSystem. (Amareshwari Sriramadasu via ddas)
|
||
|
||
HADOOP-3056. Fix distcp when the target is an empty directory by
|
||
making sure the directory is created first. (cdouglas and acmurthy
|
||
via omalley)
|
||
|
||
HADOOP-3070. Protect the trash emptier thread from null pointer
|
||
exceptions. (Koji Noguchi via omalley)
|
||
|
||
HADOOP-3084. Fix HftpFileSystem to work for zero-lenghth files.
|
||
(cdouglas)
|
||
|
||
HADOOP-3107. Fix NPE when fsck invokes getListings. (dhruba)
|
||
|
||
HADOOP-3104. Limit MultithreadedMapRunner to have a fixed length queue
|
||
between the RecordReader and the map threads. (Alejandro Abdelnur via
|
||
omalley)
|
||
|
||
HADOOP-2833. Do not use "Dr. Who" as the default user in JobClient.
|
||
A valid user name is required. (Tsz Wo (Nicholas), SZE via rangadi)
|
||
|
||
HADOOP-3128. Throw RemoteException in setPermissions and setOwner of
|
||
DistributedFileSystem. (shv via nigel)
|
||
|
||
Release 0.16.1 - 2008-03-13
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-2869. Deprecate SequenceFile.setCompressionType in favor of
|
||
SequenceFile.createWriter, SequenceFileOutputFormat.setCompressionType,
|
||
and JobConf.setMapOutputCompressionType. (Arun C Murthy via cdouglas)
|
||
Configuration changes to hadoop-default.xml:
|
||
deprecated io.seqfile.compression.type
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-2371. User guide for file permissions in HDFS.
|
||
(Robert Chansler via rangadi)
|
||
|
||
HADOOP-3098. Allow more characters in user and group names while
|
||
using -chown and -chgrp commands. (rangadi)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2789. Race condition in IPC Server Responder that could close
|
||
connections early. (Raghu Angadi)
|
||
|
||
HADOOP-2785. minor. Fix a typo in Datanode block verification
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-2788. minor. Fix help message for chgrp shell command (Raghu Angadi).
|
||
|
||
HADOOP-1188. fstime file is updated when a storage directory containing
|
||
namespace image becomes inaccessible. (shv)
|
||
|
||
HADOOP-2787. An application can set a configuration variable named
|
||
dfs.umask to set the umask that is used by DFS.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2780. The default socket buffer size for DataNodes is 128K.
|
||
(dhruba)
|
||
|
||
HADOOP-2716. Superuser privileges for the Balancer.
|
||
(Tsz Wo (Nicholas), SZE via shv)
|
||
|
||
HADOOP-2754. Filter out .crc files from local file system listing.
|
||
(Hairong Kuang via shv)
|
||
|
||
HADOOP-2733. Fix compiler warnings in test code.
|
||
(Tsz Wo (Nicholas), SZE via cdouglas)
|
||
|
||
HADOOP-2725. Modify distcp to avoid leaving partially copied files at
|
||
the destination after encountering an error. (Tsz Wo (Nicholas), SZE
|
||
via cdouglas)
|
||
|
||
HADOOP-2391. Cleanup job output directory before declaring a job as
|
||
SUCCESSFUL. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2808. Minor fix to FileUtil::copy to mind the overwrite
|
||
formal. (cdouglas)
|
||
|
||
HADOOP-2683. Moving UGI out of the RPC Server.
|
||
(Tsz Wo (Nicholas), SZE via shv)
|
||
|
||
HADOOP-2814. Fix for NPE in datanode in unit test TestDataTransferProtocol.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2811. Dump of counters in job history does not add comma between
|
||
groups. (runping via omalley)
|
||
|
||
HADOOP-2735. Enables setting TMPDIR for tasks.
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2843. Fix protections on map-side join classes to enable derivation.
|
||
(cdouglas via omalley)
|
||
|
||
HADOOP-2840. Fix gridmix scripts to correctly invoke the java sort through
|
||
the proper jar. (Mukund Madhugiri via cdouglas)
|
||
|
||
HADOOP-2769. TestNNThroughputBnechmark should not use a fixed port for
|
||
the namenode http port. (omalley)
|
||
|
||
HADOOP-2852. Update gridmix benchmark to avoid an artifically long tail.
|
||
(cdouglas)
|
||
|
||
HADOOP-2894. Fix a problem to do with tasktrackers failing to connect to
|
||
JobTracker upon reinitialization. (Owen O'Malley via ddas).
|
||
|
||
HADOOP-2903. Fix exception generated by Metrics while using pushMetric().
|
||
(girish vaitheeswaran via dhruba)
|
||
|
||
HADOOP-2904. Fix to RPC metrics to log the correct host name.
|
||
(girish vaitheeswaran via dhruba)
|
||
|
||
HADOOP-2918. Improve error logging so that dfs writes failure with
|
||
"No lease on file" can be diagnosed. (dhruba)
|
||
|
||
HADOOP-2923. Add SequenceFileAsBinaryInputFormat, which was
|
||
missed in the commit for HADOOP-2603. (cdouglas via omalley)
|
||
|
||
HADOOP-2931. IOException thrown by DFSOutputStream had wrong stack
|
||
trace in some cases. (Michael Bieniosek via rangadi)
|
||
|
||
HADOOP-2883. Write failures and data corruptions on HDFS files.
|
||
The write timeout is back to what it was on 0.15 release. Also, the
|
||
datnodes flushes the block file buffered output stream before
|
||
sending a positive ack for the packet back to the client. (dhruba)
|
||
|
||
HADOOP-2756. NPE in DFSClient while closing DFSOutputStreams
|
||
under load. (rangadi)
|
||
|
||
HADOOP-2958. Fixed FileBench which broke due to HADOOP-2391 which performs
|
||
a check for existence of the output directory and a trivial bug in
|
||
GenericMRLoadGenerator where min/max word lenghts were identical since
|
||
they were looking at the same config variables (Chris Douglas via
|
||
acmurthy)
|
||
|
||
HADOOP-2915. Fixed FileSystem.CACHE so that a username is included
|
||
in the cache key. (Tsz Wo (Nicholas), SZE via nigel)
|
||
|
||
HADOOP-2813. TestDU unit test uses its own directory to run its
|
||
sequence of tests. (Mahadev Konar via dhruba)
|
||
|
||
Release 0.16.0 - 2008-02-07
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-1245. Use the mapred.tasktracker.tasks.maximum value
|
||
configured on each tasktracker when allocating tasks, instead of
|
||
the value configured on the jobtracker. InterTrackerProtocol
|
||
version changed from 5 to 6. (Michael Bieniosek via omalley)
|
||
|
||
HADOOP-1843. Removed code from Configuration and JobConf deprecated by
|
||
HADOOP-785 and a minor fix to Configuration.toString. Specifically the
|
||
important change is that mapred-default.xml is no longer supported and
|
||
Configuration no longer supports the notion of default/final resources.
|
||
(acmurthy)
|
||
|
||
HADOOP-1302. Remove deprecated abacus code from the contrib directory.
|
||
This also fixes a configuration bug in AggregateWordCount, so that the
|
||
job now works. (enis)
|
||
|
||
HADOOP-2288. Enhance FileSystem API to support access control.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2184. RPC Support for user permissions and authentication.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2185. RPC Server uses any available port if the specified
|
||
port is zero. Otherwise it uses the specified port. Also combines
|
||
the configuration attributes for the servers' bind address and
|
||
port from "x.x.x.x" and "y" to "x.x.x.x:y".
|
||
Deprecated configuration variables:
|
||
dfs.info.bindAddress
|
||
dfs.info.port
|
||
dfs.datanode.bindAddress
|
||
dfs.datanode.port
|
||
dfs.datanode.info.bindAdress
|
||
dfs.datanode.info.port
|
||
dfs.secondary.info.bindAddress
|
||
dfs.secondary.info.port
|
||
mapred.job.tracker.info.bindAddress
|
||
mapred.job.tracker.info.port
|
||
mapred.task.tracker.report.bindAddress
|
||
tasktracker.http.bindAddress
|
||
tasktracker.http.port
|
||
New configuration variables (post HADOOP-2404):
|
||
dfs.secondary.http.address
|
||
dfs.datanode.address
|
||
dfs.datanode.http.address
|
||
dfs.http.address
|
||
mapred.job.tracker.http.address
|
||
mapred.task.tracker.report.address
|
||
mapred.task.tracker.http.address
|
||
(Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-2401. Only the current leaseholder can abandon a block for
|
||
a HDFS file. ClientProtocol version changed from 20 to 21.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2381. Support permission information in FileStatus. Client
|
||
Protocol version changed from 21 to 22. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2110. Block report processing creates fewer transient objects.
|
||
Datanode Protocol version changed from 10 to 11.
|
||
(Sanjay Radia via dhruba)
|
||
|
||
HADOOP-2567. Add FileSystem#getHomeDirectory(), which returns the
|
||
user's home directory in a FileSystem as a fully-qualified path.
|
||
FileSystem#getWorkingDirectory() is also changed to return a
|
||
fully-qualified path, which can break applications that attempt
|
||
to, e.g., pass LocalFileSystem#getWorkingDir().toString() directly
|
||
to java.io methods that accept file names. (cutting)
|
||
|
||
HADOOP-2514. Change trash feature to maintain a per-user trash
|
||
directory, named ".Trash" in the user's home directory. The
|
||
"fs.trash.root" parameter is no longer used. Full source paths
|
||
are also no longer reproduced within the trash.
|
||
|
||
HADOOP-2012. Periodic data verification on Datanodes.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1707. The DFSClient does not use a local disk file to cache
|
||
writes to a HDFS file. Changed Data Transfer Version from 7 to 8.
|
||
(dhruba)
|
||
|
||
HADOOP-2652. Fix permission issues for HftpFileSystem. This is an
|
||
incompatible change since distcp may not be able to copy files
|
||
from cluster A (compiled with this patch) to cluster B (compiled
|
||
with previous versions). (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-1857. Ability to run a script when a task fails to capture stack
|
||
traces. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2299. Defination of a login interface. A simple implementation for
|
||
Unix users and groups. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1652. A utility to balance data among datanodes in a HDFS cluster.
|
||
(Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2085. A library to support map-side joins of consistently
|
||
partitioned and sorted data sets. (Chris Douglas via omalley)
|
||
|
||
HADOOP-2336. Shell commands to modify file permissions. (rangadi)
|
||
|
||
HADOOP-1298. Implement file permissions for HDFS.
|
||
(Tsz Wo (Nicholas) & taton via cutting)
|
||
|
||
HADOOP-2447. HDFS can be configured to limit the total number of
|
||
objects (inodes and blocks) in the file system. (dhruba)
|
||
|
||
HADOOP-2487. Added an option to get statuses for all submitted/run jobs.
|
||
This information can be used to develop tools for analysing jobs.
|
||
(Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-1873. Implement user permissions for Map/Reduce framework.
|
||
(Hairong Kuang via shv)
|
||
|
||
HADOOP-2532. Add to MapFile a getClosest method that returns the key
|
||
that comes just before if the key is not present. (stack via tomwhite)
|
||
|
||
HADOOP-1883. Add versioning to Record I/O. (Vivek Ratan via ddas)
|
||
|
||
HADOOP-2603. Add SeqeunceFileAsBinaryInputFormat, which reads
|
||
sequence files as BytesWritable/BytesWritable regardless of the
|
||
key and value types used to write the file. (cdouglas via omalley)
|
||
|
||
HADOOP-2367. Add ability to profile a subset of map/reduce tasks and fetch
|
||
the result to the local filesystem of the submitting application. Also
|
||
includes a general IntegerRanges extension to Configuration for setting
|
||
positive, ranged parameters. (Owen O'Malley via cdouglas)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-2045. Change committer list on website to a table, so that
|
||
folks can list their organization, timezone, etc. (cutting)
|
||
|
||
HADOOP-2058. Facilitate creating new datanodes dynamically in
|
||
MiniDFSCluster. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1855. fsck verifies block placement policies and reports
|
||
violations. (Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-1604. An system administrator can finalize namenode upgrades
|
||
without running the cluster. (Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-1839. Link-ify the Pending/Running/Complete/Killed grid in
|
||
jobdetails.jsp to help quickly narrow down and see categorized TIPs'
|
||
details via jobtasks.jsp. (Amar Kamat via acmurthy)
|
||
|
||
HADOOP-1210. Log counters in job history. (Owen O'Malley via ddas)
|
||
|
||
HADOOP-1912. Datanode has two new commands COPY and REPLACE. These are
|
||
needed for supporting data rebalance. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2086. This patch adds the ability to add dependencies to a job
|
||
(run via JobControl) after construction. (Adrian Woodhead via ddas)
|
||
|
||
HADOOP-1185. Support changing the logging level of a server without
|
||
restarting the server. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2134. Remove developer-centric requirements from overview.html and
|
||
keep it end-user focussed, specifically sections related to subversion and
|
||
building Hadoop. (Jim Kellerman via acmurthy)
|
||
|
||
HADOOP-1989. Support simulated DataNodes. This helps creating large virtual
|
||
clusters for testing purposes. (Sanjay Radia via dhruba)
|
||
|
||
HADOOP-1274. Support different number of mappers and reducers per
|
||
TaskTracker to allow administrators to better configure and utilize
|
||
heterogenous clusters.
|
||
Configuration changes to hadoop-default.xml:
|
||
add mapred.tasktracker.map.tasks.maximum (default value of 2)
|
||
add mapred.tasktracker.reduce.tasks.maximum (default value of 2)
|
||
remove mapred.tasktracker.tasks.maximum (deprecated for 0.16.0)
|
||
(Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-2104. Adds a description to the ant targets. This makes the
|
||
output of "ant -projecthelp" sensible. (Chris Douglas via ddas)
|
||
|
||
HADOOP-2127. Added a pipes sort example to benchmark trivial pipes
|
||
application versus trivial java application. (omalley via acmurthy)
|
||
|
||
HADOOP-2113. A new shell command "dfs -text" to view the contents of
|
||
a gziped or SequenceFile. (Chris Douglas via dhruba)
|
||
|
||
HADOOP-2207. Add a "package" target for contrib modules that
|
||
permits each to determine what files are copied into release
|
||
builds. (stack via cutting)
|
||
|
||
HADOOP-1984. Makes the backoff for failed fetches exponential.
|
||
Earlier, it was a random backoff from an interval.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-1327. Include website documentation for streaming. (Rob Weltman
|
||
via omalley)
|
||
|
||
HADOOP-2000. Rewrite NNBench to measure namenode performance accurately.
|
||
It now uses the map-reduce framework for load generation.
|
||
(Mukund Madhugiri via dhruba)
|
||
|
||
HADOOP-2248. Speeds up the framework w.r.t Counters. Also has API
|
||
updates to the Counters part. (Owen O'Malley via ddas)
|
||
|
||
HADOOP-2326. The initial block report at Datanode startup time has
|
||
a random backoff period. (Sanjay Radia via dhruba)
|
||
|
||
HADOOP-2432. HDFS includes the name of the file while throwing
|
||
"File does not exist" exception. (Jim Kellerman via dhruba)
|
||
|
||
HADOOP-2457. Added a 'forrest.home' property to the 'docs' target in
|
||
build.xml. (acmurthy)
|
||
|
||
HADOOP-2149. A new benchmark for three name-node operation: file create,
|
||
open, and block report, to evaluate the name-node performance
|
||
for optimizations or new features. (Konstantin Shvachko via shv)
|
||
|
||
HADOOP-2466. Change FileInputFormat.computeSplitSize to a protected
|
||
non-static method to allow sub-classes to provide alternate
|
||
implementations. (Alejandro Abdelnur via acmurthy)
|
||
|
||
HADOOP-2425. Change TextOutputFormat to handle Text specifically for better
|
||
performance. Make NullWritable implement Comparable. Make TextOutputFormat
|
||
treat NullWritable like null. (omalley)
|
||
|
||
HADOOP-1719. Improves the utilization of shuffle copier threads.
|
||
(Amar Kamat via ddas)
|
||
|
||
HADOOP-2390. Added documentation for user-controls for intermediate
|
||
map-outputs & final job-outputs and native-hadoop libraries. (acmurthy)
|
||
|
||
HADOOP-1660. Add the cwd of the map/reduce task to the java.library.path
|
||
of the child-jvm to support loading of native libraries distributed via
|
||
the DistributedCache. (acmurthy)
|
||
|
||
HADOOP-2285. Speeds up TextInputFormat. Also includes updates to the
|
||
Text API. (Owen O'Malley via cdouglas)
|
||
|
||
HADOOP-2233. Adds a generic load generator for modeling MR jobs. (cdouglas)
|
||
|
||
HADOOP-2369. Adds a set of scripts for simulating a mix of user map/reduce
|
||
workloads. (Runping Qi via cdouglas)
|
||
|
||
HADOOP-2547. Removes use of a 'magic number' in build.xml.
|
||
(Hrishikesh via nigel)
|
||
|
||
HADOOP-2268. Fix org.apache.hadoop.mapred.jobcontrol classes to use the
|
||
List/Map interfaces rather than concrete ArrayList/HashMap classes
|
||
internally. (Adrian Woodhead via acmurthy)
|
||
|
||
HADOOP-2406. Add a benchmark for measuring read/write performance through
|
||
the InputFormat interface, particularly with compression. (cdouglas)
|
||
|
||
HADOOP-2131. Allow finer-grained control over speculative-execution. Now
|
||
users can set it for maps and reduces independently.
|
||
Configuration changes to hadoop-default.xml:
|
||
deprecated mapred.speculative.execution
|
||
add mapred.map.tasks.speculative.execution
|
||
add mapred.reduce.tasks.speculative.execution
|
||
(Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-1965. Interleave sort/spill in teh map-task along with calls to the
|
||
Mapper.map method. This is done by splitting the 'io.sort.mb' buffer into
|
||
two and using one half for collecting map-outputs and the other half for
|
||
sort/spill. (Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2464. Unit tests for chmod, chown, and chgrp using DFS.
|
||
(Raghu Angadi)
|
||
|
||
HADOOP-1876. Persist statuses of completed jobs in HDFS so that the
|
||
JobClient can query and get information about decommissioned jobs and also
|
||
across JobTracker restarts.
|
||
Configuration changes to hadoop-default.xml:
|
||
add mapred.job.tracker.persist.jobstatus.active (default value of false)
|
||
add mapred.job.tracker.persist.jobstatus.hours (default value of 0)
|
||
add mapred.job.tracker.persist.jobstatus.dir (default value of
|
||
/jobtracker/jobsInfo)
|
||
(Alejandro Abdelnur via acmurthy)
|
||
|
||
HADOOP-2077. Added version and build information to STARTUP_MSG for all
|
||
hadoop daemons to aid error-reporting, debugging etc. (acmurthy)
|
||
|
||
HADOOP-2398. Additional instrumentation for NameNode and RPC server.
|
||
Add support for accessing instrumentation statistics via JMX.
|
||
(Sanjay radia via dhruba)
|
||
|
||
HADOOP-2449. A return of the non-MR version of NNBench.
|
||
(Sanjay Radia via shv)
|
||
|
||
HADOOP-1989. Remove 'datanodecluster' command from bin/hadoop.
|
||
(Sanjay Radia via shv)
|
||
|
||
HADOOP-1742. Improve JavaDoc documentation for ClientProtocol, DFSClient,
|
||
and FSNamesystem. (Konstantin Shvachko)
|
||
|
||
HADOOP-2298. Add Ant target for a binary-only distribution.
|
||
(Hrishikesh via nigel)
|
||
|
||
HADOOP-2509. Add Ant target for Rat report (Apache license header
|
||
reports). (Hrishikesh via nigel)
|
||
|
||
HADOOP-2469. WritableUtils.clone should take a Configuration
|
||
instead of a JobConf. (stack via omalley)
|
||
|
||
HADOOP-2659. Introduce superuser permissions for admin operations.
|
||
(Tsz Wo (Nicholas), SZE via shv)
|
||
|
||
HADOOP-2596. Added a SequenceFile.createWriter api which allows the user
|
||
to specify the blocksize, replication factor and the buffersize to be
|
||
used for the underlying HDFS file. (Alejandro Abdelnur via acmurthy)
|
||
|
||
HADOOP-2431. Test HDFS File Permissions. (Hairong Kuang via shv)
|
||
|
||
HADOOP-2232. Add an option to disable Nagle's algorithm in the IPC stack.
|
||
(Clint Morgan via cdouglas)
|
||
|
||
HADOOP-2342. Created a micro-benchmark for measuring
|
||
local-file versus hdfs reads. (Owen O'Malley via nigel)
|
||
|
||
HADOOP-2529. First version of HDFS User Guide. (Raghu Angadi)
|
||
|
||
HADOOP-2690. Add jar-test target to build.xml, separating compilation
|
||
and packaging of the test classes. (Enis Soztutar via cdouglas)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-1898. Release the lock protecting the last time of the last stack
|
||
dump while the dump is happening. (Amareshwari Sri Ramadasu via omalley)
|
||
|
||
HADOOP-1900. Makes the heartbeat and task event queries interval
|
||
dependent on the cluster size. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2208. Counter update frequency (from TaskTracker to JobTracker) is
|
||
capped at 1 minute. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2284. Reduce the number of progress updates during the sorting in
|
||
the map task. (Amar Kamat via ddas)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2583. Fixes a bug in the Eclipse plug-in UI to edit locations.
|
||
Plug-in version is now synchronized with Hadoop version.
|
||
|
||
HADOOP-2100. Remove faulty check for existence of $HADOOP_PID_DIR and let
|
||
'mkdir -p' check & create it. (Michael Bieniosek via acmurthy)
|
||
|
||
HADOOP-1642. Ensure jobids generated by LocalJobRunner are unique to
|
||
avoid collissions and hence job-failures. (Doug Cutting via acmurthy)
|
||
|
||
HADOOP-2096. Close open file-descriptors held by streams while localizing
|
||
job.xml in the JobTracker and while displaying it on the webui in
|
||
jobconf.jsp. (Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2098. Log start & completion of empty jobs to JobHistory, which
|
||
also ensures that we close the file-descriptor of the job's history log
|
||
opened during job-submission. (Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2112. Adding back changes to build.xml lost while reverting
|
||
HADOOP-1622 i.e. http://svn.apache.org/viewvc?view=rev&revision=588771.
|
||
(acmurthy)
|
||
|
||
HADOOP-2089. Fixes the command line argument handling to handle multiple
|
||
-cacheArchive in Hadoop streaming. (Lohit Vijayarenu via ddas)
|
||
|
||
HADOOP-2071. Fix StreamXmlRecordReader to use a BufferedInputStream
|
||
wrapped over the DFSInputStream since mark/reset aren't supported by
|
||
DFSInputStream anymore. (Lohit Vijayarenu via acmurthy)
|
||
|
||
HADOOP-1348. Allow XML comments inside configuration files.
|
||
(Rajagopal Natarajan and Enis Soztutar via enis)
|
||
|
||
HADOOP-1952. Improve handling of invalid, user-specified classes while
|
||
configuring streaming jobs such as combiner, input/output formats etc.
|
||
Now invalid options are caught, logged and jobs are failed early. (Lohit
|
||
Vijayarenu via acmurthy)
|
||
|
||
HADOOP-2151. FileSystem.globPaths validates the list of Paths that
|
||
it returns. (Lohit Vijayarenu via dhruba)
|
||
|
||
HADOOP-2121. Cleanup DFSOutputStream when the stream encountered errors
|
||
when Datanodes became full. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1130. The FileSystem.closeAll() method closes all existing
|
||
DFSClients. (Chris Douglas via dhruba)
|
||
|
||
HADOOP-2204. DFSTestUtil.waitReplication was not waiting for all replicas
|
||
to get created, thus causing unit test failure.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2078. An zero size file may have no blocks associated with it.
|
||
(Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-2212. ChecksumFileSystem.getSumBufferSize might throw
|
||
java.lang.ArithmeticException. The fix is to initialize bytesPerChecksum
|
||
to 0. (Michael Bieniosek via ddas)
|
||
|
||
HADOOP-2216. Fix jobtasks.jsp to ensure that it first collects the
|
||
taskids which satisfy the filtering criteria and then use that list to
|
||
print out only the required task-reports, previously it was oblivious to
|
||
the filtering and hence used the wrong index into the array of task-reports.
|
||
(Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2272. Fix findbugs target to reflect changes made to the location
|
||
of the streaming jar file by HADOOP-2207. (Adrian Woodhead via nigel)
|
||
|
||
HADOOP-2244. Fixes the MapWritable.readFields to clear the instance
|
||
field variable every time readFields is called. (Michael Stack via ddas).
|
||
|
||
HADOOP-2245. Fixes LocalJobRunner to include a jobId in the mapId. Also,
|
||
adds a testcase for JobControl. (Adrian Woodhead via ddas).
|
||
|
||
HADOOP-2275. Fix erroneous detection of corrupted file when namenode
|
||
fails to allocate any datanodes for newly allocated block.
|
||
(Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-2256. Fix a buf in the namenode that could cause it to encounter
|
||
an infinite loop while deleting excess replicas that were created by
|
||
block rebalancing. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2209. SecondaryNamenode process exits if it encounters exceptions
|
||
that it cannot handle. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-2314. Prevent TestBlockReplacement from occasionally getting
|
||
into an infinite loop. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2300. This fixes a bug where mapred.tasktracker.tasks.maximum
|
||
would be ignored even if it was set in hadoop-site.xml.
|
||
(Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2349. Improve code layout in file system transaction logging code.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2368. Fix unit tests on Windows.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2363. This fix allows running multiple instances of the unit test
|
||
in parallel. The bug was introduced in HADOOP-2185 that changed
|
||
port-rolling behaviour. (Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-2271. Fix chmod task to be non-parallel. (Adrian Woodhead via
|
||
omalley)
|
||
|
||
HADOOP-2313. Fail the build if building libhdfs fails. (nigel via omalley)
|
||
|
||
HADOOP-2359. Remove warning for interruptted exception when closing down
|
||
minidfs. (dhruba via omalley)
|
||
|
||
HADOOP-1841. Prevent slow clients from consuming threads in the NameNode.
|
||
(dhruba)
|
||
|
||
HADOOP-2323. JobTracker.close() should not print stack traces for
|
||
normal exit. (jimk via cutting)
|
||
|
||
HADOOP-2376. Prevents sort example from overriding the number of maps.
|
||
(Owen O'Malley via ddas)
|
||
|
||
HADOOP-2434. FSDatasetInterface read interface causes HDFS reads to occur
|
||
in 1 byte chunks, causing performance degradation.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2459. Fix package target so that src/docs/build files are not
|
||
included in the release. (nigel)
|
||
|
||
HADOOP-2215. Fix documentation in cluster_setup.html &
|
||
mapred_tutorial.html reflect that mapred.tasktracker.tasks.maximum has
|
||
been superceeded by mapred.tasktracker.{map|reduce}.tasks.maximum.
|
||
(Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-2459. Fix package target so that src/docs/build files are not
|
||
included in the release. (nigel)
|
||
|
||
HADOOP-2352. Remove AC_CHECK_LIB for libz and liblzo to ensure that
|
||
libhadoop.so doesn't have a dependency on them. (acmurthy)
|
||
|
||
HADOOP-2453. Fix the configuration for wordcount-simple example in Hadoop
|
||
Pipes which currently produces an XML parsing error. (Amareshwari Sri
|
||
Ramadasu via acmurthy)
|
||
|
||
HADOOP-2476. Unit test failure while reading permission bits of local
|
||
file system (on Windows) fixed. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2247. Fine-tune the strategies for killing mappers and reducers
|
||
due to failures while fetching map-outputs. Now the map-completion times
|
||
and number of currently running reduces are taken into account by the
|
||
JobTracker before killing the mappers, while the progress made by the
|
||
reducer and the number of fetch-failures vis-a-vis total number of
|
||
fetch-attempts are taken into account before teh reducer kills itself.
|
||
(Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2452. Fix eclipse plug-in build.xml to refers to the right
|
||
location where hadoop-*-core.jar is generated. (taton)
|
||
|
||
HADOOP-2492. Additional debugging in the rpc server to better
|
||
diagnose ConcurrentModificationException. (dhruba)
|
||
|
||
HADOOP-2344. Enhance the utility for executing shell commands to read the
|
||
stdout/stderr streams while waiting for the command to finish (to free up
|
||
the buffers). Also, this patch throws away stderr of the DF utility.
|
||
@deprecated
|
||
org.apache.hadoop.fs.ShellCommand for org.apache.hadoop.util.Shell
|
||
org.apache.hadoop.util.ShellUtil for
|
||
org.apache.hadoop.util.Shell.ShellCommandExecutor
|
||
(Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2511. Fix a javadoc warning in org.apache.hadoop.util.Shell
|
||
introduced by HADOOP-2344. (acmurthy)
|
||
|
||
HADOOP-2442. Fix TestLocalFileSystemPermission.testLocalFSsetOwner
|
||
to work on more platforms. (Raghu Angadi via nigel)
|
||
|
||
HADOOP-2488. Fix a regression in random read performance.
|
||
(Michael Stack via rangadi)
|
||
|
||
HADOOP-2523. Fix TestDFSShell.testFilePermissions on Windows.
|
||
(Raghu Angadi via nigel)
|
||
|
||
HADOOP-2535. Removed support for deprecated mapred.child.heap.size and
|
||
fixed some indentation issues in TaskRunner. (acmurthy)
|
||
Configuration changes to hadoop-default.xml:
|
||
remove mapred.child.heap.size
|
||
|
||
HADOOP-2512. Fix error stream handling in Shell. Use exit code to
|
||
detect shell command errors in RawLocalFileSystem. (Raghu Angadi)
|
||
|
||
HADOOP-2446. Fixes TestHDFSServerPorts and TestMRServerPorts so they
|
||
do not rely on statically configured ports and cleanup better. (nigel)
|
||
|
||
HADOOP-2537. Make build process compatible with Ant 1.7.0.
|
||
(Hrishikesh via nigel)
|
||
|
||
HADOOP-1281. Ensure running tasks of completed map TIPs (e.g. speculative
|
||
tasks) are killed as soon as the TIP completed. (acmurthy)
|
||
|
||
HADOOP-2571. Suppress a suprious warning in test code. (cdouglas)
|
||
|
||
HADOOP-2481. NNBench report its progress periodically.
|
||
(Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2601. Start name-node on a free port for TestNNThroughputBenchmark.
|
||
(Konstantin Shvachko)
|
||
|
||
HADOOP-2494. Set +x on contrib/*/bin/* in packaged tar bundle.
|
||
(stack via tomwhite)
|
||
|
||
HADOOP-2605. Remove bogus leading slash in task-tracker report bindAddress.
|
||
(Konstantin Shvachko)
|
||
|
||
HADOOP-2620. Trivial. 'bin/hadoop fs -help' did not list chmod, chown, and
|
||
chgrp. (Raghu Angadi)
|
||
|
||
HADOOP-2614. The DFS WebUI accesses are configured to be from the user
|
||
specified by dfs.web.ugi. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2543. Implement a "no-permission-checking" mode for smooth
|
||
upgrade from a pre-0.16 install of HDFS.
|
||
(Hairong Kuang via dhruba)
|
||
|
||
HADOOP-290. A DataNode log message now prints the target of a replication
|
||
request correctly. (dhruba)
|
||
|
||
HADOOP-2538. Redirect to a warning, if plaintext parameter is true but
|
||
the filter parameter is not given in TaskLogServlet.
|
||
(Michael Bieniosek via enis)
|
||
|
||
HADOOP-2582. Prevent 'bin/hadoop fs -copyToLocal' from creating
|
||
zero-length files when the src does not exist.
|
||
(Lohit Vijayarenu via cdouglas)
|
||
|
||
HADOOP-2189. Incrementing user counters should count as progress. (ddas)
|
||
|
||
HADOOP-2649. The NameNode periodically computes replication work for
|
||
the datanodes. The periodicity of this computation is now configurable.
|
||
(dhruba)
|
||
|
||
HADOOP-2549. Correct disk size computation so that data-nodes could switch
|
||
to other local drives if current is full. (Hairong Kuang via shv)
|
||
|
||
HADOOP-2633. Fsck should call name-node methods directly rather than
|
||
through rpc. (Tsz Wo (Nicholas), SZE via shv)
|
||
|
||
HADOOP-2687. Modify a few log message generated by dfs client to be
|
||
logged only at INFO level. (stack via dhruba)
|
||
|
||
HADOOP-2402. Fix BlockCompressorStream to ensure it buffers data before
|
||
sending it down to the compressor so that each write call doesn't
|
||
compress. (Chris Douglas via acmurthy)
|
||
|
||
HADOOP-2645. The Metrics initialization code does not throw
|
||
exceptions when servers are restarted by MiniDFSCluster.
|
||
(Sanjay Radia via dhruba)
|
||
|
||
HADOOP-2691. Fix a race condition that was causing the DFSClient
|
||
to erroneously remove a good datanode from a pipeline that actually
|
||
had another datanode that was bad. (dhruba)
|
||
|
||
HADOOP-1195. All code in FSNamesystem checks the return value
|
||
of getDataNode for null before using it. (dhruba)
|
||
|
||
HADOOP-2640. Fix a bug in MultiFileSplitInputFormat that was always
|
||
returning 1 split in some circumstances. (Enis Soztutar via nigel)
|
||
|
||
HADOOP-2626. Fix paths with special characters to work correctly
|
||
with the local filesystem. (Thomas Friol via cutting)
|
||
|
||
HADOOP-2646. Fix SortValidator to work with fully-qualified
|
||
working directories. (Arun C Murthy via nigel)
|
||
|
||
HADOOP-2092. Added a ping mechanism to the pipes' task to periodically
|
||
check if the parent Java task is running, and exit if the parent isn't
|
||
alive and responding. (Amareshwari Sri Ramadasu via acmurthy)
|
||
|
||
HADOOP-2714. TestDecommission failed on windows because the replication
|
||
request was timing out. (dhruba)
|
||
|
||
HADOOP-2576. Namenode performance degradation over time triggered by
|
||
large heartbeat interval. (Raghu Angadi)
|
||
|
||
HADOOP-2713. TestDatanodeDeath failed on windows because the replication
|
||
request was timing out. (dhruba)
|
||
|
||
HADOOP-2639. Fixes a problem to do with incorrect maintenance of values
|
||
for runningMapTasks/runningReduceTasks. (Amar Kamat and Arun Murthy
|
||
via ddas)
|
||
|
||
HADOOP-2723. Fixed the check for checking whether to do user task
|
||
profiling. (Amareshwari Sri Ramadasu via omalley)
|
||
|
||
HADOOP-2734. Link forrest docs to new http://hadoop.apache.org
|
||
(Doug Cutting via nigel)
|
||
|
||
HADOOP-2641. Added Apache license headers to 95 files. (nigel)
|
||
|
||
HADOOP-2732. Fix bug in path globbing. (Hairong Kuang via nigel)
|
||
|
||
HADOOP-2404. Fix backwards compatability with hadoop-0.15 configuration
|
||
files that was broken by HADOOP-2185. (omalley)
|
||
|
||
HADOOP-2755. Fix fsck performance degradation because of permissions
|
||
issue. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-2768. Fix performance regression caused by HADOOP-1707.
|
||
(dhruba borthakur via nigel)
|
||
|
||
HADOOP-3108. Fix NPE in setPermission and setOwner. (shv)
|
||
|
||
Release 0.15.3 - 2008-01-18
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2562. globPaths supports {ab,cd}. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2540. fsck reports missing blocks incorrectly. (dhruba)
|
||
|
||
HADOOP-2570. "work" directory created unconditionally, and symlinks
|
||
created from the task cwds.
|
||
|
||
HADOOP-2574. Fixed mapred_tutorial.xml to correct minor errors with the
|
||
WordCount examples. (acmurthy)
|
||
|
||
Release 0.15.2 - 2008-01-02
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2246. Moved the changelog for HADOOP-1851 from the NEW FEATURES
|
||
section to the INCOMPATIBLE CHANGES section. (acmurthy)
|
||
|
||
HADOOP-2238. Fix TaskGraphServlet so that it sets the content type of
|
||
the response appropriately. (Paul Saab via enis)
|
||
|
||
HADOOP-2129. Fix so that distcp works correctly when source is
|
||
HDFS but not the default filesystem. HDFS paths returned by the
|
||
listStatus() method are now fully-qualified. (cutting)
|
||
|
||
HADOOP-2378. Fixes a problem where the last task completion event would
|
||
get created after the job completes. (Alejandro Abdelnur via ddas)
|
||
|
||
HADOOP-2228. Checks whether a job with a certain jobId is already running
|
||
and then tries to create the JobInProgress object.
|
||
(Johan Oskarsson via ddas)
|
||
|
||
HADOOP-2422. dfs -cat multiple files fail with 'Unable to write to
|
||
output stream'. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2460. When the namenode encounters ioerrors on writing a
|
||
transaction log, it stops writing new transactions to that one.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-2227. Use the LocalDirAllocator uniformly for handling all of the
|
||
temporary storage required for a given task. It also implies that
|
||
mapred.local.dir.minspacestart is handled by checking if there is enough
|
||
free-space on any one of the available disks. (Amareshwari Sri Ramadasu
|
||
via acmurthy)
|
||
|
||
HADOOP-2437. Fix the LocalDirAllocator to choose the seed for the
|
||
round-robin disk selections randomly. This helps in spreading data across
|
||
multiple partitions much better. (acmurhty)
|
||
|
||
HADOOP-2486. When the list of files from the InMemoryFileSystem is obtained
|
||
for merging, this patch will ensure that only those files whose checksums
|
||
have also got created (renamed) are returned. (ddas)
|
||
|
||
HADOOP-2456. Hardcode English locale to prevent NumberFormatException
|
||
from occurring when starting the NameNode with certain locales.
|
||
(Matthias Friedrich via nigel)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-2160. Remove project-level, non-user documentation from
|
||
releases, since it's now maintained in a separate tree. (cutting)
|
||
|
||
HADOOP-1327. Add user documentation for streaming. (cutting)
|
||
|
||
HADOOP-2382. Add hadoop-default.html to subversion. (cutting)
|
||
|
||
HADOOP-2158. hdfsListDirectory calls FileSystem.listStatus instead
|
||
of FileSystem.listPaths. This reduces the number of RPC calls on the
|
||
namenode, thereby improving scalability. (Christian Kunz via dhruba)
|
||
|
||
Release 0.15.1 - 2007-11-27
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-713. Reduce CPU usage on namenode while listing directories.
|
||
FileSystem.listPaths does not return the size of the entire subtree.
|
||
Introduced a new API ClientProtocol.getContentLength that returns the
|
||
size of the subtree. (Dhruba Borthakur via dhruba)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-1917. Addition of guides/tutorial for better overall
|
||
documentation for Hadoop. Specifically:
|
||
* quickstart.html is targetted towards first-time users and helps them
|
||
setup a single-node cluster and play with Hadoop.
|
||
* cluster_setup.html helps admins to configure and setup non-trivial
|
||
hadoop clusters.
|
||
* mapred_tutorial.html is a comprehensive Map-Reduce tutorial.
|
||
(acmurthy)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2174. Removed the unnecessary Reporter.setStatus call from
|
||
FSCopyFilesMapper.close which led to a NPE since the reporter isn't valid
|
||
in the close method. (Chris Douglas via acmurthy)
|
||
|
||
HADOOP-2172. Restore performance of random access to local files
|
||
by caching positions of local input streams, avoiding a system
|
||
call. (cutting)
|
||
|
||
HADOOP-2205. Regenerate the Hadoop website since some of the changes made
|
||
by HADOOP-1917 weren't correctly copied over to the trunk/docs directory.
|
||
Also fixed a couple of minor typos and broken links. (acmurthy)
|
||
|
||
Release 0.15.0 - 2007-11-2
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
HADOOP-1708. Make files appear in namespace as soon as they are
|
||
created. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-999. A HDFS Client immediately informs the NameNode of a new
|
||
file creation. ClientProtocol version changed from 14 to 15.
|
||
(Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-932. File locking interfaces and implementations (that were
|
||
earlier deprecated) are removed. Client Protocol version changed
|
||
from 15 to 16. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1621. FileStatus is now a concrete class and FileSystem.listPaths
|
||
is deprecated and replaced with listStatus. (Chris Douglas via omalley)
|
||
|
||
HADOOP-1656. The blockSize of a file is stored persistently in the file
|
||
inode. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-1838. The blocksize of files created with an earlier release is
|
||
set to the default block size. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-785. Add support for 'final' Configuration parameters,
|
||
removing support for 'mapred-default.xml', and changing
|
||
'hadoop-site.xml' to not override other files. Now folks should
|
||
generally use 'hadoop-site.xml' for all configurations. Values
|
||
with a 'final' tag may not be overridden by subsequently loaded
|
||
configuration files, e.g., by jobs. (Arun C. Murthy via cutting)
|
||
|
||
HADOOP-1846. DatanodeReport in ClientProtocol can report live
|
||
datanodes, dead datanodes or all datanodes. Client Protocol version
|
||
changed from 17 to 18. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1851. Permit specification of map output compression type
|
||
and codec, independent of the final output's compression
|
||
parameters. (Arun C Murthy via cutting)
|
||
|
||
HADOOP-1819. Jobtracker cleanups, including binding ports before
|
||
clearing state directories, so that inadvertently starting a
|
||
second jobtracker doesn't trash one that's already running. Removed
|
||
method JobTracker.getTracker() because the static variable, which
|
||
stored the value caused initialization problems.
|
||
(omalley via cutting)
|
||
|
||
NEW FEATURES
|
||
|
||
HADOOP-89. A client can access file data even before the creator
|
||
has closed the file. Introduce a new command "tail" from dfs shell.
|
||
(Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-1636. Allow configuration of the number of jobs kept in
|
||
memory by the JobTracker. (Michael Bieniosek via omalley)
|
||
|
||
HADOOP-1667. Reorganize CHANGES.txt into sections to make it
|
||
easier to read. Also remove numbering, to make merging easier.
|
||
(cutting)
|
||
|
||
HADOOP-1610. Add metrics for failed tasks.
|
||
(Devaraj Das via tomwhite)
|
||
|
||
HADOOP-1767. Add "bin/hadoop job -list" sub-command. (taton via cutting)
|
||
|
||
HADOOP-1351. Add "bin/hadoop job [-fail-task|-kill-task]" sub-commands
|
||
to terminate a particular task-attempt. (Enis Soztutar via acmurthy)
|
||
|
||
HADOOP-1880. SleepJob : An example job that sleeps at each map and
|
||
reduce task. (enis)
|
||
|
||
HADOOP-1809. Add a link in web site to #hadoop IRC channel. (enis)
|
||
|
||
HADOOP-1894. Add percentage graphs and mapred task completion graphs
|
||
to Web User Interface. Users not using Firefox may install a plugin to
|
||
their browsers to see svg graphics. (enis)
|
||
|
||
HADOOP-1914. Introduce a new NamenodeProtocol to allow secondary
|
||
namenodes and rebalancing processes to communicate with a primary
|
||
namenode. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1963. Add a FileSystem implementation for the Kosmos
|
||
Filesystem (KFS). (Sriram Rao via cutting)
|
||
|
||
HADOOP-1822. Allow the specialization and configuration of socket
|
||
factories. Provide a StandardSocketFactory, and a SocksSocketFactory to
|
||
allow the use of SOCKS proxies. (taton).
|
||
|
||
HADOOP-1968. FileSystem supports wildcard input syntax "{ }".
|
||
(Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2566. Add globStatus method to the FileSystem interface
|
||
and deprecate globPath and listPath. (Hairong Kuang via hairong)
|
||
|
||
OPTIMIZATIONS
|
||
|
||
HADOOP-1910. Reduce the number of RPCs that DistributedFileSystem.create()
|
||
makes to the namenode. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1565. Reduce memory usage of NameNode by replacing
|
||
TreeMap in HDFS Namespace with ArrayList.
|
||
(Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-1743. Change DFS INode from a nested class to standalone
|
||
class, with specialized subclasses for directories and files, to
|
||
save memory on the namenode. (Konstantin Shvachko via cutting)
|
||
|
||
HADOOP-1759. Change file name in INode from String to byte[],
|
||
saving memory on the namenode. (Konstantin Shvachko via cutting)
|
||
|
||
HADOOP-1766. Save memory in namenode by having BlockInfo extend
|
||
Block, and replace many uses of Block with BlockInfo.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
HADOOP-1687. Save memory in namenode by optimizing BlockMap
|
||
representation. (Konstantin Shvachko via cutting)
|
||
|
||
HADOOP-1774. Remove use of INode.parent in Block CRC upgrade.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1788. Increase the buffer size on the Pipes command socket.
|
||
(Amareshwari Sri Ramadasu and Christian Kunz via omalley)
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-1946. The Datanode code does not need to invoke du on
|
||
every heartbeat. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1935. Fix a NullPointerException in internalReleaseCreate.
|
||
(Dhruba Borthakur)
|
||
|
||
HADOOP-1933. The nodes listed in include and exclude files
|
||
are always listed in the datanode report.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1953. The job tracker should wait beteween calls to try and delete
|
||
the system directory (Owen O'Malley via devaraj)
|
||
|
||
HADOOP-1932. TestFileCreation fails with message saying filestatus.dat
|
||
is of incorrect size. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-1573. Support for 0 reducers in PIPES.
|
||
(Owen O'Malley via devaraj)
|
||
|
||
HADOOP-1500. Fix typographical errors in the DFS WebUI.
|
||
(Nigel Daley via dhruba)
|
||
|
||
HADOOP-1076. Periodic checkpoint can continue even if an earlier
|
||
checkpoint encountered an error. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-1887. The Namenode encounters an ArrayIndexOutOfBoundsException
|
||
while listing a directory that had a file that was
|
||
being actively written to. (Dhruba Borthakur via dhruba)
|
||
|
||
HADOOP-1904. The Namenode encounters an exception because the
|
||
list of blocks per datanode-descriptor was corrupted.
|
||
(Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-1762. The Namenode fsimage does not contain a list of
|
||
Datanodes. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1890. Removed debugging prints introduced by HADOOP-1774.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1763. Too many lost task trackers on large clusters due to
|
||
insufficient number of RPC handler threads on the JobTracker.
|
||
(Devaraj Das)
|
||
|
||
HADOOP-1463. HDFS report correct usage statistics for disk space
|
||
used by HDFS. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1692. In DFS ant task, don't cache the Configuration.
|
||
(Chris Douglas via cutting)
|
||
|
||
HADOOP-1726. Remove lib/jetty-ext/ant.jar. (omalley)
|
||
|
||
HADOOP-1772. Fix hadoop-daemon.sh script to get correct hostname
|
||
under Cygwin. (Tsz Wo (Nicholas), SZE via cutting)
|
||
|
||
HADOOP-1749. Change TestDFSUpgrade to sort files, fixing sporadic
|
||
test failures. (Enis Soztutar via cutting)
|
||
|
||
HADOOP-1748. Fix tasktracker to be able to launch tasks when log
|
||
directory is relative. (omalley via cutting)
|
||
|
||
HADOOP-1775. Fix a NullPointerException and an
|
||
IllegalArgumentException in MapWritable.
|
||
(Jim Kellerman via cutting)
|
||
|
||
HADOOP-1795. Fix so that jobs can generate output file names with
|
||
special characters. (Fr??d??ric Bertin via cutting)
|
||
|
||
HADOOP-1810. Fix incorrect value type in MRBench (SmallJobs)
|
||
(Devaraj Das via tomwhite)
|
||
|
||
HADOOP-1806. Fix ant task to compile again, also fix default
|
||
builds to compile ant tasks. (Chris Douglas via cutting)
|
||
|
||
HADOOP-1758. Fix escape processing in librecordio to not be
|
||
quadratic. (Vivek Ratan via cutting)
|
||
|
||
HADOOP-1817. Fix MultiFileSplit to read and write the split
|
||
length, so that it is not always zero in map tasks.
|
||
(Thomas Friol via cutting)
|
||
|
||
HADOOP-1853. Fix contrib/streaming to accept multiple -cacheFile
|
||
options. (Prachi Gupta via cutting)
|
||
|
||
HADOOP-1818. Fix MultiFileInputFormat so that it does not return
|
||
empty splits when numPaths < numSplits. (Thomas Friol via enis)
|
||
|
||
HADOOP-1840. Fix race condition which leads to task's diagnostic
|
||
messages getting lost. (acmurthy)
|
||
|
||
HADOOP-1885. Fix race condition in MiniDFSCluster shutdown.
|
||
(Chris Douglas via nigel)
|
||
|
||
HADOOP-1889. Fix path in EC2 scripts for building your own AMI.
|
||
(tomwhite)
|
||
|
||
HADOOP-1892. Fix a NullPointerException in the JobTracker when
|
||
trying to fetch a task's diagnostic messages from the JobClient.
|
||
(Amar Kamat via acmurthy)
|
||
|
||
HADOOP-1897. Completely remove about.html page from the web site.
|
||
(enis)
|
||
|
||
HADOOP-1907. Fix null pointer exception when getting task diagnostics
|
||
in JobClient. (Christian Kunz via omalley)
|
||
|
||
HADOOP-1882. Remove spurious asterisks from decimal number displays.
|
||
(Raghu Angadi via cutting)
|
||
|
||
HADOOP-1783. Make S3 FileSystem return Paths fully-qualified with
|
||
scheme and host. (tomwhite)
|
||
|
||
HADOOP-1925. Make pipes' autoconf script look for libsocket and libnsl, so
|
||
that it can compile under Solaris. (omalley)
|
||
|
||
HADOOP-1940. TestDFSUpgradeFromImage must shut down its MiniDFSCluster.
|
||
(Chris Douglas via nigel)
|
||
|
||
HADOOP-1930. Fix the blame for failed fetchs on the right host. (Arun C.
|
||
Murthy via omalley)
|
||
|
||
HADOOP-1934. Fix the platform name on Mac to use underscores rather than
|
||
spaces. (omalley)
|
||
|
||
HADOOP-1959. Use "/" instead of File.separator in the StatusHttpServer.
|
||
(jimk via omalley)
|
||
|
||
HADOOP-1626. Improve dfsadmin help messages.
|
||
(Lohit Vijayarenu via dhruba)
|
||
|
||
HADOOP-1695. The SecondaryNamenode waits for the Primary NameNode to
|
||
start up. (Dhruba Borthakur)
|
||
|
||
HADOOP-1983. Have Pipes flush the command socket when progress is sent
|
||
to prevent timeouts during long computations. (omalley)
|
||
|
||
HADOOP-1875. Non-existant directories or read-only directories are
|
||
filtered from dfs.client.buffer.dir. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1992. Fix the performance degradation in the sort validator.
|
||
(acmurthy via omalley)
|
||
|
||
HADOOP-1874. Move task-outputs' promotion/discard to a separate thread
|
||
distinct from the main heartbeat-processing thread. The main upside being
|
||
that we do not lock-up the JobTracker during HDFS operations, which
|
||
otherwise may lead to lost tasktrackers if the NameNode is unresponsive.
|
||
(Devaraj Das via acmurthy)
|
||
|
||
HADOOP-2026. Namenode prints out one log line for "Number of transactions"
|
||
at most once every minute. (Dhruba Borthakur)
|
||
|
||
HADOOP-2022. Ensure that status information for successful tasks is correctly
|
||
recorded at the JobTracker, so that, for example, one may view correct
|
||
information via taskdetails.jsp. This bug was introduced by HADOOP-1874.
|
||
(Amar Kamat via acmurthy)
|
||
|
||
HADOOP-2031. Correctly maintain the taskid which takes the TIP to
|
||
completion, failing which the case of lost tasktrackers isn't handled
|
||
properly i.e. the map TIP is incorrectly left marked as 'complete' and it
|
||
is never rescheduled elsewhere, leading to hung reduces.
|
||
(Devaraj Das via acmurthy)
|
||
|
||
HADOOP-2018. The source datanode of a data transfer waits for
|
||
a response from the target datanode before closing the data stream.
|
||
(Hairong Kuang via dhruba)
|
||
|
||
HADOOP-2023. Disable TestLocalDirAllocator on Windows.
|
||
(Hairong Kuang via nigel)
|
||
|
||
HADOOP-2016. Ignore status-updates from FAILED/KILLED tasks at the
|
||
TaskTracker. This fixes a race-condition which caused the tasks to wrongly
|
||
remain in the RUNNING state even after being killed by the JobTracker and
|
||
thus handicap the cleanup of the task's output sub-directory. (acmurthy)
|
||
|
||
HADOOP-1771. Fix a NullPointerException in streaming caused by an
|
||
IOException in MROutputThread. (lohit vijayarenu via nigel)
|
||
|
||
HADOOP-2028. Fix distcp so that the log dir does not need to be
|
||
specified and the destination does not need to exist.
|
||
(Chris Douglas via nigel)
|
||
|
||
HADOOP-2044. The namenode protects all lease manipulations using a
|
||
sortedLease lock. (Dhruba Borthakur)
|
||
|
||
HADOOP-2051. The TaskCommit thread should not die for exceptions other
|
||
than the InterruptedException. This behavior is there for the other long
|
||
running threads in the JobTracker. (Arun C Murthy via ddas)
|
||
|
||
HADOOP-1973. The FileSystem object would be accessed on the JobTracker
|
||
through a RPC in the InterTrackerProtocol. The check for the object being
|
||
null was missing and hence NPE would be thrown sometimes. This issue fixes
|
||
that problem. (Amareshwari Sri Ramadasu via ddas)
|
||
|
||
HADOOP-2033. The SequenceFile.Writer.sync method was a no-op, which caused
|
||
very uneven splits for applications like distcp that count on them.
|
||
(omalley)
|
||
|
||
HADOOP-2070. Added a flush method to pipes' DownwardProtocol and call
|
||
that before waiting for the application to finish to ensure all buffered
|
||
data is flushed. (Owen O'Malley via acmurthy)
|
||
|
||
HADOOP-2080. Fixed calculation of the checksum file size when the values
|
||
are large. (omalley)
|
||
|
||
HADOOP-2048. Change error handling in distcp so that each map copies
|
||
as much as possible before reporting the error. Also report progress on
|
||
every copy. (Chris Douglas via omalley)
|
||
|
||
HADOOP-2073. Change size of VERSION file after writing contents to it.
|
||
(Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-2102. Fix the deprecated ToolBase to pass its Configuration object
|
||
to the superceding ToolRunner to ensure it picks up the appropriate
|
||
configuration resources. (Dennis Kubes and Enis Soztutar via acmurthy)
|
||
|
||
HADOOP-2103. Fix minor javadoc bugs introduce by HADOOP-2046. (Nigel
|
||
Daley via acmurthy)
|
||
|
||
IMPROVEMENTS
|
||
|
||
HADOOP-1908. Restructure data node code so that block sending and
|
||
receiving are seperated from data transfer header handling.
|
||
(Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1921. Save the configuration of completed/failed jobs and make them
|
||
available via the web-ui. (Amar Kamat via devaraj)
|
||
|
||
HADOOP-1266. Remove dependency of package org.apache.hadoop.net on
|
||
org.apache.hadoop.dfs. (Hairong Kuang via dhruba)
|
||
|
||
HADOOP-1779. Replace INodeDirectory.getINode() by a getExistingPathINodes()
|
||
to allow the retrieval of all existing INodes along a given path in a
|
||
single lookup. This facilitates removal of the 'parent' field in the
|
||
inode. (Christophe Taton via dhruba)
|
||
|
||
HADOOP-1756. Add toString() to some Writable-s. (ab)
|
||
|
||
HADOOP-1727. New classes: MapWritable and SortedMapWritable.
|
||
(Jim Kellerman via ab)
|
||
|
||
HADOOP-1651. Improve progress reporting.
|
||
(Devaraj Das via tomwhite)
|
||
|
||
HADOOP-1595. dfsshell can wait for a file to achieve its intended
|
||
replication target. (Tsz Wo (Nicholas), SZE via dhruba)
|
||
|
||
HADOOP-1693. Remove un-needed log fields in DFS replication classes,
|
||
since the log may be accessed statically. (Konstantin Shvachko via cutting)
|
||
|
||
HADOOP-1231. Add generics to Mapper and Reducer interfaces.
|
||
(tomwhite via cutting)
|
||
|
||
HADOOP-1436. Improved command-line APIs, so that all tools need
|
||
not subclass ToolBase, and generic parameter parser is public.
|
||
(Enis Soztutar via cutting)
|
||
|
||
HADOOP-1703. DFS-internal code cleanups, removing several uses of
|
||
the obsolete UTF8. (Christophe Taton via cutting)
|
||
|
||
HADOOP-1731. Add Hadoop's version to contrib jar file names.
|
||
(cutting)
|
||
|
||
HADOOP-1689. Make shell scripts more portable. All shell scripts
|
||
now explicitly depend on bash, but do not require that bash be
|
||
installed in a particular location, as long as it is on $PATH.
|
||
(cutting)
|
||
|
||
HADOOP-1744. Remove many uses of the deprecated UTF8 class from
|
||
the HDFS namenode. (Christophe Taton via cutting)
|
||
|
||
HADOOP-1654. Add IOUtils class, containing generic io-related
|
||
utility methods. (Enis Soztutar via cutting)
|
||
|
||
HADOOP-1158. Change JobTracker to record map-output transmission
|
||
errors and use them to trigger speculative re-execution of tasks.
|
||
(Arun C Murthy via cutting)
|
||
|
||
HADOOP-1601. Change GenericWritable to use ReflectionUtils for
|
||
instance creation, avoiding classloader issues, and to implement
|
||
Configurable. (Enis Soztutar via cutting)
|
||
|
||
HADOOP-1750. Log standard output and standard error when forking
|
||
task processes. (omalley via cutting)
|
||
|
||
HADOOP-1803. Generalize build.xml to make files in all
|
||
src/contrib/*/bin directories executable. (stack via cutting)
|
||
|
||
HADOOP-1739. Let OS always choose the tasktracker's umbilical
|
||
port. Also switch default address for umbilical connections to
|
||
loopback. (cutting)
|
||
|
||
HADOOP-1812. Let OS choose ports for IPC and RPC unit tests. (cutting)
|
||
|
||
HADOOP-1825. Create $HADOOP_PID_DIR when it does not exist.
|
||
(Michael Bieniosek via cutting)
|
||
|
||
HADOOP-1425. Replace uses of ToolBase with the Tool interface.
|
||
(Enis Soztutar via cutting)
|
||
|
||
HADOOP-1569. Reimplement DistCP to use the standard FileSystem/URI
|
||
code in Hadoop so that you can copy from and to all of the supported file
|
||
systems.(Chris Douglas via omalley)
|
||
|
||
HADOOP-1018. Improve documentation w.r.t handling of lost hearbeats between
|
||
TaskTrackers and JobTracker. (acmurthy)
|
||
|
||
HADOOP-1718. Add ant targets for measuring code coverage with clover.
|
||
(simonwillnauer via nigel)
|
||
|
||
HADOOP-1592. Log error messages to the client console when tasks
|
||
fail. (Amar Kamat via cutting)
|
||
|
||
HADOOP-1879. Remove some unneeded casts. (Nilay Vaish via cutting)
|
||
|
||
HADOOP-1878. Add space between priority links on job details
|
||
page. (Thomas Friol via cutting)
|
||
|
||
HADOOP-120. In ArrayWritable, prevent creation with null value
|
||
class, and improve documentation. (Cameron Pope via cutting)
|
||
|
||
HADOOP-1926. Add a random text writer example/benchmark so that we can
|
||
benchmark compression codecs on random data. (acmurthy via omalley)
|
||
|
||
HADOOP-1906. Warn the user if they have an obsolete madred-default.xml
|
||
file in their configuration directory. (acmurthy via omalley)
|
||
|
||
HADOOP-1971. Warn when job does not specify a jar. (enis via cutting)
|
||
|
||
HADOOP-1942. Increase the concurrency of transaction logging to
|
||
edits log. Reduce the number of syncs by double-buffering the changes
|
||
to the transaction log. (Dhruba Borthakur)
|
||
|
||
HADOOP-2046. Improve mapred javadoc. (Arun C. Murthy via cutting)
|
||
|
||
HADOOP-2105. Improve overview.html to clarify supported platforms,
|
||
software pre-requisites for hadoop, how to install them on various
|
||
platforms and a better general description of hadoop and it's utility.
|
||
(Jim Kellerman via acmurthy)
|
||
|
||
|
||
Release 0.14.4 - 2007-11-26
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2140. Add missing Apache Licensing text at the front of several
|
||
C and C++ files.
|
||
|
||
HADOOP-2169. Fix the DT_SONAME field of libhdfs.so to set it to the
|
||
correct value of 'libhdfs.so', currently it is set to the absolute path of
|
||
libhdfs.so. (acmurthy)
|
||
|
||
HADOOP-2001. Make the job priority updates and job kills synchronized on
|
||
the JobTracker. Deadlock was seen in the JobTracker because of the lack of
|
||
this synchronization. (Arun C Murthy via ddas)
|
||
|
||
|
||
Release 0.14.3 - 2007-10-19
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-2053. Fixed a dangling reference to a memory buffer in the map
|
||
output sorter. (acmurthy via omalley)
|
||
|
||
HADOOP-2036. Fix a NullPointerException in JvmMetrics class. (nigel)
|
||
|
||
HADOOP-2043. Release 0.14.2 was compiled with Java 1.6 rather than
|
||
Java 1.5. (cutting)
|
||
|
||
|
||
Release 0.14.2 - 2007-10-09
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-1948. Removed spurious error message during block crc upgrade.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1862. reduces are getting stuck trying to find map outputs.
|
||
(Arun C. Murthy via ddas)
|
||
|
||
HADOOP-1977. Fixed handling of ToolBase cli options in JobClient.
|
||
(enis via omalley)
|
||
|
||
HADOOP-1972. Fix LzoCompressor to ensure the user has actually asked
|
||
to finish compression. (arun via omalley)
|
||
|
||
HADOOP-1970. Fix deadlock in progress reporting in the task. (Vivek
|
||
Ratan via omalley)
|
||
|
||
HADOOP-1978. Name-node removes edits.new after a successful startup.
|
||
(Konstantin Shvachko via dhruba)
|
||
|
||
HADOOP-1955. The Namenode tries to not pick the same source Datanode for
|
||
a replication request if the earlier replication request for the same
|
||
block and that source Datanode had failed.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1961. The -get option to dfs-shell works when a single filename
|
||
is specified. (Raghu Angadi via dhruba)
|
||
|
||
HADOOP-1997. TestCheckpoint closes the edits file after writing to it,
|
||
otherwise the rename of this file on Windows fails.
|
||
(Konstantin Shvachko via dhruba)
|
||
|
||
Release 0.14.1 - 2007-09-04
|
||
|
||
BUG FIXES
|
||
|
||
HADOOP-1740. Fix null pointer exception in sorting map outputs. (Devaraj
|
||
Das via omalley)
|
||
|
||
HADOOP-1790. Fix tasktracker to work correctly on multi-homed
|
||
boxes. (Torsten Curdt via cutting)
|
||
|
||
HADOOP-1798. Fix jobtracker to correctly account for failed
|
||
tasks. (omalley via cutting)
|
||
|
||
|
||
Release 0.14.0 - 2007-08-17
|
||
|
||
INCOMPATIBLE CHANGES
|
||
|
||
1. HADOOP-1134.
|
||
CONFIG/API - dfs.block.size must now be a multiple of
|
||
io.byte.per.checksum, otherwise new files can not be written.
|
||
LAYOUT - DFS layout version changed from -6 to -7, which will require an
|
||
upgrade from previous versions.
|
||
PROTOCOL - Datanode RPC protocol version changed from 7 to 8.
|
||
|
||
2. HADOOP-1283
|
||
API - deprecated file locking API.
|
||
|
||
3. HADOOP-894
|
||
PROTOCOL - changed ClientProtocol to fetch parts of block locations.
|
||
|
||
4. HADOOP-1336
|
||
CONFIG - Enable speculative execution by default.
|
||
|
||
5. HADOOP-1197
|
||
API - deprecated method for Configuration.getObject, because
|
||
Configurations should only contain strings.
|
||
|
||
6. HADOOP-1343
|
||
API - deprecate Configuration.set(String,Object) so that only strings are
|
||
put in Configrations.
|
||
|
||
7. HADOOP-1207
|
||
CLI - Fix FsShell 'rm' command to continue when a non-existent file is
|
||
encountered.
|
||
|
||
8. HADOOP-1473
|
||
CLI/API - Job, TIP, and Task id formats have changed and are now unique
|
||
across job tracker restarts.
|
||
|
||
9. HADOOP-1400
|
||
API - JobClient constructor now takes a JobConf object instead of a
|
||
Configuration object.
|
||
|
||
NEW FEATURES and BUG FIXES
|
||
|
||
1. HADOOP-1197. In Configuration, deprecate getObject() and add
|
||
getRaw(), which skips variable expansion. (omalley via cutting)
|
||
|
||
2. HADOOP-1343. In Configuration, deprecate set(String,Object) and
|
||
implement Iterable. (omalley via cutting)
|
||
|
||
3. HADOOP-1344. Add RunningJob#getJobName(). (Michael Bieniosek via cutting)
|
||
|
||
4. HADOOP-1342. In aggregators, permit one to limit the number of
|
||
unique values per key. (Runping Qi via cutting)
|
||
|
||
5. HADOOP-1340. Set the replication factor of the MD5 file in the filecache
|
||
to be the same as the replication factor of the original file.
|
||
(Dhruba Borthakur via tomwhite.)
|
||
|
||
6. HADOOP-1355. Fix null pointer dereference in
|
||
TaskLogAppender.append(LoggingEvent). (Arun C Murthy via tomwhite.)
|
||
|
||
7. HADOOP-1357. Fix CopyFiles to correctly avoid removing "/".
|
||
(Arun C Murthy via cutting)
|
||
|
||
8. HADOOP-234. Add pipes facility, which permits writing MapReduce
|
||
programs in C++.
|
||
|
||
9. HADOOP-1359. Fix a potential NullPointerException in HDFS.
|
||
(Hairong Kuang via cutting)
|
||
|
||
10. HADOOP-1364. Fix inconsistent synchronization in SequenceFile.
|
||
(omalley via cutting)
|
||
|
||
11. HADOOP-1379. Add findbugs target to build.xml.
|
||
(Nigel Daley via cutting)
|
||
|
||
12. HADOOP-1364. Fix various inconsistent synchronization issues.
|
||
(Devaraj Das via cutting)
|
||
|
||
13. HADOOP-1393. Remove a potential unexpected negative number from
|
||
uses of random number generator. (omalley via cutting)
|
||
|
||
14. HADOOP-1387. A number of "performance" code-cleanups suggested
|
||
by findbugs. (Arun C Murthy via cutting)
|
||
|
||
15. HADOOP-1401. Add contrib/hbase javadoc to tree. (stack via cutting)
|
||
|
||
16. HADOOP-894. Change HDFS so that the client only retrieves a limited
|
||
number of block locations per request from the namenode.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
17. HADOOP-1406. Plug a leak in MapReduce's use of metrics.
|
||
(David Bowen via cutting)
|
||
|
||
18. HADOOP-1394. Implement "performance" code-cleanups in HDFS
|
||
suggested by findbugs. (Raghu Angadi via cutting)
|
||
|
||
19. HADOOP-1413. Add example program that uses Knuth's dancing links
|
||
algorithm to solve pentomino problems. (omalley via cutting)
|
||
|
||
20. HADOOP-1226. Change HDFS so that paths it returns are always
|
||
fully qualified. (Dhruba Borthakur via cutting)
|
||
|
||
21. HADOOP-800. Improvements to HDFS web-based file browser.
|
||
(Enis Soztutar via cutting)
|
||
|
||
22. HADOOP-1408. Fix a compiler warning by adding a class to replace
|
||
a generic. (omalley via cutting)
|
||
|
||
23. HADOOP-1376. Modify RandomWriter example so that it can generate
|
||
data for the Terasort benchmark. (Devaraj Das via cutting)
|
||
|
||
24. HADOOP-1429. Stop logging exceptions during normal IPC server
|
||
shutdown. (stack via cutting)
|
||
|
||
25. HADOOP-1461. Fix the synchronization of the task tracker to
|
||
avoid lockups in job cleanup. (Arun C Murthy via omalley)
|
||
|
||
26. HADOOP-1446. Update the TaskTracker metrics while the task is
|
||
running. (Devaraj via omalley)
|
||
|
||
27. HADOOP-1414. Fix a number of issues identified by FindBugs as
|
||
"Bad Practice". (Dhruba Borthakur via cutting)
|
||
|
||
28. HADOOP-1392. Fix "correctness" bugs identified by FindBugs in
|
||
fs and dfs packages. (Raghu Angadi via cutting)
|
||
|
||
29. HADOOP-1412. Fix "dodgy" bugs identified by FindBugs in fs and
|
||
io packages. (Hairong Kuang via cutting)
|
||
|
||
30. HADOOP-1261. Remove redundant events from HDFS namenode's edit
|
||
log when a datanode restarts. (Raghu Angadi via cutting)
|
||
|
||
31. HADOOP-1336. Re-enable speculative execution by
|
||
default. (omalley via cutting)
|
||
|
||
32. HADOOP-1311. Fix a bug in BytesWritable#set() where start offset
|
||
was ignored. (Dhruba Borthakur via cutting)
|
||
|
||
33. HADOOP-1450. Move checksumming closer to user code, so that
|
||
checksums are created before data is stored in large buffers and
|
||
verified after data is read from large buffers, to better catch
|
||
memory errors. (cutting)
|
||
|
||
34. HADOOP-1447. Add support in contrib/data_join for text inputs.
|
||
(Senthil Subramanian via cutting)
|
||
|
||
35. HADOOP-1456. Fix TestDecommission assertion failure by setting
|
||
the namenode to ignore the load on datanodes while allocating
|
||
replicas. (Dhruba Borthakur via tomwhite)
|
||
|
||
36. HADOOP-1396. Fix FileNotFoundException on DFS block.
|
||
(Dhruba Borthakur via tomwhite)
|
||
|
||
37. HADOOP-1467. Remove redundant counters from WordCount example.
|
||
(Owen O'Malley via tomwhite)
|
||
|
||
38. HADOOP-1139. Log HDFS block transitions at INFO level, to better
|
||
enable diagnosis of problems. (Dhruba Borthakur via cutting)
|
||
|
||
39. HADOOP-1269. Finer grained locking in HDFS namenode.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
40. HADOOP-1438. Improve HDFS documentation, correcting typos and
|
||
making images appear in PDF. Also update copyright date for all
|
||
docs. (Luke Nezda via cutting)
|
||
|
||
41. HADOOP-1457. Add counters for monitoring task assignments.
|
||
(Arun C Murthy via tomwhite)
|
||
|
||
42. HADOOP-1472. Fix so that timed-out tasks are counted as failures
|
||
rather than as killed. (Arun C Murthy via cutting)
|
||
|
||
43. HADOOP-1234. Fix a race condition in file cache that caused
|
||
tasktracker to not be able to find cached files.
|
||
(Arun C Murthy via cutting)
|
||
|
||
44. HADOOP-1482. Fix secondary namenode to roll info port.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
45. HADOOP-1300. Improve removal of excess block replicas to be
|
||
rack-aware. Attempts are now made to keep replicas on more
|
||
racks. (Hairong Kuang via cutting)
|
||
|
||
46. HADOOP-1417. Disable a few FindBugs checks that generate a lot
|
||
of spurious warnings. (Nigel Daley via cutting)
|
||
|
||
47. HADOOP-1320. Rewrite RandomWriter example to bypass reduce.
|
||
(Arun C Murthy via cutting)
|
||
|
||
48. HADOOP-1449. Add some examples to contrib/data_join.
|
||
(Senthil Subramanian via cutting)
|
||
|
||
49. HADOOP-1459. Fix so that, in HDFS, getFileCacheHints() returns
|
||
hostnames instead of IP addresses. (Dhruba Borthakur via cutting)
|
||
|
||
50. HADOOP-1493. Permit specification of "java.library.path" system
|
||
property in "mapred.child.java.opts" configuration property.
|
||
(Enis Soztutar via cutting)
|
||
|
||
51. HADOOP-1372. Use LocalDirAllocator for HDFS temporary block
|
||
files, so that disk space, writability, etc. is considered.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
52. HADOOP-1193. Pool allocation of compression codecs. This
|
||
eliminates a memory leak that could cause OutOfMemoryException,
|
||
and also substantially improves performance.
|
||
(Arun C Murthy via cutting)
|
||
|
||
53. HADOOP-1492. Fix a NullPointerException handling version
|
||
mismatch during datanode registration.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
54. HADOOP-1442. Fix handling of zero-length input splits.
|
||
(Senthil Subramanian via cutting)
|
||
|
||
55. HADOOP-1444. Fix HDFS block id generation to check pending
|
||
blocks for duplicates. (Dhruba Borthakur via cutting)
|
||
|
||
56. HADOOP-1207. Fix FsShell's 'rm' command to not stop when one of
|
||
the named files does not exist. (Tsz Wo Sze via cutting)
|
||
|
||
57. HADOOP-1475. Clear tasktracker's file cache before it
|
||
re-initializes, to avoid confusion. (omalley via cutting)
|
||
|
||
58. HADOOP-1505. Remove spurious stacktrace in ZlibFactory
|
||
introduced in HADOOP-1093. (Michael Stack via tomwhite)
|
||
|
||
59. HADOOP-1484. Permit one to kill jobs from the web ui. Note that
|
||
this is disabled by default. One must set
|
||
"webinterface.private.actions" to enable this.
|
||
(Enis Soztutar via cutting)
|
||
|
||
60. HADOOP-1003. Remove flushing of namenode edit log from primary
|
||
namenode lock, increasing namenode throughput.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
61. HADOOP-1023. Add links to searchable mail archives.
|
||
(tomwhite via cutting)
|
||
|
||
62. HADOOP-1504. Fix terminate-hadoop-cluster script in contrib/ec2
|
||
to only terminate Hadoop instances, and not other instances
|
||
started by the same user. (tomwhite via cutting)
|
||
|
||
63. HADOOP-1462. Improve task progress reporting. Progress reports
|
||
are no longer blocking since i/o is performed in a separate
|
||
thread. Reporting during sorting and more is also more
|
||
consistent. (Vivek Ratan via cutting)
|
||
|
||
64. [ intentionally blank ]
|
||
|
||
65. HADOOP-1453. Remove some unneeded calls to FileSystem#exists()
|
||
when opening files, reducing the namenode load somewhat.
|
||
(Raghu Angadi via cutting)
|
||
|
||
66. HADOOP-1489. Fix text input truncation bug due to mark/reset.
|
||
Add a unittest. (Bwolen Yang via cutting)
|
||
|
||
67. HADOOP-1455. Permit specification of arbitrary job options on
|
||
pipes command line. (Devaraj Das via cutting)
|
||
|
||
68. HADOOP-1501. Better randomize sending of block reports to
|
||
namenode, so reduce load spikes. (Dhruba Borthakur via cutting)
|
||
|
||
69. HADOOP-1147. Remove @author tags from Java source files.
|
||
|
||
70. HADOOP-1283. Convert most uses of UTF8 in the namenode to be
|
||
String. (Konstantin Shvachko via cutting)
|
||
|
||
71. HADOOP-1511. Speedup hbase unit tests. (stack via cutting)
|
||
|
||
72. HADOOP-1517. Remove some synchronization in namenode to permit
|
||
finer grained locking previously added. (Konstantin Shvachko via cutting)
|
||
|
||
73. HADOOP-1512. Fix failing TestTextInputFormat on Windows.
|
||
(Senthil Subramanian via nigel)
|
||
|
||
74. HADOOP-1518. Add a session id to job metrics, for use by HOD.
|
||
(David Bowen via cutting)
|
||
|
||
75. HADOOP-1292. Change 'bin/hadoop fs -get' to first copy files to
|
||
a temporary name, then rename them to their final name, so that
|
||
failures don't leave partial files. (Tsz Wo Sze via cutting)
|
||
|
||
76. HADOOP-1377. Add support for modification time to FileSystem and
|
||
implement in HDFS and local implementations. Also, alter access
|
||
to file properties to be through a new FileStatus interface.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
77. HADOOP-1515. Add MultiFileInputFormat, which can pack multiple,
|
||
typically small, input files into each split. (Enis Soztutar via cutting)
|
||
|
||
78. HADOOP-1514. Make reducers report progress while waiting for map
|
||
outputs, so they're not killed. (Vivek Ratan via cutting)
|
||
|
||
79. HADOOP-1508. Add an Ant task for FsShell operations. Also add
|
||
new FsShell commands "touchz", "test" and "stat".
|
||
(Chris Douglas via cutting)
|
||
|
||
80. HADOOP-1028. Add log messages for server startup and shutdown.
|
||
(Tsz Wo Sze via cutting)
|
||
|
||
81. HADOOP-1485. Add metrics for monitoring shuffle.
|
||
(Devaraj Das via cutting)
|
||
|
||
82. HADOOP-1536. Remove file locks from libhdfs tests.
|
||
(Dhruba Borthakur via nigel)
|
||
|
||
83. HADOOP-1520. Add appropriate synchronization to FSEditsLog.
|
||
(Dhruba Borthakur via nigel)
|
||
|
||
84. HADOOP-1513. Fix a race condition in directory creation.
|
||
(Devaraj via omalley)
|
||
|
||
85. HADOOP-1546. Remove spurious column from HDFS web UI.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
86. HADOOP-1556. Make LocalJobRunner delete working files at end of
|
||
job run. (Devaraj Das via tomwhite)
|
||
|
||
87. HADOOP-1571. Add contrib lib directories to root build.xml
|
||
javadoc classpath. (Michael Stack via tomwhite)
|
||
|
||
88. HADOOP-1554. Log killed tasks to the job history and display them on the
|
||
web/ui. (Devaraj Das via omalley)
|
||
|
||
89. HADOOP-1533. Add persistent error logging for distcp. The logs are stored
|
||
into a specified hdfs directory. (Senthil Subramanian via omalley)
|
||
|
||
90. HADOOP-1286. Add support to HDFS for distributed upgrades, which
|
||
permits coordinated upgrade of datanode data.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
91. HADOOP-1580. Improve contrib/streaming so that subprocess exit
|
||
status is displayed for errors. (John Heidemann via cutting)
|
||
|
||
92. HADOOP-1448. In HDFS, randomize lists of non-local block
|
||
locations returned to client, so that load is better balanced.
|
||
(Hairong Kuang via cutting)
|
||
|
||
93. HADOOP-1578. Fix datanode to send its storage id to namenode
|
||
during registration. (Konstantin Shvachko via cutting)
|
||
|
||
94. HADOOP-1584. Fix a bug in GenericWritable which limited it to
|
||
128 types instead of 256. (Espen Amble Kolstad via cutting)
|
||
|
||
95. HADOOP-1473. Make job ids unique across jobtracker restarts.
|
||
(omalley via cutting)
|
||
|
||
96. HADOOP-1582. Fix hdfslib to return 0 instead of -1 at
|
||
end-of-file, per C conventions. (Christian Kunz via cutting)
|
||
|
||
97. HADOOP-911. Fix a multithreading bug in libhdfs.
|
||
(Christian Kunz)
|
||
|
||
98. HADOOP-1486. Fix so that fatal exceptions in namenode cause it
|
||
to exit. (Dhruba Borthakur via cutting)
|
||
|
||
99. HADOOP-1470. Factor checksum generation and validation out of
|
||
ChecksumFileSystem so that it can be reused by FileSystem's with
|
||
built-in checksumming. (Hairong Kuang via cutting)
|
||
|
||
100. HADOOP-1590. Use relative urls in jobtracker jsp pages, so that
|
||
webapp can be used in non-root contexts. (Thomas Friol via cutting)
|
||
|
||
101. HADOOP-1596. Fix the parsing of taskids by streaming and improve the
|
||
error reporting. (omalley)
|
||
|
||
102. HADOOP-1535. Fix the user-controlled grouping to the reduce function.
|
||
(Vivek Ratan via omalley)
|
||
|
||
103. HADOOP-1585. Modify GenericWritable to declare the classes as subtypes
|
||
of Writable (Espen Amble Kolstad via omalley)
|
||
|
||
104. HADOOP-1576. Fix errors in count of completed tasks when
|
||
speculative execution is enabled. (Arun C Murthy via cutting)
|
||
|
||
105. HADOOP-1598. Fix license headers: adding missing; updating old.
|
||
(Enis Soztutar via cutting)
|
||
|
||
106. HADOOP-1547. Provide examples for aggregate library.
|
||
(Runping Qi via tomwhite)
|
||
|
||
107. HADOOP-1570. Permit jobs to enable and disable the use of
|
||
hadoop's native library. (Arun C Murthy via cutting)
|
||
|
||
108. HADOOP-1433. Add job priority. (Johan Oskarsson via tomwhite)
|
||
|
||
109. HADOOP-1597. Add status reports and post-upgrade options to HDFS
|
||
distributed upgrade. (Konstantin Shvachko via cutting)
|
||
|
||
110. HADOOP-1524. Permit user task logs to appear as they're
|
||
created. (Michael Bieniosek via cutting)
|
||
|
||
111. HADOOP-1599. Fix distcp bug on Windows. (Senthil Subramanian via cutting)
|
||
|
||
112. HADOOP-1562. Add JVM metrics, including GC and logging stats.
|
||
(David Bowen via cutting)
|
||
|
||
113. HADOOP-1613. Fix "DFS Health" page to display correct time of
|
||
last contact. (Dhruba Borthakur via cutting)
|
||
|
||
114. HADOOP-1134. Add optimized checksum support to HDFS. Checksums
|
||
are now stored with each block, rather than as parallel files.
|
||
This reduces the namenode's memory requirements and increases
|
||
data integrity. (Raghu Angadi via cutting)
|
||
|
||
115. HADOOP-1400. Make JobClient retry requests, so that clients can
|
||
survive jobtracker problems. (omalley via cutting)
|
||
|
||
116. HADOOP-1564. Add unit tests for HDFS block-level checksums.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
117. HADOOP-1620. Reduce the number of abstract FileSystem methods,
|
||
simplifying implementations. (cutting)
|
||
|
||
118. HADOOP-1625. Fix a "could not move files" exception in datanode.
|
||
(Raghu Angadi via cutting)
|
||
|
||
119. HADOOP-1624. Fix an infinite loop in datanode. (Raghu Angadi via cutting)
|
||
|
||
120. HADOOP-1084. Switch mapred file cache to use file modification
|
||
time instead of checksum to detect file changes, as checksums are
|
||
no longer easily accessed. (Arun C Murthy via cutting)
|
||
|
||
130. HADOOP-1623. Fix an infinite loop when copying directories.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
131. HADOOP-1603. Fix a bug in namenode initialization where
|
||
default replication is sometimes reset to one on restart.
|
||
(Raghu Angadi via cutting)
|
||
|
||
132. HADOOP-1635. Remove hardcoded keypair name and fix launch-hadoop-cluster
|
||
to support later versions of ec2-api-tools. (Stu Hood via tomwhite)
|
||
|
||
133. HADOOP-1638. Fix contrib EC2 scripts to support NAT addressing.
|
||
(Stu Hood via tomwhite)
|
||
|
||
134. HADOOP-1632. Fix an IllegalArgumentException in fsck.
|
||
(Hairong Kuang via cutting)
|
||
|
||
135. HADOOP-1619. Fix FSInputChecker to not attempt to read past EOF.
|
||
(Hairong Kuang via cutting)
|
||
|
||
136. HADOOP-1640. Fix TestDecommission on Windows.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
137. HADOOP-1587. Fix TestSymLink to get required system properties.
|
||
(Devaraj Das via omalley)
|
||
|
||
138. HADOOP-1628. Add block CRC protocol unit tests. (Raghu Angadi via omalley)
|
||
|
||
139. HADOOP-1653. FSDirectory code-cleanups. FSDirectory.INode
|
||
becomes a static class. (Christophe Taton via dhruba)
|
||
|
||
140. HADOOP-1066. Restructure documentation to make more user
|
||
friendly. (Connie Kleinjans and Jeff Hammerbacher via cutting)
|
||
|
||
141. HADOOP-1551. libhdfs supports setting replication factor and
|
||
retrieving modification time of files. (Sameer Paranjpye via dhruba)
|
||
|
||
141. HADOOP-1647. FileSystem.getFileStatus returns valid values for "/".
|
||
(Dhruba Borthakur via dhruba)
|
||
|
||
142. HADOOP-1657. Fix NNBench to ensure that the block size is a
|
||
multiple of bytes.per.checksum. (Raghu Angadi via dhruba)
|
||
|
||
143. HADOOP-1553. Replace user task output and log capture code to use shell
|
||
redirection instead of copier threads in the TaskTracker. Capping the
|
||
size of the output is now done via tail in memory and thus should not be
|
||
large. The output of the tasklog servlet is not forced into UTF8 and is
|
||
not buffered entirely in memory. (omalley)
|
||
Configuration changes to hadoop-default.xml:
|
||
remove mapred.userlog.num.splits
|
||
remove mapred.userlog.purge.splits
|
||
change default mapred.userlog.limit.kb to 0 (no limit)
|
||
change default mapred.userlog.retain.hours to 24
|
||
Configuration changes to log4j.properties:
|
||
remove log4j.appender.TLA.noKeepSplits
|
||
remove log4j.appender.TLA.purgeLogSplits
|
||
remove log4j.appender.TLA.logsRetainHours
|
||
URL changes:
|
||
http://<tasktracker>/tasklog.jsp -> http://<tasktracker>tasklog with
|
||
parameters limited to start and end, which may be positive (from
|
||
start) or negative (from end).
|
||
Environment:
|
||
require bash (v2 or later) and tail
|
||
|
||
144. HADOOP-1659. Fix a job id/job name mixup. (Arun C. Murthy via omalley)
|
||
|
||
145. HADOOP-1665. With HDFS Trash enabled and the same file was created
|
||
and deleted more than once, the suceeding deletions creates Trash item
|
||
names suffixed with a integer. (Dhruba Borthakur via dhruba)
|
||
|
||
146. HADOOP-1666. FsShell object can be used for multiple fs commands.
|
||
(Dhruba Borthakur via dhruba)
|
||
|
||
147. HADOOP-1654. Remove performance regression introduced by Block CRC.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
148. HADOOP-1680. Improvements to Block CRC upgrade messages.
|
||
(Raghu Angadi via dhruba)
|
||
|
||
149. HADOOP-71. Allow Text and SequenceFile Map/Reduce inputs from non-default
|
||
filesystems. (omalley)
|
||
|
||
150. HADOOP-1568. Expose HDFS as xml/http filesystem to provide cross-version
|
||
compatability. (Chris Douglas via omalley)
|
||
|
||
151. HADOOP-1668. Added an INCOMPATIBILITY section to CHANGES.txt. (nigel)
|
||
|
||
152. HADOOP-1629. Added a upgrade test for HADOOP-1134.
|
||
(Raghu Angadi via nigel)
|
||
|
||
153. HADOOP-1698. Fix performance problems on map output sorting for jobs
|
||
with large numbers of reduces. (Devaraj Das via omalley)
|
||
|
||
154. HADOOP-1716. Fix a Pipes wordcount example to remove the 'file:'
|
||
schema from its output path. (omalley via cutting)
|
||
|
||
155. HADOOP-1714. Fix TestDFSUpgradeFromImage to work on Windows.
|
||
(Raghu Angadi via nigel)
|
||
|
||
156. HADOOP-1663. Return a non-zero exit code if streaming fails. (Lohit Renu
|
||
via omalley)
|
||
|
||
157. HADOOP-1712. Fix an unhandled exception on datanode during block
|
||
CRC upgrade. (Raghu Angadi via cutting)
|
||
|
||
158. HADOOP-1717. Fix TestDFSUpgradeFromImage to work on Solaris.
|
||
(nigel via cutting)
|
||
|
||
159. HADOOP-1437. Add Eclipse plugin in contrib.
|
||
(Eugene Hung and Christophe Taton via cutting)
|
||
|
||
|
||
Release 0.13.0 - 2007-06-08
|
||
|
||
1. HADOOP-1047. Fix TestReplication to succeed more reliably.
|
||
(Hairong Kuang via cutting)
|
||
|
||
2. HADOOP-1063. Fix a race condition in MiniDFSCluster test code.
|
||
(Hairong Kuang via cutting)
|
||
|
||
3. HADOOP-1101. In web ui, split shuffle statistics from reduce
|
||
statistics, and add some task averages. (Devaraj Das via cutting)
|
||
|
||
4. HADOOP-1071. Improve handling of protocol version mismatch in
|
||
JobTracker. (Tahir Hashmi via cutting)
|
||
|
||
5. HADOOP-1116. Increase heap size used for contrib unit tests.
|
||
(Philippe Gassmann via cutting)
|
||
|
||
6. HADOOP-1120. Add contrib/data_join, tools to simplify joining
|
||
data from multiple sources using MapReduce. (Runping Qi via cutting)
|
||
|
||
7. HADOOP-1064. Reduce log level of some DFSClient messages.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
8. HADOOP-1137. Fix StatusHttpServer to work correctly when
|
||
resources are in a jar file. (Benjamin Reed via cutting)
|
||
|
||
9. HADOOP-1094. Optimize generated Writable implementations for
|
||
records to not allocate a new BinaryOutputArchive or
|
||
BinaryInputArchive per call. (Milind Bhandarkar via cutting)
|
||
|
||
10. HADOOP-1068. Improve error message for clusters with 0 datanodes.
|
||
(Dhruba Borthakur via tomwhite)
|
||
|
||
11. HADOOP-1122. Fix divide-by-zero exception in FSNamesystem
|
||
chooseTarget method. (Dhruba Borthakur via tomwhite)
|
||
|
||
12. HADOOP-1131. Add a closeAll() static method to FileSystem.
|
||
(Philippe Gassmann via tomwhite)
|
||
|
||
13. HADOOP-1085. Improve port selection in HDFS and MapReduce test
|
||
code. Ports are now selected by the OS during testing rather than
|
||
by probing for free ports, improving test reliability.
|
||
(Arun C Murthy via cutting)
|
||
|
||
14. HADOOP-1153. Fix HDFS daemons to correctly stop their threads.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
15. HADOOP-1146. Add a counter for reduce input keys and rename the
|
||
"reduce input records" counter to be "reduce input groups".
|
||
(David Bowen via cutting)
|
||
|
||
16. HADOOP-1165. In records, replace idential generated toString
|
||
methods with a method on the base class. (Milind Bhandarkar via cutting)
|
||
|
||
17. HADOOP-1164. Fix TestReplicationPolicy to specify port zero, so
|
||
that a free port is automatically selected. (omalley via cutting)
|
||
|
||
18. HADOOP-1166. Add a NullOutputFormat and use it in the
|
||
RandomWriter example. (omalley via cutting)
|
||
|
||
19. HADOOP-1169. Fix a cut/paste error in CopyFiles utility so that
|
||
S3-based source files are correctly copied. (Michael Stack via cutting)
|
||
|
||
20. HADOOP-1167. Remove extra synchronization in InMemoryFileSystem.
|
||
(omalley via cutting)
|
||
|
||
21. HADOOP-1110. Fix an off-by-one error counting map inputs.
|
||
(David Bowen via cutting)
|
||
|
||
22. HADOOP-1178. Fix a NullPointerException during namenode startup.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
23. HADOOP-1011. Fix a ConcurrentModificationException when viewing
|
||
job history. (Tahir Hashmi via cutting)
|
||
|
||
24. HADOOP-672. Improve help for fs shell commands.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
25. HADOOP-1170. Improve datanode performance by removing device
|
||
checks from common operations. (Igor Bolotin via cutting)
|
||
|
||
26. HADOOP-1090. Fix SortValidator's detection of whether the input
|
||
file belongs to the sort-input or sort-output directory.
|
||
(Arun C Murthy via tomwhite)
|
||
|
||
27. HADOOP-1081. Fix bin/hadoop on Darwin. (Michael Bieniosek via cutting)
|
||
|
||
28. HADOOP-1045. Add contrib/hbase, a BigTable-like online database.
|
||
(Jim Kellerman via cutting)
|
||
|
||
29. HADOOP-1156. Fix a NullPointerException in MiniDFSCluster.
|
||
(Hairong Kuang via cutting)
|
||
|
||
30. HADOOP-702. Add tools to help automate HDFS upgrades.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
31. HADOOP-1163. Fix ganglia metrics to aggregate metrics from different
|
||
hosts properly. (Michael Bieniosek via tomwhite)
|
||
|
||
32. HADOOP-1194. Make compression style record level for map output
|
||
compression. (Arun C Murthy via tomwhite)
|
||
|
||
33. HADOOP-1187. Improve DFS Scalability: avoid scanning entire list of
|
||
datanodes in getAdditionalBlocks. (Dhruba Borthakur via tomwhite)
|
||
|
||
34. HADOOP-1133. Add tool to analyze and debug namenode on a production
|
||
cluster. (Dhruba Borthakur via tomwhite)
|
||
|
||
35. HADOOP-1151. Remove spurious printing to stderr in streaming
|
||
PipeMapRed. (Koji Noguchi via tomwhite)
|
||
|
||
36. HADOOP-988. Change namenode to use a single map of blocks to metadata.
|
||
(Raghu Angadi via tomwhite)
|
||
|
||
37. HADOOP-1203. Change UpgradeUtilities used by DFS tests to use
|
||
MiniDFSCluster to start and stop NameNode/DataNodes.
|
||
(Nigel Daley via tomwhite)
|
||
|
||
38. HADOOP-1217. Add test.timeout property to build.xml, so that
|
||
long-running unit tests may be automatically terminated.
|
||
(Nigel Daley via cutting)
|
||
|
||
39. HADOOP-1149. Improve DFS Scalability: make
|
||
processOverReplicatedBlock() a no-op if blocks are not
|
||
over-replicated. (Raghu Angadi via tomwhite)
|
||
|
||
40. HADOOP-1149. Improve DFS Scalability: optimize getDistance(),
|
||
contains(), and isOnSameRack() in NetworkTopology.
|
||
(Hairong Kuang via tomwhite)
|
||
|
||
41. HADOOP-1218. Make synchronization on TaskTracker's RunningJob
|
||
object consistent. (Devaraj Das via tomwhite)
|
||
|
||
42. HADOOP-1219. Ignore progress report once a task has reported as
|
||
'done'. (Devaraj Das via tomwhite)
|
||
|
||
43. HADOOP-1114. Permit user to specify additional CLASSPATH elements
|
||
with a HADOOP_CLASSPATH environment variable. (cutting)
|
||
|
||
44. HADOOP-1198. Remove ipc.client.timeout parameter override from
|
||
unit test configuration. Using the default is more robust and
|
||
has almost the same run time. (Arun C Murthy via tomwhite)
|
||
|
||
45. HADOOP-1211. Remove deprecated constructor and unused static
|
||
members in DataNode class. (Konstantin Shvachko via tomwhite)
|
||
|
||
46. HADOOP-1136. Fix ArrayIndexOutOfBoundsException in
|
||
FSNamesystem$UnderReplicatedBlocks add() method.
|
||
(Hairong Kuang via tomwhite)
|
||
|
||
47. HADOOP-978. Add the client name and the address of the node that
|
||
previously started to create the file to the description of
|
||
AlreadyBeingCreatedException. (Konstantin Shvachko via tomwhite)
|
||
|
||
48. HADOOP-1001. Check the type of keys and values generated by the
|
||
mapper against the types specified in JobConf.
|
||
(Tahir Hashmi via tomwhite)
|
||
|
||
49. HADOOP-971. Improve DFS Scalability: Improve name node performance
|
||
by adding a hostname to datanodes map. (Hairong Kuang via tomwhite)
|
||
|
||
50. HADOOP-1189. Fix 'No space left on device' exceptions on datanodes.
|
||
(Raghu Angadi via tomwhite)
|
||
|
||
51. HADOOP-819. Change LineRecordWriter to not insert a tab between
|
||
key and value when either is null, and to print nothing when both
|
||
are null. (Runping Qi via cutting)
|
||
|
||
52. HADOOP-1204. Rename InputFormatBase to be FileInputFormat, and
|
||
deprecate InputFormatBase. Also make LineRecordReader easier to
|
||
extend. (Runping Qi via cutting)
|
||
|
||
53. HADOOP-1213. Improve logging of errors by IPC server, to
|
||
consistently include the service name and the call. (cutting)
|
||
|
||
54. HADOOP-1238. Fix metrics reporting by TaskTracker to correctly
|
||
track maps_running and reduces_running.
|
||
(Michael Bieniosek via cutting)
|
||
|
||
55. HADOOP-1093. Fix a race condition in HDFS where blocks were
|
||
sometimes erased before they were reported written.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
56. HADOOP-1239. Add a package name to some testjar test classes.
|
||
(Jim Kellerman via cutting)
|
||
|
||
57. HADOOP-1241. Fix NullPointerException in processReport when
|
||
namenode is restarted. (Dhruba Borthakur via tomwhite)
|
||
|
||
58. HADOOP-1244. Fix stop-dfs.sh to no longer incorrectly specify
|
||
slaves file for stopping datanode.
|
||
(Michael Bieniosek via tomwhite)
|
||
|
||
59. HADOOP-1253. Fix ConcurrentModificationException and
|
||
NullPointerException in JobControl.
|
||
(Johan Oskarson via tomwhite)
|
||
|
||
60. HADOOP-1256. Fix NameNode so that multiple DataNodeDescriptors
|
||
can no longer be created on startup. (Hairong Kuang via cutting)
|
||
|
||
61. HADOOP-1214. Replace streaming classes with new counterparts
|
||
from Hadoop core. (Runping Qi via tomwhite)
|
||
|
||
62. HADOOP-1250. Move a chmod utility from streaming to FileUtil.
|
||
(omalley via cutting)
|
||
|
||
63. HADOOP-1258. Fix TestCheckpoint test case to wait for
|
||
MiniDFSCluster to be active. (Nigel Daley via tomwhite)
|
||
|
||
64. HADOOP-1148. Re-indent all Java source code to consistently use
|
||
two spaces per indent level. (cutting)
|
||
|
||
65. HADOOP-1251. Add a method to Reporter to get the map InputSplit.
|
||
(omalley via cutting)
|
||
|
||
66. HADOOP-1224. Fix "Browse the filesystem" link to no longer point
|
||
to dead datanodes. (Enis Soztutar via tomwhite)
|
||
|
||
67. HADOOP-1154. Fail a streaming task if the threads reading from or
|
||
writing to the streaming process fail. (Koji Noguchi via tomwhite)
|
||
|
||
68. HADOOP-968. Move shuffle and sort to run in reduce's child JVM,
|
||
rather than in TaskTracker. (Devaraj Das via cutting)
|
||
|
||
69. HADOOP-1111. Add support for client notification of job
|
||
completion. If the job configuration has a job.end.notification.url
|
||
property it will make a HTTP GET request to the specified URL.
|
||
The number of retries and the interval between retries is also
|
||
configurable. (Alejandro Abdelnur via tomwhite)
|
||
|
||
70. HADOOP-1275. Fix misspelled job notification property in
|
||
hadoop-default.xml. (Alejandro Abdelnur via tomwhite)
|
||
|
||
71. HADOOP-1152. Fix race condition in MapOutputCopier.copyOutput file
|
||
rename causing possible reduce task hang.
|
||
(Tahir Hashmi via tomwhite)
|
||
|
||
72. HADOOP-1050. Distinguish between failed and killed tasks so as to
|
||
not count a lost tasktracker against the job.
|
||
(Arun C Murthy via tomwhite)
|
||
|
||
73. HADOOP-1271. Fix StreamBaseRecordReader to be able to log record
|
||
data that's not UTF-8. (Arun C Murthy via tomwhite)
|
||
|
||
74. HADOOP-1190. Fix unchecked warnings in main Hadoop code.
|
||
(tomwhite)
|
||
|
||
75. HADOOP-1127. Fix AlreadyBeingCreatedException in namenode for
|
||
jobs run with speculative execution.
|
||
(Arun C Murthy via tomwhite)
|
||
|
||
76. HADOOP-1282. Omnibus HBase patch. Improved tests & configuration.
|
||
(Jim Kellerman via cutting)
|
||
|
||
77. HADOOP-1262. Make dfs client try to read from a different replica
|
||
of the checksum file when a checksum error is detected.
|
||
(Hairong Kuang via tomwhite)
|
||
|
||
78. HADOOP-1279. Fix JobTracker to maintain list of recently
|
||
completed jobs by order of completion, not submission.
|
||
(Arun C Murthy via cutting)
|
||
|
||
79. HADOOP-1284. In contrib/streaming, permit flexible specification
|
||
of field delimiter and fields for partitioning and sorting.
|
||
(Runping Qi via cutting)
|
||
|
||
80. HADOOP-1176. Fix a bug where reduce would hang when a map had
|
||
more than 2GB of output for it. (Arun C Murthy via cutting)
|
||
|
||
81. HADOOP-1293. Fix contrib/streaming to print more than the first
|
||
twenty lines of standard error. (Koji Noguchi via cutting)
|
||
|
||
82. HADOOP-1297. Fix datanode so that requests to remove blocks that
|
||
do not exist no longer causes block reports to be re-sent every
|
||
second. (Dhruba Borthakur via cutting)
|
||
|
||
83. HADOOP-1216. Change MapReduce so that, when numReduceTasks is
|
||
zero, map outputs are written directly as final output, skipping
|
||
shuffle, sort and reduce. Use this to implement reduce=NONE
|
||
option in contrib/streaming. (Runping Qi via cutting)
|
||
|
||
84. HADOOP-1294. Fix unchecked warnings in main Hadoop code under
|
||
Java 6. (tomwhite)
|
||
|
||
85. HADOOP-1299. Fix so that RPC will restart after RPC.stopClient()
|
||
has been called. (Michael Stack via cutting)
|
||
|
||
86. HADOOP-1278. Improve blacklisting of TaskTrackers by JobTracker,
|
||
to reduce false positives. (Arun C Murthy via cutting)
|
||
|
||
87. HADOOP-1290. Move contrib/abacus into mapred/lib/aggregate.
|
||
(Runping Qi via cutting)
|
||
|
||
88. HADOOP-1272. Extract inner classes from FSNamesystem into separate
|
||
classes. (Dhruba Borthakur via tomwhite)
|
||
|
||
89. HADOOP-1247. Add support to contrib/streaming for aggregate
|
||
package, formerly called Abacus. (Runping Qi via cutting)
|
||
|
||
90. HADOOP-1061. Fix bug in listing files in the S3 filesystem.
|
||
NOTE: this change is not backwards compatible! You should use the
|
||
MigrationTool supplied to migrate existing S3 filesystem data to
|
||
the new format. Please backup your data first before upgrading
|
||
(using 'hadoop distcp' for example). (tomwhite)
|
||
|
||
91. HADOOP-1304. Make configurable the maximum number of task
|
||
attempts before a job fails. (Devaraj Das via cutting)
|
||
|
||
92. HADOOP-1308. Use generics to restrict types when classes are
|
||
passed as parameters to JobConf methods. (Michael Bieniosek via cutting)
|
||
|
||
93. HADOOP-1312. Fix a ConcurrentModificationException in NameNode
|
||
that killed the heartbeat monitoring thread.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
94. HADOOP-1315. Clean up contrib/streaming, switching it to use core
|
||
classes more and removing unused code. (Runping Qi via cutting)
|
||
|
||
95. HADOOP-485. Allow a different comparator for grouping keys in
|
||
calls to reduce. (Tahir Hashmi via cutting)
|
||
|
||
96. HADOOP-1322. Fix TaskTracker blacklisting to work correctly in
|
||
one- and two-node clusters. (Arun C Murthy via cutting)
|
||
|
||
97. HADOOP-1144. Permit one to specify a maximum percentage of tasks
|
||
that can fail before a job is aborted. The default is zero.
|
||
(Arun C Murthy via cutting)
|
||
|
||
98. HADOOP-1184. Fix HDFS decomissioning to complete when the only
|
||
copy of a block is on a decommissioned node. (Dhruba Borthakur via cutting)
|
||
|
||
99. HADOOP-1263. Change DFSClient to retry certain namenode calls
|
||
with a random, exponentially increasing backoff time, to avoid
|
||
overloading the namenode on, e.g., job start. (Hairong Kuang via cutting)
|
||
|
||
100. HADOOP-1325. First complete, functioning version of HBase.
|
||
(Jim Kellerman via cutting)
|
||
|
||
101. HADOOP-1276. Make tasktracker expiry interval configurable.
|
||
(Arun C Murthy via cutting)
|
||
|
||
102. HADOOP-1326. Change JobClient#RunJob() to return the job.
|
||
(omalley via cutting)
|
||
|
||
103. HADOOP-1270. Randomize the fetch of map outputs, speeding the
|
||
shuffle. (Arun C Murthy via cutting)
|
||
|
||
104. HADOOP-1200. Restore disk checking lost in HADOOP-1170.
|
||
(Hairong Kuang via cutting)
|
||
|
||
105. HADOOP-1252. Changed MapReduce's allocation of local files to
|
||
use round-robin among available devices, rather than a hashcode.
|
||
More care is also taken to not allocate files on full or offline
|
||
drives. (Devaraj Das via cutting)
|
||
|
||
106. HADOOP-1324. Change so that an FSError kills only the task that
|
||
generates it rather than the entire task tracker.
|
||
(Arun C Murthy via cutting)
|
||
|
||
107. HADOOP-1310. Fix unchecked warnings in aggregate code. (tomwhite)
|
||
|
||
108. HADOOP-1255. Fix a bug where the namenode falls into an infinite
|
||
loop trying to remove a dead node. (Hairong Kuang via cutting)
|
||
|
||
109. HADOOP-1160. Fix DistributedFileSystem.close() to close the
|
||
underlying FileSystem, correctly aborting files being written.
|
||
(Hairong Kuang via cutting)
|
||
|
||
110. HADOOP-1341. Fix intermittent failures in HBase unit tests
|
||
caused by deadlock. (Jim Kellerman via cutting)
|
||
|
||
111. HADOOP-1350. Fix shuffle performance problem caused by forcing
|
||
chunked encoding of map outputs. (Devaraj Das via cutting)
|
||
|
||
112. HADOOP-1345. Fix HDFS to correctly retry another replica when a
|
||
checksum error is encountered. (Hairong Kuang via cutting)
|
||
|
||
113. HADOOP-1205. Improve synchronization around HDFS block map.
|
||
(Hairong Kuang via cutting)
|
||
|
||
114. HADOOP-1353. Fix a potential NullPointerException in namenode.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
115. HADOOP-1354. Fix a potential NullPointerException in FsShell.
|
||
(Hairong Kuang via cutting)
|
||
|
||
116. HADOOP-1358. Fix a potential bug when DFSClient calls skipBytes.
|
||
(Hairong Kuang via cutting)
|
||
|
||
117. HADOOP-1356. Fix a bug in ValueHistogram. (Runping Qi via cutting)
|
||
|
||
118. HADOOP-1363. Fix locking bug in JobClient#waitForCompletion().
|
||
(omalley via cutting)
|
||
|
||
119. HADOOP-1368. Fix inconsistent synchronization in JobInProgress.
|
||
(omalley via cutting)
|
||
|
||
120. HADOOP-1369. Fix inconsistent synchronization in TaskTracker.
|
||
(omalley via cutting)
|
||
|
||
121. HADOOP-1361. Fix various calls to skipBytes() to check return
|
||
value. (Hairong Kuang via cutting)
|
||
|
||
122. HADOOP-1388. Fix a potential NullPointerException in web ui.
|
||
(Devaraj Das via cutting)
|
||
|
||
123. HADOOP-1385. Fix MD5Hash#hashCode() to generally hash to more
|
||
than 256 values. (omalley via cutting)
|
||
|
||
124. HADOOP-1386. Fix Path to not permit the empty string as a
|
||
path, as this has lead to accidental file deletion. Instead
|
||
force applications to use "." to name the default directory.
|
||
(Hairong Kuang via cutting)
|
||
|
||
125. HADOOP-1407. Fix integer division bug in JobInProgress which
|
||
meant failed tasks didn't cause the job to fail.
|
||
(Arun C Murthy via tomwhite)
|
||
|
||
126. HADOOP-1427. Fix a typo that caused GzipCodec to incorrectly use
|
||
a very small input buffer. (Espen Amble Kolstad via cutting)
|
||
|
||
127. HADOOP-1435. Fix globbing code to no longer use the empty string
|
||
to indicate the default directory, per HADOOP-1386.
|
||
(Hairong Kuang via cutting)
|
||
|
||
128. HADOOP-1411. Make task retry framework handle
|
||
AlreadyBeingCreatedException when wrapped as a RemoteException.
|
||
(Hairong Kuang via tomwhite)
|
||
|
||
129. HADOOP-1242. Improve handling of DFS upgrades.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
130. HADOOP-1332. Fix so that TaskTracker exits reliably during unit
|
||
tests on Windows. (omalley via cutting)
|
||
|
||
131. HADOOP-1431. Fix so that sort progress reporting during map runs
|
||
only while sorting, so that stuck maps are correctly terminated.
|
||
(Devaraj Das and Arun C Murthy via cutting)
|
||
|
||
132. HADOOP-1452. Change TaskTracker.MapOutputServlet.doGet.totalRead
|
||
to a long, permitting map outputs to exceed 2^31 bytes.
|
||
(omalley via cutting)
|
||
|
||
133. HADOOP-1443. Fix a bug opening zero-length files in HDFS.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
|
||
Release 0.12.3 - 2007-04-06
|
||
|
||
1. HADOOP-1162. Fix bug in record CSV and XML serialization of
|
||
binary values. (Milind Bhandarkar via cutting)
|
||
|
||
2. HADOOP-1123. Fix NullPointerException in LocalFileSystem when
|
||
trying to recover from a checksum error.
|
||
(Hairong Kuang & Nigel Daley via tomwhite)
|
||
|
||
3. HADOOP-1177. Fix bug where IOException in MapOutputLocation.getFile
|
||
was not being logged. (Devaraj Das via tomwhite)
|
||
|
||
4. HADOOP-1175. Fix bugs in JSP for displaying a task's log messages.
|
||
(Arun C Murthy via cutting)
|
||
|
||
5. HADOOP-1191. Fix map tasks to wait until sort progress thread has
|
||
stopped before reporting the task done. (Devaraj Das via cutting)
|
||
|
||
6. HADOOP-1192. Fix an integer overflow bug in FSShell's 'dus'
|
||
command and a performance problem in HDFS's implementation of it.
|
||
(Hairong Kuang via cutting)
|
||
|
||
7. HADOOP-1105. Fix reducers to make "progress" while iterating
|
||
through values. (Devaraj Das & Owen O'Malley via tomwhite)
|
||
|
||
8. HADOOP-1179. Make Task Tracker close index file as soon as the read
|
||
is done when serving get-map-output requests.
|
||
(Devaraj Das via tomwhite)
|
||
|
||
|
||
Release 0.12.2 - 2007-23-17
|
||
|
||
1. HADOOP-1135. Fix bug in block report processing which may cause
|
||
the namenode to delete blocks. (Dhruba Borthakur via tomwhite)
|
||
|
||
2. HADOOP-1145. Make XML serializer and deserializer classes public
|
||
in record package. (Milind Bhandarkar via cutting)
|
||
|
||
3. HADOOP-1140. Fix a deadlock in metrics. (David Bowen via cutting)
|
||
|
||
4. HADOOP-1150. Fix streaming -reducer and -mapper to give them
|
||
defaults. (Owen O'Malley via tomwhite)
|
||
|
||
|
||
Release 0.12.1 - 2007-03-17
|
||
|
||
1. HADOOP-1035. Fix a StackOverflowError in FSDataSet.
|
||
(Raghu Angadi via cutting)
|
||
|
||
2. HADOOP-1053. Fix VInt representation of negative values. Also
|
||
remove references in generated record code to methods outside of
|
||
the record package and improve some record documentation.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
3. HADOOP-1067. Compile fails if Checkstyle jar is present in lib
|
||
directory. Also remove dependency on a particular Checkstyle
|
||
version number. (tomwhite)
|
||
|
||
4. HADOOP-1060. Fix an IndexOutOfBoundsException in the JobTracker
|
||
that could cause jobs to hang. (Arun C Murthy via cutting)
|
||
|
||
5. HADOOP-1077. Fix a race condition fetching map outputs that could
|
||
hang reduces. (Devaraj Das via cutting)
|
||
|
||
6. HADOOP-1083. Fix so that when a cluster restarts with a missing
|
||
datanode, its blocks are replicated. (Hairong Kuang via cutting)
|
||
|
||
7. HADOOP-1082. Fix a NullPointerException in ChecksumFileSystem.
|
||
(Hairong Kuang via cutting)
|
||
|
||
8. HADOOP-1088. Fix record serialization of negative values.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
9. HADOOP-1080. Fix bug in bin/hadoop on Windows when native
|
||
libraries are present. (ab via cutting)
|
||
|
||
10. HADOOP-1091. Fix a NullPointerException in MetricsRecord.
|
||
(David Bowen via tomwhite)
|
||
|
||
11. HADOOP-1092. Fix a NullPointerException in HeartbeatMonitor
|
||
thread. (Hairong Kuang via tomwhite)
|
||
|
||
12. HADOOP-1112. Fix a race condition in Hadoop metrics.
|
||
(David Bowen via tomwhite)
|
||
|
||
13. HADOOP-1108. Checksummed file system should retry reading if a
|
||
different replica is found when handling ChecksumException.
|
||
(Hairong Kuang via tomwhite)
|
||
|
||
14. HADOOP-1070. Fix a problem with number of racks and datanodes
|
||
temporarily doubling. (Konstantin Shvachko via tomwhite)
|
||
|
||
15. HADOOP-1099. Fix NullPointerException in JobInProgress.
|
||
(Gautam Kowshik via tomwhite)
|
||
|
||
16. HADOOP-1115. Fix bug where FsShell copyToLocal doesn't
|
||
copy directories. (Hairong Kuang via tomwhite)
|
||
|
||
17. HADOOP-1109. Fix NullPointerException in StreamInputFormat.
|
||
(Koji Noguchi via tomwhite)
|
||
|
||
18. HADOOP-1117. Fix DFS scalability: when the namenode is
|
||
restarted it consumes 80% CPU. (Dhruba Borthakur via
|
||
tomwhite)
|
||
|
||
19. HADOOP-1089. Make the C++ version of write and read v-int
|
||
agree with the Java versions. (Milind Bhandarkar via
|
||
tomwhite)
|
||
|
||
20. HADOOP-1096. Rename InputArchive and OutputArchive and
|
||
make them public. (Milind Bhandarkar via tomwhite)
|
||
|
||
21. HADOOP-1128. Fix missing progress information in map tasks.
|
||
(Espen Amble Kolstad, Andrzej Bialecki, and Owen O'Malley
|
||
via tomwhite)
|
||
|
||
22. HADOOP-1129. Fix DFSClient to not hide IOExceptions in
|
||
flush method. (Hairong Kuang via tomwhite)
|
||
|
||
23. HADOOP-1126. Optimize CPU usage for under replicated blocks
|
||
when cluster restarts. (Hairong Kuang via tomwhite)
|
||
|
||
|
||
Release 0.12.0 - 2007-03-02
|
||
|
||
1. HADOOP-975. Separate stdout and stderr from tasks.
|
||
(Arun C Murthy via cutting)
|
||
|
||
2. HADOOP-982. Add some setters and a toString() method to
|
||
BytesWritable. (omalley via cutting)
|
||
|
||
3. HADOOP-858. Move contrib/smallJobsBenchmark to src/test, removing
|
||
obsolete bits. (Nigel Daley via cutting)
|
||
|
||
4. HADOOP-992. Fix MiniMR unit tests to use MiniDFS when specified,
|
||
rather than the local FS. (omalley via cutting)
|
||
|
||
5. HADOOP-954. Change use of metrics to use callback mechanism.
|
||
Also rename utility class Metrics to MetricsUtil.
|
||
(David Bowen & Nigel Daley via cutting)
|
||
|
||
6. HADOOP-893. Improve HDFS client's handling of dead datanodes.
|
||
The set is no longer reset with each block, but rather is now
|
||
maintained for the life of an open file. (Raghu Angadi via cutting)
|
||
|
||
7. HADOOP-882. Upgrade to jets3t version 0.5, used by the S3
|
||
FileSystem. This version supports retries. (Michael Stack via cutting)
|
||
|
||
8. HADOOP-977. Send task's stdout and stderr to JobClient's stdout
|
||
and stderr respectively, with each line tagged by the task's name.
|
||
(Arun C Murthy via cutting)
|
||
|
||
9. HADOOP-761. Change unit tests to not use /tmp. (Nigel Daley via cutting)
|
||
|
||
10. HADOOP-1007. Make names of metrics used in Hadoop unique.
|
||
(Nigel Daley via cutting)
|
||
|
||
11. HADOOP-491. Change mapred.task.timeout to be per-job, and make a
|
||
value of zero mean no timeout. Also change contrib/streaming to
|
||
disable task timeouts. (Arun C Murthy via cutting)
|
||
|
||
12. HADOOP-1010. Add Reporter.NULL, a Reporter implementation that
|
||
does nothing. (Runping Qi via cutting)
|
||
|
||
13. HADOOP-923. In HDFS NameNode, move replication computation to a
|
||
separate thread, to improve heartbeat processing time.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
14. HADOOP-476. Rewrite contrib/streaming command-line processing,
|
||
improving parameter validation. (Sanjay Dahiya via cutting)
|
||
|
||
15. HADOOP-973. Improve error messages in Namenode. This should help
|
||
to track down a problem that was appearing as a
|
||
NullPointerException. (Dhruba Borthakur via cutting)
|
||
|
||
16. HADOOP-649. Fix so that jobs with no tasks are not lost.
|
||
(Thomas Friol via cutting)
|
||
|
||
17. HADOOP-803. Reduce memory use by HDFS namenode, phase I.
|
||
(Raghu Angadi via cutting)
|
||
|
||
18. HADOOP-1021. Fix MRCaching-based unit tests on Windows.
|
||
(Nigel Daley via cutting)
|
||
|
||
19. HADOOP-889. Remove duplicate code from HDFS unit tests.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
20. HADOOP-943. Improve HDFS's fsck command to display the filename
|
||
for under-replicated blocks. (Dhruba Borthakur via cutting)
|
||
|
||
21. HADOOP-333. Add validator for sort benchmark output.
|
||
(Arun C Murthy via cutting)
|
||
|
||
22. HADOOP-947. Improve performance of datanode decomissioning.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
23. HADOOP-442. Permit one to specify hosts allowed to connect to
|
||
namenode and jobtracker with include and exclude files. (Wendy
|
||
Chien via cutting)
|
||
|
||
24. HADOOP-1017. Cache constructors, for improved performance.
|
||
(Ron Bodkin via cutting)
|
||
|
||
25. HADOOP-867. Move split creation out of JobTracker to client.
|
||
Splits are now saved in a separate file, read by task processes
|
||
directly, so that user code is no longer required in the
|
||
JobTracker. (omalley via cutting)
|
||
|
||
26. HADOOP-1006. Remove obsolete '-local' option from test code.
|
||
(Gautam Kowshik via cutting)
|
||
|
||
27. HADOOP-952. Create a public (shared) Hadoop EC2 AMI.
|
||
The EC2 scripts now support launch of public AMIs.
|
||
(tomwhite)
|
||
|
||
28. HADOOP-1025. Remove some obsolete code in ipc.Server. (cutting)
|
||
|
||
29. HADOOP-997. Implement S3 retry mechanism for failed block
|
||
transfers. This includes a generic retry mechanism for use
|
||
elsewhere in Hadoop. (tomwhite)
|
||
|
||
30. HADOOP-990. Improve HDFS support for full datanode volumes.
|
||
(Raghu Angadi via cutting)
|
||
|
||
31. HADOOP-564. Replace uses of "dfs://" URIs with the more standard
|
||
"hdfs://". (Wendy Chien via cutting)
|
||
|
||
32. HADOOP-1030. In unit tests, unify setting of ipc.client.timeout.
|
||
Also increase the value used from one to two seconds, in hopes of
|
||
making tests complete more reliably. (cutting)
|
||
|
||
33. HADOOP-654. Stop assigning tasks to a tasktracker if it has
|
||
failed more than a specified number in the job.
|
||
(Arun C Murthy via cutting)
|
||
|
||
34. HADOOP-985. Change HDFS to identify nodes by IP address rather
|
||
than by DNS hostname. (Raghu Angadi via cutting)
|
||
|
||
35. HADOOP-248. Optimize location of map outputs to not use random
|
||
probes. (Devaraj Das via cutting)
|
||
|
||
36. HADOOP-1029. Fix streaming's input format to correctly seek to
|
||
the start of splits. (Arun C Murthy via cutting)
|
||
|
||
37. HADOOP-492. Add per-job and per-task counters. These are
|
||
incremented via the Reporter interface and available through the
|
||
web ui and the JobClient API. The mapreduce framework maintains a
|
||
few basic counters, and applications may add their own. Counters
|
||
are also passed to the metrics system.
|
||
(David Bowen via cutting)
|
||
|
||
38. HADOOP-1034. Fix datanode to better log exceptions.
|
||
(Philippe Gassmann via cutting)
|
||
|
||
39. HADOOP-878. In contrib/streaming, fix reducer=NONE to work with
|
||
multiple maps. (Arun C Murthy via cutting)
|
||
|
||
40. HADOOP-1039. In HDFS's TestCheckpoint, avoid restarting
|
||
MiniDFSCluster so often, speeding this test. (Dhruba Borthakur via cutting)
|
||
|
||
41. HADOOP-1040. Update RandomWriter example to use counters and
|
||
user-defined input and output formats. (omalley via cutting)
|
||
|
||
42. HADOOP-1027. Fix problems with in-memory merging during shuffle
|
||
and re-enable this optimization. (Devaraj Das via cutting)
|
||
|
||
43. HADOOP-1036. Fix exception handling in TaskTracker to keep tasks
|
||
from being lost. (Arun C Murthy via cutting)
|
||
|
||
44. HADOOP-1042. Improve the handling of failed map output fetches.
|
||
(Devaraj Das via cutting)
|
||
|
||
45. HADOOP-928. Make checksums optional per FileSystem.
|
||
(Hairong Kuang via cutting)
|
||
|
||
46. HADOOP-1044. Fix HDFS's TestDecommission to not spuriously fail.
|
||
(Wendy Chien via cutting)
|
||
|
||
47. HADOOP-972. Optimize HDFS's rack-aware block placement algorithm.
|
||
(Hairong Kuang via cutting)
|
||
|
||
48. HADOOP-1043. Optimize shuffle, increasing parallelism.
|
||
(Devaraj Das via cutting)
|
||
|
||
49. HADOOP-940. Improve HDFS's replication scheduling.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
50. HADOOP-1020. Fix a bug in Path resolution, and a with unit tests
|
||
on Windows. (cutting)
|
||
|
||
51. HADOOP-941. Enhance record facility.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
52. HADOOP-1000. Fix so that log messages in task subprocesses are
|
||
not written to a task's standard error. (Arun C Murthy via cutting)
|
||
|
||
53. HADOOP-1037. Fix bin/slaves.sh, which currently only works with
|
||
/bin/bash, to specify /bin/bash rather than /bin/sh. (cutting)
|
||
|
||
54. HADOOP-1046. Clean up tmp from partially received stale block files. (ab)
|
||
|
||
55. HADOOP-1041. Optimize mapred counter implementation. Also group
|
||
counters by their declaring Enum. (David Bowen via cutting)
|
||
|
||
56. HADOOP-1032. Permit one to specify jars that will be cached
|
||
across multiple jobs. (Gautam Kowshik via cutting)
|
||
|
||
57. HADOOP-1051. Add optional checkstyle task to build.xml. To use
|
||
this developers must download the (LGPL'd) checkstyle jar
|
||
themselves. (tomwhite via cutting)
|
||
|
||
58. HADOOP-1049. Fix a race condition in IPC client.
|
||
(Devaraj Das via cutting)
|
||
|
||
60. HADOOP-1056. Check HDFS include/exclude node lists with both IP
|
||
address and hostname. (Wendy Chien via cutting)
|
||
|
||
61. HADOOP-994. In HDFS, limit the number of blocks invalidated at
|
||
once. Large lists were causing datenodes to timeout.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
62. HADOOP-432. Add a trash feature, disabled by default. When
|
||
enabled, the FSShell 'rm' command will move things to a trash
|
||
directory in the filesystem. In HDFS, a thread periodically
|
||
checkpoints the trash and removes old checkpoints. (cutting)
|
||
|
||
|
||
Release 0.11.2 - 2007-02-16
|
||
|
||
1. HADOOP-1009. Fix an infinite loop in the HDFS namenode.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
2. HADOOP-1014. Disable in-memory merging during shuffle, as this is
|
||
causing data corruption. (Devaraj Das via cutting)
|
||
|
||
|
||
Release 0.11.1 - 2007-02-09
|
||
|
||
1. HADOOP-976. Make SequenceFile.Metadata public. (Runping Qi via cutting)
|
||
|
||
2. HADOOP-917. Fix a NullPointerException in SequenceFile's merger
|
||
with large map outputs. (omalley via cutting)
|
||
|
||
3. HADOOP-984. Fix a bug in shuffle error handling introduced by
|
||
HADOOP-331. If a map output is unavailable, the job tracker is
|
||
once more informed. (Arun C Murthy via cutting)
|
||
|
||
4. HADOOP-987. Fix a problem in HDFS where blocks were not removed
|
||
from neededReplications after a replication target was selected.
|
||
(Hairong Kuang via cutting)
|
||
|
||
Release 0.11.0 - 2007-02-02
|
||
|
||
1. HADOOP-781. Remove methods deprecated in 0.10 that are no longer
|
||
widely used. (cutting)
|
||
|
||
2. HADOOP-842. Change HDFS protocol so that the open() method is
|
||
passed the client hostname, to permit the namenode to order block
|
||
locations on the basis of network topology.
|
||
(Hairong Kuang via cutting)
|
||
|
||
3. HADOOP-852. Add an ant task to compile record definitions, and
|
||
use it to compile record unit tests. (Milind Bhandarkar via cutting)
|
||
|
||
4. HADOOP-757. Fix "Bad File Descriptor" exception in HDFS client
|
||
when an output file is closed twice. (Raghu Angadi via cutting)
|
||
|
||
5. [ intentionally blank ]
|
||
|
||
6. HADOOP-890. Replace dashes in metric names with underscores,
|
||
for better compatibility with some monitoring systems.
|
||
(Nigel Daley via cutting)
|
||
|
||
7. HADOOP-801. Add to jobtracker a log of task completion events.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
8. HADOOP-855. In HDFS, try to repair files with checksum errors.
|
||
An exception is still thrown, but corrupt blocks are now removed
|
||
when they have replicas. (Wendy Chien via cutting)
|
||
|
||
9. HADOOP-886. Reduce number of timer threads created by metrics API
|
||
by pooling contexts. (Nigel Daley via cutting)
|
||
|
||
10. HADOOP-897. Add a "javac.args" property to build.xml that permits
|
||
one to pass arbitrary options to javac. (Milind Bhandarkar via cutting)
|
||
|
||
11. HADOOP-899. Update libhdfs for changes in HADOOP-871.
|
||
(Sameer Paranjpye via cutting)
|
||
|
||
12. HADOOP-905. Remove some dead code from JobClient. (cutting)
|
||
|
||
13. HADOOP-902. Fix a NullPointerException in HDFS client when
|
||
closing output streams. (Raghu Angadi via cutting)
|
||
|
||
14. HADOOP-735. Switch generated record code to use BytesWritable to
|
||
represent fields of type 'buffer'. (Milind Bhandarkar via cutting)
|
||
|
||
15. HADOOP-830. Improve mapreduce merge performance by buffering and
|
||
merging multiple map outputs as they arrive at reduce nodes before
|
||
they're written to disk. (Devaraj Das via cutting)
|
||
|
||
16. HADOOP-908. Add a new contrib package, Abacus, that simplifies
|
||
counting and aggregation, built on MapReduce. (Runping Qi via cutting)
|
||
|
||
17. HADOOP-901. Add support for recursive renaming to the S3 filesystem.
|
||
(Tom White via cutting)
|
||
|
||
18. HADOOP-912. Fix a bug in TaskTracker.isIdle() that was
|
||
sporadically causing unit test failures. (Arun C Murthy via cutting)
|
||
|
||
19. HADOOP-909. Fix the 'du' command to correctly compute the size of
|
||
FileSystem directory trees. (Hairong Kuang via cutting)
|
||
|
||
20. HADOOP-731. When a checksum error is encountered on a file stored
|
||
in HDFS, try another replica of the data, if any.
|
||
(Wendy Chien via cutting)
|
||
|
||
21. HADOOP-732. Add support to SequenceFile for arbitrary metadata,
|
||
as a set of attribute value pairs. (Runping Qi via cutting)
|
||
|
||
22. HADOOP-929. Fix PhasedFileSystem to pass configuration to
|
||
underlying FileSystem. (Sanjay Dahiya via cutting)
|
||
|
||
23. HADOOP-935. Fix contrib/abacus to not delete pre-existing output
|
||
files, but rather to fail in this case. (Runping Qi via cutting)
|
||
|
||
24. HADOOP-936. More metric renamings, as in HADOOP-890.
|
||
(Nigel Daley via cutting)
|
||
|
||
25. HADOOP-856. Fix HDFS's fsck command to not report that
|
||
non-existent filesystems are healthy. (Milind Bhandarkar via cutting)
|
||
|
||
26. HADOOP-602. Remove the dependency on Lucene's PriorityQueue
|
||
utility, by copying it into Hadoop. This facilitates using Hadoop
|
||
with different versions of Lucene without worrying about CLASSPATH
|
||
order. (Milind Bhandarkar via cutting)
|
||
|
||
27. [ intentionally blank ]
|
||
|
||
28. HADOOP-227. Add support for backup namenodes, which periodically
|
||
get snapshots of the namenode state. (Dhruba Borthakur via cutting)
|
||
|
||
29. HADOOP-884. Add scripts in contrib/ec2 to facilitate running
|
||
Hadoop on an Amazon's EC2 cluster. (Tom White via cutting)
|
||
|
||
30. HADOOP-937. Change the namenode to request re-registration of
|
||
datanodes in more circumstances. (Hairong Kuang via cutting)
|
||
|
||
31. HADOOP-922. Optimize small forward seeks in HDFS. If data is has
|
||
likely already in flight, skip ahead rather than re-opening the
|
||
block. (Dhruba Borthakur via cutting)
|
||
|
||
32. HADOOP-961. Add a 'job -events' sub-command that prints job
|
||
events, including task completions and failures. (omalley via cutting)
|
||
|
||
33. HADOOP-959. Fix namenode snapshot code added in HADOOP-227 to
|
||
work on Windows. (Dhruba Borthakur via cutting)
|
||
|
||
34. HADOOP-934. Fix TaskTracker to catch metrics exceptions that were
|
||
causing heartbeats to fail. (Arun Murthy via cutting)
|
||
|
||
35. HADOOP-881. Fix JobTracker web interface to display the correct
|
||
number of task failures. (Sanjay Dahiya via cutting)
|
||
|
||
36. HADOOP-788. Change contrib/streaming to subclass TextInputFormat,
|
||
permitting it to take advantage of native compression facilities.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
37. HADOOP-962. In contrib/ec2: make scripts executable in tar file;
|
||
add a README; make the environment file use a template.
|
||
(Tom White via cutting)
|
||
|
||
38. HADOOP-549. Fix a NullPointerException in TaskReport's
|
||
serialization. (omalley via cutting)
|
||
|
||
39. HADOOP-963. Fix remote exceptions to have the stack trace of the
|
||
caller thread, not the IPC listener thread. (omalley via cutting)
|
||
|
||
40. HADOOP-967. Change RPC clients to start sending a version header.
|
||
(omalley via cutting)
|
||
|
||
41. HADOOP-964. Fix a bug introduced by HADOOP-830 where jobs failed
|
||
whose comparators and/or i/o types were in the job's jar.
|
||
(Dennis Kubes via cutting)
|
||
|
||
42. HADOOP-969. Fix a deadlock in JobTracker. (omalley via cutting)
|
||
|
||
43. HADOOP-862. Add support for the S3 FileSystem to the CopyFiles
|
||
tool. (Michael Stack via cutting)
|
||
|
||
44. HADOOP-965. Fix IsolationRunner so that job's jar can be found.
|
||
(Dennis Kubes via cutting)
|
||
|
||
45. HADOOP-309. Fix two NullPointerExceptions in StatusHttpServer.
|
||
(navychen via cutting)
|
||
|
||
46. HADOOP-692. Add rack awareness to HDFS's placement of blocks.
|
||
(Hairong Kuang via cutting)
|
||
|
||
|
||
Release 0.10.1 - 2007-01-10
|
||
|
||
1. HADOOP-857. Fix S3 FileSystem implementation to permit its use
|
||
for MapReduce input and output. (Tom White via cutting)
|
||
|
||
2. HADOOP-863. Reduce logging verbosity introduced by HADOOP-813.
|
||
(Devaraj Das via cutting)
|
||
|
||
3. HADOOP-815. Fix memory leaks in JobTracker. (Arun C Murthy via cutting)
|
||
|
||
4. HADOOP-600. Fix a race condition in JobTracker.
|
||
(Arun C Murthy via cutting)
|
||
|
||
5. HADOOP-864. Fix 'bin/hadoop -jar' to operate correctly when
|
||
hadoop.tmp.dir does not yet exist. (omalley via cutting)
|
||
|
||
6. HADOOP-866. Fix 'dfs -get' command to remove existing crc files,
|
||
if any. (Milind Bhandarkar via cutting)
|
||
|
||
7. HADOOP-871. Fix a bug in bin/hadoop setting JAVA_LIBRARY_PATH.
|
||
(Arun C Murthy via cutting)
|
||
|
||
8. HADOOP-868. Decrease the number of open files during map,
|
||
respecting io.sort.fa ctor. (Devaraj Das via cutting)
|
||
|
||
9. HADOOP-865. Fix S3 FileSystem so that partially created files can
|
||
be deleted. (Tom White via cutting)
|
||
|
||
10. HADOOP-873. Pass java.library.path correctly to child processes.
|
||
(omalley via cutting)
|
||
|
||
11. HADOOP-851. Add support for the LZO codec. This is much faster
|
||
than the default, zlib-based compression, but it is only available
|
||
when the native library is built. (Arun C Murthy via cutting)
|
||
|
||
12. HADOOP-880. Fix S3 FileSystem to remove directories.
|
||
(Tom White via cutting)
|
||
|
||
13. HADOOP-879. Fix InputFormatBase to handle output generated by
|
||
MapFileOutputFormat. (cutting)
|
||
|
||
14. HADOOP-659. In HDFS, prioritize replication of blocks based on
|
||
current replication level. Blocks which are severely
|
||
under-replicated should be further replicated before blocks which
|
||
are less under-replicated. (Hairong Kuang via cutting)
|
||
|
||
15. HADOOP-726. Deprecate FileSystem locking methods. They are not
|
||
currently usable. Locking should eventually provided as an
|
||
independent service. (Raghu Angadi via cutting)
|
||
|
||
16. HADOOP-758. Fix exception handling during reduce so that root
|
||
exceptions are not masked by exceptions in cleanups.
|
||
(Raghu Angadi via cutting)
|
||
|
||
|
||
Release 0.10.0 - 2007-01-05
|
||
|
||
1. HADOOP-763. Change DFS namenode benchmark to not use MapReduce.
|
||
(Nigel Daley via cutting)
|
||
|
||
2. HADOOP-777. Use fully-qualified hostnames for tasktrackers and
|
||
datanodes. (Mahadev Konar via cutting)
|
||
|
||
3. HADOOP-621. Change 'dfs -cat' to exit sooner when output has been
|
||
closed. (Dhruba Borthakur via cutting)
|
||
|
||
4. HADOOP-752. Rationalize some synchronization in DFS namenode.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
5. HADOOP-629. Fix RPC services to better check the protocol name and
|
||
version. (omalley via cutting)
|
||
|
||
6. HADOOP-774. Limit the number of invalid blocks returned with
|
||
heartbeats by the namenode to datanodes. Transmitting and
|
||
processing very large invalid block lists can tie up both the
|
||
namenode and datanode for too long. (Dhruba Borthakur via cutting)
|
||
|
||
7. HADOOP-738. Change 'dfs -get' command to not create CRC files by
|
||
default, adding a -crc option to force their creation.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
8. HADOOP-676. Improved exceptions and error messages for common job
|
||
input specification errors. (Sanjay Dahiya via cutting)
|
||
|
||
9. [Included in 0.9.2 release]
|
||
|
||
10. HADOOP-756. Add new dfsadmin option to wait for filesystem to be
|
||
operational. (Dhruba Borthakur via cutting)
|
||
|
||
11. HADOOP-770. Fix jobtracker web interface to display, on restart,
|
||
jobs that were running when it was last stopped.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
12. HADOOP-331. Write all map outputs to a single file with an index,
|
||
rather than to a separate file per reduce task. This should both
|
||
speed the shuffle and make things more scalable.
|
||
(Devaraj Das via cutting)
|
||
|
||
13. HADOOP-818. Fix contrib unit tests to not depend on core unit
|
||
tests. (omalley via cutting)
|
||
|
||
14. HADOOP-786. Log common exception at debug level.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
15. HADOOP-796. Provide more convenient access to failed task
|
||
information in the web interface. (Sanjay Dahiya via cutting)
|
||
|
||
16. HADOOP-764. Reduce memory allocations in namenode some.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
17. HADOOP-802. Update description of mapred.speculative.execution to
|
||
mention reduces. (Nigel Daley via cutting)
|
||
|
||
18. HADOOP-806. Include link to datanodes on front page of namenode
|
||
web interface. (Raghu Angadi via cutting)
|
||
|
||
19. HADOOP-618. Make JobSubmissionProtocol public.
|
||
(Arun C Murthy via cutting)
|
||
|
||
20. HADOOP-782. Fully remove killed tasks. (Arun C Murthy via cutting)
|
||
|
||
21. HADOOP-792. Fix 'dfs -mv' to return correct status.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
22. HADOOP-673. Give each task its own working directory again.
|
||
(Mahadev Konar via cutting)
|
||
|
||
23. HADOOP-571. Extend the syntax of Path to be a URI; to be
|
||
optionally qualified with a scheme and authority. The scheme
|
||
determines the FileSystem implementation, while the authority
|
||
determines the FileSystem instance. New FileSystem
|
||
implementations may be provided by defining an fs.<scheme>.impl
|
||
property, naming the FileSystem implementation class. This
|
||
permits easy integration of new FileSystem implementations.
|
||
(cutting)
|
||
|
||
24. HADOOP-720. Add an HDFS white paper to website.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
25. HADOOP-794. Fix a divide-by-zero exception when a job specifies
|
||
zero map tasks. (omalley via cutting)
|
||
|
||
26. HADOOP-454. Add a 'dfs -dus' command that provides summary disk
|
||
usage. (Hairong Kuang via cutting)
|
||
|
||
27. HADOOP-574. Add an Amazon S3 implementation of FileSystem. To
|
||
use this, one need only specify paths of the form
|
||
s3://id:secret@bucket/. Alternately, the AWS access key id and
|
||
secret can be specified in your config, with the properties
|
||
fs.s3.awsAccessKeyId and fs.s3.awsSecretAccessKey.
|
||
(Tom White via cutting)
|
||
|
||
28. HADOOP-824. Rename DFSShell to be FsShell, since it applies
|
||
generically to all FileSystem implementations. (cutting)
|
||
|
||
29. HADOOP-813. Fix map output sorting to report progress, so that
|
||
sorts which take longer than the task timeout do not fail.
|
||
(Devaraj Das via cutting)
|
||
|
||
30. HADOOP-825. Fix HDFS daemons when configured with new URI syntax.
|
||
(omalley via cutting)
|
||
|
||
31. HADOOP-596. Fix a bug in phase reporting during reduce.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
32. HADOOP-811. Add a utility, MultithreadedMapRunner.
|
||
(Alejandro Abdelnur via cutting)
|
||
|
||
33. HADOOP-829. Within HDFS, clearly separate three different
|
||
representations for datanodes: one for RPCs, one for
|
||
namenode-internal use, and one for namespace persistence.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
34. HADOOP-823. Fix problem starting datanode when not all configured
|
||
data directories exist. (Bryan Pendleton via cutting)
|
||
|
||
35. HADOOP-451. Add a Split interface. CAUTION: This incompatibly
|
||
changes the InputFormat and RecordReader interfaces. Not only is
|
||
FileSplit replaced with Split, but a FileSystem parameter is no
|
||
longer passed in several methods, input validation has changed,
|
||
etc. (omalley via cutting)
|
||
|
||
36. HADOOP-814. Optimize locking in namenode. (Dhruba Borthakur via cutting)
|
||
|
||
37. HADOOP-738. Change 'fs -put' and 'fs -get' commands to accept
|
||
standard input and output, respectively. Standard i/o is
|
||
specified by a file named '-'. (Wendy Chien via cutting)
|
||
|
||
38. HADOOP-835. Fix a NullPointerException reading record-compressed
|
||
SequenceFiles. (Hairong Kuang via cutting)
|
||
|
||
39. HADOOP-836. Fix a MapReduce bug on Windows, where the wrong
|
||
FileSystem was used. Also add a static FileSystem.getLocal()
|
||
method and better Path checking in HDFS, to help avoid such issues
|
||
in the future. (omalley via cutting)
|
||
|
||
40. HADOOP-837. Improve RunJar utility to unpack jar file
|
||
hadoop.tmp.dir, rather than the system temporary directory.
|
||
(Hairong Kuang via cutting)
|
||
|
||
41. HADOOP-841. Fix native library to build 32-bit version even when
|
||
on a 64-bit host, if a 32-bit JVM is used. (Arun C Murthy via cutting)
|
||
|
||
42. HADOOP-838. Fix tasktracker to pass java.library.path to
|
||
sub-processes, so that libhadoop.a is found.
|
||
(Arun C Murthy via cutting)
|
||
|
||
43. HADOOP-844. Send metrics messages on a fixed-delay schedule
|
||
instead of a fixed-rate schedule. (David Bowen via cutting)
|
||
|
||
44. HADOOP-849. Fix OutOfMemory exceptions in TaskTracker due to a
|
||
file handle leak in SequenceFile. (Devaraj Das via cutting)
|
||
|
||
45. HADOOP-745. Fix a synchronization bug in the HDFS namenode.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
46. HADOOP-850. Add Writable implementations for variable-length
|
||
integers. (ab via cutting)
|
||
|
||
47. HADOOP-525. Add raw comparators to record types. This greatly
|
||
improves record sort performance. (Milind Bhandarkar via cutting)
|
||
|
||
48. HADOOP-628. Fix a problem with 'fs -cat' command, where some
|
||
characters were replaced with question marks. (Wendy Chien via cutting)
|
||
|
||
49. HADOOP-804. Reduce verbosity of MapReduce logging.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
50. HADOOP-853. Rename 'site' to 'docs', in preparation for inclusion
|
||
in releases. (cutting)
|
||
|
||
51. HADOOP-371. Include contrib jars and site documentation in
|
||
distributions. Also add contrib and example documentation to
|
||
distributed javadoc, in separate sections. (Nigel Daley via cutting)
|
||
|
||
52. HADOOP-846. Report progress during entire map, as sorting of
|
||
intermediate outputs may happen at any time, potentially causing
|
||
task timeouts. (Devaraj Das via cutting)
|
||
|
||
53. HADOOP-840. In task tracker, queue task cleanups and perform them
|
||
in a separate thread. (omalley & Mahadev Konar via cutting)
|
||
|
||
54. HADOOP-681. Add to HDFS the ability to decommission nodes. This
|
||
causes their blocks to be re-replicated on other nodes, so that
|
||
they may be removed from a cluster. (Dhruba Borthakur via cutting)
|
||
|
||
55. HADOOP-470. In HDFS web ui, list the datanodes containing each
|
||
copy of a block. (Hairong Kuang via cutting)
|
||
|
||
56. HADOOP-700. Change bin/hadoop to only include core jar file on
|
||
classpath, not example, test, etc. Also rename core jar to
|
||
hadoop-${version}-core.jar so that it can be more easily
|
||
identified. (Nigel Daley via cutting)
|
||
|
||
57. HADOOP-619. Extend InputFormatBase to accept individual files and
|
||
glob patterns as MapReduce inputs, not just directories. Also
|
||
change contrib/streaming to use this. (Sanjay Dahia via cutting)
|
||
|
||
|
||
Release 0.9.2 - 2006-12-15
|
||
|
||
1. HADOOP-639. Restructure InterTrackerProtocol to make task
|
||
accounting more reliable. (Arun C Murthy via cutting)
|
||
|
||
2. HADOOP-827. Turn off speculative execution by default, since it's
|
||
currently broken. (omalley via cutting)
|
||
|
||
3. HADOOP-791. Fix a deadlock in the task tracker.
|
||
(Mahadev Konar via cutting)
|
||
|
||
|
||
Release 0.9.1 - 2006-12-06
|
||
|
||
1. HADOOP-780. Use ReflectionUtils to instantiate key and value
|
||
objects. (ab)
|
||
|
||
2. HADOOP-779. Fix contrib/streaming to work correctly with gzipped
|
||
input files. (Hairong Kuang via cutting)
|
||
|
||
|
||
Release 0.9.0 - 2006-12-01
|
||
|
||
1. HADOOP-655. Remove most deprecated code. A few deprecated things
|
||
remain, notably UTF8 and some methods that are still required.
|
||
Also cleaned up constructors for SequenceFile, MapFile, SetFile,
|
||
and ArrayFile a bit. (cutting)
|
||
|
||
2. HADOOP-565. Upgrade to Jetty version 6. (Sanjay Dahiya via cutting)
|
||
|
||
3. HADOOP-682. Fix DFS format command to work correctly when
|
||
configured with a non-existent directory. (Sanjay Dahiya via cutting)
|
||
|
||
4. HADOOP-645. Fix a bug in contrib/streaming when -reducer is NONE.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
5. HADOOP-687. Fix a classpath bug in bin/hadoop that blocked the
|
||
servers from starting. (Sameer Paranjpye via omalley)
|
||
|
||
6. HADOOP-683. Remove a script dependency on bash, so it works with
|
||
dash, the new default for /bin/sh on Ubuntu. (James Todd via cutting)
|
||
|
||
7. HADOOP-382. Extend unit tests to run multiple datanodes.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
8. HADOOP-604. Fix some synchronization issues and a
|
||
NullPointerException in DFS datanode. (Raghu Angadi via cutting)
|
||
|
||
9. HADOOP-459. Fix memory leaks and a host of other issues with
|
||
libhdfs. (Sameer Paranjpye via cutting)
|
||
|
||
10. HADOOP-694. Fix a NullPointerException in jobtracker.
|
||
(Mahadev Konar via cutting)
|
||
|
||
11. HADOOP-637. Fix a memory leak in the IPC server. Direct buffers
|
||
are not collected like normal buffers, and provided little
|
||
advantage. (Raghu Angadi via cutting)
|
||
|
||
12. HADOOP-696. Fix TestTextInputFormat unit test to not rely on the
|
||
order of directory listings. (Sameer Paranjpye via cutting)
|
||
|
||
13. HADOOP-611. Add support for iterator-based merging to
|
||
SequenceFile. (Devaraj Das via cutting)
|
||
|
||
14. HADOOP-688. Move DFS administrative commands to a separate
|
||
command named 'dfsadmin'. (Dhruba Borthakur via cutting)
|
||
|
||
15. HADOOP-708. Fix test-libhdfs to return the correct status, so
|
||
that failures will break the build. (Nigel Daley via cutting)
|
||
|
||
16. HADOOP-646. Fix namenode to handle edits files larger than 2GB.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
17. HADOOP-705. Fix a bug in the JobTracker when failed jobs were
|
||
not completely cleaned up. (Mahadev Konar via cutting)
|
||
|
||
18. HADOOP-613. Perform final merge while reducing. This removes one
|
||
sort pass over the data and should consequently significantly
|
||
decrease overall processing time. (Devaraj Das via cutting)
|
||
|
||
19. HADOOP-661. Make each job's configuration visible through the web
|
||
ui. (Arun C Murthy via cutting)
|
||
|
||
20. HADOOP-489. In MapReduce, separate user logs from system logs.
|
||
Each task's log output is now available through the web ui. (Arun
|
||
C Murthy via cutting)
|
||
|
||
21. HADOOP-712. Fix record io's xml serialization to correctly handle
|
||
control-characters. (Milind Bhandarkar via cutting)
|
||
|
||
22. HADOOP-668. Improvements to the web-based DFS browser.
|
||
(Hairong Kuang via cutting)
|
||
|
||
23. HADOOP-715. Fix build.xml so that test logs are written in build
|
||
directory, rather than in CWD. (Arun C Murthy via cutting)
|
||
|
||
24. HADOOP-538. Add support for building an optional native library,
|
||
libhadoop.so, that improves the performance of zlib-based
|
||
compression. To build this, specify -Dcompile.native to Ant.
|
||
(Arun C Murthy via cutting)
|
||
|
||
25. HADOOP-610. Fix an problem when the DFS block size is configured
|
||
to be smaller than the buffer size, typically only when debugging.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
26. HADOOP-695. Fix a NullPointerException in contrib/streaming.
|
||
(Hairong Kuang via cutting)
|
||
|
||
27. HADOOP-652. In DFS, when a file is deleted, the block count is
|
||
now decremented. (Vladimir Krokhmalyov via cutting)
|
||
|
||
28. HADOOP-725. In DFS, optimize block placement algorithm,
|
||
previously a performance bottleneck. (Milind Bhandarkar via cutting)
|
||
|
||
29. HADOOP-723. In MapReduce, fix a race condition during the
|
||
shuffle, which resulted in FileNotFoundExceptions. (omalley via cutting)
|
||
|
||
30. HADOOP-447. In DFS, fix getBlockSize(Path) to work with relative
|
||
paths. (Raghu Angadi via cutting)
|
||
|
||
31. HADOOP-733. Make exit codes in DFShell consistent and add a unit
|
||
test. (Dhruba Borthakur via cutting)
|
||
|
||
32. HADOOP-709. Fix contrib/streaming to work with commands that
|
||
contain control characters. (Dhruba Borthakur via cutting)
|
||
|
||
33. HADOOP-677. In IPC, permit a version header to be transmitted
|
||
when connections are established. This will permit us to change
|
||
the format of IPC requests back-compatibly in subsequent releases.
|
||
(omalley via cutting)
|
||
|
||
34. HADOOP-699. Fix DFS web interface so that filesystem browsing
|
||
works correctly, using the right port number. Also add support
|
||
for sorting datanode list by various columns.
|
||
(Raghu Angadi via cutting)
|
||
|
||
35. HADOOP-76. Implement speculative reduce. Now when a job is
|
||
configured for speculative execution, both maps and reduces will
|
||
execute speculatively. Reduce outputs are written to temporary
|
||
location and moved to the final location when reduce is complete.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
36. HADOOP-736. Roll back to Jetty 5.1.4, due to performance problems
|
||
with Jetty 6.0.1.
|
||
|
||
37. HADOOP-739. Fix TestIPC to use different port number, making it
|
||
more reliable. (Nigel Daley via cutting)
|
||
|
||
38. HADOOP-749. Fix a NullPointerException in jobfailures.jsp.
|
||
(omalley via cutting)
|
||
|
||
39. HADOOP-747. Fix record serialization to work correctly when
|
||
records are embedded in Maps. (Milind Bhandarkar via cutting)
|
||
|
||
40. HADOOP-698. Fix HDFS client not to retry the same datanode on
|
||
read failures. (Milind Bhandarkar via cutting)
|
||
|
||
41. HADOOP-689. Add GenericWritable, to facilitate polymorphism in
|
||
MapReduce, SequenceFile, etc. (Feng Jiang via cutting)
|
||
|
||
42. HADOOP-430. Stop datanode's HTTP server when registration with
|
||
namenode fails. (Wendy Chien via cutting)
|
||
|
||
43. HADOOP-750. Fix a potential race condition during mapreduce
|
||
shuffle. (omalley via cutting)
|
||
|
||
44. HADOOP-728. Fix contrib/streaming-related issues, including
|
||
'-reducer NONE'. (Sanjay Dahiya via cutting)
|
||
|
||
|
||
Release 0.8.0 - 2006-11-03
|
||
|
||
1. HADOOP-477. Extend contrib/streaming to scan the PATH environment
|
||
variables when resolving executable program names.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
2. HADOOP-583. In DFSClient, reduce the log level of re-connect
|
||
attempts from 'info' to 'debug', so they are not normally shown.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
3. HADOOP-498. Re-implement DFS integrity checker to run server-side,
|
||
for much improved performance. (Milind Bhandarkar via cutting)
|
||
|
||
4. HADOOP-586. Use the jar name for otherwise un-named jobs.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
5. HADOOP-514. Make DFS heartbeat interval configurable.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
6. HADOOP-588. Fix logging and accounting of failed tasks.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
7. HADOOP-462. Improve command line parsing in DFSShell, so that
|
||
incorrect numbers of arguments result in informative errors rather
|
||
than ArrayOutOfBoundsException. (Dhruba Borthakur via cutting)
|
||
|
||
8. HADOOP-561. Fix DFS so that one replica of each block is written
|
||
locally, if possible. This was the intent, but there as a bug.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
9. HADOOP-610. Fix TaskTracker to survive more exceptions, keeping
|
||
tasks from becoming lost. (omalley via cutting)
|
||
|
||
10. HADOOP-625. Add a servlet to all http daemons that displays a
|
||
stack dump, useful for debugging. (omalley via cutting)
|
||
|
||
11. HADOOP-554. Fix DFSShell to return -1 for errors.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
12. HADOOP-626. Correct the documentation in the NNBench example
|
||
code, and also remove a mistaken call there.
|
||
(Nigel Daley via cutting)
|
||
|
||
13. HADOOP-634. Add missing license to many files.
|
||
(Nigel Daley via cutting)
|
||
|
||
14. HADOOP-627. Fix some synchronization problems in MiniMRCluster
|
||
that sometimes caused unit tests to fail. (Nigel Daley via cutting)
|
||
|
||
15. HADOOP-563. Improve the NameNode's lease policy so that leases
|
||
are held for one hour without renewal (instead of one minute).
|
||
However another attempt to create the same file will still succeed
|
||
if the lease has not been renewed within a minute. This prevents
|
||
communication or scheduling problems from causing a write to fail
|
||
for up to an hour, barring some other process trying to create the
|
||
same file. (Dhruba Borthakur via cutting)
|
||
|
||
16. HADOOP-635. In DFSShell, permit specification of multiple files
|
||
as the source for file copy and move commands.
|
||
(Dhruba Borthakur via cutting)
|
||
|
||
17. HADOOP-641. Change NameNode to request a fresh block report from
|
||
a re-discovered DataNode, so that no-longer-needed replications
|
||
are stopped promptly. (Konstantin Shvachko via cutting)
|
||
|
||
18. HADOOP-642. Change IPC client to specify an explicit connect
|
||
timeout. (Konstantin Shvachko via cutting)
|
||
|
||
19. HADOOP-638. Fix an unsynchronized access to TaskTracker's
|
||
internal state. (Nigel Daley via cutting)
|
||
|
||
20. HADOOP-624. Fix servlet path to stop a Jetty warning on startup.
|
||
(omalley via cutting)
|
||
|
||
21. HADOOP-578. Failed tasks are no longer placed at the end of the
|
||
task queue. This was originally done to work around other
|
||
problems that have now been fixed. Re-executing failed tasks
|
||
sooner causes buggy jobs to fail faster. (Sanjay Dahiya via cutting)
|
||
|
||
22. HADOOP-658. Update source file headers per Apache policy. (cutting)
|
||
|
||
23. HADOOP-636. Add MapFile & ArrayFile constructors which accept a
|
||
Progressable, and pass it down to SequenceFile. This permits
|
||
reduce tasks which use MapFile to still report progress while
|
||
writing blocks to the filesystem. (cutting)
|
||
|
||
24. HADOOP-576. Enable contrib/streaming to use the file cache. Also
|
||
extend the cache to permit symbolic links to cached items, rather
|
||
than local file copies. (Mahadev Konar via cutting)
|
||
|
||
25. HADOOP-482. Fix unit tests to work when a cluster is running on
|
||
the same machine, removing port conflicts. (Wendy Chien via cutting)
|
||
|
||
26. HADOOP-90. Permit dfs.name.dir to list multiple directories,
|
||
where namenode data is to be replicated. (Milind Bhandarkar via cutting)
|
||
|
||
27. HADOOP-651. Fix DFSCk to correctly pass parameters to the servlet
|
||
on the namenode. (Milind Bhandarkar via cutting)
|
||
|
||
28. HADOOP-553. Change main() routines of DataNode and NameNode to
|
||
log exceptions rather than letting the JVM print them to standard
|
||
error. Also, change the hadoop-daemon.sh script to rotate
|
||
standard i/o log files. (Raghu Angadi via cutting)
|
||
|
||
29. HADOOP-399. Fix javadoc warnings. (Nigel Daley via cutting)
|
||
|
||
30. HADOOP-599. Fix web ui and command line to correctly report DFS
|
||
filesystem size statistics. Also improve web layout.
|
||
(Raghu Angadi via cutting)
|
||
|
||
31. HADOOP-660. Permit specification of junit test output format.
|
||
(Nigel Daley via cutting)
|
||
|
||
32. HADOOP-663. Fix a few unit test issues. (Mahadev Konar via cutting)
|
||
|
||
33. HADOOP-664. Cause entire build to fail if libhdfs tests fail.
|
||
(Nigel Daley via cutting)
|
||
|
||
34. HADOOP-633. Keep jobtracker from dying when job initialization
|
||
throws exceptions. Also improve exception handling in a few other
|
||
places and add more informative thread names.
|
||
(omalley via cutting)
|
||
|
||
35. HADOOP-669. Fix a problem introduced by HADOOP-90 that can cause
|
||
DFS to lose files. (Milind Bhandarkar via cutting)
|
||
|
||
36. HADOOP-373. Consistently check the value returned by
|
||
FileSystem.mkdirs(). (Wendy Chien via cutting)
|
||
|
||
37. HADOOP-670. Code cleanups in some DFS internals: use generic
|
||
types, replace Vector with ArrayList, etc.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
38. HADOOP-647. Permit map outputs to use a different compression
|
||
type than the job output. (omalley via cutting)
|
||
|
||
39. HADOOP-671. Fix file cache to check for pre-existence before
|
||
creating . (Mahadev Konar via cutting)
|
||
|
||
40. HADOOP-665. Extend many DFSShell commands to accept multiple
|
||
arguments. Now commands like "ls", "rm", etc. will operate on
|
||
multiple files. (Dhruba Borthakur via cutting)
|
||
|
||
|
||
Release 0.7.2 - 2006-10-18
|
||
|
||
1. HADOOP-607. Fix a bug where classes included in job jars were not
|
||
found by tasks. (Mahadev Konar via cutting)
|
||
|
||
2. HADOOP-609. Add a unit test that checks that classes in job jars
|
||
can be found by tasks. Also modify unit tests to specify multiple
|
||
local directories. (Mahadev Konar via cutting)
|
||
|
||
|
||
Release 0.7.1 - 2006-10-11
|
||
|
||
1. HADOOP-593. Fix a NullPointerException in the JobTracker.
|
||
(omalley via cutting)
|
||
|
||
2. HADOOP-592. Fix a NullPointerException in the IPC Server. Also
|
||
consistently log when stale calls are discarded. (omalley via cutting)
|
||
|
||
3. HADOOP-594. Increase the DFS safe-mode threshold from .95 to
|
||
.999, so that nearly all blocks must be reported before filesystem
|
||
modifications are permitted. (Konstantin Shvachko via cutting)
|
||
|
||
4. HADOOP-598. Fix tasks to retry when reporting completion, so that
|
||
a single RPC timeout won't fail a task. (omalley via cutting)
|
||
|
||
5. HADOOP-597. Fix TaskTracker to not discard map outputs for errors
|
||
in transmitting them to reduce nodes. (omalley via cutting)
|
||
|
||
|
||
Release 0.7.0 - 2006-10-06
|
||
|
||
1. HADOOP-243. Fix rounding in the display of task and job progress
|
||
so that things are not shown to be 100% complete until they are in
|
||
fact finished. (omalley via cutting)
|
||
|
||
2. HADOOP-438. Limit the length of absolute paths in DFS, since the
|
||
file format used to store pathnames has some limitations.
|
||
(Wendy Chien via cutting)
|
||
|
||
3. HADOOP-530. Improve error messages in SequenceFile when keys or
|
||
values are of the wrong type. (Hairong Kuang via cutting)
|
||
|
||
4. HADOOP-288. Add a file caching system and use it in MapReduce to
|
||
cache job jar files on slave nodes. (Mahadev Konar via cutting)
|
||
|
||
5. HADOOP-533. Fix unit test to not modify conf directory.
|
||
(Hairong Kuang via cutting)
|
||
|
||
6. HADOOP-527. Permit specification of the local address that various
|
||
Hadoop daemons should bind to. (Philippe Gassmann via cutting)
|
||
|
||
7. HADOOP-542. Updates to contrib/streaming: reformatted source code,
|
||
on-the-fly merge sort, a fix for HADOOP-540, etc.
|
||
(Michel Tourn via cutting)
|
||
|
||
8. HADOOP-545. Remove an unused config file parameter.
|
||
(Philippe Gassmann via cutting)
|
||
|
||
9. HADOOP-548. Add an Ant property "test.output" to build.xml that
|
||
causes test output to be logged to the console. (omalley via cutting)
|
||
|
||
10. HADOOP-261. Record an error message when map output is lost.
|
||
(omalley via cutting)
|
||
|
||
11. HADOOP-293. Report the full list of task error messages in the
|
||
web ui, not just the most recent. (omalley via cutting)
|
||
|
||
12. HADOOP-551. Restore JobClient's console printouts to only include
|
||
a maximum of one update per one percent of progress.
|
||
(omalley via cutting)
|
||
|
||
13. HADOOP-306. Add a "safe" mode to DFS. The name node enters this
|
||
when less than a specified percentage of file data is complete.
|
||
Currently safe mode is only used on startup, but eventually it
|
||
will also be entered when datanodes disconnect and file data
|
||
becomes incomplete. While in safe mode no filesystem
|
||
modifications are permitted and block replication is inhibited.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
14. HADOOP-431. Change 'dfs -rm' to not operate recursively and add a
|
||
new command, 'dfs -rmr' which operates recursively.
|
||
(Sameer Paranjpye via cutting)
|
||
|
||
15. HADOOP-263. Include timestamps for job transitions. The web
|
||
interface now displays the start and end times of tasks and the
|
||
start times of sorting and reducing for reduce tasks. Also,
|
||
extend ObjectWritable to handle enums, so that they can be passed
|
||
as RPC parameters. (Sanjay Dahiya via cutting)
|
||
|
||
16. HADOOP-556. Contrib/streaming: send keep-alive reports to task
|
||
tracker every 10 seconds rather than every 100 records, to avoid
|
||
task timeouts. (Michel Tourn via cutting)
|
||
|
||
17. HADOOP-547. Fix reduce tasks to ping tasktracker while copying
|
||
data, rather than only between copies, avoiding task timeouts.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
18. HADOOP-537. Fix src/c++/libhdfs build process to create files in
|
||
build/, no longer modifying the source tree.
|
||
(Arun C Murthy via cutting)
|
||
|
||
19. HADOOP-487. Throw a more informative exception for unknown RPC
|
||
hosts. (Sameer Paranjpye via cutting)
|
||
|
||
20. HADOOP-559. Add file name globbing (pattern matching) support to
|
||
the FileSystem API, and use it in DFSShell ('bin/hadoop dfs')
|
||
commands. (Hairong Kuang via cutting)
|
||
|
||
21. HADOOP-508. Fix a bug in FSDataInputStream. Incorrect data was
|
||
returned after seeking to a random location.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
22. HADOOP-560. Add a "killed" task state. This can be used to
|
||
distinguish kills from other failures. Task state has also been
|
||
converted to use an enum type instead of an int, uncovering a bug
|
||
elsewhere. The web interface is also updated to display killed
|
||
tasks. (omalley via cutting)
|
||
|
||
23. HADOOP-423. Normalize Paths containing directories named "." and
|
||
"..", using the standard, unix interpretation. Also add checks in
|
||
DFS, prohibiting the use of "." or ".." as directory or file
|
||
names. (Wendy Chien via cutting)
|
||
|
||
24. HADOOP-513. Replace map output handling with a servlet, rather
|
||
than a JSP page. This fixes an issue where
|
||
IllegalStateException's were logged, sets content-length
|
||
correctly, and better handles some errors. (omalley via cutting)
|
||
|
||
25. HADOOP-552. Improved error checking when copying map output files
|
||
to reduce nodes. (omalley via cutting)
|
||
|
||
26. HADOOP-566. Fix scripts to work correctly when accessed through
|
||
relative symbolic links. (Lee Faris via cutting)
|
||
|
||
27. HADOOP-519. Add positioned read methods to FSInputStream. These
|
||
permit one to read from a stream without moving its position, and
|
||
can hence be performed by multiple threads at once on a single
|
||
stream. Implement an optimized version for DFS and local FS.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
28. HADOOP-522. Permit block compression with MapFile and SetFile.
|
||
Since these formats are always sorted, block compression can
|
||
provide a big advantage. (cutting)
|
||
|
||
29. HADOOP-567. Record version and revision information in builds. A
|
||
package manifest is added to the generated jar file containing
|
||
version information, and a VersionInfo utility is added that
|
||
includes further information, including the build date and user,
|
||
and the subversion revision and repository. A 'bin/hadoop
|
||
version' comand is added to show this information, and it is also
|
||
added to various web interfaces. (omalley via cutting)
|
||
|
||
30. HADOOP-568. Fix so that errors while initializing tasks on a
|
||
tasktracker correctly report the task as failed to the jobtracker,
|
||
so that it will be rescheduled. (omalley via cutting)
|
||
|
||
31. HADOOP-550. Disable automatic UTF-8 validation in Text. This
|
||
permits, e.g., TextInputFormat to again operate on non-UTF-8 data.
|
||
(Hairong and Mahadev via cutting)
|
||
|
||
32. HADOOP-343. Fix mapred copying so that a failed tasktracker
|
||
doesn't cause other copies to slow. (Sameer Paranjpye via cutting)
|
||
|
||
33. HADOOP-239. Add a persistent job history mechanism, so that basic
|
||
job statistics are not lost after 24 hours and/or when the
|
||
jobtracker is restarted. (Sanjay Dahiya via cutting)
|
||
|
||
34. HADOOP-506. Ignore heartbeats from stale task trackers.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
35. HADOOP-255. Discard stale, queued IPC calls. Do not process
|
||
calls whose clients will likely time out before they receive a
|
||
response. When the queue is full, new calls are now received and
|
||
queued, and the oldest calls are discarded, so that, when servers
|
||
get bogged down, they no longer develop a backlog on the socket.
|
||
This should improve some DFS namenode failure modes.
|
||
(omalley via cutting)
|
||
|
||
36. HADOOP-581. Fix datanode to not reset itself on communications
|
||
errors with the namenode. If a request to the namenode fails, the
|
||
datanode should retry, not restart. This reduces the load on the
|
||
namenode, since restarts cause a resend of the block report.
|
||
(omalley via cutting)
|
||
|
||
|
||
Release 0.6.2 - 2006-09-18
|
||
|
||
1. HADOOP-532. Fix a bug reading value-compressed sequence files,
|
||
where an exception was thrown reporting that the full value had not
|
||
been read. (omalley via cutting)
|
||
|
||
2. HADOOP-534. Change the default value class in JobConf to be Text
|
||
instead of the now-deprecated UTF8. This fixes the Grep example
|
||
program, which was updated to use Text, but relies on this
|
||
default. (Hairong Kuang via cutting)
|
||
|
||
|
||
Release 0.6.1 - 2006-09-13
|
||
|
||
1. HADOOP-520. Fix a bug in libhdfs, where write failures were not
|
||
correctly returning error codes. (Arun C Murthy via cutting)
|
||
|
||
2. HADOOP-523. Fix a NullPointerException when TextInputFormat is
|
||
explicitly specified. Also add a test case for this.
|
||
(omalley via cutting)
|
||
|
||
3. HADOOP-521. Fix another NullPointerException finding the
|
||
ClassLoader when using libhdfs. (omalley via cutting)
|
||
|
||
4. HADOOP-526. Fix a NullPointerException when attempting to start
|
||
two datanodes in the same directory. (Milind Bhandarkar via cutting)
|
||
|
||
5. HADOOP-529. Fix a NullPointerException when opening
|
||
value-compressed sequence files generated by pre-0.6.0 Hadoop.
|
||
(omalley via cutting)
|
||
|
||
|
||
Release 0.6.0 - 2006-09-08
|
||
|
||
1. HADOOP-427. Replace some uses of DatanodeDescriptor in the DFS
|
||
web UI code with DatanodeInfo, the preferred public class.
|
||
(Devaraj Das via cutting)
|
||
|
||
2. HADOOP-426. Fix streaming contrib module to work correctly on
|
||
Solaris. This was causing nightly builds to fail.
|
||
(Michel Tourn via cutting)
|
||
|
||
3. HADOOP-400. Improvements to task assignment. Tasks are no longer
|
||
re-run on nodes where they have failed (unless no other node is
|
||
available). Also, tasks are better load-balanced among nodes.
|
||
(omalley via cutting)
|
||
|
||
4. HADOOP-324. Fix datanode to not exit when a disk is full, but
|
||
rather simply to fail writes. (Wendy Chien via cutting)
|
||
|
||
5. HADOOP-434. Change smallJobsBenchmark to use standard Hadoop
|
||
scripts. (Sanjay Dahiya via cutting)
|
||
|
||
6. HADOOP-453. Fix a bug in Text.setCapacity(). (siren via cutting)
|
||
|
||
|
||
7. HADOOP-450. Change so that input types are determined by the
|
||
RecordReader rather than specified directly in the JobConf. This
|
||
facilitates jobs with a variety of input types.
|
||
|
||
WARNING: This contains incompatible API changes! The RecordReader
|
||
interface has two new methods that all user-defined InputFormats
|
||
must now define. Also, the values returned by TextInputFormat are
|
||
no longer of class UTF8, but now of class Text.
|
||
|
||
8. HADOOP-436. Fix an error-handling bug in the web ui.
|
||
(Devaraj Das via cutting)
|
||
|
||
9. HADOOP-455. Fix a bug in Text, where DEL was not permitted.
|
||
(Hairong Kuang via cutting)
|
||
|
||
10. HADOOP-456. Change the DFS namenode to keep a persistent record
|
||
of the set of known datanodes. This will be used to implement a
|
||
"safe mode" where filesystem changes are prohibited when a
|
||
critical percentage of the datanodes are unavailable.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
11. HADOOP-322. Add a job control utility. This permits one to
|
||
specify job interdependencies. Each job is submitted only after
|
||
the jobs it depends on have successfully completed.
|
||
(Runping Qi via cutting)
|
||
|
||
12. HADOOP-176. Fix a bug in IntWritable.Comparator.
|
||
(Dick King via cutting)
|
||
|
||
13. HADOOP-421. Replace uses of String in recordio package with Text
|
||
class, for improved handling of UTF-8 data.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
14. HADOOP-464. Improved error message when job jar not found.
|
||
(Michel Tourn via cutting)
|
||
|
||
15. HADOOP-469. Fix /bin/bash specifics that have crept into our
|
||
/bin/sh scripts since HADOOP-352.
|
||
(Jean-Baptiste Quenot via cutting)
|
||
|
||
16. HADOOP-468. Add HADOOP_NICENESS environment variable to set
|
||
scheduling priority for daemons. (Vetle Roeim via cutting)
|
||
|
||
17. HADOOP-473. Fix TextInputFormat to correctly handle more EOL
|
||
formats. Things now work correctly with CR, LF or CRLF.
|
||
(Dennis Kubes & James White via cutting)
|
||
|
||
18. HADOOP-461. Make Java 1.5 an explicit requirement. (cutting)
|
||
|
||
19. HADOOP-54. Add block compression to SequenceFile. One may now
|
||
specify that blocks of keys and values are compressed together,
|
||
improving compression for small keys and values.
|
||
SequenceFile.Writer's constructor is now deprecated and replaced
|
||
with a factory method. (Arun C Murthy via cutting)
|
||
|
||
20. HADOOP-281. Prohibit DFS files that are also directories.
|
||
(Wendy Chien via cutting)
|
||
|
||
21. HADOOP-486. Add the job username to JobStatus instances returned
|
||
by JobClient. (Mahadev Konar via cutting)
|
||
|
||
22. HADOOP-437. contrib/streaming: Add support for gzipped inputs.
|
||
(Michel Tourn via cutting)
|
||
|
||
23. HADOOP-463. Add variable expansion to config files.
|
||
Configuration property values may now contain variable
|
||
expressions. A variable is referenced with the syntax
|
||
'${variable}'. Variables values are found first in the
|
||
configuration, and then in Java system properties. The default
|
||
configuration is modified so that temporary directories are now
|
||
under ${hadoop.tmp.dir}, which is, by default,
|
||
/tmp/hadoop-${user.name}. (Michel Tourn via cutting)
|
||
|
||
24. HADOOP-419. Fix a NullPointerException finding the ClassLoader
|
||
when using libhdfs. (omalley via cutting)
|
||
|
||
25. HADOOP-460. Fix contrib/smallJobsBenchmark to use Text instead of
|
||
UTF8. (Sanjay Dahiya via cutting)
|
||
|
||
26. HADOOP-196. Fix Configuration(Configuration) constructor to work
|
||
correctly. (Sami Siren via cutting)
|
||
|
||
27. HADOOP-501. Fix Configuration.toString() to handle URL resources.
|
||
(Thomas Friol via cutting)
|
||
|
||
28. HADOOP-499. Reduce the use of Strings in contrib/streaming,
|
||
replacing them with Text for better performance.
|
||
(Hairong Kuang via cutting)
|
||
|
||
29. HADOOP-64. Manage multiple volumes with a single DataNode.
|
||
Previously DataNode would create a separate daemon per configured
|
||
volume, each with its own connection to the NameNode. Now all
|
||
volumes are handled by a single DataNode daemon, reducing the load
|
||
on the NameNode. (Milind Bhandarkar via cutting)
|
||
|
||
30. HADOOP-424. Fix MapReduce so that jobs which generate zero splits
|
||
do not fail. (Fr??d??ric Bertin via cutting)
|
||
|
||
31. HADOOP-408. Adjust some timeouts and remove some others so that
|
||
unit tests run faster. (cutting)
|
||
|
||
32. HADOOP-507. Fix an IllegalAccessException in DFS.
|
||
(omalley via cutting)
|
||
|
||
33. HADOOP-320. Fix so that checksum files are correctly copied when
|
||
the destination of a file copy is a directory.
|
||
(Hairong Kuang via cutting)
|
||
|
||
34. HADOOP-286. In DFSClient, avoid pinging the NameNode with
|
||
renewLease() calls when no files are being written.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
35. HADOOP-312. Close idle IPC connections. All IPC connections were
|
||
cached forever. Now, after a connection has been idle for more
|
||
than a configurable amount of time (one second by default), the
|
||
connection is closed, conserving resources on both client and
|
||
server. (Devaraj Das via cutting)
|
||
|
||
36. HADOOP-497. Permit the specification of the network interface and
|
||
nameserver to be used when determining the local hostname
|
||
advertised by datanodes and tasktrackers.
|
||
(Lorenzo Thione via cutting)
|
||
|
||
37. HADOOP-441. Add a compression codec API and extend SequenceFile
|
||
to use it. This will permit the use of alternate compression
|
||
codecs in SequenceFile. (Arun C Murthy via cutting)
|
||
|
||
38. HADOOP-483. Improvements to libhdfs build and documentation.
|
||
(Arun C Murthy via cutting)
|
||
|
||
39. HADOOP-458. Fix a memory corruption bug in libhdfs.
|
||
(Arun C Murthy via cutting)
|
||
|
||
40. HADOOP-517. Fix a contrib/streaming bug in end-of-line detection.
|
||
(Hairong Kuang via cutting)
|
||
|
||
41. HADOOP-474. Add CompressionCodecFactory, and use it in
|
||
TextInputFormat and TextOutputFormat. Compressed input files are
|
||
automatically decompressed when they have the correct extension.
|
||
Output files will, when output compression is specified, be
|
||
generated with an approprate extension. Also add a gzip codec and
|
||
fix problems with UTF8 text inputs. (omalley via cutting)
|
||
|
||
|
||
Release 0.5.0 - 2006-08-04
|
||
|
||
1. HADOOP-352. Fix shell scripts to use /bin/sh instead of
|
||
/bin/bash, for better portability.
|
||
(Jean-Baptiste Quenot via cutting)
|
||
|
||
2. HADOOP-313. Permit task state to be saved so that single tasks
|
||
may be manually re-executed when debugging. (omalley via cutting)
|
||
|
||
3. HADOOP-339. Add method to JobClient API listing jobs that are
|
||
not yet complete, i.e., that are queued or running.
|
||
(Mahadev Konar via cutting)
|
||
|
||
4. HADOOP-355. Updates to the streaming contrib module, including
|
||
API fixes, making reduce optional, and adding an input type for
|
||
StreamSequenceRecordReader. (Michel Tourn via cutting)
|
||
|
||
5. HADOOP-358. Fix a NPE bug in Path.equals().
|
||
(Fr??d??ric Bertin via cutting)
|
||
|
||
6. HADOOP-327. Fix ToolBase to not call System.exit() when
|
||
exceptions are thrown. (Hairong Kuang via cutting)
|
||
|
||
7. HADOOP-359. Permit map output to be compressed.
|
||
(omalley via cutting)
|
||
|
||
8. HADOOP-341. Permit input URI to CopyFiles to use the HTTP
|
||
protocol. This lets one, e.g., more easily copy log files into
|
||
DFS. (Arun C Murthy via cutting)
|
||
|
||
9. HADOOP-361. Remove unix dependencies from streaming contrib
|
||
module tests, making them pure java. (Michel Tourn via cutting)
|
||
|
||
10. HADOOP-354. Make public methods to stop DFS daemons.
|
||
(Barry Kaplan via cutting)
|
||
|
||
11. HADOOP-252. Add versioning to RPC protocols.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
12. HADOOP-356. Add contrib to "compile" and "test" build targets, so
|
||
that this code is better maintained. (Michel Tourn via cutting)
|
||
|
||
13. HADOOP-307. Add smallJobsBenchmark contrib module. This runs
|
||
lots of small jobs, in order to determine per-task overheads.
|
||
(Sanjay Dahiya via cutting)
|
||
|
||
14. HADOOP-342. Add a tool for log analysis: Logalyzer.
|
||
(Arun C Murthy via cutting)
|
||
|
||
15. HADOOP-347. Add web-based browsing of DFS content. The namenode
|
||
redirects browsing requests to datanodes. Content requests are
|
||
redirected to datanodes where the data is local when possible.
|
||
(Devaraj Das via cutting)
|
||
|
||
16. HADOOP-351. Make Hadoop IPC kernel independent of Jetty.
|
||
(Devaraj Das via cutting)
|
||
|
||
17. HADOOP-237. Add metric reporting to DFS and MapReduce. With only
|
||
minor configuration changes, one can now monitor many Hadoop
|
||
system statistics using Ganglia or other monitoring systems.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
18. HADOOP-376. Fix datanode's HTTP server to scan for a free port.
|
||
(omalley via cutting)
|
||
|
||
19. HADOOP-260. Add --config option to shell scripts, specifying an
|
||
alternate configuration directory. (Milind Bhandarkar via cutting)
|
||
|
||
20. HADOOP-381. Permit developers to save the temporary files for
|
||
tasks whose names match a regular expression, to facilliate
|
||
debugging. (omalley via cutting)
|
||
|
||
21. HADOOP-344. Fix some Windows-related problems with DF.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
22. HADOOP-380. Fix reduce tasks to poll less frequently for map
|
||
outputs. (Mahadev Konar via cutting)
|
||
|
||
23. HADOOP-321. Refactor DatanodeInfo, in preparation for
|
||
HADOOP-306. (Konstantin Shvachko & omalley via cutting)
|
||
|
||
24. HADOOP-385. Fix some bugs in record io code generation.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
25. HADOOP-302. Add new Text class to replace UTF8, removing
|
||
limitations of that class. Also refactor utility methods for
|
||
writing zero-compressed integers (VInts and VLongs).
|
||
(Hairong Kuang via cutting)
|
||
|
||
26. HADOOP-335. Refactor DFS namespace/transaction logging in
|
||
namenode. (Konstantin Shvachko via cutting)
|
||
|
||
27. HADOOP-375. Fix handling of the datanode HTTP daemon's port so
|
||
that multiple datanode's can be run on a single host.
|
||
(Devaraj Das via cutting)
|
||
|
||
28. HADOOP-386. When removing excess DFS block replicas, remove those
|
||
on nodes with the least free space first.
|
||
(Johan Oskarson via cutting)
|
||
|
||
29. HADOOP-389. Fix intermittent failures of mapreduce unit tests.
|
||
Also fix some build dependencies.
|
||
(Mahadev & Konstantin via cutting)
|
||
|
||
30. HADOOP-362. Fix a problem where jobs hang when status messages
|
||
are recieved out-of-order. (omalley via cutting)
|
||
|
||
31. HADOOP-394. Change order of DFS shutdown in unit tests to
|
||
minimize errors logged. (Konstantin Shvachko via cutting)
|
||
|
||
32. HADOOP-396. Make DatanodeID implement Writable.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
33. HADOOP-377. Permit one to add URL resources to a Configuration.
|
||
(Jean-Baptiste Quenot via cutting)
|
||
|
||
34. HADOOP-345. Permit iteration over Configuration key/value pairs.
|
||
(Michel Tourn via cutting)
|
||
|
||
35. HADOOP-409. Streaming contrib module: make configuration
|
||
properties available to commands as environment variables.
|
||
(Michel Tourn via cutting)
|
||
|
||
36. HADOOP-369. Add -getmerge option to dfs command that appends all
|
||
files in a directory into a single local file.
|
||
(Johan Oskarson via cutting)
|
||
|
||
37. HADOOP-410. Replace some TreeMaps with HashMaps in DFS, for
|
||
a 17% performance improvement. (Milind Bhandarkar via cutting)
|
||
|
||
38. HADOOP-411. Add unit tests for command line parser.
|
||
(Hairong Kuang via cutting)
|
||
|
||
39. HADOOP-412. Add MapReduce input formats that support filtering
|
||
of SequenceFile data, including sampling and regex matching.
|
||
Also, move JobConf.newInstance() to a new utility class.
|
||
(Hairong Kuang via cutting)
|
||
|
||
40. HADOOP-226. Fix fsck command to properly consider replication
|
||
counts, now that these can vary per file. (Bryan Pendleton via cutting)
|
||
|
||
41. HADOOP-425. Add a Python MapReduce example, using Jython.
|
||
(omalley via cutting)
|
||
|
||
|
||
Release 0.4.0 - 2006-06-28
|
||
|
||
1. HADOOP-298. Improved progress reports for CopyFiles utility, the
|
||
distributed file copier. (omalley via cutting)
|
||
|
||
2. HADOOP-299. Fix the task tracker, permitting multiple jobs to
|
||
more easily execute at the same time. (omalley via cutting)
|
||
|
||
3. HADOOP-250. Add an HTTP user interface to the namenode, running
|
||
on port 50070. (Devaraj Das via cutting)
|
||
|
||
4. HADOOP-123. Add MapReduce unit tests that run a jobtracker and
|
||
tasktracker, greatly increasing code coverage.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
5. HADOOP-271. Add links from jobtracker's web ui to tasktracker's
|
||
web ui. Also attempt to log a thread dump of child processes
|
||
before they're killed. (omalley via cutting)
|
||
|
||
6. HADOOP-210. Change RPC server to use a selector instead of a
|
||
thread per connection. This should make it easier to scale to
|
||
larger clusters. Note that this incompatibly changes the RPC
|
||
protocol: clients and servers must both be upgraded to the new
|
||
version to ensure correct operation. (Devaraj Das via cutting)
|
||
|
||
7. HADOOP-311. Change DFS client to retry failed reads, so that a
|
||
single read failure will not alone cause failure of a task.
|
||
(omalley via cutting)
|
||
|
||
8. HADOOP-314. Remove the "append" phase when reducing. Map output
|
||
files are now directly passed to the sorter, without first
|
||
appending them into a single file. Now, the first third of reduce
|
||
progress is "copy" (transferring map output to reduce nodes), the
|
||
middle third is "sort" (sorting map output) and the last third is
|
||
"reduce" (generating output). Long-term, the "sort" phase will
|
||
also be removed. (omalley via cutting)
|
||
|
||
9. HADOOP-316. Fix a potential deadlock in the jobtracker.
|
||
(omalley via cutting)
|
||
|
||
10. HADOOP-319. Fix FileSystem.close() to remove the FileSystem
|
||
instance from the cache. (Hairong Kuang via cutting)
|
||
|
||
11. HADOOP-135. Fix potential deadlock in JobTracker by acquiring
|
||
locks in a consistent order. (omalley via cutting)
|
||
|
||
12. HADOOP-278. Check for existence of input directories before
|
||
starting MapReduce jobs, making it easier to debug this common
|
||
error. (omalley via cutting)
|
||
|
||
13. HADOOP-304. Improve error message for
|
||
UnregisterdDatanodeException to include expected node name.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
14. HADOOP-305. Fix TaskTracker to ask for new tasks as soon as a
|
||
task is finished, rather than waiting for the next heartbeat.
|
||
This improves performance when tasks are short.
|
||
(Mahadev Konar via cutting)
|
||
|
||
15. HADOOP-59. Add support for generic command line options. One may
|
||
now specify the filesystem (-fs), the MapReduce jobtracker (-jt),
|
||
a config file (-conf) or any configuration property (-D). The
|
||
"dfs", "fsck", "job", and "distcp" commands currently support
|
||
this, with more to be added. (Hairong Kuang via cutting)
|
||
|
||
16. HADOOP-296. Permit specification of the amount of reserved space
|
||
on a DFS datanode. One may specify both the percentage free and
|
||
the number of bytes. (Johan Oskarson via cutting)
|
||
|
||
17. HADOOP-325. Fix a problem initializing RPC parameter classes, and
|
||
remove the workaround used to initialize classes.
|
||
(omalley via cutting)
|
||
|
||
18. HADOOP-328. Add an option to the "distcp" command to ignore read
|
||
errors while copying. (omalley via cutting)
|
||
|
||
19. HADOOP-27. Don't allocate tasks to trackers whose local free
|
||
space is too low. (Johan Oskarson via cutting)
|
||
|
||
20. HADOOP-318. Keep slow DFS output from causing task timeouts.
|
||
This incompatibly changes some public interfaces, adding a
|
||
parameter to OutputFormat.getRecordWriter() and the new method
|
||
Reporter.progress(), but it makes lots of tasks succeed that were
|
||
previously failing. (Milind Bhandarkar via cutting)
|
||
|
||
|
||
Release 0.3.2 - 2006-06-09
|
||
|
||
1. HADOOP-275. Update the streaming contrib module to use log4j for
|
||
its logging. (Michel Tourn via cutting)
|
||
|
||
2. HADOOP-279. Provide defaults for log4j logging parameters, so
|
||
that things still work reasonably when Hadoop-specific system
|
||
properties are not provided. (omalley via cutting)
|
||
|
||
3. HADOOP-280. Fix a typo in AllTestDriver which caused the wrong
|
||
test to be run when "DistributedFSCheck" was specified.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
4. HADOOP-240. DFS's mkdirs() implementation no longer logs a warning
|
||
when the directory already exists. (Hairong Kuang via cutting)
|
||
|
||
5. HADOOP-285. Fix DFS datanodes to be able to re-join the cluster
|
||
after the connection to the namenode is lost. (omalley via cutting)
|
||
|
||
6. HADOOP-277. Fix a race condition when creating directories.
|
||
(Sameer Paranjpye via cutting)
|
||
|
||
7. HADOOP-289. Improved exception handling in DFS datanode.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
8. HADOOP-292. Fix client-side logging to go to standard error
|
||
rather than standard output, so that it can be distinguished from
|
||
application output. (omalley via cutting)
|
||
|
||
9. HADOOP-294. Fixed bug where conditions for retrying after errors
|
||
in the DFS client were reversed. (omalley via cutting)
|
||
|
||
|
||
Release 0.3.1 - 2006-06-05
|
||
|
||
1. HADOOP-272. Fix a bug in bin/hadoop setting log
|
||
parameters. (omalley & cutting)
|
||
|
||
2. HADOOP-274. Change applications to log to standard output rather
|
||
than to a rolling log file like daemons. (omalley via cutting)
|
||
|
||
3. HADOOP-262. Fix reduce tasks to report progress while they're
|
||
waiting for map outputs, so that they do not time out.
|
||
(Mahadev Konar via cutting)
|
||
|
||
4. HADOOP-245 and HADOOP-246. Improvements to record io package.
|
||
(Mahadev Konar via cutting)
|
||
|
||
5. HADOOP-276. Add logging config files to jar file so that they're
|
||
always found. (omalley via cutting)
|
||
|
||
|
||
Release 0.3.0 - 2006-06-02
|
||
|
||
1. HADOOP-208. Enhance MapReduce web interface, adding new pages
|
||
for failed tasks, and tasktrackers. (omalley via cutting)
|
||
|
||
2. HADOOP-204. Tweaks to metrics package. (David Bowen via cutting)
|
||
|
||
3. HADOOP-209. Add a MapReduce-based file copier. This will
|
||
copy files within or between file systems in parallel.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
4. HADOOP-146. Fix DFS to check when randomly generating a new block
|
||
id that no existing blocks already have that id.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
5. HADOOP-180. Make a daemon thread that does the actual task clean ups, so
|
||
that the main offerService thread in the taskTracker doesn't get stuck
|
||
and miss his heartbeat window. This was killing many task trackers as
|
||
big jobs finished (300+ tasks / node). (omalley via cutting)
|
||
|
||
6. HADOOP-200. Avoid transmitting entire list of map task names to
|
||
reduce tasks. Instead just transmit the number of map tasks and
|
||
henceforth refer to them by number when collecting map output.
|
||
(omalley via cutting)
|
||
|
||
7. HADOOP-219. Fix a NullPointerException when handling a checksum
|
||
exception under SequenceFile.Sorter.sort(). (cutting & stack)
|
||
|
||
8. HADOOP-212. Permit alteration of the file block size in DFS. The
|
||
default block size for new files may now be specified in the
|
||
configuration with the dfs.block.size property. The block size
|
||
may also be specified when files are opened.
|
||
(omalley via cutting)
|
||
|
||
9. HADOOP-218. Avoid accessing configuration while looping through
|
||
tasks in JobTracker. (Mahadev Konar via cutting)
|
||
|
||
10. HADOOP-161. Add hashCode() method to DFS's Block.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
11. HADOOP-115. Map output types may now be specified. These are also
|
||
used as reduce input types, thus permitting reduce input types to
|
||
differ from reduce output types. (Runping Qi via cutting)
|
||
|
||
12. HADOOP-216. Add task progress to task status page.
|
||
(Bryan Pendelton via cutting)
|
||
|
||
13. HADOOP-233. Add web server to task tracker that shows running
|
||
tasks and logs. Also add log access to job tracker web interface.
|
||
(omalley via cutting)
|
||
|
||
14. HADOOP-205. Incorporate pending tasks into tasktracker load
|
||
calculations. (Mahadev Konar via cutting)
|
||
|
||
15. HADOOP-247. Fix sort progress to better handle exceptions.
|
||
(Mahadev Konar via cutting)
|
||
|
||
16. HADOOP-195. Improve performance of the transfer of map outputs to
|
||
reduce nodes by performing multiple transfers in parallel, each on
|
||
a separate socket. (Sameer Paranjpye via cutting)
|
||
|
||
17. HADOOP-251. Fix task processes to be tolerant of failed progress
|
||
reports to their parent process. (omalley via cutting)
|
||
|
||
18. HADOOP-325. Improve the FileNotFound exceptions thrown by
|
||
LocalFileSystem to include the name of the file.
|
||
(Benjamin Reed via cutting)
|
||
|
||
19. HADOOP-254. Use HTTP to transfer map output data to reduce
|
||
nodes. This, together with HADOOP-195, greatly improves the
|
||
performance of these transfers. (omalley via cutting)
|
||
|
||
20. HADOOP-163. Cause datanodes that\ are unable to either read or
|
||
write data to exit, so that the namenode will no longer target
|
||
them for new blocks and will replicate their data on other nodes.
|
||
(Hairong Kuang via cutting)
|
||
|
||
21. HADOOP-222. Add a -setrep option to the dfs commands that alters
|
||
file replication levels. (Johan Oskarson via cutting)
|
||
|
||
22. HADOOP-75. In DFS, only check for a complete file when the file
|
||
is closed, rather than as each block is written.
|
||
(Milind Bhandarkar via cutting)
|
||
|
||
23. HADOOP-124. Change DFS so that datanodes are identified by a
|
||
persistent ID rather than by host and port. This solves a number
|
||
of filesystem integrity problems, when, e.g., datanodes are
|
||
restarted. (Konstantin Shvachko via cutting)
|
||
|
||
24. HADOOP-256. Add a C API for DFS. (Arun C Murthy via cutting)
|
||
|
||
25. HADOOP-211. Switch to use the Jakarta Commons logging internally,
|
||
configured to use log4j by default. (Arun C Murthy and cutting)
|
||
|
||
26. HADOOP-265. Tasktracker now fails to start if it does not have a
|
||
writable local directory for temporary files. In this case, it
|
||
logs a message to the JobTracker and exits. (Hairong Kuang via cutting)
|
||
|
||
27. HADOOP-270. Fix potential deadlock in datanode shutdown.
|
||
(Hairong Kuang via cutting)
|
||
|
||
Release 0.2.1 - 2006-05-12
|
||
|
||
1. HADOOP-199. Fix reduce progress (broken by HADOOP-182).
|
||
(omalley via cutting)
|
||
|
||
2. HADOOP-201. Fix 'bin/hadoop dfs -report'. (cutting)
|
||
|
||
3. HADOOP-207. Fix JDK 1.4 incompatibility introduced by HADOOP-96.
|
||
System.getenv() does not work in JDK 1.4. (Hairong Kuang via cutting)
|
||
|
||
|
||
Release 0.2.0 - 2006-05-05
|
||
|
||
1. Fix HADOOP-126. 'bin/hadoop dfs -cp' now correctly copies .crc
|
||
files. (Konstantin Shvachko via cutting)
|
||
|
||
2. Fix HADOOP-51. Change DFS to support per-file replication counts.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
3. Fix HADOOP-131. Add scripts to start/stop dfs and mapred daemons.
|
||
Use these in start/stop-all scripts. (Chris Mattmann via cutting)
|
||
|
||
4. Stop using ssh options by default that are not yet in widely used
|
||
versions of ssh. Folks can still enable their use by uncommenting
|
||
a line in conf/hadoop-env.sh. (cutting)
|
||
|
||
5. Fix HADOOP-92. Show information about all attempts to run each
|
||
task in the web ui. (Mahadev konar via cutting)
|
||
|
||
6. Fix HADOOP-128. Improved DFS error handling. (Owen O'Malley via cutting)
|
||
|
||
7. Fix HADOOP-129. Replace uses of java.io.File with new class named
|
||
Path. This fixes bugs where java.io.File methods were called
|
||
directly when FileSystem methods were desired, and reduces the
|
||
likelihood of such bugs in the future. It also makes the handling
|
||
of pathnames more consistent between local and dfs FileSystems and
|
||
between Windows and Unix. java.io.File-based methods are still
|
||
available for back-compatibility, but are deprecated and will be
|
||
removed once 0.2 is released. (cutting)
|
||
|
||
8. Change dfs.data.dir and mapred.local.dir to be comma-separated
|
||
lists of directories, no longer be space-separated. This fixes
|
||
several bugs on Windows. (cutting)
|
||
|
||
9. Fix HADOOP-144. Use mapred task id for dfs client id, to
|
||
facilitate debugging. (omalley via cutting)
|
||
|
||
10. Fix HADOOP-143. Do not line-wrap stack-traces in web ui.
|
||
(omalley via cutting)
|
||
|
||
11. Fix HADOOP-118. In DFS, improve clean up of abandoned file
|
||
creations. (omalley via cutting)
|
||
|
||
12. Fix HADOOP-138. Stop multiple tasks in a single heartbeat, rather
|
||
than one per heartbeat. (Stefan via cutting)
|
||
|
||
13. Fix HADOOP-139. Remove a potential deadlock in
|
||
LocalFileSystem.lock(). (Igor Bolotin via cutting)
|
||
|
||
14. Fix HADOOP-134. Don't hang jobs when the tasktracker is
|
||
misconfigured to use an un-writable local directory. (omalley via cutting)
|
||
|
||
15. Fix HADOOP-115. Correct an error message. (Stack via cutting)
|
||
|
||
16. Fix HADOOP-133. Retry pings from child to parent, in case of
|
||
(local) communcation problems. Also log exit status, so that one
|
||
can distinguish patricide from other deaths. (omalley via cutting)
|
||
|
||
17. Fix HADOOP-142. Avoid re-running a task on a host where it has
|
||
previously failed. (omalley via cutting)
|
||
|
||
18. Fix HADOOP-148. Maintain a task failure count for each
|
||
tasktracker and display it in the web ui. (omalley via cutting)
|
||
|
||
19. Fix HADOOP-151. Close a potential socket leak, where new IPC
|
||
connection pools were created per configuration instance that RPCs
|
||
use. Now a global RPC connection pool is used again, as
|
||
originally intended. (cutting)
|
||
|
||
20. Fix HADOOP-69. Don't throw a NullPointerException when getting
|
||
hints for non-existing file split. (Bryan Pendelton via cutting)
|
||
|
||
21. Fix HADOOP-157. When a task that writes dfs files (e.g., a reduce
|
||
task) failed and was retried, it would fail again and again,
|
||
eventually failing the job. The problem was that dfs did not yet
|
||
know that the failed task had abandoned the files, and would not
|
||
yet let another task create files with the same names. Dfs now
|
||
retries when creating a file long enough for locks on abandoned
|
||
files to expire. (omalley via cutting)
|
||
|
||
22. Fix HADOOP-150. Improved task names that include job
|
||
names. (omalley via cutting)
|
||
|
||
23. Fix HADOOP-162. Fix ConcurrentModificationException when
|
||
releasing file locks. (omalley via cutting)
|
||
|
||
24. Fix HADOOP-132. Initial check-in of new Metrics API, including
|
||
implementations for writing metric data to a file and for sending
|
||
it to Ganglia. (David Bowen via cutting)
|
||
|
||
25. Fix HADOOP-160. Remove some uneeded synchronization around
|
||
time-consuming operations in the TaskTracker. (omalley via cutting)
|
||
|
||
26. Fix HADOOP-166. RPCs failed when passed subclasses of a declared
|
||
parameter type. This is fixed by changing ObjectWritable to store
|
||
both the declared type and the instance type for Writables. Note
|
||
that this incompatibly changes the format of ObjectWritable and
|
||
will render unreadable any ObjectWritables stored in files.
|
||
Nutch only uses ObjectWritable in intermediate files, so this
|
||
should not be a problem for Nutch. (Stefan & cutting)
|
||
|
||
27. Fix HADOOP-168. MapReduce RPC protocol methods should all declare
|
||
IOException, so that timeouts are handled appropriately.
|
||
(omalley via cutting)
|
||
|
||
28. Fix HADOOP-169. Don't fail a reduce task if a call to the
|
||
jobtracker to locate map outputs fails. (omalley via cutting)
|
||
|
||
29. Fix HADOOP-170. Permit FileSystem clients to examine and modify
|
||
the replication count of individual files. Also fix a few
|
||
replication-related bugs. (Konstantin Shvachko via cutting)
|
||
|
||
30. Permit specification of a higher replication levels for job
|
||
submission files (job.xml and job.jar). This helps with large
|
||
clusters, since these files are read by every node. (cutting)
|
||
|
||
31. HADOOP-173. Optimize allocation of tasks with local data. (cutting)
|
||
|
||
32. HADOOP-167. Reduce number of Configurations and JobConf's
|
||
created. (omalley via cutting)
|
||
|
||
33. NUTCH-256. Change FileSystem#createNewFile() to create a .crc
|
||
file. The lack of a .crc file was causing warnings. (cutting)
|
||
|
||
34. HADOOP-174. Change JobClient to not abort job until it has failed
|
||
to contact the job tracker for five attempts, not just one as
|
||
before. (omalley via cutting)
|
||
|
||
35. HADOOP-177. Change MapReduce web interface to page through tasks.
|
||
Previously, when jobs had more than a few thousand tasks they
|
||
could crash web browsers. (Mahadev Konar via cutting)
|
||
|
||
36. HADOOP-178. In DFS, piggyback blockwork requests from datanodes
|
||
on heartbeat responses from namenode. This reduces the volume of
|
||
RPC traffic. Also move startup delay in blockwork from datanode
|
||
to namenode. This fixes a problem where restarting the namenode
|
||
triggered a lot of uneeded replication. (Hairong Kuang via cutting)
|
||
|
||
37. HADOOP-183. If the DFS namenode is restarted with different
|
||
minimum and/or maximum replication counts, existing files'
|
||
replication counts are now automatically adjusted to be within the
|
||
newly configured bounds. (Hairong Kuang via cutting)
|
||
|
||
38. HADOOP-186. Better error handling in TaskTracker's top-level
|
||
loop. Also improve calculation of time to send next heartbeat.
|
||
(omalley via cutting)
|
||
|
||
39. HADOOP-187. Add two MapReduce examples/benchmarks. One creates
|
||
files containing random data. The second sorts the output of the
|
||
first. (omalley via cutting)
|
||
|
||
40. HADOOP-185. Fix so that, when a task tracker times out making the
|
||
RPC asking for a new task to run, the job tracker does not think
|
||
that it is actually running the task returned. (omalley via cutting)
|
||
|
||
41. HADOOP-190. If a child process hangs after it has reported
|
||
completion, its output should not be lost. (Stack via cutting)
|
||
|
||
42. HADOOP-184. Re-structure some test code to better support testing
|
||
on a cluster. (Mahadev Konar via cutting)
|
||
|
||
43. HADOOP-191 Add streaming package, Hadoop's first contrib module.
|
||
This permits folks to easily submit MapReduce jobs whose map and
|
||
reduce functions are implemented by shell commands. Use
|
||
'bin/hadoop jar build/hadoop-streaming.jar' to get details.
|
||
(Michel Tourn via cutting)
|
||
|
||
44. HADOOP-189. Fix MapReduce in standalone configuration to
|
||
correctly handle job jar files that contain a lib directory with
|
||
nested jar files. (cutting)
|
||
|
||
45. HADOOP-65. Initial version of record I/O framework that enables
|
||
the specification of record types and generates marshalling code
|
||
in both Java and C++. Generated Java code implements
|
||
WritableComparable, but is not yet otherwise used by
|
||
Hadoop. (Milind Bhandarkar via cutting)
|
||
|
||
46. HADOOP-193. Add a MapReduce-based FileSystem benchmark.
|
||
(Konstantin Shvachko via cutting)
|
||
|
||
47. HADOOP-194. Add a MapReduce-based FileSystem checker. This reads
|
||
every block in every file in the filesystem. (Konstantin Shvachko
|
||
via cutting)
|
||
|
||
48. HADOOP-182. Fix so that lost task trackers to not change the
|
||
status of reduce tasks or completed jobs. Also fixes the progress
|
||
meter so that failed tasks are subtracted. (omalley via cutting)
|
||
|
||
49. HADOOP-96. Logging improvements. Log files are now separate from
|
||
standard output and standard error files. Logs are now rolled.
|
||
Logging of all DFS state changes can be enabled, to facilitate
|
||
debugging. (Hairong Kuang via cutting)
|
||
|
||
|
||
Release 0.1.1 - 2006-04-08
|
||
|
||
1. Added CHANGES.txt, logging all significant changes to Hadoop. (cutting)
|
||
|
||
2. Fix MapReduceBase.close() to throw IOException, as declared in the
|
||
Closeable interface. This permits subclasses which override this
|
||
method to throw that exception. (cutting)
|
||
|
||
3. Fix HADOOP-117. Pathnames were mistakenly transposed in
|
||
JobConf.getLocalFile() causing many mapred temporary files to not
|
||
be removed. (Raghavendra Prabhu via cutting)
|
||
|
||
4. Fix HADOOP-116. Clean up job submission files when jobs complete.
|
||
(cutting)
|
||
|
||
5. Fix HADOOP-125. Fix handling of absolute paths on Windows (cutting)
|
||
|
||
Release 0.1.0 - 2006-04-01
|
||
|
||
1. The first release of Hadoop.
|
||
|