HADOOP-11791. Update src/site/markdown/releases to include old versions of Hadoop (aw)

(cherry picked from commit d759b4bd74)
This commit is contained in:
Allen Wittenauer 2015-10-21 11:30:30 -07:00 committed by Andrew Wang
parent 9f4b3eaac0
commit 140a33df9b
232 changed files with 35454 additions and 0 deletions

View File

@ -0,0 +1,117 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.1.0 - 2006-04-02
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-80](https://issues.apache.org/jira/browse/HADOOP-80) | binary key | Major | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-46](https://issues.apache.org/jira/browse/HADOOP-46) | user-specified job names | Major | . | Doug Cutting | Owen O'Malley |
| [HADOOP-44](https://issues.apache.org/jira/browse/HADOOP-44) | RPC exceptions should include remote stack trace | Major | ipc | Doug Cutting | Doug Cutting |
| [HADOOP-37](https://issues.apache.org/jira/browse/HADOOP-37) | A way to determine the size and overall activity of the cluster | Major | . | Owen O'Malley | |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-103](https://issues.apache.org/jira/browse/HADOOP-103) | introduce a common parent class for Mapper and Reducer | Minor | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-87](https://issues.apache.org/jira/browse/HADOOP-87) | SequenceFile performance degrades substantially compression is on and large values are encountered | Major | io | Sameer Paranjpye | Doug Cutting |
| [HADOOP-79](https://issues.apache.org/jira/browse/HADOOP-79) | listFiles optimization | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-67](https://issues.apache.org/jira/browse/HADOOP-67) | Added statistic/reporting info to DFS | Trivial | . | Barry Kaplan | Doug Cutting |
| [HADOOP-60](https://issues.apache.org/jira/browse/HADOOP-60) | Specification of alternate conf. directory | Minor | . | stack | |
| [HADOOP-49](https://issues.apache.org/jira/browse/HADOOP-49) | JobClient cannot use a non-default server (unlike DFSShell) | Major | . | Michel Tourn | Michel Tourn |
| [HADOOP-45](https://issues.apache.org/jira/browse/HADOOP-45) | JobTracker should log task errors | Major | . | Doug Cutting | Doug Cutting |
| [HADOOP-41](https://issues.apache.org/jira/browse/HADOOP-41) | JAVA\_OPTS for the TaskRunner Child | Minor | conf | stack | |
| [HADOOP-38](https://issues.apache.org/jira/browse/HADOOP-38) | default splitter should incorporate fs block size | Major | . | Doug Cutting | |
| [HADOOP-36](https://issues.apache.org/jira/browse/HADOOP-36) | Adding some uniformity/convenience to environment management | Major | conf | Bryan Pendleton | |
| [HADOOP-33](https://issues.apache.org/jira/browse/HADOOP-33) | DF enhancement: performance and win XP support | Minor | fs | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-30](https://issues.apache.org/jira/browse/HADOOP-30) | DFS shell: support for ls -r and cat | Major | . | Michel Tourn | |
| [HADOOP-25](https://issues.apache.org/jira/browse/HADOOP-25) | a new map/reduce example and moving the examples from src/java to src/examples | Minor | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-20](https://issues.apache.org/jira/browse/HADOOP-20) | Mapper, Reducer need an occasion to cleanup after the last record is processed. | Major | . | Michel Tourn | |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-112](https://issues.apache.org/jira/browse/HADOOP-112) | copyFromLocal should exclude .crc files | Minor | . | Monu Ogbe | Doug Cutting |
| [HADOOP-110](https://issues.apache.org/jira/browse/HADOOP-110) | new key and value instances are allocated before each map | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-107](https://issues.apache.org/jira/browse/HADOOP-107) | Namenode errors "Failed to complete filename.crc because dir.getFile()==null and null" | Major | . | Igor Bolotin | Doug Cutting |
| [HADOOP-102](https://issues.apache.org/jira/browse/HADOOP-102) | Two identical consecutive loops in FSNamesystem.chooseTarget() | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-100](https://issues.apache.org/jira/browse/HADOOP-100) | Inconsistent locking of the JobTracker.taskTrackers field | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-98](https://issues.apache.org/jira/browse/HADOOP-98) | The JobTracker's count of the number of running maps and reduces is wrong | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-97](https://issues.apache.org/jira/browse/HADOOP-97) | DFSShell.cat returns NullPointerException if file does not exist | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-93](https://issues.apache.org/jira/browse/HADOOP-93) | allow minimum split size configurable | Major | . | Hairong Kuang | Doug Cutting |
| [HADOOP-86](https://issues.apache.org/jira/browse/HADOOP-86) | If corrupted map outputs, reducers get stuck fetching forever | Major | . | stack | Doug Cutting |
| [HADOOP-84](https://issues.apache.org/jira/browse/HADOOP-84) | client should report file name in which IO exception occurs | Minor | . | Yoram Arnon | Konstantin Shvachko |
| [HADOOP-83](https://issues.apache.org/jira/browse/HADOOP-83) | infinite retries accessing a missing block | Major | . | Yoram Arnon | Konstantin Shvachko |
| [HADOOP-82](https://issues.apache.org/jira/browse/HADOOP-82) | JobTracker loses it: NoSuchElementException | Minor | . | stack | |
| [HADOOP-81](https://issues.apache.org/jira/browse/HADOOP-81) | speculative execution is only controllable from the default config | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-78](https://issues.apache.org/jira/browse/HADOOP-78) | rpc commands not buffered | Major | ipc | Owen O'Malley | Owen O'Malley |
| [HADOOP-77](https://issues.apache.org/jira/browse/HADOOP-77) | hang / crash when input folder does not exists. | Critical | . | Stefan Groschupf | |
| [HADOOP-70](https://issues.apache.org/jira/browse/HADOOP-70) | the two file system tests TestDFS and TestFileSystem take too long | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-66](https://issues.apache.org/jira/browse/HADOOP-66) | dfs client writes all data for a chunk to /tmp | Major | . | Sameer Paranjpye | Doug Cutting |
| [HADOOP-57](https://issues.apache.org/jira/browse/HADOOP-57) | hadoop dfs -ls / does not show root of file system | Minor | . | Yoram Arnon | |
| [HADOOP-52](https://issues.apache.org/jira/browse/HADOOP-52) | mapred input and output dirs must be absolute | Major | . | Doug Cutting | Owen O'Malley |
| [HADOOP-42](https://issues.apache.org/jira/browse/HADOOP-42) | PositionCache decrements its position for reads at the end of file | Major | fs | Konstantin Shvachko | |
| [HADOOP-40](https://issues.apache.org/jira/browse/HADOOP-40) | bufferSize argument is ignored in FileSystem.create(File, boolean, int) | Minor | fs | Konstantin Shvachko | |
| [HADOOP-34](https://issues.apache.org/jira/browse/HADOOP-34) | Build Paths Relative to PWD in build.xml | Trivial | . | Jeremy Bensley | |
| [HADOOP-28](https://issues.apache.org/jira/browse/HADOOP-28) | webapps broken | Major | . | Owen O'Malley | |
| [HADOOP-22](https://issues.apache.org/jira/browse/HADOOP-22) | remove unused imports | Trivial | . | Sami Siren | |
| [HADOOP-21](https://issues.apache.org/jira/browse/HADOOP-21) | the webapps need to be updated for the move from nutch | Minor | . | Owen O'Malley | |
| [HADOOP-19](https://issues.apache.org/jira/browse/HADOOP-19) | Datanode corruption | Critical | . | Rod Taylor | Doug Cutting |
| [HADOOP-16](https://issues.apache.org/jira/browse/HADOOP-16) | RPC call times out while indexing map task is computing splits | Major | . | Chris Schneider | Mike Cafarella |
| [HADOOP-12](https://issues.apache.org/jira/browse/HADOOP-12) | InputFormat used in job must be in JobTracker classpath (not loaded from job JAR) | Minor | . | Bryan Pendleton | |
| [HADOOP-10](https://issues.apache.org/jira/browse/HADOOP-10) | ndfs.replication is not documented within the nutch-default.xml configuration file. | Trivial | . | Rod Taylor | |
| [HADOOP-7](https://issues.apache.org/jira/browse/HADOOP-7) | MapReduce has a series of problems concerning task-allocation to worker nodes | Major | . | Mike Cafarella | |
| [HADOOP-6](https://issues.apache.org/jira/browse/HADOOP-6) | missing build directory in classpath | Minor | . | Owen O'Malley | |
| [HADOOP-5](https://issues.apache.org/jira/browse/HADOOP-5) | need commons-logging-api jar file | Minor | . | Owen O'Malley | |
| [HADOOP-3](https://issues.apache.org/jira/browse/HADOOP-3) | Output directories are not cleaned up before the reduces run | Minor | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2](https://issues.apache.org/jira/browse/HADOOP-2) | Reused Keys and Values fail with a Combiner | Major | . | Owen O'Malley | Owen O'Malley |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1](https://issues.apache.org/jira/browse/HADOOP-1) | initial import of code from Nutch | Major | . | Doug Cutting | Doug Cutting |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.1.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,67 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.1.1 - 2006-04-07
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-116](https://issues.apache.org/jira/browse/HADOOP-116) | cleaning up /tmp/hadoop/mapred/system | Major | . | raghavendra prabhu | Doug Cutting |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-125](https://issues.apache.org/jira/browse/HADOOP-125) | LocalFileSystem.makeAbsolute bug on Windows | Minor | fs | p sutter | Doug Cutting |
| [HADOOP-117](https://issues.apache.org/jira/browse/HADOOP-117) | mapred temporary files not deleted | Blocker | . | raghavendra prabhu | Doug Cutting |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.1.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,123 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.10.0 - 2007-01-05
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-811](https://issues.apache.org/jira/browse/HADOOP-811) | Patch to support multi-threaded MapRunnable | Major | . | Alejandro Abdelnur | Doug Cutting |
| [HADOOP-681](https://issues.apache.org/jira/browse/HADOOP-681) | Adminstrative hook to pull live nodes out of a HDFS cluster | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-574](https://issues.apache.org/jira/browse/HADOOP-574) | want FileSystem implementation for Amazon S3 | Major | fs | Doug Cutting | |
| [HADOOP-454](https://issues.apache.org/jira/browse/HADOOP-454) | hadoop du optionally behave like unix's du -s | Trivial | . | Marco Nicosia | Hairong Kuang |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-853](https://issues.apache.org/jira/browse/HADOOP-853) | Move site directories to docs directories | Minor | documentation | Nigel Daley | Doug Cutting |
| [HADOOP-850](https://issues.apache.org/jira/browse/HADOOP-850) | Add Writable implementations for variable-length integer types. | Minor | io | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-837](https://issues.apache.org/jira/browse/HADOOP-837) | RunJar should unpack jar files into hadoop.tmp.dir | Major | util | Hairong Kuang | Hairong Kuang |
| [HADOOP-806](https://issues.apache.org/jira/browse/HADOOP-806) | NameNode WebUI : Include link to each of datanodes | Minor | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-804](https://issues.apache.org/jira/browse/HADOOP-804) | Cut down on the "mumbling" in the Task process' stdout/stderr | Major | . | Owen O'Malley | Sanjay Dahiya |
| [HADOOP-796](https://issues.apache.org/jira/browse/HADOOP-796) | Node failing tasks and failed tasks should be more easily accessible through jobtracker history. | Minor | . | Sanjay Dahiya | Sanjay Dahiya |
| [HADOOP-783](https://issues.apache.org/jira/browse/HADOOP-783) | Hadoop dfs -put and -get accept '-' to indicate stdin/stdout | Minor | . | Marco Nicosia | Wendy Chien |
| [HADOOP-763](https://issues.apache.org/jira/browse/HADOOP-763) | NameNode benchmark using mapred is insufficient | Minor | test | Nigel Daley | Nigel Daley |
| [HADOOP-756](https://issues.apache.org/jira/browse/HADOOP-756) | new dfsadmin command to wait until safe mode is exited | Minor | . | Owen O'Malley | dhruba borthakur |
| [HADOOP-720](https://issues.apache.org/jira/browse/HADOOP-720) | Write a white paper on Hadoop File System Architecture, Design and Features | Major | documentation | dhruba borthakur | dhruba borthakur |
| [HADOOP-717](https://issues.apache.org/jira/browse/HADOOP-717) | When there are few reducers, sorting should be done by mappers | Major | . | arkady borkovsky | Owen O'Malley |
| [HADOOP-676](https://issues.apache.org/jira/browse/HADOOP-676) | JobClient should print user friendly messages for standard errors | Major | . | Owen O'Malley | Sanjay Dahiya |
| [HADOOP-621](https://issues.apache.org/jira/browse/HADOOP-621) | When a dfs -cat command is killed by the user, the correspondig hadoop process does not get aborted | Minor | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-619](https://issues.apache.org/jira/browse/HADOOP-619) | Unify Map-Reduce and Streaming to take the same globbed input specification | Major | . | eric baldeschwieler | Sanjay Dahiya |
| [HADOOP-618](https://issues.apache.org/jira/browse/HADOOP-618) | JobProfile and JobSubmissionProtocol should be public | Major | . | Runping Qi | Arun C Murthy |
| [HADOOP-571](https://issues.apache.org/jira/browse/HADOOP-571) | Path should use URI syntax | Major | fs | Doug Cutting | Doug Cutting |
| [HADOOP-525](https://issues.apache.org/jira/browse/HADOOP-525) | Need raw comparators for hadoop record types | Major | record | Sameer Paranjpye | Milind Bhandarkar |
| [HADOOP-470](https://issues.apache.org/jira/browse/HADOOP-470) | Some improvements in the DFS content browsing UI | Minor | . | Devaraj Das | Hairong Kuang |
| [HADOOP-451](https://issues.apache.org/jira/browse/HADOOP-451) | Add a Split interface | Major | . | Doug Cutting | Owen O'Malley |
| [HADOOP-371](https://issues.apache.org/jira/browse/HADOOP-371) | ant tar should package contrib jars | Major | build | Michel Tourn | Nigel Daley |
| [HADOOP-331](https://issues.apache.org/jira/browse/HADOOP-331) | map outputs should be written to a single output file with an index | Major | . | eric baldeschwieler | Devaraj Das |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-849](https://issues.apache.org/jira/browse/HADOOP-849) | randomwriter fails with 'java.lang.OutOfMemoryError: Java heap space' in the 'reduce' task | Major | . | Arun C Murthy | Devaraj Das |
| [HADOOP-846](https://issues.apache.org/jira/browse/HADOOP-846) | Progress report is not sent during the intermediate sorts in the map phase | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-844](https://issues.apache.org/jira/browse/HADOOP-844) | Metrics messages are sent on a fixed-delay schedule instead of a fixed-rate schedule | Minor | metrics | David Bowen | |
| [HADOOP-841](https://issues.apache.org/jira/browse/HADOOP-841) | native hadoop libraries don't build properly with 64-bit OS and a 32-bit jvm | Major | build | Arun C Murthy | Arun C Murthy |
| [HADOOP-840](https://issues.apache.org/jira/browse/HADOOP-840) | the task tracker is getting blocked by long deletes of local files | Major | . | Owen O'Malley | Mahadev konar |
| [HADOOP-838](https://issues.apache.org/jira/browse/HADOOP-838) | TaskRunner.run() doesn't pass along the 'java.library.path' to the child (task) jvm | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-836](https://issues.apache.org/jira/browse/HADOOP-836) | unit tests fail on windows (/C:/cygwin/... is invalid) | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-835](https://issues.apache.org/jira/browse/HADOOP-835) | conf not set for the default Codec when initializing a Reader for a record-compressed sequence file | Major | io | Hairong Kuang | Hairong Kuang |
| [HADOOP-829](https://issues.apache.org/jira/browse/HADOOP-829) | Separate the datanode contents that is written to the fsimage vs the contents used in over-the-wire communication | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-825](https://issues.apache.org/jira/browse/HADOOP-825) | If the default file system is set using the new uri syntax, the namenode will not start | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-824](https://issues.apache.org/jira/browse/HADOOP-824) | DFSShell should become FSShell | Major | . | Doug Cutting | |
| [HADOOP-823](https://issues.apache.org/jira/browse/HADOOP-823) | DataNode will not start up if any directories from dfs.data.dir are missing | Major | . | Bryan Pendleton | Sameer Paranjpye |
| [HADOOP-818](https://issues.apache.org/jira/browse/HADOOP-818) | ant clean test-contrib doesn't work | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-814](https://issues.apache.org/jira/browse/HADOOP-814) | Increase dfs scalability by optimizing locking on namenode. | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-813](https://issues.apache.org/jira/browse/HADOOP-813) | map tasks lost during sort | Major | . | Owen O'Malley | Devaraj Das |
| [HADOOP-802](https://issues.apache.org/jira/browse/HADOOP-802) | mapred.speculative.execution description in hadoop-defauls.xml is not complete | Trivial | conf | Nigel Daley | Nigel Daley |
| [HADOOP-794](https://issues.apache.org/jira/browse/HADOOP-794) | JobTracker crashes with ArithmeticException | Major | . | Nigel Daley | Owen O'Malley |
| [HADOOP-792](https://issues.apache.org/jira/browse/HADOOP-792) | Invalid dfs -mv can trash your entire dfs | Major | . | Chris Schneider | |
| [HADOOP-786](https://issues.apache.org/jira/browse/HADOOP-786) | PhasedFileSystem should use debug level log for ignored exception. | Trivial | . | Sanjay Dahiya | Sanjay Dahiya |
| [HADOOP-782](https://issues.apache.org/jira/browse/HADOOP-782) | TaskTracker.java:killOverflowingTasks & TaskTracker.java:markUnresponsiveTasks only put the tip in tasksToCleanup queue, they don't update the runningJobs | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-777](https://issues.apache.org/jira/browse/HADOOP-777) | the tasktracker hostname is not fully qualified | Minor | . | Mahadev konar | Mahadev konar |
| [HADOOP-774](https://issues.apache.org/jira/browse/HADOOP-774) | Datanodes fails to heartbeat when a directory with a large number of blocks is deleted | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-770](https://issues.apache.org/jira/browse/HADOOP-770) | When JobTracker gets restarted, Job Tracker History doesn't show the jobs that were running. (incomplete jobs) | Minor | . | Koji Noguchi | Sanjay Dahiya |
| [HADOOP-764](https://issues.apache.org/jira/browse/HADOOP-764) | The memory consumption of processReport() in the namenode can be reduced | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-752](https://issues.apache.org/jira/browse/HADOOP-752) | Possible locking issues in HDFS Namenode | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-745](https://issues.apache.org/jira/browse/HADOOP-745) | NameNode throws FileNotFoundException: Parent path does not exist on startup | Major | . | Nigel Daley | dhruba borthakur |
| [HADOOP-738](https://issues.apache.org/jira/browse/HADOOP-738) | dfs get or copyToLocal should not copy crc file | Major | . | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-737](https://issues.apache.org/jira/browse/HADOOP-737) | TaskTracker's job cleanup loop should check for finished job before deleting local directories | Critical | . | Sanjay Dahiya | Arun C Murthy |
| [HADOOP-700](https://issues.apache.org/jira/browse/HADOOP-700) | bin/hadoop includes in classpath all jar files in HADOOP\_HOME | Major | scripts | Nigel Daley | Doug Cutting |
| [HADOOP-673](https://issues.apache.org/jira/browse/HADOOP-673) | the task execution environment should have a current working directory that is task specific | Major | . | Owen O'Malley | Mahadev konar |
| [HADOOP-629](https://issues.apache.org/jira/browse/HADOOP-629) | none of the rpc servers check the protcol name for validity | Major | ipc | Owen O'Malley | Owen O'Malley |
| [HADOOP-628](https://issues.apache.org/jira/browse/HADOOP-628) | hadoop hdfs -cat replaces some characters with question marks. | Major | . | arkady borkovsky | Wendy Chien |
| [HADOOP-596](https://issues.apache.org/jira/browse/HADOOP-596) | TaskTracker taskstatus's phase doesnt get updated on phase transition causing wrong values displayed in WI | Major | . | Sanjay Dahiya | Sanjay Dahiya |
| [HADOOP-546](https://issues.apache.org/jira/browse/HADOOP-546) | Task tracker doesnt generate job.xml in jobcache for some tasks ( possibly for only rescheduled tasks) | Critical | . | Sanjay Dahiya | Arun C Murthy |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.10.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,77 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.10.1 - 2007-01-10
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-851](https://issues.apache.org/jira/browse/HADOOP-851) | Implement the LzoCodec with support for the lzo compression algorithms | Major | io | Arun C Murthy | Arun C Murthy |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-880](https://issues.apache.org/jira/browse/HADOOP-880) | Recursive delete for an S3 directory does not actually delete files or subdirectories | Major | fs | Tom White | Tom White |
| [HADOOP-879](https://issues.apache.org/jira/browse/HADOOP-879) | SequenceFileInputFormat can no longer read from data produced by MapFileOutputFormat | Major | . | Bryan Pendleton | Doug Cutting |
| [HADOOP-873](https://issues.apache.org/jira/browse/HADOOP-873) | native libraries aren't loaded unless the user specifies the java.library.path in the child jvm options | Major | util | Owen O'Malley | Owen O'Malley |
| [HADOOP-871](https://issues.apache.org/jira/browse/HADOOP-871) | java.library.path is wrongly initialized by bin/hadoop when only pre-built libs are present, but custom-built ones aren't | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-868](https://issues.apache.org/jira/browse/HADOOP-868) | Fix the merge method on Maps to limit the number of open files | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-866](https://issues.apache.org/jira/browse/HADOOP-866) | dfs -get should remove existing crc file if -crc is not specified | Major | fs | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-865](https://issues.apache.org/jira/browse/HADOOP-865) | Files written to S3 but never closed can't be deleted | Major | fs | Bryan Pendleton | Tom White |
| [HADOOP-864](https://issues.apache.org/jira/browse/HADOOP-864) | bin/hadoop jar throws file creation exception for temp files | Minor | . | Sanjay Dahiya | Sanjay Dahiya |
| [HADOOP-863](https://issues.apache.org/jira/browse/HADOOP-863) | MapTask prints info log message when the progress-reporting thread starts | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-857](https://issues.apache.org/jira/browse/HADOOP-857) | IOException when running map reduce on S3 filesystem | Major | fs | Tom White | |
| [HADOOP-815](https://issues.apache.org/jira/browse/HADOOP-815) | Investigate and fix the extremely large memory-footprint of JobTracker | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-600](https://issues.apache.org/jira/browse/HADOOP-600) | Race condition in JobTracker updating the task tracker's status while declaring it lost | Major | . | Owen O'Malley | Arun C Murthy |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.10.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,118 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.11.0 - 2007-02-02
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-967](https://issues.apache.org/jira/browse/HADOOP-967) | flip boolean to have rpc clients send a header | Major | ipc | Owen O'Malley | Owen O'Malley |
| [HADOOP-961](https://issues.apache.org/jira/browse/HADOOP-961) | a cli tool to get the event logs from a job | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-918](https://issues.apache.org/jira/browse/HADOOP-918) | Examples of Abacus using Python plugins | Major | . | Runping Qi | Runping Qi |
| [HADOOP-908](https://issues.apache.org/jira/browse/HADOOP-908) | Hadoop Abacus, a package for performing simple counting/aggregation | Major | . | Runping Qi | Runping Qi |
| [HADOOP-884](https://issues.apache.org/jira/browse/HADOOP-884) | Create scripts to run Hadoop on Amazon EC2 | Major | scripts | Tom White | Tom White |
| [HADOOP-852](https://issues.apache.org/jira/browse/HADOOP-852) | want ant task for record definitions | Major | record | Doug Cutting | Milind Bhandarkar |
| [HADOOP-732](https://issues.apache.org/jira/browse/HADOOP-732) | SequenceFile's header should allow to store metadata in the form of key/value pairs | Major | io | Runping Qi | Runping Qi |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-963](https://issues.apache.org/jira/browse/HADOOP-963) | improve the stack trace returned by RPC client | Major | ipc | Owen O'Malley | Owen O'Malley |
| [HADOOP-936](https://issues.apache.org/jira/browse/HADOOP-936) | More updates to metric names to conform to HADOOP-887 | Minor | . | Nigel Daley | |
| [HADOOP-897](https://issues.apache.org/jira/browse/HADOOP-897) | Need a simpler way to specify arbitrary options to java compiler while building Hadoop | Minor | build | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-890](https://issues.apache.org/jira/browse/HADOOP-890) | Update tag and metric names to conform to HADOOP-887 | Minor | metrics | Nigel Daley | Nigel Daley |
| [HADOOP-862](https://issues.apache.org/jira/browse/HADOOP-862) | Add handling of s3 to CopyFile tool | Minor | util | stack | |
| [HADOOP-842](https://issues.apache.org/jira/browse/HADOOP-842) | change the open method in ClientProtocol to take an additional argument: clientMachine | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-833](https://issues.apache.org/jira/browse/HADOOP-833) | need documentation of native build requirements | Major | documentation | Doug Cutting | Arun C Murthy |
| [HADOOP-830](https://issues.apache.org/jira/browse/HADOOP-830) | Improve the performance of the Merge phase | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-805](https://issues.apache.org/jira/browse/HADOOP-805) | JobClient should print the Task's stdout and stderr to the clients console | Major | . | Owen O'Malley | Sanjay Dahiya |
| [HADOOP-801](https://issues.apache.org/jira/browse/HADOOP-801) | job tracker should keep a log of task completion and failure | Major | . | Owen O'Malley | Sanjay Dahiya |
| [HADOOP-788](https://issues.apache.org/jira/browse/HADOOP-788) | Streaming should use a subclass of TextInputFormat for reading text inputs. | Major | . | Owen O'Malley | Sanjay Dahiya |
| [HADOOP-726](https://issues.apache.org/jira/browse/HADOOP-726) | HDFS locking mechanisms should be simplified or removed | Minor | . | Sameer Paranjpye | Raghu Angadi |
| [HADOOP-692](https://issues.apache.org/jira/browse/HADOOP-692) | Rack-aware Replica Placement | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-659](https://issues.apache.org/jira/browse/HADOOP-659) | Boost the priority of re-replicating blocks that are far from their replication target | Major | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-602](https://issues.apache.org/jira/browse/HADOOP-602) | Remove Lucene dependency | Major | . | Andrzej Bialecki | Milind Bhandarkar |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-969](https://issues.apache.org/jira/browse/HADOOP-969) | deadlock in job tracker RetireJobs | Critical | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-965](https://issues.apache.org/jira/browse/HADOOP-965) | Isolation Runner looking for job.jar in wrong directory | Major | . | Dennis Kubes | |
| [HADOOP-964](https://issues.apache.org/jira/browse/HADOOP-964) | ClassNotFoundException in ReduceTaskRunner | Blocker | . | Dennis Kubes | |
| [HADOOP-962](https://issues.apache.org/jira/browse/HADOOP-962) | Hadoop EC2 scripts are not executable | Major | scripts | Tom White | Tom White |
| [HADOOP-959](https://issues.apache.org/jira/browse/HADOOP-959) | TestCheckpoint fails on Windows | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-937](https://issues.apache.org/jira/browse/HADOOP-937) | data node re-registration | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-935](https://issues.apache.org/jira/browse/HADOOP-935) | Abacus should not delete the output dir | Major | . | Runping Qi | Runping Qi |
| [HADOOP-934](https://issues.apache.org/jira/browse/HADOOP-934) | TaskTracker sends duplicate status when updating task metrics throws exception | Major | . | Nigel Daley | Arun C Murthy |
| [HADOOP-929](https://issues.apache.org/jira/browse/HADOOP-929) | PhasedFileSystem should implement get/set configuration | Major | . | Sanjay Dahiya | Sanjay Dahiya |
| [HADOOP-922](https://issues.apache.org/jira/browse/HADOOP-922) | Optimize small reads and seeks | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-920](https://issues.apache.org/jira/browse/HADOOP-920) | MapFileOutputFormat and SequenceFileOutputFormat use incorrect key/value classes in map/reduce tasks | Major | . | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-916](https://issues.apache.org/jira/browse/HADOOP-916) | HADOOP-908 patch causes javadoc warnings | Trivial | . | Nigel Daley | Nigel Daley |
| [HADOOP-912](https://issues.apache.org/jira/browse/HADOOP-912) | TestMiniMRWithDFS fails sporadically | Major | . | Nigel Daley | Arun C Murthy |
| [HADOOP-909](https://issues.apache.org/jira/browse/HADOOP-909) | dfs "du" shows that the size of a subdirectory is 0 | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-905](https://issues.apache.org/jira/browse/HADOOP-905) | Code to qualify inputDirs doesn't affect path validation | Major | fs | Kenji Matsuoka | |
| [HADOOP-902](https://issues.apache.org/jira/browse/HADOOP-902) | NPE in DFSOutputStream.closeBackupStream() | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-901](https://issues.apache.org/jira/browse/HADOOP-901) | Make S3FileSystem do recursive renames | Major | fs | Tom White | |
| [HADOOP-899](https://issues.apache.org/jira/browse/HADOOP-899) | Removal of deprecated code (in v0.10.0) from trunk breaks libhdfs | Major | . | Sameer Paranjpye | Sameer Paranjpye |
| [HADOOP-898](https://issues.apache.org/jira/browse/HADOOP-898) | namenode generates infinite stream of null pointers | Major | . | Owen O'Malley | Raghu Angadi |
| [HADOOP-886](https://issues.apache.org/jira/browse/HADOOP-886) | thousands of TimerThreads created by metrics API | Major | metrics | Nigel Daley | Nigel Daley |
| [HADOOP-881](https://issues.apache.org/jira/browse/HADOOP-881) | job history web/ui does not count task failures correctly | Major | . | Owen O'Malley | Sanjay Dahiya |
| [HADOOP-856](https://issues.apache.org/jira/browse/HADOOP-856) | fsck reports a non-existant DFS path as healthy | Minor | . | Nigel Daley | Milind Bhandarkar |
| [HADOOP-855](https://issues.apache.org/jira/browse/HADOOP-855) | HDFS should repair corrupted files | Major | . | Wendy Chien | Wendy Chien |
| [HADOOP-781](https://issues.apache.org/jira/browse/HADOOP-781) | Remove from trunk things deprecated in 0.10 branch. | Major | . | Doug Cutting | Doug Cutting |
| [HADOOP-758](https://issues.apache.org/jira/browse/HADOOP-758) | FileNotFound on DFS block file | Major | . | Owen O'Malley | Raghu Angadi |
| [HADOOP-757](https://issues.apache.org/jira/browse/HADOOP-757) | "Bad File Descriptor" in closing DFS file | Major | . | Owen O'Malley | Raghu Angadi |
| [HADOOP-735](https://issues.apache.org/jira/browse/HADOOP-735) | The underlying data structure, ByteArrayOutputStream, for buffer type of Hadoop record is inappropriate | Major | record | Runping Qi | Milind Bhandarkar |
| [HADOOP-731](https://issues.apache.org/jira/browse/HADOOP-731) | Sometimes when a dfs file is accessed and one copy has a checksum error the I/O command fails, even if another copy is alright. | Major | . | Dick King | Wendy Chien |
| [HADOOP-549](https://issues.apache.org/jira/browse/HADOOP-549) | NullPointerException in TaskReport's serialization code | Major | . | Michel Tourn | Owen O'Malley |
| [HADOOP-405](https://issues.apache.org/jira/browse/HADOOP-405) | Duplicate browseDirectory.jsp | Minor | . | Konstantin Shvachko | navychen |
| [HADOOP-309](https://issues.apache.org/jira/browse/HADOOP-309) | NullPointerException in StatusHttpServer | Minor | . | Konstantin Shvachko | navychen |
| [HADOOP-227](https://issues.apache.org/jira/browse/HADOOP-227) | Namespace check pointing is not performed until the namenode restarts. | Major | . | Konstantin Shvachko | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.11.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,68 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.11.1 - 2007-02-09
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-987](https://issues.apache.org/jira/browse/HADOOP-987) | pendingTransfer does not remove a block from neededReplications correctly after targets are chosen | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-984](https://issues.apache.org/jira/browse/HADOOP-984) | Exception while retrieving a map output index file is not reported to JobTracker | Critical | . | Nigel Daley | Arun C Murthy |
| [HADOOP-976](https://issues.apache.org/jira/browse/HADOOP-976) | SequenceFile.Metadata class should be public | Major | . | Runping Qi | Runping Qi |
| [HADOOP-917](https://issues.apache.org/jira/browse/HADOOP-917) | NPE in org.apache.hadoop.io.SequenceFile$Sorter$MergeQueue | Major | io | Sami Siren | Owen O'Malley |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.11.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,67 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.11.2 - 2007-02-16
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1014](https://issues.apache.org/jira/browse/HADOOP-1014) | map/reduce is corrupting data between map and reduce | Blocker | . | Owen O'Malley | Devaraj Das |
| [HADOOP-1013](https://issues.apache.org/jira/browse/HADOOP-1013) | ArithmeticException when number of racks is zero | Major | . | James P. White | Hairong Kuang |
| [HADOOP-1009](https://issues.apache.org/jira/browse/HADOOP-1009) | Infinite loop in chooseTarget | Major | . | dhruba borthakur | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.11.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,129 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.12.0 - 2007-03-02
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1032](https://issues.apache.org/jira/browse/HADOOP-1032) | Support for caching Job JARs | Minor | . | Gautam Kowshik | Gautam Kowshik |
| [HADOOP-492](https://issues.apache.org/jira/browse/HADOOP-492) | Global counters | Major | . | arkady borkovsky | David Bowen |
| [HADOOP-491](https://issues.apache.org/jira/browse/HADOOP-491) | streaming jobs should allow programs that don't do any IO for a long time | Major | . | arkady borkovsky | Arun C Murthy |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1043](https://issues.apache.org/jira/browse/HADOOP-1043) | Optimize the shuffle phase (increase the parallelism) | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1042](https://issues.apache.org/jira/browse/HADOOP-1042) | Improve the handling of failed map output fetches | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1041](https://issues.apache.org/jira/browse/HADOOP-1041) | Counter names are ugly | Major | . | Owen O'Malley | David Bowen |
| [HADOOP-1040](https://issues.apache.org/jira/browse/HADOOP-1040) | Improvement of RandomWriter example to use custom InputFormat, OutputFormat, and Counters | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1033](https://issues.apache.org/jira/browse/HADOOP-1033) | Rewrite AmazonEC2 wiki page | Minor | scripts | Tom White | Tom White |
| [HADOOP-1030](https://issues.apache.org/jira/browse/HADOOP-1030) | in unit tests, set ipc timeout in one place | Minor | test | Doug Cutting | Doug Cutting |
| [HADOOP-1025](https://issues.apache.org/jira/browse/HADOOP-1025) | remove dead code in Server.java | Minor | ipc | Doug Cutting | Doug Cutting |
| [HADOOP-1017](https://issues.apache.org/jira/browse/HADOOP-1017) | Optimization: Reduce Overhead from ReflectionUtils.newInstance | Major | util | Ron Bodkin | |
| [HADOOP-1007](https://issues.apache.org/jira/browse/HADOOP-1007) | Names used for map, reduce, and shuffle metrics should be unique | Trivial | metrics | Nigel Daley | Nigel Daley |
| [HADOOP-997](https://issues.apache.org/jira/browse/HADOOP-997) | Implement S3 retry mechanism for failed block transfers | Major | fs | Tom White | Tom White |
| [HADOOP-985](https://issues.apache.org/jira/browse/HADOOP-985) | Namenode should identify DataNodes as ip:port instead of hostname:port | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-982](https://issues.apache.org/jira/browse/HADOOP-982) | A couple setter functions and toString method for BytesWritable. | Major | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-977](https://issues.apache.org/jira/browse/HADOOP-977) | The output from the user's task should be tagged and sent to the resepective console streams. | Major | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-975](https://issues.apache.org/jira/browse/HADOOP-975) | Separation of user tasks' stdout and stderr streams | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-972](https://issues.apache.org/jira/browse/HADOOP-972) | Improve the rack-aware replica placement performance | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-954](https://issues.apache.org/jira/browse/HADOOP-954) | Metrics should offer complete set of static report methods or none at all | Minor | metrics | Nigel Daley | David Bowen |
| [HADOOP-952](https://issues.apache.org/jira/browse/HADOOP-952) | Create a public (shared) Hadoop EC2 AMI | Major | scripts | Tom White | Tom White |
| [HADOOP-943](https://issues.apache.org/jira/browse/HADOOP-943) | fsck to show the filename of the corrupted file | Trivial | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-941](https://issues.apache.org/jira/browse/HADOOP-941) | Enhancements to Hadoop record I/O - Part 1 | Major | record | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-940](https://issues.apache.org/jira/browse/HADOOP-940) | pendingReplications of FSNamesystem is not informative | Major | . | Hairong Kuang | dhruba borthakur |
| [HADOOP-928](https://issues.apache.org/jira/browse/HADOOP-928) | make checksums optional per FileSystem | Major | fs | Doug Cutting | Hairong Kuang |
| [HADOOP-889](https://issues.apache.org/jira/browse/HADOOP-889) | DFS unit tests have duplicate code | Minor | test | Doug Cutting | Milind Bhandarkar |
| [HADOOP-882](https://issues.apache.org/jira/browse/HADOOP-882) | S3FileSystem should retry if there is a communication problem with S3 | Major | fs | Tom White | Tom White |
| [HADOOP-867](https://issues.apache.org/jira/browse/HADOOP-867) | job client should generate input fragments before the job is submitted | Major | . | Owen O'Malley | |
| [HADOOP-858](https://issues.apache.org/jira/browse/HADOOP-858) | clean up smallJobsBenchmark and move to src/test/org/apache/hadoop/mapred | Minor | build | Nigel Daley | Nigel Daley |
| [HADOOP-432](https://issues.apache.org/jira/browse/HADOOP-432) | support undelete, snapshots, or other mechanism to recover lost files | Major | . | Yoram Arnon | Doug Cutting |
| [HADOOP-333](https://issues.apache.org/jira/browse/HADOOP-333) | we should have some checks that the sort benchmark generates correct outputs | Major | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-248](https://issues.apache.org/jira/browse/HADOOP-248) | locating map outputs via random probing is inefficient | Major | . | Owen O'Malley | Devaraj Das |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1056](https://issues.apache.org/jira/browse/HADOOP-1056) | Decommission only recognizes IP addesses in hosts and exclude files on refresh. | Major | . | Wendy Chien | Wendy Chien |
| [HADOOP-1049](https://issues.apache.org/jira/browse/HADOOP-1049) | race condition in setting up ipc connections | Major | ipc | Owen O'Malley | Devaraj Das |
| [HADOOP-1046](https://issues.apache.org/jira/browse/HADOOP-1046) | Datanode should periodically clean up /tmp from partially received (and not completed) block files | Major | . | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-1044](https://issues.apache.org/jira/browse/HADOOP-1044) | TestDecommission fails because it attempts to transfer block to a dead datanode | Major | test | Wendy Chien | Wendy Chien |
| [HADOOP-1039](https://issues.apache.org/jira/browse/HADOOP-1039) | Reduce the time taken by TestCheckpoint | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1037](https://issues.apache.org/jira/browse/HADOOP-1037) | bin/slaves.sh not compatible with /bin/dash | Major | . | Doug Cutting | |
| [HADOOP-1036](https://issues.apache.org/jira/browse/HADOOP-1036) | task gets lost during assignment | Critical | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-1034](https://issues.apache.org/jira/browse/HADOOP-1034) | RuntimeException and Error not catched in DataNode.DataXceiver.run() | Major | . | Philippe Gassmann | |
| [HADOOP-1029](https://issues.apache.org/jira/browse/HADOOP-1029) | streaming doesn't work with multiple maps | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1027](https://issues.apache.org/jira/browse/HADOOP-1027) | Fix the RAM FileSystem/Merge problems (reported in HADOOP-1014) | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1021](https://issues.apache.org/jira/browse/HADOOP-1021) | TestMiniMRLocalFS and TestMiniMRCaching broken on Windows | Major | test | Nigel Daley | |
| [HADOOP-1020](https://issues.apache.org/jira/browse/HADOOP-1020) | Path class on Windows seems broken | Major | . | Nigel Daley | Doug Cutting |
| [HADOOP-1010](https://issues.apache.org/jira/browse/HADOOP-1010) | getReordReader methof of InputFormat class should handle null reporter argument | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1006](https://issues.apache.org/jira/browse/HADOOP-1006) | The "-local" option does work properly with test programs | Minor | test | Gautam Kowshik | Doug Cutting |
| [HADOOP-1000](https://issues.apache.org/jira/browse/HADOOP-1000) | Loggers in the Task framework should not write the the Tasks stderr | Major | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-994](https://issues.apache.org/jira/browse/HADOOP-994) | DFS Scalability : a BlockReport that returns large number of blocks-to-be-deleted cause datanode to lost connectivity to namenode | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-992](https://issues.apache.org/jira/browse/HADOOP-992) | The mini/mr cluster for testing always uses the local file system rather than the namenode that was passed in | Major | test | Owen O'Malley | Owen O'Malley |
| [HADOOP-990](https://issues.apache.org/jira/browse/HADOOP-990) | Datanode doesn't retry when write to one (full)drive fail | Major | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-973](https://issues.apache.org/jira/browse/HADOOP-973) | NPE in FSDataset during heavy Namenode load | Major | . | Nigel Daley | dhruba borthakur |
| [HADOOP-947](https://issues.apache.org/jira/browse/HADOOP-947) | isReplicationInProgress() is very heavyweight | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-933](https://issues.apache.org/jira/browse/HADOOP-933) | Application defined InputSplits do not work | Major | . | Benjamin Reed | Owen O'Malley |
| [HADOOP-923](https://issues.apache.org/jira/browse/HADOOP-923) | DFS Scalability: datanode heartbeat timeouts cause cascading timeouts of other datanodes | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-893](https://issues.apache.org/jira/browse/HADOOP-893) | dead datanode set should be maintained in the file handle or file system for hdfs | Major | . | Owen O'Malley | Raghu Angadi |
| [HADOOP-878](https://issues.apache.org/jira/browse/HADOOP-878) | reducer NONE does not work with multiple maps | Minor | . | Mahadev konar | Arun C Murthy |
| [HADOOP-803](https://issues.apache.org/jira/browse/HADOOP-803) | Reducing memory consumption on Namenode : Part 1 | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-761](https://issues.apache.org/jira/browse/HADOOP-761) | Unit tests should cleanup created files in /tmp. It causes tests to fail if more than one users run tests on same machine. | Minor | test | Sanjay Dahiya | Nigel Daley |
| [HADOOP-654](https://issues.apache.org/jira/browse/HADOOP-654) | jobs fail with some hardware/system failures on a small number of nodes | Minor | . | Yoram Arnon | Arun C Murthy |
| [HADOOP-649](https://issues.apache.org/jira/browse/HADOOP-649) | Jobs without any map and reduce operations seems to be lost after their execution | Major | . | Thomas Friol | Owen O'Malley |
| [HADOOP-564](https://issues.apache.org/jira/browse/HADOOP-564) | we should use hdfs:// in all API URIs | Major | . | eric baldeschwieler | Wendy Chien |
| [HADOOP-476](https://issues.apache.org/jira/browse/HADOOP-476) | Streaming should check for correctness of the task | Major | . | arkady borkovsky | Arun C Murthy |
| [HADOOP-442](https://issues.apache.org/jira/browse/HADOOP-442) | slaves file should include an 'exclude' section, to prevent "bad" datanodes and tasktrackers from disrupting a cluster | Major | conf | Yoram Arnon | Wendy Chien |
| [HADOOP-338](https://issues.apache.org/jira/browse/HADOOP-338) | the number of maps in the JobConf does not match reality | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-109](https://issues.apache.org/jira/browse/HADOOP-109) | Blocks are not replicated when... | Major | . | Konstantin Shvachko | Konstantin Shvachko |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1051](https://issues.apache.org/jira/browse/HADOOP-1051) | Add checkstyle target to ant build file | Major | build, test | Tom White | Tom White |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.12.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,87 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.12.1 - 2007-03-17
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1096](https://issues.apache.org/jira/browse/HADOOP-1096) | Rename InputArchive and OutputArchive and make them public | Major | record | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-1053](https://issues.apache.org/jira/browse/HADOOP-1053) | Make Record I/O functionally modular from the rest of Hadoop | Major | record | Milind Bhandarkar | Milind Bhandarkar |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1129](https://issues.apache.org/jira/browse/HADOOP-1129) | The DFSClient hides IOExceptions in flush | Major | . | Owen O'Malley | Hairong Kuang |
| [HADOOP-1128](https://issues.apache.org/jira/browse/HADOOP-1128) | Missing progress information in map tasks | Major | . | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-1126](https://issues.apache.org/jira/browse/HADOOP-1126) | Optimize CPU usage when cluster restarts | Major | . | dhruba borthakur | Hairong Kuang |
| [HADOOP-1117](https://issues.apache.org/jira/browse/HADOOP-1117) | DFS Scalability: When the namenode is restarted it consumes 80% CPU | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1115](https://issues.apache.org/jira/browse/HADOOP-1115) | copyToLocal doesn't copy directories | Blocker | . | Nigel Daley | |
| [HADOOP-1112](https://issues.apache.org/jira/browse/HADOOP-1112) | Race condition in Hadoop metrics | Major | . | David Bowen | |
| [HADOOP-1109](https://issues.apache.org/jira/browse/HADOOP-1109) | Streaming, NPE when reading sequencefile | Major | . | Koji Noguchi | |
| [HADOOP-1108](https://issues.apache.org/jira/browse/HADOOP-1108) | Checksumed file system should retry reading if a different replica is found when handle ChecksumException | Blocker | . | dhruba borthakur | Hairong Kuang |
| [HADOOP-1099](https://issues.apache.org/jira/browse/HADOOP-1099) | NullPointerException in JobInProgress.getTaskInProgress | Major | . | Nigel Daley | Gautam Kowshik |
| [HADOOP-1092](https://issues.apache.org/jira/browse/HADOOP-1092) | NullPointerException in HeartbeatMonitor thread | Blocker | . | Nigel Daley | Hairong Kuang |
| [HADOOP-1091](https://issues.apache.org/jira/browse/HADOOP-1091) | NPE from Simon in JT stdout | Major | . | David Bowen | David Bowen |
| [HADOOP-1089](https://issues.apache.org/jira/browse/HADOOP-1089) | The c++ version of write and read v-int don't agree with the java versions | Major | record | Owen O'Malley | Milind Bhandarkar |
| [HADOOP-1088](https://issues.apache.org/jira/browse/HADOOP-1088) | Csv and Xml serialization for buffers do not work for byte value of -1 | Blocker | record | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-1083](https://issues.apache.org/jira/browse/HADOOP-1083) | Replication not occuring after cluster restart when datanodes missing | Blocker | . | Nigel Daley | Hairong Kuang |
| [HADOOP-1082](https://issues.apache.org/jira/browse/HADOOP-1082) | NullpointerException in ChecksumFileSystem$FSInputChecker.seek | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1080](https://issues.apache.org/jira/browse/HADOOP-1080) | Cygwin path translation should occur earlier in bin/hadoop | Major | scripts | Andrzej Bialecki | |
| [HADOOP-1077](https://issues.apache.org/jira/browse/HADOOP-1077) | Race condition in fetching map outputs (might lead to hung reduces) | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-1070](https://issues.apache.org/jira/browse/HADOOP-1070) | Number of racks and datanode double temporarily when upgrading from 0.10.1 to 0.11.2 | Blocker | . | Nigel Daley | Konstantin Shvachko |
| [HADOOP-1067](https://issues.apache.org/jira/browse/HADOOP-1067) | Compile fails if Checkstyle jar is present in lib directory | Major | build | Tom White | Tom White |
| [HADOOP-1060](https://issues.apache.org/jira/browse/HADOOP-1060) | IndexOutOfBoundsException in JobInProgress.updateTaskStatus leads to hung jobs | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1035](https://issues.apache.org/jira/browse/HADOOP-1035) | StackOverflowError in FSDataSet | Blocker | . | Philippe Gassmann | Raghu Angadi |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.12.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,68 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.12.2 - 2007-03-23
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1150](https://issues.apache.org/jira/browse/HADOOP-1150) | Streaming -reducer and -mapper no longer have defaults | Major | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1145](https://issues.apache.org/jira/browse/HADOOP-1145) | XmlRecordInput class should be public | Major | record | Runping Qi | Milind Bhandarkar |
| [HADOOP-1140](https://issues.apache.org/jira/browse/HADOOP-1140) | Deadlock bug involving the o.a.h.metrics package | Major | metrics | David Bowen | David Bowen |
| [HADOOP-1135](https://issues.apache.org/jira/browse/HADOOP-1135) | A block report processing may incorrectly cause the namenode to delete blocks | Major | . | dhruba borthakur | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.12.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,72 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.12.3 - 2007-04-06
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1192](https://issues.apache.org/jira/browse/HADOOP-1192) | Du command takes a noticable longer time to execute on a large dfs than the 0.11 release | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1191](https://issues.apache.org/jira/browse/HADOOP-1191) | MapTask should wait for the status reporting thread to die before invoking the TaskUmbilicalProtocol.done(taskid) | Major | . | Devaraj Das | Doug Cutting |
| [HADOOP-1179](https://issues.apache.org/jira/browse/HADOOP-1179) | task Tracker should be restarted if its jetty http server cannot serve get-map-output files | Major | . | Runping Qi | Devaraj Das |
| [HADOOP-1177](https://issues.apache.org/jira/browse/HADOOP-1177) | Lack of logging of exceptions in MapOutputLocation.getFile | Major | . | Owen O'Malley | Devaraj Das |
| [HADOOP-1175](https://issues.apache.org/jira/browse/HADOOP-1175) | the user output/logs are broken from the web ui | Major | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-1162](https://issues.apache.org/jira/browse/HADOOP-1162) | Record IO: seariliizing a byte buffer to CSV fails if buffer contains bytes less than 16. | Major | . | David Bowen | Milind Bhandarkar |
| [HADOOP-1123](https://issues.apache.org/jira/browse/HADOOP-1123) | LocalFileSystem gets a NullPointerException when tries to recover from ChecksumError | Major | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-1105](https://issues.apache.org/jira/browse/HADOOP-1105) | Reducers don't make "progress" while iterating through values | Major | . | Owen O'Malley | Owen O'Malley |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.12.3 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,195 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.13.0 - 2007-06-08
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1251](https://issues.apache.org/jira/browse/HADOOP-1251) | A method to get the InputSplit from a Mapper | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1247](https://issues.apache.org/jira/browse/HADOOP-1247) | Make Hadoop Abacus work with Hadoop Streaming | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1217](https://issues.apache.org/jira/browse/HADOOP-1217) | Specify a junit test timeout in build.xml files | Minor | build | Nigel Daley | Nigel Daley |
| [HADOOP-1216](https://issues.apache.org/jira/browse/HADOOP-1216) | Hadoop should support reduce none option | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1120](https://issues.apache.org/jira/browse/HADOOP-1120) | Contribute some code helping implement map/reduce apps for joining data from multiple sources | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1111](https://issues.apache.org/jira/browse/HADOOP-1111) | Job completion notification to a job configured URL | Major | . | Alejandro Abdelnur | |
| [HADOOP-702](https://issues.apache.org/jira/browse/HADOOP-702) | DFS Upgrade Proposal | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-485](https://issues.apache.org/jira/browse/HADOOP-485) | allow a different comparator for grouping keys in calls to reduce | Major | . | Owen O'Malley | Tahir Hashmi |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1326](https://issues.apache.org/jira/browse/HADOOP-1326) | Return the RunningJob from JobClient.runJob | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1324](https://issues.apache.org/jira/browse/HADOOP-1324) | FSError encountered by one running task should not be fatal to other tasks on that node | Major | . | Devaraj Das | Arun C Murthy |
| [HADOOP-1315](https://issues.apache.org/jira/browse/HADOOP-1315) | Hadoop Streaming code clean up | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1308](https://issues.apache.org/jira/browse/HADOOP-1308) | Tighten generic Class restrictions in JobConf.java | Minor | . | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-1304](https://issues.apache.org/jira/browse/HADOOP-1304) | MAX\_TASK\_FAILURES should be configurable | Major | . | Christian Kunz | Devaraj Das |
| [HADOOP-1290](https://issues.apache.org/jira/browse/HADOOP-1290) | Move Hadoop Abacus to hadoop.mapred.lib | Major | . | Runping Qi | |
| [HADOOP-1284](https://issues.apache.org/jira/browse/HADOOP-1284) | clean up the protocol between stream mapper/reducer and the framework | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1276](https://issues.apache.org/jira/browse/HADOOP-1276) | TaskTracker expiry interval is not configurable | Major | . | Alejandro Abdelnur | Arun C Murthy |
| [HADOOP-1270](https://issues.apache.org/jira/browse/HADOOP-1270) | Randomize the fetch of map outputs | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1263](https://issues.apache.org/jira/browse/HADOOP-1263) | retry logic when dfs exist or open fails temporarily, e.g because of timeout | Major | . | Christian Kunz | Hairong Kuang |
| [HADOOP-1260](https://issues.apache.org/jira/browse/HADOOP-1260) | need code review guidelines | Major | build | Nigel Daley | Nigel Daley |
| [HADOOP-1250](https://issues.apache.org/jira/browse/HADOOP-1250) | Remove the MustangFile class from streaming and promote the chmod into FileUtils | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1214](https://issues.apache.org/jira/browse/HADOOP-1214) | the first step for streaming clean up | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1213](https://issues.apache.org/jira/browse/HADOOP-1213) | When RPC call fails then log call message detail | Minor | ipc | Nigel Daley | Doug Cutting |
| [HADOOP-1194](https://issues.apache.org/jira/browse/HADOOP-1194) | map output should not do block level compression | Major | . | Runping Qi | Arun C Murthy |
| [HADOOP-1190](https://issues.apache.org/jira/browse/HADOOP-1190) | Fix unchecked warnings | Major | . | Tom White | Tom White |
| [HADOOP-1167](https://issues.apache.org/jira/browse/HADOOP-1167) | InMemoryFileSystem uses synchronizedtMaps with maps that are locked anyways | Minor | fs | Owen O'Malley | Owen O'Malley |
| [HADOOP-1166](https://issues.apache.org/jira/browse/HADOOP-1166) | Pull the NullOutputFormat into the lib package | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1165](https://issues.apache.org/jira/browse/HADOOP-1165) | Code for toString in code generated by Record I/O Compiler can be generic | Minor | record | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-1161](https://issues.apache.org/jira/browse/HADOOP-1161) | need improved release process | Major | build | Doug Cutting | Doug Cutting |
| [HADOOP-1148](https://issues.apache.org/jira/browse/HADOOP-1148) | re-indent all code | Minor | . | Doug Cutting | Doug Cutting |
| [HADOOP-1144](https://issues.apache.org/jira/browse/HADOOP-1144) | Hadoop should allow a configurable percentage of failed map tasks before declaring a job failed. | Major | . | Christian Kunz | Arun C Murthy |
| [HADOOP-1133](https://issues.apache.org/jira/browse/HADOOP-1133) | Tools to analyze and debug namenode on a production cluster | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1131](https://issues.apache.org/jira/browse/HADOOP-1131) | Add a closeAll() static method to FileSystem | Minor | . | Philippe Gassmann | |
| [HADOOP-1127](https://issues.apache.org/jira/browse/HADOOP-1127) | Speculative Execution and output of Reduce tasks | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1116](https://issues.apache.org/jira/browse/HADOOP-1116) | Add maxmemory="256m" in the junit call of build-contrib.xml | Major | build | Philippe Gassmann | |
| [HADOOP-1101](https://issues.apache.org/jira/browse/HADOOP-1101) | Add more statistics in the web-ui to do with tasks | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1094](https://issues.apache.org/jira/browse/HADOOP-1094) | Optimize readFields and write methods in record I/O | Major | record | Milind Bhandarkar | Milind Bhandarkar |
| [HADOOP-1068](https://issues.apache.org/jira/browse/HADOOP-1068) | Improve error message for 0 datanode case | Major | . | Owen O'Malley | dhruba borthakur |
| [HADOOP-988](https://issues.apache.org/jira/browse/HADOOP-988) | Namenode should use single map for block to its meta data. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-978](https://issues.apache.org/jira/browse/HADOOP-978) | AlreadyBeingCreatedException detail message could contain more useful info | Minor | . | Nigel Daley | Konstantin Shvachko |
| [HADOOP-971](https://issues.apache.org/jira/browse/HADOOP-971) | DFS Scalabilty: Improve name node performance by adding a hostname to datanodes map | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-968](https://issues.apache.org/jira/browse/HADOOP-968) | Reduce shuffle and merge should be done a child JVM | Major | . | Owen O'Malley | Devaraj Das |
| [HADOOP-819](https://issues.apache.org/jira/browse/HADOOP-819) | LineRecordWriter should not always insert tab char between key and value | Major | . | Runping Qi | Runping Qi |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1452](https://issues.apache.org/jira/browse/HADOOP-1452) | map output transfers of more than 2^31 bytes output are failing | Blocker | . | Christian Kunz | Owen O'Malley |
| [HADOOP-1435](https://issues.apache.org/jira/browse/HADOOP-1435) | FileSystem.globPaths should not create a Path from an empty string | Blocker | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-1431](https://issues.apache.org/jira/browse/HADOOP-1431) | Map tasks can't timeout for failing to call progress | Blocker | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-1427](https://issues.apache.org/jira/browse/HADOOP-1427) | Typo in GzipCodec.createInputStream - bufferSize | Blocker | io | Espen Amble Kolstad | Espen Amble Kolstad |
| [HADOOP-1411](https://issues.apache.org/jira/browse/HADOOP-1411) | AlreadyBeingCreatedException from task retries | Blocker | . | Nigel Daley | Hairong Kuang |
| [HADOOP-1407](https://issues.apache.org/jira/browse/HADOOP-1407) | Failed tasks not killing job | Blocker | . | Nigel Daley | Arun C Murthy |
| [HADOOP-1388](https://issues.apache.org/jira/browse/HADOOP-1388) | Possible Null Pointer Dereference in taskdetails.jsp | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1386](https://issues.apache.org/jira/browse/HADOOP-1386) | The constructor of Path should not take an empty string as a parameter | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1385](https://issues.apache.org/jira/browse/HADOOP-1385) | MD5Hash has a bad hash function | Major | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-1369](https://issues.apache.org/jira/browse/HADOOP-1369) | Inconsistent synchronization of TaskTracker fields | Blocker | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1368](https://issues.apache.org/jira/browse/HADOOP-1368) | Inconsistent synchronization of 3 fields in JobInProgress.java | Blocker | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1363](https://issues.apache.org/jira/browse/HADOOP-1363) | waitForCompletion() calls Thread.sleep() with a lock held | Blocker | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1361](https://issues.apache.org/jira/browse/HADOOP-1361) | seek calls in 3 io classes ignore result of skipBytes(int) | Blocker | io | Nigel Daley | Hairong Kuang |
| [HADOOP-1358](https://issues.apache.org/jira/browse/HADOOP-1358) | seek call ignores result of skipBytes(int) | Blocker | . | Nigel Daley | Hairong Kuang |
| [HADOOP-1356](https://issues.apache.org/jira/browse/HADOOP-1356) | ValueHistogram.addNextValue(Object) ignores return value of String.substring(int, int) | Blocker | . | Nigel Daley | Runping Qi |
| [HADOOP-1354](https://issues.apache.org/jira/browse/HADOOP-1354) | Null pointer dereference of paths in FsShell.dus(String) | Blocker | fs | Nigel Daley | Hairong Kuang |
| [HADOOP-1353](https://issues.apache.org/jira/browse/HADOOP-1353) | Null pointer dereference of nodeInfo in FSNamesystem.removeDatanode(DatanodeID) | Blocker | . | Nigel Daley | dhruba borthakur |
| [HADOOP-1350](https://issues.apache.org/jira/browse/HADOOP-1350) | Shuffle started taking a very long time after the HADOOP-1176 fix | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-1345](https://issues.apache.org/jira/browse/HADOOP-1345) | Checksum object does not get restored to the old state in retries when handle ChecksumException | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1332](https://issues.apache.org/jira/browse/HADOOP-1332) | Sporadic unit test failures (TestMiniMRClasspath, TestMiniMRLocalFS, TestMiniMRDFSCaching) | Blocker | . | Nigel Daley | Arun C Murthy |
| [HADOOP-1322](https://issues.apache.org/jira/browse/HADOOP-1322) | Tasktracker blacklist leads to hung jobs in single-node cluster | Critical | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1312](https://issues.apache.org/jira/browse/HADOOP-1312) | heartbeat monitor thread goes away | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1310](https://issues.apache.org/jira/browse/HADOOP-1310) | Fix unchecked warnings in aggregate code | Major | . | Tom White | Tom White |
| [HADOOP-1299](https://issues.apache.org/jira/browse/HADOOP-1299) | Once RPC.stopClient has been called, RPC can not be used again | Minor | ipc | stack | stack |
| [HADOOP-1297](https://issues.apache.org/jira/browse/HADOOP-1297) | datanode sending block reports to namenode once every second | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1294](https://issues.apache.org/jira/browse/HADOOP-1294) | Fix unchecked warnings in main Hadoop code under Java 6. | Major | test | Tom White | Tom White |
| [HADOOP-1293](https://issues.apache.org/jira/browse/HADOOP-1293) | stderr from streaming skipped after first 20 lines. | Minor | . | Koji Noguchi | Koji Noguchi |
| [HADOOP-1279](https://issues.apache.org/jira/browse/HADOOP-1279) | list of completed jobs purges jobs based on submission not on completion age | Major | . | Alejandro Abdelnur | Arun C Murthy |
| [HADOOP-1278](https://issues.apache.org/jira/browse/HADOOP-1278) | Fix the per-job tasktracker 'blacklist' | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1275](https://issues.apache.org/jira/browse/HADOOP-1275) | job notification property in hadoop-default.xml is misspelled | Trivial | . | Alejandro Abdelnur | |
| [HADOOP-1272](https://issues.apache.org/jira/browse/HADOOP-1272) | Extract InnerClasses from FSNamesystem into separate classes | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1271](https://issues.apache.org/jira/browse/HADOOP-1271) | The StreamBaseRecordReader is unable to log record data that's not UTF-8 | Minor | . | Gautam Kowshik | Arun C Murthy |
| [HADOOP-1262](https://issues.apache.org/jira/browse/HADOOP-1262) | file corruption detected because dfs client does not use replica blocks for checksum file | Major | . | dhruba borthakur | Hairong Kuang |
| [HADOOP-1258](https://issues.apache.org/jira/browse/HADOOP-1258) | TestCheckpoint test case doesn't wait for MiniDFSCluster to be active | Trivial | test | Nigel Daley | Nigel Daley |
| [HADOOP-1256](https://issues.apache.org/jira/browse/HADOOP-1256) | Dfs image loading and edits loading creates multiple instances of DatanodeDescriptor for the same datanode | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1255](https://issues.apache.org/jira/browse/HADOOP-1255) | Name-node falls into infinite loop trying to remove a dead node. | Blocker | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-1253](https://issues.apache.org/jira/browse/HADOOP-1253) | ConcurrentModificationException and NPE in JobControl | Minor | . | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-1252](https://issues.apache.org/jira/browse/HADOOP-1252) | Disk problems should be handled better by the MR framework | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1244](https://issues.apache.org/jira/browse/HADOOP-1244) | stop-dfs.sh incorrectly specifies slaves file for stopping datanode | Minor | . | Michael Bieniosek | dhruba borthakur |
| [HADOOP-1243](https://issues.apache.org/jira/browse/HADOOP-1243) | ClientProtocol.versionID should be 11 | Major | . | Konstantin Shvachko | dhruba borthakur |
| [HADOOP-1242](https://issues.apache.org/jira/browse/HADOOP-1242) | dfs upgrade/downgrade problems | Blocker | . | Owen O'Malley | Konstantin Shvachko |
| [HADOOP-1241](https://issues.apache.org/jira/browse/HADOOP-1241) | Null PointerException in processReport when namenode is restarted | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1239](https://issues.apache.org/jira/browse/HADOOP-1239) | Classes in src/test/testjar need package name | Trivial | test | Jim Kellerman | Jim Kellerman |
| [HADOOP-1238](https://issues.apache.org/jira/browse/HADOOP-1238) | maps\_running metric is only updated at the end of the task | Minor | metrics | Michael Bieniosek | David Bowen |
| [HADOOP-1224](https://issues.apache.org/jira/browse/HADOOP-1224) | "Browse the filesystem" link pointing to a dead data-node | Major | . | Konstantin Shvachko | Enis Soztutar |
| [HADOOP-1219](https://issues.apache.org/jira/browse/HADOOP-1219) | Spurious progress messages should be discarded after a task is done | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1218](https://issues.apache.org/jira/browse/HADOOP-1218) | In TaskTracker the access to RunningJob object is not synchronized in one place | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1211](https://issues.apache.org/jira/browse/HADOOP-1211) | Remove deprecated constructor and unused static members in DataNode class | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1205](https://issues.apache.org/jira/browse/HADOOP-1205) | The open method of FSNamesystem should be synchronized | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1204](https://issues.apache.org/jira/browse/HADOOP-1204) | Re-factor InputFormat/RecordReader related classes | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1203](https://issues.apache.org/jira/browse/HADOOP-1203) | UpgradeUtilities should use MiniDFSCluster to start and stop NameNode/DataNodes | Major | test | Nigel Daley | Nigel Daley |
| [HADOOP-1200](https://issues.apache.org/jira/browse/HADOOP-1200) | Datanode should periodically do a disk check | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1198](https://issues.apache.org/jira/browse/HADOOP-1198) | ipc.client.timeout of 2000ms for test cases seems too small; causes too many timeouts and leads to hung test cases | Major | test | Arun C Murthy | Arun C Murthy |
| [HADOOP-1189](https://issues.apache.org/jira/browse/HADOOP-1189) | Still seeing some unexpected 'No space left on device' exceptions | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1187](https://issues.apache.org/jira/browse/HADOOP-1187) | DFS Scalability: avoid scanning entire list of datanodes in getAdditionalBlocks | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1184](https://issues.apache.org/jira/browse/HADOOP-1184) | Decommission fails if a block that needs replication has only one replica | Major | . | dhruba borthakur | |
| [HADOOP-1178](https://issues.apache.org/jira/browse/HADOOP-1178) | NullPointer Exception in org.apache.hadoop.dfs.NameNode.isDir on namenode restart | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1176](https://issues.apache.org/jira/browse/HADOOP-1176) | Reduce hang on huge map output | Major | . | Hairong Kuang | Arun C Murthy |
| [HADOOP-1170](https://issues.apache.org/jira/browse/HADOOP-1170) | Very high CPU usage on data nodes because of FSDataset.checkDataDir() on every connect | Major | . | Igor Bolotin | |
| [HADOOP-1169](https://issues.apache.org/jira/browse/HADOOP-1169) | CopyFiles skips src files of s3 urls | Minor | util | stack | |
| [HADOOP-1164](https://issues.apache.org/jira/browse/HADOOP-1164) | TestReplicationPolicy doesn't use port 0 for the NameNode | Major | test | Owen O'Malley | Owen O'Malley |
| [HADOOP-1163](https://issues.apache.org/jira/browse/HADOOP-1163) | Ganglia metrics reporting is misconfigured | Minor | metrics | Michael Bieniosek | |
| [HADOOP-1160](https://issues.apache.org/jira/browse/HADOOP-1160) | DistributedFileSystem doesn't close the RawDistributedFileSystem on close. | Blocker | . | Owen O'Malley | Hairong Kuang |
| [HADOOP-1156](https://issues.apache.org/jira/browse/HADOOP-1156) | NullPointerException in MiniDFSCluster | Major | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-1154](https://issues.apache.org/jira/browse/HADOOP-1154) | streaming hang. (PipeMapRed$MROutputThread gone) | Major | . | Koji Noguchi | Koji Noguchi |
| [HADOOP-1153](https://issues.apache.org/jira/browse/HADOOP-1153) | DataNode and FSNamesystem don't shutdown cleanly | Major | . | Nigel Daley | Konstantin Shvachko |
| [HADOOP-1152](https://issues.apache.org/jira/browse/HADOOP-1152) | Reduce task hang failing in MapOutputCopier.copyOutput | Major | . | Koji Noguchi | Tahir Hashmi |
| [HADOOP-1151](https://issues.apache.org/jira/browse/HADOOP-1151) | streaming PipeMapRed prints system info to stderr | Trivial | . | Koji Noguchi | Koji Noguchi |
| [HADOOP-1149](https://issues.apache.org/jira/browse/HADOOP-1149) | DFS Scalability: high cpu usage in addStoredBlock | Major | . | dhruba borthakur | Raghu Angadi |
| [HADOOP-1146](https://issues.apache.org/jira/browse/HADOOP-1146) | "Reduce input records" counter name is misleading | Major | . | David Bowen | David Bowen |
| [HADOOP-1137](https://issues.apache.org/jira/browse/HADOOP-1137) | StatusHttpServer assumes that resources for /static are in files | Major | . | Benjamin Reed | |
| [HADOOP-1136](https://issues.apache.org/jira/browse/HADOOP-1136) | exception in UnderReplicatedBlocks:add when ther are more replicas of a block than required | Major | . | dhruba borthakur | Hairong Kuang |
| [HADOOP-1122](https://issues.apache.org/jira/browse/HADOOP-1122) | Divide-by-zero exception in chooseTarget | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1114](https://issues.apache.org/jira/browse/HADOOP-1114) | bin/hadoop script clobbers CLASSPATH | Minor | scripts | Michael Bieniosek | Doug Cutting |
| [HADOOP-1110](https://issues.apache.org/jira/browse/HADOOP-1110) | JobTracker WebUI "Map input records" a little off. | Trivial | . | Koji Noguchi | David Bowen |
| [HADOOP-1093](https://issues.apache.org/jira/browse/HADOOP-1093) | NNBench generates millions of NotReplicatedYetException in Namenode log | Major | . | Nigel Daley | dhruba borthakur |
| [HADOOP-1090](https://issues.apache.org/jira/browse/HADOOP-1090) | In SortValidator, the check for whether a file belongs to sort-input or sort-output dir is weak | Major | . | Devaraj Das | Arun C Murthy |
| [HADOOP-1085](https://issues.apache.org/jira/browse/HADOOP-1085) | Remove 'port rolling' from Mini{DFS\|MR}Cluster | Major | test | Arun C Murthy | Arun C Murthy |
| [HADOOP-1081](https://issues.apache.org/jira/browse/HADOOP-1081) | JAVA\_PLATFORM with spaces (i.e. Mac OS X-ppc-32) breaks bin/hadoop script | Major | scripts | Andrzej Bialecki | |
| [HADOOP-1073](https://issues.apache.org/jira/browse/HADOOP-1073) | DFS Scalability: high CPU usage in choosing replication targets and file open | Major | . | dhruba borthakur | Hairong Kuang |
| [HADOOP-1071](https://issues.apache.org/jira/browse/HADOOP-1071) | RPC$VersionMismatch exception is not fatal to JobTracker | Major | . | Nigel Daley | Tahir Hashmi |
| [HADOOP-1064](https://issues.apache.org/jira/browse/HADOOP-1064) | dfsclient logging messages should have appropriate log levels | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1063](https://issues.apache.org/jira/browse/HADOOP-1063) | MiniDFSCluster exists a race condition that lead to data node resources are not properly released | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-1061](https://issues.apache.org/jira/browse/HADOOP-1061) | S3 listSubPaths bug | Critical | fs | Mike Smith | |
| [HADOOP-1050](https://issues.apache.org/jira/browse/HADOOP-1050) | Do not count lost tasktracker against the job | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1047](https://issues.apache.org/jira/browse/HADOOP-1047) | TestReplication fails because DFS does not guarantee all the replicas are placed when a file is closed | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1011](https://issues.apache.org/jira/browse/HADOOP-1011) | ConcurrentModificationException in JobHistory | Major | . | Nigel Daley | Tahir Hashmi |
| [HADOOP-1001](https://issues.apache.org/jira/browse/HADOOP-1001) | the output of the map is not type checked against the specified types | Major | . | Owen O'Malley | Tahir Hashmi |
| [HADOOP-672](https://issues.apache.org/jira/browse/HADOOP-672) | dfs shell enhancements | Minor | . | Yoram Arnon | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,31 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.13.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-1063](https://issues.apache.org/jira/browse/HADOOP-1063) | *Major* | **MiniDFSCluster exists a race condition that lead to data node resources are not properly released**
Resolved race condition in shutting down MiniDFSCluster data node that prevented resources from being deallocated properly.

View File

@ -0,0 +1,64 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.13.1 - Unreleased
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.13.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,224 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.14.0 - 2007-08-20
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1597](https://issues.apache.org/jira/browse/HADOOP-1597) | Distributed upgrade status reporting and post upgrade features. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1570](https://issues.apache.org/jira/browse/HADOOP-1570) | Add a per-job configuration knob to control loading of native hadoop libraries | Major | io | Arun C Murthy | Arun C Murthy |
| [HADOOP-1568](https://issues.apache.org/jira/browse/HADOOP-1568) | NameNode Schema for HttpFileSystem | Major | fs | Chris Douglas | Chris Douglas |
| [HADOOP-1562](https://issues.apache.org/jira/browse/HADOOP-1562) | Report Java VM metrics | Major | metrics | David Bowen | David Bowen |
| [HADOOP-1515](https://issues.apache.org/jira/browse/HADOOP-1515) | MultiFileSplit, MultiFileInputFormat | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1508](https://issues.apache.org/jira/browse/HADOOP-1508) | ant Task for FsShell operations | Minor | build, fs | Chris Douglas | Chris Douglas |
| [HADOOP-1469](https://issues.apache.org/jira/browse/HADOOP-1469) | Asynchronous table creation | Minor | . | James Kennedy | stack |
| [HADOOP-1447](https://issues.apache.org/jira/browse/HADOOP-1447) | Support for textInputFormat in contrib/data\_join | Minor | . | Senthil Subramanian | Senthil Subramanian |
| [HADOOP-1437](https://issues.apache.org/jira/browse/HADOOP-1437) | Eclipse plugin for developing and executing MapReduce programs on Hadoop | Major | . | Eugene Hung | Christophe Taton |
| [HADOOP-1433](https://issues.apache.org/jira/browse/HADOOP-1433) | Add job priority | Minor | . | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-1379](https://issues.apache.org/jira/browse/HADOOP-1379) | Integrate Findbugs into nightly build process | Major | test | Nigel Daley | Nigel Daley |
| [HADOOP-1377](https://issues.apache.org/jira/browse/HADOOP-1377) | Creation time and modification time for hadoop files and directories | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1134](https://issues.apache.org/jira/browse/HADOOP-1134) | Block level CRCs in HDFS | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-234](https://issues.apache.org/jira/browse/HADOOP-234) | Hadoop Pipes for writing map/reduce jobs in C++ and python | Major | . | Sanjay Dahiya | Owen O'Malley |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1653](https://issues.apache.org/jira/browse/HADOOP-1653) | FSDirectory class code cleanup | Trivial | . | Christophe Taton | Christophe Taton |
| [HADOOP-1620](https://issues.apache.org/jira/browse/HADOOP-1620) | FileSystem should have fewer abstract methods | Major | . | Doug Cutting | Doug Cutting |
| [HADOOP-1585](https://issues.apache.org/jira/browse/HADOOP-1585) | GenericWritable should use generics | Minor | io | Espen Amble Kolstad | Espen Amble Kolstad |
| [HADOOP-1582](https://issues.apache.org/jira/browse/HADOOP-1582) | hdfsRead and hdfsPread should return 0 instead of -1 at end-of-file. | Minor | . | Christian Kunz | Christian Kunz |
| [HADOOP-1580](https://issues.apache.org/jira/browse/HADOOP-1580) | provide better error message when subprocesses fail in hadoop streaming | Minor | . | John Heidemann | John Heidemann |
| [HADOOP-1547](https://issues.apache.org/jira/browse/HADOOP-1547) | Provide examples for aggregate library | Major | . | Tom White | Runping Qi |
| [HADOOP-1518](https://issues.apache.org/jira/browse/HADOOP-1518) | Add session id to metric data | Major | . | David Bowen | David Bowen |
| [HADOOP-1485](https://issues.apache.org/jira/browse/HADOOP-1485) | Metrics should be there for reporting shuffle failures/successes | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1484](https://issues.apache.org/jira/browse/HADOOP-1484) | Kill jobs from web interface | Minor | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1478](https://issues.apache.org/jira/browse/HADOOP-1478) | The blockStream of DFSClient.FSInputStream should not be buffered | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1473](https://issues.apache.org/jira/browse/HADOOP-1473) | Make jobids unique across jobtracker restarts | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1470](https://issues.apache.org/jira/browse/HADOOP-1470) | Rework FSInputChecker and FSOutputSummer to support checksum code sharing between ChecksumFileSystem and block level crc dfs | Major | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-1467](https://issues.apache.org/jira/browse/HADOOP-1467) | Remove redundant counters from WordCount example | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1462](https://issues.apache.org/jira/browse/HADOOP-1462) | Better progress reporting from a Task | Major | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-1457](https://issues.apache.org/jira/browse/HADOOP-1457) | Counters for monitoring task assignments | Minor | . | Devaraj Das | Arun C Murthy |
| [HADOOP-1455](https://issues.apache.org/jira/browse/HADOOP-1455) | Allow any key-value pair on the command line of 'hadoop pipes' to be added to the JobConf | Major | . | Christian Kunz | Devaraj Das |
| [HADOOP-1450](https://issues.apache.org/jira/browse/HADOOP-1450) | checksums should be closer to data generation and consumption | Major | fs | Doug Cutting | Doug Cutting |
| [HADOOP-1440](https://issues.apache.org/jira/browse/HADOOP-1440) | JobClient should not sort input-splits | Major | . | Milind Bhandarkar | Senthil Subramanian |
| [HADOOP-1438](https://issues.apache.org/jira/browse/HADOOP-1438) | Grammatical / wording / copy edits for Hadoop Distributed File System: Architecture and Design white paper | Trivial | documentation | Luke Nezda | |
| [HADOOP-1429](https://issues.apache.org/jira/browse/HADOOP-1429) | RPC Server won't go quietly | Minor | ipc | stack | stack |
| [HADOOP-1417](https://issues.apache.org/jira/browse/HADOOP-1417) | Exclude some Findbugs detectors | Minor | build | Nigel Daley | Nigel Daley |
| [HADOOP-1413](https://issues.apache.org/jira/browse/HADOOP-1413) | A new example to do tile placements using distributed dancing links | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1408](https://issues.apache.org/jira/browse/HADOOP-1408) | fix warning about cast of Map\<String, Map\<String, JobInfo\>\> in jobhistory.jsp | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1376](https://issues.apache.org/jira/browse/HADOOP-1376) | RandomWriter should be tweaked to generate input data for terasort | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1343](https://issues.apache.org/jira/browse/HADOOP-1343) | Deprecate the Configuration.set(String,Object) method and make Configuration Iterable | Major | conf | Owen O'Malley | Owen O'Malley |
| [HADOOP-1342](https://issues.apache.org/jira/browse/HADOOP-1342) | A configurable limit on the number of unique values should be set on the UniqueValueCount and ValueHistogram aggregators | Major | . | Runping Qi | Runping Qi |
| [HADOOP-1340](https://issues.apache.org/jira/browse/HADOOP-1340) | md5 file in filecache should inherit replication factor from the file it belongs to. | Major | . | Christian Kunz | dhruba borthakur |
| [HADOOP-1320](https://issues.apache.org/jira/browse/HADOOP-1320) | Rewrite 'random-writer' to use '-reducer NONE' | Minor | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1292](https://issues.apache.org/jira/browse/HADOOP-1292) | dfs -copyToLocal should guarantee file is complete | Major | . | eric baldeschwieler | Tsz Wo Nicholas Sze |
| [HADOOP-1286](https://issues.apache.org/jira/browse/HADOOP-1286) | Distributed cluster upgrade | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1283](https://issues.apache.org/jira/browse/HADOOP-1283) | Eliminate internal UTF8 to String and vice versa conversions in the name-node. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1147](https://issues.apache.org/jira/browse/HADOOP-1147) | remove all @author tags from source | Minor | . | Doug Cutting | Doug Cutting |
| [HADOOP-1066](https://issues.apache.org/jira/browse/HADOOP-1066) | http://lucene.apache.org/hadoop/ front page is not user-friendly | Minor | documentation | Marco Nicosia | Doug Cutting |
| [HADOOP-1028](https://issues.apache.org/jira/browse/HADOOP-1028) | Servers should log startup and shutdown messages | Major | . | Nigel Daley | Tsz Wo Nicholas Sze |
| [HADOOP-1023](https://issues.apache.org/jira/browse/HADOOP-1023) | better links to mailing list archives | Major | documentation | Daniel Naber | Tom White |
| [HADOOP-1003](https://issues.apache.org/jira/browse/HADOOP-1003) | Proposal to batch commits to edits log. | Major | . | Raghu Angadi | dhruba borthakur |
| [HADOOP-894](https://issues.apache.org/jira/browse/HADOOP-894) | dfs client protocol should allow asking for parts of the block map | Major | . | Owen O'Malley | Konstantin Shvachko |
| [HADOOP-800](https://issues.apache.org/jira/browse/HADOOP-800) | More improvements to DFS browsing WI | Major | . | arkady borkovsky | Enis Soztutar |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1717](https://issues.apache.org/jira/browse/HADOOP-1717) | TestDFSUpgradeFromImage fails on Solaris | Blocker | test | Nigel Daley | Raghu Angadi |
| [HADOOP-1716](https://issues.apache.org/jira/browse/HADOOP-1716) | TestPipes.testPipes fails | Blocker | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1714](https://issues.apache.org/jira/browse/HADOOP-1714) | TestDFSUpgradeFromImage fails on Windows | Blocker | test | Nigel Daley | Raghu Angadi |
| [HADOOP-1712](https://issues.apache.org/jira/browse/HADOOP-1712) | Unhandled exception in Block CRC upgrade on datanode. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1698](https://issues.apache.org/jira/browse/HADOOP-1698) | 7500+ reducers/partitions causes job to hang | Blocker | . | Srikanth Kakani | Devaraj Das |
| [HADOOP-1681](https://issues.apache.org/jira/browse/HADOOP-1681) | Re organize StreamJob::submitAndMonitorJob() Exception handling | Minor | . | Lohit Vijayarenu | |
| [HADOOP-1680](https://issues.apache.org/jira/browse/HADOOP-1680) | Improvements to Block CRC upgrade messages | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1668](https://issues.apache.org/jira/browse/HADOOP-1668) | add INCOMPATIBLE CHANGES section to CHANGES.txt for Hadoop 0.14 | Blocker | documentation | Nigel Daley | Nigel Daley |
| [HADOOP-1666](https://issues.apache.org/jira/browse/HADOOP-1666) | The FsShell Object cannot be used for multiple fs commands. | Minor | fs | dhruba borthakur | dhruba borthakur |
| [HADOOP-1665](https://issues.apache.org/jira/browse/HADOOP-1665) | DFS Trash feature bugs | Blocker | . | Nigel Daley | dhruba borthakur |
| [HADOOP-1663](https://issues.apache.org/jira/browse/HADOOP-1663) | streaming returning 0 when submitJob fails with Exception | Major | . | Koji Noguchi | Koji Noguchi |
| [HADOOP-1659](https://issues.apache.org/jira/browse/HADOOP-1659) | job id / job name mix-up | Blocker | . | Christian Kunz | Arun C Murthy |
| [HADOOP-1657](https://issues.apache.org/jira/browse/HADOOP-1657) | NNBench benchmark hangs with trunk | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1649](https://issues.apache.org/jira/browse/HADOOP-1649) | Performance regression with Block CRCs | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1647](https://issues.apache.org/jira/browse/HADOOP-1647) | DistributedFileSystem.getFileStatus() fails for path "/" | Blocker | . | Enis Soztutar | dhruba borthakur |
| [HADOOP-1640](https://issues.apache.org/jira/browse/HADOOP-1640) | TestDecommission fails on Windows | Blocker | . | Nigel Daley | dhruba borthakur |
| [HADOOP-1639](https://issues.apache.org/jira/browse/HADOOP-1639) | TestSymLink is failing fairly often and is blocking the regression | Major | . | Owen O'Malley | Mahadev konar |
| [HADOOP-1638](https://issues.apache.org/jira/browse/HADOOP-1638) | Master node unable to bind to DNS hostname | Minor | contrib/cloud | Stu Hood | |
| [HADOOP-1635](https://issues.apache.org/jira/browse/HADOOP-1635) | Keypair Name Hardcoded | Minor | contrib/cloud | Stu Hood | |
| [HADOOP-1634](https://issues.apache.org/jira/browse/HADOOP-1634) | EC2 launch-hadoop-cluster awk Problem | Minor | contrib/cloud | Stu Hood | |
| [HADOOP-1632](https://issues.apache.org/jira/browse/HADOOP-1632) | IllegalArgumentException in fsck | Blocker | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-1625](https://issues.apache.org/jira/browse/HADOOP-1625) | "could not move files" exception in DataXceiver | Blocker | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-1624](https://issues.apache.org/jira/browse/HADOOP-1624) | Unknown op code exception in DataXceiver. | Blocker | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-1623](https://issues.apache.org/jira/browse/HADOOP-1623) | dfs -cp infinite loop creating sub-directories | Blocker | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-1619](https://issues.apache.org/jira/browse/HADOOP-1619) | FSInputChecker attempts to seek past EOF | Blocker | fs | Nigel Daley | Hairong Kuang |
| [HADOOP-1613](https://issues.apache.org/jira/browse/HADOOP-1613) | The dfs webui (dfshealth) shows "Last Contact" as a negative number | Minor | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1603](https://issues.apache.org/jira/browse/HADOOP-1603) | Replication gets set to 1 sometimes when Namenode restarted. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1599](https://issues.apache.org/jira/browse/HADOOP-1599) | TestCopyFiles with IllegalArgumentException on Windows | Blocker | fs | Nigel Daley | Senthil Subramanian |
| [HADOOP-1596](https://issues.apache.org/jira/browse/HADOOP-1596) | TestSymLink is failing | Blocker | . | Doug Cutting | Owen O'Malley |
| [HADOOP-1590](https://issues.apache.org/jira/browse/HADOOP-1590) | Jobtracker web interface contains several absolute href links instead of relative ones | Major | . | Thomas Friol | |
| [HADOOP-1587](https://issues.apache.org/jira/browse/HADOOP-1587) | Tasks run by MiniMRCluster don't get sysprops from TestCases | Blocker | test | Alejandro Abdelnur | Devaraj Das |
| [HADOOP-1584](https://issues.apache.org/jira/browse/HADOOP-1584) | Bug in readFields of GenericWritable | Minor | io | Espen Amble Kolstad | Espen Amble Kolstad |
| [HADOOP-1578](https://issues.apache.org/jira/browse/HADOOP-1578) | Data-nodes should send storage ID to the name-node during registration | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1576](https://issues.apache.org/jira/browse/HADOOP-1576) | web interface inconsistencies when using speculative execution | Blocker | . | Christian Kunz | Arun C Murthy |
| [HADOOP-1564](https://issues.apache.org/jira/browse/HADOOP-1564) | Write unit tests to detect CRC corruption | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1556](https://issues.apache.org/jira/browse/HADOOP-1556) | 9 unit test failures: file.out.index already exists | Major | . | Nigel Daley | Devaraj Das |
| [HADOOP-1554](https://issues.apache.org/jira/browse/HADOOP-1554) | Fix the JobHistory to display things like the number of nodes the job ran on, the number of killed/failed tasks | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1553](https://issues.apache.org/jira/browse/HADOOP-1553) | Extensive logging of C++ application can slow down task by an order of magnitude | Blocker | . | Christian Kunz | Owen O'Malley |
| [HADOOP-1551](https://issues.apache.org/jira/browse/HADOOP-1551) | libhdfs API is out of sync with Filesystem API | Blocker | . | Christian Kunz | Sameer Paranjpye |
| [HADOOP-1546](https://issues.apache.org/jira/browse/HADOOP-1546) | The DFS WebUI shows an incorrect column for file Creatin Time | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1542](https://issues.apache.org/jira/browse/HADOOP-1542) | Incorrect task/tip being scheduled (looks like speculative execution) | Blocker | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1536](https://issues.apache.org/jira/browse/HADOOP-1536) | libhdfs tests failing | Blocker | . | Nigel Daley | dhruba borthakur |
| [HADOOP-1535](https://issues.apache.org/jira/browse/HADOOP-1535) | Wrong comparator used to merge files in Reduce phase | Major | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-1524](https://issues.apache.org/jira/browse/HADOOP-1524) | Task Logs userlogs don't show up for a while | Major | . | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-1520](https://issues.apache.org/jira/browse/HADOOP-1520) | IndexOutOfBoundsException in FSEditLog.processIOError | Blocker | . | Nigel Daley | dhruba borthakur |
| [HADOOP-1517](https://issues.apache.org/jira/browse/HADOOP-1517) | Three methods in FSNamesystem should not be synchronized. | Critical | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1514](https://issues.apache.org/jira/browse/HADOOP-1514) | Progress reporting not handled for the case where a reducer currently doesn't have anything to fetch | Blocker | . | Devaraj Das | Vivek Ratan |
| [HADOOP-1513](https://issues.apache.org/jira/browse/HADOOP-1513) | A likely race condition between the creation of a directory and checking for its existence in the DiskChecker class | Critical | fs | Devaraj Das | Devaraj Das |
| [HADOOP-1512](https://issues.apache.org/jira/browse/HADOOP-1512) | TestTextInputFormat fails on Windows | Major | . | Konstantin Shvachko | |
| [HADOOP-1504](https://issues.apache.org/jira/browse/HADOOP-1504) | terminate-hadoop-cluster may be overzealous | Blocker | fs/s3 | Doug Cutting | Tom White |
| [HADOOP-1503](https://issues.apache.org/jira/browse/HADOOP-1503) | Fix for broken build by HADOOP-1498 | Major | . | stack | stack |
| [HADOOP-1501](https://issues.apache.org/jira/browse/HADOOP-1501) | Block reports from all datanodes arrive at the namenode within a small band of time | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1493](https://issues.apache.org/jira/browse/HADOOP-1493) | possible double setting of java.library.path introduced by HADOOP-838 | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1492](https://issues.apache.org/jira/browse/HADOOP-1492) | DataNode version mismatch during handshake() causes NullPointerException. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1489](https://issues.apache.org/jira/browse/HADOOP-1489) | Input file get truncated for text files with \r\n | Major | io | Bwolen Yang | |
| [HADOOP-1486](https://issues.apache.org/jira/browse/HADOOP-1486) | ReplicationMonitor thread goes away | Blocker | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-1482](https://issues.apache.org/jira/browse/HADOOP-1482) | SecondaryNameNode does not roll ports | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1479](https://issues.apache.org/jira/browse/HADOOP-1479) | NPE in HStore#get if StoreFile only has keys \< than passed key | Minor | . | stack | stack |
| [HADOOP-1475](https://issues.apache.org/jira/browse/HADOOP-1475) | local filecache disappears | Blocker | . | Christian Kunz | Owen O'Malley |
| [HADOOP-1472](https://issues.apache.org/jira/browse/HADOOP-1472) | Timed-out tasks are marked as 'KILLED' rather than as 'FAILED' which means the framework doesn't fail a TIP with 4 or more timed-out attempts | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1461](https://issues.apache.org/jira/browse/HADOOP-1461) | Corner-case deadlock in TaskTracker | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1459](https://issues.apache.org/jira/browse/HADOOP-1459) | FileSystem.getFileCacheHints returns IP addresses rather than hostnames, which breaks 'data-locality' in map-reduce | Blocker | . | Arun C Murthy | dhruba borthakur |
| [HADOOP-1456](https://issues.apache.org/jira/browse/HADOOP-1456) | TestDecommission fails with assertion Number of replicas for block1 expected:\<3\> but was:\<2\> | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1453](https://issues.apache.org/jira/browse/HADOOP-1453) | exists() not necessary before DFS.open | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1448](https://issues.apache.org/jira/browse/HADOOP-1448) | Setting the replication factor of a file too high causes namenode cpu overload | Major | . | dhruba borthakur | Hairong Kuang |
| [HADOOP-1446](https://issues.apache.org/jira/browse/HADOOP-1446) | Metrics from the TaskTracker are updated only when map/reduce tasks start/end/fail | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1444](https://issues.apache.org/jira/browse/HADOOP-1444) | Block allocation method does not check pendingCreates for duplicate block ids | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1443](https://issues.apache.org/jira/browse/HADOOP-1443) | TestFileCorruption fails with ArrayIndexOutOfBoundsException | Critical | . | Nigel Daley | Konstantin Shvachko |
| [HADOOP-1442](https://issues.apache.org/jira/browse/HADOOP-1442) | Zero-byte input files are not included in InputSplit | Major | . | Milind Bhandarkar | Senthil Subramanian |
| [HADOOP-1428](https://issues.apache.org/jira/browse/HADOOP-1428) | ChecksumFileSystem : some operations implicitly not supported. | Major | fs | Raghu Angadi | |
| [HADOOP-1414](https://issues.apache.org/jira/browse/HADOOP-1414) | Findbugs - Bad Practice | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1412](https://issues.apache.org/jira/browse/HADOOP-1412) | FindBugs: Dodgy bugs in fs, filecache, io, and util packages | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1406](https://issues.apache.org/jira/browse/HADOOP-1406) | Metrics based on Map-Reduce Counters are not cleaned up | Major | . | David Bowen | David Bowen |
| [HADOOP-1400](https://issues.apache.org/jira/browse/HADOOP-1400) | JobClient rpc times out getting job status | Blocker | . | Nigel Daley | Owen O'Malley |
| [HADOOP-1396](https://issues.apache.org/jira/browse/HADOOP-1396) | FileNotFound exception on DFS block | Blocker | . | Devaraj Das | dhruba borthakur |
| [HADOOP-1394](https://issues.apache.org/jira/browse/HADOOP-1394) | FindBugs : Performance : in dfs | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1393](https://issues.apache.org/jira/browse/HADOOP-1393) | using Math.abs(Random.getInt()) does not guarantee a positive number | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1392](https://issues.apache.org/jira/browse/HADOOP-1392) | FindBugs : Fix some correctness bugs reported in DFS, FS, etc. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1390](https://issues.apache.org/jira/browse/HADOOP-1390) | Inconsistent Synchronization cleanup for {Configuration, TaskLog, MapTask, Server}.java | Minor | conf, ipc | Devaraj Das | Devaraj Das |
| [HADOOP-1387](https://issues.apache.org/jira/browse/HADOOP-1387) | FindBugs -\> Performance | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1372](https://issues.apache.org/jira/browse/HADOOP-1372) | DFS Clients should start using the org.apache.hadoop.fs.LocalDirAllocator | Major | . | Devaraj Das | dhruba borthakur |
| [HADOOP-1367](https://issues.apache.org/jira/browse/HADOOP-1367) | Inconsistent synchronization of NetworkTopology.distFrom; locked 50% of time | Major | io | Nigel Daley | Hairong Kuang |
| [HADOOP-1364](https://issues.apache.org/jira/browse/HADOOP-1364) | Inconsistent synchronization of SequenceFile$Reader.noBufferedValues; locked 66% of time | Minor | io | Nigel Daley | Owen O'Malley |
| [HADOOP-1359](https://issues.apache.org/jira/browse/HADOOP-1359) | Variable dereferenced then later checked for null | Minor | . | Nigel Daley | Hairong Kuang |
| [HADOOP-1357](https://issues.apache.org/jira/browse/HADOOP-1357) | Call to equals() comparing different types in CopyFiles.cleanup(Configuration, JobConf, String, String) | Minor | util | Nigel Daley | Arun C Murthy |
| [HADOOP-1355](https://issues.apache.org/jira/browse/HADOOP-1355) | Possible null pointer dereference in TaskLogAppender.append(LoggingEvent) | Minor | . | Nigel Daley | Arun C Murthy |
| [HADOOP-1344](https://issues.apache.org/jira/browse/HADOOP-1344) | getJobName not accessible from JobClient | Minor | . | Michael Bieniosek | |
| [HADOOP-1335](https://issues.apache.org/jira/browse/HADOOP-1335) | C++ reducers under hadoop-pipes are not started when there are no key-value pairs to be reduced | Major | . | Christian Kunz | Owen O'Malley |
| [HADOOP-1311](https://issues.apache.org/jira/browse/HADOOP-1311) | Bug in BytesWritable.set(byte[] newData, int offset, int length) | Major | io | Srikanth Kakani | dhruba borthakur |
| [HADOOP-1300](https://issues.apache.org/jira/browse/HADOOP-1300) | deletion of excess replicas does not take into account 'rack-locality' | Major | . | Koji Noguchi | Hairong Kuang |
| [HADOOP-1285](https://issues.apache.org/jira/browse/HADOOP-1285) | ChecksumFileSystem : Can't read when io.file.buffer.size \< bytePerChecksum | Major | fs | Raghu Angadi | |
| [HADOOP-1269](https://issues.apache.org/jira/browse/HADOOP-1269) | DFS Scalability: namenode throughput impacted becuase of global FSNamesystem lock | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1261](https://issues.apache.org/jira/browse/HADOOP-1261) | Restart of the same data-node should not generate edits log records. | Minor | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-1234](https://issues.apache.org/jira/browse/HADOOP-1234) | map tasks fail because they do not find application in file cache | Major | . | Christian Kunz | Arun C Murthy |
| [HADOOP-1226](https://issues.apache.org/jira/browse/HADOOP-1226) | makeQualified should return an instance of a DfsPath when passed a DfsPath | Major | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-1207](https://issues.apache.org/jira/browse/HADOOP-1207) | hdfs -rm should NOT fail if one of the files to remove is missing | Minor | . | arkady borkovsky | Tsz Wo Nicholas Sze |
| [HADOOP-1197](https://issues.apache.org/jira/browse/HADOOP-1197) | The misleading Configuration.set(String, Object) should be removed | Minor | conf | Owen O'Malley | Owen O'Malley |
| [HADOOP-1193](https://issues.apache.org/jira/browse/HADOOP-1193) | Map/reduce job gets OutOfMemoryException when set map out to be compressed | Blocker | . | Hairong Kuang | Arun C Murthy |
| [HADOOP-1139](https://issues.apache.org/jira/browse/HADOOP-1139) | All block trasitions should be logged at log level INFO | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1124](https://issues.apache.org/jira/browse/HADOOP-1124) | ChecksumFileSystem does not handle ChecksumError correctly | Major | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-1084](https://issues.apache.org/jira/browse/HADOOP-1084) | updating a hdfs file, doesn't cause the distributed file cache to update itself | Blocker | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-911](https://issues.apache.org/jira/browse/HADOOP-911) | Multithreading issue with libhdfs library | Blocker | . | Christian Kunz | Christian Kunz |
| [HADOOP-690](https://issues.apache.org/jira/browse/HADOOP-690) | NPE in jobcontrol | Major | . | Johan Oskarsson | Owen O'Malley |
| [HADOOP-71](https://issues.apache.org/jira/browse/HADOOP-71) | The SequenceFileRecordReader uses the default FileSystem rather than the supplied one | Major | . | Owen O'Malley | Owen O'Malley |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1629](https://issues.apache.org/jira/browse/HADOOP-1629) | Block CRC Unit Tests: upgrade test | Blocker | . | Nigel Daley | Raghu Angadi |
| [HADOOP-1628](https://issues.apache.org/jira/browse/HADOOP-1628) | Block CRC Unit Tests: protocol tests | Blocker | . | Nigel Daley | Raghu Angadi |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1449](https://issues.apache.org/jira/browse/HADOOP-1449) | Example for contrib/data\_join | Minor | . | Senthil Subramanian | Senthil Subramanian |
| [HADOOP-1336](https://issues.apache.org/jira/browse/HADOOP-1336) | turn on speculative execution by defaul | Major | . | Owen O'Malley | Owen O'Malley |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.14.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,67 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.14.1 - 2007-09-03
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1798](https://issues.apache.org/jira/browse/HADOOP-1798) | The counts of currently running maps and reduces isn't maintained correctly when task trackers fail | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1790](https://issues.apache.org/jira/browse/HADOOP-1790) | hadoop seems not to support multi-homed installations | Blocker | . | Torsten Curdt | Doug Cutting |
| [HADOOP-1740](https://issues.apache.org/jira/browse/HADOOP-1740) | Certain Pipes tasks fail, after exiting the C++ application | Blocker | . | Srikanth Kakani | Devaraj Das |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.14.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,74 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.14.2 - 2007-10-09
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1997](https://issues.apache.org/jira/browse/HADOOP-1997) | TestCheckpoint fails on Windows | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1978](https://issues.apache.org/jira/browse/HADOOP-1978) | Name-node should remove edits.new during startup rather than renaming it to edits. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1977](https://issues.apache.org/jira/browse/HADOOP-1977) | hadoop job -kill , -status causes NullPointerException | Blocker | . | Lohit Vijayarenu | Enis Soztutar |
| [HADOOP-1972](https://issues.apache.org/jira/browse/HADOOP-1972) | LzoCompressor.finished doesn't check whether the user actually wants to finish compressing | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1970](https://issues.apache.org/jira/browse/HADOOP-1970) | tasktracker hang in reduce. Deadlock between main and comm thread | Blocker | . | Koji Noguchi | Vivek Ratan |
| [HADOOP-1961](https://issues.apache.org/jira/browse/HADOOP-1961) | -get, -copyToLocal fail when single filename is passed | Blocker | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-1955](https://issues.apache.org/jira/browse/HADOOP-1955) | Corrupted block replication retries for ever | Blocker | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-1948](https://issues.apache.org/jira/browse/HADOOP-1948) | Spurious error message during block crc upgrade. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1862](https://issues.apache.org/jira/browse/HADOOP-1862) | reduces are getting stuck trying to find map outputs | Blocker | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-1783](https://issues.apache.org/jira/browse/HADOOP-1783) | keyToPath in Jets3tFileSystemStore needs to return absolute path | Major | fs/s3 | Ahad Rana | Tom White |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.14.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,68 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.14.3 - 2007-10-19
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2072](https://issues.apache.org/jira/browse/HADOOP-2072) | RawLocalFileStatus is causing Path problems | Major | fs | Dennis Kubes | |
| [HADOOP-2053](https://issues.apache.org/jira/browse/HADOOP-2053) | OutOfMemoryError : Java heap space errors in hadoop 0.14 | Blocker | . | Lohit Vijayarenu | Arun C Murthy |
| [HADOOP-2043](https://issues.apache.org/jira/browse/HADOOP-2043) | 0.14.2 release compiled with Java 1.6 instead of Java 1.5 | Blocker | build | Doug Cutting | Doug Cutting |
| [HADOOP-2036](https://issues.apache.org/jira/browse/HADOOP-2036) | NPE in JvmMetrics.doThreadUpdates | Blocker | metrics | Koji Noguchi | Nigel Daley |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.14.3 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,67 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.14.4 - 2007-11-26
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2169](https://issues.apache.org/jira/browse/HADOOP-2169) | libhdfs makefile wrongly sets up DT\_SONAME field of libhdfs.so | Critical | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-2001](https://issues.apache.org/jira/browse/HADOOP-2001) | Deadlock in jobtracker | Blocker | . | Michael Bieniosek | Arun C Murthy |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2140](https://issues.apache.org/jira/browse/HADOOP-2140) | C and C++ files are missing Apache license header | Blocker | . | Owen O'Malley | Owen O'Malley |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.14.4 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,206 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.15.0 - 2007-10-19
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1963](https://issues.apache.org/jira/browse/HADOOP-1963) | Code contribution of Kosmos Filesystem implementation of Hadoop Filesystem interface | Major | fs | Sriram Rao | Sriram Rao |
| [HADOOP-1914](https://issues.apache.org/jira/browse/HADOOP-1914) | HDFS should have a NamenodeProtocol to allow secondary namenodes and rebalancing processes to communicate with a primary namenode | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1894](https://issues.apache.org/jira/browse/HADOOP-1894) | Add fancy graphs for mapred task statuses | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1880](https://issues.apache.org/jira/browse/HADOOP-1880) | SleepJob | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1851](https://issues.apache.org/jira/browse/HADOOP-1851) | Map output compression codec cannot be set independently of job output compression codec | Major | . | Riccardo Boscolo | Arun C Murthy |
| [HADOOP-1822](https://issues.apache.org/jira/browse/HADOOP-1822) | Allow SOCKS proxy configuration to remotely access the DFS and submit Jobs | Minor | ipc | Christophe Taton | Christophe Taton |
| [HADOOP-1809](https://issues.apache.org/jira/browse/HADOOP-1809) | Add link to irc channel #hadoop | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1727](https://issues.apache.org/jira/browse/HADOOP-1727) | Make ...hbase.io.MapWritable more generic so that it can be included in ...hadoop.io | Minor | io | Jim Kellerman | Jim Kellerman |
| [HADOOP-1351](https://issues.apache.org/jira/browse/HADOOP-1351) | Want to kill a particular task or attempt | Major | . | Owen O'Malley | Enis Soztutar |
| [HADOOP-789](https://issues.apache.org/jira/browse/HADOOP-789) | DFS shell should return a list of nodes for a file saying that where the blocks for these files are located. | Minor | . | Mahadev konar | Mahadev konar |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2046](https://issues.apache.org/jira/browse/HADOOP-2046) | Documentation: improve mapred javadocs | Blocker | documentation | Arun C Murthy | Arun C Murthy |
| [HADOOP-1971](https://issues.apache.org/jira/browse/HADOOP-1971) | Constructing a JobConf without a class leads to a very misleading error message. | Minor | . | Ted Dunning | Enis Soztutar |
| [HADOOP-1968](https://issues.apache.org/jira/browse/HADOOP-1968) | Wildcard input syntax (glob) should support {} | Major | fs | eric baldeschwieler | Hairong Kuang |
| [HADOOP-1942](https://issues.apache.org/jira/browse/HADOOP-1942) | Increase the concurrency of transaction logging to edits log | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1933](https://issues.apache.org/jira/browse/HADOOP-1933) | Consider include/exclude files while listing datanodes. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1926](https://issues.apache.org/jira/browse/HADOOP-1926) | Design/implement a set of compression benchmarks for the map-reduce framework | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1921](https://issues.apache.org/jira/browse/HADOOP-1921) | Save the configuration of completed/failed jobs and make them available via the web-ui. | Major | . | Arun C Murthy | Amar Kamat |
| [HADOOP-1908](https://issues.apache.org/jira/browse/HADOOP-1908) | Restructure data node code so that block sending/receiving is seperated from data transfer header handling | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1906](https://issues.apache.org/jira/browse/HADOOP-1906) | JobConf should warn about the existance of obsolete mapred-default.xml. | Major | conf | Owen O'Malley | Arun C Murthy |
| [HADOOP-1881](https://issues.apache.org/jira/browse/HADOOP-1881) | Update documentation for hadoop's configuration post HADOOP-785 | Major | documentation | Arun C Murthy | Arun C Murthy |
| [HADOOP-1878](https://issues.apache.org/jira/browse/HADOOP-1878) | Change priority feature in the job details JSP page misses spaces between each priority link | Trivial | . | Thomas Friol | Thomas Friol |
| [HADOOP-1803](https://issues.apache.org/jira/browse/HADOOP-1803) | Generalize making contrib bin content executable in ant package target | Minor | build | stack | stack |
| [HADOOP-1779](https://issues.apache.org/jira/browse/HADOOP-1779) | Small INodeDirectory enhancement to get all existing INodes components on a path | Trivial | . | Christophe Taton | Christophe Taton |
| [HADOOP-1777](https://issues.apache.org/jira/browse/HADOOP-1777) | Typo issue in the job details JSP page | Trivial | . | Thomas Friol | Thomas Friol |
| [HADOOP-1774](https://issues.apache.org/jira/browse/HADOOP-1774) | Remove use of INode.parent in Block CRC upgrade | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1767](https://issues.apache.org/jira/browse/HADOOP-1767) | JobClient CLI cleanup and improvement | Minor | . | Christophe Taton | Christophe Taton |
| [HADOOP-1766](https://issues.apache.org/jira/browse/HADOOP-1766) | Merging Block and BlockInfo classes on name-node. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1762](https://issues.apache.org/jira/browse/HADOOP-1762) | Namenode does not need to store storageID and datanodeID persistently | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1759](https://issues.apache.org/jira/browse/HADOOP-1759) | File name should be represented by a byte array instead of a String | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1756](https://issues.apache.org/jira/browse/HADOOP-1756) | Add toString() methods to some Writable types | Major | io | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-1750](https://issues.apache.org/jira/browse/HADOOP-1750) | We should log better if something goes wrong with the process fork | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1744](https://issues.apache.org/jira/browse/HADOOP-1744) | Small cleanup of DistributedFileSystem and DFSClient (next) | Trivial | . | Christophe Taton | Christophe Taton |
| [HADOOP-1743](https://issues.apache.org/jira/browse/HADOOP-1743) | INode refactoring | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1731](https://issues.apache.org/jira/browse/HADOOP-1731) | contrib jar file names should include hadoop version number | Major | . | Doug Cutting | Doug Cutting |
| [HADOOP-1718](https://issues.apache.org/jira/browse/HADOOP-1718) | Test coverage target in build files using clover | Major | build | Nigel Daley | Nigel Daley |
| [HADOOP-1703](https://issues.apache.org/jira/browse/HADOOP-1703) | Small cleanup of DistributedFileSystem and DFSClient | Trivial | . | Christophe Taton | Christophe Taton |
| [HADOOP-1693](https://issues.apache.org/jira/browse/HADOOP-1693) | Remove LOG members from PendingReplicationBlocks and ReplicationTargetChooser. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1687](https://issues.apache.org/jira/browse/HADOOP-1687) | Name-node memory size estimates and optimization proposal. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1667](https://issues.apache.org/jira/browse/HADOOP-1667) | organize CHANGES.txt messages into sections for future releases | Major | documentation | Doug Cutting | Doug Cutting |
| [HADOOP-1654](https://issues.apache.org/jira/browse/HADOOP-1654) | IOUtils class | Major | io | Enis Soztutar | Enis Soztutar |
| [HADOOP-1626](https://issues.apache.org/jira/browse/HADOOP-1626) | DFSAdmin. Help messages are missing for -finalizeUpgrade and -metasave. | Blocker | . | Konstantin Shvachko | Lohit Vijayarenu |
| [HADOOP-1621](https://issues.apache.org/jira/browse/HADOOP-1621) | Make FileStatus a concrete class | Major | fs | Chris Douglas | Chris Douglas |
| [HADOOP-1610](https://issues.apache.org/jira/browse/HADOOP-1610) | Add metrics for failed tasks | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1595](https://issues.apache.org/jira/browse/HADOOP-1595) | Add an option to setReplication method to wait for completion of replication | Major | . | Christian Kunz | Tsz Wo Nicholas Sze |
| [HADOOP-1592](https://issues.apache.org/jira/browse/HADOOP-1592) | Print the diagnostic error messages for FAILED task-attempts to the user console via TaskCompletionEvents | Major | . | Arun C Murthy | Amar Kamat |
| [HADOOP-1500](https://issues.apache.org/jira/browse/HADOOP-1500) | typo's in dfs webui | Trivial | . | Nigel Daley | Nigel Daley |
| [HADOOP-1436](https://issues.apache.org/jira/browse/HADOOP-1436) | Redesign Tool and ToolBase API and releted functionality | Major | util | Enis Soztutar | Enis Soztutar |
| [HADOOP-1425](https://issues.apache.org/jira/browse/HADOOP-1425) | Rework the various programs in 'examples' to extend ToolBase | Minor | . | Arun C Murthy | Enis Soztutar |
| [HADOOP-1266](https://issues.apache.org/jira/browse/HADOOP-1266) | Remove DatanodeDescriptor dependency from NetworkTopology | Major | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-1231](https://issues.apache.org/jira/browse/HADOOP-1231) | Add generics to Mapper and Reducer interfaces | Major | . | Owen O'Malley | Tom White |
| [HADOOP-1158](https://issues.apache.org/jira/browse/HADOOP-1158) | JobTracker should collect statistics of failed map output fetches, and take decisions to reexecute map tasks and/or restart the (possibly faulty) Jetty server on the TaskTracker | Major | . | Devaraj Das | Arun C Murthy |
| [HADOOP-785](https://issues.apache.org/jira/browse/HADOOP-785) | Divide the server and client configurations | Major | conf | Owen O'Malley | Arun C Murthy |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2103](https://issues.apache.org/jira/browse/HADOOP-2103) | HADOOP-2046 caused some javadoc anomalies | Major | documentation | Nigel Daley | Nigel Daley |
| [HADOOP-2102](https://issues.apache.org/jira/browse/HADOOP-2102) | ToolBase doesn't keep configuration | Blocker | util | Dennis Kubes | Dennis Kubes |
| [HADOOP-2080](https://issues.apache.org/jira/browse/HADOOP-2080) | ChecksumFileSystem checksum file size incorrect. | Blocker | fs | Richard Lee | Owen O'Malley |
| [HADOOP-2073](https://issues.apache.org/jira/browse/HADOOP-2073) | Datanode corruption if machine dies while writing VERSION file | Blocker | . | Michael Bieniosek | Konstantin Shvachko |
| [HADOOP-2072](https://issues.apache.org/jira/browse/HADOOP-2072) | RawLocalFileStatus is causing Path problems | Major | fs | Dennis Kubes | |
| [HADOOP-2070](https://issues.apache.org/jira/browse/HADOOP-2070) | Test org.apache.hadoop.mapred.pipes.TestPipes.unknown failed | Blocker | . | Mukund Madhugiri | Owen O'Malley |
| [HADOOP-2051](https://issues.apache.org/jira/browse/HADOOP-2051) | JobTracker's TaskCommitQueue is vulnerable to non-IOExceptions | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-2048](https://issues.apache.org/jira/browse/HADOOP-2048) | DISTCP mapper should report progress more often | Blocker | . | Runping Qi | Chris Douglas |
| [HADOOP-2044](https://issues.apache.org/jira/browse/HADOOP-2044) | Namenode encounters ClassCastException exceptions for INodeFileUnderConstruction | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2033](https://issues.apache.org/jira/browse/HADOOP-2033) | In SequenceFile sync doesn't work unless the file is compressed (block or record) | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2031](https://issues.apache.org/jira/browse/HADOOP-2031) | Lost tasktracker not handled properly leading to tips wrongly being kept as completed, and hence not rescheduled | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-2028](https://issues.apache.org/jira/browse/HADOOP-2028) | distcp fails if log dir not specified and destination not present | Blocker | util | Chris Douglas | Chris Douglas |
| [HADOOP-2026](https://issues.apache.org/jira/browse/HADOOP-2026) | Namenode prints out too many log lines for "Number of transactions" | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2023](https://issues.apache.org/jira/browse/HADOOP-2023) | TestLocalDirAllocator fails on Windows | Blocker | fs | Mukund Madhugiri | Hairong Kuang |
| [HADOOP-2022](https://issues.apache.org/jira/browse/HADOOP-2022) | Task times are not saved correctly (bug in hadoop-1874) | Blocker | . | Devaraj Das | Amar Kamat |
| [HADOOP-2018](https://issues.apache.org/jira/browse/HADOOP-2018) | Broken pipe SocketException in DataNode$DataXceiver | Blocker | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-2016](https://issues.apache.org/jira/browse/HADOOP-2016) | Race condition in removing a KILLED task from tasktracker | Blocker | . | Devaraj Das | Arun C Murthy |
| [HADOOP-1997](https://issues.apache.org/jira/browse/HADOOP-1997) | TestCheckpoint fails on Windows | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1992](https://issues.apache.org/jira/browse/HADOOP-1992) | Sort validation is taking considerably longer than before | Blocker | . | Mukund Madhugiri | Arun C Murthy |
| [HADOOP-1983](https://issues.apache.org/jira/browse/HADOOP-1983) | jobs using pipes interface with tasks not using java output format have a good chance of not updating progress and timing out | Major | . | Christian Kunz | Owen O'Malley |
| [HADOOP-1978](https://issues.apache.org/jira/browse/HADOOP-1978) | Name-node should remove edits.new during startup rather than renaming it to edits. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1973](https://issues.apache.org/jira/browse/HADOOP-1973) | NPE at JobTracker startup.. | Blocker | . | Gautam Kowshik | Amareshwari Sriramadasu |
| [HADOOP-1961](https://issues.apache.org/jira/browse/HADOOP-1961) | -get, -copyToLocal fail when single filename is passed | Blocker | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-1959](https://issues.apache.org/jira/browse/HADOOP-1959) | Use of File.separator in StatusHttpServer prevents running Junit tests inside eclipse on Windows | Minor | . | Jim Kellerman | Jim Kellerman |
| [HADOOP-1955](https://issues.apache.org/jira/browse/HADOOP-1955) | Corrupted block replication retries for ever | Blocker | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-1953](https://issues.apache.org/jira/browse/HADOOP-1953) | the job tracker should wait beteween calls to try and delete the system directory | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1948](https://issues.apache.org/jira/browse/HADOOP-1948) | Spurious error message during block crc upgrade. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1946](https://issues.apache.org/jira/browse/HADOOP-1946) | du should be not called on every heartbeat | Blocker | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-1944](https://issues.apache.org/jira/browse/HADOOP-1944) | Maps which ran on trackers declared 'lost' are being marked as FAILED rather than KILLED | Blocker | . | Arun C Murthy | Devaraj Das |
| [HADOOP-1940](https://issues.apache.org/jira/browse/HADOOP-1940) | TestDFSUpgradeFromImage doesn't shut down its MiniDFSCluster | Major | test | Chris Douglas | Chris Douglas |
| [HADOOP-1935](https://issues.apache.org/jira/browse/HADOOP-1935) | NullPointerException in internalReleaseCreate | Blocker | . | Konstantin Shvachko | dhruba borthakur |
| [HADOOP-1934](https://issues.apache.org/jira/browse/HADOOP-1934) | the os.name string on Mac OS contains spaces, which causes the c++ compilation to fail | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1932](https://issues.apache.org/jira/browse/HADOOP-1932) | Test dfs.TestFileCreation.testFileCreation failed on Windows | Blocker | test | Mukund Madhugiri | dhruba borthakur |
| [HADOOP-1930](https://issues.apache.org/jira/browse/HADOOP-1930) | Too many fetch-failures issue | Blocker | . | Christian Kunz | Arun C Murthy |
| [HADOOP-1925](https://issues.apache.org/jira/browse/HADOOP-1925) | Hadoop Pipes doesn't compile on solaris | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1910](https://issues.apache.org/jira/browse/HADOOP-1910) | Extra checks in DFS.create() are not necessary. | Minor | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1907](https://issues.apache.org/jira/browse/HADOOP-1907) | JobClient.runJob kills the job for failed tasks with no diagnostics | Major | . | Christian Kunz | Christian Kunz |
| [HADOOP-1904](https://issues.apache.org/jira/browse/HADOOP-1904) | ArrayIndexOutOfBoundException in BlocksMap | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1897](https://issues.apache.org/jira/browse/HADOOP-1897) | about.html page is there but not linked. | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-1892](https://issues.apache.org/jira/browse/HADOOP-1892) | In the Job UI, some links don't work | Major | . | Devaraj Das | Amar Kamat |
| [HADOOP-1890](https://issues.apache.org/jira/browse/HADOOP-1890) | Revert a debug patch. | Trivial | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1889](https://issues.apache.org/jira/browse/HADOOP-1889) | Fix path in EC2 scripts for building your own AMI | Major | contrib/cloud | Tom White | Tom White |
| [HADOOP-1887](https://issues.apache.org/jira/browse/HADOOP-1887) | ArrayIndexOutOfBoundsException with trunk | Major | . | Raghu Angadi | dhruba borthakur |
| [HADOOP-1885](https://issues.apache.org/jira/browse/HADOOP-1885) | Race condition in MiniDFSCluster shutdown | Major | test | Chris Douglas | Chris Douglas |
| [HADOOP-1882](https://issues.apache.org/jira/browse/HADOOP-1882) | Remove extra '\*'s from FsShell.limitDecimal() | Minor | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1875](https://issues.apache.org/jira/browse/HADOOP-1875) | multiple dfs.client.buffer.dir directories are not treated as alternatives | Blocker | fs | Christian Kunz | Hairong Kuang |
| [HADOOP-1874](https://issues.apache.org/jira/browse/HADOOP-1874) | lost task trackers -- jobs hang | Blocker | . | Christian Kunz | Devaraj Das |
| [HADOOP-1846](https://issues.apache.org/jira/browse/HADOOP-1846) | DatanodeReport should distinguish live datanodes from dead datanodes | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1840](https://issues.apache.org/jira/browse/HADOOP-1840) | Task's diagnostic messages are lost sometimes | Critical | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1838](https://issues.apache.org/jira/browse/HADOOP-1838) | Files created with an pre-0.15 gets blocksize as zero, causing performance degradation | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1832](https://issues.apache.org/jira/browse/HADOOP-1832) | listTables() returns duplicate tables | Major | . | Andrew Hitchcock | Jim Kellerman |
| [HADOOP-1825](https://issues.apache.org/jira/browse/HADOOP-1825) | hadoop-daemon.sh script fails if HADOOP\_PID\_DIR doesn't exist | Minor | scripts | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-1819](https://issues.apache.org/jira/browse/HADOOP-1819) | The JobTracker should ensure that it is running on the right host. | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1818](https://issues.apache.org/jira/browse/HADOOP-1818) | MutliFileInputFormat returns "empty" MultiFileSplit when number of paths \< number of splits | Major | . | Thomas Friol | Thomas Friol |
| [HADOOP-1817](https://issues.apache.org/jira/browse/HADOOP-1817) | MultiFileSplit does not write and read the total length | Major | . | Thomas Friol | Thomas Friol |
| [HADOOP-1812](https://issues.apache.org/jira/browse/HADOOP-1812) | TestIPC and TestRPC should use dynamically allocated ports | Major | ipc | Doug Cutting | Doug Cutting |
| [HADOOP-1810](https://issues.apache.org/jira/browse/HADOOP-1810) | Incorrect Value type in MRBench (SmallJobs) | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-1806](https://issues.apache.org/jira/browse/HADOOP-1806) | DfsTask no longer compiles | Major | build | Chris Douglas | Chris Douglas |
| [HADOOP-1795](https://issues.apache.org/jira/browse/HADOOP-1795) | Task.moveTaskOutputs is escaping special characters in output filenames | Critical | . | Frédéric Bertin | Frédéric Bertin |
| [HADOOP-1792](https://issues.apache.org/jira/browse/HADOOP-1792) | df command doesn't exist under windows | Major | fs | Benjamin Francisoud | Mahadev konar |
| [HADOOP-1788](https://issues.apache.org/jira/browse/HADOOP-1788) | Increase the buffer size of pipes from 1k to 128k | Blocker | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-1783](https://issues.apache.org/jira/browse/HADOOP-1783) | keyToPath in Jets3tFileSystemStore needs to return absolute path | Major | fs/s3 | Ahad Rana | Tom White |
| [HADOOP-1775](https://issues.apache.org/jira/browse/HADOOP-1775) | MapWritable and SortedMapWritable - Writable problems | Major | io | Jim Kellerman | Jim Kellerman |
| [HADOOP-1772](https://issues.apache.org/jira/browse/HADOOP-1772) | Hadoop does not run in Cygwin in Windows | Critical | scripts | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-1771](https://issues.apache.org/jira/browse/HADOOP-1771) | streaming hang when IOException in MROutputThread. (NPE) | Blocker | . | Koji Noguchi | Lohit Vijayarenu |
| [HADOOP-1758](https://issues.apache.org/jira/browse/HADOOP-1758) | processing escapes in a jute record is quadratic | Blocker | record | Dick King | Vivek Ratan |
| [HADOOP-1749](https://issues.apache.org/jira/browse/HADOOP-1749) | TestDFSUpgrade some times fails with an assert | Major | . | Raghu Angadi | Enis Soztutar |
| [HADOOP-1748](https://issues.apache.org/jira/browse/HADOOP-1748) | Task Trackers fail to launch tasks when they have relative log directories configured | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-1739](https://issues.apache.org/jira/browse/HADOOP-1739) | ConnectException in TaskTracker Child | Major | . | Srikanth Kakani | Doug Cutting |
| [HADOOP-1708](https://issues.apache.org/jira/browse/HADOOP-1708) | make files visible in the namespace as soon as they are created | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1695](https://issues.apache.org/jira/browse/HADOOP-1695) | Secondary Namenode halt when SocketTimeoutException at startup | Blocker | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-1692](https://issues.apache.org/jira/browse/HADOOP-1692) | DfsTask cache interferes with operation | Minor | util | Chris Douglas | Chris Douglas |
| [HADOOP-1689](https://issues.apache.org/jira/browse/HADOOP-1689) | .sh scripts do not work on Solaris | Minor | scripts | David Biesack | Doug Cutting |
| [HADOOP-1656](https://issues.apache.org/jira/browse/HADOOP-1656) | HDFS does not record the blocksize for a file | Major | . | Sameer Paranjpye | dhruba borthakur |
| [HADOOP-1651](https://issues.apache.org/jira/browse/HADOOP-1651) | Some improvements in progress reporting | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-1636](https://issues.apache.org/jira/browse/HADOOP-1636) | constant should be user-configurable: MAX\_COMPLETE\_USER\_JOBS\_IN\_MEMORY | Major | . | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-1601](https://issues.apache.org/jira/browse/HADOOP-1601) | GenericWritable should use ReflectionUtils.newInstance to avoid problems with classloaders | Major | io | Owen O'Malley | Enis Soztutar |
| [HADOOP-1573](https://issues.apache.org/jira/browse/HADOOP-1573) | Support for 0 reducers in PIPES | Major | . | Christian Kunz | Owen O'Malley |
| [HADOOP-1569](https://issues.apache.org/jira/browse/HADOOP-1569) | distcp should use the Path -\> FileSystem interface like the rest of Hadoop | Major | util | Owen O'Malley | Chris Douglas |
| [HADOOP-1565](https://issues.apache.org/jira/browse/HADOOP-1565) | DFSScalability: reduce memory usage of namenode | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1463](https://issues.apache.org/jira/browse/HADOOP-1463) | dfs.datanode.du.reserved semantics being violated | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1316](https://issues.apache.org/jira/browse/HADOOP-1316) | "Go to parent directory" does not work on windows. | Minor | . | Konstantin Shvachko | Mahadev konar |
| [HADOOP-1076](https://issues.apache.org/jira/browse/HADOOP-1076) | Periodic checkpointing cannot resume if the secondary name-node fails. | Major | . | Konstantin Shvachko | dhruba borthakur |
| [HADOOP-1018](https://issues.apache.org/jira/browse/HADOOP-1018) | Single lost heartbeat leads to a "Lost task tracker" | Major | . | Andrzej Bialecki | Arun C Murthy |
| [HADOOP-999](https://issues.apache.org/jira/browse/HADOOP-999) | DFS Client should create file when the user creates the file | Major | . | Owen O'Malley | Tsz Wo Nicholas Sze |
| [HADOOP-932](https://issues.apache.org/jira/browse/HADOOP-932) | File locking interface and implementation should be remvoed. | Minor | fs | Raghu Angadi | Raghu Angadi |
| [HADOOP-795](https://issues.apache.org/jira/browse/HADOOP-795) | hdfs -cp /a/b/c /x/y acts like hdfs -cp /a/b/c/\* /x/y | Minor | . | arkady borkovsky | Mahadev konar |
| [HADOOP-120](https://issues.apache.org/jira/browse/HADOOP-120) | Reading an ArrayWriter does not work because valueClass does not get initialized | Major | io | Dick King | Cameron Pope |
| [HADOOP-89](https://issues.apache.org/jira/browse/HADOOP-89) | files are not visible until they are closed | Critical | . | Yoram Arnon | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1879](https://issues.apache.org/jira/browse/HADOOP-1879) | Warnings With JDK1.6.0\_02 | Minor | . | Nilay Vaish | Nilay Vaish |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.15.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,71 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.15.1 - 2007-11-27
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1917](https://issues.apache.org/jira/browse/HADOOP-1917) | Need configuration guides for Hadoop | Critical | documentation | Sameer Paranjpye | Arun C Murthy |
| [HADOOP-713](https://issues.apache.org/jira/browse/HADOOP-713) | dfs list operation is too expensive | Blocker | . | Hairong Kuang | dhruba borthakur |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2205](https://issues.apache.org/jira/browse/HADOOP-2205) | Regenerate entire hadoop website since site.xml was changed by HADOOP-1917 | Blocker | documentation | Arun C Murthy | Arun C Murthy |
| [HADOOP-2174](https://issues.apache.org/jira/browse/HADOOP-2174) | distcp throws a NullPointerException in the close() method of mapper class due to the Reporter becoming invalid | Critical | util | Riccardo Boscolo | Chris Douglas |
| [HADOOP-2172](https://issues.apache.org/jira/browse/HADOOP-2172) | PositionCache was removed from FSDataInputStream, causes extremely bad MapFile performance | Blocker | fs | Johan Oskarsson | Doug Cutting |
| [HADOOP-2169](https://issues.apache.org/jira/browse/HADOOP-2169) | libhdfs makefile wrongly sets up DT\_SONAME field of libhdfs.so | Critical | . | Arun C Murthy | Arun C Murthy |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2140](https://issues.apache.org/jira/browse/HADOOP-2140) | C and C++ files are missing Apache license header | Blocker | . | Owen O'Malley | Owen O'Malley |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.15.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,79 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.15.2 - 2008-01-08
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2382](https://issues.apache.org/jira/browse/HADOOP-2382) | include hadoop-default.html in subversion | Minor | documentation | Doug Cutting | |
| [HADOOP-2160](https://issues.apache.org/jira/browse/HADOOP-2160) | separate website from user documentation | Major | documentation | Doug Cutting | Doug Cutting |
| [HADOOP-1327](https://issues.apache.org/jira/browse/HADOOP-1327) | Doc on Streaming | Major | documentation | Runping Qi | Rob Weltman |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2486](https://issues.apache.org/jira/browse/HADOOP-2486) | Dropping records at reducer. InMemoryFileSystem NPE. | Blocker | . | Koji Noguchi | Devaraj Das |
| [HADOOP-2460](https://issues.apache.org/jira/browse/HADOOP-2460) | NameNode could delete wrong edits file when there is an error | Major | . | Raghu Angadi | dhruba borthakur |
| [HADOOP-2456](https://issues.apache.org/jira/browse/HADOOP-2456) | German locale makes NameNode web interface crash | Minor | . | Matthias Friedrich | Matthias Friedrich |
| [HADOOP-2437](https://issues.apache.org/jira/browse/HADOOP-2437) | final map output not evenly distributed across multiple disks | Blocker | . | Christian Kunz | Arun C Murthy |
| [HADOOP-2422](https://issues.apache.org/jira/browse/HADOOP-2422) | dfs -cat multiple files fail with 'Unable to write to output stream.' | Blocker | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-2378](https://issues.apache.org/jira/browse/HADOOP-2378) | last TaskCompletionEvent gets added to the job after the job is marked as completed | Blocker | . | Alejandro Abdelnur | Devaraj Das |
| [HADOOP-2246](https://issues.apache.org/jira/browse/HADOOP-2246) | In CHANGES.txt, move HADOOP-1851 & HADOOP-1231 to INCOMPATIBLE CHANGES section | Blocker | documentation | Devaraj Das | Arun C Murthy |
| [HADOOP-2238](https://issues.apache.org/jira/browse/HADOOP-2238) | TaskGraphServlet does not set Content-Type | Major | . | Paul Saab | |
| [HADOOP-2228](https://issues.apache.org/jira/browse/HADOOP-2228) | Jobs fail because job.xml exists | Major | . | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-2227](https://issues.apache.org/jira/browse/HADOOP-2227) | wrong usage of mapred.local.dir.minspacestart | Critical | . | Christian Kunz | Amareshwari Sriramadasu |
| [HADOOP-2158](https://issues.apache.org/jira/browse/HADOOP-2158) | hdfsListDirectory in libhdfs does not scale | Blocker | . | Christian Kunz | Christian Kunz |
| [HADOOP-2129](https://issues.apache.org/jira/browse/HADOOP-2129) | distcp between two clusters does not work if it is run on the target cluster | Critical | util | Murtaza A. Basrai | Doug Cutting |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.15.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,69 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.15.3 - 2008-01-21
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2574](https://issues.apache.org/jira/browse/HADOOP-2574) | bugs in mapred tutorial | Major | documentation | Doug Cutting | Arun C Murthy |
| [HADOOP-2570](https://issues.apache.org/jira/browse/HADOOP-2570) | streaming jobs fail after HADOOP-2227 | Blocker | . | Lohit Vijayarenu | Amareshwari Sriramadasu |
| [HADOOP-2562](https://issues.apache.org/jira/browse/HADOOP-2562) | globPaths does not support {ab,cd} as it claims to | Blocker | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-2540](https://issues.apache.org/jira/browse/HADOOP-2540) | Empty blocks make fsck report corrupt, even when it isn't | Blocker | . | Allen Wittenauer | dhruba borthakur |
| [HADOOP-650](https://issues.apache.org/jira/browse/HADOOP-650) | bin/hadoop dfs -lsr / is broken | Minor | . | Mahadev konar | Mahadev konar |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.15.3 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,65 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.15.4 - Unreleased
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2647](https://issues.apache.org/jira/browse/HADOOP-2647) | dfs -put hangs | Major | . | Lohit Vijayarenu | Raghu Angadi |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.15.4 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,241 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.16.0 - 2008-02-07
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2603](https://issues.apache.org/jira/browse/HADOOP-2603) | SequenceFileAsBinaryInputFormat | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-2567](https://issues.apache.org/jira/browse/HADOOP-2567) | add FileSystem#getHomeDirectory() method | Major | fs | Doug Cutting | Doug Cutting |
| [HADOOP-2543](https://issues.apache.org/jira/browse/HADOOP-2543) | No-permission-checking mode for smooth transition to 0.16's permissions features. | Major | . | Sanjay Radia | Hairong Kuang |
| [HADOOP-2529](https://issues.apache.org/jira/browse/HADOOP-2529) | DFS User Guide | Major | documentation | Raghu Angadi | Raghu Angadi |
| [HADOOP-2514](https://issues.apache.org/jira/browse/HADOOP-2514) | Trash and permissions don't mix | Major | . | Robert Chansler | Doug Cutting |
| [HADOOP-2487](https://issues.apache.org/jira/browse/HADOOP-2487) | Provide an option to get job status for all jobs run by or submitted to a job tracker | Major | . | Hemanth Yamijala | Amareshwari Sriramadasu |
| [HADOOP-2447](https://issues.apache.org/jira/browse/HADOOP-2447) | HDFS should be capable of limiting the total number of inodes in the system | Major | . | Sameer Paranjpye | dhruba borthakur |
| [HADOOP-2398](https://issues.apache.org/jira/browse/HADOOP-2398) | Additional Instrumentation for NameNode, RPC Layer and JMX support | Major | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-2381](https://issues.apache.org/jira/browse/HADOOP-2381) | Support permission information in FileStatus | Major | fs | Tsz Wo Nicholas Sze | Raghu Angadi |
| [HADOOP-2367](https://issues.apache.org/jira/browse/HADOOP-2367) | Get representative hprof information from tasks | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2336](https://issues.apache.org/jira/browse/HADOOP-2336) | Shell commands to access and modify file permissions | Major | fs | Raghu Angadi | Raghu Angadi |
| [HADOOP-2288](https://issues.apache.org/jira/browse/HADOOP-2288) | Change FileSystem API to support access control. | Major | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2229](https://issues.apache.org/jira/browse/HADOOP-2229) | Provide a simple login implementation | Major | fs | Tsz Wo Nicholas Sze | Hairong Kuang |
| [HADOOP-2184](https://issues.apache.org/jira/browse/HADOOP-2184) | RPC Support for user permissions and authentication. | Major | ipc | Tsz Wo Nicholas Sze | Raghu Angadi |
| [HADOOP-2145](https://issues.apache.org/jira/browse/HADOOP-2145) | need 'doc' target that runs forrest | Major | build | Doug Cutting | Doug Cutting |
| [HADOOP-2085](https://issues.apache.org/jira/browse/HADOOP-2085) | Map-side joins on sorted, equally-partitioned datasets | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-2045](https://issues.apache.org/jira/browse/HADOOP-2045) | credits page should have more information | Major | documentation | Doug Cutting | Doug Cutting |
| [HADOOP-2012](https://issues.apache.org/jira/browse/HADOOP-2012) | Periodic verification at the Datanode | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-1912](https://issues.apache.org/jira/browse/HADOOP-1912) | Datanode should support block replacement | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1652](https://issues.apache.org/jira/browse/HADOOP-1652) | Rebalance data blocks when new data nodes added or data nodes become full | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-1604](https://issues.apache.org/jira/browse/HADOOP-1604) | admins should be able to finalize namenode upgrades without running the cluster | Critical | . | Owen O'Malley | Konstantin Shvachko |
| [HADOOP-1301](https://issues.apache.org/jira/browse/HADOOP-1301) | resource management proviosioning for Hadoop | Major | . | Pete Wyckoff | Hemanth Yamijala |
| [HADOOP-1298](https://issues.apache.org/jira/browse/HADOOP-1298) | adding user info to file | Major | fs | Kurtis Heimerl | Tsz Wo Nicholas Sze |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2645](https://issues.apache.org/jira/browse/HADOOP-2645) | Additional metrics & jmx beans and cleanup to use the recent metrics libraries | Major | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-2596](https://issues.apache.org/jira/browse/HADOOP-2596) | add SequenceFile.createWriter() method that takes block size as parameter | Minor | io | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-2566](https://issues.apache.org/jira/browse/HADOOP-2566) | need FileSystem#globStatus method | Major | fs | Doug Cutting | Hairong Kuang |
| [HADOOP-2552](https://issues.apache.org/jira/browse/HADOOP-2552) | enable hdfs permission checking by default | Major | . | Doug Cutting | Doug Cutting |
| [HADOOP-2547](https://issues.apache.org/jira/browse/HADOOP-2547) | remove use of 'magic number' in build.xml | Trivial | build | Hrishikesh | Hrishikesh |
| [HADOOP-2469](https://issues.apache.org/jira/browse/HADOOP-2469) | WritableUtils.clone should take Configuration rather than JobConf | Minor | io | stack | stack |
| [HADOOP-2466](https://issues.apache.org/jira/browse/HADOOP-2466) | FileInputFormat computeSplitSize() method, change visibility to protected and make it a member method | Major | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-2464](https://issues.apache.org/jira/browse/HADOOP-2464) | Test permissions related shell commands with DFS | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2457](https://issues.apache.org/jira/browse/HADOOP-2457) | Add a 'forrest.home' property for the 'docs' target in build.xml | Minor | documentation | Arun C Murthy | Arun C Murthy |
| [HADOOP-2432](https://issues.apache.org/jira/browse/HADOOP-2432) | If HDFS is going to throw an exception "File does not exist" it should include the name of the file | Minor | . | Jim Kellerman | Jim Kellerman |
| [HADOOP-2425](https://issues.apache.org/jira/browse/HADOOP-2425) | TextOutputFormat should special case Text | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2401](https://issues.apache.org/jira/browse/HADOOP-2401) | Lease holder information should be passed in ClientProtocol.abandonBlock(...) | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2390](https://issues.apache.org/jira/browse/HADOOP-2390) | Document the user-controls for intermediate/output compression via forrest | Major | documentation | Arun C Murthy | Arun C Murthy |
| [HADOOP-2349](https://issues.apache.org/jira/browse/HADOOP-2349) | FSEditLog.logEdit(byte op, Writable w1, Writable w2) should accept variable numbers of Writable, instead of two. | Minor | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2298](https://issues.apache.org/jira/browse/HADOOP-2298) | ant target without source and docs | Major | build | Gautam Kowshik | Hrishikesh |
| [HADOOP-2268](https://issues.apache.org/jira/browse/HADOOP-2268) | JobControl classes should use interfaces rather than implemenations | Minor | . | Adrian Woodhead | Adrian Woodhead |
| [HADOOP-2233](https://issues.apache.org/jira/browse/HADOOP-2233) | General example for modeling m/r load in Java | Minor | test | Chris Douglas | Chris Douglas |
| [HADOOP-2232](https://issues.apache.org/jira/browse/HADOOP-2232) | Add option to disable nagles algorithm in the IPC Server | Major | ipc | Clint Morgan | Clint Morgan |
| [HADOOP-2208](https://issues.apache.org/jira/browse/HADOOP-2208) | Reduce frequency of Counter updates in the task tracker status | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2185](https://issues.apache.org/jira/browse/HADOOP-2185) | Server ports: to roll or not to roll. | Major | conf | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2149](https://issues.apache.org/jira/browse/HADOOP-2149) | Pure name-node benchmarks. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2134](https://issues.apache.org/jira/browse/HADOOP-2134) | Remove developer-centric requirements from overview.html | Major | documentation | Arun C Murthy | Jim Kellerman |
| [HADOOP-2131](https://issues.apache.org/jira/browse/HADOOP-2131) | Speculative execution should be allowed for reducers only | Critical | . | Srikanth Kakani | Amareshwari Sriramadasu |
| [HADOOP-2127](https://issues.apache.org/jira/browse/HADOOP-2127) | Add pipes sort example | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2113](https://issues.apache.org/jira/browse/HADOOP-2113) | Add "-text" command to FsShell to decode SequenceFile to stdout | Minor | fs | Chris Douglas | Chris Douglas |
| [HADOOP-2105](https://issues.apache.org/jira/browse/HADOOP-2105) | Clarify requirements for Hadoop in overview.html | Minor | . | Jim Kellerman | Jim Kellerman |
| [HADOOP-2086](https://issues.apache.org/jira/browse/HADOOP-2086) | ability to add dependencies to a job after construction | Major | . | Adrian Woodhead | Adrian Woodhead |
| [HADOOP-2077](https://issues.apache.org/jira/browse/HADOOP-2077) | Logging version number (and compiled date) at STARTUP\_MSG | Trivial | . | Koji Noguchi | Arun C Murthy |
| [HADOOP-2058](https://issues.apache.org/jira/browse/HADOOP-2058) | Allow adding additional datanodes to MiniDFSCluster | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-1989](https://issues.apache.org/jira/browse/HADOOP-1989) | Add support for simulated Data Nodes - helpful for testing and performance benchmarking of the Name Node without having a large cluster | Minor | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-1965](https://issues.apache.org/jira/browse/HADOOP-1965) | Handle map output buffers better | Major | . | Devaraj Das | Amar Kamat |
| [HADOOP-1900](https://issues.apache.org/jira/browse/HADOOP-1900) | the heartbeat and task event queries interval should be set dynamically by the JobTracker | Major | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-1876](https://issues.apache.org/jira/browse/HADOOP-1876) | Persisting completed jobs status | Critical | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-1873](https://issues.apache.org/jira/browse/HADOOP-1873) | User permissions for Map/Reduce | Major | . | Raghu Angadi | Hairong Kuang |
| [HADOOP-1857](https://issues.apache.org/jira/browse/HADOOP-1857) | Ability to run a script when a task fails to capture stack traces | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-1855](https://issues.apache.org/jira/browse/HADOOP-1855) | fsck should verify block placement | Major | . | dhruba borthakur | Konstantin Shvachko |
| [HADOOP-1848](https://issues.apache.org/jira/browse/HADOOP-1848) | Redesign of Eclipse plug-in interface with Hadoop | Major | . | Christophe Taton | Christophe Taton |
| [HADOOP-1843](https://issues.apache.org/jira/browse/HADOOP-1843) | Remove deprecated code in Configuration/JobConf | Major | conf | Arun C Murthy | Arun C Murthy |
| [HADOOP-1841](https://issues.apache.org/jira/browse/HADOOP-1841) | IPC server should write repsonses asynchronously | Major | ipc | Doug Cutting | dhruba borthakur |
| [HADOOP-1839](https://issues.apache.org/jira/browse/HADOOP-1839) | Link-ify the Pending/Running/Complete/Killed tasks/task-attempts on jobdetails.jsp | Major | . | Arun C Murthy | Amar Kamat |
| [HADOOP-1742](https://issues.apache.org/jira/browse/HADOOP-1742) | FSNamesystem.startFile() javadoc is inconsistent | Minor | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-1719](https://issues.apache.org/jira/browse/HADOOP-1719) | Improve the utilization of shuffle copier threads | Major | . | Devaraj Das | Amar Kamat |
| [HADOOP-1707](https://issues.apache.org/jira/browse/HADOOP-1707) | Remove the DFS Client disk-based cache | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1660](https://issues.apache.org/jira/browse/HADOOP-1660) | add support for native library toDistributedCache | Major | . | Alejandro Abdelnur | Arun C Murthy |
| [HADOOP-1302](https://issues.apache.org/jira/browse/HADOOP-1302) | Remove deprecated contrib/abacus code | Major | . | Doug Cutting | Enis Soztutar |
| [HADOOP-1274](https://issues.apache.org/jira/browse/HADOOP-1274) | Configuring different number of mappers and reducers per TaskTracker | Major | . | Koji Noguchi | Amareshwari Sriramadasu |
| [HADOOP-1210](https://issues.apache.org/jira/browse/HADOOP-1210) | Log counters in job history | Minor | . | Albert Chern | Owen O'Malley |
| [HADOOP-1185](https://issues.apache.org/jira/browse/HADOOP-1185) | dynamically change log levels | Major | util | dhruba borthakur | Tsz Wo Nicholas Sze |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2768](https://issues.apache.org/jira/browse/HADOOP-2768) | DFSIO write performance benchmark shows a regression | Blocker | . | Mukund Madhugiri | dhruba borthakur |
| [HADOOP-2755](https://issues.apache.org/jira/browse/HADOOP-2755) | dfs fsck extremely slow, dfs ls times out | Blocker | . | Christian Kunz | Tsz Wo Nicholas Sze |
| [HADOOP-2740](https://issues.apache.org/jira/browse/HADOOP-2740) | Modify HOD to work with changes mentioned in HADOOP-2404 | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-2734](https://issues.apache.org/jira/browse/HADOOP-2734) | docs link to lucene.apache.org | Major | documentation | Doug Cutting | Doug Cutting |
| [HADOOP-2732](https://issues.apache.org/jira/browse/HADOOP-2732) | ab{5[6-9],[6-9][6-9]}.gz should not be treated as an illegal glob | Blocker | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-2723](https://issues.apache.org/jira/browse/HADOOP-2723) | Hadoop 2367- Does not respect JobConf.getProfileEnabled() | Blocker | . | Clint Morgan | Amareshwari Sriramadasu |
| [HADOOP-2720](https://issues.apache.org/jira/browse/HADOOP-2720) | Update HOD in Hadoop 0.16 | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-2714](https://issues.apache.org/jira/browse/HADOOP-2714) | Unit test fails on Windows: rg.apache.hadoop.dfs.TestDecommission | Blocker | . | Mukund Madhugiri | dhruba borthakur |
| [HADOOP-2713](https://issues.apache.org/jira/browse/HADOOP-2713) | Unit test fails on Windows: org.apache.hadoop.dfs.TestDatanodeDeath | Blocker | . | Mukund Madhugiri | dhruba borthakur |
| [HADOOP-2691](https://issues.apache.org/jira/browse/HADOOP-2691) | Some junit tests fail with the exception: All datanodes are bad. Aborting... | Major | . | Hairong Kuang | dhruba borthakur |
| [HADOOP-2687](https://issues.apache.org/jira/browse/HADOOP-2687) | 1707 added errant INFO-level logging to DFSClient | Blocker | . | stack | stack |
| [HADOOP-2659](https://issues.apache.org/jira/browse/HADOOP-2659) | The commands in DFSAdmin should require admin privilege | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2652](https://issues.apache.org/jira/browse/HADOOP-2652) | Fix permission issues for HftpFileSystem | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2649](https://issues.apache.org/jira/browse/HADOOP-2649) | The ReplicationMonitor sleep period should be configurable | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2646](https://issues.apache.org/jira/browse/HADOOP-2646) | SortValidator broken with fully-qualified working directories | Blocker | test | Doug Cutting | Arun C Murthy |
| [HADOOP-2640](https://issues.apache.org/jira/browse/HADOOP-2640) | MultiFileSplitInputFormat always returns 1 split when avgLengthPerSplit \> Integer.MAX\_VALUE | Blocker | . | Frédéric Bertin | Enis Soztutar |
| [HADOOP-2639](https://issues.apache.org/jira/browse/HADOOP-2639) | Reducers stuck in shuffle | Blocker | . | Amareshwari Sriramadasu | Arun C Murthy |
| [HADOOP-2633](https://issues.apache.org/jira/browse/HADOOP-2633) | Revert change to fsck made as part of permissions implementation | Blocker | . | Robert Chansler | Tsz Wo Nicholas Sze |
| [HADOOP-2626](https://issues.apache.org/jira/browse/HADOOP-2626) | RawLocalFileStatus is badly handling URIs | Major | fs | Frédéric Bertin | Doug Cutting |
| [HADOOP-2620](https://issues.apache.org/jira/browse/HADOOP-2620) | 'bin/hadoop fs -help' does not list file permissions commands. | Trivial | fs | Raghu Angadi | Raghu Angadi |
| [HADOOP-2614](https://issues.apache.org/jira/browse/HADOOP-2614) | dfs web interfaces should run as a configurable user account | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2605](https://issues.apache.org/jira/browse/HADOOP-2605) | leading slash in mapred.task.tracker.report.bindAddress | Major | conf | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2601](https://issues.apache.org/jira/browse/HADOOP-2601) | TestNNThroughput should not use a fixed namenode port | Major | . | Hairong Kuang | Konstantin Shvachko |
| [HADOOP-2583](https://issues.apache.org/jira/browse/HADOOP-2583) | Potential Eclipse plug-in UI loop when editing location parameters | Minor | contrib/eclipse-plugin | Christophe Taton | Christophe Taton |
| [HADOOP-2582](https://issues.apache.org/jira/browse/HADOOP-2582) | hadoop dfs -copyToLocal creates zero byte files, when source file does not exists | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-2576](https://issues.apache.org/jira/browse/HADOOP-2576) | Namenode performance degradation over time | Blocker | . | Christian Kunz | Raghu Angadi |
| [HADOOP-2571](https://issues.apache.org/jira/browse/HADOOP-2571) | javac generates a warning in test/o.a.h.io.FileBench | Trivial | test | Chris Douglas | Chris Douglas |
| [HADOOP-2549](https://issues.apache.org/jira/browse/HADOOP-2549) | hdfs does not honor dfs.du.reserved setting | Critical | . | Joydeep Sen Sarma | Hairong Kuang |
| [HADOOP-2538](https://issues.apache.org/jira/browse/HADOOP-2538) | NPE in TaskLog.java | Trivial | . | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-2537](https://issues.apache.org/jira/browse/HADOOP-2537) | make build process compatible with Ant 1.7.0 | Major | build | Nigel Daley | Hrishikesh |
| [HADOOP-2535](https://issues.apache.org/jira/browse/HADOOP-2535) | Remove support for deprecated mapred.child.heap.size and indentation fix in TaskRunner.java | Minor | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-2523](https://issues.apache.org/jira/browse/HADOOP-2523) | Unit test fails on Windows: TestDFSShell.testFilePermissions | Blocker | . | Mukund Madhugiri | Raghu Angadi |
| [HADOOP-2512](https://issues.apache.org/jira/browse/HADOOP-2512) | error stream handling in Shell executor | Blocker | util | Raghu Angadi | Raghu Angadi |
| [HADOOP-2511](https://issues.apache.org/jira/browse/HADOOP-2511) | HADOOP-2344 introduced a javadoc warning | Major | documentation | Arun C Murthy | Arun C Murthy |
| [HADOOP-2509](https://issues.apache.org/jira/browse/HADOOP-2509) | Add rat target to build | Major | build | Nigel Daley | Hrishikesh |
| [HADOOP-2503](https://issues.apache.org/jira/browse/HADOOP-2503) | REST Insert / Select | Critical | . | Billy Pearson | Bryan Duxbury |
| [HADOOP-2494](https://issues.apache.org/jira/browse/HADOOP-2494) | Set +x on contrib/\*/bin/\* in packaged tar bundle | Major | scripts | stack | stack |
| [HADOOP-2492](https://issues.apache.org/jira/browse/HADOOP-2492) | ConcurrentModificationException in org.apache.hadoop.ipc.Server.Responder | Major | ipc | Devaraj Das | dhruba borthakur |
| [HADOOP-2481](https://issues.apache.org/jira/browse/HADOOP-2481) | NNBench should periodically report its progress | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-2476](https://issues.apache.org/jira/browse/HADOOP-2476) | Unit test fails on Windows: TestCopyFiles.testCopyFromLocalToLocal | Blocker | fs | Mukund Madhugiri | Raghu Angadi |
| [HADOOP-2459](https://issues.apache.org/jira/browse/HADOOP-2459) | Running 'ant docs tar' includes src/docs/build in the resulting tar file | Minor | build | Nigel Daley | Nigel Daley |
| [HADOOP-2453](https://issues.apache.org/jira/browse/HADOOP-2453) | wordcount-simple example gives ParseException with examples configuration file | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2452](https://issues.apache.org/jira/browse/HADOOP-2452) | Eclipse plug-in build.xml issue | Trivial | build | Christophe Taton | Christophe Taton |
| [HADOOP-2446](https://issues.apache.org/jira/browse/HADOOP-2446) | TestHDFSServerPorts fails. | Major | test | Raghu Angadi | Nigel Daley |
| [HADOOP-2442](https://issues.apache.org/jira/browse/HADOOP-2442) | Unit test failed: org.apache.hadoop.fs.TestLocalFileSystemPermission.testLocalFSsetOwner | Critical | fs | Mukund Madhugiri | Raghu Angadi |
| [HADOOP-2434](https://issues.apache.org/jira/browse/HADOOP-2434) | MapFile.get on HDFS in TRUNK is WAY!!! slower than 0.15.x | Blocker | io | stack | stack |
| [HADOOP-2420](https://issues.apache.org/jira/browse/HADOOP-2420) | Use exit code to detect normal errors while excuting 'ls' in Local FS | Blocker | fs | Raghu Angadi | Raghu Angadi |
| [HADOOP-2404](https://issues.apache.org/jira/browse/HADOOP-2404) | HADOOP-2185 breaks compatibility with hadoop-0.15.0 | Blocker | conf | Arun C Murthy | Owen O'Malley |
| [HADOOP-2402](https://issues.apache.org/jira/browse/HADOOP-2402) | Lzo compression compresses each write from TextOutputFormat | Major | io, native | Chris Douglas | Chris Douglas |
| [HADOOP-2376](https://issues.apache.org/jira/browse/HADOOP-2376) | The sort example shouldn't override the number of maps | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2368](https://issues.apache.org/jira/browse/HADOOP-2368) | Lots of unit tests fail on Windows with exception: Login failed: CreateProcess | Blocker | . | Mukund Madhugiri | Tsz Wo Nicholas Sze |
| [HADOOP-2365](https://issues.apache.org/jira/browse/HADOOP-2365) | Result of HashFunction.hash() contains all identical values | Minor | . | Andrzej Bialecki | Jim Kellerman |
| [HADOOP-2363](https://issues.apache.org/jira/browse/HADOOP-2363) | Unit tests fail if there is another instance of Hadoop | Major | test | Raghu Angadi | Konstantin Shvachko |
| [HADOOP-2359](https://issues.apache.org/jira/browse/HADOOP-2359) | PendingReplicationMonitor thread received exception. java.lang.InterruptedException | Major | . | Owen O'Malley | dhruba borthakur |
| [HADOOP-2352](https://issues.apache.org/jira/browse/HADOOP-2352) | Remove AC\_LIB\_CHECK from src/native/configure.ac to ensure libhadoop.so doesn't have a dependency on libz.so/liblzo.so | Major | native | Arun C Murthy | Arun C Murthy |
| [HADOOP-2344](https://issues.apache.org/jira/browse/HADOOP-2344) | Free up the buffers (input and error) while executing a shell command before waiting for it to finish. | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-2323](https://issues.apache.org/jira/browse/HADOOP-2323) | JobTracker.close() prints stack traces for exceptions that are not errors | Minor | . | Jim Kellerman | Jim Kellerman |
| [HADOOP-2314](https://issues.apache.org/jira/browse/HADOOP-2314) | TestBlockReplacement occasionally get into an infinite loop | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-2313](https://issues.apache.org/jira/browse/HADOOP-2313) | build does not fail when libhdfs build fails | Minor | . | Nigel Daley | Nigel Daley |
| [HADOOP-2300](https://issues.apache.org/jira/browse/HADOOP-2300) | mapred.tasktracker.tasks.maximum is completely ignored | Blocker | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-2285](https://issues.apache.org/jira/browse/HADOOP-2285) | TextInputFormat is slow compared to reading files. | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2284](https://issues.apache.org/jira/browse/HADOOP-2284) | BasicTypeSorterBase.compare calls progress on each compare | Major | . | Owen O'Malley | Amar Kamat |
| [HADOOP-2275](https://issues.apache.org/jira/browse/HADOOP-2275) | Erroneous detection of corrupted file when namenode fails to allocate any datanodes for newly allocated block | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2272](https://issues.apache.org/jira/browse/HADOOP-2272) | findbugs currently fails due to hadoop-streaming having moved | Major | build | Adrian Woodhead | stack |
| [HADOOP-2271](https://issues.apache.org/jira/browse/HADOOP-2271) | chmod in ant package target fails | Major | build | Adrian Woodhead | Adrian Woodhead |
| [HADOOP-2256](https://issues.apache.org/jira/browse/HADOOP-2256) | TestBlockReplacement unit test failed. | Major | . | Raghu Angadi | Hairong Kuang |
| [HADOOP-2248](https://issues.apache.org/jira/browse/HADOOP-2248) | Word count example is spending 24% of the time in incrCounter | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2247](https://issues.apache.org/jira/browse/HADOOP-2247) | Mappers fail easily due to repeated failures | Blocker | . | Srikanth Kakani | Amar Kamat |
| [HADOOP-2245](https://issues.apache.org/jira/browse/HADOOP-2245) | TestRecordMR and TestAggregates fail once in a while | Major | . | Devaraj Das | Adrian Woodhead |
| [HADOOP-2244](https://issues.apache.org/jira/browse/HADOOP-2244) | MapWritable.readFields needs to clear internal hash else instance accumulates entries forever | Major | io | stack | stack |
| [HADOOP-2228](https://issues.apache.org/jira/browse/HADOOP-2228) | Jobs fail because job.xml exists | Major | . | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-2220](https://issues.apache.org/jira/browse/HADOOP-2220) | Reduce tasks fail too easily because of repeated fetch failures | Blocker | . | Christian Kunz | Amar Kamat |
| [HADOOP-2216](https://issues.apache.org/jira/browse/HADOOP-2216) | Job UI doesnot show running tasks and complete tasks correctly. | Major | . | Amareshwari Sriramadasu | Amar Kamat |
| [HADOOP-2215](https://issues.apache.org/jira/browse/HADOOP-2215) | Change documentation in cluster\_setup.html and mapred\_tutorial.html post HADOOP-1274 | Major | documentation | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2212](https://issues.apache.org/jira/browse/HADOOP-2212) | java.lang.ArithmeticException: / by zero in ChecksumFileSystem.open | Critical | fs | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-2209](https://issues.apache.org/jira/browse/HADOOP-2209) | SecondaryNamenode process should exit if it encounters Runtime exceptions | Major | . | dhruba borthakur | |
| [HADOOP-2204](https://issues.apache.org/jira/browse/HADOOP-2204) | DFSTestUtil.waitReplication does not wait. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2189](https://issues.apache.org/jira/browse/HADOOP-2189) | Incrementing user counters should count as progress | Blocker | . | Owen O'Malley | Devaraj Das |
| [HADOOP-2151](https://issues.apache.org/jira/browse/HADOOP-2151) | FileSyste.globPaths does not validate the return list of Paths | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-2121](https://issues.apache.org/jira/browse/HADOOP-2121) | Unexpected IOException in DFSOutputStream.close() | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2112](https://issues.apache.org/jira/browse/HADOOP-2112) | TestMiniMRMapRedDebugScript fails due to a missing file | Blocker | . | Devaraj Das | Arun C Murthy |
| [HADOOP-2104](https://issues.apache.org/jira/browse/HADOOP-2104) | clover description attribute suppresses all other targets in -projecthelp | Trivial | build | Chris Douglas | Chris Douglas |
| [HADOOP-2100](https://issues.apache.org/jira/browse/HADOOP-2100) | hadoop-daemon.sh script fails if HADOOP\_PID\_DIR doesn't exist | Major | scripts | Arun C Murthy | Michael Bieniosek |
| [HADOOP-2098](https://issues.apache.org/jira/browse/HADOOP-2098) | File handles for log files are still open in case of jobs with 0 maps | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-2096](https://issues.apache.org/jira/browse/HADOOP-2096) | The file used to localize job.xml should be closed. | Minor | . | Amar Kamat | Amar Kamat |
| [HADOOP-2092](https://issues.apache.org/jira/browse/HADOOP-2092) | Pipes C++ task does not die even if the Java tasks die | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2089](https://issues.apache.org/jira/browse/HADOOP-2089) | Multiple caheArchive does not work in Hadoop streaming | Critical | . | Milind Bhandarkar | Lohit Vijayarenu |
| [HADOOP-2078](https://issues.apache.org/jira/browse/HADOOP-2078) | Name-node should be able to close empty files. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2072](https://issues.apache.org/jira/browse/HADOOP-2072) | RawLocalFileStatus is causing Path problems | Major | fs | Dennis Kubes | |
| [HADOOP-2071](https://issues.apache.org/jira/browse/HADOOP-2071) | StreamXmlRecordReader throws java.io.IOException: Mark/reset exception in hadoop 0.14 | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-1984](https://issues.apache.org/jira/browse/HADOOP-1984) | some reducer stuck at copy phase and progress extremely slowly | Critical | . | Runping Qi | Amar Kamat |
| [HADOOP-1952](https://issues.apache.org/jira/browse/HADOOP-1952) | Streaming does not handle invalid -inputformat (typo by users for example) | Minor | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-1898](https://issues.apache.org/jira/browse/HADOOP-1898) | locking for the ReflectionUtils.logThreadInfo is too conservative | Major | util | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-1642](https://issues.apache.org/jira/browse/HADOOP-1642) | Jobs using LocalJobRunner + JobControl fails | Critical | . | Johan Oskarsson | Doug Cutting |
| [HADOOP-1348](https://issues.apache.org/jira/browse/HADOOP-1348) | Configuration XML bug: comments inside values | Critical | conf | Eelco Lempsink | Rajagopal Natarajan |
| [HADOOP-1281](https://issues.apache.org/jira/browse/HADOOP-1281) | Speculative map tasks aren't getting killed although the TIP completed | Critical | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-1245](https://issues.apache.org/jira/browse/HADOOP-1245) | value for mapred.tasktracker.tasks.maximum taken from jobtracker, not tasktracker | Major | . | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-1195](https://issues.apache.org/jira/browse/HADOOP-1195) | NullPointerException in FSNamesystem due to getDatanode() return value is not checked | Major | . | Konstantin Shvachko | dhruba borthakur |
| [HADOOP-1130](https://issues.apache.org/jira/browse/HADOOP-1130) | Remove unused ClientFinalizer in DFSClient | Major | . | Philippe Gassmann | Chris Douglas |
| [HADOOP-290](https://issues.apache.org/jira/browse/HADOOP-290) | Fix Datanode transfer thread logging | Minor | . | Dennis Kubes | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2449](https://issues.apache.org/jira/browse/HADOOP-2449) | Restore the old NN Bench that was replaced by a MR NN Bench | Major | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-2431](https://issues.apache.org/jira/browse/HADOOP-2431) | Test HDFS File Permissions | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-2406](https://issues.apache.org/jira/browse/HADOOP-2406) | Micro-benchmark to measure read/write times through InputFormats | Major | fs, test | Chris Douglas | Chris Douglas |
| [HADOOP-2369](https://issues.apache.org/jira/browse/HADOOP-2369) | Representative mix of jobs for large cluster throughput benchmarking | Major | test | Chris Douglas | Runping Qi |
| [HADOOP-2342](https://issues.apache.org/jira/browse/HADOOP-2342) | create a micro-benchmark for measure local-file versus hdfs read | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2000](https://issues.apache.org/jira/browse/HADOOP-2000) | Re-write NNBench to use MapReduce | Major | test | Mukund Madhugiri | Mukund Madhugiri |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.16.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,104 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.16.1 - 2008-03-14
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2923](https://issues.apache.org/jira/browse/HADOOP-2923) | Check in missing files from HADOOP-2603 | Major | . | Owen O'Malley | Chris Douglas |
| [HADOOP-2735](https://issues.apache.org/jira/browse/HADOOP-2735) | Setting default tmp directory for java createTempFile (java.io.tmpdir) | Critical | . | Koji Noguchi | Amareshwari Sriramadasu |
| [HADOOP-2371](https://issues.apache.org/jira/browse/HADOOP-2371) | Candidate user guide for permissions feature of Hadoop DFS | Major | . | Robert Chansler | Robert Chansler |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2911](https://issues.apache.org/jira/browse/HADOOP-2911) | [HOD] Make the information printed by allocate and info commands less verbose and clearer | Major | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2861](https://issues.apache.org/jira/browse/HADOOP-2861) | [HOD] Improve the user interface for the HOD commands | Major | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2730](https://issues.apache.org/jira/browse/HADOOP-2730) | Update HOD documentation | Major | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2958](https://issues.apache.org/jira/browse/HADOOP-2958) | Test utility no longer works in trunk | Minor | test | Chris Douglas | Chris Douglas |
| [HADOOP-2931](https://issues.apache.org/jira/browse/HADOOP-2931) | exception in DFSClient.create: Stream closed | Major | . | Michael Bieniosek | Michael Bieniosek |
| [HADOOP-2925](https://issues.apache.org/jira/browse/HADOOP-2925) | [HOD] Create mapred system directory using a naming convention that will avoid clashes in multi-user shared cluster scenario. | Major | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-2915](https://issues.apache.org/jira/browse/HADOOP-2915) | mapred output files and directories should be created as the job submitter, not tasktracker or jobtracker | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2904](https://issues.apache.org/jira/browse/HADOOP-2904) | 3 minor fixes in the rpc metrics area. | Major | . | girish vaitheeswaran | dhruba borthakur |
| [HADOOP-2903](https://issues.apache.org/jira/browse/HADOOP-2903) | Data type mismatch exception raised from pushMetric | Major | metrics | girish vaitheeswaran | girish vaitheeswaran |
| [HADOOP-2894](https://issues.apache.org/jira/browse/HADOOP-2894) | task trackers can't survive a job tracker bounce | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2883](https://issues.apache.org/jira/browse/HADOOP-2883) | Extensive write failures | Blocker | . | Christian Kunz | dhruba borthakur |
| [HADOOP-2869](https://issues.apache.org/jira/browse/HADOOP-2869) | Deprecate and remove SequenceFile.setCompressionType | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-2847](https://issues.apache.org/jira/browse/HADOOP-2847) | [HOD] Idle cluster cleanup does not work if the JobTracker becomes unresponsive to RPC calls | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-2843](https://issues.apache.org/jira/browse/HADOOP-2843) | mapred.join access control is overly restrictive | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-2840](https://issues.apache.org/jira/browse/HADOOP-2840) | Gridmix test script fails to run java sort tests | Major | test | Mukund Madhugiri | Mukund Madhugiri |
| [HADOOP-2814](https://issues.apache.org/jira/browse/HADOOP-2814) | NPE in datanode during TestDataTransferProtocol. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2813](https://issues.apache.org/jira/browse/HADOOP-2813) | Unit test fails on Linux: org.apache.hadoop.fs.TestDU.testDU | Blocker | fs | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2811](https://issues.apache.org/jira/browse/HADOOP-2811) | method Counters.makeCompactString() does not insert separator char ',' between the counters of different groups. | Critical | . | Runping Qi | Runping Qi |
| [HADOOP-2809](https://issues.apache.org/jira/browse/HADOOP-2809) | [HOD] Syslog configuration, syslog-address, does not work in HOD 0.4 | Critical | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2808](https://issues.apache.org/jira/browse/HADOOP-2808) | FileUtil::copy ignores "overwrite" formal | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-2789](https://issues.apache.org/jira/browse/HADOOP-2789) | Race condition in ipc.Server prevents responce being written back to client. | Critical | ipc | Clint Morgan | Raghu Angadi |
| [HADOOP-2788](https://issues.apache.org/jira/browse/HADOOP-2788) | chgrp missing from hadoop dfs options | Critical | . | Mukund Madhugiri | Raghu Angadi |
| [HADOOP-2787](https://issues.apache.org/jira/browse/HADOOP-2787) | The constant org.apache.hadoop.fs.permission.FsPermission.UMASK\_LABEL should be "dfs.umask", instead of "hadoop.dfs.umask" | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2785](https://issues.apache.org/jira/browse/HADOOP-2785) | Typo in peridioc block verification patch | Trivial | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2780](https://issues.apache.org/jira/browse/HADOOP-2780) | Socket receive buffer size on datanode too large | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2769](https://issues.apache.org/jira/browse/HADOOP-2769) | TestNNThroughputBenchmark should not used a fixed http port | Major | test | Owen O'Malley | Owen O'Malley |
| [HADOOP-2766](https://issues.apache.org/jira/browse/HADOOP-2766) | [HOD] No way to set HADOOP\_OPTS environment variable to the Hadoop daemons through HOD | Critical | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2756](https://issues.apache.org/jira/browse/HADOOP-2756) | NPE in DFSClient in hbase under load | Minor | . | stack | Raghu Angadi |
| [HADOOP-2754](https://issues.apache.org/jira/browse/HADOOP-2754) | Path filter for Local file system list .crc files | Major | . | Amareshwari Sriramadasu | Hairong Kuang |
| [HADOOP-2733](https://issues.apache.org/jira/browse/HADOOP-2733) | Compiler warnings in TestClusterMapReduceTestCase and TestJobStatusPersistency | Major | test | Konstantin Shvachko | Tsz Wo Nicholas Sze |
| [HADOOP-2725](https://issues.apache.org/jira/browse/HADOOP-2725) | Distcp truncates some files when copying | Critical | util | Murtaza A. Basrai | Tsz Wo Nicholas Sze |
| [HADOOP-2716](https://issues.apache.org/jira/browse/HADOOP-2716) | Balancer should require superuser privilege | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2683](https://issues.apache.org/jira/browse/HADOOP-2683) | Provide a way to specifiy login out side an RPC | Blocker | . | Raghu Angadi | Tsz Wo Nicholas Sze |
| [HADOOP-2391](https://issues.apache.org/jira/browse/HADOOP-2391) | Speculative Execution race condition with output paths | Major | . | Dennis Kubes | Amareshwari Sriramadasu |
| [HADOOP-1188](https://issues.apache.org/jira/browse/HADOOP-1188) | processIOError() should update fstime file | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2852](https://issues.apache.org/jira/browse/HADOOP-2852) | Update gridmix to avoid artificially long tail | Major | test | Chris Douglas | Chris Douglas |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2918](https://issues.apache.org/jira/browse/HADOOP-2918) | Enhance log messages to better debug "No lease on file" message | Major | . | dhruba borthakur | dhruba borthakur |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.16.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,87 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.16.2 - 2008-04-02
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3128](https://issues.apache.org/jira/browse/HADOOP-3128) | TestDFSPermission due to not throwing exception. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3108](https://issues.apache.org/jira/browse/HADOOP-3108) | NPE in FSDirectory.unprotectedSetPermission | Blocker | . | Koji Noguchi | Konstantin Shvachko |
| [HADOOP-3107](https://issues.apache.org/jira/browse/HADOOP-3107) | fsck failing with NPE | Blocker | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-3105](https://issues.apache.org/jira/browse/HADOOP-3105) | compile-core-test fails for branch 0.16 | Blocker | . | Amareshwari Sriramadasu | Alejandro Abdelnur |
| [HADOOP-3104](https://issues.apache.org/jira/browse/HADOOP-3104) | MultithreadMapRunner keeps consuming records even if trheads are not available | Critical | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-3103](https://issues.apache.org/jira/browse/HADOOP-3103) | [HOD] Hadoop.tmp.dir should not be set to cluster directory | Blocker | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-3098](https://issues.apache.org/jira/browse/HADOOP-3098) | dfs -chown does not like "\_" underscore in user name | Blocker | fs | Koji Noguchi | Raghu Angadi |
| [HADOOP-3084](https://issues.apache.org/jira/browse/HADOOP-3084) | distcp fails for files with zero length | Blocker | util | Mukund Madhugiri | Chris Douglas |
| [HADOOP-3070](https://issues.apache.org/jira/browse/HADOOP-3070) | Trash not being expunged, Trash Emptier thread gone by NPE | Blocker | fs | Koji Noguchi | Koji Noguchi |
| [HADOOP-3056](https://issues.apache.org/jira/browse/HADOOP-3056) | distcp seems to be broken in 0.16.1 | Blocker | util | Christian Kunz | Chris Douglas |
| [HADOOP-3049](https://issues.apache.org/jira/browse/HADOOP-3049) | MultithreadedMapRunner eats RuntimeExceptions | Blocker | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-3042](https://issues.apache.org/jira/browse/HADOOP-3042) | Update the Javadoc in JobConf.getOutputPath to reflect the actual temporary path | Major | documentation | Devaraj Das | Amareshwari Sriramadasu |
| [HADOOP-3039](https://issues.apache.org/jira/browse/HADOOP-3039) | Runtime exceptions not killing job | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3033](https://issues.apache.org/jira/browse/HADOOP-3033) | Datanode fails write to DFS file with exception message "Trying to change block file offset" | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3027](https://issues.apache.org/jira/browse/HADOOP-3027) | JobTracker shuts down during initialization if the NameNode is down | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3011](https://issues.apache.org/jira/browse/HADOOP-3011) | Distcp deleting target directory | Blocker | util | Koji Noguchi | Chris Douglas |
| [HADOOP-3007](https://issues.apache.org/jira/browse/HADOOP-3007) | DataNode pipelining : failure on mirror results in failure on upstream datanode | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3003](https://issues.apache.org/jira/browse/HADOOP-3003) | FileSystem cache key should be updated after a FileSystem object is created | Major | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2985](https://issues.apache.org/jira/browse/HADOOP-2985) | LocalJobRunner gets NullPointerException if there is no output directory | Critical | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-2978](https://issues.apache.org/jira/browse/HADOOP-2978) | JobHistory log format for COUNTER is ambigurous | Major | . | Runping Qi | Runping Qi |
| [HADOOP-2944](https://issues.apache.org/jira/browse/HADOOP-2944) | redesigned plugin has missing functionality | Major | contrib/eclipse-plugin | Chris Dyer | Christophe Taton |
| [HADOOP-2833](https://issues.apache.org/jira/browse/HADOOP-2833) | JobClient.submitJob(...) should not use "Dr Who" as a default username | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3111](https://issues.apache.org/jira/browse/HADOOP-3111) | Remove HBase from Hadoop contrib | Major | . | Jim Kellerman | Jim Kellerman |

View File

@ -0,0 +1,31 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.16.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3098](https://issues.apache.org/jira/browse/HADOOP-3098) | *Blocker* | **dfs -chown does not like "\_" underscore in user name**
chown and chgrp commands allow more flexible names compared to 0.16. See 'fs -help chown'. Inform Users : YES.

View File

@ -0,0 +1,71 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.16.3 - 2008-04-16
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3195](https://issues.apache.org/jira/browse/HADOOP-3195) | TestFileSystem fails randomly | Minor | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3182](https://issues.apache.org/jira/browse/HADOOP-3182) | JobClient creates submitJobDir with SYSTEM\_DIR\_PERMISSION ( rwx-wx-wx) | Blocker | . | Lohit Vijayarenu | Tsz Wo Nicholas Sze |
| [HADOOP-3159](https://issues.apache.org/jira/browse/HADOOP-3159) | FileSystem cache keep overwriting cached value | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3154](https://issues.apache.org/jira/browse/HADOOP-3154) | Job successful but dropping records (when disk full) | Blocker | . | Koji Noguchi | Devaraj Das |
| [HADOOP-3139](https://issues.apache.org/jira/browse/HADOOP-3139) | DistributedFileSystem.close() deadlock and FileSystem.closeAll() warning | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3069](https://issues.apache.org/jira/browse/HADOOP-3069) | A failure on SecondaryNameNode truncates the primary NameNode image. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3010](https://issues.apache.org/jira/browse/HADOOP-3010) | ConcurrentModificationException from org.apache.hadoop.ipc.Server$Responder in JobTracker | Major | ipc | Amar Kamat | Raghu Angadi |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,31 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.16.3 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3182](https://issues.apache.org/jira/browse/HADOOP-3182) | *Blocker* | **JobClient creates submitJobDir with SYSTEM\_DIR\_PERMISSION ( rwx-wx-wx)**
Changed \<job-dir\> from 733 to 777, so that a shared JobTracker can be started by a non-superuser account.

View File

@ -0,0 +1,68 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.16.4 - 2008-05-05
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3304](https://issues.apache.org/jira/browse/HADOOP-3304) | [HOD] logcondense fails if DFS has files that are not log files, but match a certain pattern | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3294](https://issues.apache.org/jira/browse/HADOOP-3294) | distcp leaves empty blocks afte successful execution | Blocker | util | Christian Kunz | Tsz Wo Nicholas Sze |
| [HADOOP-3186](https://issues.apache.org/jira/browse/HADOOP-3186) | Incorrect permission checking on mv | Blocker | . | Koji Noguchi | Tsz Wo Nicholas Sze |
| [HADOOP-3138](https://issues.apache.org/jira/browse/HADOOP-3138) | distcp fail copying to /user/\<username\>/\<newtarget\> (with permission on) | Blocker | . | Koji Noguchi | Raghu Angadi |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,31 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.16.4 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3304](https://issues.apache.org/jira/browse/HADOOP-3304) | *Blocker* | **[HOD] logcondense fails if DFS has files that are not log files, but match a certain pattern**
Fixed the regular expression used by the logcondense utility to generate the list of log files to purge from HDFS. Also, improved error handling, so a single invalid log file does not fail the purge process.

View File

@ -0,0 +1,265 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.17.0 - 2008-05-20
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3280](https://issues.apache.org/jira/browse/HADOOP-3280) | virtual address space limits break streaming apps | Blocker | . | Rick Cox | Arun C Murthy |
| [HADOOP-3266](https://issues.apache.org/jira/browse/HADOOP-3266) | Remove HOD changes from CHANGES.txt, as they are now inside src/contrib/hod | Major | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3239](https://issues.apache.org/jira/browse/HADOOP-3239) | exists() calls logs FileNotFoundException in namenode log | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3137](https://issues.apache.org/jira/browse/HADOOP-3137) | [HOD] Update hod version number | Major | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3091](https://issues.apache.org/jira/browse/HADOOP-3091) | hadoop dfs -put should support multiple src | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3060](https://issues.apache.org/jira/browse/HADOOP-3060) | MiniMRCluster is ignoring parameter taskTrackerFirst | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2873](https://issues.apache.org/jira/browse/HADOOP-2873) | Namenode fails to re-start after cluster shutdown - DFSClient: Could not obtain blocks even all datanodes were up & live | Major | . | André Martin | dhruba borthakur |
| [HADOOP-2854](https://issues.apache.org/jira/browse/HADOOP-2854) | Remove the deprecated ipc.Server.getUserInfo() | Blocker | . | Tsz Wo Nicholas Sze | Lohit Vijayarenu |
| [HADOOP-2839](https://issues.apache.org/jira/browse/HADOOP-2839) | Remove deprecated methods in FileSystem | Blocker | fs | Hairong Kuang | Lohit Vijayarenu |
| [HADOOP-2831](https://issues.apache.org/jira/browse/HADOOP-2831) | Remove the deprecated INode.getAbsoluteName() | Blocker | . | Tsz Wo Nicholas Sze | Lohit Vijayarenu |
| [HADOOP-2828](https://issues.apache.org/jira/browse/HADOOP-2828) | Remove deprecated methods in Configuration.java | Major | conf | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2826](https://issues.apache.org/jira/browse/HADOOP-2826) | FileSplit.getFile(), LineRecordReader. readLine() need to be removed | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2825](https://issues.apache.org/jira/browse/HADOOP-2825) | MapOutputLocation.getFile() needs to be removed | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2824](https://issues.apache.org/jira/browse/HADOOP-2824) | One of MiniMRCluster constructors needs tobe removed | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2823](https://issues.apache.org/jira/browse/HADOOP-2823) | SimpleCharStream.getColumn(), getLine() methods to be removed. | Major | record | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2822](https://issues.apache.org/jira/browse/HADOOP-2822) | Remove deprecated classes in mapred | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2821](https://issues.apache.org/jira/browse/HADOOP-2821) | Remove deprecated classes in util | Major | util | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2820](https://issues.apache.org/jira/browse/HADOOP-2820) | Remove deprecated classes in streaming | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2819](https://issues.apache.org/jira/browse/HADOOP-2819) | Remove deprecated methods in JobConf() | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2818](https://issues.apache.org/jira/browse/HADOOP-2818) | Remove deprecated Counters.getDisplayName(), getCounterNames(), getCounter(String counterName) | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2765](https://issues.apache.org/jira/browse/HADOOP-2765) | setting memory limits for tasks | Major | . | Joydeep Sen Sarma | Amareshwari Sriramadasu |
| [HADOOP-2634](https://issues.apache.org/jira/browse/HADOOP-2634) | Deprecate exists() and isDir() to simplify ClientProtocol. | Blocker | . | Konstantin Shvachko | Lohit Vijayarenu |
| [HADOOP-2563](https://issues.apache.org/jira/browse/HADOOP-2563) | Remove deprecated FileSystem#listPaths() | Blocker | fs | Doug Cutting | Lohit Vijayarenu |
| [HADOOP-2470](https://issues.apache.org/jira/browse/HADOOP-2470) | Open and isDir should be removed from ClientProtocol | Major | . | Hairong Kuang | Tsz Wo Nicholas Sze |
| [HADOOP-2410](https://issues.apache.org/jira/browse/HADOOP-2410) | Make EC2 cluster nodes more independent of each other | Major | contrib/cloud | Tom White | Chris K Wensel |
| [HADOOP-2399](https://issues.apache.org/jira/browse/HADOOP-2399) | Input key and value to combiner and reducer should be reused | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2345](https://issues.apache.org/jira/browse/HADOOP-2345) | new transactions to support HDFS Appends | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2219](https://issues.apache.org/jira/browse/HADOOP-2219) | du like command to count number of files under a given directory | Major | . | Koji Noguchi | Tsz Wo Nicholas Sze |
| [HADOOP-2192](https://issues.apache.org/jira/browse/HADOOP-2192) | dfs mv command differs from POSIX standards | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2178](https://issues.apache.org/jira/browse/HADOOP-2178) | Job history on HDFS | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2116](https://issues.apache.org/jira/browse/HADOOP-2116) | Job.local.dir to be exposed to tasks | Major | . | Milind Bhandarkar | Amareshwari Sriramadasu |
| [HADOOP-2027](https://issues.apache.org/jira/browse/HADOOP-2027) | FileSystem should provide byte ranges for file locations | Major | fs | Owen O'Malley | Lohit Vijayarenu |
| [HADOOP-1986](https://issues.apache.org/jira/browse/HADOOP-1986) | Add support for a general serialization mechanism for Map Reduce | Major | . | Tom White | Tom White |
| [HADOOP-1985](https://issues.apache.org/jira/browse/HADOOP-1985) | Abstract node to switch mapping into a topology service class used by namenode and jobtracker | Major | . | eric baldeschwieler | Devaraj Das |
| [HADOOP-771](https://issues.apache.org/jira/browse/HADOOP-771) | Namenode should return error when trying to delete non-empty directory | Major | . | Milind Bhandarkar | Mahadev konar |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3152](https://issues.apache.org/jira/browse/HADOOP-3152) | Make index interval configuable when using MapFileOutputFormat for map-reduce job | Minor | io | Rong-En Fan | Doug Cutting |
| [HADOOP-3048](https://issues.apache.org/jira/browse/HADOOP-3048) | Stringifier | Blocker | io | Enis Soztutar | Enis Soztutar |
| [HADOOP-3001](https://issues.apache.org/jira/browse/HADOOP-3001) | FileSystems should track how many bytes are read and written | Blocker | fs | Owen O'Malley | Owen O'Malley |
| [HADOOP-2951](https://issues.apache.org/jira/browse/HADOOP-2951) | contrib package provides a utility to build or update an index
A contrib package to update an index using Map/Reduce | Major | . | Ning Li | Doug Cutting |
| [HADOOP-2906](https://issues.apache.org/jira/browse/HADOOP-2906) | output format classes that can write to different files depending on keys and/or config variable | Major | . | Runping Qi | Runping Qi |
| [HADOOP-2657](https://issues.apache.org/jira/browse/HADOOP-2657) | Enhancements to DFSClient to support flushing data at any point in time | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2063](https://issues.apache.org/jira/browse/HADOOP-2063) | Command to pull corrupted files | Blocker | fs | Koji Noguchi | Tsz Wo Nicholas Sze |
| [HADOOP-2055](https://issues.apache.org/jira/browse/HADOOP-2055) | JobConf should have a setInputPathFilter method | Minor | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-1593](https://issues.apache.org/jira/browse/HADOOP-1593) | FsShell should work with paths in non-default FileSystem | Major | fs | Doug Cutting | Mahadev konar |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3174](https://issues.apache.org/jira/browse/HADOOP-3174) | Improve documentation and supply an example for MultiFileInputFormat | Major | documentation | Enis Soztutar | Enis Soztutar |
| [HADOOP-3143](https://issues.apache.org/jira/browse/HADOOP-3143) | Decrease the number of slaves in TestMiniMRDFSSort to 3. | Major | test | Owen O'Malley | Nigel Daley |
| [HADOOP-3123](https://issues.apache.org/jira/browse/HADOOP-3123) | Build native libraries on Solaris | Major | build | Tom White | Tom White |
| [HADOOP-3099](https://issues.apache.org/jira/browse/HADOOP-3099) | Need new options in distcp for preserving ower, group and permission | Blocker | util | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3092](https://issues.apache.org/jira/browse/HADOOP-3092) | Show counter values from "job -status" command | Major | scripts | Tom White | Tom White |
| [HADOOP-3046](https://issues.apache.org/jira/browse/HADOOP-3046) | Text and BytesWritable's raw comparators should use the lengths provided instead of rebuilding them from scratch using readInt | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2996](https://issues.apache.org/jira/browse/HADOOP-2996) | StreamUtils abuses StringBuffers | Trivial | . | Dave Brosius | Dave Brosius |
| [HADOOP-2994](https://issues.apache.org/jira/browse/HADOOP-2994) | DFSClient calls toString on strings. | Trivial | . | Dave Brosius | Dave Brosius |
| [HADOOP-2993](https://issues.apache.org/jira/browse/HADOOP-2993) | Specify which JAVA\_HOME should be set | Major | documentation | Jason Rennie | Arun C Murthy |
| [HADOOP-2947](https://issues.apache.org/jira/browse/HADOOP-2947) | [HOD] Hod should redirect stderr and stdout of Hadoop daemons to assist debugging | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2939](https://issues.apache.org/jira/browse/HADOOP-2939) | Make the Hudson patch process an executable ant target | Minor | test | Nigel Daley | Nigel Daley |
| [HADOOP-2919](https://issues.apache.org/jira/browse/HADOOP-2919) | Create fewer copies of buffer data during sort/spill | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-2902](https://issues.apache.org/jira/browse/HADOOP-2902) | replace accesss of "fs.default.name" with FileSystem accessor methods | Major | fs | Doug Cutting | Doug Cutting |
| [HADOOP-2895](https://issues.apache.org/jira/browse/HADOOP-2895) | String for configuring profiling should be customizable | Major | . | Martin Traverso | Martin Traverso |
| [HADOOP-2888](https://issues.apache.org/jira/browse/HADOOP-2888) | Enhancements to gridmix scripts | Major | test | Mukund Madhugiri | Mukund Madhugiri |
| [HADOOP-2886](https://issues.apache.org/jira/browse/HADOOP-2886) | Track individual RPC metrics. | Major | metrics | girish vaitheeswaran | dhruba borthakur |
| [HADOOP-2841](https://issues.apache.org/jira/browse/HADOOP-2841) | Dfs methods should not throw RemoteException | Major | . | Hairong Kuang | Konstantin Shvachko |
| [HADOOP-2810](https://issues.apache.org/jira/browse/HADOOP-2810) | Need new Hadoop Core logo | Minor | documentation | Nigel Daley | Nigel Daley |
| [HADOOP-2804](https://issues.apache.org/jira/browse/HADOOP-2804) | Formatable changes log as html | Minor | documentation | Nigel Daley | Nigel Daley |
| [HADOOP-2796](https://issues.apache.org/jira/browse/HADOOP-2796) | For script option hod should exit with distinguishable exit codes for script code and hod exit code. | Major | contrib/hod | Karam Singh | Hemanth Yamijala |
| [HADOOP-2758](https://issues.apache.org/jira/browse/HADOOP-2758) | Reduce memory copies when data is read from DFS | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2690](https://issues.apache.org/jira/browse/HADOOP-2690) | Adding support into build.xml to build a special hadoop jar file that has the MiniDFSCluster and MiniMRCluster classes among others necessary for building and running the unit tests of Pig on the local mini cluster | Major | build | Xu Zhang | Enis Soztutar |
| [HADOOP-2559](https://issues.apache.org/jira/browse/HADOOP-2559) | DFS should place one replica per rack | Major | . | Runping Qi | Lohit Vijayarenu |
| [HADOOP-2555](https://issues.apache.org/jira/browse/HADOOP-2555) | Refactor the HTable#get and HTable#getRow methods to avoid repetition of retry-on-failure logic | Minor | . | Peter Dolan | Bryan Duxbury |
| [HADOOP-2551](https://issues.apache.org/jira/browse/HADOOP-2551) | hadoop-env.sh needs finer granularity | Blocker | scripts | Allen Wittenauer | Raghu Angadi |
| [HADOOP-2473](https://issues.apache.org/jira/browse/HADOOP-2473) | EC2 termination script should support termination by group | Major | contrib/cloud | Tom White | Chris K Wensel |
| [HADOOP-2423](https://issues.apache.org/jira/browse/HADOOP-2423) | The codes in FSDirectory.mkdirs(...) is inefficient. | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2239](https://issues.apache.org/jira/browse/HADOOP-2239) | Security: Need to be able to encrypt Hadoop socket connections | Major | . | Allen Wittenauer | Chris Douglas |
| [HADOOP-2148](https://issues.apache.org/jira/browse/HADOOP-2148) | Inefficient FSDataset.getBlockFile() | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2057](https://issues.apache.org/jira/browse/HADOOP-2057) | streaming should optionally treat a non-zero exit status of a child process as a failed task | Major | . | Rick Cox | Rick Cox |
| [HADOOP-1677](https://issues.apache.org/jira/browse/HADOOP-1677) | improve semantics of the hadoop dfs command | Minor | . | Nigel Daley | Mahadev konar |
| [HADOOP-1622](https://issues.apache.org/jira/browse/HADOOP-1622) | Hadoop should provide a way to allow the user to specify jar file(s) the user job depends on | Major | . | Runping Qi | Mahadev konar |
| [HADOOP-1228](https://issues.apache.org/jira/browse/HADOOP-1228) | Eclipse project files | Minor | build | Albert Strasheim | Tom White |
| [HADOOP-910](https://issues.apache.org/jira/browse/HADOOP-910) | Reduces can do merges for the on-disk map output files in parallel with their copying | Major | . | Devaraj Das | Amar Kamat |
| [HADOOP-730](https://issues.apache.org/jira/browse/HADOOP-730) | Local file system uses copy to implement rename | Major | fs | Owen O'Malley | Chris Douglas |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3701](https://issues.apache.org/jira/browse/HADOOP-3701) | Too many trash sockets and trash pipes opened | Major | . | He Yongqiang | |
| [HADOOP-3382](https://issues.apache.org/jira/browse/HADOOP-3382) | Memory leak when files are not cleanly closed | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3372](https://issues.apache.org/jira/browse/HADOOP-3372) | TestUlimit fails on LINUX | Blocker | . | Lohit Vijayarenu | Arun C Murthy |
| [HADOOP-3322](https://issues.apache.org/jira/browse/HADOOP-3322) | Hadoop rpc metrics do not get pushed to the MetricsRecord | Blocker | metrics | girish vaitheeswaran | girish vaitheeswaran |
| [HADOOP-3286](https://issues.apache.org/jira/browse/HADOOP-3286) | Gridmix jobs' output dir names may collide | Major | test | Runping Qi | Runping Qi |
| [HADOOP-3285](https://issues.apache.org/jira/browse/HADOOP-3285) | map tasks with node local splits do not always read from local nodes | Blocker | . | Runping Qi | Owen O'Malley |
| [HADOOP-3279](https://issues.apache.org/jira/browse/HADOOP-3279) | TaskTracker should check for SUCCEEDED task status in addition to COMMIT\_PENDING status when it fails maps due to lost map outputs | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-3263](https://issues.apache.org/jira/browse/HADOOP-3263) | job history browser throws exception if job name or user name is null. | Blocker | . | Amareshwari Sriramadasu | Arun C Murthy |
| [HADOOP-3256](https://issues.apache.org/jira/browse/HADOOP-3256) | JobHistory file on HDFS should not use the 'job name' | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3251](https://issues.apache.org/jira/browse/HADOOP-3251) | WARN message on command line when a hadoop jar command is executed | Blocker | . | Mukund Madhugiri | Arun C Murthy |
| [HADOOP-3247](https://issues.apache.org/jira/browse/HADOOP-3247) | gridmix scripts have a few bugs | Major | test | Runping Qi | Runping Qi |
| [HADOOP-3242](https://issues.apache.org/jira/browse/HADOOP-3242) | SequenceFileAsBinaryRecordReader seems always to read from the start of a file, not the start of the split. | Major | . | Runping Qi | Chris Douglas |
| [HADOOP-3237](https://issues.apache.org/jira/browse/HADOOP-3237) | Unit test failed on windows: TestDFSShell.testErrOutPut | Blocker | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-3229](https://issues.apache.org/jira/browse/HADOOP-3229) | Map OutputCollector does not report progress on writes | Major | . | Alejandro Abdelnur | Doug Cutting |
| [HADOOP-3225](https://issues.apache.org/jira/browse/HADOOP-3225) | FsShell showing null instead of a error message | Blocker | . | Tsz Wo Nicholas Sze | Mahadev konar |
| [HADOOP-3224](https://issues.apache.org/jira/browse/HADOOP-3224) | hadoop dfs -du /dirPath does not work with hadoop-0.17 branch | Blocker | . | Runping Qi | Lohit Vijayarenu |
| [HADOOP-3223](https://issues.apache.org/jira/browse/HADOOP-3223) | Hadoop dfs -help for permissions contains a typo | Blocker | . | Milind Bhandarkar | Raghu Angadi |
| [HADOOP-3220](https://issues.apache.org/jira/browse/HADOOP-3220) | Safemode log message need to be corrected. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3208](https://issues.apache.org/jira/browse/HADOOP-3208) | WritableDeserializer does not pass the Configuration to deserialized Writables | Blocker | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-3204](https://issues.apache.org/jira/browse/HADOOP-3204) | LocalFSMerger needs to catch throwable | Blocker | . | Koji Noguchi | Amar Kamat |
| [HADOOP-3183](https://issues.apache.org/jira/browse/HADOOP-3183) | Unit test fails on Windows: TestJobShell.testJobShell | Blocker | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-3178](https://issues.apache.org/jira/browse/HADOOP-3178) | gridmix scripts for small and medium jobs need to be changed to handle input paths differently | Blocker | test | Mukund Madhugiri | Mukund Madhugiri |
| [HADOOP-3175](https://issues.apache.org/jira/browse/HADOOP-3175) | "-get file -" does not work | Blocker | fs | Raghu Angadi | Edward J. Yoon |
| [HADOOP-3168](https://issues.apache.org/jira/browse/HADOOP-3168) | reduce amount of logging in hadoop streaming | Major | . | Joydeep Sen Sarma | Zheng Shao |
| [HADOOP-3166](https://issues.apache.org/jira/browse/HADOOP-3166) | SpillThread throws ArrayIndexOutOfBoundsException, which is ignored by MapTask | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-3165](https://issues.apache.org/jira/browse/HADOOP-3165) | FsShell no longer accepts stdin as a source for -put/-copyFromLocal | Blocker | . | Chris Douglas | Lohit Vijayarenu |
| [HADOOP-3162](https://issues.apache.org/jira/browse/HADOOP-3162) | Map/reduce stops working with comma separated input paths | Blocker | . | Runping Qi | Amareshwari Sriramadasu |
| [HADOOP-3161](https://issues.apache.org/jira/browse/HADOOP-3161) | TestFileAppend fails on Mac since HADOOP-2655 was committed | Minor | test | Nigel Daley | Nigel Daley |
| [HADOOP-3157](https://issues.apache.org/jira/browse/HADOOP-3157) | TestMiniMRLocalFS fails in trunk on Windows | Blocker | test | Lohit Vijayarenu | Doug Cutting |
| [HADOOP-3153](https://issues.apache.org/jira/browse/HADOOP-3153) | [HOD] Hod should deallocate cluster if there's a problem in writing information to the state file | Major | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-3146](https://issues.apache.org/jira/browse/HADOOP-3146) | DFSOutputStream.flush should be renamed as DFSOutputStream.fsync | Blocker | . | Runping Qi | dhruba borthakur |
| [HADOOP-3140](https://issues.apache.org/jira/browse/HADOOP-3140) | JobTracker should not try to promote a (map) task if it does not write to DFS at all | Major | . | Runping Qi | Amar Kamat |
| [HADOOP-3124](https://issues.apache.org/jira/browse/HADOOP-3124) | DFS data node should not use hard coded 10 minutes as write timeout. | Major | . | Runping Qi | Raghu Angadi |
| [HADOOP-3118](https://issues.apache.org/jira/browse/HADOOP-3118) | Namenode NPE while loading fsimage after a cluster upgrade from older disk format | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3114](https://issues.apache.org/jira/browse/HADOOP-3114) | TestDFSShell fails on Windows. | Major | fs | Konstantin Shvachko | Lohit Vijayarenu |
| [HADOOP-3106](https://issues.apache.org/jira/browse/HADOOP-3106) | Update documentation in mapred\_tutorial to add Debugging | Major | documentation | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3094](https://issues.apache.org/jira/browse/HADOOP-3094) | BytesWritable.toString prints bytes above 0x80 as FFFFFF80 | Major | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-3093](https://issues.apache.org/jira/browse/HADOOP-3093) | ma/reduce throws the following exception if "io.serializations" is not set: | Major | . | Runping Qi | Amareshwari Sriramadasu |
| [HADOOP-3089](https://issues.apache.org/jira/browse/HADOOP-3089) | streaming should accept stderr from task before first key arrives | Major | . | Rick Cox | Rick Cox |
| [HADOOP-3087](https://issues.apache.org/jira/browse/HADOOP-3087) | JobInfo session object is not refreshed in loadHistory.jsp if same job is accessed again. | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3086](https://issues.apache.org/jira/browse/HADOOP-3086) | Test case was missed in commit of HADOOP-3040 | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3083](https://issues.apache.org/jira/browse/HADOOP-3083) | Remove lease when file is renamed | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3080](https://issues.apache.org/jira/browse/HADOOP-3080) | Remove flush calls from JobHistory | Blocker | . | Devaraj Das | Amareshwari Sriramadasu |
| [HADOOP-3073](https://issues.apache.org/jira/browse/HADOOP-3073) | SocketOutputStream.close() should close the channel. | Blocker | ipc | Raghu Angadi | Raghu Angadi |
| [HADOOP-3067](https://issues.apache.org/jira/browse/HADOOP-3067) | DFSInputStream 'pread' does not close its sockets | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3066](https://issues.apache.org/jira/browse/HADOOP-3066) | Should not require superuser privilege to query if hdfs is in safe mode | Major | . | Jim Kellerman | Jim Kellerman |
| [HADOOP-3065](https://issues.apache.org/jira/browse/HADOOP-3065) | Namenode does not process block report if the rack-location script is not provided on namenode | Blocker | . | dhruba borthakur | Devaraj Das |
| [HADOOP-3064](https://issues.apache.org/jira/browse/HADOOP-3064) | Exception with file globbing closures | Major | . | Tom White | Hairong Kuang |
| [HADOOP-3050](https://issues.apache.org/jira/browse/HADOOP-3050) | Cluster fall into infinite loop trying to replicate a block to a target that aready has this replica. | Blocker | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-3044](https://issues.apache.org/jira/browse/HADOOP-3044) | NNBench does not use the right configuration for the mapper | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-3041](https://issues.apache.org/jira/browse/HADOOP-3041) | Within a task, the value ofJobConf.getOutputPath() method is modified | Blocker | . | Alejandro Abdelnur | Amareshwari Sriramadasu |
| [HADOOP-3040](https://issues.apache.org/jira/browse/HADOOP-3040) | Streaming should assume an empty key if the first character on a line is the seperator (stream.map.output.field.separator, by default, tab) | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3036](https://issues.apache.org/jira/browse/HADOOP-3036) | Fix findBugs warnings in UpgradeUtilities. | Major | test | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3031](https://issues.apache.org/jira/browse/HADOOP-3031) | Remove compiler warnings for ant test | Minor | . | Amareshwari Sriramadasu | Chris Douglas |
| [HADOOP-3030](https://issues.apache.org/jira/browse/HADOOP-3030) | InMemoryFileSystem.reserveSpaceWithChecksum does not look at failures while reserving space for the file in question | Major | fs | Devaraj Das | Devaraj Das |
| [HADOOP-3029](https://issues.apache.org/jira/browse/HADOOP-3029) | Misleading log message "firstbadlink" printed by datanodes | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3025](https://issues.apache.org/jira/browse/HADOOP-3025) | ChecksumFileSystem needs to support the new delete method | Blocker | fs | Devaraj Das | Mahadev konar |
| [HADOOP-3018](https://issues.apache.org/jira/browse/HADOOP-3018) | Eclipse plugin fails to compile due to missing RPC.stopClient() method | Blocker | contrib/eclipse-plugin | Tom White | Christophe Taton |
| [HADOOP-3012](https://issues.apache.org/jira/browse/HADOOP-3012) | dfs -mv file to user home directory fails silently if the user home directory does not exist | Blocker | fs | Mukund Madhugiri | Mahadev konar |
| [HADOOP-3009](https://issues.apache.org/jira/browse/HADOOP-3009) | TestFileCreation fails while restarting cluster | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3008](https://issues.apache.org/jira/browse/HADOOP-3008) | SocketIOWithTimeout does not handle thread interruption | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3006](https://issues.apache.org/jira/browse/HADOOP-3006) | DataNode sends wrong length in header while pipelining. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2995](https://issues.apache.org/jira/browse/HADOOP-2995) | StreamBaseRecordReader's getProgress returns just 0 or 1 | Minor | . | Dave Brosius | Dave Brosius |
| [HADOOP-2992](https://issues.apache.org/jira/browse/HADOOP-2992) | Sequential distributed upgrades. | Major | test | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2983](https://issues.apache.org/jira/browse/HADOOP-2983) | [HOD] local\_fqdn() returns None when gethostbyname\_ex doesnt return any FQDNs. | Blocker | contrib/hod | Craig Macdonald | Hemanth Yamijala |
| [HADOOP-2982](https://issues.apache.org/jira/browse/HADOOP-2982) | [HOD] checknodes should look for free nodes without the jobs attribute | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-2976](https://issues.apache.org/jira/browse/HADOOP-2976) | Blocks staying underreplicated (for unclosed file) | Minor | . | Koji Noguchi | dhruba borthakur |
| [HADOOP-2974](https://issues.apache.org/jira/browse/HADOOP-2974) | ipc unit tests fail due to connection errors | Blocker | ipc | Mukund Madhugiri | Raghu Angadi |
| [HADOOP-2973](https://issues.apache.org/jira/browse/HADOOP-2973) | Unit test fails on Windows: org.apache.hadoop.dfs.TestLocalDFS.testWorkingDirectory | Blocker | . | Mukund Madhugiri | Tsz Wo Nicholas Sze |
| [HADOOP-2972](https://issues.apache.org/jira/browse/HADOOP-2972) | org.apache.hadoop.dfs.TestDFSShell.testErrOutPut fails on Windows with NullPointerException | Blocker | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2971](https://issues.apache.org/jira/browse/HADOOP-2971) | SocketTimeoutException in unit tests | Major | io | Raghu Angadi | Raghu Angadi |
| [HADOOP-2970](https://issues.apache.org/jira/browse/HADOOP-2970) | Wrong class definition for hodlib/Hod/hod.py for Python \< 2.5.1 | Major | contrib/hod | Luca Telloli | Vinod Kumar Vavilapalli |
| [HADOOP-2955](https://issues.apache.org/jira/browse/HADOOP-2955) | ant test fail for TestCrcCorruption with OutofMemory. | Blocker | . | Mahadev konar | Raghu Angadi |
| [HADOOP-2943](https://issues.apache.org/jira/browse/HADOOP-2943) | Compression for intermediate map output is broken | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-2938](https://issues.apache.org/jira/browse/HADOOP-2938) | some of the fs commands don't globPaths. | Major | fs | Raghu Angadi | Tsz Wo Nicholas Sze |
| [HADOOP-2936](https://issues.apache.org/jira/browse/HADOOP-2936) | HOD should generate hdfs://host:port on the client side configs. | Major | contrib/hod | Mahadev konar | Vinod Kumar Vavilapalli |
| [HADOOP-2934](https://issues.apache.org/jira/browse/HADOOP-2934) | NPE while loading FSImage | Major | . | Raghu Angadi | dhruba borthakur |
| [HADOOP-2932](https://issues.apache.org/jira/browse/HADOOP-2932) | Trash initialization generates "deprecated filesystem name" warning even if the name is correct. | Blocker | conf, fs | Konstantin Shvachko | Mahadev konar |
| [HADOOP-2927](https://issues.apache.org/jira/browse/HADOOP-2927) | Unit test fails on Windows: org.apache.hadoop.fs.TestDU.testDU | Blocker | fs | Mukund Madhugiri | Konstantin Shvachko |
| [HADOOP-2924](https://issues.apache.org/jira/browse/HADOOP-2924) | HOD is trying to bring up task tracker on port which is already in close\_wait state | Critical | contrib/hod | Aroop Maliakkal | Vinod Kumar Vavilapalli |
| [HADOOP-2912](https://issues.apache.org/jira/browse/HADOOP-2912) | Unit test fails: org.apache.hadoop.dfs.TestFsck.testFsck. This is a regression | Blocker | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2908](https://issues.apache.org/jira/browse/HADOOP-2908) | forrest docs for dfs shell commands and semantics. | Major | documentation | Mahadev konar | Mahadev konar |
| [HADOOP-2901](https://issues.apache.org/jira/browse/HADOOP-2901) | the job tracker should not start 2 info servers | Blocker | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-2899](https://issues.apache.org/jira/browse/HADOOP-2899) | [HOD] hdfs:///mapredsystem directory not cleaned up after deallocation | Major | contrib/hod | Luca Telloli | Hemanth Yamijala |
| [HADOOP-2891](https://issues.apache.org/jira/browse/HADOOP-2891) | The dfsclient on exit deletes files that are open and not closed. | Major | . | Mahadev konar | dhruba borthakur |
| [HADOOP-2890](https://issues.apache.org/jira/browse/HADOOP-2890) | HDFS should recover when replicas of block have different sizes (due to corrupted block) | Major | . | Lohit Vijayarenu | dhruba borthakur |
| [HADOOP-2871](https://issues.apache.org/jira/browse/HADOOP-2871) | Unit tests (16) fail on Windows due to java.lang.IllegalArgumentException causing MiniMRCluster to not start up | Blocker | . | Mukund Madhugiri | Amareshwari Sriramadasu |
| [HADOOP-2870](https://issues.apache.org/jira/browse/HADOOP-2870) | Datanode.shutdown() and Namenode.stop() should close all rpc connections | Major | ipc | Hairong Kuang | Hairong Kuang |
| [HADOOP-2863](https://issues.apache.org/jira/browse/HADOOP-2863) | FSDataOutputStream should not flush() inside close(). | Major | fs | Raghu Angadi | Raghu Angadi |
| [HADOOP-2855](https://issues.apache.org/jira/browse/HADOOP-2855) | [HOD] HOD fails to allocate a cluster if the tarball specified is a relative path | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2848](https://issues.apache.org/jira/browse/HADOOP-2848) | [HOD] If a cluster directory is deleted, hod -o list must show it, and deallocate should work. | Major | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-2845](https://issues.apache.org/jira/browse/HADOOP-2845) | dfsadmin disk utilization report on Solaris is wrong | Major | fs | Martin Traverso | Martin Traverso |
| [HADOOP-2844](https://issues.apache.org/jira/browse/HADOOP-2844) | A SequenceFile.Reader object is not closed properly in CopyFiles | Major | util | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2832](https://issues.apache.org/jira/browse/HADOOP-2832) | bad code indentation in DFSClient | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2817](https://issues.apache.org/jira/browse/HADOOP-2817) | Remove deprecated mapred.tasktracker.tasks.maximum and clusterStatus.getMaxTasks() | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2806](https://issues.apache.org/jira/browse/HADOOP-2806) | Streaming has no way to force entire record (or null) as key | Minor | . | Marco Nicosia | Amareshwari Sriramadasu |
| [HADOOP-2800](https://issues.apache.org/jira/browse/HADOOP-2800) | SetFile.Writer deprecated by mistake? | Trivial | io | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-2790](https://issues.apache.org/jira/browse/HADOOP-2790) | TaskInProgress.hasSpeculativeTask is very inefficient | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-2783](https://issues.apache.org/jira/browse/HADOOP-2783) | hod/hodlib/Common/xmlrpc.py uses HodInterruptException without importing it | Minor | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-2779](https://issues.apache.org/jira/browse/HADOOP-2779) | build scripts broken by moving hbase to subproject | Major | build | Owen O'Malley | Owen O'Malley |
| [HADOOP-2767](https://issues.apache.org/jira/browse/HADOOP-2767) | org.apache.hadoop.net.NetworkTopology.InnerNode#getLeaf does not return the last node on a rack when used with an excluded node | Minor | . | Mark Butler | Hairong Kuang |
| [HADOOP-2738](https://issues.apache.org/jira/browse/HADOOP-2738) | Text is not subclassable because set(Text) and compareTo(Object) access the other instance's private members directly | Minor | io | Jim Kellerman | Jim Kellerman |
| [HADOOP-2727](https://issues.apache.org/jira/browse/HADOOP-2727) | Web UI links to Hadoop homepage has to change to new hadoop homepage | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-2679](https://issues.apache.org/jira/browse/HADOOP-2679) | There is a small typeo in hdfs\_test.c when testing the success of the local hadoop initialization | Trivial | . | Jason | dhruba borthakur |
| [HADOOP-2655](https://issues.apache.org/jira/browse/HADOOP-2655) | Copy on write for data and metadata files in the presence of snapshots | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2606](https://issues.apache.org/jira/browse/HADOOP-2606) | Namenode unstable when replicating 500k blocks at once | Major | . | Koji Noguchi | Konstantin Shvachko |
| [HADOOP-2373](https://issues.apache.org/jira/browse/HADOOP-2373) | Name node silently changes state | Major | . | Robert Chansler | Konstantin Shvachko |
| [HADOOP-2346](https://issues.apache.org/jira/browse/HADOOP-2346) | DataNode should have timeout on socket writes. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2195](https://issues.apache.org/jira/browse/HADOOP-2195) | dfs mkdir command differs from POSIX standards | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2194](https://issues.apache.org/jira/browse/HADOOP-2194) | dfs cat on a file that does not exist throws a java IOException | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2193](https://issues.apache.org/jira/browse/HADOOP-2193) | dfs rm and rmr commands differ from POSIX standards | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2191](https://issues.apache.org/jira/browse/HADOOP-2191) | dfs du and dus commands differ from POSIX standards | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2190](https://issues.apache.org/jira/browse/HADOOP-2190) | dfs ls and lsr commands differ from POSIX standards | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-2119](https://issues.apache.org/jira/browse/HADOOP-2119) | JobTracker becomes non-responsive if the task trackers finish task too fast | Critical | . | Runping Qi | Amar Kamat |
| [HADOOP-1967](https://issues.apache.org/jira/browse/HADOOP-1967) | hadoop dfs -ls, -get, -mv command's source/destination URI are inconsistent | Major | . | Lohit Vijayarenu | Doug Cutting |
| [HADOOP-1911](https://issues.apache.org/jira/browse/HADOOP-1911) | infinite loop in dfs -cat command. | Blocker | . | Koji Noguchi | Chris Douglas |
| [HADOOP-1902](https://issues.apache.org/jira/browse/HADOOP-1902) | du command throws an exception when the directory is not specified | Major | . | Mukund Madhugiri | Mahadev konar |
| [HADOOP-1373](https://issues.apache.org/jira/browse/HADOOP-1373) | checkPath() throws IllegalArgumentException | Blocker | fs | Konstantin Shvachko | Edward J. Yoon |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2997](https://issues.apache.org/jira/browse/HADOOP-2997) | Add test for non-writable serializer | Blocker | . | Tom White | Tom White |
| [HADOOP-2775](https://issues.apache.org/jira/browse/HADOOP-2775) | [HOD] Put in place unit test framework for HOD | Major | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2981](https://issues.apache.org/jira/browse/HADOOP-2981) | Follow Apache process for getting ready to put crypto code in to project | Major | . | Owen O'Malley | Owen O'Malley |

View File

@ -0,0 +1,604 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.17.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3382](https://issues.apache.org/jira/browse/HADOOP-3382) | *Blocker* | **Memory leak when files are not cleanly closed**
Fixed a memory leak associated with 'abandoned' files (i.e. not cleanly closed). This held up significant amounts of memory depending on activity and how long NameNode has been running.
---
* [HADOOP-3280](https://issues.apache.org/jira/browse/HADOOP-3280) | *Blocker* | **virtual address space limits break streaming apps**
This patch adds the mapred.child.ulimit to limit the virtual memory for children processes to the given value.
---
* [HADOOP-3266](https://issues.apache.org/jira/browse/HADOOP-3266) | *Major* | **Remove HOD changes from CHANGES.txt, as they are now inside src/contrib/hod**
Moved HOD change items from CHANGES.txt to a new file src/contrib/hod/CHANGES.txt.
---
* [HADOOP-3239](https://issues.apache.org/jira/browse/HADOOP-3239) | *Major* | **exists() calls logs FileNotFoundException in namenode log**
getFileInfo returns null for File not found instead of throwing FileNotFoundException
---
* [HADOOP-3223](https://issues.apache.org/jira/browse/HADOOP-3223) | *Blocker* | **Hadoop dfs -help for permissions contains a typo**
Minor typo fix in help message for chmod. impact : none.
---
* [HADOOP-3204](https://issues.apache.org/jira/browse/HADOOP-3204) | *Blocker* | **LocalFSMerger needs to catch throwable**
Fixes LocalFSMerger in ReduceTask.java to handle errors/exceptions better. Prior to this all exceptions except IOException would be silently ignored.
---
* [HADOOP-3168](https://issues.apache.org/jira/browse/HADOOP-3168) | *Major* | **reduce amount of logging in hadoop streaming**
Decreases the frequency of logging from streaming from every 100 records to every 10,000 records.
---
* [HADOOP-3162](https://issues.apache.org/jira/browse/HADOOP-3162) | *Blocker* | **Map/reduce stops working with comma separated input paths**
The public methods org.apache.hadoop.mapred.JobConf.setInputPath(Path) and org.apache.hadoop.mapred.JobConf.addInputPath(Path) are deprecated. And the methods have the semantics of branch 0.16.
The following public APIs are added in org.apache.hadoop.mapred.FileInputFormat :
public static void setInputPaths(JobConf job, Path... paths);
public static void setInputPaths(JobConf job, String commaSeparatedPaths);
public static void addInputPath(JobConf job, Path path);
public static void addInputPaths(JobConf job, String commaSeparatedPaths);
Earlier code calling JobConf.setInputPath(Path), JobConf.addInputPath(Path) should now call FileInputFormat.setInputPaths(JobConf, Path...) and FileInputFormat.addInputPath(Path) respectively
---
* [HADOOP-3152](https://issues.apache.org/jira/browse/HADOOP-3152) | *Minor* | **Make index interval configuable when using MapFileOutputFormat for map-reduce job**
Add a static method MapFile#setIndexInterval(Configuration, int interval) so that MapReduce jobs that use MapFileOutputFormat can set the index interval.
---
* [HADOOP-3140](https://issues.apache.org/jira/browse/HADOOP-3140) | *Major* | **JobTracker should not try to promote a (map) task if it does not write to DFS at all**
Tasks that don't generate any output are not inserted in the commit queue of the JobTracker. They are marked as SUCCESSFUL by the TaskTracker and the JobTracker updates their state short-circuiting the commit queue.
---
* [HADOOP-3137](https://issues.apache.org/jira/browse/HADOOP-3137) | *Major* | **[HOD] Update hod version number**
Build script was changed to make HOD versions follow Hadoop version numbers. As a result of this change, the next version of HOD would not be 0.5, but would be synchronized to the Hadoop version number. Users who rely on the version number of HOD should note the unexpected jump in version numbers.
---
* [HADOOP-3124](https://issues.apache.org/jira/browse/HADOOP-3124) | *Major* | **DFS data node should not use hard coded 10 minutes as write timeout.**
Makes DataNode socket write timeout configurable. User impact : none.
---
* [HADOOP-3099](https://issues.apache.org/jira/browse/HADOOP-3099) | *Blocker* | **Need new options in distcp for preserving ower, group and permission**
Added a new option -p to distcp for preserving file/directory status.
-p[rbugp] Preserve status
r: replication number
b: block size
u: user
g: group
p: permission
-p alone is equivalent to -prbugp
---
* [HADOOP-3093](https://issues.apache.org/jira/browse/HADOOP-3093) | *Major* | **ma/reduce throws the following exception if "io.serializations" is not set:**
The following public APIs are added in org.apache.hadoop.conf.Configuration
String[] Configuration.getStrings(String name, String... defaultValue) and
void Configuration.setStrings(String name, String... values)
---
* [HADOOP-3091](https://issues.apache.org/jira/browse/HADOOP-3091) | *Major* | **hadoop dfs -put should support multiple src**
hadoop dfs -put accepts multiple sources when destination is a directory.
---
* [HADOOP-3073](https://issues.apache.org/jira/browse/HADOOP-3073) | *Blocker* | **SocketOutputStream.close() should close the channel.**
SocketOutputStream.close() closes the underlying channel. Increase compatibility with java.net.Socket.getOutputStream. User Impact : none.
---
* [HADOOP-3060](https://issues.apache.org/jira/browse/HADOOP-3060) | *Major* | **MiniMRCluster is ignoring parameter taskTrackerFirst**
The parameter boolean taskTrackerFirst is removed from org.apache.hadoop.mapred.MiniMRCluster constructors.
Thus signature of following APIs
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, boolean taskTrackerFirst, int numDir)
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, boolean taskTrackerFirst, int numDir, String[] racks)
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, boolean taskTrackerFirst, int numDir, String[] racks, String[] hosts)
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, boolean taskTrackerFirst, int numDir, String[] racks, String[] hosts, UnixUserGroupInformation ugi )
is changed to
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, int numDir)
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, int numDir, String[] racks)
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, int numDir, String[] racks, String[] hosts)
public MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, int numDir, String[] racks, String[] hosts, UnixUserGroupInformation ugi )
respectively.
Since the old signatures were not deprecated, any code using the old constructors must be changed to use the new constructors.
---
* [HADOOP-3048](https://issues.apache.org/jira/browse/HADOOP-3048) | *Blocker* | **Stringifier**
A new Interface and a default implementation to convert and restore serializations of objects to strings.
---
* [HADOOP-3041](https://issues.apache.org/jira/browse/HADOOP-3041) | *Blocker* | **Within a task, the value ofJobConf.getOutputPath() method is modified**
1. Deprecates JobConf.setOutputPath and JobConf.getOutputPath
JobConf.getOutputPath() still returns the same value that it used to return.
2. Deprecates OutputFormatBase. Adds FileOutputFormat. Existing output formats extending OutputFormatBase, now extend FileOutputFormat.
3. Adds the following APIs in FileOutputFormat :
public static void setOutputPath(JobConf conf, Path outputDir); // sets mapred.output.dir
public static Path getOutputPath(JobConf conf) ; // gets mapred.output.dir
public static Path getWorkOutputPath(JobConf conf); // gets mapred.work.output.dir
4. static void setWorkOutputPath(JobConf conf, Path outputDir) is also added to FileOutputFormat. This is used by the framework to set mapred.work.output.dir as task's temporary output dir .
---
* [HADOOP-3040](https://issues.apache.org/jira/browse/HADOOP-3040) | *Major* | **Streaming should assume an empty key if the first character on a line is the seperator (stream.map.output.field.separator, by default, tab)**
If the first character on a line is the separator, empty key is assumed, and the whole line is the value (due to a bug this was not the case).
---
* [HADOOP-3001](https://issues.apache.org/jira/browse/HADOOP-3001) | *Blocker* | **FileSystems should track how many bytes are read and written**
Adds new framework map/reduce counters that track the number of bytes read and written to HDFS, local, KFS, and S3 file systems.
---
* [HADOOP-2982](https://issues.apache.org/jira/browse/HADOOP-2982) | *Blocker* | **[HOD] checknodes should look for free nodes without the jobs attribute**
The number of free nodes in the cluster is computed using a better algorithm that filters out inconsistencies in node status as reported by Torque.
---
* [HADOOP-2947](https://issues.apache.org/jira/browse/HADOOP-2947) | *Blocker* | **[HOD] Hod should redirect stderr and stdout of Hadoop daemons to assist debugging**
The stdout and stderr streams of daemons are redirected to files that are created under the hadoop log directory. Users can now send kill 3 signals to the daemons to get stack traces and thread dumps for debugging.
---
* [HADOOP-2899](https://issues.apache.org/jira/browse/HADOOP-2899) | *Major* | **[HOD] hdfs:///mapredsystem directory not cleaned up after deallocation**
The mapred system directory generated by HOD is cleaned up at cluster deallocation time.
---
* [HADOOP-2873](https://issues.apache.org/jira/browse/HADOOP-2873) | *Major* | **Namenode fails to re-start after cluster shutdown - DFSClient: Could not obtain blocks even all datanodes were up & live**
**WARNING: No release note provided for this incompatible change.**
---
* [HADOOP-2855](https://issues.apache.org/jira/browse/HADOOP-2855) | *Blocker* | **[HOD] HOD fails to allocate a cluster if the tarball specified is a relative path**
Changes were made to handle relative paths correctly for important HOD options such as the cluster directory, tarball option, and script file.
---
* [HADOOP-2854](https://issues.apache.org/jira/browse/HADOOP-2854) | *Blocker* | **Remove the deprecated ipc.Server.getUserInfo()**
Removes deprecated method Server.getUserInfo()
---
* [HADOOP-2839](https://issues.apache.org/jira/browse/HADOOP-2839) | *Blocker* | **Remove deprecated methods in FileSystem**
Removes deprecated API FileSystem#globPaths()
---
* [HADOOP-2831](https://issues.apache.org/jira/browse/HADOOP-2831) | *Blocker* | **Remove the deprecated INode.getAbsoluteName()**
Removes deprecated method INode#getAbsoluteName()
---
* [HADOOP-2828](https://issues.apache.org/jira/browse/HADOOP-2828) | *Major* | **Remove deprecated methods in Configuration.java**
The following deprecated methods in org.apache.hadoop.conf.Configuration are removed.
public Object getObject(String name)
public void setObject(String name, Object value)
public Object get(String name, Object defaultValue)
public void set(String name, Object value)
and public Iterator entries()
---
* [HADOOP-2826](https://issues.apache.org/jira/browse/HADOOP-2826) | *Major* | **FileSplit.getFile(), LineRecordReader. readLine() need to be removed**
The deprecated methods, public File org.apache.hadoop.mapred.FileSplit.getFile() and
public static long org.apache.hadoop.mapred.LineRecordReader.readLine(InputStream in, OutputStream out)
are removed.
The constructor org.apache.hadoop.mapred.LineRecordReader.LineReader(InputStream in, Configuration conf) 's visibility is made public.
The signature of the public org.apache.hadoop.streaming.UTF8ByteArrayUtils.readLIne(InputStream) method is changed to UTF8ByteArrayUtils.readLIne(LineReader, Text). Since the old signature is not deprecated, any code using the old method must be changed to use the new method.
---
* [HADOOP-2825](https://issues.apache.org/jira/browse/HADOOP-2825) | *Major* | **MapOutputLocation.getFile() needs to be removed**
The deprecated method, public long org.apache.hadoop.mapred.MapOutputLocation.getFile(FileSystem fileSys, Path localFilename, int reduce, Progressable pingee, int timeout) is removed.
---
* [HADOOP-2824](https://issues.apache.org/jira/browse/HADOOP-2824) | *Major* | **One of MiniMRCluster constructors needs tobe removed**
The deprecated constructor org.apache.hadoop.mapred.MiniMRCluster.MiniMRCluster(int jobTrackerPort, int taskTrackerPort, int numTaskTrackers, String namenode, boolean taskTrackerFirst) is removed.
---
* [HADOOP-2823](https://issues.apache.org/jira/browse/HADOOP-2823) | *Major* | **SimpleCharStream.getColumn(), getLine() methods to be removed.**
The deprecated methods in org.apache.hadoop.record.compiler.generated.SimpleCharStream :
public int getColumn()
and public int getLine() are removed
---
* [HADOOP-2822](https://issues.apache.org/jira/browse/HADOOP-2822) | *Major* | **Remove deprecated classes in mapred**
The deprecated classes org.apache.hadoop.mapred.InputFormatBase and org.apache.hadoop.mapred.PhasedFileSystem are removed.
---
* [HADOOP-2821](https://issues.apache.org/jira/browse/HADOOP-2821) | *Major* | **Remove deprecated classes in util**
The deprecated classes org.apache.hadoop.util.ShellUtil and org.apache.hadoop.util.ToolBase are removed.
---
* [HADOOP-2820](https://issues.apache.org/jira/browse/HADOOP-2820) | *Major* | **Remove deprecated classes in streaming**
The deprecated classes org.apache.hadoop.streaming.StreamLineRecordReader, org.apache.hadoop.streaming.StreamOutputFormat and org.apache.hadoop.streaming.StreamSequenceRecordReader are removed
---
* [HADOOP-2819](https://issues.apache.org/jira/browse/HADOOP-2819) | *Major* | **Remove deprecated methods in JobConf()**
The following deprecated methods are removed from org.apache.hadoop.JobConf :
public Class getInputKeyClass()
public void setInputKeyClass(Class theClass)
public Class getInputValueClass()
public void setInputValueClass(Class theClass)
The methods, public boolean org.apache.hadoop.JobConf.getSpeculativeExecution() and
public void org.apache.hadoop.JobConf.setSpeculativeExecution(boolean speculativeExecution) are undeprecated.
---
* [HADOOP-2818](https://issues.apache.org/jira/browse/HADOOP-2818) | *Major* | **Remove deprecated Counters.getDisplayName(), getCounterNames(), getCounter(String counterName)**
The deprecated methods public String org.apache.hadoop.mapred.Counters.getDisplayName(String counter) and
public synchronized Collection\<String\> org.apache.hadoop.mapred.Counters.getCounterNames() are removed.
The deprecated method public synchronized long org.apache.hadoop.mapred.Counters.getCounter(String counterName) is undeprecated.
---
* [HADOOP-2817](https://issues.apache.org/jira/browse/HADOOP-2817) | *Major* | **Remove deprecated mapred.tasktracker.tasks.maximum and clusterStatus.getMaxTasks()**
The deprecated method public int org.apache.hadoop.mapred.ClusterStatus.getMaxTasks() is removed.
The deprecated configuration property "mapred.tasktracker.tasks.maximum" is removed.
---
* [HADOOP-2796](https://issues.apache.org/jira/browse/HADOOP-2796) | *Major* | **For script option hod should exit with distinguishable exit codes for script code and hod exit code.**
A provision to reliably detect a failing script's exit code was added. In case the hod script option returned a non-zero exit code, users can now look for a 'script.exitcode' file written to the HOD cluster directory. If this file is present, it means the script failed with the returned exit code.
---
* [HADOOP-2775](https://issues.apache.org/jira/browse/HADOOP-2775) | *Major* | **[HOD] Put in place unit test framework for HOD**
A unit testing framework based on pyunit is added to HOD. Developers contributing patches to HOD should now contribute unit tests along with the patches where possible.
---
* [HADOOP-2765](https://issues.apache.org/jira/browse/HADOOP-2765) | *Major* | **setting memory limits for tasks**
This feature enables specifying ulimits for streaming/pipes tasks. Now pipes and streaming tasks have same virtual memory available as the java process which invokes them. Ulimit value will be the same as -Xmx value for java processes provided using mapred.child.java.opts.
---
* [HADOOP-2758](https://issues.apache.org/jira/browse/HADOOP-2758) | *Major* | **Reduce memory copies when data is read from DFS**
DataNode takes 50% less CPU while serving data to clients.
---
* [HADOOP-2657](https://issues.apache.org/jira/browse/HADOOP-2657) | *Major* | **Enhancements to DFSClient to support flushing data at any point in time**
A new API DFSOututStream.flush() flushes all outstanding data to the pipeline of datanodes.
---
* [HADOOP-2634](https://issues.apache.org/jira/browse/HADOOP-2634) | *Blocker* | **Deprecate exists() and isDir() to simplify ClientProtocol.**
Deprecates exists() from ClientProtocol
---
* [HADOOP-2563](https://issues.apache.org/jira/browse/HADOOP-2563) | *Blocker* | **Remove deprecated FileSystem#listPaths()**
Removes deprecated method FileSystem#listPaths()
---
* [HADOOP-2559](https://issues.apache.org/jira/browse/HADOOP-2559) | *Major* | **DFS should place one replica per rack**
Change DFS block placement to allocate the first replica locally, the second off-rack, and the third intra-rack from the second.
---
* [HADOOP-2551](https://issues.apache.org/jira/browse/HADOOP-2551) | *Blocker* | **hadoop-env.sh needs finer granularity**
New environment variables were introduced to allow finer grained control of Java options passed to server and client JVMs. See the new *\_OPTS variables in conf/hadoop-env.sh.
---
* [HADOOP-2470](https://issues.apache.org/jira/browse/HADOOP-2470) | *Major* | **Open and isDir should be removed from ClientProtocol**
Open and isDir were removed from ClientProtocol.
---
* [HADOOP-2423](https://issues.apache.org/jira/browse/HADOOP-2423) | *Major* | **The codes in FSDirectory.mkdirs(...) is inefficient.**
Improved FSDirectory.mkdirs(...) performance. In NNThroughputBenchmark-create, the ops per sec in was improved ~54%.
---
* [HADOOP-2410](https://issues.apache.org/jira/browse/HADOOP-2410) | *Major* | **Make EC2 cluster nodes more independent of each other**
The command "hadoop-ec2 run" has been replaced by "hadoop-ec2 launch-cluster \<group\> \<number of instances\>", and "hadoop-ec2 start-hadoop" has been removed since Hadoop is started on instance start up. See http://wiki.apache.org/hadoop/AmazonEC2 for details.
---
* [HADOOP-2399](https://issues.apache.org/jira/browse/HADOOP-2399) | *Major* | **Input key and value to combiner and reducer should be reused**
The key and value objects that are given to the Combiner and Reducer are now reused between calls. This is much more efficient, but the user can not assume the objects are constant.
---
* [HADOOP-2345](https://issues.apache.org/jira/browse/HADOOP-2345) | *Major* | **new transactions to support HDFS Appends**
Introduce new namenode transactions to support appending to HDFS files.
---
* [HADOOP-2239](https://issues.apache.org/jira/browse/HADOOP-2239) | *Major* | **Security: Need to be able to encrypt Hadoop socket connections**
This patch adds a new FileSystem, HftpsFileSystem, that allows access to HDFS data over HTTPS.
---
* [HADOOP-2219](https://issues.apache.org/jira/browse/HADOOP-2219) | *Major* | **du like command to count number of files under a given directory**
Added a new fs command fs -count for counting the number of bytes, files and directories under a given path.
Added a new RPC getContentSummary(String path) to ClientProtocol.
---
* [HADOOP-2192](https://issues.apache.org/jira/browse/HADOOP-2192) | *Major* | **dfs mv command differs from POSIX standards**
this patch makes dfs -mv more like linux mv command getting rid of unnecessary output in dfs -mv and returns an error message when moving non existent files/directories --- mv: cannot stat "filename": No such file or directory.
---
* [HADOOP-2178](https://issues.apache.org/jira/browse/HADOOP-2178) | *Major* | **Job history on HDFS**
This feature provides facility to store job history on DFS. Now cluster admin can provide either localFS location or DFS location using configuration property "mapred.job.history.location" to store job histroy. History will be logged in user specified location also. User can specify history location using configuration property "mapred.job.history.user.location" .
The classes org.apache.hadoop.mapred.DefaultJobHistoryParser.MasterIndex and org.apache.hadoop.mapred.DefaultJobHistoryParser.MasterIndexParseListener, and public method org.apache.hadoop.mapred.DefaultJobHistoryParser.parseMasterIndex are not available.
The signature of public method org.apache.hadoop.mapred.DefaultJobHistoryParser.parseJobTasks(File jobHistoryFile, JobHistory.JobInfo job) is changed to DefaultJobHistoryParser.parseJobTasks(String jobHistoryFile, JobHistory.JobInfo job, FileSystem fs).
The signature of public method org.apache.hadoop.mapred.JobHistory.parseHistory(File path, Listener l) is changed to JobHistory.parseHistoryFromFS(String path, Listener l, FileSystem fs)
---
* [HADOOP-2119](https://issues.apache.org/jira/browse/HADOOP-2119) | *Critical* | **JobTracker becomes non-responsive if the task trackers finish task too fast**
This removes many inefficiencies in task placement and scheduling logic. The JobTracker would perform linear scans of the list of submitted tasks in cases where it did not find an obvious candidate task for a node. With better data structures for managing job state, all task placement operations now run in constant time (in most cases). Also, the task output promotions are batched.
---
* [HADOOP-2116](https://issues.apache.org/jira/browse/HADOOP-2116) | *Major* | **Job.local.dir to be exposed to tasks**
This issue restructures local job directory on the tasktracker.
Users are provided with a job-specific shared directory (mapred-local/taskTracker/jobcache/$jobid/ work) for using it as scratch space, through configuration property and system property "job.local.dir". Now, the directory "../work" is not available from the task's cwd.
---
* [HADOOP-2063](https://issues.apache.org/jira/browse/HADOOP-2063) | *Blocker* | **Command to pull corrupted files**
Added a new option -ignoreCrc to fs -get, or equivalently, fs -copyToLocal, such that crc checksum will be ignored for the command. The use of this option is to download the corrupted files.
---
* [HADOOP-2055](https://issues.apache.org/jira/browse/HADOOP-2055) | *Minor* | **JobConf should have a setInputPathFilter method**
This issue provides users the ability to specify what paths to ignore for processing in the job input directory (apart from the filenames that start with "\_" and "."). Defines two new APIs - FileInputFormat.setInputPathFilter(JobConf, PathFilter), and, FileInputFormat.getInputPathFilter(JobConf).
---
* [HADOOP-2027](https://issues.apache.org/jira/browse/HADOOP-2027) | *Major* | **FileSystem should provide byte ranges for file locations**
New FileSystem API getFileBlockLocations to return the number of bytes in each block in a file via a single rpc to the namenode to speed up job planning. Deprecates getFileCacheHints.
---
* [HADOOP-1986](https://issues.apache.org/jira/browse/HADOOP-1986) | *Major* | **Add support for a general serialization mechanism for Map Reduce**
Programs that implement the raw Mapper or Reducer interfaces will need modification to compile with this release. For example,
class MyMapper implements Mapper {
public void map(WritableComparable key, Writable val,
OutputCollector out, Reporter reporter) throws IOException {
// ...
}
// ...
}
will need to be changed to refer to the parameterized type. For example:
class MyMapper implements Mapper\<WritableComparable, Writable, WritableComparable, Writable\> {
public void map(WritableComparable key, Writable val,
OutputCollector\<WritableComparable, Writable\> out, Reporter reporter) throws IOException {
// ...
}
// ...
}
Similarly implementations of the following raw interfaces will need modification: InputFormat, OutputCollector, OutputFormat, Partitioner, RecordReader, RecordWriter
---
* [HADOOP-1985](https://issues.apache.org/jira/browse/HADOOP-1985) | *Major* | **Abstract node to switch mapping into a topology service class used by namenode and jobtracker**
This issue introduces rack awareness for map tasks. It also moves the rack resolution logic to the central servers - NameNode & JobTracker. The administrator can specify a loadable class given by topology.node.switch.mapping.impl to specify the class implementing the logic for rack resolution. The class must implement a method - resolve(List\<String\> names), where names is the list of DNS-names/IP-addresses that we want resolved. The return value is a list of resolved network paths of the form /foo/rack, where rack is the rackID where the node belongs to and foo is the switch where multiple racks are connected, and so on. The default implementation of this class is packaged along with hadoop and points to org.apache.hadoop.net.ScriptBasedMapping and this class loads a script that can be used for rack resolution. The script location is configurable. It is specified by topology.script.file.name and defaults to an empty script. In the case where the script name is empty, /default-rack is returned for all dns-names/IP-addresses. The loadable topology.node.switch.mapping.impl provides administrators fleixibilty to define how their site's node resolution should happen.
For mapred, one can also specify the level of the cache w.r.t the number of levels in the resolved network path - defaults to two. This means that the JobTracker will cache tasks at the host level and at the rack level.
Known issue: the task caching will not work with levels greater than 2 (beyond racks). This bug is tracked in HADOOP-3296.
---
* [HADOOP-1622](https://issues.apache.org/jira/browse/HADOOP-1622) | *Major* | **Hadoop should provide a way to allow the user to specify jar file(s) the user job depends on**
This patch allows new command line options for
hadoop jar
which are
hadoop jar -files \<comma seperated list of files\> -libjars \<comma seperated list of jars\> -archives \<comma seperated list of archives\>
-files options allows you to speficy comma seperated list of path which would be present in your current working directory of your task
-libjars option allows you to add jars to the classpaths of the maps and reduces.
-archives allows you to pass archives as arguments that are unzipped/unjarred and a link with name of the jar/zip are created in the current working directory if tasks.
---
* [HADOOP-1593](https://issues.apache.org/jira/browse/HADOOP-1593) | *Major* | **FsShell should work with paths in non-default FileSystem**
This bug allows non default path to specifeid in fsshell commands.
So, you can now specify hadoop dfs -ls hdfs://remotehost1:port/path
and hadoop dfs -ls hdfs://remotehost2:port/path without changing the config.
---
* [HADOOP-910](https://issues.apache.org/jira/browse/HADOOP-910) | *Major* | **Reduces can do merges for the on-disk map output files in parallel with their copying**
Reducers now perform merges of shuffle data (both in-memory and on disk) while fetching map outputs. Earlier, during shuffle they used to merge only the in-memory outputs.
---
* [HADOOP-771](https://issues.apache.org/jira/browse/HADOOP-771) | *Major* | **Namenode should return error when trying to delete non-empty directory**
This patch adds a new api to file system i.e delete(path, boolean), deprecating the previous delete(path).
the new api recursively deletes files only if boolean is set to true.
If path is a file, the boolean value does not matter, if path is a directory and the directory is non empty delete(path, false) will throw an exception and delete(path, true) will delete all files recursively.

View File

@ -0,0 +1,74 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.17.1 - 2008-06-23
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3565](https://issues.apache.org/jira/browse/HADOOP-3565) | JavaSerialization can throw java.io.StreamCorruptedException | Major | . | Tom White | Tom White |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3550](https://issues.apache.org/jira/browse/HADOOP-3550) | Reduce tasks failing with OOM | Blocker | . | Arun C Murthy | Chris Douglas |
| [HADOOP-3526](https://issues.apache.org/jira/browse/HADOOP-3526) | contrib/data\_join doesn't work | Blocker | . | Spyros Blanas | Spyros Blanas |
| [HADOOP-3522](https://issues.apache.org/jira/browse/HADOOP-3522) | ValuesIterator.next() doesn't return a new object, thus failing many equals() tests. | Major | . | Spyros Blanas | Owen O'Malley |
| [HADOOP-3477](https://issues.apache.org/jira/browse/HADOOP-3477) | release tar.gz contains duplicate files | Major | build | Adam Heath | Adam Heath |
| [HADOOP-3475](https://issues.apache.org/jira/browse/HADOOP-3475) | MapOutputBuffer allocates 4x as much space to record capacity as intended | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3472](https://issues.apache.org/jira/browse/HADOOP-3472) | MapFile.Reader getClosest() function returns incorrect results when before is true | Major | io | Todd Lipcon | stack |
| [HADOOP-3442](https://issues.apache.org/jira/browse/HADOOP-3442) | QuickSort may get into unbounded recursion | Blocker | . | Runping Qi | Chris Douglas |
| [HADOOP-2159](https://issues.apache.org/jira/browse/HADOOP-2159) | Namenode stuck in safemode | Major | . | Christian Kunz | Hairong Kuang |
| [HADOOP-1979](https://issues.apache.org/jira/browse/HADOOP-1979) | fsck on namenode without datanodes takes too much time | Minor | . | Koji Noguchi | Lohit Vijayarenu |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,38 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.17.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3565](https://issues.apache.org/jira/browse/HADOOP-3565) | *Major* | **JavaSerialization can throw java.io.StreamCorruptedException**
Change the Java serialization framework, which is not enabled by default, to correctly make the objects independent of the previous objects.
---
* [HADOOP-1979](https://issues.apache.org/jira/browse/HADOOP-1979) | *Minor* | **fsck on namenode without datanodes takes too much time**
Improved performance of {{fsck}} by better management of the data stream on the client side.

View File

@ -0,0 +1,77 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.17.2 - 2008-08-11
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4773](https://issues.apache.org/jira/browse/HADOOP-4773) | namenode startup error, hadoop-user-namenode.pid permission denied. | Critical | . | Focus | |
| [HADOOP-3931](https://issues.apache.org/jira/browse/HADOOP-3931) | Bug in MapTask.MapOutputBuffer.collect leads to an unnecessary and harmful 'reset' | Blocker | . | Arun C Murthy | Chris Douglas |
| [HADOOP-3859](https://issues.apache.org/jira/browse/HADOOP-3859) | 1000 concurrent read on a single file failing the task/client | Blocker | . | Koji Noguchi | Johan Oskarsson |
| [HADOOP-3813](https://issues.apache.org/jira/browse/HADOOP-3813) | RPC queue overload of JobTracker | Major | . | Christian Kunz | Amareshwari Sriramadasu |
| [HADOOP-3760](https://issues.apache.org/jira/browse/HADOOP-3760) | DFS operations fail because of Stream closed error | Blocker | . | Amar Kamat | Lohit Vijayarenu |
| [HADOOP-3758](https://issues.apache.org/jira/browse/HADOOP-3758) | Excessive exceptions in HDFS namenode log file | Blocker | . | Jim Huang | Lohit Vijayarenu |
| [HADOOP-3707](https://issues.apache.org/jira/browse/HADOOP-3707) | Frequent DiskOutOfSpaceException on almost-full datanodes | Blocker | . | Koji Noguchi | Raghu Angadi |
| [HADOOP-3685](https://issues.apache.org/jira/browse/HADOOP-3685) | Unbalanced replication target | Blocker | . | Koji Noguchi | Hairong Kuang |
| [HADOOP-3681](https://issues.apache.org/jira/browse/HADOOP-3681) | Infinite loop in dfs close | Blocker | . | Koji Noguchi | Lohit Vijayarenu |
| [HADOOP-3678](https://issues.apache.org/jira/browse/HADOOP-3678) | Avoid spurious "DataXceiver: java.io.IOException: Connection reset by peer" errors in DataNode log | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3633](https://issues.apache.org/jira/browse/HADOOP-3633) | Uncaught exception in DataXceiveServer | Blocker | . | Koji Noguchi | Konstantin Shvachko |
| [HADOOP-3370](https://issues.apache.org/jira/browse/HADOOP-3370) | failed tasks may stay forever in TaskTracker.runningJobs | Critical | . | Zheng Shao | Zheng Shao |
| [HADOOP-3002](https://issues.apache.org/jira/browse/HADOOP-3002) | HDFS should not remove blocks while in safemode. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,52 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.17.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3859](https://issues.apache.org/jira/browse/HADOOP-3859) | *Blocker* | **1000 concurrent read on a single file failing the task/client**
Allows the user to change the maximum number of xceivers in the datanode.
---
* [HADOOP-3760](https://issues.apache.org/jira/browse/HADOOP-3760) | *Blocker* | **DFS operations fail because of Stream closed error**
Fix a bug with HDFS file close() mistakenly introduced by HADOOP-3681.
---
* [HADOOP-3707](https://issues.apache.org/jira/browse/HADOOP-3707) | *Blocker* | **Frequent DiskOutOfSpaceException on almost-full datanodes**
NameNode keeps a count of number of blocks scheduled to be written to a datanode and uses it to avoid allocating more blocks than a datanode can hold.
---
* [HADOOP-3678](https://issues.apache.org/jira/browse/HADOOP-3678) | *Blocker* | **Avoid spurious "DataXceiver: java.io.IOException: Connection reset by peer" errors in DataNode log**
Avoid spurious exceptions logged at DataNode when clients read from DFS.

View File

@ -0,0 +1,70 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.17.3 - Unreleased
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4326](https://issues.apache.org/jira/browse/HADOOP-4326) | ChecksumFileSystem does not override all create(...) methods | Blocker | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4318](https://issues.apache.org/jira/browse/HADOOP-4318) | distcp fails | Blocker | . | Christian Kunz | Tsz Wo Nicholas Sze |
| [HADOOP-4277](https://issues.apache.org/jira/browse/HADOOP-4277) | Checksum verification is disabled for LocalFS | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4271](https://issues.apache.org/jira/browse/HADOOP-4271) | Bug in FSInputChecker makes it possible to read from an invalid buffer | Blocker | fs | Ning Li | Ning Li |
| [HADOOP-3217](https://issues.apache.org/jira/browse/HADOOP-3217) | [HOD] Be less agressive when querying job status from resource manager. | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4164](https://issues.apache.org/jira/browse/HADOOP-4164) | Chinese translation of core docs | Major | documentation | Xuebing Yan | Xuebing Yan |

View File

@ -0,0 +1,45 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.17.3 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-4277](https://issues.apache.org/jira/browse/HADOOP-4277) | *Blocker* | **Checksum verification is disabled for LocalFS**
Checksum verification was mistakenly disabled for LocalFileSystem.
---
* [HADOOP-4271](https://issues.apache.org/jira/browse/HADOOP-4271) | *Blocker* | **Bug in FSInputChecker makes it possible to read from an invalid buffer**
Checksum input stream can sometimes return invalid data to the user.
---
* [HADOOP-4164](https://issues.apache.org/jira/browse/HADOOP-4164) | *Major* | **Chinese translation of core docs**
Chinese translation for hadoop 0.17.x core docs.

View File

@ -0,0 +1,337 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.18.0 - 2008-08-22
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3837](https://issues.apache.org/jira/browse/HADOOP-3837) | hadop streaming does not use progress reporting to detect hung tasks | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3808](https://issues.apache.org/jira/browse/HADOOP-3808) | [HOD] Include job tracker RPC in notes attribute after job submission | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3703](https://issues.apache.org/jira/browse/HADOOP-3703) | [HOD] logcondense needs to use the new pattern of output in hadoop dfs -lsr | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-3683](https://issues.apache.org/jira/browse/HADOOP-3683) | Hadoop dfs metric FilesListed shows number of files listed instead of operations | Major | metrics | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3665](https://issues.apache.org/jira/browse/HADOOP-3665) | WritableComparator newKey() fails for NullWritable | Minor | io | Lukas Vlcek | Chris Douglas |
| [HADOOP-3610](https://issues.apache.org/jira/browse/HADOOP-3610) | [HOD] HOD does not automatically create a cluster directory for the script option | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-3598](https://issues.apache.org/jira/browse/HADOOP-3598) | Map-Reduce framework needlessly creates temporary \_${taskid} directories for Maps | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3569](https://issues.apache.org/jira/browse/HADOOP-3569) | KFS input stream read() returns 4 bytes instead of 1 | Minor | . | Sriram Rao | Sriram Rao |
| [HADOOP-3512](https://issues.apache.org/jira/browse/HADOOP-3512) | Split map/reduce tools into separate jars | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-3486](https://issues.apache.org/jira/browse/HADOOP-3486) | Change default for initial block report to 0 sec and document it in hadoop-defaults.xml | Major | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-3483](https://issues.apache.org/jira/browse/HADOOP-3483) | [HOD] Improvements with cluster directory handling | Major | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3459](https://issues.apache.org/jira/browse/HADOOP-3459) | Change dfs -ls listing to closely match format on Linux | Major | . | Mukund Madhugiri | Mukund Madhugiri |
| [HADOOP-3452](https://issues.apache.org/jira/browse/HADOOP-3452) | fsck exit code would be better if non-zero when FS corrupt | Minor | . | Pete Wyckoff | Lohit Vijayarenu |
| [HADOOP-3417](https://issues.apache.org/jira/browse/HADOOP-3417) | JobClient should not have a static configuration for cli parsing | Major | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-3405](https://issues.apache.org/jira/browse/HADOOP-3405) | Make mapred internal classes package-local | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-3390](https://issues.apache.org/jira/browse/HADOOP-3390) | Remove deprecated ClientProtocol.abandonFileInProgress() | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3379](https://issues.apache.org/jira/browse/HADOOP-3379) | Document the "stream.non.zero.exit.status.is.failure" knob for streaming | Blocker | documentation | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-3329](https://issues.apache.org/jira/browse/HADOOP-3329) | DatanodeDescriptor objects stored in FSImage may be out dated. | Major | . | Tsz Wo Nicholas Sze | dhruba borthakur |
| [HADOOP-3317](https://issues.apache.org/jira/browse/HADOOP-3317) | add default port for hdfs namenode | Minor | . | Doug Cutting | Doug Cutting |
| [HADOOP-3310](https://issues.apache.org/jira/browse/HADOOP-3310) | Lease recovery for append | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3283](https://issues.apache.org/jira/browse/HADOOP-3283) | Need a mechanism for data nodes to update generation stamps. | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3265](https://issues.apache.org/jira/browse/HADOOP-3265) | Remove deprecated API getFileCacheHints | Major | fs | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3226](https://issues.apache.org/jira/browse/HADOOP-3226) | Run combiner when merging spills from map output | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3193](https://issues.apache.org/jira/browse/HADOOP-3193) | Discovery of corrupt block reported in name node log | Minor | . | Robert Chansler | Chris Douglas |
| [HADOOP-3184](https://issues.apache.org/jira/browse/HADOOP-3184) | HOD gracefully exclude "bad" nodes during ring formation | Major | contrib/hod | Marco Nicosia | Hemanth Yamijala |
| [HADOOP-3113](https://issues.apache.org/jira/browse/HADOOP-3113) | DFSOututStream.flush() should flush data to real block file on DataNode. | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3095](https://issues.apache.org/jira/browse/HADOOP-3095) | Validating input paths and creating splits is slow on S3 | Major | fs, fs/s3 | Tom White | Tom White |
| [HADOOP-3035](https://issues.apache.org/jira/browse/HADOOP-3035) | Data nodes should inform the name-node about block crc errors. | Major | . | Konstantin Shvachko | Lohit Vijayarenu |
| [HADOOP-2909](https://issues.apache.org/jira/browse/HADOOP-2909) | Improve IPC idle connection management | Major | ipc | Hairong Kuang | Hairong Kuang |
| [HADOOP-2865](https://issues.apache.org/jira/browse/HADOOP-2865) | FsShell.ls() should print file attributes first then the path name. | Major | . | Konstantin Shvachko | Edward J. Yoon |
| [HADOOP-2797](https://issues.apache.org/jira/browse/HADOOP-2797) | Withdraw CRC upgrade from HDFS | Critical | . | Robert Chansler | Raghu Angadi |
| [HADOOP-2703](https://issues.apache.org/jira/browse/HADOOP-2703) | New files under lease (before close) still shows up as MISSING files/blocks in fsck | Minor | . | Koji Noguchi | Lohit Vijayarenu |
| [HADOOP-2656](https://issues.apache.org/jira/browse/HADOOP-2656) | Support for upgrading existing cluster to facilitate appends to HDFS files | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2585](https://issues.apache.org/jira/browse/HADOOP-2585) | Automatic namespace recovery from the secondary image. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2188](https://issues.apache.org/jira/browse/HADOOP-2188) | RPC should send a ping rather than use client timeouts | Major | ipc | Owen O'Malley | Hairong Kuang |
| [HADOOP-2181](https://issues.apache.org/jira/browse/HADOOP-2181) | Input Split details for maps should be logged | Minor | . | Lohit Vijayarenu | Amareshwari Sriramadasu |
| [HADOOP-2065](https://issues.apache.org/jira/browse/HADOOP-2065) | Replication policy for corrupted block | Major | . | Koji Noguchi | Lohit Vijayarenu |
| [HADOOP-1702](https://issues.apache.org/jira/browse/HADOOP-1702) | Reduce buffer copies when data is written to DFS | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-544](https://issues.apache.org/jira/browse/HADOOP-544) | Replace the job, tip and task ids with objects. | Major | . | Owen O'Malley | Enis Soztutar |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3502](https://issues.apache.org/jira/browse/HADOOP-3502) | Quota API needs documentation in Forrest | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-3460](https://issues.apache.org/jira/browse/HADOOP-3460) | SequenceFileAsBinaryOutputFormat | Minor | . | Koji Noguchi | Koji Noguchi |
| [HADOOP-3336](https://issues.apache.org/jira/browse/HADOOP-3336) | Direct a subset of namenode RPC events for audit logging | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3307](https://issues.apache.org/jira/browse/HADOOP-3307) | Archives in Hadoop. | Major | fs | Mahadev konar | Mahadev konar |
| [HADOOP-3250](https://issues.apache.org/jira/browse/HADOOP-3250) | Extend FileSystem API to allow appending to files | Major | fs | dhruba borthakur | Tsz Wo Nicholas Sze |
| [HADOOP-3246](https://issues.apache.org/jira/browse/HADOOP-3246) | FTP client over HDFS | Major | util | Ankur | Ankur |
| [HADOOP-3230](https://issues.apache.org/jira/browse/HADOOP-3230) | Add command line access to named counters | Major | scripts | Tom White | Tom White |
| [HADOOP-3221](https://issues.apache.org/jira/browse/HADOOP-3221) | Need a "LineBasedTextInputFormat" | Major | . | Milind Bhandarkar | Amareshwari Sriramadasu |
| [HADOOP-3188](https://issues.apache.org/jira/browse/HADOOP-3188) | compaction utility for directories | Major | . | Robert Chansler | Robert Chansler |
| [HADOOP-3187](https://issues.apache.org/jira/browse/HADOOP-3187) | Quotas for name space management | Major | . | Robert Chansler | Hairong Kuang |
| [HADOOP-3074](https://issues.apache.org/jira/browse/HADOOP-3074) | URLStreamHandler for the DFS | Major | util | Christophe Taton | Christophe Taton |
| [HADOOP-3061](https://issues.apache.org/jira/browse/HADOOP-3061) | Writable for single byte and double | Major | io | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-3022](https://issues.apache.org/jira/browse/HADOOP-3022) | Fast Cluster Restart | Major | . | Robert Chansler | Konstantin Shvachko |
| [HADOOP-1915](https://issues.apache.org/jira/browse/HADOOP-1915) | adding counters methods using String (as opposed to Enum) | Minor | . | Alejandro Abdelnur | Tom White |
| [HADOOP-1328](https://issues.apache.org/jira/browse/HADOOP-1328) | Hadoop Streaming needs to provide a way for the stream plugin to update global counters | Major | . | Runping Qi | Tom White |
| [HADOOP-930](https://issues.apache.org/jira/browse/HADOOP-930) | Add support for reading regular (non-block-based) files from S3 in S3FileSystem | Major | fs | Tom White | Tom White |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3599](https://issues.apache.org/jira/browse/HADOOP-3599) | The new setCombineOnceOnly shouldn't take a JobConf, since it is a method on JobConf | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-3572](https://issues.apache.org/jira/browse/HADOOP-3572) | setQuotas usage interface has some minor bugs. | Minor | . | Mahadev konar | Hairong Kuang |
| [HADOOP-3547](https://issues.apache.org/jira/browse/HADOOP-3547) | Improve documentation about distributing native libraries via DistributedCache | Blocker | documentation | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-3535](https://issues.apache.org/jira/browse/HADOOP-3535) | IOUtils.close needs better documentation | Blocker | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-3532](https://issues.apache.org/jira/browse/HADOOP-3532) | Create build targets to create api change reports using jdiff | Major | build | Owen O'Malley | Owen O'Malley |
| [HADOOP-3501](https://issues.apache.org/jira/browse/HADOOP-3501) | deprecate InMemoryFileSystem | Major | fs | Doug Cutting | Doug Cutting |
| [HADOOP-3492](https://issues.apache.org/jira/browse/HADOOP-3492) | add forrest documentation for user archives | Blocker | . | Mahadev konar | Mahadev konar |
| [HADOOP-3467](https://issues.apache.org/jira/browse/HADOOP-3467) | The javadoc for FileSystem.deleteOnExit should have more description | Blocker | documentation | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3464](https://issues.apache.org/jira/browse/HADOOP-3464) | [HOD] HOD can improve error messages by reporting failures on compute nodes back to hod client | Major | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-3455](https://issues.apache.org/jira/browse/HADOOP-3455) | IPC.Client synchronisation looks weak | Major | ipc | Steve Loughran | Hairong Kuang |
| [HADOOP-3448](https://issues.apache.org/jira/browse/HADOOP-3448) | Add some more hints of the problem when datanode and namenode don't match | Minor | . | Steve Loughran | Steve Loughran |
| [HADOOP-3434](https://issues.apache.org/jira/browse/HADOOP-3434) | Retain cause of bind failure in Server.bind | Major | . | Steve Loughran | Steve Loughran |
| [HADOOP-3429](https://issues.apache.org/jira/browse/HADOOP-3429) | Increase the buffersize for the streaming parent java process's streams | Major | . | Devaraj Das | Amareshwari Sriramadasu |
| [HADOOP-3406](https://issues.apache.org/jira/browse/HADOOP-3406) | Document controls for profiling maps & reduces | Blocker | documentation | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-3400](https://issues.apache.org/jira/browse/HADOOP-3400) | Facilitate creation of temporary files in HDFS | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3398](https://issues.apache.org/jira/browse/HADOOP-3398) | ReduceTask::closestPowerOf2 is inefficient | Trivial | . | Chris Douglas | Chris Douglas |
| [HADOOP-3377](https://issues.apache.org/jira/browse/HADOOP-3377) | Use StringUtils#replaceAll instead of | Trivial | . | Brice Arnould | Brice Arnould |
| [HADOOP-3369](https://issues.apache.org/jira/browse/HADOOP-3369) | Fast block processing during name-node startup. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3366](https://issues.apache.org/jira/browse/HADOOP-3366) | Shuffle/Merge improvements | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3364](https://issues.apache.org/jira/browse/HADOOP-3364) | Faster image and log edits loading. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3355](https://issues.apache.org/jira/browse/HADOOP-3355) | Configuration should accept decimal and hexadecimal values | Major | conf | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3350](https://issues.apache.org/jira/browse/HADOOP-3350) | distcp should permit users to limit the number of maps | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3345](https://issues.apache.org/jira/browse/HADOOP-3345) | Enhance the hudson-test-patch target | Minor | build | Nigel Daley | Nigel Daley |
| [HADOOP-3334](https://issues.apache.org/jira/browse/HADOOP-3334) | Move lease handling codes out from FSNamesystem | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3332](https://issues.apache.org/jira/browse/HADOOP-3332) | improving the logging during shuffling | Blocker | . | Runping Qi | Devaraj Das |
| [HADOOP-3308](https://issues.apache.org/jira/browse/HADOOP-3308) | Improve QuickSort by excluding values eq the pivot from the partition | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3297](https://issues.apache.org/jira/browse/HADOOP-3297) | The way in which ReduceTask/TaskTracker gets completion events during shuffle can be improved | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-3295](https://issues.apache.org/jira/browse/HADOOP-3295) | Allow TextOutputFormat to use configurable separators | Minor | io | Zheng Shao | Zheng Shao |
| [HADOOP-3277](https://issues.apache.org/jira/browse/HADOOP-3277) | hod should better errors message when deallocate is fired on non allocated directory. | Minor | contrib/hod | Karam Singh | |
| [HADOOP-3274](https://issues.apache.org/jira/browse/HADOOP-3274) | The default constructor of BytesWritable should not create a 100-byte array. | Minor | io | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3270](https://issues.apache.org/jira/browse/HADOOP-3270) | Constant DatanodeCommand should be stored in static fianl immutable variables. | Minor | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3254](https://issues.apache.org/jira/browse/HADOOP-3254) | FSNamesystem.gotHeartbeat(..., Object[] xferResults, Object[] deleteList) should not use Object[] as pass-by-reference parameters | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3177](https://issues.apache.org/jira/browse/HADOOP-3177) | Expose DFSOutputStream.fsync API though the FileSystem interface | Major | . | dhruba borthakur | Tsz Wo Nicholas Sze |
| [HADOOP-3164](https://issues.apache.org/jira/browse/HADOOP-3164) | Use FileChannel.transferTo() when data is read from DataNode. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3144](https://issues.apache.org/jira/browse/HADOOP-3144) | better fault tolerance for corrupted text files | Major | . | Joydeep Sen Sarma | Zheng Shao |
| [HADOOP-3096](https://issues.apache.org/jira/browse/HADOOP-3096) | Improve documentation about the Task Execution Environment in the Map-Reduce tutorial | Blocker | documentation | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-3058](https://issues.apache.org/jira/browse/HADOOP-3058) | Hadoop DFS to report more replication metrics | Minor | metrics | Marco Nicosia | Lohit Vijayarenu |
| [HADOOP-3013](https://issues.apache.org/jira/browse/HADOOP-3013) | fsck to show (checksum) corrupted files | Major | . | Koji Noguchi | Lohit Vijayarenu |
| [HADOOP-2987](https://issues.apache.org/jira/browse/HADOOP-2987) | Keep two generations of fsimage | Major | . | Robert Chansler | Konstantin Shvachko |
| [HADOOP-2910](https://issues.apache.org/jira/browse/HADOOP-2910) | Throttle IPC Client/Server during bursts of requests or server slowdown | Major | ipc | Hairong Kuang | Hairong Kuang |
| [HADOOP-2867](https://issues.apache.org/jira/browse/HADOOP-2867) | Add a task's cwd to it's LD\_LIBRARY\_PATH | Major | . | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-2857](https://issues.apache.org/jira/browse/HADOOP-2857) | libhdfs: no way to set JVM args other than classpath | Minor | . | Craig Macdonald | Craig Macdonald |
| [HADOOP-2799](https://issues.apache.org/jira/browse/HADOOP-2799) | Replace org.apache.hadoop.io.Closeable with java.io.Closeable | Minor | io | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2762](https://issues.apache.org/jira/browse/HADOOP-2762) | Better documentation of controls for memory limits on hadoop daemons and Map-Reduce tasks | Blocker | documentation, scripts | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-2661](https://issues.apache.org/jira/browse/HADOOP-2661) | Replicator log should include block id | Minor | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-2461](https://issues.apache.org/jira/browse/HADOOP-2461) | Configuration should trim property names and accept decimal, hexadecimal, and octal numbers | Minor | conf | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2154](https://issues.apache.org/jira/browse/HADOOP-2154) | Non-interleaved checksums would optimize block transfers. | Major | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-2019](https://issues.apache.org/jira/browse/HADOOP-2019) | DistributedFileCache should support .tgz files in addition to jars and zip files | Major | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-4](https://issues.apache.org/jira/browse/HADOOP-4) | tool to mount dfs on linux | Major | . | John Xing | Pete Wyckoff |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3947](https://issues.apache.org/jira/browse/HADOOP-3947) | TaskTrackers fail to connect back upon a re-init action | Blocker | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-3901](https://issues.apache.org/jira/browse/HADOOP-3901) | CLASSPATH in bin/hadoop script is set incorrectly for cygwin | Blocker | scripts | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3897](https://issues.apache.org/jira/browse/HADOOP-3897) | SecondaryNameNode fails with NullPointerException | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3884](https://issues.apache.org/jira/browse/HADOOP-3884) | eclipse plugin build is broken with current eclipse versions | Blocker | contrib/eclipse-plugin | Doug Cutting | Doug Cutting |
| [HADOOP-3865](https://issues.apache.org/jira/browse/HADOOP-3865) | SecondaryNameNode runs out of memory | Blocker | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3855](https://issues.apache.org/jira/browse/HADOOP-3855) | Fix import of MiniDFSCluster in TestCompressedEmptyMapOutputs.java | Blocker | test | Arun C Murthy | Arun C Murthy |
| [HADOOP-3827](https://issues.apache.org/jira/browse/HADOOP-3827) | Jobs with empty map-outputs and intermediate compression fail | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3806](https://issues.apache.org/jira/browse/HADOOP-3806) | Remove debug message from Quicksort | Trivial | . | Chris Douglas | |
| [HADOOP-3798](https://issues.apache.org/jira/browse/HADOOP-3798) | compile-core-test fails to compile | Blocker | build | Mukund Madhugiri | Mukund Madhugiri |
| [HADOOP-3794](https://issues.apache.org/jira/browse/HADOOP-3794) | KFS implementation needs to return directory modification time | Minor | . | Sriram Rao | Sriram Rao |
| [HADOOP-3776](https://issues.apache.org/jira/browse/HADOOP-3776) | NPE in NameNode with unknown blocks | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3774](https://issues.apache.org/jira/browse/HADOOP-3774) | Typos in shell output | Blocker | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3762](https://issues.apache.org/jira/browse/HADOOP-3762) | Task tracker died due to OOM | Blocker | . | Runping Qi | Doug Cutting |
| [HADOOP-3755](https://issues.apache.org/jira/browse/HADOOP-3755) | the gridmix scripts do not work with hod 0.4 | Major | . | Runping Qi | Runping Qi |
| [HADOOP-3752](https://issues.apache.org/jira/browse/HADOOP-3752) | Audit logging fails to record rename | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-3743](https://issues.apache.org/jira/browse/HADOOP-3743) | -libjars, -files and -archives options do not work with 0.18 | Blocker | . | Mahadev konar | Amareshwari Sriramadasu |
| [HADOOP-3737](https://issues.apache.org/jira/browse/HADOOP-3737) | CompressedWritable throws OutOfMemoryError | Major | io | Grant Glouser | Grant Glouser |
| [HADOOP-3724](https://issues.apache.org/jira/browse/HADOOP-3724) | Namenode does not start due to exception throw while saving Image | Blocker | . | Lohit Vijayarenu | dhruba borthakur |
| [HADOOP-3718](https://issues.apache.org/jira/browse/HADOOP-3718) | KFS: write(int v) API writes out an integer rather than a byte | Minor | . | Sriram Rao | Sriram Rao |
| [HADOOP-3716](https://issues.apache.org/jira/browse/HADOOP-3716) | KFS listStatus() returns NULL on empty directories | Minor | . | Sriram Rao | Sriram Rao |
| [HADOOP-3706](https://issues.apache.org/jira/browse/HADOOP-3706) | CompositeInputFormat: Unable to wrap custom InputFormats | Major | . | Jingkei Ly | Jingkei Ly |
| [HADOOP-3693](https://issues.apache.org/jira/browse/HADOOP-3693) | Fix documentation for Archives, distcp and native libraries | Blocker | documentation | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3692](https://issues.apache.org/jira/browse/HADOOP-3692) | Fix documentation for Cluster setup and Quick start guides | Blocker | documentation | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3691](https://issues.apache.org/jira/browse/HADOOP-3691) | Fix mapred docs | Blocker | documentation | Amareshwari Sriramadasu | Jothi Padmanabhan |
| [HADOOP-3688](https://issues.apache.org/jira/browse/HADOOP-3688) | Fix up HDFS docs | Blocker | . | Robert Chansler | Robert Chansler |
| [HADOOP-3677](https://issues.apache.org/jira/browse/HADOOP-3677) | Problems with generation stamp upgrade | Blocker | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-3673](https://issues.apache.org/jira/browse/HADOOP-3673) | Deadlock in Datanode RPC servers | Blocker | . | dhruba borthakur | Tsz Wo Nicholas Sze |
| [HADOOP-3670](https://issues.apache.org/jira/browse/HADOOP-3670) | JobTracker running out of heap space | Blocker | . | Christian Kunz | Amareshwari Sriramadasu |
| [HADOOP-3668](https://issues.apache.org/jira/browse/HADOOP-3668) | Clean up HOD documentation | Blocker | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-3653](https://issues.apache.org/jira/browse/HADOOP-3653) | test-patch target not working on hudson.zones.apache.org due to HADOOP-3480 | Blocker | . | Nigel Daley | Brice Arnould |
| [HADOOP-3649](https://issues.apache.org/jira/browse/HADOOP-3649) | ArrayIndexOutOfBounds in FSNamesystem.getBlockLocationsInternal | Blocker | . | Arun C Murthy | Lohit Vijayarenu |
| [HADOOP-3647](https://issues.apache.org/jira/browse/HADOOP-3647) | Corner-case in IFile leads to failed tasks | Blocker | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3645](https://issues.apache.org/jira/browse/HADOOP-3645) | MetricsTimeVaryingRate returns wrong value for metric\_avg\_time | Blocker | metrics | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3639](https://issues.apache.org/jira/browse/HADOOP-3639) | Exception when closing DFSClient while multiple files are open | Blocker | . | Benjamin Gufler | Benjamin Gufler |
| [HADOOP-3635](https://issues.apache.org/jira/browse/HADOOP-3635) | Uncaught exception in DataBlockScanner | Blocker | . | Koji Noguchi | Tsz Wo Nicholas Sze |
| [HADOOP-3630](https://issues.apache.org/jira/browse/HADOOP-3630) | CompositeRecordReader: key and values can be in uninitialized state if files being joined have no records | Major | . | Jingkei Ly | Chris Douglas |
| [HADOOP-3615](https://issues.apache.org/jira/browse/HADOOP-3615) | DatanodeProtocol.versionID should be 16L | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3606](https://issues.apache.org/jira/browse/HADOOP-3606) | Update streaming documentation | Blocker | documentation | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3604](https://issues.apache.org/jira/browse/HADOOP-3604) | Reduce stuck at shuffling phase | Blocker | . | Runping Qi | Arun C Murthy |
| [HADOOP-3603](https://issues.apache.org/jira/browse/HADOOP-3603) | Setting spill threshold to 100% fails to detect spill for records | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-3597](https://issues.apache.org/jira/browse/HADOOP-3597) | SortValidator always uses the default file system irrespective of the actual input | Major | test | Jothi Padmanabhan | Jothi Padmanabhan |
| [HADOOP-3593](https://issues.apache.org/jira/browse/HADOOP-3593) | Update MapRed tutorial | Blocker | documentation | Devaraj Das | Devaraj Das |
| [HADOOP-3590](https://issues.apache.org/jira/browse/HADOOP-3590) | Null pointer exception in JobTracker when the task tracker is not yet resolved | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-3588](https://issues.apache.org/jira/browse/HADOOP-3588) | Bug report for archives | Blocker | . | Hairong Kuang | Mahadev konar |
| [HADOOP-3586](https://issues.apache.org/jira/browse/HADOOP-3586) | keep combiner backward compatible with earlier versions of hadoop | Blocker | . | Olga Natkovich | Chris Douglas |
| [HADOOP-3580](https://issues.apache.org/jira/browse/HADOOP-3580) | Using a har file as input for the Sort example fails | Blocker | . | Jothi Padmanabhan | Mahadev konar |
| [HADOOP-3576](https://issues.apache.org/jira/browse/HADOOP-3576) | hadoop dfs -mv throws NullPointerException | Blocker | . | Lohit Vijayarenu | Tsz Wo Nicholas Sze |
| [HADOOP-3575](https://issues.apache.org/jira/browse/HADOOP-3575) | clover target broken after src restructuring | Minor | build | Nigel Daley | Nigel Daley |
| [HADOOP-3571](https://issues.apache.org/jira/browse/HADOOP-3571) | ArrayIndexOutOfBoundsException in BlocksMap$BlockInfo.setPrevious | Blocker | . | Tsz Wo Nicholas Sze | Konstantin Shvachko |
| [HADOOP-3564](https://issues.apache.org/jira/browse/HADOOP-3564) | Sometime after successful hod allocation datanode fails to come up with java.net.BindException for dfs.datanode.ipc.address | Blocker | contrib/hod | Karam Singh | Vinod Kumar Vavilapalli |
| [HADOOP-3561](https://issues.apache.org/jira/browse/HADOOP-3561) | With trash enabled, 'hadoop fs -rmr .' still fully deletes the working dir | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-3560](https://issues.apache.org/jira/browse/HADOOP-3560) | Archvies sometimes create empty part files. | Blocker | . | Mahadev konar | Mahadev konar |
| [HADOOP-3559](https://issues.apache.org/jira/browse/HADOOP-3559) | test-libhdfs fails on linux | Blocker | . | Mukund Madhugiri | Lohit Vijayarenu |
| [HADOOP-3552](https://issues.apache.org/jira/browse/HADOOP-3552) | forrest doc for hadoop commands | Blocker | documentation | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-3548](https://issues.apache.org/jira/browse/HADOOP-3548) | The tools.jar is not included in the distribution | Blocker | build | Owen O'Malley | Owen O'Malley |
| [HADOOP-3546](https://issues.apache.org/jira/browse/HADOOP-3546) | TaskTracker re-initialization gets stuck in cleaning up | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3545](https://issues.apache.org/jira/browse/HADOOP-3545) | archive is failing with "Illegal Capacity" error | Blocker | . | Jothi Padmanabhan | Mahadev konar |
| [HADOOP-3544](https://issues.apache.org/jira/browse/HADOOP-3544) | The command "archive" is missing in the example in docs/hadoop\_archives.html (and pdf) | Blocker | documentation | Jothi Padmanabhan | Jothi Padmanabhan |
| [HADOOP-3542](https://issues.apache.org/jira/browse/HADOOP-3542) | Hadoop archives should not create \_logs file in the final archive directory. | Blocker | . | Mahadev konar | Mahadev konar |
| [HADOOP-3539](https://issues.apache.org/jira/browse/HADOOP-3539) | Cygwin: cygpath displays an error message in running bin/hadoop script | Blocker | scripts | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3537](https://issues.apache.org/jira/browse/HADOOP-3537) | Datanode not starting up with java.lang.StringIndexOutOfBoundsException in NetworkTopology.remove | Blocker | . | Koji Noguchi | Hairong Kuang |
| [HADOOP-3534](https://issues.apache.org/jira/browse/HADOOP-3534) | The namenode ignores ioexceptions in close | Blocker | . | Owen O'Malley | Tsz Wo Nicholas Sze |
| [HADOOP-3533](https://issues.apache.org/jira/browse/HADOOP-3533) | The api to JobTracker and TaskTracker have changed incompatibly | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-3531](https://issues.apache.org/jira/browse/HADOOP-3531) | Hod does not report job tracker failure on hod client side when job tracker fails to come up | Blocker | contrib/hod | Karam Singh | Hemanth Yamijala |
| [HADOOP-3528](https://issues.apache.org/jira/browse/HADOOP-3528) | Metrics FilesCreated and files\_deleted metrics do not match. | Blocker | metrics | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3523](https://issues.apache.org/jira/browse/HADOOP-3523) | [HOD] If a job does not exist in Torque's list of jobs, HOD allocate on previously allocated directory fails. | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3521](https://issues.apache.org/jira/browse/HADOOP-3521) | Hadoop mapreduce task metrics, unable to send metrics data. | Blocker | . | Eric Yang | Arun C Murthy |
| [HADOOP-3520](https://issues.apache.org/jira/browse/HADOOP-3520) | Generation stamp upgrade fails TestDFSUpgradeFromImage | Blocker | . | Konstantin Shvachko | dhruba borthakur |
| [HADOOP-3519](https://issues.apache.org/jira/browse/HADOOP-3519) | NPE in DFS FileSystem rename | Blocker | . | Tom White | Hairong Kuang |
| [HADOOP-3517](https://issues.apache.org/jira/browse/HADOOP-3517) | The last InMemory merge may be missed | Blocker | . | Devaraj Das | Arun C Murthy |
| [HADOOP-3516](https://issues.apache.org/jira/browse/HADOOP-3516) | TestHarFileSystem.testArchives fails with NullPointerException | Blocker | test | Nigel Daley | Subru Krishnan |
| [HADOOP-3513](https://issues.apache.org/jira/browse/HADOOP-3513) | Improve NNThroughputBenchmark log messages. | Major | test | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3511](https://issues.apache.org/jira/browse/HADOOP-3511) | Namenode should not restore the root's quota if the quota was not in the image | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-3509](https://issues.apache.org/jira/browse/HADOOP-3509) | FSNamesystem.close() throws NullPointerException | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3505](https://issues.apache.org/jira/browse/HADOOP-3505) | omissions in HOD documentation | Blocker | contrib/hod, documentation | Ari Rabkin | Vinod Kumar Vavilapalli |
| [HADOOP-3503](https://issues.apache.org/jira/browse/HADOOP-3503) | Race condition when client and namenode start block recovery simultaneously | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3496](https://issues.apache.org/jira/browse/HADOOP-3496) | TestHarFileSystem.testArchives fails | Blocker | fs | Amareshwari Sriramadasu | Tom White |
| [HADOOP-3493](https://issues.apache.org/jira/browse/HADOOP-3493) | TestStreamingFailure fails. | Major | . | Amareshwari Sriramadasu | Lohit Vijayarenu |
| [HADOOP-3491](https://issues.apache.org/jira/browse/HADOOP-3491) | Name-node shutdown causes InterruptedException in ResolutionMonitor | Major | . | Konstantin Shvachko | Lohit Vijayarenu |
| [HADOOP-3489](https://issues.apache.org/jira/browse/HADOOP-3489) | NPE in SafeModeMonitor | Major | . | Konstantin Shvachko | Lohit Vijayarenu |
| [HADOOP-3487](https://issues.apache.org/jira/browse/HADOOP-3487) | Balancer should not allocate a thread per block move | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-3480](https://issues.apache.org/jira/browse/HADOOP-3480) | Need to update Eclipse template to reflect current trunk | Blocker | build | Tsz Wo Nicholas Sze | Brice Arnould |
| [HADOOP-3476](https://issues.apache.org/jira/browse/HADOOP-3476) | Code cleanup needed in fuse-dfs | Major | . | Tsz Wo Nicholas Sze | Pete Wyckoff |
| [HADOOP-3471](https://issues.apache.org/jira/browse/HADOOP-3471) | TestIndexedSort sometimes fails | Major | test | Chris Douglas | Chris Douglas |
| [HADOOP-3468](https://issues.apache.org/jira/browse/HADOOP-3468) | Compile error: FTPFileSystem.java:26: cannot access org.apache.commons.net.ftp.FTP | Blocker | fs | Tsz Wo Nicholas Sze | Ankur |
| [HADOOP-3463](https://issues.apache.org/jira/browse/HADOOP-3463) | hadoop scripts don't change directory to hadoop\_home | Critical | scripts | Owen O'Malley | Owen O'Malley |
| [HADOOP-3454](https://issues.apache.org/jira/browse/HADOOP-3454) | Text.find incorrectly searches beyond the end of the buffer | Major | . | Chad Whipkey | Chad Whipkey |
| [HADOOP-3453](https://issues.apache.org/jira/browse/HADOOP-3453) | ipc.Client.close() throws NullPointerException | Major | ipc | Tsz Wo Nicholas Sze | Hairong Kuang |
| [HADOOP-3451](https://issues.apache.org/jira/browse/HADOOP-3451) | test-libhdfs fails on Linux | Blocker | test | Mukund Madhugiri | Lohit Vijayarenu |
| [HADOOP-3443](https://issues.apache.org/jira/browse/HADOOP-3443) | map outputs should not be renamed between partitions | Critical | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-3440](https://issues.apache.org/jira/browse/HADOOP-3440) | TaskRunner creates a symlink with name 'null' if a file is added to DistributedCache without fragment | Minor | . | Abhijit Bagri | Devaraj Das |
| [HADOOP-3435](https://issues.apache.org/jira/browse/HADOOP-3435) | test-patch fail if sh != bash | Major | . | Brice Arnould | Brice Arnould |
| [HADOOP-3427](https://issues.apache.org/jira/browse/HADOOP-3427) | In ReduceTask::fetchOutputs, wait for result can be improved slightly | Major | . | Devaraj Das | Devaraj Das |
| [HADOOP-3424](https://issues.apache.org/jira/browse/HADOOP-3424) | the value returned from getPartition should be checked to make sure it is in the range 0..#reduces-1 | Major | . | Owen O'Malley | Chris Douglas |
| [HADOOP-3419](https://issues.apache.org/jira/browse/HADOOP-3419) | TestFsck fails once in a while on WINDOWS/LINUX | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3418](https://issues.apache.org/jira/browse/HADOOP-3418) | NameNode does not restart if parent directory of a "FileUnderConstruction" is deleted. | Blocker | . | Raghu Angadi | Tsz Wo Nicholas Sze |
| [HADOOP-3413](https://issues.apache.org/jira/browse/HADOOP-3413) | SequenceFile.Reader doesn't use the Serialization framework | Critical | io | Arun C Murthy | Tom White |
| [HADOOP-3410](https://issues.apache.org/jira/browse/HADOOP-3410) | KFS implementation needs to return file modification time | Minor | . | Sriram Rao | Sriram Rao |
| [HADOOP-3409](https://issues.apache.org/jira/browse/HADOOP-3409) | NameNode should save the root inode into fsimage | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-3408](https://issues.apache.org/jira/browse/HADOOP-3408) | Change FSNamesytem status metrics to IntValue | Major | metrics | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3403](https://issues.apache.org/jira/browse/HADOOP-3403) | Job tracker's ExpireTackers thread gets NullPointerException if a tasktracker is lost. | Blocker | . | Amareshwari Sriramadasu | Arun C Murthy |
| [HADOOP-3401](https://issues.apache.org/jira/browse/HADOOP-3401) | Update FileBench to use the "work" directory for SequenceFileOutputFormat | Major | test | Chris Douglas | Chris Douglas |
| [HADOOP-3399](https://issues.apache.org/jira/browse/HADOOP-3399) | Debug log not removed in ipc client | Trivial | ipc | Raghu Angadi | Raghu Angadi |
| [HADOOP-3396](https://issues.apache.org/jira/browse/HADOOP-3396) | Unit test TestDatanodeBlockScanner fails on Windows | Critical | . | Mukund Madhugiri | Lohit Vijayarenu |
| [HADOOP-3393](https://issues.apache.org/jira/browse/HADOOP-3393) | TestHDFSServerPorts fails on LINUX (NFS mounted directory) and on WINDOWS | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3391](https://issues.apache.org/jira/browse/HADOOP-3391) | HADOOP-3248 introduced a findbugs warning. | Minor | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3388](https://issues.apache.org/jira/browse/HADOOP-3388) | TestDatanodeBlockScanner failed while trying to corrupt replicas | Major | test | dhruba borthakur | dhruba borthakur |
| [HADOOP-3381](https://issues.apache.org/jira/browse/HADOOP-3381) | INode interlinks can multiply effect of memory leaks | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3376](https://issues.apache.org/jira/browse/HADOOP-3376) | [HOD] HOD should have a way to detect and deal with clusters that violate/exceed resource manager limits | Major | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-3375](https://issues.apache.org/jira/browse/HADOOP-3375) | Lease paths are sometimes not removed from LeaseManager.sortedLeasesByPath | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3371](https://issues.apache.org/jira/browse/HADOOP-3371) | MBeanUtil dumps stacktrace from registerMBean | Minor | metrics | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3365](https://issues.apache.org/jira/browse/HADOOP-3365) | SequenceFile.Sorter.MergeQueue.next does an unnecessary copy of the key | Major | io | Arun C Murthy | Devaraj Das |
| [HADOOP-3363](https://issues.apache.org/jira/browse/HADOOP-3363) | HDFS throws a InconsistentFSStateException when the name node starts up on a directory that isnt formatted | Blocker | . | Steve Loughran | Konstantin Shvachko |
| [HADOOP-3351](https://issues.apache.org/jira/browse/HADOOP-3351) | Fix history viewer | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3349](https://issues.apache.org/jira/browse/HADOOP-3349) | FSNamesystem.changeLease(src, dst) incorrectly updates the paths inside a lease | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3348](https://issues.apache.org/jira/browse/HADOOP-3348) | TestUrlStreamHandler hangs on LINUX | Major | fs | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3340](https://issues.apache.org/jira/browse/HADOOP-3340) | hadoop dfs metrics shows 0 | Major | metrics | Eric Yang | Lohit Vijayarenu |
| [HADOOP-3339](https://issues.apache.org/jira/browse/HADOOP-3339) | DFS Write pipeline does not detect defective datanode correctly if it times out. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3338](https://issues.apache.org/jira/browse/HADOOP-3338) | trunk doesn't compile after HADOOP-544 was committed | Blocker | . | Nigel Daley | Christophe Taton |
| [HADOOP-3337](https://issues.apache.org/jira/browse/HADOOP-3337) | Name-node fails to start because DatanodeInfo format changed. | Blocker | . | Konstantin Shvachko | Tsz Wo Nicholas Sze |
| [HADOOP-3335](https://issues.apache.org/jira/browse/HADOOP-3335) | 'make clean' in src/c++/libhdfs does 'rm -rf /\*' | Critical | build | Doug Cutting | Doug Cutting |
| [HADOOP-3333](https://issues.apache.org/jira/browse/HADOOP-3333) | job failing because of reassigning same tasktracker to failing tasks | Blocker | . | Christian Kunz | Jothi Padmanabhan |
| [HADOOP-3326](https://issues.apache.org/jira/browse/HADOOP-3326) | ReduceTask should not sleep for 200 ms while waiting for merge to finish | Major | . | Owen O'Malley | Sharad Agarwal |
| [HADOOP-3320](https://issues.apache.org/jira/browse/HADOOP-3320) | NPE in NetworkTopology.getDistance() | Blocker | . | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-3318](https://issues.apache.org/jira/browse/HADOOP-3318) | Hadoop streaming doesn't recognize "Darwin" as an OS but Soylatte (OpenJDK port to Mac) reports that rather than "Mac OS X" | Major | . | Sam Pullara | Sam Pullara |
| [HADOOP-3313](https://issues.apache.org/jira/browse/HADOOP-3313) | RPC::Invoker makes unnecessary calls to System.currentTimeMillis | Minor | . | Chris Douglas | Chris Douglas |
| [HADOOP-3309](https://issues.apache.org/jira/browse/HADOOP-3309) | Unit test fails on Windows: org.apache.hadoop.mapred.TestMiniMRDFSSort.unknown | Major | . | Mukund Madhugiri | Lohit Vijayarenu |
| [HADOOP-3301](https://issues.apache.org/jira/browse/HADOOP-3301) | Misleading error message when S3 URI contains hostname containing an underscore | Major | fs/s3 | Tom White | Tom White |
| [HADOOP-3300](https://issues.apache.org/jira/browse/HADOOP-3300) | FindBugs warnings in NetworkTopology | Major | . | Tom White | Tom White |
| [HADOOP-3299](https://issues.apache.org/jira/browse/HADOOP-3299) | org.apache.hadoop.mapred.join.CompositeInputFormat does not initialize TextInput format files with the configuration resulting in an NullPointerException | Major | io | Jason | Chris Douglas |
| [HADOOP-3296](https://issues.apache.org/jira/browse/HADOOP-3296) | Some levels are skipped while creating the task cache in JobInProgress | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-3282](https://issues.apache.org/jira/browse/HADOOP-3282) | TestCheckpoint occasionally fails because of the port issues. | Major | test | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3272](https://issues.apache.org/jira/browse/HADOOP-3272) | Reduce redundant copy of Block object in BlocksMap.map hash map | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3269](https://issues.apache.org/jira/browse/HADOOP-3269) | NameNode doesn't startup when restarted after running an MR job | Blocker | . | Devaraj Das | Tsz Wo Nicholas Sze |
| [HADOOP-3268](https://issues.apache.org/jira/browse/HADOOP-3268) | TestUrlStreamHandler.testFileUrls fails on Windows | Major | test | Nigel Daley | Christophe Taton |
| [HADOOP-3259](https://issues.apache.org/jira/browse/HADOOP-3259) | Configuration.substituteVars() needs to handle security exceptions | Trivial | conf | Steve Loughran | Edward J. Yoon |
| [HADOOP-3248](https://issues.apache.org/jira/browse/HADOOP-3248) | Improve Namenode startup performance | Major | . | girish vaitheeswaran | dhruba borthakur |
| [HADOOP-3240](https://issues.apache.org/jira/browse/HADOOP-3240) | TestJobShell should not create files in the current directory | Blocker | test | Tsz Wo Nicholas Sze | Mahadev konar |
| [HADOOP-3232](https://issues.apache.org/jira/browse/HADOOP-3232) | Datanodes time out | Critical | . | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-3203](https://issues.apache.org/jira/browse/HADOOP-3203) | TaskTracker::localizeJob doesn't provide the correct size to LocalDirAllocator | Major | . | Chris Douglas | Amareshwari Sriramadasu |
| [HADOOP-3176](https://issues.apache.org/jira/browse/HADOOP-3176) | Change lease record when a open-for-write-file gets renamed | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3160](https://issues.apache.org/jira/browse/HADOOP-3160) | remove exists() from ClientProtocol and NameNode | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3151](https://issues.apache.org/jira/browse/HADOOP-3151) | Hod should have better error messages. | Major | contrib/hod | Arkady Borkovsky | Vinod Kumar Vavilapalli |
| [HADOOP-3135](https://issues.apache.org/jira/browse/HADOOP-3135) | if the 'mapred.system.dir' in the client jobconf is different from the JobTracker's value job submission fails | Critical | . | Alejandro Abdelnur | Subru Krishnan |
| [HADOOP-3130](https://issues.apache.org/jira/browse/HADOOP-3130) | Shuffling takes too long to get the last map output. | Major | . | Runping Qi | Amar Kamat |
| [HADOOP-3127](https://issues.apache.org/jira/browse/HADOOP-3127) | rm /user/\<username\>/.Trash/\_\_\_\_ only moves it back to .Trash | Minor | fs | Koji Noguchi | Brice Arnould |
| [HADOOP-3119](https://issues.apache.org/jira/browse/HADOOP-3119) | Text.getBytes() | Trivial | . | Andrew Gudkov | Tim Nelson |
| [HADOOP-3109](https://issues.apache.org/jira/browse/HADOOP-3109) | RPC should accepted connections even when rpc queue is full (ie undo part of HADOOP-2910) | Blocker | . | Sanjay Radia | Hairong Kuang |
| [HADOOP-3101](https://issues.apache.org/jira/browse/HADOOP-3101) | 'bin/hadoop job' should display the help and silently exit | Minor | . | Amar Kamat | Edward J. Yoon |
| [HADOOP-3085](https://issues.apache.org/jira/browse/HADOOP-3085) | pushMetric() method of various metric util classes should catch exceptions | Major | metrics | Runping Qi | Chris Douglas |
| [HADOOP-3076](https://issues.apache.org/jira/browse/HADOOP-3076) | [HOD] If a cluster directory is specified as a relative path, an existing script.exitcode file will not be deleted. | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2961](https://issues.apache.org/jira/browse/HADOOP-2961) | [HOD] Hod expects port info though external host is not mentioned. | Minor | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2930](https://issues.apache.org/jira/browse/HADOOP-2930) | make {start,stop}-balancer.sh work even if hadoop-daemon.sh isn't in the PATH | Trivial | scripts | Spiros Papadimitriou | Spiros Papadimitriou |
| [HADOOP-2928](https://issues.apache.org/jira/browse/HADOOP-2928) | Remove deprecated methods getContentLength() in ClientProtocol, NameNode, FileSystem, DistributedFileSystem and DFSClient | Blocker | . | Tsz Wo Nicholas Sze | Lohit Vijayarenu |
| [HADOOP-2905](https://issues.apache.org/jira/browse/HADOOP-2905) | fsck -move triggers NPE in namenode | Major | . | Michael Bieniosek | Lohit Vijayarenu |
| [HADOOP-2793](https://issues.apache.org/jira/browse/HADOOP-2793) | Links for worst performing shuffle tasks are broken in Analyze Job. | Minor | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-2669](https://issues.apache.org/jira/browse/HADOOP-2669) | DFS client lost lease during writing into DFS files | Major | . | Runping Qi | dhruba borthakur |
| [HADOOP-2565](https://issues.apache.org/jira/browse/HADOOP-2565) | DFSPath cache of FileStatus can become stale | Major | . | Doug Cutting | Tsz Wo Nicholas Sze |
| [HADOOP-2427](https://issues.apache.org/jira/browse/HADOOP-2427) | Cleanup of mapred.local.dir after maptask is complete | Major | . | Lohit Vijayarenu | Amareshwari Sriramadasu |
| [HADOOP-2393](https://issues.apache.org/jira/browse/HADOOP-2393) | TaskTracker locks up removing job files within a synchronized method | Critical | . | Joydeep Sen Sarma | Amareshwari Sriramadasu |
| [HADOOP-2294](https://issues.apache.org/jira/browse/HADOOP-2294) | In hdfs.h , the comment says you release the result of a hdfsListDirectory with a freehdfsFileInfo, but should say hdfsFreeFileInfo | Trivial | . | Dick King | Craig Macdonald |
| [HADOOP-2132](https://issues.apache.org/jira/browse/HADOOP-2132) | Killing successfully completed jobs moves them to failed | Critical | . | Srikanth Kakani | Jothi Padmanabhan |
| [HADOOP-2095](https://issues.apache.org/jira/browse/HADOOP-2095) | Reducer failed due to Out ofMemory | Major | . | Runping Qi | Arun C Murthy |
| [HADOOP-1318](https://issues.apache.org/jira/browse/HADOOP-1318) | Do not fail completed maps on lost tasktrackers if '-reducer NONE' is specified | Minor | . | Arun C Murthy | Amareshwari Sriramadasu |
| [HADOOP-236](https://issues.apache.org/jira/browse/HADOOP-236) | job tracker should refuse connection from a task tracker with a different version number | Major | . | Hairong Kuang | Sharad Agarwal |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3100](https://issues.apache.org/jira/browse/HADOOP-3100) | Develop tests to test the DFS command line interface | Major | test | Mukund Madhugiri | Mukund Madhugiri |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3541](https://issues.apache.org/jira/browse/HADOOP-3541) | Namespace recovery from the secondary image should be documented. | Blocker | documentation | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-2984](https://issues.apache.org/jira/browse/HADOOP-2984) | Distcp should have forrest documentation | Blocker | util | Owen O'Malley | Chris Douglas |
| [HADOOP-2632](https://issues.apache.org/jira/browse/HADOOP-2632) | Discussion of fsck operation in the permissions regime | Major | . | Robert Chansler | Robert Chansler |

View File

@ -0,0 +1,545 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.18.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-3837](https://issues.apache.org/jira/browse/HADOOP-3837) | *Major* | **hadop streaming does not use progress reporting to detect hung tasks**
Changed streaming tasks to adhere to task timeout value specified in the job configuration.
---
* [HADOOP-3808](https://issues.apache.org/jira/browse/HADOOP-3808) | *Blocker* | **[HOD] Include job tracker RPC in notes attribute after job submission**
Modified HOD to include the RPC port of the JobTracker in the 'notes' attribute of the resource manager. The RPC port is included as the string 'Mapred RPC Port:\<port number\>'. Tools that depend on the value of the notes attribute must change to parse this new value.
---
* [HADOOP-3703](https://issues.apache.org/jira/browse/HADOOP-3703) | *Blocker* | **[HOD] logcondense needs to use the new pattern of output in hadoop dfs -lsr**
Modified logcondense.py to use the new format of hadoop dfs -lsr output. This version of logcondense would not work with previous versions of Hadoop and hence is incompatible.
---
* [HADOOP-3683](https://issues.apache.org/jira/browse/HADOOP-3683) | *Major* | **Hadoop dfs metric FilesListed shows number of files listed instead of operations**
Change FileListed to getNumGetListingOps and add CreateFileOps, DeleteFileOps and AddBlockOps metrics.
---
* [HADOOP-3677](https://issues.apache.org/jira/browse/HADOOP-3677) | *Blocker* | **Problems with generation stamp upgrade**
Simplify generation stamp upgrade by making is a local upgrade on datandodes. Deleted distributed upgrade.
---
* [HADOOP-3665](https://issues.apache.org/jira/browse/HADOOP-3665) | *Minor* | **WritableComparator newKey() fails for NullWritable**
**WARNING: No release note provided for this incompatible change.**
---
* [HADOOP-3610](https://issues.apache.org/jira/browse/HADOOP-3610) | *Blocker* | **[HOD] HOD does not automatically create a cluster directory for the script option**
Modified HOD to automatically create a cluster directory if the one specified with the script command does not exist.
---
* [HADOOP-3598](https://issues.apache.org/jira/browse/HADOOP-3598) | *Blocker* | **Map-Reduce framework needlessly creates temporary \_${taskid} directories for Maps**
Changed Map-Reduce framework to no longer create temporary task output directories for staging outputs if staging outputs isn't necessary. ${mapred.out.dir}/\_temporary/\_${taskid}
---
* [HADOOP-3569](https://issues.apache.org/jira/browse/HADOOP-3569) | *Minor* | **KFS input stream read() returns 4 bytes instead of 1**
Fixed KFS to have read() read and return 1 byte instead of 4.
---
* [HADOOP-3564](https://issues.apache.org/jira/browse/HADOOP-3564) | *Blocker* | **Sometime after successful hod allocation datanode fails to come up with java.net.BindException for dfs.datanode.ipc.address**
Modifed HOD to generate the dfs.datanode.ipc.address parameter in the hadoop-site.xml of datanodes that it launches.
---
* [HADOOP-3512](https://issues.apache.org/jira/browse/HADOOP-3512) | *Major* | **Split map/reduce tools into separate jars**
Separated Distcp, Logalyzer and Archiver into a tools jar.
---
* [HADOOP-3486](https://issues.apache.org/jira/browse/HADOOP-3486) | *Major* | **Change default for initial block report to 0 sec and document it in hadoop-defaults.xml**
Changed the default value of dfs.blockreport.initialDelay to be 0 seconds.
---
* [HADOOP-3483](https://issues.apache.org/jira/browse/HADOOP-3483) | *Major* | **[HOD] Improvements with cluster directory handling**
Modified HOD to create a cluster directory if one does not exist and to auto-deallocate a cluster while reallocating it, if it is already dead.
---
* [HADOOP-3464](https://issues.apache.org/jira/browse/HADOOP-3464) | *Major* | **[HOD] HOD can improve error messages by reporting failures on compute nodes back to hod client**
Implemented a mechanism to transfer HOD errors that occur on compute nodes to the submit node running the HOD client, so users have good feedback on why an allocation failed.
---
* [HADOOP-3460](https://issues.apache.org/jira/browse/HADOOP-3460) | *Minor* | **SequenceFileAsBinaryOutputFormat**
Created SequenceFileAsBinaryOutputFormat to write raw bytes as keys and values to a SequenceFile.
---
* [HADOOP-3459](https://issues.apache.org/jira/browse/HADOOP-3459) | *Major* | **Change dfs -ls listing to closely match format on Linux**
Changed the output of the "fs -ls" command to more closely match familiar Linux format. Applications that parse the command output should be reviewed.
---
* [HADOOP-3452](https://issues.apache.org/jira/browse/HADOOP-3452) | *Minor* | **fsck exit code would be better if non-zero when FS corrupt**
Changed exit status of fsck to report whether the files system is healthy or corrupt.
---
* [HADOOP-3429](https://issues.apache.org/jira/browse/HADOOP-3429) | *Major* | **Increase the buffersize for the streaming parent java process's streams**
Increased the size of the buffer used in the communication between the Java task and the Streaming process to 128KB.
---
* [HADOOP-3427](https://issues.apache.org/jira/browse/HADOOP-3427) | *Major* | **In ReduceTask::fetchOutputs, wait for result can be improved slightly**
Changed shuffle scheduler policy to wait for notifications from shuffle threads before scheduling more.
---
* [HADOOP-3417](https://issues.apache.org/jira/browse/HADOOP-3417) | *Major* | **JobClient should not have a static configuration for cli parsing**
Removed the public class org.apache.hadoop.mapred.JobShell.
Command line options -libjars, -files and -archives are moved to GenericCommands. Thus applications have to implement org.apache.hadoop.util.Tool to use the options.
---
* [HADOOP-3405](https://issues.apache.org/jira/browse/HADOOP-3405) | *Major* | **Make mapred internal classes package-local**
Refactored previously public classes MapTaskStatus, ReduceTaskStatus, JobSubmissionProtocol, CompletedJobStatusStore to be package local.
---
* [HADOOP-3390](https://issues.apache.org/jira/browse/HADOOP-3390) | *Major* | **Remove deprecated ClientProtocol.abandonFileInProgress()**
Removed deprecated ClientProtocol.abandonFileInProgress().
---
* [HADOOP-3379](https://issues.apache.org/jira/browse/HADOOP-3379) | *Blocker* | **Document the "stream.non.zero.exit.status.is.failure" knob for streaming**
Set default value for configuration property "stream.non.zero.exit.status.is.failure" to be "true".
---
* [HADOOP-3376](https://issues.apache.org/jira/browse/HADOOP-3376) | *Major* | **[HOD] HOD should have a way to detect and deal with clusters that violate/exceed resource manager limits**
Modified HOD client to look for specific messages related to resource limit overruns and take appropriate actions - such as either failing to allocate the cluster, or issuing a warning to the user. A tool is provided, specific to Maui and Torque, that will set these specific messages.
---
* [HADOOP-3366](https://issues.apache.org/jira/browse/HADOOP-3366) | *Major* | **Shuffle/Merge improvements**
Improved shuffle so that all fetched map-outputs are kept in-memory before being merged by stalling the shuffle so that the in-memory merge executes and frees up memory for the shuffle.
---
* [HADOOP-3355](https://issues.apache.org/jira/browse/HADOOP-3355) | *Major* | **Configuration should accept decimal and hexadecimal values**
Added support for hexadecimal values in Configuration
---
* [HADOOP-3339](https://issues.apache.org/jira/browse/HADOOP-3339) | *Major* | **DFS Write pipeline does not detect defective datanode correctly if it times out.**
Improved failure handling of last Data Node in write pipeline.
---
* [HADOOP-3336](https://issues.apache.org/jira/browse/HADOOP-3336) | *Major* | **Direct a subset of namenode RPC events for audit logging**
Added a log4j appender that emits events from FSNamesystem for audit logging
---
* [HADOOP-3329](https://issues.apache.org/jira/browse/HADOOP-3329) | *Major* | **DatanodeDescriptor objects stored in FSImage may be out dated.**
Changed format of file system image to not store locations of last block.
---
* [HADOOP-3326](https://issues.apache.org/jira/browse/HADOOP-3326) | *Major* | **ReduceTask should not sleep for 200 ms while waiting for merge to finish**
Changed fetchOutputs() so that LocalFSMerger and InMemFSMergeThread threads are spawned only once. The thread gets notified when something is ready for merge. The merge happens when thresholds are met.
---
* [HADOOP-3317](https://issues.apache.org/jira/browse/HADOOP-3317) | *Minor* | **add default port for hdfs namenode**
Changed the default port for "hdfs:" URIs to be 8020, so that one may simply use URIs of the form "hdfs\://example.com/dir/file".
---
* [HADOOP-3310](https://issues.apache.org/jira/browse/HADOOP-3310) | *Major* | **Lease recovery for append**
Implemented Lease Recovery to sync the last bock of a file. Added ClientDatanodeProtocol for client trigging block recovery. Changed DatanodeProtocol to support block synchronization. Changed InterDatanodeProtocol to support block update.
---
* [HADOOP-3307](https://issues.apache.org/jira/browse/HADOOP-3307) | *Major* | **Archives in Hadoop.**
Introduced archive feature to Hadoop. A Map/Reduce job can be run to create an archive with indexes. A FileSystem abstraction is provided over the archive.
---
* [HADOOP-3299](https://issues.apache.org/jira/browse/HADOOP-3299) | *Major* | **org.apache.hadoop.mapred.join.CompositeInputFormat does not initialize TextInput format files with the configuration resulting in an NullPointerException**
Changed the TextInputFormat and KeyValueTextInput classes to initialize the compressionCodecs member variable before dereferencing it.
---
* [HADOOP-3283](https://issues.apache.org/jira/browse/HADOOP-3283) | *Major* | **Need a mechanism for data nodes to update generation stamps.**
Added an IPC server in DataNode and a new IPC protocol InterDatanodeProtocol. Added conf properties dfs.datanode.ipc.address and dfs.datanode.handler.count with defaults "0.0.0.0:50020" and 3, respectively.
Changed the serialization in DatanodeRegistration and DatanodeInfo, and therefore, updated the versionID in ClientProtocol, DatanodeProtocol, NamenodeProtocol.
---
* [HADOOP-3265](https://issues.apache.org/jira/browse/HADOOP-3265) | *Major* | **Remove deprecated API getFileCacheHints**
Removed deprecated API getFileCacheHints
---
* [HADOOP-3246](https://issues.apache.org/jira/browse/HADOOP-3246) | *Major* | **FTP client over HDFS**
Introduced an FTPFileSystem backed by Apache Commons FTPClient to directly store data into HDFS.
---
* [HADOOP-3232](https://issues.apache.org/jira/browse/HADOOP-3232) | *Critical* | **Datanodes time out**
Changed 'du' command to run in a seperate thread so that it does not block user.
---
* [HADOOP-3230](https://issues.apache.org/jira/browse/HADOOP-3230) | *Major* | **Add command line access to named counters**
Added command line tool "job -counter \<job-id\> \<group-name\> \<counter-name\>" to access counters.
---
* [HADOOP-3226](https://issues.apache.org/jira/browse/HADOOP-3226) | *Major* | **Run combiner when merging spills from map output**
Changed policy for running combiner. The combiner may be run multiple times as the map's output is sorted and merged. Additionally, it may be run on the reduce side as data is merged. The old semantics are available in Hadoop 0.18 if the user calls:
job.setCombineOnlyOnce(true);
---
* [HADOOP-3221](https://issues.apache.org/jira/browse/HADOOP-3221) | *Major* | **Need a "LineBasedTextInputFormat"**
Added org.apache.hadoop.mapred.lib.NLineInputFormat ,which splits N lines of input as one split. N can be specified by configuration property "mapred.line.input.format.linespermap", which defaults to 1.
---
* [HADOOP-3193](https://issues.apache.org/jira/browse/HADOOP-3193) | *Minor* | **Discovery of corrupt block reported in name node log**
Added reporter to FSNamesystem stateChangeLog, and a new metric to track the number of corrupted replicas.
---
* [HADOOP-3187](https://issues.apache.org/jira/browse/HADOOP-3187) | *Major* | **Quotas for name space management**
Introduced directory quota as hard limits on the number of names in the tree rooted at that directory. An administrator may set quotas on individual directories explicitly. Newly created directories have no associated quota. File/directory creations fault if the quota would be exceeded. The attempt to set a quota faults if the directory would be in violation of the new quota.
---
* [HADOOP-3184](https://issues.apache.org/jira/browse/HADOOP-3184) | *Major* | **HOD gracefully exclude "bad" nodes during ring formation**
Modified HOD to handle master (NameNode or JobTracker) failures on bad nodes by trying to bring them up on another node in the ring. Introduced new property ringmaster.max-master-failures to specify the maximum number of times a master is allowed to fail.
---
* [HADOOP-3177](https://issues.apache.org/jira/browse/HADOOP-3177) | *Major* | **Expose DFSOutputStream.fsync API though the FileSystem interface**
Added a new public interface Syncable which declares the sync() operation. FSDataOutputStream implements Syncable. If the wrappedStream in FSDataOutputStream is Syncalbe, calling FSDataOutputStream.sync() is equivalent to call wrappedStream.sync(). Otherwise, FSDataOutputStream.sync() is a no-op. Both DistributedFileSystem and LocalFileSystem support the sync() operation.
---
* [HADOOP-3164](https://issues.apache.org/jira/browse/HADOOP-3164) | *Major* | **Use FileChannel.transferTo() when data is read from DataNode.**
Changed data node to use FileChannel.tranferTo() to transfer block data.
---
* [HADOOP-3135](https://issues.apache.org/jira/browse/HADOOP-3135) | *Critical* | **if the 'mapred.system.dir' in the client jobconf is different from the JobTracker's value job submission fails**
Changed job submission protocol to not allow submission if the client's value of mapred.system.dir does not match the job tracker's. Deprecated JobConf.getSystemDir(); use JobClient.getSystemDir().
---
* [HADOOP-3113](https://issues.apache.org/jira/browse/HADOOP-3113) | *Major* | **DFSOututStream.flush() should flush data to real block file on DataNode.**
Added sync() method to FSDataOutputStream to really, really persist data in HDFS. InterDatanodeProtocol to implement this feature.
---
* [HADOOP-3095](https://issues.apache.org/jira/browse/HADOOP-3095) | *Major* | **Validating input paths and creating splits is slow on S3**
Added overloaded method getFileBlockLocations(FileStatus, long, long). This is an incompatible change for FileSystem implementations which override getFileBlockLocations(Path, long, long). They should have the signature of this method changed to getFileBlockLocations(FileStatus, long, long) to work correctly.
---
* [HADOOP-3061](https://issues.apache.org/jira/browse/HADOOP-3061) | *Major* | **Writable for single byte and double**
Introduced ByteWritable and DoubleWritable (implementing WritableComparable) implementations for Byte and Double.
---
* [HADOOP-3058](https://issues.apache.org/jira/browse/HADOOP-3058) | *Minor* | **Hadoop DFS to report more replication metrics**
Added FSNamesystem status metrics.
---
* [HADOOP-3035](https://issues.apache.org/jira/browse/HADOOP-3035) | *Major* | **Data nodes should inform the name-node about block crc errors.**
Changed protocol for transferring blocks between data nodes to report corrupt blocks to data node for re-replication from a good replica.
---
* [HADOOP-3013](https://issues.apache.org/jira/browse/HADOOP-3013) | *Major* | **fsck to show (checksum) corrupted files**
fsck reports corrupt blocks in the system.
---
* [HADOOP-2909](https://issues.apache.org/jira/browse/HADOOP-2909) | *Major* | **Improve IPC idle connection management**
Removed property ipc.client.maxidletime from the default configuration. The allowed idle time is twice ipc.client.connection.maxidletime.
---
* [HADOOP-2867](https://issues.apache.org/jira/browse/HADOOP-2867) | *Major* | **Add a task's cwd to it's LD\_LIBRARY\_PATH**
Added task's cwd to its LD\_LIBRARY\_PATH.
---
* [HADOOP-2865](https://issues.apache.org/jira/browse/HADOOP-2865) | *Major* | **FsShell.ls() should print file attributes first then the path name.**
Changed the output of the "fs -ls" command to more closely match familiar Linux format. Additional changes were made by HADOOP-3459. Applications that parse the command output should be reviewed.
---
* [HADOOP-2797](https://issues.apache.org/jira/browse/HADOOP-2797) | *Critical* | **Withdraw CRC upgrade from HDFS**
Withdrew the upgrade-to-CRC facility. HDFS will no longer support upgrades from versions without CRCs for block data. Users upgrading from version 0.13 or earlier must first upgrade to an intermediate (0.14, 0.15, 0.16, 0.17) version before doing upgrade to version 0.18 or later.
---
* [HADOOP-2703](https://issues.apache.org/jira/browse/HADOOP-2703) | *Minor* | **New files under lease (before close) still shows up as MISSING files/blocks in fsck**
Changed fsck to ignore files opened for writing. Introduced new option "-openforwrite" to explicitly show open files.
---
* [HADOOP-2656](https://issues.apache.org/jira/browse/HADOOP-2656) | *Major* | **Support for upgrading existing cluster to facilitate appends to HDFS files**
Associated a generation stamp with each block. On data nodes, the generation stamp is stored as part of the file name of the block's meta-data file.
---
* [HADOOP-2585](https://issues.apache.org/jira/browse/HADOOP-2585) | *Major* | **Automatic namespace recovery from the secondary image.**
Improved management of replicas of the name space image. If all replicas on the Name Node are lost, the latest check point can be loaded from the secondary Name Node. Use parameter "-importCheckpoint" and specify the location with "fs.checkpoint.dir." The directory structure on the secondary Name Node has changed to match the primary Name Node.
---
* [HADOOP-2427](https://issues.apache.org/jira/browse/HADOOP-2427) | *Major* | **Cleanup of mapred.local.dir after maptask is complete**
The current working directory of a task, i.e. ${mapred.local.dir}/taskTracker/jobcache/\<jobid\>/\<task\_dir\>/work is cleanedup, as soon as the task is finished.
---
* [HADOOP-2188](https://issues.apache.org/jira/browse/HADOOP-2188) | *Major* | **RPC should send a ping rather than use client timeouts**
Replaced timeouts with pings to check that client connection is alive. Removed the property ipc.client.timeout from the default Hadoop configuration. Removed the metric RpcOpsDiscardedOPsNum.
---
* [HADOOP-2181](https://issues.apache.org/jira/browse/HADOOP-2181) | *Minor* | **Input Split details for maps should be logged**
Added logging for input splits in job tracker log and job history log. Added web UI for viewing input splits in the job UI and history UI.
---
* [HADOOP-2132](https://issues.apache.org/jira/browse/HADOOP-2132) | *Critical* | **Killing successfully completed jobs moves them to failed**
Change "job -kill" to only allow a job that is in the RUNNING or PREP state to be killed.
---
* [HADOOP-2095](https://issues.apache.org/jira/browse/HADOOP-2095) | *Major* | **Reducer failed due to Out ofMemory**
Reduced in-memory copies of keys and values as they flow through the Map-Reduce framework. Changed the storage of intermediate map outputs to use new IFile instead of SequenceFile for better compression.
---
* [HADOOP-2065](https://issues.apache.org/jira/browse/HADOOP-2065) | *Major* | **Replication policy for corrupted block**
Added "corrupt" flag to LocatedBlock to indicate that all replicas of the block thought to be corrupt.
---
* [HADOOP-2019](https://issues.apache.org/jira/browse/HADOOP-2019) | *Major* | **DistributedFileCache should support .tgz files in addition to jars and zip files**
Added support for .tar, .tgz and .tar.gz files in DistributedCache. File sizes are limited to 2GB.
---
* [HADOOP-1915](https://issues.apache.org/jira/browse/HADOOP-1915) | *Minor* | **adding counters methods using String (as opposed to Enum)**
Provided a new method to update counters. "incrCounter(String group, String counter, long amount)"
---
* [HADOOP-1702](https://issues.apache.org/jira/browse/HADOOP-1702) | *Major* | **Reduce buffer copies when data is written to DFS**
Reduced buffer copies as data is written to HDFS. The order of sending data bytes and control information has changed, but this will not be observed by client applications.
---
* [HADOOP-1328](https://issues.apache.org/jira/browse/HADOOP-1328) | *Major* | **Hadoop Streaming needs to provide a way for the stream plugin to update global counters**
Introduced a way for a streaming process to update global counters and status using stderr stream to emit information. Use "reporter:counter:\<group\>,\<counter\>,\<amount\> " to update a counter. Use "reporter:status:\<message\>" to update status.
---
* [HADOOP-930](https://issues.apache.org/jira/browse/HADOOP-930) | *Major* | **Add support for reading regular (non-block-based) files from S3 in S3FileSystem**
Added support for reading and writing native S3 files. Native S3 files are referenced using s3n URIs. See http://wiki.apache.org/hadoop/AmazonS3 for more details.
---
* [HADOOP-544](https://issues.apache.org/jira/browse/HADOOP-544) | *Major* | **Replace the job, tip and task ids with objects.**
Introduced new classes JobID, TaskID and TaskAttemptID, which should be used instead of their string counterparts. Deprecated functions in JobClient, TaskReport, RunningJob, jobcontrol.Job and TaskCompletionEvent that use string arguments. Applications can use xxxID.toString() and xxxID.forName() methods to convert/restore objects to/from strings.
---
* [HADOOP-236](https://issues.apache.org/jira/browse/HADOOP-236) | *Major* | **job tracker should refuse connection from a task tracker with a different version number**
Changed connection protocol job tracker and task tracker so that task tracker will not connect to a job tracker with a different build version.
---
* [HADOOP-4](https://issues.apache.org/jira/browse/HADOOP-4) | *Major* | **tool to mount dfs on linux**
Introduced FUSE module for HDFS. Module allows mount of HDFS as a Unix filesystem, and optionally the export of that mount point to other machines. Writes are disabled. rmdir, mv, mkdir, rm are supported, but not cp, touch, and the like. Usage information is attached to the Jira record.

View File

@ -0,0 +1,74 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.18.1 - 2008-09-17
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4145](https://issues.apache.org/jira/browse/HADOOP-4145) | [HOD] Support an accounting plugin script for HOD | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3934](https://issues.apache.org/jira/browse/HADOOP-3934) | Update log4j from 1.2.13 to 1.2.15 | Blocker | . | Don Kim | Owen O'Malley |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4161](https://issues.apache.org/jira/browse/HADOOP-4161) | [HOD] Uncaught exceptions can potentially hang hod-client. | Major | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-4060](https://issues.apache.org/jira/browse/HADOOP-4060) | [HOD] Make HOD to roll log files on the client | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-4046](https://issues.apache.org/jira/browse/HADOOP-4046) | WritableComparator's constructor should be protected instead of private | Major | io | Owen O'Malley | Chris Douglas |
| [HADOOP-4040](https://issues.apache.org/jira/browse/HADOOP-4040) | Remove the hardcoded ipc.client.connection.maxidletime setting from the TaskTracker.Child.main() | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-4037](https://issues.apache.org/jira/browse/HADOOP-4037) | HadoopQA contrib -1 comments due to inconsistency in eclipse plugin declared jars | Blocker | . | Nigel Daley | Nigel Daley |
| [HADOOP-3995](https://issues.apache.org/jira/browse/HADOOP-3995) | renameTo(src, dst) does not restore src name in case of quota failure. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3940](https://issues.apache.org/jira/browse/HADOOP-3940) | Reduce often attempts in memory merge with no work | Minor | . | Chris Douglas | Chris Douglas |
| [HADOOP-3821](https://issues.apache.org/jira/browse/HADOOP-3821) | SequenceFile's Reader.decompressorPool or Writer.decompressorPool gets into an inconsistent state when calling close() more than once | Major | io | Peter Voss | Arun C Murthy |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,38 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.18.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-4060](https://issues.apache.org/jira/browse/HADOOP-4060) | *Blocker* | **[HOD] Make HOD to roll log files on the client**
HOD client was modified to roll over client logs being written to the cluster directory. A new configuration parameter, hod.log-rollover-count, was introduced to specify how many rollover log files to retain.
---
* [HADOOP-3995](https://issues.apache.org/jira/browse/HADOOP-3995) | *Blocker* | **renameTo(src, dst) does not restore src name in case of quota failure.**
In case of quota failure on HDFS, rename does not restore source filename.

View File

@ -0,0 +1,88 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.18.2 - 2008-11-03
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4228](https://issues.apache.org/jira/browse/HADOOP-4228) | dfs datanode metrics, bytes\_read, bytes\_written overflows due to incorrect type used. | Blocker | metrics | Eric Yang | Eric Yang |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-2421](https://issues.apache.org/jira/browse/HADOOP-2421) | Release JDiff report of changes between different versions of Hadoop | Minor | documentation | Nigel Daley | Doug Cutting |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4533](https://issues.apache.org/jira/browse/HADOOP-4533) | HDFS client of hadoop 0.18.1 and HDFS server 0.18.2 (0.18 branch) not compatible | Blocker | . | Runping Qi | Hairong Kuang |
| [HADOOP-4526](https://issues.apache.org/jira/browse/HADOOP-4526) | fsck failing with NullPointerException (return value 0) | Major | . | Koji Noguchi | Hairong Kuang |
| [HADOOP-4517](https://issues.apache.org/jira/browse/HADOOP-4517) | unstable dfs when running jobs on 0.18.1 | Blocker | . | Christian Kunz | Tsz Wo Nicholas Sze |
| [HADOOP-4483](https://issues.apache.org/jira/browse/HADOOP-4483) | getBlockArray in DatanodeDescriptor does not honor passed in maxblocks value | Critical | . | Ahad Rana | Ahad Rana |
| [HADOOP-4469](https://issues.apache.org/jira/browse/HADOOP-4469) | ant jar file not being included in tar distribution | Blocker | build | Nigel Daley | Nigel Daley |
| [HADOOP-4467](https://issues.apache.org/jira/browse/HADOOP-4467) | SerializationFactory should use current context ClassLoader | Blocker | . | Chris K Wensel | Chris K Wensel |
| [HADOOP-4407](https://issues.apache.org/jira/browse/HADOOP-4407) | HADOOP-4395 should use a Java 1.5 API for 0.18 | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4399](https://issues.apache.org/jira/browse/HADOOP-4399) | fuse-dfs per FD context is not thread safe and can cause segfaults and corruptions | Blocker | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4395](https://issues.apache.org/jira/browse/HADOOP-4395) | Reloading FSImage and FSEditLog may erase user and group information | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4369](https://issues.apache.org/jira/browse/HADOOP-4369) | Metric Averages are not averages | Blocker | metrics | Brian Bockelman | Brian Bockelman |
| [HADOOP-4351](https://issues.apache.org/jira/browse/HADOOP-4351) | ArrayIndexOutOfBoundsException during fsck | Blocker | . | Brian Bockelman | Hairong Kuang |
| [HADOOP-4340](https://issues.apache.org/jira/browse/HADOOP-4340) | "hadoop jar" always returns exit code 0 (success) to the shell when jar throws a fatal exception | Major | . | David Litster | Arun C Murthy |
| [HADOOP-4326](https://issues.apache.org/jira/browse/HADOOP-4326) | ChecksumFileSystem does not override all create(...) methods | Blocker | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4314](https://issues.apache.org/jira/browse/HADOOP-4314) | TestReplication fails quite often | Blocker | test | Raghu Angadi | Raghu Angadi |
| [HADOOP-4292](https://issues.apache.org/jira/browse/HADOOP-4292) | append() does not work for LocalFileSystem | Blocker | fs | Raghu Angadi | Hairong Kuang |
| [HADOOP-4277](https://issues.apache.org/jira/browse/HADOOP-4277) | Checksum verification is disabled for LocalFS | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4271](https://issues.apache.org/jira/browse/HADOOP-4271) | Bug in FSInputChecker makes it possible to read from an invalid buffer | Blocker | fs | Ning Li | Ning Li |
| [HADOOP-3914](https://issues.apache.org/jira/browse/HADOOP-3914) | checksumOk implementation in DFSClient can break applications | Blocker | . | Christian Kunz | Christian Kunz |
| [HADOOP-3786](https://issues.apache.org/jira/browse/HADOOP-3786) | Changes in HOD documentation | Blocker | documentation | Suman Sehgal | Vinod Kumar Vavilapalli |
| [HADOOP-3614](https://issues.apache.org/jira/browse/HADOOP-3614) | TestLeaseRecovery fails when run with assertions enabled. | Blocker | . | Konstantin Shvachko | Tsz Wo Nicholas Sze |
| [HADOOP-3217](https://issues.apache.org/jira/browse/HADOOP-3217) | [HOD] Be less agressive when querying job status from resource manager. | Blocker | contrib/hod | Hemanth Yamijala | Hemanth Yamijala |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4403](https://issues.apache.org/jira/browse/HADOOP-4403) | TestLeaseRecovery.testBlockSynchronization failed on trunk | Blocker | test | Hemanth Yamijala | Tsz Wo Nicholas Sze |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,59 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.18.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-4467](https://issues.apache.org/jira/browse/HADOOP-4467) | *Blocker* | **SerializationFactory should use current context ClassLoader**
SerializationFactory now uses the current context ClassLoader allowing for user supplied Serialization instances.
---
* [HADOOP-4314](https://issues.apache.org/jira/browse/HADOOP-4314) | *Blocker* | **TestReplication fails quite often**
Simulated Datanodes should not include blocks that are still being written in their block report.
---
* [HADOOP-4277](https://issues.apache.org/jira/browse/HADOOP-4277) | *Blocker* | **Checksum verification is disabled for LocalFS**
Checksum verification was mistakenly disabled for LocalFileSystem.
---
* [HADOOP-4271](https://issues.apache.org/jira/browse/HADOOP-4271) | *Blocker* | **Bug in FSInputChecker makes it possible to read from an invalid buffer**
Checksum input stream can sometimes return invalid data to the user.
---
* [HADOOP-4228](https://issues.apache.org/jira/browse/HADOOP-4228) | *Blocker* | **dfs datanode metrics, bytes\_read, bytes\_written overflows due to incorrect type used.**
Changed bytes\_read, bytes\_written to type long to prevent metrics overflow.

View File

@ -0,0 +1,115 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.18.3 - 2009-01-29
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4997](https://issues.apache.org/jira/browse/HADOOP-4997) | workaround for tmp file handling on DataNodes in 0.18 (HADOOP-4663) | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4659](https://issues.apache.org/jira/browse/HADOOP-4659) | Root cause of connection failure is being lost to code that uses it for delaying startup | Blocker | ipc | Steve Loughran | Steve Loughran |
| [HADOOP-4061](https://issues.apache.org/jira/browse/HADOOP-4061) | Large number of decommission freezes the Namenode | Major | . | Koji Noguchi | Tsz Wo Nicholas Sze |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3780](https://issues.apache.org/jira/browse/HADOOP-3780) | JobTracker should synchronously resolve the tasktracker's network location when the tracker registers | Major | . | Amar Kamat | Amar Kamat |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5077](https://issues.apache.org/jira/browse/HADOOP-5077) | JavaDoc errors in 0.18.3 | Blocker | util | Raghu Angadi | Raghu Angadi |
| [HADOOP-4983](https://issues.apache.org/jira/browse/HADOOP-4983) | Job counters sometimes go down as tasks run without task failures | Critical | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-4971](https://issues.apache.org/jira/browse/HADOOP-4971) | Block report times from datanodes could converge to same time. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4961](https://issues.apache.org/jira/browse/HADOOP-4961) | ConcurrentModificationException in lease recovery of empty files. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4951](https://issues.apache.org/jira/browse/HADOOP-4951) | Lease monitor does not own the LeaseManager lock in changing leases. | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4935](https://issues.apache.org/jira/browse/HADOOP-4935) | Manual leaving of safe mode may lead to data lost | Major | . | Hairong Kuang | Konstantin Shvachko |
| [HADOOP-4924](https://issues.apache.org/jira/browse/HADOOP-4924) | Race condition in re-init of TaskTracker | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-4919](https://issues.apache.org/jira/browse/HADOOP-4919) | [HOD] Provide execute access to JT history directory path for group | Major | contrib/hod | Hemanth Yamijala | Peeyush Bishnoi |
| [HADOOP-4910](https://issues.apache.org/jira/browse/HADOOP-4910) | NameNode should exclude corrupt replicas when choosing excessive replicas to delete | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4904](https://issues.apache.org/jira/browse/HADOOP-4904) | Deadlock while leaving safe mode. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4840](https://issues.apache.org/jira/browse/HADOOP-4840) | TestNodeCount sometimes fails with NullPointerException | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4822](https://issues.apache.org/jira/browse/HADOOP-4822) | 0.18 cannot be compiled in Java 5. | Blocker | util | Tsz Wo Nicholas Sze | |
| [HADOOP-4810](https://issues.apache.org/jira/browse/HADOOP-4810) | Data lost at cluster startup time | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4806](https://issues.apache.org/jira/browse/HADOOP-4806) | HDFS rename does not work correctly if src contains Java regular expression special characters | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4797](https://issues.apache.org/jira/browse/HADOOP-4797) | RPC Server can leave a lot of direct buffers | Blocker | ipc | Raghu Angadi | Raghu Angadi |
| [HADOOP-4795](https://issues.apache.org/jira/browse/HADOOP-4795) | Lease monitor may get into an infinite loop | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4778](https://issues.apache.org/jira/browse/HADOOP-4778) | Check for zero size block meta file when updating a block. | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4746](https://issues.apache.org/jira/browse/HADOOP-4746) | Job output directory should be normalized | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4742](https://issues.apache.org/jira/browse/HADOOP-4742) | Mistake delete replica in hadoop 0.18.1 | Blocker | . | Wang Xu | Wang Xu |
| [HADOOP-4734](https://issues.apache.org/jira/browse/HADOOP-4734) | Some lease recovery codes in 0.19 or trunk should also be committed in 0.18. | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4726](https://issues.apache.org/jira/browse/HADOOP-4726) | documentation typos: "the the" | Minor | documentation | Tsz Wo Nicholas Sze | Edward J. Yoon |
| [HADOOP-4717](https://issues.apache.org/jira/browse/HADOOP-4717) | Removal of default port# in NameNode.getUri() cause a map/reduce job failed to prompt temporay output | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4714](https://issues.apache.org/jira/browse/HADOOP-4714) | map tasks timing out during merge phase | Major | . | Christian Kunz | Jothi Padmanabhan |
| [HADOOP-4713](https://issues.apache.org/jira/browse/HADOOP-4713) | librecordio does not scale to large records | Blocker | record | Christian Kunz | Christian Kunz |
| [HADOOP-4703](https://issues.apache.org/jira/browse/HADOOP-4703) | DataNode.createInterDataNodeProtocolProxy should not wait for proxy forever while recovering lease | Major | . | Hairong Kuang | Tsz Wo Nicholas Sze |
| [HADOOP-4702](https://issues.apache.org/jira/browse/HADOOP-4702) | Failed block replication leaves an incomplete block in receiver's tmp data directory | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4679](https://issues.apache.org/jira/browse/HADOOP-4679) | Datanode prints tons of log messages: Waiting for threadgroup to exit, active theads is XX | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4654](https://issues.apache.org/jira/browse/HADOOP-4654) | remove temporary output directory of failed tasks | Major | . | Christian Kunz | Amareshwari Sriramadasu |
| [HADOOP-4647](https://issues.apache.org/jira/browse/HADOOP-4647) | NamenodeFsck creates a new DFSClient but never closes it | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4643](https://issues.apache.org/jira/browse/HADOOP-4643) | NameNode should exclude excessive replicas when counting live replicas for a block | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4635](https://issues.apache.org/jira/browse/HADOOP-4635) | Memory leak ? | Blocker | . | Marc-Olivier Fleury | Pete Wyckoff |
| [HADOOP-4620](https://issues.apache.org/jira/browse/HADOOP-4620) | Streaming mapper never completes if the mapper does not write to stdout | Major | . | Runping Qi | Ravi Gummadi |
| [HADOOP-4616](https://issues.apache.org/jira/browse/HADOOP-4616) | assertion makes fuse-dfs exit when reading incomplete data | Blocker | . | Marc-Olivier Fleury | Pete Wyckoff |
| [HADOOP-4614](https://issues.apache.org/jira/browse/HADOOP-4614) | "Too many open files" error while processing a large gzip file | Blocker | . | Abdul Qadeer | Yuri Pradkin |
| [HADOOP-4613](https://issues.apache.org/jira/browse/HADOOP-4613) | browseBlock.jsp does not generate "genstamp" property. | Major | . | Konstantin Shvachko | Johan Oskarsson |
| [HADOOP-4610](https://issues.apache.org/jira/browse/HADOOP-4610) | Always calculate mis-replicated blocks when safe-mode is turned off. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4597](https://issues.apache.org/jira/browse/HADOOP-4597) | Under-replicated blocks are not calculated if the name-node is forced out of safe-mode. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4556](https://issues.apache.org/jira/browse/HADOOP-4556) | Block went missing | Major | . | Robert Chansler | Hairong Kuang |
| [HADOOP-4542](https://issues.apache.org/jira/browse/HADOOP-4542) | Fault in TestDistributedUpgrade | Minor | test | Robert Chansler | Raghu Angadi |
| [HADOOP-4499](https://issues.apache.org/jira/browse/HADOOP-4499) | DFSClient should invoke checksumOk only once. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4257](https://issues.apache.org/jira/browse/HADOOP-4257) | TestLeaseRecovery2.testBlockSynchronization failing. | Blocker | test | Vinod Kumar Vavilapalli | Tsz Wo Nicholas Sze |
| [HADOOP-3883](https://issues.apache.org/jira/browse/HADOOP-3883) | TestFileCreation fails once in a while | Blocker | test | Lohit Vijayarenu | Tsz Wo Nicholas Sze |
| [HADOOP-3121](https://issues.apache.org/jira/browse/HADOOP-3121) | dfs -lsr fail with "Could not get listing " | Minor | fs | Koji Noguchi | Tsz Wo Nicholas Sze |
| [HADOOP-1980](https://issues.apache.org/jira/browse/HADOOP-1980) | 'dfsadmin -safemode enter' should prevent the namenode from leaving safemode automatically after startup | Minor | . | Koji Noguchi | Konstantin Shvachko |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4824](https://issues.apache.org/jira/browse/HADOOP-4824) | Should not use File.setWritable(..) in 0.18 | Major | . | Tsz Wo Nicholas Sze | Hairong Kuang |
| [HADOOP-4823](https://issues.apache.org/jira/browse/HADOOP-4823) | Should not use java.util.NavigableMap in 0.18 | Major | util | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4150](https://issues.apache.org/jira/browse/HADOOP-4150) | Include librecordio as part of the release | Blocker | build | Koji Noguchi | Giridharan Kesavan |

View File

@ -0,0 +1,104 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.18.3 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-5077](https://issues.apache.org/jira/browse/HADOOP-5077) | *Blocker* | **JavaDoc errors in 0.18.3**
Fix couple of JavaDoc warnings.
---
* [HADOOP-4997](https://issues.apache.org/jira/browse/HADOOP-4997) | *Blocker* | **workaround for tmp file handling on DataNodes in 0.18 (HADOOP-4663)**
Revert tmp files handling on DataNodes back to 0.17. sync() introduced in 0.18 has less gaurantees.
---
* [HADOOP-4971](https://issues.apache.org/jira/browse/HADOOP-4971) | *Blocker* | **Block report times from datanodes could converge to same time.**
A long (unexpected) delay at datanodes could make subsequent block reports from many datanode at the same time.
---
* [HADOOP-4797](https://issues.apache.org/jira/browse/HADOOP-4797) | *Blocker* | **RPC Server can leave a lot of direct buffers**
Improve how RPC server reads and writes large buffers. Avoids soft-leak of direct buffers and excess copies in NIO layer.
---
* [HADOOP-4679](https://issues.apache.org/jira/browse/HADOOP-4679) | *Major* | **Datanode prints tons of log messages: Waiting for threadgroup to exit, active theads is XX**
1. Only datanode's offerService thread shutdown the datanode to avoid deadlock;
2. Datanode checks disk in case of failure on creating a block file.
---
* [HADOOP-4659](https://issues.apache.org/jira/browse/HADOOP-4659) | *Blocker* | **Root cause of connection failure is being lost to code that uses it for delaying startup**
**WARNING: No release note provided for this incompatible change.**
---
* [HADOOP-4635](https://issues.apache.org/jira/browse/HADOOP-4635) | *Blocker* | **Memory leak ?**
fix memory leak of user/group information in fuse-dfs
---
* [HADOOP-4620](https://issues.apache.org/jira/browse/HADOOP-4620) | *Major* | **Streaming mapper never completes if the mapper does not write to stdout**
This patch HADOOP-4620.patch
(1) solves the hanging problem on map side with empty input and nonempty output — this map task generates output properly to intermediate files similar to other map tasks.
(2) solves the problem of hanging reducer with empty input to reduce task and nonempty output — this reduce task doesn't generate output if input to reduce task is empty.
---
* [HADOOP-4542](https://issues.apache.org/jira/browse/HADOOP-4542) | *Minor* | **Fault in TestDistributedUpgrade**
TestDistributedUpgrade used succeed for wrong reasons.
---
* [HADOOP-4150](https://issues.apache.org/jira/browse/HADOOP-4150) | *Blocker* | **Include librecordio as part of the release**
Included librecordio in release for use by xerces-c (ant -Dlibrecordio=true -Dxercescroot=\<path to the xerces-c root\>)
---
* [HADOOP-4061](https://issues.apache.org/jira/browse/HADOOP-4061) | *Major* | **Large number of decommission freezes the Namenode**
Added a new conf property dfs.namenode.decommission.nodes.per.interval so that NameNode checks decommission status of x nodes for every y seconds, where x is the value of dfs.namenode.decommission.nodes.per.interval and y is the value of dfs.namenode.decommission.interval.

View File

@ -0,0 +1,73 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.18.4 - Unreleased
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6017](https://issues.apache.org/jira/browse/HADOOP-6017) | NameNode and SecondaryNameNode fail to restart because of abnormal filenames. | Blocker | . | Raghu Angadi | Tsz Wo Nicholas Sze |
| [HADOOP-5644](https://issues.apache.org/jira/browse/HADOOP-5644) | Namnode is stuck in safe mode | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-5557](https://issues.apache.org/jira/browse/HADOOP-5557) | Two minor problems in TestOverReplicatedBlocks | Minor | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5465](https://issues.apache.org/jira/browse/HADOOP-5465) | Blocks remain under-replicated | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5412](https://issues.apache.org/jira/browse/HADOOP-5412) | TestInjectionForSimulatedStorage occasionally fails on timeout | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5311](https://issues.apache.org/jira/browse/HADOOP-5311) | Write pipeline recovery fails | Blocker | . | Hairong Kuang | dhruba borthakur |
| [HADOOP-5192](https://issues.apache.org/jira/browse/HADOOP-5192) | Block reciever should not remove a finalized block when block replication fails | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5134](https://issues.apache.org/jira/browse/HADOOP-5134) | FSNamesystem#commitBlockSynchronization adds under-construction block locations to blocksMap | Blocker | . | Hairong Kuang | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5114](https://issues.apache.org/jira/browse/HADOOP-5114) | A bunch of mapred unit tests are failing on Windows | Minor | test | Ramya Sunil | Raghu Angadi |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,31 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.18.4 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-5114](https://issues.apache.org/jira/browse/HADOOP-5114) | *Minor* | **A bunch of mapred unit tests are failing on Windows**
Remove timeout for accept() in DataNode. This made accept() fail in JDK on Windows and caused many tests to fail.

View File

@ -0,0 +1,422 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.19.0 - 2008-11-20
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4430](https://issues.apache.org/jira/browse/HADOOP-4430) | Namenode Web UI capacity report is inconsistent with Balancer | Blocker | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4293](https://issues.apache.org/jira/browse/HADOOP-4293) | Remove WritableJobConf | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4281](https://issues.apache.org/jira/browse/HADOOP-4281) | Capacity reported in some of the commands is not consistent with the Web UI reported data | Blocker | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4227](https://issues.apache.org/jira/browse/HADOOP-4227) | Remove the deprecated, unused class ShellCommand. | Minor | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4190](https://issues.apache.org/jira/browse/HADOOP-4190) | Changes to JobHistory makes it backward incompatible | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-4116](https://issues.apache.org/jira/browse/HADOOP-4116) | Balancer should provide better resource management | Blocker | . | Raghu Angadi | Hairong Kuang |
| [HADOOP-3981](https://issues.apache.org/jira/browse/HADOOP-3981) | Need a distributed file checksum algorithm for HDFS | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3963](https://issues.apache.org/jira/browse/HADOOP-3963) | libhdfs should never exit on its own but rather return errors to the calling application | Minor | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3938](https://issues.apache.org/jira/browse/HADOOP-3938) | Quotas for disk space management | Major | . | Robert Chansler | Raghu Angadi |
| [HADOOP-3911](https://issues.apache.org/jira/browse/HADOOP-3911) | ' -blocks ' option not being recognized | Minor | fs, util | Koji Noguchi | Lohit Vijayarenu |
| [HADOOP-3889](https://issues.apache.org/jira/browse/HADOOP-3889) | distcp: Better Error Message should be thrown when accessing source files/directory with no read permission | Minor | . | Peeyush Bishnoi | Tsz Wo Nicholas Sze |
| [HADOOP-3837](https://issues.apache.org/jira/browse/HADOOP-3837) | hadop streaming does not use progress reporting to detect hung tasks | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3796](https://issues.apache.org/jira/browse/HADOOP-3796) | fuse-dfs should take rw,ro,trashon,trashoff,protected=blah mount arguments rather than them being compiled in | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3792](https://issues.apache.org/jira/browse/HADOOP-3792) | exit code from "hadoop dfs -test ..." is wrong for Unix shell | Minor | fs | Ben Slusky | Ben Slusky |
| [HADOOP-3722](https://issues.apache.org/jira/browse/HADOOP-3722) | Provide a unified way to pass jobconf options from bin/hadoop | Minor | conf | Matei Zaharia | Enis Soztutar |
| [HADOOP-3667](https://issues.apache.org/jira/browse/HADOOP-3667) | Remove deprecated methods in JobConf | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3652](https://issues.apache.org/jira/browse/HADOOP-3652) | Remove deprecated class OutputFormatBase | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3595](https://issues.apache.org/jira/browse/HADOOP-3595) | Remove deprecated mapred.combine.once functionality | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3245](https://issues.apache.org/jira/browse/HADOOP-3245) | Provide ability to persist running jobs (extend HADOOP-1876) | Major | . | Devaraj Das | Amar Kamat |
| [HADOOP-3150](https://issues.apache.org/jira/browse/HADOOP-3150) | Move task file promotion into the task | Major | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-3062](https://issues.apache.org/jira/browse/HADOOP-3062) | Need to capture the metrics for the network ios generate by dfs reads/writes and map/reduce shuffling and break them down by racks | Major | metrics | Runping Qi | Chris Douglas |
| [HADOOP-2816](https://issues.apache.org/jira/browse/HADOOP-2816) | Cluster summary at name node web has confusing report for space utilization | Major | . | Robert Chansler | Suresh Srinivas |
| [HADOOP-2325](https://issues.apache.org/jira/browse/HADOOP-2325) | Require Java 6 | Major | build | Doug Cutting | Doug Cutting |
| [HADOOP-1869](https://issues.apache.org/jira/browse/HADOOP-1869) | access times of HDFS files | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-1700](https://issues.apache.org/jira/browse/HADOOP-1700) | Append to files in HDFS | Major | . | stack | dhruba borthakur |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4406](https://issues.apache.org/jira/browse/HADOOP-4406) | Make TCTLSeparatedProtocol configurable and have DynamicSerDe initialize, initialize the SerDe | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4301](https://issues.apache.org/jira/browse/HADOOP-4301) | Forrest doc for skip bad records feature | Blocker | documentation | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-4260](https://issues.apache.org/jira/browse/HADOOP-4260) | support show partitions in hive | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4176](https://issues.apache.org/jira/browse/HADOOP-4176) | Implement getFileChecksum(Path) in HftpFileSystem | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4120](https://issues.apache.org/jira/browse/HADOOP-4120) | [Hive] print time taken by query in interactive shell | Minor | . | Raghotham Murthy | Raghotham Murthy |
| [HADOOP-4106](https://issues.apache.org/jira/browse/HADOOP-4106) | add time, permission and user attribute support to fuse-dfs | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4104](https://issues.apache.org/jira/browse/HADOOP-4104) | add time, permission and user attribute support to libhdfs | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4095](https://issues.apache.org/jira/browse/HADOOP-4095) | [Hive] enhance describe table & partition | Major | . | Prasad Chakka | Namit Jain |
| [HADOOP-4086](https://issues.apache.org/jira/browse/HADOOP-4086) | Add limit to Hive QL | Major | . | Ashish Thusoo | Namit Jain |
| [HADOOP-4084](https://issues.apache.org/jira/browse/HADOOP-4084) | Add explain plan capabilities to Hive QL | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4070](https://issues.apache.org/jira/browse/HADOOP-4070) | [Hive] Provide a mechanism for registering UDFs from the query language | Major | . | Tom White | Tom White |
| [HADOOP-3992](https://issues.apache.org/jira/browse/HADOOP-3992) | Synthetic Load Generator for NameNode testing | Major | . | Robert Chansler | Hairong Kuang |
| [HADOOP-3941](https://issues.apache.org/jira/browse/HADOOP-3941) | Extend FileSystem API to return file-checksums/file-digests | Major | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3939](https://issues.apache.org/jira/browse/HADOOP-3939) | DistCp should support an option for deleting non-existing files. | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3924](https://issues.apache.org/jira/browse/HADOOP-3924) | Add a 'Killed' job status | Critical | . | Alejandro Abdelnur | Subru Krishnan |
| [HADOOP-3873](https://issues.apache.org/jira/browse/HADOOP-3873) | DistCp should have an option for limiting the number of files/bytes being copied | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3854](https://issues.apache.org/jira/browse/HADOOP-3854) | org.apache.hadoop.http.HttpServer should support user configurable filter | Major | util | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3829](https://issues.apache.org/jira/browse/HADOOP-3829) | Narrown down skipped records based on user acceptable value | Major | . | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-3828](https://issues.apache.org/jira/browse/HADOOP-3828) | Write skipped records' bytes to DFS | Major | . | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-3754](https://issues.apache.org/jira/browse/HADOOP-3754) | Support a Thrift Interface to access files/directories in HDFS | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3746](https://issues.apache.org/jira/browse/HADOOP-3746) | A fair sharing job scheduler | Minor | . | Matei Zaharia | Matei Zaharia |
| [HADOOP-3730](https://issues.apache.org/jira/browse/HADOOP-3730) | add new JobConf constructor that disables loading default configurations | Major | conf | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-3714](https://issues.apache.org/jira/browse/HADOOP-3714) | Bash tab completion support | Trivial | scripts | Chris Smith | Chris Smith |
| [HADOOP-3702](https://issues.apache.org/jira/browse/HADOOP-3702) | add support for chaining Maps in a single Map and after a Reduce [M\*/RM\*] | Major | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-3698](https://issues.apache.org/jira/browse/HADOOP-3698) | Implement access control for submitting jobs to queues in the JobTracker | Major | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3695](https://issues.apache.org/jira/browse/HADOOP-3695) | [HOD] Have an ability to run multiple slaves per node | Major | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-3585](https://issues.apache.org/jira/browse/HADOOP-3585) | Hardware Failure Monitoring in large clusters running Hadoop/HDFS | Minor | metrics | Ioannis Koltsidas | Ioannis Koltsidas |
| [HADOOP-3485](https://issues.apache.org/jira/browse/HADOOP-3485) | fix writes | Minor | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3479](https://issues.apache.org/jira/browse/HADOOP-3479) | Implement configuration items useful for Hadoop resource manager (v1) | Major | conf | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3445](https://issues.apache.org/jira/browse/HADOOP-3445) | Implementing core scheduler functionality in Resource Manager (V1) for Hadoop | Major | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-3402](https://issues.apache.org/jira/browse/HADOOP-3402) | Add example code to support run terasort on hadoop | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-3361](https://issues.apache.org/jira/browse/HADOOP-3361) | Implement renames for NativeS3FileSystem | Major | fs/s3 | Tom White | Tom White |
| [HADOOP-3149](https://issues.apache.org/jira/browse/HADOOP-3149) | supporting multiple outputs for M/R jobs | Major | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-3019](https://issues.apache.org/jira/browse/HADOOP-3019) | want input sampler & sorted partitioner | Major | . | Doug Cutting | Chris Douglas |
| [HADOOP-2664](https://issues.apache.org/jira/browse/HADOOP-2664) | lzop-compatible CompresionCodec | Major | io | Chris Douglas | Chris Douglas |
| [HADOOP-2658](https://issues.apache.org/jira/browse/HADOOP-2658) | Design and Implement a Test Plan to support appends to HDFS files | Blocker | test | dhruba borthakur | dhruba borthakur |
| [HADOOP-2536](https://issues.apache.org/jira/browse/HADOOP-2536) | MapReduce for MySQL | Minor | . | Fredrik Hedberg | Fredrik Hedberg |
| [HADOOP-1823](https://issues.apache.org/jira/browse/HADOOP-1823) | want InputFormat for bzip2 files | Major | . | Doug Cutting | |
| [HADOOP-1480](https://issues.apache.org/jira/browse/HADOOP-1480) | pipes should be able to set user counters | Major | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-372](https://issues.apache.org/jira/browse/HADOOP-372) | should allow to specify different inputformat classes for different input dirs for Map/Reduce jobs | Major | . | Runping Qi | Chris Smith |
| [HADOOP-153](https://issues.apache.org/jira/browse/HADOOP-153) | skip records that fail Task | Major | . | Doug Cutting | Sharad Agarwal |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4433](https://issues.apache.org/jira/browse/HADOOP-4433) | Improve data loader for collecting metrics and log files from hadoop and system | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4431](https://issues.apache.org/jira/browse/HADOOP-4431) | Add versionning/tags to Chukwa Chunk | Major | . | Jerome Boulon | Jerome Boulon |
| [HADOOP-4424](https://issues.apache.org/jira/browse/HADOOP-4424) | menu layout change for Hadoop documentation | Blocker | documentation | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4353](https://issues.apache.org/jira/browse/HADOOP-4353) | enable multi-line query from Hive CLI | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4345](https://issues.apache.org/jira/browse/HADOOP-4345) | Hive: Check that partitioning predicate is present when hive.partition.pruning = strict | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4307](https://issues.apache.org/jira/browse/HADOOP-4307) | add an option to describe table to show extended properties of the table such as serialization/deserialization properties | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4279](https://issues.apache.org/jira/browse/HADOOP-4279) | write the random number generator seed to log in the append-related tests | Blocker | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4252](https://issues.apache.org/jira/browse/HADOOP-4252) | Catch Ctrl-C in Hive CLI so that corresponding hadoop jobs can be killed | Minor | . | Prasad Chakka | Pete Wyckoff |
| [HADOOP-4231](https://issues.apache.org/jira/browse/HADOOP-4231) | Hive: converting complex objects to JSON failed. | Minor | . | Zheng Shao | Zheng Shao |
| [HADOOP-4230](https://issues.apache.org/jira/browse/HADOOP-4230) | Hive: GroupBy should not pass the whole row from mapper to reducer | Blocker | . | Zheng Shao | Ashish Thusoo |
| [HADOOP-4205](https://issues.apache.org/jira/browse/HADOOP-4205) | [Hive] metastore and ql to use the refactored SerDe library | Major | . | Zheng Shao | Zheng Shao |
| [HADOOP-4194](https://issues.apache.org/jira/browse/HADOOP-4194) | Add JobConf and JobID to job related methods in JobTrackerInstrumentation | Major | . | Mac Yang | Mac Yang |
| [HADOOP-4181](https://issues.apache.org/jira/browse/HADOOP-4181) | some minor things to make Hadoop friendlier to git | Major | build | Owen O'Malley | Owen O'Malley |
| [HADOOP-4174](https://issues.apache.org/jira/browse/HADOOP-4174) | Move non-client methods ou of ClientProtocol | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4151](https://issues.apache.org/jira/browse/HADOOP-4151) | Add a memcmp-compatible interface for key types | Minor | . | Chris Douglas | Chris Douglas |
| [HADOOP-4138](https://issues.apache.org/jira/browse/HADOOP-4138) | [Hive] refactor the SerDe library | Major | . | Zheng Shao | Zheng Shao |
| [HADOOP-4124](https://issues.apache.org/jira/browse/HADOOP-4124) | Changing priority of a job should be available in CLI and available on the web UI only along with the Kill Job actions | Major | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4117](https://issues.apache.org/jira/browse/HADOOP-4117) | Improve configurability of Hadoop EC2 instances | Major | contrib/cloud | Tom White | Tom White |
| [HADOOP-4113](https://issues.apache.org/jira/browse/HADOOP-4113) | libhdfs should never exit on its own but rather return errors to the calling application - missing diff files | Minor | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4094](https://issues.apache.org/jira/browse/HADOOP-4094) | [Hive]implement hive-site.xml similar to hadoop-site.xml | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4090](https://issues.apache.org/jira/browse/HADOOP-4090) | The configuration file lists two paths to hadoop directories (bin and conf). Startup should check that these are valid directories and give appropriate messages. | Minor | . | Ashish Thusoo | Raghotham Murthy |
| [HADOOP-4083](https://issues.apache.org/jira/browse/HADOOP-4083) | change new config attribute queue.name to mapred.job.queue.name | Major | . | Owen O'Malley | Hemanth Yamijala |
| [HADOOP-4075](https://issues.apache.org/jira/browse/HADOOP-4075) | test-patch.sh should output the ant commands that it runs | Major | build | Nigel Daley | Ramya Sunil |
| [HADOOP-4062](https://issues.apache.org/jira/browse/HADOOP-4062) | IPC client does not need to be synchronized on the output stream when a connection is closed | Major | ipc | Hairong Kuang | Hairong Kuang |
| [HADOOP-4053](https://issues.apache.org/jira/browse/HADOOP-4053) | Schedulers need to know when a job has completed | Blocker | . | Vivek Ratan | Amar Kamat |
| [HADOOP-3975](https://issues.apache.org/jira/browse/HADOOP-3975) | test-patch can report the modifications found in the workspace along with the error message | Minor | test | Hemanth Yamijala | Ramya Sunil |
| [HADOOP-3965](https://issues.apache.org/jira/browse/HADOOP-3965) | Make DataBlockScanner package private | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3948](https://issues.apache.org/jira/browse/HADOOP-3948) | Separate Namenodes edits and fsimage | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3944](https://issues.apache.org/jira/browse/HADOOP-3944) | TupleWritable listed as public class but cannot be used without methods private to the package | Trivial | documentation | Michael Andrews | Chris Douglas |
| [HADOOP-3943](https://issues.apache.org/jira/browse/HADOOP-3943) | NetworkTopology.pseudoSortByDistance does not need to be a synchronized method | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-3935](https://issues.apache.org/jira/browse/HADOOP-3935) | Extract classes from DataNode.java | Trivial | . | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-3930](https://issues.apache.org/jira/browse/HADOOP-3930) | Decide how to integrate scheduler info into CLI and job tracker web page | Major | . | Matei Zaharia | Sreekanth Ramakrishnan |
| [HADOOP-3908](https://issues.apache.org/jira/browse/HADOOP-3908) | Better error message if llibhdfs.so doesn't exist | Minor | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3905](https://issues.apache.org/jira/browse/HADOOP-3905) | Create a generic interface for edits log. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3892](https://issues.apache.org/jira/browse/HADOOP-3892) | Include Unix group name in JobConf | Trivial | conf | Matei Zaharia | Matei Zaharia |
| [HADOOP-3866](https://issues.apache.org/jira/browse/HADOOP-3866) | Improve Hadoop Jobtracker Admin | Major | scripts | craig weisenfluh | craig weisenfluh |
| [HADOOP-3861](https://issues.apache.org/jira/browse/HADOOP-3861) | Make MapFile.Reader and Writer implement java.io.Closeable | Major | io | Tom White | Tom White |
| [HADOOP-3860](https://issues.apache.org/jira/browse/HADOOP-3860) | Compare name-node performance when journaling is performed into local hard-drives or nfs. | Major | benchmarks | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-3853](https://issues.apache.org/jira/browse/HADOOP-3853) | Move multiple input format extension to library package | Major | . | Tom White | Tom White |
| [HADOOP-3852](https://issues.apache.org/jira/browse/HADOOP-3852) | If ShellCommandExecutor had a toString() operator that listed the command run, its error messages may be more meaningful | Minor | util | Steve Loughran | Steve Loughran |
| [HADOOP-3844](https://issues.apache.org/jira/browse/HADOOP-3844) | include message of local exception in Client call failures | Minor | ipc | Steve Loughran | Steve Loughran |
| [HADOOP-3805](https://issues.apache.org/jira/browse/HADOOP-3805) | improve fuse-dfs write performance which is 33% slower than hadoop dfs -copyFromLocal | Minor | . | Pete Wyckoff | |
| [HADOOP-3780](https://issues.apache.org/jira/browse/HADOOP-3780) | JobTracker should synchronously resolve the tasktracker's network location when the tracker registers | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-3769](https://issues.apache.org/jira/browse/HADOOP-3769) | expose static SampleMapper and SampleReducer classes of GenericMRLoadGenerator class for gridmix reuse | Major | test | Lingyun Yang | Lingyun Yang |
| [HADOOP-3759](https://issues.apache.org/jira/browse/HADOOP-3759) | Provide ability to run memory intensive jobs without affecting other running tasks on the nodes | Major | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-3747](https://issues.apache.org/jira/browse/HADOOP-3747) | Add counter support to MultipleOutputs | Minor | . | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-3721](https://issues.apache.org/jira/browse/HADOOP-3721) | CompositeRecordReader::next is unnecessarily complex | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3719](https://issues.apache.org/jira/browse/HADOOP-3719) | Chukwa | Major | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-3694](https://issues.apache.org/jira/browse/HADOOP-3694) | if MiniDFS startup time could be improved, testing time would be reduced | Major | test | Steve Loughran | Doug Cutting |
| [HADOOP-3684](https://issues.apache.org/jira/browse/HADOOP-3684) | The data\_join should allow the user to implement a customer cloning function | Major | . | Runping Qi | Runping Qi |
| [HADOOP-3664](https://issues.apache.org/jira/browse/HADOOP-3664) | Remove deprecated methods introduced in changes to validating input paths (HADOOP-3095) | Major | . | Tom White | Tom White |
| [HADOOP-3661](https://issues.apache.org/jira/browse/HADOOP-3661) | Normalize fuse-dfs handling of moving things to trash wrt the way hadoop dfs does it (only when non posix trash flag is enabled in compile) | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3660](https://issues.apache.org/jira/browse/HADOOP-3660) | Add replication factor for injecting blocks in the data node cluster | Major | benchmarks | Sanjay Radia | Sanjay Radia |
| [HADOOP-3655](https://issues.apache.org/jira/browse/HADOOP-3655) | provide more control options for the junit run | Minor | build | Steve Loughran | Steve Loughran |
| [HADOOP-3646](https://issues.apache.org/jira/browse/HADOOP-3646) | Providing bzip2 as codec | Major | conf, io | Abdul Qadeer | Abdul Qadeer |
| [HADOOP-3638](https://issues.apache.org/jira/browse/HADOOP-3638) | Cache the iFile index files in memory to reduce seeks during map output serving | Major | . | Devaraj Das | Jothi Padmanabhan |
| [HADOOP-3624](https://issues.apache.org/jira/browse/HADOOP-3624) | CreateEditsLog could be improved to create tree directory structure | Minor | test | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3620](https://issues.apache.org/jira/browse/HADOOP-3620) | Namenode should synchronously resolve a datanode's network location when the datanode registers | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-3617](https://issues.apache.org/jira/browse/HADOOP-3617) | Writes from map serialization include redundant checks for accounting space | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3605](https://issues.apache.org/jira/browse/HADOOP-3605) | Added an abort on unset AWS\_ACCOUNT\_ID to luanch-hadoop-master | Minor | contrib/cloud | Al Hoang | Al Hoang |
| [HADOOP-3581](https://issues.apache.org/jira/browse/HADOOP-3581) | Prevent memory intensive user tasks from taking down nodes | Major | . | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-3577](https://issues.apache.org/jira/browse/HADOOP-3577) | Tools to inject blocks into name node and simulated data nodes for testing | Minor | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-3563](https://issues.apache.org/jira/browse/HADOOP-3563) | Seperate out datanode and namenode functionality of generation stamp upgrade process | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-3556](https://issues.apache.org/jira/browse/HADOOP-3556) | Substitute the synchronized code in MD5Hash to avoid lock contention. Use ThreadLocal instead. | Major | io | Iván de Prado | Iván de Prado |
| [HADOOP-3549](https://issues.apache.org/jira/browse/HADOOP-3549) | meaningful errno values in libhdfs | Major | . | Ben Slusky | Ben Slusky |
| [HADOOP-3514](https://issues.apache.org/jira/browse/HADOOP-3514) | Reduce seeks during shuffle, by inline crcs | Major | . | Devaraj Das | Jothi Padmanabhan |
| [HADOOP-3498](https://issues.apache.org/jira/browse/HADOOP-3498) | File globbing alternation should be able to span path components | Major | fs | Tom White | Tom White |
| [HADOOP-3478](https://issues.apache.org/jira/browse/HADOOP-3478) | The algorithm to decide map re-execution on fetch failures can be improved | Major | . | Jothi Padmanabhan | Jothi Padmanabhan |
| [HADOOP-3446](https://issues.apache.org/jira/browse/HADOOP-3446) | The reduce task should not flush the in memory file system before starting the reducer | Critical | . | Owen O'Malley | Chris Douglas |
| [HADOOP-3412](https://issues.apache.org/jira/browse/HADOOP-3412) | Refactor the scheduler out of the JobTracker | Minor | . | Brice Arnould | Brice Arnould |
| [HADOOP-3368](https://issues.apache.org/jira/browse/HADOOP-3368) | Can commons-logging.properties be pulled from hadoop-core? | Major | build | Steve Loughran | Steve Loughran |
| [HADOOP-3342](https://issues.apache.org/jira/browse/HADOOP-3342) | Better safety of killing jobs via web interface | Minor | . | Daniel Naber | Enis Soztutar |
| [HADOOP-3341](https://issues.apache.org/jira/browse/HADOOP-3341) | make key-value separators in hadoop streaming fully configurable | Major | . | Zheng Shao | Zheng Shao |
| [HADOOP-3328](https://issues.apache.org/jira/browse/HADOOP-3328) | DFS write pipeline : only the last datanode needs to verify checksum | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3202](https://issues.apache.org/jira/browse/HADOOP-3202) | Deprecate org.apache.hadoop.fs.FileUtil.fullyDelete(FileSystem fs, Path dir) | Major | fs | Tsz Wo Nicholas Sze | Amareshwari Sriramadasu |
| [HADOOP-3169](https://issues.apache.org/jira/browse/HADOOP-3169) | LeaseChecker daemon should not be started in DFSClient constructor | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-2411](https://issues.apache.org/jira/browse/HADOOP-2411) | Add support for larger EC2 instance types | Major | contrib/cloud | Tom White | Chris K Wensel |
| [HADOOP-2330](https://issues.apache.org/jira/browse/HADOOP-2330) | Preallocate transaction log to improve namenode transaction logging performance | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-2302](https://issues.apache.org/jira/browse/HADOOP-2302) | Streaming should provide an option for numerical sort of keys | Major | . | Lohit Vijayarenu | Devaraj Das |
| [HADOOP-2165](https://issues.apache.org/jira/browse/HADOOP-2165) | Augment JobHistory to store tasks' userlogs | Major | . | Arun C Murthy | Vinod Kumar Vavilapalli |
| [HADOOP-2130](https://issues.apache.org/jira/browse/HADOOP-2130) | Pipes submit job should be Non-blocking | Critical | . | Srikanth Kakani | Arun C Murthy |
| [HADOOP-1627](https://issues.apache.org/jira/browse/HADOOP-1627) | DFSAdmin incorrectly reports cluster data. | Minor | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-657](https://issues.apache.org/jira/browse/HADOOP-657) | Free temporary space should be modelled better | Major | . | Owen O'Malley | Ari Rabkin |
| [HADOOP-249](https://issues.apache.org/jira/browse/HADOOP-249) | Improving Map -\> Reduce performance and Task JVM reuse | Major | . | Benjamin Reed | Devaraj Das |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4595](https://issues.apache.org/jira/browse/HADOOP-4595) | JVM Reuse triggers RuntimeException("Invalid state") | Major | . | Aaron Kimball | Devaraj Das |
| [HADOOP-4552](https://issues.apache.org/jira/browse/HADOOP-4552) | Deadlock in RPC Server | Major | ipc | Raghu Angadi | Raghu Angadi |
| [HADOOP-4525](https://issues.apache.org/jira/browse/HADOOP-4525) | config ipc.server.tcpnodelay is no loger being respected | Major | ipc | Clint Morgan | Clint Morgan |
| [HADOOP-4510](https://issues.apache.org/jira/browse/HADOOP-4510) | FileOutputFormat protects getTaskOutputPath | Blocker | . | Chris K Wensel | Chris K Wensel |
| [HADOOP-4500](https://issues.apache.org/jira/browse/HADOOP-4500) | multifilesplit is using job default filesystem incorrectly | Major | . | Joydeep Sen Sarma | Joydeep Sen Sarma |
| [HADOOP-4498](https://issues.apache.org/jira/browse/HADOOP-4498) | JobHistory does not escape literal jobName when used in a regex pattern | Blocker | . | Chris K Wensel | Chris K Wensel |
| [HADOOP-4471](https://issues.apache.org/jira/browse/HADOOP-4471) | Capacity Scheduler should maintain the right ordering of jobs in its running queue | Blocker | . | Vivek Ratan | Amar Kamat |
| [HADOOP-4466](https://issues.apache.org/jira/browse/HADOOP-4466) | SequenceFileOutputFormat is coupled to WritableComparable and Writable | Blocker | io | Chris K Wensel | Chris K Wensel |
| [HADOOP-4457](https://issues.apache.org/jira/browse/HADOOP-4457) | Input split logging in history is broken in 0.19 | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4455](https://issues.apache.org/jira/browse/HADOOP-4455) | Upload the derby.jar and TestSeDe.jar needed for fixes to 0.19 bugs | Blocker | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4449](https://issues.apache.org/jira/browse/HADOOP-4449) | Minor formatting changes to quota related commands | Trivial | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4446](https://issues.apache.org/jira/browse/HADOOP-4446) | Update Scheduling Information display in Web UI | Major | . | Karam Singh | Sreekanth Ramakrishnan |
| [HADOOP-4439](https://issues.apache.org/jira/browse/HADOOP-4439) | Cleanup memory related resource management | Blocker | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4438](https://issues.apache.org/jira/browse/HADOOP-4438) | Add new/missing dfs commands in forrest | Blocker | documentation | Hemanth Yamijala | Suresh Srinivas |
| [HADOOP-4427](https://issues.apache.org/jira/browse/HADOOP-4427) | Add new/missing commands in forrest | Blocker | documentation | Sharad Agarwal | Sreekanth Ramakrishnan |
| [HADOOP-4425](https://issues.apache.org/jira/browse/HADOOP-4425) | Edits log takes much longer to load | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-4423](https://issues.apache.org/jira/browse/HADOOP-4423) | FSDataset.getStoredBlock(id) should not return corrupted information | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4418](https://issues.apache.org/jira/browse/HADOOP-4418) | Update documentation in forrest for Mapred, streaming and pipes | Blocker | documentation | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4410](https://issues.apache.org/jira/browse/HADOOP-4410) | TestMiniMRDebugScript fails on trunk | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4405](https://issues.apache.org/jira/browse/HADOOP-4405) | all creation of hadoop dfs queries from with in hive shell | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4404](https://issues.apache.org/jira/browse/HADOOP-4404) | saveFSImage() should remove files from a storage directory that do not correspond to its type. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4400](https://issues.apache.org/jira/browse/HADOOP-4400) | Add "hdfs://" to fs.default.name on quickstart.html | Trivial | documentation | Jeff Hammerbacher | Jeff Hammerbacher |
| [HADOOP-4393](https://issues.apache.org/jira/browse/HADOOP-4393) | Merge AccessControlException and AccessControlIOException into one exception class | Blocker | fs | Owen O'Malley | Owen O'Malley |
| [HADOOP-4387](https://issues.apache.org/jira/browse/HADOOP-4387) | TestHDFSFileSystemContract fails on windows | Blocker | test | Raghu Angadi | Raghu Angadi |
| [HADOOP-4380](https://issues.apache.org/jira/browse/HADOOP-4380) | Make new classes in mapred package private instead of public | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4378](https://issues.apache.org/jira/browse/HADOOP-4378) | TestJobQueueInformation fails regularly | Blocker | test | Tsz Wo Nicholas Sze | Sreekanth Ramakrishnan |
| [HADOOP-4376](https://issues.apache.org/jira/browse/HADOOP-4376) | Fix line formatting in hadoop-default.xml for hadoop.http.filter.initializers | Blocker | conf | Enis Soztutar | Enis Soztutar |
| [HADOOP-4373](https://issues.apache.org/jira/browse/HADOOP-4373) | Guaranteed Capacity calculation is not calculated correctly | Blocker | . | Karam Singh | Hemanth Yamijala |
| [HADOOP-4367](https://issues.apache.org/jira/browse/HADOOP-4367) | Hive: UDAF functions cannot handle NULL values | Major | . | Zheng Shao | Zheng Shao |
| [HADOOP-4366](https://issues.apache.org/jira/browse/HADOOP-4366) | Provide way to replace existing column names for columnSet tables | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4361](https://issues.apache.org/jira/browse/HADOOP-4361) | Corner cases in killJob from command line | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4358](https://issues.apache.org/jira/browse/HADOOP-4358) | NPE from CreateEditsLog | Blocker | test | Chris Douglas | Raghu Angadi |
| [HADOOP-4356](https://issues.apache.org/jira/browse/HADOOP-4356) | [Hive] for a 2-stage map-reduce job, number of reducers not set correctly | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4355](https://issues.apache.org/jira/browse/HADOOP-4355) | hive 2 case sensitivity issues | Major | . | Zheng Shao | |
| [HADOOP-4344](https://issues.apache.org/jira/browse/HADOOP-4344) | Hive: Partition pruning causes semantic exception with joins | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4342](https://issues.apache.org/jira/browse/HADOOP-4342) | [hive] bug in partition pruning | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4336](https://issues.apache.org/jira/browse/HADOOP-4336) | fix sampling bug in fractional bucket case | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4335](https://issues.apache.org/jira/browse/HADOOP-4335) | FsShell -ls fails for file systems without owners or groups | Major | scripts | David Phillips | David Phillips |
| [HADOOP-4333](https://issues.apache.org/jira/browse/HADOOP-4333) | add ability to drop partitions through DDL | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4330](https://issues.apache.org/jira/browse/HADOOP-4330) | Hive: AS clause with subqueries having group bys is not propogated to the outer query block | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4329](https://issues.apache.org/jira/browse/HADOOP-4329) | Hive: [] operator with maps does not work | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4327](https://issues.apache.org/jira/browse/HADOOP-4327) | Create table hive does not set delimeters | Major | . | Edward Capriolo | Namit Jain |
| [HADOOP-4321](https://issues.apache.org/jira/browse/HADOOP-4321) | Document the capacity scheduler in Forrest | Blocker | documentation | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4320](https://issues.apache.org/jira/browse/HADOOP-4320) | [Hive] TCTLSeparatedProtocol implement maps/lists/sets read/writes | Major | . | Pete Wyckoff | |
| [HADOOP-4319](https://issues.apache.org/jira/browse/HADOOP-4319) | fuse-dfs dfs\_read function may return less than the requested #of bytes even if EOF not reached | Blocker | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4316](https://issues.apache.org/jira/browse/HADOOP-4316) | [Hive] extra new lines at output | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4315](https://issues.apache.org/jira/browse/HADOOP-4315) | Hive: Cleanup temporary files once the job is done | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4309](https://issues.apache.org/jira/browse/HADOOP-4309) | eclipse-plugin no longer compiles on trunk | Blocker | contrib/eclipse-plugin | Chris Douglas | Chris Douglas |
| [HADOOP-4303](https://issues.apache.org/jira/browse/HADOOP-4303) | Hive: trim and rtrim UDFs behaviors are reversed | Major | . | Ashish Thusoo | Ashish Thusoo |
| [HADOOP-4302](https://issues.apache.org/jira/browse/HADOOP-4302) | TestReduceFetch fails intermittently | Blocker | . | Devaraj Das | Chris Douglas |
| [HADOOP-4299](https://issues.apache.org/jira/browse/HADOOP-4299) | Unable to access a file by a different user in the same group when permissions is set to 770 or when permissions is turned OFF | Blocker | . | Ramya Sunil | Hairong Kuang |
| [HADOOP-4296](https://issues.apache.org/jira/browse/HADOOP-4296) | Spasm of JobClient failures on successful jobs every once in a while | Blocker | . | Joydeep Sen Sarma | dhruba borthakur |
| [HADOOP-4294](https://issues.apache.org/jira/browse/HADOOP-4294) | Hive: Parser should pass field schema to SerDe | Major | . | Zheng Shao | |
| [HADOOP-4288](https://issues.apache.org/jira/browse/HADOOP-4288) | java.lang.NullPointerException is observed in Jobtracker log while call heartbeat | Blocker | . | Karam Singh | Amar Kamat |
| [HADOOP-4287](https://issues.apache.org/jira/browse/HADOOP-4287) | [mapred] jobqueue\_details.jsp shows negative count of running and waiting reduces with CapacityTaskScheduler. | Blocker | . | Vinod Kumar Vavilapalli | Sreekanth Ramakrishnan |
| [HADOOP-4282](https://issues.apache.org/jira/browse/HADOOP-4282) | User configurable filter fails to filter accesses to certain directories | Blocker | . | Kan Zhang | Tsz Wo Nicholas Sze |
| [HADOOP-4280](https://issues.apache.org/jira/browse/HADOOP-4280) | test-libhdfs consistently fails on trunk | Blocker | . | Raghu Angadi | Pete Wyckoff |
| [HADOOP-4278](https://issues.apache.org/jira/browse/HADOOP-4278) | TestDatanodeDeath failed occasionally | Blocker | . | Tsz Wo Nicholas Sze | dhruba borthakur |
| [HADOOP-4275](https://issues.apache.org/jira/browse/HADOOP-4275) | New public methods added to the \*ID classes | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4274](https://issues.apache.org/jira/browse/HADOOP-4274) | Capacity scheduler's implementation of getJobs modifies the list of running jobs inadvertently | Blocker | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4273](https://issues.apache.org/jira/browse/HADOOP-4273) | [Hive] job submission exception if input is null | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4272](https://issues.apache.org/jira/browse/HADOOP-4272) | Hive: metadataTypedColumnsetSerDe should check if SERIALIZATION.LIB is old columnsetSerDe | Major | . | Zheng Shao | Prasad Chakka |
| [HADOOP-4269](https://issues.apache.org/jira/browse/HADOOP-4269) | LineRecordReader.LineReader should use util.LineReader | Major | util | Chris Douglas | Chris Douglas |
| [HADOOP-4267](https://issues.apache.org/jira/browse/HADOOP-4267) | TestDBJob failed on Linux | Blocker | . | Raghu Angadi | Enis Soztutar |
| [HADOOP-4266](https://issues.apache.org/jira/browse/HADOOP-4266) | Hive: Support "IS NULL", "IS NOT NULL", and size(x) for map and list | Major | . | Zheng Shao | Zheng Shao |
| [HADOOP-4265](https://issues.apache.org/jira/browse/HADOOP-4265) | [Hive] error when user specifies the delimiter | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4261](https://issues.apache.org/jira/browse/HADOOP-4261) | Jobs failing in the init stage will never cleanup | Blocker | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-4256](https://issues.apache.org/jira/browse/HADOOP-4256) | Remove Completed and Failed Job tables from jobqueue\_details.jsp | Blocker | . | Sreekanth Ramakrishnan | Sreekanth Ramakrishnan |
| [HADOOP-4254](https://issues.apache.org/jira/browse/HADOOP-4254) | Cannot setSpaceQuota to 1TB | Blocker | . | Tsz Wo Nicholas Sze | Raghu Angadi |
| [HADOOP-4250](https://issues.apache.org/jira/browse/HADOOP-4250) | Remove short names of serdes from Deserializer, Serializer & SerDe interface and relevant code. | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4249](https://issues.apache.org/jira/browse/HADOOP-4249) | Declare hsqldb.jar in eclipse plugin | Blocker | contrib/eclipse-plugin | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4248](https://issues.apache.org/jira/browse/HADOOP-4248) | Remove HADOOP-1230 API from 0.19 | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4247](https://issues.apache.org/jira/browse/HADOOP-4247) | hadoop jar throwing exception when running examples | Blocker | . | Hemanth Yamijala | Owen O'Malley |
| [HADOOP-4246](https://issues.apache.org/jira/browse/HADOOP-4246) | Reduce task copy errors may not kill it eventually | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4242](https://issues.apache.org/jira/browse/HADOOP-4242) | Remove an extra ";" in FSDirectory | Blocker | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4241](https://issues.apache.org/jira/browse/HADOOP-4241) | -hiveconf config parameters in hive cli should override all config variables | Major | . | Joydeep Sen Sarma | Joydeep Sen Sarma |
| [HADOOP-4236](https://issues.apache.org/jira/browse/HADOOP-4236) | JobTracker.killJob() fails to kill a job if the job is not yet initialized | Blocker | . | Amar Kamat | Sharad Agarwal |
| [HADOOP-4232](https://issues.apache.org/jira/browse/HADOOP-4232) | Race condition in JVM reuse when more than one slot becomes free | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-4225](https://issues.apache.org/jira/browse/HADOOP-4225) | FSEditLog logs modification time instead of access time. | Blocker | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4213](https://issues.apache.org/jira/browse/HADOOP-4213) | NPE in TestLimitTasksPerJobTaskScheduler | Major | test | Tsz Wo Nicholas Sze | Sreekanth Ramakrishnan |
| [HADOOP-4209](https://issues.apache.org/jira/browse/HADOOP-4209) | The TaskAttemptID should not have the JobTracker start time | Blocker | . | Owen O'Malley | Amar Kamat |
| [HADOOP-4200](https://issues.apache.org/jira/browse/HADOOP-4200) | Hadoop-Patch build is failing | Major | build | Ramya Sunil | Ramya Sunil |
| [HADOOP-4197](https://issues.apache.org/jira/browse/HADOOP-4197) | Need to update DATA\_TRANSFER\_VERSION | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4195](https://issues.apache.org/jira/browse/HADOOP-4195) | SequenceFile.Writer close() uses compressor after returning it to CodecPool. | Major | io | Hong Tang | Arun C Murthy |
| [HADOOP-4189](https://issues.apache.org/jira/browse/HADOOP-4189) | HADOOP-3245 is incomplete | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-4183](https://issues.apache.org/jira/browse/HADOOP-4183) | select \* to console issues in Hive | Major | . | Joydeep Sen Sarma | |
| [HADOOP-4175](https://issues.apache.org/jira/browse/HADOOP-4175) | Incorporate metastore server review comments | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4173](https://issues.apache.org/jira/browse/HADOOP-4173) | TestProcfsBasedProcessTree failing on Windows machine | Major | test, util | Ramya Sunil | Vinod Kumar Vavilapalli |
| [HADOOP-4169](https://issues.apache.org/jira/browse/HADOOP-4169) | 'compressed' keyword in DDL syntax misleading and does not compress | Major | . | Joydeep Sen Sarma | Joydeep Sen Sarma |
| [HADOOP-4163](https://issues.apache.org/jira/browse/HADOOP-4163) | If a reducer failed at shuffling stage, the task should fail, not just logging an exception | Blocker | . | Runping Qi | Sharad Agarwal |
| [HADOOP-4155](https://issues.apache.org/jira/browse/HADOOP-4155) | JobHisotry::JOBTRACKER\_START\_TIME is not initialized properly | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-4154](https://issues.apache.org/jira/browse/HADOOP-4154) | Fix javac warning in WritableUtils | Minor | io | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4149](https://issues.apache.org/jira/browse/HADOOP-4149) | JobQueueJobInProgressListener.jobUpdated() might not work as expected | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-4147](https://issues.apache.org/jira/browse/HADOOP-4147) | Remove JobWithTaskContext from JobInProgress | Trivial | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-4146](https://issues.apache.org/jira/browse/HADOOP-4146) | [Hive] null pointer exception on a join | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4139](https://issues.apache.org/jira/browse/HADOOP-4139) | [Hive] multi group by statement is not optimized | Major | . | Namit Jain | Namit Jain |
| [HADOOP-4135](https://issues.apache.org/jira/browse/HADOOP-4135) | change max length of database columns for metastore to 767 | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4133](https://issues.apache.org/jira/browse/HADOOP-4133) | remove derby.log files form repository and also change the location where these files get created | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4129](https://issues.apache.org/jira/browse/HADOOP-4129) | Memory limits of TaskTracker and Tasks should be in kiloBytes. | Blocker | . | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-4125](https://issues.apache.org/jira/browse/HADOOP-4125) | Reduce cleanup tip web ui is does not show attempts | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4121](https://issues.apache.org/jira/browse/HADOOP-4121) | HistoryViewer initialization failure should log exception trace | Trivial | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4112](https://issues.apache.org/jira/browse/HADOOP-4112) | Got ArrayOutOfBound exception while analyzing the job history | Major | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-4100](https://issues.apache.org/jira/browse/HADOOP-4100) | Scheduler.assignTasks should not be dealing with cleanupTask | Major | . | Devaraj Das | Amareshwari Sriramadasu |
| [HADOOP-4099](https://issues.apache.org/jira/browse/HADOOP-4099) | HFTP interface compatibility with older releases broken | Blocker | fs | Kan Zhang | dhruba borthakur |
| [HADOOP-4097](https://issues.apache.org/jira/browse/HADOOP-4097) | Hive interaction with speculative execution is broken | Critical | . | Joydeep Sen Sarma | Joydeep Sen Sarma |
| [HADOOP-4093](https://issues.apache.org/jira/browse/HADOOP-4093) | [Hive]unify Table.getCols() & get\_fields() | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4089](https://issues.apache.org/jira/browse/HADOOP-4089) | Check if the tmp file used in the CLI exists before using it. | Major | . | Ashish Thusoo | |
| [HADOOP-4087](https://issues.apache.org/jira/browse/HADOOP-4087) | Make Hive metastore server to work for PHP & Python clients | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4078](https://issues.apache.org/jira/browse/HADOOP-4078) | TestKosmosFileSystem fails on trunk | Blocker | fs | Amareshwari Sriramadasu | Lohit Vijayarenu |
| [HADOOP-4077](https://issues.apache.org/jira/browse/HADOOP-4077) | Access permissions for setting access times and modification times for files | Blocker | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-4071](https://issues.apache.org/jira/browse/HADOOP-4071) | FSNameSystem.isReplicationInProgress should add an underReplicated block to the neededReplication queue using method "add" not "update" | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4054](https://issues.apache.org/jira/browse/HADOOP-4054) | During edit log loading, an underconstruction file's lease gets removed twice | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4050](https://issues.apache.org/jira/browse/HADOOP-4050) | TestFairScheduler failed on Linux | Major | . | Tsz Wo Nicholas Sze | Matei Zaharia |
| [HADOOP-4036](https://issues.apache.org/jira/browse/HADOOP-4036) | Increment InterTrackerProtocol version number due to changes in HADOOP-3759 | Major | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4030](https://issues.apache.org/jira/browse/HADOOP-4030) | LzopCodec shouldn't be in the default list of codecs i.e. io.compression.codecs | Major | io | Arun C Murthy | Arun C Murthy |
| [HADOOP-4027](https://issues.apache.org/jira/browse/HADOOP-4027) | When streaming utility is run without specifying mapper/reducer/input/output options, it returns 0. | Major | . | Ramya Sunil | |
| [HADOOP-4023](https://issues.apache.org/jira/browse/HADOOP-4023) | javadoc warnings: incorrect references | Major | documentation | Tsz Wo Nicholas Sze | Owen O'Malley |
| [HADOOP-4018](https://issues.apache.org/jira/browse/HADOOP-4018) | limit memory usage in jobtracker | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-4014](https://issues.apache.org/jira/browse/HADOOP-4014) | DFS upgrade fails on Windows | Blocker | fs | NOMURA Yoshihide | Konstantin Shvachko |
| [HADOOP-3991](https://issues.apache.org/jira/browse/HADOOP-3991) | updates to hadoop-ec2-env.sh for 0.18.0 | Minor | contrib/cloud | Karl Anderson | Tom White |
| [HADOOP-3985](https://issues.apache.org/jira/browse/HADOOP-3985) | TestHDFSServerPorts fails on trunk | Major | . | Amar Kamat | Hairong Kuang |
| [HADOOP-3970](https://issues.apache.org/jira/browse/HADOOP-3970) | Counters written to the job history cannot be recovered back | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-3968](https://issues.apache.org/jira/browse/HADOOP-3968) | test-libhdfs fails on trunk | Major | . | Lohit Vijayarenu | Pete Wyckoff |
| [HADOOP-3964](https://issues.apache.org/jira/browse/HADOOP-3964) | javadoc warnings by failmon | Major | build | Tsz Wo Nicholas Sze | dhruba borthakur |
| [HADOOP-3962](https://issues.apache.org/jira/browse/HADOOP-3962) | Shell command "fs -count" should support paths with different file systsms | Major | fs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3961](https://issues.apache.org/jira/browse/HADOOP-3961) | resource estimation works badly in some cases | Blocker | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-3959](https://issues.apache.org/jira/browse/HADOOP-3959) | [HOD] --resource\_manager.options is not passed to qsub | Major | contrib/hod | Craig Macdonald | Vinod Kumar Vavilapalli |
| [HADOOP-3958](https://issues.apache.org/jira/browse/HADOOP-3958) | TestMapRed ignores failures of the test case | Major | test | Owen O'Malley | Owen O'Malley |
| [HADOOP-3957](https://issues.apache.org/jira/browse/HADOOP-3957) | Fix javac warnings in DistCp and the corresponding tests | Minor | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3954](https://issues.apache.org/jira/browse/HADOOP-3954) | Skip records enabled as default. | Critical | . | Koji Noguchi | Sharad Agarwal |
| [HADOOP-3952](https://issues.apache.org/jira/browse/HADOOP-3952) | TestDataJoin references dfs.MiniDFSCluster instead of hdfs.MiniDFSCluster | Major | test | Owen O'Malley | Owen O'Malley |
| [HADOOP-3951](https://issues.apache.org/jira/browse/HADOOP-3951) | The package name used in FSNamesystem is incorrect | Trivial | . | Tsz Wo Nicholas Sze | Chris Douglas |
| [HADOOP-3950](https://issues.apache.org/jira/browse/HADOOP-3950) | TestMapRed and TestMiniMRDFSSort failed on trunk | Major | test | Tsz Wo Nicholas Sze | Enis Soztutar |
| [HADOOP-3949](https://issues.apache.org/jira/browse/HADOOP-3949) | javadoc warnings: Multiple sources of package comments found for package | Major | build, documentation | Tsz Wo Nicholas Sze | Jerome Boulon |
| [HADOOP-3946](https://issues.apache.org/jira/browse/HADOOP-3946) | TestMapRed fails on trunk | Blocker | test | Amareshwari Sriramadasu | Tom White |
| [HADOOP-3937](https://issues.apache.org/jira/browse/HADOOP-3937) | Job history may get disabled due to overly long job names | Major | . | Matei Zaharia | Matei Zaharia |
| [HADOOP-3933](https://issues.apache.org/jira/browse/HADOOP-3933) | DataNode's BlockSender sends more data than necessary | Minor | . | Ning Li | Ning Li |
| [HADOOP-3919](https://issues.apache.org/jira/browse/HADOOP-3919) | hadoop conf got slightly mangled by 3772 | Minor | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-3910](https://issues.apache.org/jira/browse/HADOOP-3910) | Are ClusterTestDFSNamespaceLogging and ClusterTestDFS still valid tests? | Minor | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3907](https://issues.apache.org/jira/browse/HADOOP-3907) | INodeDirectoryWithQuota should be in its own .java file | Minor | . | Steve Loughran | Tsz Wo Nicholas Sze |
| [HADOOP-3904](https://issues.apache.org/jira/browse/HADOOP-3904) | A few tests still using old hdfs package name | Minor | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3875](https://issues.apache.org/jira/browse/HADOOP-3875) | Fix TaskTracker's heartbeat timer to note the time the hearbeat RPC returned to decide next heartbeat time | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3864](https://issues.apache.org/jira/browse/HADOOP-3864) | JobTracker lockup due to JobInProgress.initTasks taking significant time for large jobs on large clusters | Critical | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3863](https://issues.apache.org/jira/browse/HADOOP-3863) | Use a thread-local rather than static ENCODER/DECODER variables in Text for synchronization | Critical | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3851](https://issues.apache.org/jira/browse/HADOOP-3851) | spelling error in FSNamesystemMetrics log message | Trivial | . | Steve Loughran | Steve Loughran |
| [HADOOP-3848](https://issues.apache.org/jira/browse/HADOOP-3848) | TaskTracker.localizeJob calls getSystemDir for each task rather than caching it | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-3846](https://issues.apache.org/jira/browse/HADOOP-3846) | CreateEditsLog used for benchmark misses creating parent directories | Minor | benchmarks | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3836](https://issues.apache.org/jira/browse/HADOOP-3836) | TestMultipleOutputs will fail if it is ran more than one times | Major | test | Tsz Wo Nicholas Sze | Alejandro Abdelnur |
| [HADOOP-3831](https://issues.apache.org/jira/browse/HADOOP-3831) | slow-reading dfs clients do not recover from datanode-write-timeouts | Major | . | Christian Kunz | Raghu Angadi |
| [HADOOP-3820](https://issues.apache.org/jira/browse/HADOOP-3820) | gridmix-env has a syntax error, and wrongly defines USE\_REAL\_DATASET by default | Major | benchmarks | Arun C Murthy | Arun C Murthy |
| [HADOOP-3819](https://issues.apache.org/jira/browse/HADOOP-3819) | can not get svn revision # at build time if locale is not english | Minor | build | Rong-En Fan | Rong-En Fan |
| [HADOOP-3816](https://issues.apache.org/jira/browse/HADOOP-3816) | KFS changes for faster directory listing | Minor | fs | Sriram Rao | Sriram Rao |
| [HADOOP-3814](https://issues.apache.org/jira/browse/HADOOP-3814) | [HOD] Remove dfs.client.buffer.dir generation, as this is removed in Hadoop 0.19. | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-3795](https://issues.apache.org/jira/browse/HADOOP-3795) | NameNode does not save image if different dfs.name.dir have different checkpoint stamps | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3785](https://issues.apache.org/jira/browse/HADOOP-3785) | FileSystem cache should be case-insensitive | Major | fs | Doug Cutting | Bill de hOra |
| [HADOOP-3783](https://issues.apache.org/jira/browse/HADOOP-3783) | "deprecated filesystem name" warning on EC2 | Minor | contrib/cloud | Stuart Sierra | Tom White |
| [HADOOP-3778](https://issues.apache.org/jira/browse/HADOOP-3778) | seek(long) in DFSInputStream should catch socket exception for retry later | Minor | . | Luo Ning | Luo Ning |
| [HADOOP-3777](https://issues.apache.org/jira/browse/HADOOP-3777) | Failure to load native lzo libraries causes job failure | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-3776](https://issues.apache.org/jira/browse/HADOOP-3776) | NPE in NameNode with unknown blocks | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-3773](https://issues.apache.org/jira/browse/HADOOP-3773) | Setting the conf twice in Pipes Submitter | Trivial | . | Koji Noguchi | Koji Noguchi |
| [HADOOP-3756](https://issues.apache.org/jira/browse/HADOOP-3756) | dfs.client.buffer.dir isn't used in hdfs, but it's still in conf/hadoop-default.xml | Trivial | . | Michael Bieniosek | Raghu Angadi |
| [HADOOP-3732](https://issues.apache.org/jira/browse/HADOOP-3732) | Block scanner should read block information during initialization. | Blocker | . | Konstantin Shvachko | Raghu Angadi |
| [HADOOP-3728](https://issues.apache.org/jira/browse/HADOOP-3728) | Cannot run more than one instance of examples.SleepJob at the same time. | Minor | . | Brice Arnould | Brice Arnould |
| [HADOOP-3726](https://issues.apache.org/jira/browse/HADOOP-3726) | TestCLI loses exception details on setup/teardown | Minor | test | Steve Loughran | Steve Loughran |
| [HADOOP-3725](https://issues.apache.org/jira/browse/HADOOP-3725) | TestMiniMRMapRedDebugScript loses exception details | Minor | test | Steve Loughran | Steve Loughran |
| [HADOOP-3723](https://issues.apache.org/jira/browse/HADOOP-3723) | libhdfs only accepts O\_WRONLY and O\_RDONLY so does not accept things like O\_WRONLY \| O\_CREAT | Minor | . | Pete Wyckoff | Pi Song |
| [HADOOP-3720](https://issues.apache.org/jira/browse/HADOOP-3720) | dfsadmin -refreshNodes should re-read the config file. | Major | . | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3711](https://issues.apache.org/jira/browse/HADOOP-3711) | Streaming input is not parsed properly to find the separator | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3705](https://issues.apache.org/jira/browse/HADOOP-3705) | CompositeInputFormat is unable to parse InputFormat classes with names containing '\_' or '$' | Major | . | Jingkei Ly | Chris Douglas |
| [HADOOP-3658](https://issues.apache.org/jira/browse/HADOOP-3658) | Incorrect destination IP logged for receiving blocks | Minor | . | Koji Noguchi | Chris Douglas |
| [HADOOP-3643](https://issues.apache.org/jira/browse/HADOOP-3643) | jobtasks.jsp when called for running tasks should ignore completed TIPs | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-3640](https://issues.apache.org/jira/browse/HADOOP-3640) | NativeS3FsInputStream read() method for reading a single byte is incorrect | Major | fs/s3 | Tom White | Tom White |
| [HADOOP-3623](https://issues.apache.org/jira/browse/HADOOP-3623) | LeaseManager needs refactoring. | Major | . | Konstantin Shvachko | Tsz Wo Nicholas Sze |
| [HADOOP-3592](https://issues.apache.org/jira/browse/HADOOP-3592) | org.apache.hadoop.fs.FileUtil.copy() will leak input streams if the destination can't be opened | Minor | fs | Steve Loughran | Bill de hOra |
| [HADOOP-3570](https://issues.apache.org/jira/browse/HADOOP-3570) | Including user specified jar files in the client side classpath path in Hadoop 0.17 streaming | Major | . | Suhas Gogate | Sharad Agarwal |
| [HADOOP-3560](https://issues.apache.org/jira/browse/HADOOP-3560) | Archvies sometimes create empty part files. | Blocker | . | Mahadev konar | Mahadev konar |
| [HADOOP-3543](https://issues.apache.org/jira/browse/HADOOP-3543) | Need to increment the year field for the copyright notice | Trivial | documentation | Chris Douglas | Chris Douglas |
| [HADOOP-3542](https://issues.apache.org/jira/browse/HADOOP-3542) | Hadoop archives should not create \_logs file in the final archive directory. | Blocker | . | Mahadev konar | Mahadev konar |
| [HADOOP-3528](https://issues.apache.org/jira/browse/HADOOP-3528) | Metrics FilesCreated and files\_deleted metrics do not match. | Blocker | metrics | Lohit Vijayarenu | Lohit Vijayarenu |
| [HADOOP-3506](https://issues.apache.org/jira/browse/HADOOP-3506) | Occasional NPE in Jets3tFileSystemStore | Major | fs/s3 | Robert | Tom White |
| [HADOOP-3488](https://issues.apache.org/jira/browse/HADOOP-3488) | the rsync command in hadoop-daemon.sh also rsync the logs folder from the master, what deletes the datanode / tasktracker log files. | Critical | scripts | Stefan Groschupf | Craig Macdonald |
| [HADOOP-3319](https://issues.apache.org/jira/browse/HADOOP-3319) | [HOD]checknodes prints errors messages on stdout | Major | contrib/hod | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-3155](https://issues.apache.org/jira/browse/HADOOP-3155) | reducers stuck at shuffling | Blocker | . | Runping Qi | dhruba borthakur |
| [HADOOP-3131](https://issues.apache.org/jira/browse/HADOOP-3131) | enabling BLOCK compression for map outputs breaks the reduce progress counters | Major | . | Colin Evans | Matei Zaharia |
| [HADOOP-3076](https://issues.apache.org/jira/browse/HADOOP-3076) | [HOD] If a cluster directory is specified as a relative path, an existing script.exitcode file will not be deleted. | Blocker | contrib/hod | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-2403](https://issues.apache.org/jira/browse/HADOOP-2403) | JobHistory log files contain data that cannot be parsed by org.apache.hadoop.mapred.JobHistory | Critical | . | Runping Qi | Amareshwari Sriramadasu |
| [HADOOP-2168](https://issues.apache.org/jira/browse/HADOOP-2168) | Pipes with a C++ record reader does not update progress in the map until it is 100% | Major | . | Owen O'Malley | Arun C Murthy |
| [HADOOP-1945](https://issues.apache.org/jira/browse/HADOOP-1945) | pipes examples aren't in the release | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-9](https://issues.apache.org/jira/browse/HADOOP-9) | mapred.local.dir temp dir. space allocation limited by smallest area | Minor | . | Paul Baclace | Ari Rabkin |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4464](https://issues.apache.org/jira/browse/HADOOP-4464) | Separate testClientTriggeredLeaseRecovery() out from TestFileCreation | Blocker | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4426](https://issues.apache.org/jira/browse/HADOOP-4426) | TestCapacityScheduler is broken | Blocker | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4390](https://issues.apache.org/jira/browse/HADOOP-4390) | Hive: test for case sensitivity in serde2 thrift serde | Minor | . | Zheng Shao | |
| [HADOOP-4259](https://issues.apache.org/jira/browse/HADOOP-4259) | findbugs should run over the tools.jar also | Minor | test | Owen O'Malley | Chris Douglas |
| [HADOOP-4237](https://issues.apache.org/jira/browse/HADOOP-4237) | TestStreamingBadRecords.testNarrowDown fails intermittently | Minor | test | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-4069](https://issues.apache.org/jira/browse/HADOOP-4069) | TestKosmosFileSystem can fail when run through ant test on systems shared by users | Minor | fs | Hemanth Yamijala | Lohit Vijayarenu |
| [HADOOP-4056](https://issues.apache.org/jira/browse/HADOOP-4056) | Unit test for DynamicSerDe | Minor | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3966](https://issues.apache.org/jira/browse/HADOOP-3966) | Place the new findbugs warnings introduced by the patch in the /tmp directory when "ant test-patch" is run. | Minor | test | Ramya Sunil | Ramya Sunil |
| [HADOOP-3790](https://issues.apache.org/jira/browse/HADOOP-3790) | Add more unit tests to test appending to files in HDFS | Blocker | test | dhruba borthakur | Tsz Wo Nicholas Sze |
| [HADOOP-3587](https://issues.apache.org/jira/browse/HADOOP-3587) | contrib/data\_join needs unit tests | Major | test | Chris Douglas | Chris Douglas |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4354](https://issues.apache.org/jira/browse/HADOOP-4354) | Separate TestDatanodeDeath.testDatanodeDeath() into 4 tests | Blocker | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4186](https://issues.apache.org/jira/browse/HADOOP-4186) | Move LineRecordReader.LineReader class to util package | Major | . | Tom White | Tom White |
| [HADOOP-4184](https://issues.apache.org/jira/browse/HADOOP-4184) | Fix simple module dependencies between core, hdfs and mapred | Major | . | Tom White | Tom White |
| [HADOOP-3824](https://issues.apache.org/jira/browse/HADOOP-3824) | Refactor org.apache.hadoop.mapred.StatusHttpServer | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-3601](https://issues.apache.org/jira/browse/HADOOP-3601) | Hive as a contrib project | Minor | . | Joydeep Sen Sarma | Ashish Thusoo |
| [HADOOP-4105](https://issues.apache.org/jira/browse/HADOOP-4105) | libhdfs wiki is very out-of-date and contains mostly broken links | Minor | documentation | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4076](https://issues.apache.org/jira/browse/HADOOP-4076) | fuse-dfs REAME lists wrong ant flags and is not specific in some place | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3942](https://issues.apache.org/jira/browse/HADOOP-3942) | Update DistCp documentation | Blocker | documentation | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3791](https://issues.apache.org/jira/browse/HADOOP-3791) | Use generics in ReflectionUtils | Trivial | . | Chris Smith | Chris Smith |

View File

@ -0,0 +1,529 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.19.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-4466](https://issues.apache.org/jira/browse/HADOOP-4466) | *Blocker* | **SequenceFileOutputFormat is coupled to WritableComparable and Writable**
Ensure that SequenceFileOutputFormat isn't tied to Writables and can be used with other Serialization frameworks.
---
* [HADOOP-4433](https://issues.apache.org/jira/browse/HADOOP-4433) | *Major* | **Improve data loader for collecting metrics and log files from hadoop and system**
- Added startup and shutdown script
- Added torque metrics data loader
- Improve handling of Exec Plugin
- Added Test cases for File Tailing Adaptors
- Added Test cases for Start streaming at specific offset
---
* [HADOOP-4430](https://issues.apache.org/jira/browse/HADOOP-4430) | *Blocker* | **Namenode Web UI capacity report is inconsistent with Balancer**
Changed reporting in the NameNode Web UI to more closely reflect the behavior of the re-balancer. Removed no longer used config parameter dfs.datanode.du.pct from hadoop-default.xml.
---
* [HADOOP-4293](https://issues.apache.org/jira/browse/HADOOP-4293) | *Major* | **Remove WritableJobConf**
Made Configuration Writable and rename the old write method to writeXml.
---
* [HADOOP-4281](https://issues.apache.org/jira/browse/HADOOP-4281) | *Blocker* | **Capacity reported in some of the commands is not consistent with the Web UI reported data**
Changed command "hadoop dfsadmin -report" to be consistent with Web UI for both Namenode and Datanode reports. "Total raw bytes" is changed to "Configured Capacity". "Present Capacity" is newly added to indicate the present capacity of the DFS. "Remaining raw bytes" is changed to "DFS Remaining". "Used raw bytes" is changed to "DFS Used". "% used" is changed to "DFS Used%". Applications that parse command output should be reviewed.
---
* [HADOOP-4227](https://issues.apache.org/jira/browse/HADOOP-4227) | *Minor* | **Remove the deprecated, unused class ShellCommand.**
Removed the deprecated class org.apache.hadoop.fs.ShellCommand.
---
* [HADOOP-4205](https://issues.apache.org/jira/browse/HADOOP-4205) | *Major* | **[Hive] metastore and ql to use the refactored SerDe library**
Improved Hive metastore and ql to use the refactored SerDe library.
---
* [HADOOP-4190](https://issues.apache.org/jira/browse/HADOOP-4190) | *Blocker* | **Changes to JobHistory makes it backward incompatible**
Changed job history format to add a dot at end of each line.
---
* [HADOOP-4176](https://issues.apache.org/jira/browse/HADOOP-4176) | *Major* | **Implement getFileChecksum(Path) in HftpFileSystem**
Implemented getFileChecksum(Path) in HftpFileSystemfor distcp support.
---
* [HADOOP-4138](https://issues.apache.org/jira/browse/HADOOP-4138) | *Major* | **[Hive] refactor the SerDe library**
Introduced new SerDe library for src/contrib/hive.
---
* [HADOOP-4117](https://issues.apache.org/jira/browse/HADOOP-4117) | *Major* | **Improve configurability of Hadoop EC2 instances**
Changed scripts to pass initialization script for EC2 instances at boot time (as EC2 user data) rather than embedding initialization information in the EC2 image. This change makes it easy to customize the hadoop-site.xml file for your cluster before launch, by editing the hadoop-ec2-init-remote.sh script, or by setting the environment variable USER\_DATA\_FILE in hadoop-ec2-env.sh to run a script of your choice.
---
* [HADOOP-4116](https://issues.apache.org/jira/browse/HADOOP-4116) | *Blocker* | **Balancer should provide better resource management**
Changed DataNode protocol version without impact to clients other than to compel use of current version of client application.
---
* [HADOOP-4106](https://issues.apache.org/jira/browse/HADOOP-4106) | *Major* | **add time, permission and user attribute support to fuse-dfs**
Added time, permission and user attribute support to libhdfs.
---
* [HADOOP-4086](https://issues.apache.org/jira/browse/HADOOP-4086) | *Major* | **Add limit to Hive QL**
Added LIMIT to Hive query language.
---
* [HADOOP-4084](https://issues.apache.org/jira/browse/HADOOP-4084) | *Major* | **Add explain plan capabilities to Hive QL**
Introduced "EXPLAIN" plan for Hive.
---
* [HADOOP-4018](https://issues.apache.org/jira/browse/HADOOP-4018) | *Major* | **limit memory usage in jobtracker**
Introduced new configuration parameter mapred.max.tasks.per.job to specifie the maximum number of tasks per job.
---
* [HADOOP-3992](https://issues.apache.org/jira/browse/HADOOP-3992) | *Major* | **Synthetic Load Generator for NameNode testing**
Added a synthetic load generation facility to the test directory.
---
* [HADOOP-3981](https://issues.apache.org/jira/browse/HADOOP-3981) | *Major* | **Need a distributed file checksum algorithm for HDFS**
Implemented MD5-of-xxxMD5-of-yyyCRC32 which is a distributed file checksum algorithm for HDFS, where xxx is the number of CRCs per block and yyy is the number of bytes per CRC.
Changed DistCp to use file checksum for comparing files if both source and destination FileSystem(s) support getFileChecksum(...).
---
* [HADOOP-3970](https://issues.apache.org/jira/browse/HADOOP-3970) | *Major* | **Counters written to the job history cannot be recovered back**
Added getEscapedCompactString() and fromEscapedCompactString() to Counters.java to represent counters as Strings and to reconstruct the counters from the Strings.
---
* [HADOOP-3963](https://issues.apache.org/jira/browse/HADOOP-3963) | *Minor* | **libhdfs should never exit on its own but rather return errors to the calling application**
Modified libhdfs to return NULL or error code when unrecoverable error occurs rather than exiting itself.
---
* [HADOOP-3941](https://issues.apache.org/jira/browse/HADOOP-3941) | *Major* | **Extend FileSystem API to return file-checksums/file-digests**
Added new FileSystem APIs: FileChecksum and FileSystem.getFileChecksum(Path).
---
* [HADOOP-3939](https://issues.apache.org/jira/browse/HADOOP-3939) | *Major* | **DistCp should support an option for deleting non-existing files.**
Added a new option -delete to DistCp so that if the files/directories exist in dst but not in src will be deleted. It uses FsShell to do delete, so that it will use trash if the trash is enable.
---
* [HADOOP-3938](https://issues.apache.org/jira/browse/HADOOP-3938) | *Major* | **Quotas for disk space management**
Introducted byte space quotas for directories. The count shell command modified to report both name and byte quotas.
---
* [HADOOP-3930](https://issues.apache.org/jira/browse/HADOOP-3930) | *Major* | **Decide how to integrate scheduler info into CLI and job tracker web page**
Changed TaskScheduler to expose API for Web UI and Command Line Tool.
---
* [HADOOP-3911](https://issues.apache.org/jira/browse/HADOOP-3911) | *Minor* | **' -blocks ' option not being recognized**
Added a check to fsck options to make sure -files is not the first option so as to resolve conflicts with GenericOptionsParser.
---
* [HADOOP-3908](https://issues.apache.org/jira/browse/HADOOP-3908) | *Minor* | **Better error message if llibhdfs.so doesn't exist**
Improved Fuse-dfs better error message if llibhdfs.so doesn't exist.
---
* [HADOOP-3889](https://issues.apache.org/jira/browse/HADOOP-3889) | *Minor* | **distcp: Better Error Message should be thrown when accessing source files/directory with no read permission**
Changed DistCp error messages when there is a RemoteException. Changed the corresponding return value from -999 to -3.
---
* [HADOOP-3873](https://issues.apache.org/jira/browse/HADOOP-3873) | *Major* | **DistCp should have an option for limiting the number of files/bytes being copied**
Added two new options -filelimit \<n\> and -sizelimit \<n\> to DistCp for limiting the total number of files and the total size in bytes, respectively.
---
* [HADOOP-3854](https://issues.apache.org/jira/browse/HADOOP-3854) | *Major* | **org.apache.hadoop.http.HttpServer should support user configurable filter**
Added a configuration property hadoop.http.filter.initializers and a class org.apache.hadoop.http.FilterInitializer for supporting servlet filter. Cluster administrator could possibly configure customized filters for their web site.
---
* [HADOOP-3837](https://issues.apache.org/jira/browse/HADOOP-3837) | *Major* | **hadop streaming does not use progress reporting to detect hung tasks**
Changed streaming tasks to adhere to task timeout value specified in the job configuration.
---
* [HADOOP-3829](https://issues.apache.org/jira/browse/HADOOP-3829) | *Major* | **Narrown down skipped records based on user acceptable value**
Introduced new config parameter org.apache.hadoop.mapred.SkipBadRecords.setMapperMaxSkipRecords to set range of records to be skipped in the neighborhood of a failed record.
---
* [HADOOP-3828](https://issues.apache.org/jira/browse/HADOOP-3828) | *Major* | **Write skipped records' bytes to DFS**
Skipped records can optionally be written to the HDFS. Refer org.apache.hadoop.mapred.SkipBadRecords.setSkipOutputPath for setting the output path.
---
* [HADOOP-3796](https://issues.apache.org/jira/browse/HADOOP-3796) | *Major* | **fuse-dfs should take rw,ro,trashon,trashoff,protected=blah mount arguments rather than them being compiled in**
Changed Fuse configuration to use mount options.
---
* [HADOOP-3792](https://issues.apache.org/jira/browse/HADOOP-3792) | *Minor* | **exit code from "hadoop dfs -test ..." is wrong for Unix shell**
Changed exit code from hadoop.fs.FsShell -test to match the usual Unix convention.
---
* [HADOOP-3746](https://issues.apache.org/jira/browse/HADOOP-3746) | *Minor* | **A fair sharing job scheduler**
Introduced Fair Scheduler.
---
* [HADOOP-3730](https://issues.apache.org/jira/browse/HADOOP-3730) | *Major* | **add new JobConf constructor that disables loading default configurations**
Added a JobConf constructor that disables loading default configurations so as to take all default values from the JobTracker's configuration.
---
* [HADOOP-3722](https://issues.apache.org/jira/browse/HADOOP-3722) | *Minor* | **Provide a unified way to pass jobconf options from bin/hadoop**
Changed streaming StreamJob and Submitter to implement Tool and Configurable, and to use GenericOptionsParser arguments -fs, -jt, -conf, -D, -libjars, -files, and -archives. Deprecated -jobconf, -cacheArchive, -dfs, -cacheArchive, -additionalconfspec, from streaming and pipes in favor of the generic options. Removed from streaming -config, -mapred.job.tracker, and -cluster.
---
* [HADOOP-3719](https://issues.apache.org/jira/browse/HADOOP-3719) | *Major* | **Chukwa**
Introduced Chukwa data collection and analysis framework.
---
* [HADOOP-3714](https://issues.apache.org/jira/browse/HADOOP-3714) | *Trivial* | **Bash tab completion support**
Adds a new contrib, bash-tab-completion, which enables bash tab completion for the bin/hadoop script. See the README file in the contrib directory for the installation.
---
* [HADOOP-3702](https://issues.apache.org/jira/browse/HADOOP-3702) | *Major* | **add support for chaining Maps in a single Map and after a Reduce [M\*/RM\*]**
Introduced ChainMapper and the ChainReducer classes to allow composing chains of Maps and Reduces in a single Map/Reduce job, something like MAP+ REDUCE MAP*.
---
* [HADOOP-3695](https://issues.apache.org/jira/browse/HADOOP-3695) | *Major* | **[HOD] Have an ability to run multiple slaves per node**
Added an ability in HOD to start multiple workers (TaskTrackers and/or DataNodes) per node to assist testing and simulation of scale. A configuration variable ringmaster.workers\_per\_ring was added to specify the number of workers to start.
---
* [HADOOP-3684](https://issues.apache.org/jira/browse/HADOOP-3684) | *Major* | **The data\_join should allow the user to implement a customer cloning function**
Allowed user to overwrite clone function in a subclass of TaggedMapOutput class.
---
* [HADOOP-3667](https://issues.apache.org/jira/browse/HADOOP-3667) | *Major* | **Remove deprecated methods in JobConf**
Removed the following deprecated methods from JobConf:
addInputPath(Path)
getInputPaths()
getMapOutputCompressionType()
getOutputPath()
getSystemDir()
setInputPath(Path)
setMapOutputCompressionType(CompressionType style)
setOutputPath(Path)
---
* [HADOOP-3652](https://issues.apache.org/jira/browse/HADOOP-3652) | *Major* | **Remove deprecated class OutputFormatBase**
Removed deprecated org.apache.hadoop.mapred.OutputFormatBase.
---
* [HADOOP-3646](https://issues.apache.org/jira/browse/HADOOP-3646) | *Major* | **Providing bzip2 as codec**
Introduced support for bzip2 compressed files.
---
* [HADOOP-3601](https://issues.apache.org/jira/browse/HADOOP-3601) | *Minor* | **Hive as a contrib project**
Introduced Hive Data Warehouse built on top of Hadoop that enables structuring Hadoop files as tables and partitions and allows users to query this data through a SQL like language using a command line interface.
---
* [HADOOP-3595](https://issues.apache.org/jira/browse/HADOOP-3595) | *Major* | **Remove deprecated mapred.combine.once functionality**
Removed deprecated methods for mapred.combine.once functionality.
---
* [HADOOP-3585](https://issues.apache.org/jira/browse/HADOOP-3585) | *Minor* | **Hardware Failure Monitoring in large clusters running Hadoop/HDFS**
Added FailMon as a contrib project for hardware failure monitoring and analysis, under /src/contrib/failmon. Created User Manual and Quick Start Guide.
---
* [HADOOP-3581](https://issues.apache.org/jira/browse/HADOOP-3581) | *Major* | **Prevent memory intensive user tasks from taking down nodes**
Added the ability to kill process trees transgressing memory limits. TaskTracker uses the configuration parameters introduced in HADOOP-3759. In addition, mapred.tasktracker.taskmemorymanager.monitoring-interval specifies the interval for which TT waits between cycles of monitoring tasks' memory usage, and mapred.tasktracker.procfsbasedprocesstree.sleeptime-before-sigkill specifies the time TT waits for sending a SIGKILL to a process-tree that has overrun memory limits, after it has been sent a SIGTERM.
---
* [HADOOP-3549](https://issues.apache.org/jira/browse/HADOOP-3549) | *Major* | **meaningful errno values in libhdfs**
Improved error reporting for libhdfs so permission problems now return EACCES.
---
* [HADOOP-3498](https://issues.apache.org/jira/browse/HADOOP-3498) | *Major* | **File globbing alternation should be able to span path components**
Extended file globbing alternation to cross path components. For example, {/a/b,/c/d} expands to a path that matches the files /a/b and /c/d.
---
* [HADOOP-3485](https://issues.apache.org/jira/browse/HADOOP-3485) | *Minor* | **fix writes**
Introduce write support for Fuse; requires Linux kernel 2.6.15 or better.
---
* [HADOOP-3478](https://issues.apache.org/jira/browse/HADOOP-3478) | *Major* | **The algorithm to decide map re-execution on fetch failures can be improved**
Changed reducers to fetch maps in the same order for a given host to speed up identification of the faulty maps; reducers still randomize the host selection to distribute load.
---
* [HADOOP-3445](https://issues.apache.org/jira/browse/HADOOP-3445) | *Major* | **Implementing core scheduler functionality in Resource Manager (V1) for Hadoop**
Introduced Capacity Task Scheduler.
---
* [HADOOP-3412](https://issues.apache.org/jira/browse/HADOOP-3412) | *Minor* | **Refactor the scheduler out of the JobTracker**
Added the ability to chose between many schedulers, and to limit the number of running tasks per job.
---
* [HADOOP-3245](https://issues.apache.org/jira/browse/HADOOP-3245) | *Major* | **Provide ability to persist running jobs (extend HADOOP-1876)**
Introduced recovery of jobs when JobTracker restarts. This facility is off by default. Introduced config parameters mapred.jobtracker.restart.recover, mapred.jobtracker.job.history.block.size, and mapred.jobtracker.job.history.buffer.size.
---
* [HADOOP-3150](https://issues.apache.org/jira/browse/HADOOP-3150) | *Major* | **Move task file promotion into the task**
Moved task file promotion to the Task. When the task has finished, it will do a commit and is declared SUCCEDED. Job cleanup is done by a separate task. Job is declared SUCCEDED/FAILED after the cleanup task has finished. Added public classes org.apache.hadoop.mapred.JobContext, TaskAttemptContext, OutputCommitter and FileOutputCommiitter. Added public APIs: public OutputCommitter getOutputCommitter() and
public void setOutputCommitter(Class\<? extends OutputCommitter\> theClass) in org.apache.hadoop.mapred.JobConf
---
* [HADOOP-3149](https://issues.apache.org/jira/browse/HADOOP-3149) | *Major* | **supporting multiple outputs for M/R jobs**
Introduced MultipleOutputs class so Map/Reduce jobs can write data to different output files. Each output can use a different OutputFormat. Outpufiles are created within the job output directory. FileOutputFormat.getPathForCustomFile() creates a filename under the outputdir that is named with the task ID and task type (i.e. myfile-r-00001).
---
* [HADOOP-3062](https://issues.apache.org/jira/browse/HADOOP-3062) | *Major* | **Need to capture the metrics for the network ios generate by dfs reads/writes and map/reduce shuffling and break them down by racks**
Introduced additional log records for data transfers.
---
* [HADOOP-3019](https://issues.apache.org/jira/browse/HADOOP-3019) | *Major* | **want input sampler & sorted partitioner**
Added a partitioner that effects a total order of output data, and an input sampler for generating the partition keyset for TotalOrderPartitioner for when the map's input keytype and distribution approximates its output.
---
* [HADOOP-2816](https://issues.apache.org/jira/browse/HADOOP-2816) | *Major* | **Cluster summary at name node web has confusing report for space utilization**
Improved space reporting for NameNode Web UI. Applications that parse the Web UI output should be reviewed.
---
* [HADOOP-2664](https://issues.apache.org/jira/browse/HADOOP-2664) | *Major* | **lzop-compatible CompresionCodec**
Introduced LZOP codec.
---
* [HADOOP-2411](https://issues.apache.org/jira/browse/HADOOP-2411) | *Major* | **Add support for larger EC2 instance types**
Added support for c1.* instance types and associated kernels for EC2.
---
* [HADOOP-2325](https://issues.apache.org/jira/browse/HADOOP-2325) | *Major* | **Require Java 6**
Hadoop now requires Java 6.
---
* [HADOOP-2302](https://issues.apache.org/jira/browse/HADOOP-2302) | *Major* | ** Streaming should provide an option for numerical sort of keys**
Introduced numerical key comparison for streaming.
---
* [HADOOP-1869](https://issues.apache.org/jira/browse/HADOOP-1869) | *Major* | **access times of HDFS files**
Added HDFS file access times. By default, access times will be precise to the most recent hour boundary. A configuration parameter dfs.access.time.precision (milliseconds) is used to control this precision. Setting a value of 0 will disable persisting access times for HDFS files.
---
* [HADOOP-1823](https://issues.apache.org/jira/browse/HADOOP-1823) | *Major* | **want InputFormat for bzip2 files**
bzip2 provided as codec in 0.19.0 https://issues.apache.org/jira/browse/HADOOP-3646
---
* [HADOOP-1700](https://issues.apache.org/jira/browse/HADOOP-1700) | *Major* | **Append to files in HDFS**
Introduced append operation for HDFS files.
---
* [HADOOP-249](https://issues.apache.org/jira/browse/HADOOP-249) | *Major* | **Improving Map -\> Reduce performance and Task JVM reuse**
Enabled task JVMs to be reused via the job config mapred.job.reuse.jvm.num.tasks.
---
* [HADOOP-153](https://issues.apache.org/jira/browse/HADOOP-153) | *Major* | **skip records that fail Task**
Introduced record skipping where tasks fail on certain records. (org.apache.hadoop.mapred.SkipBadRecords)

View File

@ -0,0 +1,112 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.19.1 - 2009-02-24
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5225](https://issues.apache.org/jira/browse/HADOOP-5225) | workaround for tmp file handling on DataNodes in 0.19.1 (HADOOP-4663) | Blocker | . | Nigel Daley | Raghu Angadi |
| [HADOOP-5224](https://issues.apache.org/jira/browse/HADOOP-5224) | Disable append | Blocker | . | Nigel Daley | |
| [HADOOP-4061](https://issues.apache.org/jira/browse/HADOOP-4061) | Large number of decommission freezes the Namenode | Major | . | Koji Noguchi | Tsz Wo Nicholas Sze |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5034](https://issues.apache.org/jira/browse/HADOOP-5034) | NameNode should send both replication and deletion requests to DataNode in one reply to a heartbeat | Major | . | Hairong Kuang | Hairong Kuang |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5127](https://issues.apache.org/jira/browse/HADOOP-5127) | FSDirectory should not have public methods. | Major | . | Konstantin Shvachko | Jakob Homan |
| [HADOOP-5086](https://issues.apache.org/jira/browse/HADOOP-5086) | Trash URI semantics can be relaxed | Minor | fs | Chris Douglas | Chris Douglas |
| [HADOOP-4739](https://issues.apache.org/jira/browse/HADOOP-4739) | Minor enhancements to some sections of the Map/Reduce tutorial | Trivial | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-3894](https://issues.apache.org/jira/browse/HADOOP-3894) | DFSClient chould log errors better, and provide better diagnostics | Trivial | . | Steve Loughran | Steve Loughran |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5665](https://issues.apache.org/jira/browse/HADOOP-5665) | Namenode could not be formatted because the "whoami" program could not be run. | Major | . | Evelyn Sylvia | |
| [HADOOP-5268](https://issues.apache.org/jira/browse/HADOOP-5268) | Using MultipleOutputFormat and setting reducers to 0 causes org.apache.hadoop.hdfs.protocol.AlreadyBeingCreatedException and job to fail | Major | . | Thibaut | |
| [HADOOP-5193](https://issues.apache.org/jira/browse/HADOOP-5193) | SecondaryNameNode does not rollImage because of incorrect calculation of edits modification time. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-5192](https://issues.apache.org/jira/browse/HADOOP-5192) | Block reciever should not remove a finalized block when block replication fails | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5166](https://issues.apache.org/jira/browse/HADOOP-5166) | JobTracker fails to restart if recovery and ACLs are enabled | Blocker | . | Karam Singh | Amar Kamat |
| [HADOOP-5161](https://issues.apache.org/jira/browse/HADOOP-5161) | Accepted sockets do not get placed in DataXceiverServer#childSockets | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5156](https://issues.apache.org/jira/browse/HADOOP-5156) | TestHeartbeatHandling uses MiniDFSCluster.getNamesystem() which does not exist in branch 0.20 | Major | test | Konstantin Shvachko | Hairong Kuang |
| [HADOOP-5134](https://issues.apache.org/jira/browse/HADOOP-5134) | FSNamesystem#commitBlockSynchronization adds under-construction block locations to blocksMap | Blocker | . | Hairong Kuang | dhruba borthakur |
| [HADOOP-5067](https://issues.apache.org/jira/browse/HADOOP-5067) | Failed/Killed attempts column in jobdetails.jsp does not show the number of failed/killed attempts correctly | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5009](https://issues.apache.org/jira/browse/HADOOP-5009) | DataNode#shutdown sometimes leaves data block scanner verification log unclosed | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5008](https://issues.apache.org/jira/browse/HADOOP-5008) | TestReplication#testPendingReplicationRetry leaves an opened fd unclosed | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-5002](https://issues.apache.org/jira/browse/HADOOP-5002) | 2 core tests TestFileOutputFormat and TestHarFileSystem are failing in branch 19 | Blocker | . | Ravi Gummadi | Amareshwari Sriramadasu |
| [HADOOP-4992](https://issues.apache.org/jira/browse/HADOOP-4992) | TestCustomOutputCommitter fails on hadoop-0.19 | Blocker | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-4983](https://issues.apache.org/jira/browse/HADOOP-4983) | Job counters sometimes go down as tasks run without task failures | Critical | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-4982](https://issues.apache.org/jira/browse/HADOOP-4982) | TestFsck does not run in Eclipse. | Major | test | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4967](https://issues.apache.org/jira/browse/HADOOP-4967) | Inconsistent state in JVM manager | Major | . | Amareshwari Sriramadasu | Devaraj Das |
| [HADOOP-4966](https://issues.apache.org/jira/browse/HADOOP-4966) | Setup tasks are not removed from JobTracker's taskIdToTIPMap even after the job completes | Major | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-4965](https://issues.apache.org/jira/browse/HADOOP-4965) | DFSClient should log instead of printing into std err. | Major | test | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4955](https://issues.apache.org/jira/browse/HADOOP-4955) | Make DBOutputFormat us column names from setOutput(...) | Major | . | Kevin Peterson | Kevin Peterson |
| [HADOOP-4943](https://issues.apache.org/jira/browse/HADOOP-4943) | fair share scheduler does not utilize all slots if the task trackers are configured heterogeneously | Major | . | Zheng Shao | Zheng Shao |
| [HADOOP-4924](https://issues.apache.org/jira/browse/HADOOP-4924) | Race condition in re-init of TaskTracker | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-4918](https://issues.apache.org/jira/browse/HADOOP-4918) | Fix bzip2 work with SequenceFile | Major | io | Zheng Shao | Zheng Shao |
| [HADOOP-4906](https://issues.apache.org/jira/browse/HADOOP-4906) | TaskTracker running out of memory after running several tasks | Blocker | . | Arun C Murthy | Sharad Agarwal |
| [HADOOP-4862](https://issues.apache.org/jira/browse/HADOOP-4862) | A spurious IOException log on DataNode is not completely removed | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4847](https://issues.apache.org/jira/browse/HADOOP-4847) | OutputCommitter is loaded in the TaskTracker in localizeConfiguration | Blocker | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-4836](https://issues.apache.org/jira/browse/HADOOP-4836) | Minor typos in documentation and comments | Trivial | documentation | Jordà Polo | Jordà Polo |
| [HADOOP-4821](https://issues.apache.org/jira/browse/HADOOP-4821) | Usage description in the Quotas guide documentations are incorrect | Minor | documentation | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4797](https://issues.apache.org/jira/browse/HADOOP-4797) | RPC Server can leave a lot of direct buffers | Blocker | ipc | Raghu Angadi | Raghu Angadi |
| [HADOOP-4760](https://issues.apache.org/jira/browse/HADOOP-4760) | HDFS streams should not throw exceptions when closed twice | Major | fs, fs/s3 | Alejandro Abdelnur | Enis Soztutar |
| [HADOOP-4759](https://issues.apache.org/jira/browse/HADOOP-4759) | HADOOP-4654 to be fixed for branches \>= 0.19 | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4731](https://issues.apache.org/jira/browse/HADOOP-4731) | Job is not removed from the waiting jobs queue upon completion. | Major | . | Hemanth Yamijala | Amar Kamat |
| [HADOOP-4727](https://issues.apache.org/jira/browse/HADOOP-4727) | Groups do not work for fuse-dfs out of the box on 0.19.0 | Blocker | . | Brian Bockelman | Brian Bockelman |
| [HADOOP-4720](https://issues.apache.org/jira/browse/HADOOP-4720) | docs/api does not contain the hdfs directory after building | Major | build | Ramya Sunil | |
| [HADOOP-4697](https://issues.apache.org/jira/browse/HADOOP-4697) | KFS::getBlockLocations() fails with files having multiple blocks | Major | fs | Lohit Vijayarenu | Sriram Rao |
| [HADOOP-4635](https://issues.apache.org/jira/browse/HADOOP-4635) | Memory leak ? | Blocker | . | Marc-Olivier Fleury | Pete Wyckoff |
| [HADOOP-4632](https://issues.apache.org/jira/browse/HADOOP-4632) | TestJobHistoryVersion should not create directory in current dir. | Major | . | Amareshwari Sriramadasu | Amar Kamat |
| [HADOOP-4616](https://issues.apache.org/jira/browse/HADOOP-4616) | assertion makes fuse-dfs exit when reading incomplete data | Blocker | . | Marc-Olivier Fleury | Pete Wyckoff |
| [HADOOP-4508](https://issues.apache.org/jira/browse/HADOOP-4508) | FSDataOutputStream.getPos() == 0when appending to existing file and should be file length | Major | fs | Pete Wyckoff | dhruba borthakur |
| [HADOOP-4494](https://issues.apache.org/jira/browse/HADOOP-4494) | libhdfs does not call FileSystem.append when O\_APPEND passed to hdfsOpenFile | Major | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4420](https://issues.apache.org/jira/browse/HADOOP-4420) | JobTracker.killJob() doesn't check for the JobID being valid | Minor | . | Steve Loughran | Aaron Kimball |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,101 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.19.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-5225](https://issues.apache.org/jira/browse/HADOOP-5225) | *Blocker* | **workaround for tmp file handling on DataNodes in 0.19.1 (HADOOP-4663)**
Work around for tmp file handling. sync() does not work as a result.
---
* [HADOOP-5224](https://issues.apache.org/jira/browse/HADOOP-5224) | *Blocker* | **Disable append**
HDFS append() is disabled. It throws UnsupportedOperationException.
---
* [HADOOP-5034](https://issues.apache.org/jira/browse/HADOOP-5034) | *Major* | **NameNode should send both replication and deletion requests to DataNode in one reply to a heartbeat**
This patch changes the DatanodeProtocoal version number from 18 to 19. The patch allows NameNode to send both block replication and deletion request to a DataNode in response to a heartbeat.
---
* [HADOOP-5002](https://issues.apache.org/jira/browse/HADOOP-5002) | *Blocker* | **2 core tests TestFileOutputFormat and TestHarFileSystem are failing in branch 19**
This patch solves the null pointer exception issue in the 2 core tests TestFileOutputFormat and TestHarFileSystem in branch 19.
---
* [HADOOP-4943](https://issues.apache.org/jira/browse/HADOOP-4943) | *Major* | **fair share scheduler does not utilize all slots if the task trackers are configured heterogeneously**
HADOOP-4943: Fixed fair share scheduler to utilize all slots when the task trackers are configured heterogeneously.
---
* [HADOOP-4906](https://issues.apache.org/jira/browse/HADOOP-4906) | *Blocker* | **TaskTracker running out of memory after running several tasks**
Fix the tasktracker for OOM exception by sharing the jobconf properties across tasks of the same job. Earlier a new instance was held for each task. With this fix, the job level configuration properties are shared across tasks of the same job.
---
* [HADOOP-4862](https://issues.apache.org/jira/browse/HADOOP-4862) | *Blocker* | **A spurious IOException log on DataNode is not completely removed**
Minor : HADOOP-3678 did not remove all the cases of spurious IOExceptions logged by DataNode.
---
* [HADOOP-4797](https://issues.apache.org/jira/browse/HADOOP-4797) | *Blocker* | **RPC Server can leave a lot of direct buffers**
Improve how RPC server reads and writes large buffers. Avoids soft-leak of direct buffers and excess copies in NIO layer.
---
* [HADOOP-4635](https://issues.apache.org/jira/browse/HADOOP-4635) | *Blocker* | **Memory leak ?**
fix memory leak of user/group information in fuse-dfs
---
* [HADOOP-4494](https://issues.apache.org/jira/browse/HADOOP-4494) | *Major* | **libhdfs does not call FileSystem.append when O\_APPEND passed to hdfsOpenFile**
libhdfs supports O\_APPEND flag
---
* [HADOOP-4061](https://issues.apache.org/jira/browse/HADOOP-4061) | *Major* | **Large number of decommission freezes the Namenode**
Added a new conf property dfs.namenode.decommission.nodes.per.interval so that NameNode checks decommission status of x nodes for every y seconds, where x is the value of dfs.namenode.decommission.nodes.per.interval and y is the value of dfs.namenode.decommission.interval.

View File

@ -0,0 +1,104 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.19.2 - 2009-07-23
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5332](https://issues.apache.org/jira/browse/HADOOP-5332) | Make support for file append API configurable | Blocker | . | Nigel Daley | dhruba borthakur |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5379](https://issues.apache.org/jira/browse/HADOOP-5379) | Throw exception instead of writing to System.err when there is a CRC error on CBZip2InputStream | Minor | io | Rodrigo Schmidt | Rodrigo Schmidt |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6017](https://issues.apache.org/jira/browse/HADOOP-6017) | NameNode and SecondaryNameNode fail to restart because of abnormal filenames. | Blocker | . | Raghu Angadi | Tsz Wo Nicholas Sze |
| [HADOOP-5951](https://issues.apache.org/jira/browse/HADOOP-5951) | StorageInfo needs Apache license header. | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-5816](https://issues.apache.org/jira/browse/HADOOP-5816) | ArrayIndexOutOfBoundsException when using KeyFieldBasedComparator | Minor | . | Min Zhou | He Yongqiang |
| [HADOOP-5728](https://issues.apache.org/jira/browse/HADOOP-5728) | FSEditLog.printStatistics may cause IndexOutOfBoundsException | Major | . | Wang Xu | Wang Xu |
| [HADOOP-5671](https://issues.apache.org/jira/browse/HADOOP-5671) | DistCp.sameFile(..) should return true if src fs does not support checksum | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5644](https://issues.apache.org/jira/browse/HADOOP-5644) | Namnode is stuck in safe mode | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-5579](https://issues.apache.org/jira/browse/HADOOP-5579) | libhdfs does not set errno correctly | Major | . | Brian Bockelman | Brian Bockelman |
| [HADOOP-5557](https://issues.apache.org/jira/browse/HADOOP-5557) | Two minor problems in TestOverReplicatedBlocks | Minor | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5554](https://issues.apache.org/jira/browse/HADOOP-5554) | DataNodeCluster should create blocks with the same generation stamp as the blocks created in CreateEditsLog | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-5551](https://issues.apache.org/jira/browse/HADOOP-5551) | Namenode permits directory destruction on overwrite | Critical | . | Brian Bockelman | Brian Bockelman |
| [HADOOP-5549](https://issues.apache.org/jira/browse/HADOOP-5549) | ReplicationMonitor should schedule both replication and deletion work in one iteration | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5522](https://issues.apache.org/jira/browse/HADOOP-5522) | Document job setup/cleaup tasks and task cleanup tasks in mapred tutorial | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5479](https://issues.apache.org/jira/browse/HADOOP-5479) | NameNode should not send empty block replication request to DataNode | Critical | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5465](https://issues.apache.org/jira/browse/HADOOP-5465) | Blocks remain under-replicated | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5449](https://issues.apache.org/jira/browse/HADOOP-5449) | Verify if JobHistory.HistoryCleaner works as expected | Blocker | . | Amar Kamat | Amareshwari Sriramadasu |
| [HADOOP-5446](https://issues.apache.org/jira/browse/HADOOP-5446) | TaskTracker metrics are disabled | Major | metrics | Chris Douglas | Chris Douglas |
| [HADOOP-5440](https://issues.apache.org/jira/browse/HADOOP-5440) | Successful taskid are not removed from TaskMemoryManager | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5421](https://issues.apache.org/jira/browse/HADOOP-5421) | HADOOP-4638 has broken 0.19 compilation | Blocker | . | Amar Kamat | Devaraj Das |
| [HADOOP-5392](https://issues.apache.org/jira/browse/HADOOP-5392) | JobTracker crashes during recovery if job files are garbled | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-5384](https://issues.apache.org/jira/browse/HADOOP-5384) | DataNodeCluster should not create blocks with generationStamp == 1 | Blocker | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5376](https://issues.apache.org/jira/browse/HADOOP-5376) | JobInProgress.obtainTaskCleanupTask() throws an ArrayIndexOutOfBoundsException | Blocker | . | Vinod Kumar Vavilapalli | Amareshwari Sriramadasu |
| [HADOOP-5374](https://issues.apache.org/jira/browse/HADOOP-5374) | NPE in JobTracker.getTasksToSave() method | Major | . | Vinod Kumar Vavilapalli | Amareshwari Sriramadasu |
| [HADOOP-5333](https://issues.apache.org/jira/browse/HADOOP-5333) | The libhdfs append API is not coded correctly | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-5326](https://issues.apache.org/jira/browse/HADOOP-5326) | bzip2 codec (CBZip2OutputStream) creates corrupted output file for some inputs | Major | io | Rodrigo Schmidt | Rodrigo Schmidt |
| [HADOOP-5285](https://issues.apache.org/jira/browse/HADOOP-5285) | JobTracker hangs for long periods of time | Blocker | . | Vinod Kumar Vavilapalli | Devaraj Das |
| [HADOOP-5280](https://issues.apache.org/jira/browse/HADOOP-5280) | When expiring a lost launched task, JT doesn't remove the attempt from the taskidToTIPMap. | Blocker | . | Vinod Kumar Vavilapalli | Devaraj Das |
| [HADOOP-5269](https://issues.apache.org/jira/browse/HADOOP-5269) | TaskTracker.runningTasks holding FAILED\_UNCLEAN and KILLED\_UNCLEAN taskStatuses forever in some cases. | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5247](https://issues.apache.org/jira/browse/HADOOP-5247) | NPEs in JobTracker and JobClient when mapred.jobtracker.completeuserjobs.maximum is set to zero. | Blocker | . | Vinod Kumar Vavilapalli | Amar Kamat |
| [HADOOP-5241](https://issues.apache.org/jira/browse/HADOOP-5241) | Reduce tasks get stuck because of over-estimated task size (regression from 0.18) | Blocker | . | Andy Pavlo | Sharad Agarwal |
| [HADOOP-5233](https://issues.apache.org/jira/browse/HADOOP-5233) | Reducer not Succeded after 100% | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5231](https://issues.apache.org/jira/browse/HADOOP-5231) | Negative number of maps in cluster summary | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5213](https://issues.apache.org/jira/browse/HADOOP-5213) | BZip2CompressionOutputStream NullPointerException | Blocker | io | Zheng Shao | Zheng Shao |
| [HADOOP-5154](https://issues.apache.org/jira/browse/HADOOP-5154) | 4-way deadlock in FairShare scheduler | Blocker | . | Vinod Kumar Vavilapalli | Matei Zaharia |
| [HADOOP-5146](https://issues.apache.org/jira/browse/HADOOP-5146) | LocalDirAllocator misses files on the local filesystem | Blocker | . | Arun C Murthy | Devaraj Das |
| [HADOOP-4780](https://issues.apache.org/jira/browse/HADOOP-4780) | Task Tracker burns a lot of cpu in calling getLocalCache | Major | . | Runping Qi | He Yongqiang |
| [HADOOP-4719](https://issues.apache.org/jira/browse/HADOOP-4719) | The ls shell command documentation is out-dated | Major | documentation | Tsz Wo Nicholas Sze | Ravi Phulari |
| [HADOOP-4638](https://issues.apache.org/jira/browse/HADOOP-4638) | Exception thrown in/from RecoveryManager.recover() should be caught and handled | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-3998](https://issues.apache.org/jira/browse/HADOOP-3998) | Got an exception from ClientFinalizer when the JT is terminated | Blocker | . | Amar Kamat | dhruba borthakur |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,38 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.19.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-5332](https://issues.apache.org/jira/browse/HADOOP-5332) | *Blocker* | **Make support for file append API configurable**
Appending to HDFS files is supported only if dfs.support.append is set to true.
---
* [HADOOP-4780](https://issues.apache.org/jira/browse/HADOOP-4780) | *Major* | **Task Tracker burns a lot of cpu in calling getLocalCache**
make DistributedCache remember the size of each cache directory

View File

@ -0,0 +1,119 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.2.0 - 2006-05-05
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-191](https://issues.apache.org/jira/browse/HADOOP-191) | add hadoopStreaming to src/contrib | Major | . | Michel Tourn | Doug Cutting |
| [HADOOP-189](https://issues.apache.org/jira/browse/HADOOP-189) | Add job jar lib, classes, etc. to CLASSPATH when in standalone mode | Major | . | stack | Doug Cutting |
| [HADOOP-148](https://issues.apache.org/jira/browse/HADOOP-148) | add a failure count to task trackers | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-132](https://issues.apache.org/jira/browse/HADOOP-132) | An API for reporting performance metrics | Major | . | David Bowen | |
| [HADOOP-65](https://issues.apache.org/jira/browse/HADOOP-65) | add a record I/O framework to hadoop | Minor | io, ipc | Sameer Paranjpye | |
| [HADOOP-51](https://issues.apache.org/jira/browse/HADOOP-51) | per-file replication counts | Major | . | Doug Cutting | Konstantin Shvachko |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-198](https://issues.apache.org/jira/browse/HADOOP-198) | adding owen's examples to exampledriver | Minor | . | Mahadev konar | Mahadev konar |
| [HADOOP-178](https://issues.apache.org/jira/browse/HADOOP-178) | piggyback block work requests to heartbeats and move block replication/deletion startup delay from datanodes to namenode | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-177](https://issues.apache.org/jira/browse/HADOOP-177) | improvement to browse through the map/reduce tasks | Minor | . | Mahadev konar | Mahadev konar |
| [HADOOP-173](https://issues.apache.org/jira/browse/HADOOP-173) | optimize allocation of tasks w/ local data | Major | . | Doug Cutting | Doug Cutting |
| [HADOOP-170](https://issues.apache.org/jira/browse/HADOOP-170) | setReplication and related bug fixes | Major | fs | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-167](https://issues.apache.org/jira/browse/HADOOP-167) | reducing the number of Configuration & JobConf objects created | Major | conf | Owen O'Malley | Owen O'Malley |
| [HADOOP-166](https://issues.apache.org/jira/browse/HADOOP-166) | IPC is unable to invoke methods that use interfaces as parameter | Minor | ipc | Stefan Groschupf | Doug Cutting |
| [HADOOP-150](https://issues.apache.org/jira/browse/HADOOP-150) | tip and task names should reflect the job name | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-144](https://issues.apache.org/jira/browse/HADOOP-144) | the dfs client id isn't relatable to the map/reduce task ids | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-142](https://issues.apache.org/jira/browse/HADOOP-142) | failed tasks should be rescheduled on different hosts after other jobs | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-138](https://issues.apache.org/jira/browse/HADOOP-138) | stop all tasks | Trivial | . | Stefan Groschupf | Doug Cutting |
| [HADOOP-131](https://issues.apache.org/jira/browse/HADOOP-131) | Separate start/stop-dfs.sh and start/stop-mapred.sh scripts | Minor | . | Chris A. Mattmann | Doug Cutting |
| [HADOOP-129](https://issues.apache.org/jira/browse/HADOOP-129) | FileSystem should not name files with java.io.File | Major | fs | Doug Cutting | Doug Cutting |
| [HADOOP-116](https://issues.apache.org/jira/browse/HADOOP-116) | cleaning up /tmp/hadoop/mapred/system | Major | . | raghavendra prabhu | Doug Cutting |
| [HADOOP-114](https://issues.apache.org/jira/browse/HADOOP-114) | Non-informative error message | Trivial | . | Rod Taylor | Doug Cutting |
| [HADOOP-96](https://issues.apache.org/jira/browse/HADOOP-96) | name server should log decisions that affect data: block creation, removal, replication | Critical | . | Yoram Arnon | Hairong Kuang |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-192](https://issues.apache.org/jira/browse/HADOOP-192) | Trivial JRE 1.5 versus 1.4 bug | Blocker | . | David Bowen | |
| [HADOOP-190](https://issues.apache.org/jira/browse/HADOOP-190) | Job fails though task succeeded if we fail to exit | Major | . | stack | |
| [HADOOP-188](https://issues.apache.org/jira/browse/HADOOP-188) | more unprotected RPC calls in JobClient.runJob allow loss of job due to timeout | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-186](https://issues.apache.org/jira/browse/HADOOP-186) | communication problems in the task tracker cause long latency | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-185](https://issues.apache.org/jira/browse/HADOOP-185) | tasks are lost during pollForNewTask | Critical | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-183](https://issues.apache.org/jira/browse/HADOOP-183) | adjust file replication factor when loading image and edits according to replication.min and replication.max | Minor | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-182](https://issues.apache.org/jira/browse/HADOOP-182) | lost task trackers should not update status of completed jobs | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-174](https://issues.apache.org/jira/browse/HADOOP-174) | jobclient kills job for one timeout | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-172](https://issues.apache.org/jira/browse/HADOOP-172) | rpc doesn't handle returning null for a String[] | Blocker | ipc | Owen O'Malley | Owen O'Malley |
| [HADOOP-169](https://issues.apache.org/jira/browse/HADOOP-169) | a single failure from locateMapOutputs kills the entire job | Critical | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-168](https://issues.apache.org/jira/browse/HADOOP-168) | JobSubmissionProtocol and InterTrackerProtocol don't include "throws IOException" on all methods | Minor | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-162](https://issues.apache.org/jira/browse/HADOOP-162) | concurrent modification exception in FSNamesystem.Lease.releaseLocks | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-160](https://issues.apache.org/jira/browse/HADOOP-160) | sleeping with locks held | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-157](https://issues.apache.org/jira/browse/HADOOP-157) | job fails because pendingCreates is not cleaned up after a task fails | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-151](https://issues.apache.org/jira/browse/HADOOP-151) | RPC code has socket leak? | Major | ipc | p sutter | Doug Cutting |
| [HADOOP-143](https://issues.apache.org/jira/browse/HADOOP-143) | exception call stacks are word wrapped in webapp | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-139](https://issues.apache.org/jira/browse/HADOOP-139) | Deadlock in LocalFileSystem lock/release | Major | fs | Igor Bolotin | Doug Cutting |
| [HADOOP-137](https://issues.apache.org/jira/browse/HADOOP-137) | Different TaskTrackers may get the same task tracker id, thus cause many problems. | Critical | . | Runping Qi | Owen O'Malley |
| [HADOOP-134](https://issues.apache.org/jira/browse/HADOOP-134) | JobTracker trapped in a loop if it fails to localize a task | Major | . | Runping Qi | Owen O'Malley |
| [HADOOP-133](https://issues.apache.org/jira/browse/HADOOP-133) | the TaskTracker.Child.ping thread calls exit | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-128](https://issues.apache.org/jira/browse/HADOOP-128) | Failure to replicate dfs block kills client | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-126](https://issues.apache.org/jira/browse/HADOOP-126) | "hadoop dfs -cp" does not copy crc files | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-125](https://issues.apache.org/jira/browse/HADOOP-125) | LocalFileSystem.makeAbsolute bug on Windows | Minor | fs | p sutter | Doug Cutting |
| [HADOOP-118](https://issues.apache.org/jira/browse/HADOOP-118) | Namenode does not always clean up pendingCreates | Critical | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-117](https://issues.apache.org/jira/browse/HADOOP-117) | mapred temporary files not deleted | Blocker | . | raghavendra prabhu | Doug Cutting |
| [HADOOP-92](https://issues.apache.org/jira/browse/HADOOP-92) | Error Reporting/logging in MapReduce | Minor | . | Mahadev konar | Mahadev konar |
| [HADOOP-69](https://issues.apache.org/jira/browse/HADOOP-69) | Unchecked lookup value causes NPE in FSNamesystemgetDatanodeHints | Major | . | Bryan Pendleton | |
| [HADOOP-68](https://issues.apache.org/jira/browse/HADOOP-68) | "Cannot abandon block during write to \<file\>" and "Cannot obtain additional block for file \<file\>" errors during dfs write test | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-63](https://issues.apache.org/jira/browse/HADOOP-63) | problem with webapp when start a jobtracker | Minor | . | Hairong Kuang | Hairong Kuang |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-194](https://issues.apache.org/jira/browse/HADOOP-194) | Distributed checkup of the file system consistency. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-193](https://issues.apache.org/jira/browse/HADOOP-193) | DFS i/o benchmark. | Major | fs | Konstantin Shvachko | |
| [HADOOP-187](https://issues.apache.org/jira/browse/HADOOP-187) | simple distributed dfs random data writer & sort example applications | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-184](https://issues.apache.org/jira/browse/HADOOP-184) | hadoop nightly build and regression test on a cluster | Minor | . | Mahadev konar | Mahadev konar |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.2.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,67 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.2.1 - 2006-05-12
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-207](https://issues.apache.org/jira/browse/HADOOP-207) | Patch to HADOOP-96 uses long deprecated call | Critical | . | Bryan Pendleton | Hairong Kuang |
| [HADOOP-201](https://issues.apache.org/jira/browse/HADOOP-201) | hadoop dfs -report throws exception | Minor | . | Johan Oskarsson | Doug Cutting |
| [HADOOP-199](https://issues.apache.org/jira/browse/HADOOP-199) | reduce copy progress not updating | Minor | . | Owen O'Malley | Owen O'Malley |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.2.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,347 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.0 - 2009-04-22
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5531](https://issues.apache.org/jira/browse/HADOOP-5531) | Remove Chukwa on branch-0.20 | Blocker | . | Nigel Daley | Nigel Daley |
| [HADOOP-4970](https://issues.apache.org/jira/browse/HADOOP-4970) | Use the full path when move files to .Trash/Current | Major | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4826](https://issues.apache.org/jira/browse/HADOOP-4826) | Admin command saveNamespace. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4789](https://issues.apache.org/jira/browse/HADOOP-4789) | Change fair scheduler to share between pools by default, not between invidual jobs | Minor | . | Matei Zaharia | Matei Zaharia |
| [HADOOP-4783](https://issues.apache.org/jira/browse/HADOOP-4783) | History files are given world readable permissions. | Blocker | . | Hemanth Yamijala | Amareshwari Sriramadasu |
| [HADOOP-4631](https://issues.apache.org/jira/browse/HADOOP-4631) | Split the default configurations into 3 parts | Major | conf | Owen O'Malley | Sharad Agarwal |
| [HADOOP-4618](https://issues.apache.org/jira/browse/HADOOP-4618) | Move http server from FSNamesystem into NameNode. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4576](https://issues.apache.org/jira/browse/HADOOP-4576) | Modify pending tasks count in the UI to pending jobs count in the UI | Major | . | Hemanth Yamijala | Sreekanth Ramakrishnan |
| [HADOOP-4572](https://issues.apache.org/jira/browse/HADOOP-4572) | INode and its sub-classes should be package private | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4567](https://issues.apache.org/jira/browse/HADOOP-4567) | GetFileBlockLocations should return the NetworkTopology information of the machines that hosts those blocks | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-4445](https://issues.apache.org/jira/browse/HADOOP-4445) | Wrong number of running map/reduce tasks are displayed in queue information. | Major | . | Karam Singh | Sreekanth Ramakrishnan |
| [HADOOP-4435](https://issues.apache.org/jira/browse/HADOOP-4435) | The JobTracker should display the amount of heap memory used | Minor | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-4422](https://issues.apache.org/jira/browse/HADOOP-4422) | S3 file systems should not create bucket | Major | fs/s3 | David Phillips | David Phillips |
| [HADOOP-4253](https://issues.apache.org/jira/browse/HADOOP-4253) | Fix warnings generated by FindBugs | Major | conf, fs, record | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4210](https://issues.apache.org/jira/browse/HADOOP-4210) | Findbugs warnings are printed related to equals implementation of several classes | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4188](https://issues.apache.org/jira/browse/HADOOP-4188) | Remove Task's dependency on concrete file systems | Major | . | Tom White | Sharad Agarwal |
| [HADOOP-4103](https://issues.apache.org/jira/browse/HADOOP-4103) | Alert for missing blocks | Major | . | Christian Kunz | Raghu Angadi |
| [HADOOP-4035](https://issues.apache.org/jira/browse/HADOOP-4035) | Modify the capacity scheduler (HADOOP-3445) to schedule tasks based on memory requirements and task trackers free memory | Blocker | . | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-4029](https://issues.apache.org/jira/browse/HADOOP-4029) | NameNode should report status and performance for each replica of image and log | Major | . | Robert Chansler | Boris Shkolnik |
| [HADOOP-3986](https://issues.apache.org/jira/browse/HADOOP-3986) | JobClient should not have a static configuration | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3923](https://issues.apache.org/jira/browse/HADOOP-3923) | Deprecate org.apache.hadoop.mapred.StatusHttpServer | Minor | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3750](https://issues.apache.org/jira/browse/HADOOP-3750) | Fix and enforce module dependencies | Major | . | Tom White | Sharad Agarwal |
| [HADOOP-3497](https://issues.apache.org/jira/browse/HADOOP-3497) | File globbing with a PathFilter is too restrictive | Major | fs | Tom White | Tom White |
| [HADOOP-3422](https://issues.apache.org/jira/browse/HADOOP-3422) | Ganglia counter metrics are all reported with the metric name "value", so the counter values can not be seen | Major | metrics | Jason | stack |
| [HADOOP-3344](https://issues.apache.org/jira/browse/HADOOP-3344) | libhdfs: always builds 32bit, even when x86\_64 Java used | Major | build | Craig Macdonald | Giridharan Kesavan |
| [HADOOP-1650](https://issues.apache.org/jira/browse/HADOOP-1650) | Upgrade Jetty to 6.x | Major | . | Devaraj Das | Chris Douglas |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4937](https://issues.apache.org/jira/browse/HADOOP-4937) | [HOD] Include ringmaster RPC port information in the notes attribute | Major | contrib/hod | Hemanth Yamijala | Peeyush Bishnoi |
| [HADOOP-4804](https://issues.apache.org/jira/browse/HADOOP-4804) | Create Forrest documentation for the fair scheduler | Minor | . | Matei Zaharia | Sreekanth Ramakrishnan |
| [HADOOP-4661](https://issues.apache.org/jira/browse/HADOOP-4661) | distch: a tool for distributed ch{mod,own} | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4622](https://issues.apache.org/jira/browse/HADOOP-4622) | Explicitly specify interpretor for non-native binaries | Minor | . | Fredrik Hedberg | Fredrik Hedberg |
| [HADOOP-4575](https://issues.apache.org/jira/browse/HADOOP-4575) | An independent HTTPS proxy for HDFS | Major | security | Kan Zhang | Kan Zhang |
| [HADOOP-4348](https://issues.apache.org/jira/browse/HADOOP-4348) | Adding service-level authorization to Hadoop | Major | security | Kan Zhang | Arun C Murthy |
| [HADOOP-4284](https://issues.apache.org/jira/browse/HADOOP-4284) | Support for user configurable global filters on HttpServer | Major | security | Kan Zhang | Kan Zhang |
| [HADOOP-4179](https://issues.apache.org/jira/browse/HADOOP-4179) | Hadoop-Vaidya : Rule based performance diagnostic tool for Map/Reduce jobs | Major | . | Suhas Gogate | Suhas Gogate |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6150](https://issues.apache.org/jira/browse/HADOOP-6150) | Need to be able to instantiate a comparator instance from a comparator string without creating a TFile.Reader object | Minor | io | Hong Tang | Hong Tang |
| [HADOOP-5521](https://issues.apache.org/jira/browse/HADOOP-5521) | Remove dependency of testcases on RESTART\_COUNT | Major | test | Amar Kamat | Ravi Gummadi |
| [HADOOP-5520](https://issues.apache.org/jira/browse/HADOOP-5520) | Typo in diskQuota help documentation | Trivial | documentation | Ravi Phulari | Ravi Phulari |
| [HADOOP-5519](https://issues.apache.org/jira/browse/HADOOP-5519) | Remove claims from mapred-default.xml that prime numbers of tasks are helpful. | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-5514](https://issues.apache.org/jira/browse/HADOOP-5514) | Add waiting/failed tasks to JobTracker metrics | Major | metrics | Chris Douglas | Chris Douglas |
| [HADOOP-5468](https://issues.apache.org/jira/browse/HADOOP-5468) | Change Hadoop doc menu to sub-menus | Major | documentation | Corinne Chandel | Corinne Chandel |
| [HADOOP-5466](https://issues.apache.org/jira/browse/HADOOP-5466) | CSS Style Changes for Hadoop Doc Headers and Code | Major | documentation | Corinne Chandel | Corinne Chandel |
| [HADOOP-5395](https://issues.apache.org/jira/browse/HADOOP-5395) | When queue ACLs are enabled, the error message shown when a job is submitted to a non-existent queue is misleading | Major | . | Hemanth Yamijala | rahul k singh |
| [HADOOP-5379](https://issues.apache.org/jira/browse/HADOOP-5379) | Throw exception instead of writing to System.err when there is a CRC error on CBZip2InputStream | Minor | io | Rodrigo Schmidt | Rodrigo Schmidt |
| [HADOOP-5275](https://issues.apache.org/jira/browse/HADOOP-5275) | ivy directory should be there in hadoop tar ball | Major | build | Suman Sehgal | Giridharan Kesavan |
| [HADOOP-5122](https://issues.apache.org/jira/browse/HADOOP-5122) | libhdfs test conf uses deprecated fs.default.name value | Trivial | test | Craig Macdonald | Craig Macdonald |
| [HADOOP-5030](https://issues.apache.org/jira/browse/HADOOP-5030) | Chukwa RPM build improvements | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4980](https://issues.apache.org/jira/browse/HADOOP-4980) | Cleanup the Capacity Scheduler code | Major | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-4950](https://issues.apache.org/jira/browse/HADOOP-4950) | CompressorStream and BlockCompressorStream should be public | Major | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-4944](https://issues.apache.org/jira/browse/HADOOP-4944) | Allow Xinclude in hadoop config file | Minor | conf | Rama, Ramasamy | Rama, Ramasamy |
| [HADOOP-4920](https://issues.apache.org/jira/browse/HADOOP-4920) | do not keep forrest output in subversion | Major | documentation | Doug Cutting | Doug Cutting |
| [HADOOP-4916](https://issues.apache.org/jira/browse/HADOOP-4916) | adjust build.xml to reference external properties when building the "init.d" for the rpm | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4909](https://issues.apache.org/jira/browse/HADOOP-4909) | Clean up map/reduce api to take JobContext in the static methods | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4880](https://issues.apache.org/jira/browse/HADOOP-4880) | Improvements to TestJobTrackerRestart | Major | test | Arun C Murthy | Amar Kamat |
| [HADOOP-4843](https://issues.apache.org/jira/browse/HADOOP-4843) | Collect Job History log file and Job Conf file into Chukwa | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4838](https://issues.apache.org/jira/browse/HADOOP-4838) | Cleanup Metrics and the MBeans | Major | . | Sanjay Radia | Sanjay Radia |
| [HADOOP-4837](https://issues.apache.org/jira/browse/HADOOP-4837) | Move HADOOP\_CONF\_DIR to chukwa-env.sh | Major | . | Jerome Boulon | Jerome Boulon |
| [HADOOP-4828](https://issues.apache.org/jira/browse/HADOOP-4828) | Update documentation for default configuration | Major | documentation | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-4827](https://issues.apache.org/jira/browse/HADOOP-4827) | Improve data aggregation in database | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4758](https://issues.apache.org/jira/browse/HADOOP-4758) | Add a splitter for metrics contexts | Minor | metrics | Chris Douglas | Chris Douglas |
| [HADOOP-4753](https://issues.apache.org/jira/browse/HADOOP-4753) | gridmix2 code can be condensed | Minor | benchmarks | Chris Douglas | Chris Douglas |
| [HADOOP-4749](https://issues.apache.org/jira/browse/HADOOP-4749) | reducer should output input data size when shuffling is done | Major | . | Zheng Shao | He Yongqiang |
| [HADOOP-4747](https://issues.apache.org/jira/browse/HADOOP-4747) | Reuse FileStatus in FsShell where possible | Minor | fs | David Phillips | David Phillips |
| [HADOOP-4708](https://issues.apache.org/jira/browse/HADOOP-4708) | Add support for dfsadmin commands for test TestCLI unit test | Major | test | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4705](https://issues.apache.org/jira/browse/HADOOP-4705) | [HOD] Grant read permissions for files/directories created by hod as part of provisioning hadoop clusters | Major | contrib/hod | Hemanth Yamijala | Peeyush Bishnoi |
| [HADOOP-4699](https://issues.apache.org/jira/browse/HADOOP-4699) | Change TaskTracker.MapOutputServlet to send only the IFile segment, validate checksum in Reduce | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-4690](https://issues.apache.org/jira/browse/HADOOP-4690) | fuse-dfs - create source file/function + utils + config + main source files | Minor | . | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-4668](https://issues.apache.org/jira/browse/HADOOP-4668) | Improve JavaDoc on JobConf.setCombinerClass to better document restrictions on combiners | Major | documentation | Owen O'Malley | Owen O'Malley |
| [HADOOP-4666](https://issues.apache.org/jira/browse/HADOOP-4666) | Launch reduces only after a few maps have run in the Fair Scheduler | Major | . | Matei Zaharia | Matei Zaharia |
| [HADOOP-4664](https://issues.apache.org/jira/browse/HADOOP-4664) | Parallelize job initialization | Blocker | . | Matei Zaharia | Jothi Padmanabhan |
| [HADOOP-4649](https://issues.apache.org/jira/browse/HADOOP-4649) | Improve abstraction for spill indices | Minor | . | Chris Douglas | Chris Douglas |
| [HADOOP-4640](https://issues.apache.org/jira/browse/HADOOP-4640) | Add ability to split text files compressed with lzo | Trivial | io | Johan Oskarsson | Johan Oskarsson |
| [HADOOP-4628](https://issues.apache.org/jira/browse/HADOOP-4628) | Move Hive out of Core | Major | . | Owen O'Malley | |
| [HADOOP-4599](https://issues.apache.org/jira/browse/HADOOP-4599) | BlocksMap and BlockInfo should be package private. | Major | . | Konstantin Shvachko | Konstantin Shvachko |
| [HADOOP-4565](https://issues.apache.org/jira/browse/HADOOP-4565) | MultiFileInputSplit can use data locality information to create splits | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-4545](https://issues.apache.org/jira/browse/HADOOP-4545) | Add an example of a secondary sort | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4523](https://issues.apache.org/jira/browse/HADOOP-4523) | Enhance how memory-intensive user tasks are handled | Major | . | Vivek Ratan | Vinod Kumar Vavilapalli |
| [HADOOP-4485](https://issues.apache.org/jira/browse/HADOOP-4485) | ant compile-native shorthand | Trivial | build | Enis Soztutar | Enis Soztutar |
| [HADOOP-4454](https://issues.apache.org/jira/browse/HADOOP-4454) | Support comments in 'slaves' file | Minor | conf, scripts | Rama, Ramasamy | Rama, Ramasamy |
| [HADOOP-4453](https://issues.apache.org/jira/browse/HADOOP-4453) | Improve ssl handling for distcp | Major | . | Chris Douglas | Kan Zhang |
| [HADOOP-4440](https://issues.apache.org/jira/browse/HADOOP-4440) | TestJobInProgressListener should also test for jobs killed in queued state | Minor | test | Amar Kamat | Amar Kamat |
| [HADOOP-4437](https://issues.apache.org/jira/browse/HADOOP-4437) | Use qMC sequence to improve the accuracy of PiEstimator | Minor | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4339](https://issues.apache.org/jira/browse/HADOOP-4339) | Improve FsShell -du/-dus and FileSystem.getContentSummary efficiency | Major | fs | David Phillips | David Phillips |
| [HADOOP-4305](https://issues.apache.org/jira/browse/HADOOP-4305) | repeatedly blacklisted tasktrackers should get declared dead | Major | . | Christian Kunz | Amareshwari Sriramadasu |
| [HADOOP-4276](https://issues.apache.org/jira/browse/HADOOP-4276) | The mapred.\*ID classes are inefficient for hashCode and serialization | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-4244](https://issues.apache.org/jira/browse/HADOOP-4244) | In "ant test-patch", runContribTestOnEclipseFiles should not be run on Hudson only | Major | build, test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4234](https://issues.apache.org/jira/browse/HADOOP-4234) | KFS: Allow KFS layer to interface with multiple KFS namenodes | Minor | fs | Sriram Rao | Sriram Rao |
| [HADOOP-4226](https://issues.apache.org/jira/browse/HADOOP-4226) | LineReader::readLine cleanup | Minor | . | Yuri Pradkin | Yuri Pradkin |
| [HADOOP-4178](https://issues.apache.org/jira/browse/HADOOP-4178) | The capacity scheduler's defaults for queues should be configurable. | Blocker | . | Owen O'Malley | Sreekanth Ramakrishnan |
| [HADOOP-4126](https://issues.apache.org/jira/browse/HADOOP-4126) | Allow access to HDFS web UI on EC2 | Major | contrib/cloud | Tom White | Tom White |
| [HADOOP-3770](https://issues.apache.org/jira/browse/HADOOP-3770) | improve composition, submission and result collection of gridmix | Major | benchmarks | Lingyun Yang | Runping Qi |
| [HADOOP-3461](https://issues.apache.org/jira/browse/HADOOP-3461) | Remove dfs.StringBytesWritbale | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-3180](https://issues.apache.org/jira/browse/HADOOP-3180) | add name of faulty class to WritableName.getClass IOException upon ClassNotFoundException | Trivial | io | Pete Wyckoff | Pete Wyckoff |
| [HADOOP-3136](https://issues.apache.org/jira/browse/HADOOP-3136) | Assign multiple tasks per TaskTracker heartbeat | Major | . | Devaraj Das | Arun C Murthy |
| [HADOOP-3063](https://issues.apache.org/jira/browse/HADOOP-3063) | BloomMapFile - fail-fast version of MapFile for sparsely populated key space | Major | io | Andrzej Bialecki | Andrzej Bialecki |
| [HADOOP-2774](https://issues.apache.org/jira/browse/HADOOP-2774) | Add counters to show number of key/values that have been sorted and merged in the maps and reduces | Major | . | Owen O'Malley | Ravi Gummadi |
| [HADOOP-1230](https://issues.apache.org/jira/browse/HADOOP-1230) | Replace parameters with context objects in Mapper, Reducer, Partitioner, InputFormat, and OutputFormat classes | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-466](https://issues.apache.org/jira/browse/HADOOP-466) | Startup scripts will not start instances of Hadoop daemons w/different configs w/o setting separate PID directories | Major | conf | Vetle Roeim | |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5645](https://issues.apache.org/jira/browse/HADOOP-5645) | After HADOOP-4920 we need a place to checkin releasenotes.html | Major | build | Nigel Daley | Nigel Daley |
| [HADOOP-5644](https://issues.apache.org/jira/browse/HADOOP-5644) | Namnode is stuck in safe mode | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-5607](https://issues.apache.org/jira/browse/HADOOP-5607) | TestCapacityScheduler fails with NPE | Major | test | Chris Douglas | Chris Douglas |
| [HADOOP-5605](https://issues.apache.org/jira/browse/HADOOP-5605) | All the replicas incorrectly got marked as corrupt. | Blocker | . | Raghu Angadi | Hairong Kuang |
| [HADOOP-5588](https://issues.apache.org/jira/browse/HADOOP-5588) | hadoop commands seem extremely slow in 0.20 branch | Blocker | fs | Koji Noguchi | Hairong Kuang |
| [HADOOP-5585](https://issues.apache.org/jira/browse/HADOOP-5585) | FileSystem statistic counters are too high when JVM reuse is enabled. | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-5577](https://issues.apache.org/jira/browse/HADOOP-5577) | The Job.waitForCompletion doesn't print status as the job runs | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-5576](https://issues.apache.org/jira/browse/HADOOP-5576) | LocalJobRunner does not run jobs using new MapReduce API | Blocker | . | Tom White | Tom White |
| [HADOOP-5571](https://issues.apache.org/jira/browse/HADOOP-5571) | TupleWritable can return incorrect results if it contains more than 32 values | Major | . | Jingkei Ly | Jingkei Ly |
| [HADOOP-5565](https://issues.apache.org/jira/browse/HADOOP-5565) | The job instrumentation API needs to have a method for finalizeJob, | Major | . | Jerome Boulon | Jerome Boulon |
| [HADOOP-5564](https://issues.apache.org/jira/browse/HADOOP-5564) | hadoop command uses large JVM heap size | Blocker | build | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-5557](https://issues.apache.org/jira/browse/HADOOP-5557) | Two minor problems in TestOverReplicatedBlocks | Minor | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5554](https://issues.apache.org/jira/browse/HADOOP-5554) | DataNodeCluster should create blocks with the same generation stamp as the blocks created in CreateEditsLog | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-5551](https://issues.apache.org/jira/browse/HADOOP-5551) | Namenode permits directory destruction on overwrite | Critical | . | Brian Bockelman | Brian Bockelman |
| [HADOOP-5549](https://issues.apache.org/jira/browse/HADOOP-5549) | ReplicationMonitor should schedule both replication and deletion work in one iteration | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5548](https://issues.apache.org/jira/browse/HADOOP-5548) | Observed negative running maps on the job tracker | Blocker | . | Owen O'Malley | Amareshwari Sriramadasu |
| [HADOOP-5534](https://issues.apache.org/jira/browse/HADOOP-5534) | Deadlock triggered by FairScheduler scheduler's servlet due to changes from HADOOP-5214. | Blocker | . | Vinod Kumar Vavilapalli | rahul k singh |
| [HADOOP-5522](https://issues.apache.org/jira/browse/HADOOP-5522) | Document job setup/cleaup tasks and task cleanup tasks in mapred tutorial | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5516](https://issues.apache.org/jira/browse/HADOOP-5516) | TaskMemoryManagerThread crashes in a corner case | Blocker | . | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-5505](https://issues.apache.org/jira/browse/HADOOP-5505) | TestMissingBlocksAlert fails on 0.20. | Major | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-5499](https://issues.apache.org/jira/browse/HADOOP-5499) | JobTracker metrics do not match job counters | Major | metrics | Chris Douglas | |
| [HADOOP-5493](https://issues.apache.org/jira/browse/HADOOP-5493) | Shuffle copiers do not return Codecs back to the pool at the end of shuffling | Blocker | . | Devaraj Das | Jothi Padmanabhan |
| [HADOOP-5490](https://issues.apache.org/jira/browse/HADOOP-5490) | TestParallelInitialization failed on NoSuchElementException | Blocker | . | Hairong Kuang | Jothi Padmanabhan |
| [HADOOP-5484](https://issues.apache.org/jira/browse/HADOOP-5484) | TestRecoveryManager fails wtih FileAlreadyExistsException | Major | test | Hairong Kuang | Amar Kamat |
| [HADOOP-5483](https://issues.apache.org/jira/browse/HADOOP-5483) | Directory/file cleanup thread throws IllegalStateException | Major | . | Hairong Kuang | |
| [HADOOP-5479](https://issues.apache.org/jira/browse/HADOOP-5479) | NameNode should not send empty block replication request to DataNode | Critical | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5473](https://issues.apache.org/jira/browse/HADOOP-5473) | Race condition in command-line kill for a task | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5471](https://issues.apache.org/jira/browse/HADOOP-5471) | SyncLogs thread in Child.java would update wrong file for a cleanup attempt, in some cases. | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5465](https://issues.apache.org/jira/browse/HADOOP-5465) | Blocks remain under-replicated | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5463](https://issues.apache.org/jira/browse/HADOOP-5463) | Balancer throws "Not a host:port pair" unless port is specified in fs.default.name | Minor | . | Stuart White | Stuart White |
| [HADOOP-5459](https://issues.apache.org/jira/browse/HADOOP-5459) | CRC errors not detected reading intermediate output into memory with problematic length | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-5440](https://issues.apache.org/jira/browse/HADOOP-5440) | Successful taskid are not removed from TaskMemoryManager | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5437](https://issues.apache.org/jira/browse/HADOOP-5437) | Unit test for jvm -reuse | Blocker | . | Amareshwari Sriramadasu | Owen O'Malley |
| [HADOOP-5432](https://issues.apache.org/jira/browse/HADOOP-5432) | TestHdfsProxy fails on 0.20 | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-5417](https://issues.apache.org/jira/browse/HADOOP-5417) | IPC client drops interrupted exceptions | Blocker | ipc | Owen O'Malley | Owen O'Malley |
| [HADOOP-5416](https://issues.apache.org/jira/browse/HADOOP-5416) | Wrong description of " hadoop fs -test " in FS Shell guide . | Trivial | documentation | Ravi Phulari | Ravi Phulari |
| [HADOOP-5414](https://issues.apache.org/jira/browse/HADOOP-5414) | IO exception while executing hadoop fs -touchz fileName | Major | fs | Ravi Phulari | Hairong Kuang |
| [HADOOP-5394](https://issues.apache.org/jira/browse/HADOOP-5394) | JobTracker might schedule 2 attempts of the same task with the same attempt id across restarts | Critical | . | Amar Kamat | Amar Kamat |
| [HADOOP-5390](https://issues.apache.org/jira/browse/HADOOP-5390) | hdfsproxy includes duplicate jars in tarball, source in binary tarball | Major | build | Chris Douglas | zhiyong zhang |
| [HADOOP-5384](https://issues.apache.org/jira/browse/HADOOP-5384) | DataNodeCluster should not create blocks with generationStamp == 1 | Blocker | test | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5382](https://issues.apache.org/jira/browse/HADOOP-5382) | The new map/reduce api doesn't support combiners | Blocker | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-5376](https://issues.apache.org/jira/browse/HADOOP-5376) | JobInProgress.obtainTaskCleanupTask() throws an ArrayIndexOutOfBoundsException | Blocker | . | Vinod Kumar Vavilapalli | Amareshwari Sriramadasu |
| [HADOOP-5338](https://issues.apache.org/jira/browse/HADOOP-5338) | Reduce tasks are stuck waiting for map outputs when none are in progress | Blocker | . | Ramya Sunil | Amar Kamat |
| [HADOOP-5337](https://issues.apache.org/jira/browse/HADOOP-5337) | JobTracker greedily schedules tasks without running tasks to join | Major | . | Karam Singh | Amar Kamat |
| [HADOOP-5328](https://issues.apache.org/jira/browse/HADOOP-5328) | Renaming of Job histroy file is incorrect if Jobtracker is restarted multimple times | Major | . | Karam Singh | Amar Kamat |
| [HADOOP-5327](https://issues.apache.org/jira/browse/HADOOP-5327) | Job files for a job failing because of ACLs are not clean from the system directory | Blocker | . | Karam Singh | Amar Kamat |
| [HADOOP-5326](https://issues.apache.org/jira/browse/HADOOP-5326) | bzip2 codec (CBZip2OutputStream) creates corrupted output file for some inputs | Major | io | Rodrigo Schmidt | Rodrigo Schmidt |
| [HADOOP-5320](https://issues.apache.org/jira/browse/HADOOP-5320) | TestMapReduceLocal is missing a close() that is causing it to fail while running the test on NFS | Minor | test | Jothi Padmanabhan | Jothi Padmanabhan |
| [HADOOP-5306](https://issues.apache.org/jira/browse/HADOOP-5306) | Job History file can have empty string as http port after JobTracker Restart in case of lost TT, which can result in NumberFormatException when JT is restarted 2nd time | Blocker | . | Ravi Gummadi | Amar Kamat |
| [HADOOP-5305](https://issues.apache.org/jira/browse/HADOOP-5305) | Unit test org.apache.hadoop.fs.TestCopyFiles.testMapCount fails on trunk | Minor | test | Ramya Sunil | Tsz Wo Nicholas Sze |
| [HADOOP-5298](https://issues.apache.org/jira/browse/HADOOP-5298) | Unit test fails out on trunk org.apache.hadoop.http.TestServletFilter.testServletFilter | Major | test | Lee Tucker | Tsz Wo Nicholas Sze |
| [HADOOP-5292](https://issues.apache.org/jira/browse/HADOOP-5292) | KFS: calling getFileBlockLocations() on 0-length file causes a NPE | Major | . | Sriram Rao | Sriram Rao |
| [HADOOP-5285](https://issues.apache.org/jira/browse/HADOOP-5285) | JobTracker hangs for long periods of time | Blocker | . | Vinod Kumar Vavilapalli | Devaraj Das |
| [HADOOP-5282](https://issues.apache.org/jira/browse/HADOOP-5282) | Running tasks on lost trackers are incorrectly logging the attempt and task failures | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-5281](https://issues.apache.org/jira/browse/HADOOP-5281) | GzipCodec fails second time it is used in a process | Blocker | . | Ben Maurer | Chris Douglas |
| [HADOOP-5280](https://issues.apache.org/jira/browse/HADOOP-5280) | When expiring a lost launched task, JT doesn't remove the attempt from the taskidToTIPMap. | Blocker | . | Vinod Kumar Vavilapalli | Devaraj Das |
| [HADOOP-5274](https://issues.apache.org/jira/browse/HADOOP-5274) | gridmix2 is not getting compiled to generate gridmix.jar | Critical | benchmarks | Suman Sehgal | Chris Douglas |
| [HADOOP-5272](https://issues.apache.org/jira/browse/HADOOP-5272) | JobTracker does not log TIP start information after restart | Critical | . | Amar Kamat | Amar Kamat |
| [HADOOP-5269](https://issues.apache.org/jira/browse/HADOOP-5269) | TaskTracker.runningTasks holding FAILED\_UNCLEAN and KILLED\_UNCLEAN taskStatuses forever in some cases. | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5259](https://issues.apache.org/jira/browse/HADOOP-5259) | Job with output hdfs:/user/\<username\>/outputpath (no authority) fails with Wrong FS | Major | fs | Koji Noguchi | Doug Cutting |
| [HADOOP-5255](https://issues.apache.org/jira/browse/HADOOP-5255) | Fix for HADOOP-5079 HashFunction inadvertently destroys some randomness | Minor | io | stack | Jonathan Ellis |
| [HADOOP-5248](https://issues.apache.org/jira/browse/HADOOP-5248) | Job directories could remain undeleted in some scenarios after job completes. | Blocker | . | Hemanth Yamijala | Devaraj Das |
| [HADOOP-5247](https://issues.apache.org/jira/browse/HADOOP-5247) | NPEs in JobTracker and JobClient when mapred.jobtracker.completeuserjobs.maximum is set to zero. | Blocker | . | Vinod Kumar Vavilapalli | Amar Kamat |
| [HADOOP-5241](https://issues.apache.org/jira/browse/HADOOP-5241) | Reduce tasks get stuck because of over-estimated task size (regression from 0.18) | Blocker | . | Andy Pavlo | Sharad Agarwal |
| [HADOOP-5233](https://issues.apache.org/jira/browse/HADOOP-5233) | Reducer not Succeded after 100% | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5227](https://issues.apache.org/jira/browse/HADOOP-5227) | distcp -delete option deletes all files from the destination directory | Major | . | Suhas Gogate | Tsz Wo Nicholas Sze |
| [HADOOP-5214](https://issues.apache.org/jira/browse/HADOOP-5214) | ConcurrentModificationException in FairScheduler.getTotalSlots | Blocker | . | Vinod Kumar Vavilapalli | rahul k singh |
| [HADOOP-5211](https://issues.apache.org/jira/browse/HADOOP-5211) | TestSetupAndCleanupFailure fails with timeout | Major | . | Enis Soztutar | Enis Soztutar |
| [HADOOP-5154](https://issues.apache.org/jira/browse/HADOOP-5154) | 4-way deadlock in FairShare scheduler | Blocker | . | Vinod Kumar Vavilapalli | Matei Zaharia |
| [HADOOP-5146](https://issues.apache.org/jira/browse/HADOOP-5146) | LocalDirAllocator misses files on the local filesystem | Blocker | . | Arun C Murthy | Devaraj Das |
| [HADOOP-5145](https://issues.apache.org/jira/browse/HADOOP-5145) | Balancer sometimes runs out of memory after days or weeks running | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-5142](https://issues.apache.org/jira/browse/HADOOP-5142) | MapWritable#putAll does not store classes | Minor | io | Doğacan Güney | |
| [HADOOP-5139](https://issues.apache.org/jira/browse/HADOOP-5139) | RPC call throws IllegalArgumentException complaining duplicate metrics registration | Major | ipc | Hairong Kuang | Hairong Kuang |
| [HADOOP-5111](https://issues.apache.org/jira/browse/HADOOP-5111) | Generic mapreduce classes cannot be used with Job::set\* methods | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-5085](https://issues.apache.org/jira/browse/HADOOP-5085) | Copying a file to local with Crc throws an exception | Major | fs | Hairong Kuang | Hairong Kuang |
| [HADOOP-5079](https://issues.apache.org/jira/browse/HADOOP-5079) | HashFunction inadvertently destroys some randomness | Major | util | Jonathan Ellis | Jonathan Ellis |
| [HADOOP-5075](https://issues.apache.org/jira/browse/HADOOP-5075) | Potential infinite loop in updateMinSlots | Blocker | . | Matei Zaharia | Matei Zaharia |
| [HADOOP-5068](https://issues.apache.org/jira/browse/HADOOP-5068) | testClusterBlockingForLackOfMemory in TestCapacityScheduler fails randomly | Major | . | Sreekanth Ramakrishnan | Vinod Kumar Vavilapalli |
| [HADOOP-5066](https://issues.apache.org/jira/browse/HADOOP-5066) | ant binary should not compile docs | Major | build | Amareshwari Sriramadasu | Giridharan Kesavan |
| [HADOOP-5065](https://issues.apache.org/jira/browse/HADOOP-5065) | setOutputFormatClass in mapreduce.Job fails for SequenceFileOutputFormat | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-5048](https://issues.apache.org/jira/browse/HADOOP-5048) | Sometimes job is still displayed in jobqueue\_details page for long time after job was killed. | Major | . | Karam Singh | Sreekanth Ramakrishnan |
| [HADOOP-5026](https://issues.apache.org/jira/browse/HADOOP-5026) | Startup scripts should be svn-executable | Minor | . | Andy Konwinski | Andy Konwinski |
| [HADOOP-4999](https://issues.apache.org/jira/browse/HADOOP-4999) | IndexOutOfBoundsException in FSEditLog | Major | . | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4993](https://issues.apache.org/jira/browse/HADOOP-4993) | chukwa agent startup should be more modular | Major | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-4988](https://issues.apache.org/jira/browse/HADOOP-4988) | An earlier fix, for HADOOP-4373, results in a problem with reclaiming capacity when one or more queues have a capacity equal to zero | Blocker | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-4979](https://issues.apache.org/jira/browse/HADOOP-4979) | Capacity Scheduler does not always return no task to a TT if a job's memry requirements are not met | Major | . | Vivek Ratan | Vivek Ratan |
| [HADOOP-4977](https://issues.apache.org/jira/browse/HADOOP-4977) | Deadlock between reclaimCapacity and assignTasks | Blocker | . | Matei Zaharia | Vivek Ratan |
| [HADOOP-4963](https://issues.apache.org/jira/browse/HADOOP-4963) | Logs saying org.apache.hadoop.util.DiskChecker$DiskErrorException in TaskTracker are not relevant | Minor | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4956](https://issues.apache.org/jira/browse/HADOOP-4956) | NPE when jobdetails.jsp | Blocker | . | Devaraj Das | Devaraj Das |
| [HADOOP-4949](https://issues.apache.org/jira/browse/HADOOP-4949) | Native compilation is broken | Blocker | . | Chris Douglas | Chris Douglas |
| [HADOOP-4947](https://issues.apache.org/jira/browse/HADOOP-4947) | use regex to parse chukwa commands | Minor | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-4925](https://issues.apache.org/jira/browse/HADOOP-4925) | Chukwa HTTP connection policies are too hard to configure | Major | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-4914](https://issues.apache.org/jira/browse/HADOOP-4914) | Support chkconfig for chukwa start up scripts | Trivial | . | Eric Yang | Eric Yang |
| [HADOOP-4896](https://issues.apache.org/jira/browse/HADOOP-4896) | hdfs fsck does not load hdfs configuration. | Blocker | . | Raghu Angadi | Raghu Angadi |
| [HADOOP-4894](https://issues.apache.org/jira/browse/HADOOP-4894) | fix Chukwa's jettyCollector.sh | Minor | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-4892](https://issues.apache.org/jira/browse/HADOOP-4892) | File descriptor leak in Chukwa's ExecPlugin | Major | . | Ari Rabkin | Ari Rabkin |
| [HADOOP-4889](https://issues.apache.org/jira/browse/HADOOP-4889) | Chukwa RPM needs to chown user/group in build stage instead of post install stage. | Trivial | . | Eric Yang | Eric Yang |
| [HADOOP-4884](https://issues.apache.org/jira/browse/HADOOP-4884) | Change Date format pattern for Time Series graph | Trivial | . | Eric Yang | Eric Yang |
| [HADOOP-4879](https://issues.apache.org/jira/browse/HADOOP-4879) | TestJobTrackerRestart fails on trunk | Blocker | test | Arun C Murthy | Owen O'Malley |
| [HADOOP-4878](https://issues.apache.org/jira/browse/HADOOP-4878) | After introduction of ivy ant test-patch always returns -1 score | Blocker | build | Sreekanth Ramakrishnan | Giridharan Kesavan |
| [HADOOP-4876](https://issues.apache.org/jira/browse/HADOOP-4876) | Capacity reclaimation for queue would not work as intended | Blocker | . | Sreekanth Ramakrishnan | Sreekanth Ramakrishnan |
| [HADOOP-4874](https://issues.apache.org/jira/browse/HADOOP-4874) | Remove bindings to lzo | Blocker | io | Owen O'Malley | Owen O'Malley |
| [HADOOP-4873](https://issues.apache.org/jira/browse/HADOOP-4873) | display minMaps/Reduces on advanced scheduler page | Major | . | Joydeep Sen Sarma | Joydeep Sen Sarma |
| [HADOOP-4869](https://issues.apache.org/jira/browse/HADOOP-4869) | Lost Trackers may not be able to join back | Blocker | . | Devaraj Das | Amareshwari Sriramadasu |
| [HADOOP-4860](https://issues.apache.org/jira/browse/HADOOP-4860) | File Tailing Adaptor test cases causes random failure | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4855](https://issues.apache.org/jira/browse/HADOOP-4855) | Fix help message in MRAdmin | Minor | fs, security | Arun C Murthy | Arun C Murthy |
| [HADOOP-4854](https://issues.apache.org/jira/browse/HADOOP-4854) | Capacity Scheduler should read the reclaim-capacity-interval config value from its own config file | Minor | . | Vivek Ratan | Sreekanth Ramakrishnan |
| [HADOOP-4849](https://issues.apache.org/jira/browse/HADOOP-4849) | Document service level authorization - HADOOP-4348 | Major | documentation, scripts | Arun C Murthy | Arun C Murthy |
| [HADOOP-4845](https://issues.apache.org/jira/browse/HADOOP-4845) | Shuffle counter issues | Major | . | Chris Douglas | He Yongqiang |
| [HADOOP-4844](https://issues.apache.org/jira/browse/HADOOP-4844) | Document deprecation of o.a.h.fs.permission.AccessControlException better | Minor | documentation, fs | Arun C Murthy | Arun C Murthy |
| [HADOOP-4830](https://issues.apache.org/jira/browse/HADOOP-4830) | Have end to end tests based on MiniMRCluster to verify that queue capacities are honoured. | Major | . | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-4825](https://issues.apache.org/jira/browse/HADOOP-4825) | Chukwa - change jps to ps for process status detection | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4820](https://issues.apache.org/jira/browse/HADOOP-4820) | Remove unused method FSNamesystem.deleteInSafeMode | Minor | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4818](https://issues.apache.org/jira/browse/HADOOP-4818) | Enable JobTracker Instrumentation API with user job configuration | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4811](https://issues.apache.org/jira/browse/HADOOP-4811) | Import style sheet images for Chukwa | Trivial | . | Eric Yang | Eric Yang |
| [HADOOP-4805](https://issues.apache.org/jira/browse/HADOOP-4805) | Remove black list feature from Chukwa Agent to Chukwa Collector communication | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4796](https://issues.apache.org/jira/browse/HADOOP-4796) | Test target for chukwa build.xml needs to comply to hadoop build.xml test suites | Major | build | Eric Yang | Eric Yang |
| [HADOOP-4792](https://issues.apache.org/jira/browse/HADOOP-4792) | Chukwa build process generates files not tracked by svn | Major | build | Chris Douglas | Chris Douglas |
| [HADOOP-4791](https://issues.apache.org/jira/browse/HADOOP-4791) | Improve rpm build process for Chukwa | Major | . | Eric Yang | Eric Yang |
| [HADOOP-4787](https://issues.apache.org/jira/browse/HADOOP-4787) | TestTrackerBlacklistAcrossJobs fails randomly | Blocker | test | Hemanth Yamijala | Amareshwari Sriramadasu |
| [HADOOP-4786](https://issues.apache.org/jira/browse/HADOOP-4786) | TestTrackerBlacklistAcrossJobs compilation is broken | Blocker | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-4785](https://issues.apache.org/jira/browse/HADOOP-4785) | avoid two calls to System.currentTimeMillis() in heartbeat | Blocker | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4782](https://issues.apache.org/jira/browse/HADOOP-4782) | [HOD] HOD opens up directory permissions more than required | Blocker | contrib/hod | Hemanth Yamijala | Peeyush Bishnoi |
| [HADOOP-4774](https://issues.apache.org/jira/browse/HADOOP-4774) | CapacityScheduler should work even if capacity-scheduler.xml is not in the classpath i.e with the code defaults | Major | . | Amar Kamat | Sreekanth Ramakrishnan |
| [HADOOP-4771](https://issues.apache.org/jira/browse/HADOOP-4771) | FSImage saveFSImage() will have problem. | Major | . | Ruyue Ma | Ruyue Ma |
| [HADOOP-4770](https://issues.apache.org/jira/browse/HADOOP-4770) | gridmix2 run script doesn't work on trunk | Trivial | benchmarks | Chris Douglas | Chris Douglas |
| [HADOOP-4744](https://issues.apache.org/jira/browse/HADOOP-4744) | Wrong resolution of hostname and port | Blocker | . | Amareshwari Sriramadasu | Jothi Padmanabhan |
| [HADOOP-4738](https://issues.apache.org/jira/browse/HADOOP-4738) | saveVersion.sh could write a package-info.java that cannot be compiled when used with git | Trivial | build | Hemanth Yamijala | Chris Douglas |
| [HADOOP-4737](https://issues.apache.org/jira/browse/HADOOP-4737) | Job end notification should give status as "KILLED" when job gets killed. | Major | . | Ruchir Shah | Amareshwari Sriramadasu |
| [HADOOP-4732](https://issues.apache.org/jira/browse/HADOOP-4732) | connectionTimeout and readTimeout are passed in wrong order in ReduceTask.java | Minor | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4716](https://issues.apache.org/jira/browse/HADOOP-4716) | testRestartWithLostTracker frequently times out | Blocker | . | Johan Oskarsson | Amar Kamat |
| [HADOOP-4706](https://issues.apache.org/jira/browse/HADOOP-4706) | IFileOutputStream.close should call close of the underlying stream | Minor | . | Jothi Padmanabhan | Jothi Padmanabhan |
| [HADOOP-4704](https://issues.apache.org/jira/browse/HADOOP-4704) | javadoc: "the the" =\> "the" | Trivial | documentation | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4698](https://issues.apache.org/jira/browse/HADOOP-4698) | TestMapRed fails with 64bit JDK | Major | test | Nigel Daley | Nigel Daley |
| [HADOOP-4695](https://issues.apache.org/jira/browse/HADOOP-4695) | TestGlobalFilter.testServletFilter fails | Blocker | test | Tsz Wo Nicholas Sze | Kan Zhang |
| [HADOOP-4692](https://issues.apache.org/jira/browse/HADOOP-4692) | Namenode in infinite loop for replicating/deleting corrupted block | Major | . | Hairong Kuang | Hairong Kuang |
| [HADOOP-4691](https://issues.apache.org/jira/browse/HADOOP-4691) | In the javadoc of IndexedSortable.compare(...), the link is wrong. | Minor | documentation, util | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4683](https://issues.apache.org/jira/browse/HADOOP-4683) | Move the call to getMapCompletionEvents in ReduceTask.ReduceCopier.fetchOutputs to a separate thread | Major | . | Devaraj Das | Jothi Padmanabhan |
| [HADOOP-4677](https://issues.apache.org/jira/browse/HADOOP-4677) | FileSystem.getFileBlockLocations() (aka default implementation for Local FileSystem) incorrect. | Major | fs | Hong Tang | Hong Tang |
| [HADOOP-4676](https://issues.apache.org/jira/browse/HADOOP-4676) | Go back to Job has a wrong URL on blacklisted trackers page | Minor | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4671](https://issues.apache.org/jira/browse/HADOOP-4671) | Make some "flag" booleans volatile because they are set/read in different threads | Minor | . | Owen O'Malley | Chris Douglas |
| [HADOOP-4669](https://issues.apache.org/jira/browse/HADOOP-4669) | A file-open sometimes may not set the access time of a file | Major | . | dhruba borthakur | dhruba borthakur |
| [HADOOP-4650](https://issues.apache.org/jira/browse/HADOOP-4650) | local.cache.size is set to 10 GB, while DEFAULT\_CACHE\_SIZE is set to 1 MB | Trivial | filecache | Jeff Hammerbacher | Jeff Hammerbacher |
| [HADOOP-4645](https://issues.apache.org/jira/browse/HADOOP-4645) | hdfs proxy doesn't package itself right. | Major | . | Owen O'Malley | Kan Zhang |
| [HADOOP-4638](https://issues.apache.org/jira/browse/HADOOP-4638) | Exception thrown in/from RecoveryManager.recover() should be caught and handled | Blocker | . | Amar Kamat | Amar Kamat |
| [HADOOP-4623](https://issues.apache.org/jira/browse/HADOOP-4623) | Running tasks are not maintained by JobInProgress if speculation is off | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-4621](https://issues.apache.org/jira/browse/HADOOP-4621) | javadoc: warning - Multiple sources of package comments found for some packages | Major | documentation | Tsz Wo Nicholas Sze | Kan Zhang |
| [HADOOP-4608](https://issues.apache.org/jira/browse/HADOOP-4608) | Examples -Driver does not check first argument. | Major | . | Edward J. Yoon | Edward J. Yoon |
| [HADOOP-4606](https://issues.apache.org/jira/browse/HADOOP-4606) | There is a cygpath error if log directory does not exist | Minor | scripts | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4598](https://issues.apache.org/jira/browse/HADOOP-4598) | '-setrep' command skips under-replicated blocks | Minor | . | Koji Noguchi | Hairong Kuang |
| [HADOOP-4589](https://issues.apache.org/jira/browse/HADOOP-4589) | Fix the PiEstimator output messages and code comments | Minor | documentation | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-4587](https://issues.apache.org/jira/browse/HADOOP-4587) | typo in javadoc for map.input.file | Trivial | documentation | Koji Noguchi | Koji Noguchi |
| [HADOOP-4583](https://issues.apache.org/jira/browse/HADOOP-4583) | Code optimization/cleanup in HDFS | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4579](https://issues.apache.org/jira/browse/HADOOP-4579) | StorageDirectory is not removed from the storage directories list when writing to an edits fails | Major | . | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4571](https://issues.apache.org/jira/browse/HADOOP-4571) | chukwa conf files are not cleaned up | Minor | . | Tsz Wo Nicholas Sze | Eric Yang |
| [HADOOP-4558](https://issues.apache.org/jira/browse/HADOOP-4558) | Scheduler fails to reclaim capacity if Jobs are submitted to queue one after the other | Major | . | Karam Singh | Amar Kamat |
| [HADOOP-4530](https://issues.apache.org/jira/browse/HADOOP-4530) | In fsck, HttpServletResponse sendError fails with IllegalStateException | Major | . | Koji Noguchi | Hairong Kuang |
| [HADOOP-4513](https://issues.apache.org/jira/browse/HADOOP-4513) | Capacity scheduler should initialize tasks asynchronously | Major | . | Hemanth Yamijala | Sreekanth Ramakrishnan |
| [HADOOP-4429](https://issues.apache.org/jira/browse/HADOOP-4429) | Misconfigured UNIX Groups Break Hadoop | Minor | . | Alex Loddengaard | Alex Loddengaard |
| [HADOOP-4408](https://issues.apache.org/jira/browse/HADOOP-4408) | FsAction operations shouldn't create new Objects | Minor | fs | Chris Douglas | Chris Douglas |
| [HADOOP-4388](https://issues.apache.org/jira/browse/HADOOP-4388) | Bug in Datanode transferBlocks code | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4377](https://issues.apache.org/jira/browse/HADOOP-4377) | Race condition creating S3 buffer directory for NativeS3FileSystem | Major | fs/s3 | David Phillips | David Phillips |
| [HADOOP-4374](https://issues.apache.org/jira/browse/HADOOP-4374) | JVM should not be killed but given an opportunity to exit gracefully | Major | . | Devaraj Das | Ravi Gummadi |
| [HADOOP-4346](https://issues.apache.org/jira/browse/HADOOP-4346) | Hadoop triggers a "soft" fd leak. | Major | io | Raghu Angadi | Raghu Angadi |
| [HADOOP-4325](https://issues.apache.org/jira/browse/HADOOP-4325) | Hadoop SocketInputStream.read() should return -1 in case of EOF. | Minor | io | Raghu Angadi | Raghu Angadi |
| [HADOOP-4262](https://issues.apache.org/jira/browse/HADOOP-4262) | message generated when the client exception has a null message is not useful | Minor | ipc | Steve Loughran | |
| [HADOOP-4245](https://issues.apache.org/jira/browse/HADOOP-4245) | KFS: Update the kfs jar file | Major | fs | Sriram Rao | Sriram Rao |
| [HADOOP-4238](https://issues.apache.org/jira/browse/HADOOP-4238) | [mapred] Unavailable schedulingInfo in "hadoop list [all]" output should be marked "N/A" | Major | . | Vinod Kumar Vavilapalli | Sreekanth Ramakrishnan |
| [HADOOP-4207](https://issues.apache.org/jira/browse/HADOOP-4207) | update derby.jar to eliminate the one time wait of 2-3 minutes while running junit tests | Minor | . | Prasad Chakka | Prasad Chakka |
| [HADOOP-4204](https://issues.apache.org/jira/browse/HADOOP-4204) | Fix warnings generated by FindBugs | Blocker | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-4141](https://issues.apache.org/jira/browse/HADOOP-4141) | ScriptBasedMapping has infinite loop with invalid jobconf parameter | Minor | . | Aaron Kimball | Aaron Kimball |
| [HADOOP-3921](https://issues.apache.org/jira/browse/HADOOP-3921) | Clover breaks nightly builds | Major | build | Tom White | Tom White |
| [HADOOP-3842](https://issues.apache.org/jira/browse/HADOOP-3842) | There is a window where the JobTracker is in the RUNNING state (i.e ready to accept jobs) and never executes them. | Major | . | Amar Kamat | Hemanth Yamijala |
| [HADOOP-3810](https://issues.apache.org/jira/browse/HADOOP-3810) | NameNode seems unstable on a cluster with little space left | Major | . | Raghu Angadi | Hairong Kuang |
| [HADOOP-3293](https://issues.apache.org/jira/browse/HADOOP-3293) | When an input split spans cross block boundary, the split location should be the host having most of bytes on it. | Major | . | Runping Qi | Jothi Padmanabhan |
| [HADOOP-2771](https://issues.apache.org/jira/browse/HADOOP-2771) | changing the number of reduces dramatically changes the time of the map time | Major | . | Owen O'Malley | |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4857](https://issues.apache.org/jira/browse/HADOOP-4857) | TestUlimit is failing after Hadoop-4620 | Blocker | test | Sharad Agarwal | Ravi Gummadi |
| [HADOOP-4728](https://issues.apache.org/jira/browse/HADOOP-4728) | Tests for NameNode -checkpoint option with different configurations | Minor | . | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4722](https://issues.apache.org/jira/browse/HADOOP-4722) | adding tests for quotas command line error messages | Minor | test | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-4688](https://issues.apache.org/jira/browse/HADOOP-4688) | multiple spills/reducers test case for mergeParts() needed | Major | . | Yuri Pradkin | Chris Douglas |
| [HADOOP-4505](https://issues.apache.org/jira/browse/HADOOP-4505) | Add a unit test to test faulty setup task and cleanup task killing the job | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-4458](https://issues.apache.org/jira/browse/HADOOP-4458) | Add a unit test for applications creating symlinks in wokring directory | Major | test | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-3192](https://issues.apache.org/jira/browse/HADOOP-3192) | Test behavior of very large (32GB) name nodes | Major | . | Robert Chansler | |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-4612](https://issues.apache.org/jira/browse/HADOOP-4612) | Remove RunJar's dependency on JobClient | Major | . | Sharad Agarwal | Sharad Agarwal |
| [HADOOP-4187](https://issues.apache.org/jira/browse/HADOOP-4187) | Create a MapReduce-specific ReflectionUtils that handles JobConf and JobConfigurable | Major | . | Tom White | Sharad Agarwal |
| [HADOOP-4185](https://issues.apache.org/jira/browse/HADOOP-4185) | Add setVerifyChecksum() method to FileSystem | Major | fs | Tom White | Sharad Agarwal |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,347 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-5565](https://issues.apache.org/jira/browse/HADOOP-5565) | *Major* | **The job instrumentation API needs to have a method for finalizeJob,**
Add finalizeJob & terminateJob methods to JobTrackerInstrumentation class
---
* [HADOOP-5548](https://issues.apache.org/jira/browse/HADOOP-5548) | *Blocker* | **Observed negative running maps on the job tracker**
Adds synchronization for JobTracker methods in RecoveryManager.
---
* [HADOOP-5531](https://issues.apache.org/jira/browse/HADOOP-5531) | *Blocker* | **Remove Chukwa on branch-0.20**
Disabled Chukwa unit tests for 0.20 branch only.
---
* [HADOOP-5521](https://issues.apache.org/jira/browse/HADOOP-5521) | *Major* | **Remove dependency of testcases on RESTART\_COUNT**
This patch makes TestJobHistory and its dependent testcases independent of RESTART\_COUNT.
---
* [HADOOP-5468](https://issues.apache.org/jira/browse/HADOOP-5468) | *Major* | **Change Hadoop doc menu to sub-menus**
Reformatted HTML documentation for Hadoop to use submenus at the left column.
---
* [HADOOP-5030](https://issues.apache.org/jira/browse/HADOOP-5030) | *Major* | **Chukwa RPM build improvements**
Changed RPM install location to the value specified by build.properties file.
---
* [HADOOP-4970](https://issues.apache.org/jira/browse/HADOOP-4970) | *Major* | **Use the full path when move files to .Trash/Current**
Changed trash facility to use absolute path of the deleted file.
---
* [HADOOP-4873](https://issues.apache.org/jira/browse/HADOOP-4873) | *Major* | **display minMaps/Reduces on advanced scheduler page**
Changed fair scheduler UI to display minMaps and minReduces variables.
---
* [HADOOP-4843](https://issues.apache.org/jira/browse/HADOOP-4843) | *Major* | **Collect Job History log file and Job Conf file into Chukwa**
Introduced Chuckwa collection of job history.
---
* [HADOOP-4827](https://issues.apache.org/jira/browse/HADOOP-4827) | *Major* | **Improve data aggregation in database**
Improved framework for data aggregation in Chuckwa.
---
* [HADOOP-4826](https://issues.apache.org/jira/browse/HADOOP-4826) | *Major* | **Admin command saveNamespace.**
Introduced new dfsadmin command saveNamespace to command the name service to do an immediate save of the file system image.
---
* [HADOOP-4789](https://issues.apache.org/jira/browse/HADOOP-4789) | *Minor* | **Change fair scheduler to share between pools by default, not between invidual jobs**
Changed fair scheduler to divide resources equally between pools, not jobs.
---
* [HADOOP-4783](https://issues.apache.org/jira/browse/HADOOP-4783) | *Blocker* | **History files are given world readable permissions.**
Changed history directory permissions to 750 and history file permissions to 740.
---
* [HADOOP-4749](https://issues.apache.org/jira/browse/HADOOP-4749) | *Major* | **reducer should output input data size when shuffling is done**
Added a new counter REDUCE\_INPUT\_BYTES.
---
* [HADOOP-4661](https://issues.apache.org/jira/browse/HADOOP-4661) | *Major* | **distch: a tool for distributed ch{mod,own}**
Introduced distch tool for parallel ch{mod, own, grp}.
---
* [HADOOP-4631](https://issues.apache.org/jira/browse/HADOOP-4631) | *Major* | **Split the default configurations into 3 parts**
Split hadoop-default.xml into core-default.xml, hdfs-default.xml and mapreduce-default.xml.
---
* [HADOOP-4618](https://issues.apache.org/jira/browse/HADOOP-4618) | *Major* | **Move http server from FSNamesystem into NameNode.**
Moved HTTP server from FSNameSystem to NameNode. Removed FSNamesystem.getNameNodeInfoPort(). Replaced FSNamesystem.getDFSNameNodeMachine() and FSNamesystem.getDFSNameNodePort() with new method FSNamesystem.getDFSNameNodeAddress(). Removed constructor NameNode(bindAddress, conf).
---
* [HADOOP-4576](https://issues.apache.org/jira/browse/HADOOP-4576) | *Major* | **Modify pending tasks count in the UI to pending jobs count in the UI**
Changed capacity scheduler UI to better present number of running and pending tasks.
---
* [HADOOP-4575](https://issues.apache.org/jira/browse/HADOOP-4575) | *Major* | **An independent HTTPS proxy for HDFS**
Introduced independent HSFTP proxy server for authenticated access to clusters.
---
* [HADOOP-4572](https://issues.apache.org/jira/browse/HADOOP-4572) | *Major* | **INode and its sub-classes should be package private**
Moved org.apache.hadoop.hdfs.{CreateEditsLog, NNThroughputBenchmark} to org.apache.hadoop.hdfs.server.namenode.
---
* [HADOOP-4567](https://issues.apache.org/jira/browse/HADOOP-4567) | *Major* | **GetFileBlockLocations should return the NetworkTopology information of the machines that hosts those blocks**
Changed GetFileBlockLocations to return topology information for nodes that host the block replicas.
---
* [HADOOP-4565](https://issues.apache.org/jira/browse/HADOOP-4565) | *Major* | **MultiFileInputSplit can use data locality information to create splits**
Improved MultiFileInputFormat so that multiple blocks from the same node or same rack can be combined into a single split.
---
* [HADOOP-4454](https://issues.apache.org/jira/browse/HADOOP-4454) | *Minor* | **Support comments in 'slaves' file**
Changed processing of conf/slaves file to allow # to begin a comment.
---
* [HADOOP-4445](https://issues.apache.org/jira/browse/HADOOP-4445) | *Major* | **Wrong number of running map/reduce tasks are displayed in queue information.**
Changed JobTracker UI to better present the number of active tasks.
---
* [HADOOP-4435](https://issues.apache.org/jira/browse/HADOOP-4435) | *Minor* | **The JobTracker should display the amount of heap memory used**
Changed JobTracker web status page to display the amount of heap memory in use. This changes the JobSubmissionProtocol.
---
* [HADOOP-4422](https://issues.apache.org/jira/browse/HADOOP-4422) | *Major* | **S3 file systems should not create bucket**
Modified Hadoop file system to no longer create S3 buckets. Applications can create buckets for their S3 file systems by other means, for example, using the JetS3t API.
---
* [HADOOP-4374](https://issues.apache.org/jira/browse/HADOOP-4374) | *Major* | **JVM should not be killed but given an opportunity to exit gracefully**
This patch (1) Adds a shutdownHook that does syncLogs sothat logs of the current task are flushed and log.index is up to date in cases like System.exit(), or killed using signals(other than SIGKILL).
(2) Changes writeToIndexFile() to write to a temporary index file first and then rename to log.index sothat updates to log.index file are atomic.
---
* [HADOOP-4305](https://issues.apache.org/jira/browse/HADOOP-4305) | *Major* | **repeatedly blacklisted tasktrackers should get declared dead**
Improved TaskTracker blacklisting strategy to better exclude faulty tracker from executing tasks.
---
* [HADOOP-4284](https://issues.apache.org/jira/browse/HADOOP-4284) | *Major* | **Support for user configurable global filters on HttpServer**
Introduced HttpServer method to support global filters.
---
* [HADOOP-4253](https://issues.apache.org/jira/browse/HADOOP-4253) | *Major* | **Fix warnings generated by FindBugs**
Removed from class org.apache.hadoop.fs.RawLocalFileSystem deprecated methods public String getName(), public void lock(Path p, boolean shared) and public void release(Path p).
---
* [HADOOP-4234](https://issues.apache.org/jira/browse/HADOOP-4234) | *Minor* | **KFS: Allow KFS layer to interface with multiple KFS namenodes**
Changed KFS glue layer to allow applications to interface with multiple KFS metaservers.
---
* [HADOOP-4210](https://issues.apache.org/jira/browse/HADOOP-4210) | *Major* | **Findbugs warnings are printed related to equals implementation of several classes**
Changed public class org.apache.hadoop.mapreduce.ID to be an abstract class. Removed from class org.apache.hadoop.mapreduce.ID the methods public static ID read(DataInput in) and public static ID forName(String str).
---
* [HADOOP-4188](https://issues.apache.org/jira/browse/HADOOP-4188) | *Major* | **Remove Task's dependency on concrete file systems**
Removed Task's dependency on concrete file systems by taking list from FileSystem class. Added statistics table to FileSystem class. Deprecated FileSystem method getStatistics(Class\<? extends FileSystem\> cls).
---
* [HADOOP-4179](https://issues.apache.org/jira/browse/HADOOP-4179) | *Major* | **Hadoop-Vaidya : Rule based performance diagnostic tool for Map/Reduce jobs**
Introduced Vaidya rule based performance diagnostic tool for Map/Reduce jobs.
---
* [HADOOP-4103](https://issues.apache.org/jira/browse/HADOOP-4103) | *Major* | **Alert for missing blocks**
Modified dfsadmin -report to report under replicated blocks. blocks with corrupt replicas, and missing blocks".
---
* [HADOOP-4035](https://issues.apache.org/jira/browse/HADOOP-4035) | *Blocker* | **Modify the capacity scheduler (HADOOP-3445) to schedule tasks based on memory requirements and task trackers free memory**
Changed capacity scheduler policy to take note of task memory requirements and task tracker memory availability.
---
* [HADOOP-4029](https://issues.apache.org/jira/browse/HADOOP-4029) | *Major* | **NameNode should report status and performance for each replica of image and log**
Added name node storage information to the dfshealth page, and moved data node information to a separated page.
---
* [HADOOP-3986](https://issues.apache.org/jira/browse/HADOOP-3986) | *Major* | **JobClient should not have a static configuration**
Removed classes org.apache.hadoop.mapred.JobShell and org.apache.hadoop.mapred.TestJobShell. Removed from JobClient methods static void setCommandLineConfig(Configuration conf) and public static Configuration getCommandLineConfig().
---
* [HADOOP-3923](https://issues.apache.org/jira/browse/HADOOP-3923) | *Minor* | **Deprecate org.apache.hadoop.mapred.StatusHttpServer**
Moved class org.apache.hadoop.mapred.StatusHttpServer to org.apache.hadoop.http.HttpServer.
---
* [HADOOP-3750](https://issues.apache.org/jira/browse/HADOOP-3750) | *Major* | **Fix and enforce module dependencies**
Removed deprecated method parseArgs from org.apache.hadoop.fs.FileSystem.
---
* [HADOOP-3497](https://issues.apache.org/jira/browse/HADOOP-3497) | *Major* | **File globbing with a PathFilter is too restrictive**
Changed the semantics of file globbing with a PathFilter (using the globStatus method of FileSystem). Previously, the filtering was too restrictive, so that a glob of /*/* and a filter that only accepts /a/b would not have matched /a/b. With this change /a/b does match.
---
* [HADOOP-3422](https://issues.apache.org/jira/browse/HADOOP-3422) | *Major* | **Ganglia counter metrics are all reported with the metric name "value", so the counter values can not be seen**
Changed names of ganglia metrics to avoid conflicts and to better identify source function.
---
* [HADOOP-3344](https://issues.apache.org/jira/browse/HADOOP-3344) | *Major* | **libhdfs: always builds 32bit, even when x86\_64 Java used**
Changed build procedure for libhdfs to build correctly for different platforms. Build instructions are in the Jira item.
---
* [HADOOP-3063](https://issues.apache.org/jira/browse/HADOOP-3063) | *Major* | **BloomMapFile - fail-fast version of MapFile for sparsely populated key space**
Introduced BloomMapFile subclass of MapFile that creates a Bloom filter from all keys.
---
* [HADOOP-1650](https://issues.apache.org/jira/browse/HADOOP-1650) | *Major* | **Upgrade Jetty to 6.x**
Upgraded all core servers to use Jetty 6
---
* [HADOOP-1230](https://issues.apache.org/jira/browse/HADOOP-1230) | *Major* | **Replace parameters with context objects in Mapper, Reducer, Partitioner, InputFormat, and OutputFormat classes**
Replaced parameters with context obejcts in Mapper, Reducer, Partitioner, InputFormat, and OutputFormat classes.

View File

@ -0,0 +1,139 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.1 - 2009-09-01
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-5881](https://issues.apache.org/jira/browse/HADOOP-5881) | Simplify configuration related to task-memory-monitoring and memory-based scheduling | Major | . | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-5726](https://issues.apache.org/jira/browse/HADOOP-5726) | Remove pre-emption from the capacity scheduler code base | Major | . | Hemanth Yamijala | rahul k singh |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6080](https://issues.apache.org/jira/browse/HADOOP-6080) | Handling of Trash with quota | Major | fs | Koji Noguchi | Jakob Homan |
| [HADOOP-5714](https://issues.apache.org/jira/browse/HADOOP-5714) | Metric to show number of fs.exists (or number of getFileInfo) calls | Minor | metrics | Koji Noguchi | Jakob Homan |
| [HADOOP-3315](https://issues.apache.org/jira/browse/HADOOP-3315) | New binary file format | Major | io | Owen O'Malley | Hong Tang |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-635](https://issues.apache.org/jira/browse/HDFS-635) | HDFS Project page does not show 0.20.1 documentation/release information. | Major | documentation | Andy Sautins | |
| [HDFS-527](https://issues.apache.org/jira/browse/HDFS-527) | Refactor DFSClient constructors | Major | hdfs-client | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [MAPREDUCE-767](https://issues.apache.org/jira/browse/MAPREDUCE-767) | to remove mapreduce dependency on commons-cli2 | Major | contrib/streaming | Giridharan Kesavan | Amar Kamat |
| [MAPREDUCE-465](https://issues.apache.org/jira/browse/MAPREDUCE-465) | Deprecate org.apache.hadoop.mapred.lib.MultithreadedMapRunner | Minor | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6215](https://issues.apache.org/jira/browse/HADOOP-6215) | fix GenericOptionParser to deal with -D with '=' in the value | Major | . | Owen O'Malley | Amar Kamat |
| [HADOOP-6145](https://issues.apache.org/jira/browse/HADOOP-6145) | No error message for deleting non-existant file or directory. | Major | fs | Suman Sehgal | Jakob Homan |
| [HADOOP-6141](https://issues.apache.org/jira/browse/HADOOP-6141) | hadoop 0.20 branch "test-patch" is broken | Major | build | Hong Tang | Hong Tang |
| [HADOOP-6139](https://issues.apache.org/jira/browse/HADOOP-6139) | Incomplete help message is displayed for rm and rmr options. | Minor | . | Suman Sehgal | Jakob Homan |
| [HADOOP-6017](https://issues.apache.org/jira/browse/HADOOP-6017) | NameNode and SecondaryNameNode fail to restart because of abnormal filenames. | Blocker | . | Raghu Angadi | Tsz Wo Nicholas Sze |
| [HADOOP-5951](https://issues.apache.org/jira/browse/HADOOP-5951) | StorageInfo needs Apache license header. | Major | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-5937](https://issues.apache.org/jira/browse/HADOOP-5937) | Correct info message "Use hadoop dfs -safemode option" to " Use hdfs dfsadmin -safemode option" . | Minor | . | Ravi Phulari | Ravi Phulari |
| [HADOOP-5932](https://issues.apache.org/jira/browse/HADOOP-5932) | MemoryMatcher logs 0 as freeMemOnTT even though there are free slots available on TaskTraker | Major | . | Karam Singh | Vinod Kumar Vavilapalli |
| [HADOOP-5924](https://issues.apache.org/jira/browse/HADOOP-5924) | JT fails to recover the jobs after restart after HADOOP:4372 | Major | . | Ramya Sunil | Amar Kamat |
| [HADOOP-5921](https://issues.apache.org/jira/browse/HADOOP-5921) | JobTracker does not come up because of NotReplicatedYetException | Major | . | Amareshwari Sriramadasu | Amar Kamat |
| [HADOOP-5920](https://issues.apache.org/jira/browse/HADOOP-5920) | TestJobHistory fails some times. | Major | . | Amareshwari Sriramadasu | Amar Kamat |
| [HADOOP-5908](https://issues.apache.org/jira/browse/HADOOP-5908) | ArithmeticException in heartbeats with zero map jobs | Major | . | Vinod Kumar Vavilapalli | Amar Kamat |
| [HADOOP-5884](https://issues.apache.org/jira/browse/HADOOP-5884) | Capacity scheduler should account high memory jobs as using more capacity of the queue | Major | . | Hemanth Yamijala | Vinod Kumar Vavilapalli |
| [HADOOP-5883](https://issues.apache.org/jira/browse/HADOOP-5883) | TaskMemoryMonitorThread might shoot down tasks even if their processes momentarily exceed the requested memory | Major | . | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-5882](https://issues.apache.org/jira/browse/HADOOP-5882) | Progress is not updated when the New Reducer is running reduce phase | Blocker | . | Jothi Padmanabhan | Amareshwari Sriramadasu |
| [HADOOP-5850](https://issues.apache.org/jira/browse/HADOOP-5850) | map/reduce doesn't run jobs with 0 maps | Critical | . | Owen O'Malley | Vinod Kumar Vavilapalli |
| [HADOOP-5828](https://issues.apache.org/jira/browse/HADOOP-5828) | Use absolute path for JobTracker's mapred.local.dir in MiniMRCluster | Major | test | Hemanth Yamijala | Hemanth Yamijala |
| [HADOOP-5746](https://issues.apache.org/jira/browse/HADOOP-5746) | Errors encountered in MROutputThread after the last map/reduce call can go undetected | Major | . | Devaraj Das | Amar Kamat |
| [HADOOP-5736](https://issues.apache.org/jira/browse/HADOOP-5736) | Update CapacityScheduler documentation to reflect latest changes | Major | . | Sreekanth Ramakrishnan | Sreekanth Ramakrishnan |
| [HADOOP-5719](https://issues.apache.org/jira/browse/HADOOP-5719) | Jobs failed during job initalization are never removed from Capacity Schedulers waiting list | Major | . | Sreekanth Ramakrishnan | Sreekanth Ramakrishnan |
| [HADOOP-5718](https://issues.apache.org/jira/browse/HADOOP-5718) | Capacity Scheduler should not check for presence of default queue while starting up. | Major | . | Sreekanth Ramakrishnan | Sreekanth Ramakrishnan |
| [HADOOP-5711](https://issues.apache.org/jira/browse/HADOOP-5711) | Change Namenode file close log to info | Minor | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5691](https://issues.apache.org/jira/browse/HADOOP-5691) | org.apache.hadoop.mapreduce.Reducer should not be abstract. | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5688](https://issues.apache.org/jira/browse/HADOOP-5688) | HftpFileSystem.getChecksum(..) does not work for the paths with scheme and authority | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-5655](https://issues.apache.org/jira/browse/HADOOP-5655) | TestMRServerPorts fails on java.net.BindException | Major | . | Hairong Kuang | Devaraj Das |
| [HADOOP-5654](https://issues.apache.org/jira/browse/HADOOP-5654) | TestReplicationPolicy.\<init\> fails on java.net.BindException | Major | test | Hairong Kuang | Hairong Kuang |
| [HADOOP-5648](https://issues.apache.org/jira/browse/HADOOP-5648) | Not able to generate gridmix.jar on already compiled version of hadoop | Major | benchmarks | Suman Sehgal | Giridharan Kesavan |
| [HADOOP-5646](https://issues.apache.org/jira/browse/HADOOP-5646) | TestQueueCapacities is failing Hudson tests for the last few builds | Major | . | Jothi Padmanabhan | Vinod Kumar Vavilapalli |
| [HADOOP-5641](https://issues.apache.org/jira/browse/HADOOP-5641) | Possible NPE in CapacityScheduler's MemoryMatcher | Major | . | Vinod Kumar Vavilapalli | Hemanth Yamijala |
| [HADOOP-5636](https://issues.apache.org/jira/browse/HADOOP-5636) | Job is left in Running state after a killJob | Critical | . | Amareshwari Sriramadasu | Amar Kamat |
| [HADOOP-5539](https://issues.apache.org/jira/browse/HADOOP-5539) | o.a.h.mapred.Merger not maintaining map out compression on intermediate files | Blocker | . | Billy Pearson | Jothi Padmanabhan |
| [HADOOP-5533](https://issues.apache.org/jira/browse/HADOOP-5533) | Recovery duration shown on the jobtracker webpage is inaccurate | Major | . | Amar Kamat | Amar Kamat |
| [HADOOP-5349](https://issues.apache.org/jira/browse/HADOOP-5349) | When the size required for a path is -1, LocalDirAllocator.getLocalPathForWrite fails with a DiskCheckerException when the disk it selects is bad. | Major | . | Vinod Kumar Vavilapalli | Vinod Kumar Vavilapalli |
| [HADOOP-5213](https://issues.apache.org/jira/browse/HADOOP-5213) | BZip2CompressionOutputStream NullPointerException | Blocker | io | Zheng Shao | Zheng Shao |
| [HADOOP-5210](https://issues.apache.org/jira/browse/HADOOP-5210) | Reduce Task Progress shows \> 100% when the total size of map outputs (for a single reducer) is high | Minor | . | Jothi Padmanabhan | Ravi Gummadi |
| [HADOOP-4674](https://issues.apache.org/jira/browse/HADOOP-4674) | hadoop fs -help should list detailed help info for the following commands: test, text, tail, stat & touchz | Trivial | fs | David NeSmith | Ravi Phulari |
| [HADOOP-4626](https://issues.apache.org/jira/browse/HADOOP-4626) | API link in forrest doc should point to the same version of hadoop. | Minor | documentation | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-1022](https://issues.apache.org/jira/browse/HDFS-1022) | Merge under-10-min tests specs into one file | Major | test | Erik Steffl | Erik Steffl |
| [HDFS-525](https://issues.apache.org/jira/browse/HDFS-525) | ListPathsServlet.java uses static SimpleDateFormat that has threading issues | Major | namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-438](https://issues.apache.org/jira/browse/HDFS-438) | Improve help message for quotas | Minor | . | Raghu Angadi | Raghu Angadi |
| [HDFS-167](https://issues.apache.org/jira/browse/HDFS-167) | DFSClient continues to retry indefinitely | Minor | hdfs-client | Derek Wollenstein | Bill Zeller |
| [HDFS-26](https://issues.apache.org/jira/browse/HDFS-26) | HADOOP-5862 for version .20 (Namespace quota exceeded message unclear) | Major | . | Boris Shkolnik | Boris Shkolnik |
| [MAPREDUCE-924](https://issues.apache.org/jira/browse/MAPREDUCE-924) | TestPipes must not directly invoke 'main' of pipes as an exit from main could cause the testcase to crash. | Major | pipes | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [MAPREDUCE-911](https://issues.apache.org/jira/browse/MAPREDUCE-911) | TestTaskFail fail sometimes | Major | test | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [MAPREDUCE-838](https://issues.apache.org/jira/browse/MAPREDUCE-838) | Task succeeds even when committer.commitTask fails with IOException | Blocker | task | Koji Noguchi | Amareshwari Sriramadasu |
| [MAPREDUCE-834](https://issues.apache.org/jira/browse/MAPREDUCE-834) | When TaskTracker config use old memory management values its memory monitoring is diabled. | Major | . | Karam Singh | Sreekanth Ramakrishnan |
| [MAPREDUCE-832](https://issues.apache.org/jira/browse/MAPREDUCE-832) | Too many WARN messages about deprecated memorty config variables in JobTacker log | Major | . | Karam Singh | rahul k singh |
| [MAPREDUCE-818](https://issues.apache.org/jira/browse/MAPREDUCE-818) | org.apache.hadoop.mapreduce.Counters.getGroup returns null if the group name doesnt exist. | Minor | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [MAPREDUCE-807](https://issues.apache.org/jira/browse/MAPREDUCE-807) | Stray user files in mapred.system.dir with permissions other than 777 can prevent the jobtracker from starting up. | Blocker | jobtracker | Amar Kamat | Amar Kamat |
| [MAPREDUCE-805](https://issues.apache.org/jira/browse/MAPREDUCE-805) | Deadlock in Jobtracker | Major | . | Michael Tamm | Amar Kamat |
| [MAPREDUCE-796](https://issues.apache.org/jira/browse/MAPREDUCE-796) | Encountered "ClassCastException" on tasktracker while running wordcount with MultithreadedMapRunner | Major | examples | Suman Sehgal | Amar Kamat |
| [MAPREDUCE-745](https://issues.apache.org/jira/browse/MAPREDUCE-745) | TestRecoveryManager fails sometimes | Major | jobtracker | Amareshwari Sriramadasu | Amar Kamat |
| [MAPREDUCE-735](https://issues.apache.org/jira/browse/MAPREDUCE-735) | ArrayIndexOutOfBoundsException is thrown by KeyFieldBasedPartitioner | Major | . | Suman Sehgal | Amar Kamat |
| [MAPREDUCE-687](https://issues.apache.org/jira/browse/MAPREDUCE-687) | TestMiniMRMapRedDebugScript fails sometimes | Major | test | Amar Kamat | Amareshwari Sriramadasu |
| [MAPREDUCE-657](https://issues.apache.org/jira/browse/MAPREDUCE-657) | CompletedJobStatusStore hardcodes filesystem to hdfs | Major | jobtracker | Amar Kamat | Amar Kamat |
| [MAPREDUCE-565](https://issues.apache.org/jira/browse/MAPREDUCE-565) | Partitioner does not work with new API | Blocker | task | Jothi Padmanabhan | Owen O'Malley |
| [MAPREDUCE-430](https://issues.apache.org/jira/browse/MAPREDUCE-430) | Task stuck in cleanup with OutOfMemoryErrors | Major | . | Amareshwari Sriramadasu | Amar Kamat |
| [MAPREDUCE-421](https://issues.apache.org/jira/browse/MAPREDUCE-421) | mapred pipes might return exit code 0 even when failing | Major | pipes | Christian Kunz | Christian Kunz |
| [MAPREDUCE-383](https://issues.apache.org/jira/browse/MAPREDUCE-383) | pipes combiner does not reset properly after a spill | Major | . | Christian Kunz | Christian Kunz |
| [MAPREDUCE-179](https://issues.apache.org/jira/browse/MAPREDUCE-179) | setProgress not called for new RecordReaders | Blocker | . | Chris Douglas | Chris Douglas |
| [MAPREDUCE-130](https://issues.apache.org/jira/browse/MAPREDUCE-130) | Delete the jobconf copy from the log directory of the JobTracker when the job is retired | Major | . | Devaraj Das | Amar Kamat |
| [MAPREDUCE-124](https://issues.apache.org/jira/browse/MAPREDUCE-124) | When abortTask of OutputCommitter fails with an Exception for a map-only job, the task is marked as success | Major | . | Jothi Padmanabhan | Amareshwari Sriramadasu |
| [MAPREDUCE-40](https://issues.apache.org/jira/browse/MAPREDUCE-40) | Memory management variables need a backwards compatibility option after HADOOP-5881 | Blocker | . | Hemanth Yamijala | rahul k singh |
| [MAPREDUCE-18](https://issues.apache.org/jira/browse/MAPREDUCE-18) | Under load the shuffle sometimes gets incorrect data | Blocker | . | Owen O'Malley | Ravi Gummadi |
| [MAPREDUCE-2](https://issues.apache.org/jira/browse/MAPREDUCE-2) | ArrayOutOfIndex error in KeyFieldBasedPartitioner on empty key | Major | . | Amar Kamat | Amar Kamat |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6213](https://issues.apache.org/jira/browse/HADOOP-6213) | Remove commons dependency on commons-cli2 | Blocker | util | Amar Kamat | Amar Kamat |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,218 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-6213](https://issues.apache.org/jira/browse/HADOOP-6213) | *Blocker* | **Remove commons dependency on commons-cli2**
GenericOptionsParser in branch 0.20 depends on commons-cli2. This jira removes the dependency of branch 0.20 on commons-cli2 completely. The problem is seen after 'ant binary' where all the library files are copied to '$hadoop-home/lib' which already has commons-cli2.
---
* [HADOOP-6080](https://issues.apache.org/jira/browse/HADOOP-6080) | *Major* | **Handling of Trash with quota**
Provide a new option to rm and rmr, -skipTrash, which will immediately delete the files specified, rather than moving them to the trash.
---
* [HADOOP-5924](https://issues.apache.org/jira/browse/HADOOP-5924) | *Major* | **JT fails to recover the jobs after restart after HADOOP:4372**
Post HADOOP-4372, empty job history files caused NPE. This issues fixes that by creating new files if no old file is found.
---
* [HADOOP-5921](https://issues.apache.org/jira/browse/HADOOP-5921) | *Major* | **JobTracker does not come up because of NotReplicatedYetException**
Jobtracker crashes if it fails to create jobtracker.info file (i.e if sufficient datanodes are not up). With this patch it keeps on retrying on IOExceptions assuming IOExceptions in jobtracker.info creation implies that the hdfs is not in *ready *state.
---
* [HADOOP-5920](https://issues.apache.org/jira/browse/HADOOP-5920) | *Major* | **TestJobHistory fails some times.**
TestJobHistory fails as jobtracker is restarted very fast (within a minute) and history files from earlier testcases were not cleaned up. This patch cleans up the history-dir and mapred-system-dir after every test.
---
* [HADOOP-5884](https://issues.apache.org/jira/browse/HADOOP-5884) | *Major* | **Capacity scheduler should account high memory jobs as using more capacity of the queue**
Fixes Capacity scheduler to account more capacity of a queue for a high memory job. Done by considering these jobs to
take more slots proportionally with respect to a slot's default memory size.
---
* [HADOOP-5881](https://issues.apache.org/jira/browse/HADOOP-5881) | *Major* | **Simplify configuration related to task-memory-monitoring and memory-based scheduling**
**WARNING: No release note provided for this incompatible change.**
---
* [HADOOP-5746](https://issues.apache.org/jira/browse/HADOOP-5746) | *Major* | **Errors encountered in MROutputThread after the last map/reduce call can go undetected**
If the child (streaming) process returns successfully and the MROutputThread throws an error, there was no way to detect that as all the IOExceptions was ignored. Such issues can occur when DFS clients were closed etc. Now a check for errors (in threads) is made before finishing off the task and an exception is thrown that fails he task.
---
* [HADOOP-5726](https://issues.apache.org/jira/browse/HADOOP-5726) | *Major* | **Remove pre-emption from the capacity scheduler code base**
Removed pre-emption from capacity scheduler. The impact of this change is that capacities for queues can no longer be guaranteed within a given span of time. Also changed configuration variables to remove pre-emption related variables and better reflect the absence of guarantees.
---
* [HADOOP-5210](https://issues.apache.org/jira/browse/HADOOP-5210) | *Minor* | **Reduce Task Progress shows \> 100% when the total size of map outputs (for a single reducer) is high**
This patch resets the variable totalBytesProcessed before the final merge sothat it will be used for calculating the progress of reducePhase(the 3rd phase of reduce task) correctly.
---
* [HADOOP-3315](https://issues.apache.org/jira/browse/HADOOP-3315) | *Major* | **New binary file format**
Add a new, binary file format TFile.
---
* [MAPREDUCE-838](https://issues.apache.org/jira/browse/MAPREDUCE-838) | *Blocker* | **Task succeeds even when committer.commitTask fails with IOException**
Fixed a bug in the way commit of task outputs happens. The bug was that if commit fails with IOException, the task would be declared as successful.
---
* [MAPREDUCE-834](https://issues.apache.org/jira/browse/MAPREDUCE-834) | *Major* | **When TaskTracker config use old memory management values its memory monitoring is diabled.**
The tasktracker's startup code was modified to use deprecated memory management configuration variables, when specified, and enable memory monitoring of tasks.
---
* [MAPREDUCE-832](https://issues.apache.org/jira/browse/MAPREDUCE-832) | *Major* | **Too many WARN messages about deprecated memorty config variables in JobTacker log**
Reduced the frequency of log messages printed when a deprecated memory management variable is found in configuration of a job.
---
* [MAPREDUCE-818](https://issues.apache.org/jira/browse/MAPREDUCE-818) | *Minor* | **org.apache.hadoop.mapreduce.Counters.getGroup returns null if the group name doesnt exist.**
Fixed a bug in the new org.apache.hadoop.mapreduce.Counters.getGroup() method to return an empty group if group name doesn't exist, instead of null, thus making sure that it is in sync with the Javadoc.
---
* [MAPREDUCE-807](https://issues.apache.org/jira/browse/MAPREDUCE-807) | *Blocker* | **Stray user files in mapred.system.dir with permissions other than 777 can prevent the jobtracker from starting up.**
The JobTracker tries to delete the mapred.system.dir when it is starting up (with the job recovery disabled). The fix provided by this jira is that JobTracker will fail (bail out) with AccessControlException if it fails to delete files/directories in mapred.system.dir due to access control issues.
---
* [MAPREDUCE-805](https://issues.apache.org/jira/browse/MAPREDUCE-805) | *Major* | **Deadlock in Jobtracker**
Job initialization process was changed to not change (run) states during initialization. The reason is two fold
- this can lead to deadlock as state changes require circular locking (i.e JobInProgress requires JobTracker lock)
- events were not raised as these state changes were not informed/propogated back to the JobTracker
Now the JobTracker takes care of initializing/failing/killing the job and raising appropriate events. The simple rule that was enforced was that "The JobTracker lock is *must* before changing the run-state of a job".
---
* [MAPREDUCE-796](https://issues.apache.org/jira/browse/MAPREDUCE-796) | *Major* | **Encountered "ClassCastException" on tasktracker while running wordcount with MultithreadedMapRunner**
Multithreaded mapper was modified to create a new Runtime exception (object) from a throwable instead of casting a throwable into a RuntimeException, once the Multithreaded map encounters a fault.
---
* [MAPREDUCE-767](https://issues.apache.org/jira/browse/MAPREDUCE-767) | *Major* | **to remove mapreduce dependency on commons-cli2**
Removes the dependency of hadoop-mapred from commons-cli2 and uses commons-cli1.2 for command-line parsing.
---
* [MAPREDUCE-745](https://issues.apache.org/jira/browse/MAPREDUCE-745) | *Major* | **TestRecoveryManager fails sometimes**
JobTracker was changed to take an identifier as an argument. This helps in testcases where the jobtracker/mapred-cluster is (re)started in a short span of time and the chances of jobtracker identifier clashing are high. Also the RecoveryManager was modified to throw an exception if a job fails in init during the recovery process. The reason being that this event will trigger a job failure in the recovery process and will remove the failed job from further initialization and processing.
---
* [MAPREDUCE-657](https://issues.apache.org/jira/browse/MAPREDUCE-657) | *Major* | **CompletedJobStatusStore hardcodes filesystem to hdfs**
CompletedJobStatusStore was hardcored to persist to hdfs. This patch allows to persist to local fs. Just qualify mapred.job.tracker.persist.jobstatus.dir with file://
---
* [MAPREDUCE-430](https://issues.apache.org/jira/browse/MAPREDUCE-430) | *Major* | **Task stuck in cleanup with OutOfMemoryErrors**
Various code paths in the framework caught Throwable and tried to do inline cleanup. In case of OOM errors, such inline-cleanups can result into hung jvms. With this fix, the TaskTracker provides a api to report fatal errors (any throwable other than FSErrror and Exceptions). On catching a Throwable, Mapper/Reducer tries to inform the TT.
---
* [MAPREDUCE-383](https://issues.apache.org/jira/browse/MAPREDUCE-383) | *Major* | **pipes combiner does not reset properly after a spill**
Fixed a bug in Pipes combiner to reset the spilled bytes count after the spill.
---
* [MAPREDUCE-130](https://issues.apache.org/jira/browse/MAPREDUCE-130) | *Major* | **Delete the jobconf copy from the log directory of the JobTracker when the job is retired**
When a job is initialized, it localizes the job conf to the logs dir. Without this patch I never gets deleted. Now when the job retires, the conf is deleted. This local copy is required to display on the webui.
---
* [MAPREDUCE-40](https://issues.apache.org/jira/browse/MAPREDUCE-40) | *Blocker* | **Memory management variables need a backwards compatibility option after HADOOP-5881**
Fixed backwards compatibility by re-introducing and deprecating removed memory monitoring related configuration options.
---
* [MAPREDUCE-18](https://issues.apache.org/jira/browse/MAPREDUCE-18) | *Blocker* | **Under load the shuffle sometimes gets incorrect data**
This patch adds the mapid and reduceid in the http header of mapoutput when being sent to reduce node. Also validates compressed length, decompressed length, mapid and reduceid from http header at reduce node.
---
* [MAPREDUCE-2](https://issues.apache.org/jira/browse/MAPREDUCE-2) | *Major* | **ArrayOutOfIndex error in KeyFieldBasedPartitioner on empty key**
KeyFieldBasedPartitioner throws ArrayOutOfIndex when passed an empty key. This patch hashes empty key to 0 hashcode.

View File

@ -0,0 +1,109 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.2 - 2010-02-16
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-793](https://issues.apache.org/jira/browse/HDFS-793) | DataNode should first receive the whole packet ack message before it constructs and sends its own ack message for the packet | Blocker | datanode | Hairong Kuang | Hairong Kuang |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-1849](https://issues.apache.org/jira/browse/HADOOP-1849) | IPC server max queue size should be configurable | Major | ipc | Raghu Angadi | Konstantin Shvachko |
| [MAPREDUCE-623](https://issues.apache.org/jira/browse/MAPREDUCE-623) | Resolve javac warnings in mapred | Major | build | Jothi Padmanabhan | Jothi Padmanabhan |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6576](https://issues.apache.org/jira/browse/HADOOP-6576) | TestStreamingStatus is failing on 0.20 branch | Major | . | Chris Douglas | Todd Lipcon |
| [HADOOP-6575](https://issues.apache.org/jira/browse/HADOOP-6575) | Tests do not run on 0.20 branch | Major | . | Chris Douglas | Chris Douglas |
| [HADOOP-6524](https://issues.apache.org/jira/browse/HADOOP-6524) | Contrib tests are failing Clover'ed build | Major | build | Konstantin Boudnik | Konstantin Boudnik |
| [HADOOP-6506](https://issues.apache.org/jira/browse/HADOOP-6506) | Failing tests prevent the rest of test targets from execution. | Major | build | Konstantin Boudnik | Konstantin Boudnik |
| [HADOOP-6498](https://issues.apache.org/jira/browse/HADOOP-6498) | IPC client bug may cause rpc call hang | Blocker | ipc | Ruyue Ma | Ruyue Ma |
| [HADOOP-6460](https://issues.apache.org/jira/browse/HADOOP-6460) | Namenode runs of out of memory due to memory leak in ipc Server | Blocker | . | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-6428](https://issues.apache.org/jira/browse/HADOOP-6428) | HttpServer sleeps with negative values | Major | . | Tsz Wo Nicholas Sze | Konstantin Boudnik |
| [HADOOP-6315](https://issues.apache.org/jira/browse/HADOOP-6315) | GzipCodec should not represent BuiltInZlibInflater as decompressorType | Major | io | Aaron Kimball | Aaron Kimball |
| [HADOOP-6269](https://issues.apache.org/jira/browse/HADOOP-6269) | Missing synchronization for defaultResources in Configuration.addResource | Major | conf | Todd Lipcon | Sreekanth Ramakrishnan |
| [HADOOP-6231](https://issues.apache.org/jira/browse/HADOOP-6231) | Allow caching of filesystem instances to be disabled on a per-instance basis | Major | fs | Tom White | Ben Slusky |
| [HADOOP-6097](https://issues.apache.org/jira/browse/HADOOP-6097) | Multiple bugs w/ Hadoop archives | Major | fs | Ben Slusky | Ben Slusky |
| [HADOOP-5759](https://issues.apache.org/jira/browse/HADOOP-5759) | IllegalArgumentException when CombineFileInputFormat is used as job InputFormat | Major | . | Amareshwari Sriramadasu | Amareshwari Sriramadasu |
| [HADOOP-5623](https://issues.apache.org/jira/browse/HADOOP-5623) | Streaming: process provided status messages are overwritten every 10 seoncds | Major | . | Rick Cox | Rick Cox |
| [HADOOP-5612](https://issues.apache.org/jira/browse/HADOOP-5612) | Some c++ scripts are not chmodded before ant execution | Major | build | Todd Lipcon | Todd Lipcon |
| [HADOOP-5611](https://issues.apache.org/jira/browse/HADOOP-5611) | C++ libraries do not build on Debian Lenny | Critical | . | Todd Lipcon | Todd Lipcon |
| [HDFS-927](https://issues.apache.org/jira/browse/HDFS-927) | DFSInputStream retries too many times for new block locations | Critical | hdfs-client | Todd Lipcon | Todd Lipcon |
| [HDFS-872](https://issues.apache.org/jira/browse/HDFS-872) | DFSClient 0.20.1 is incompatible with HDFS 0.20.2 | Major | datanode, hdfs-client | Bassam Tabbara | Todd Lipcon |
| [HDFS-781](https://issues.apache.org/jira/browse/HDFS-781) | Metrics PendingDeletionBlocks is not decremented | Blocker | namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-761](https://issues.apache.org/jira/browse/HDFS-761) | Failure to process rename operation from edits log due to quota verification | Major | namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-745](https://issues.apache.org/jira/browse/HDFS-745) | TestFsck timeout on 0.20. | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-732](https://issues.apache.org/jira/browse/HDFS-732) | HDFS files are ending up truncated | Blocker | hdfs-client | Christian Kunz | Tsz Wo Nicholas Sze |
| [HDFS-723](https://issues.apache.org/jira/browse/HDFS-723) | Deadlock in DFSClient#DFSOutputStream | Blocker | . | Hairong Kuang | Hairong Kuang |
| [HDFS-677](https://issues.apache.org/jira/browse/HDFS-677) | Rename failure due to quota results in deletion of src directory | Blocker | namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-596](https://issues.apache.org/jira/browse/HDFS-596) | Memory leak in libhdfs: hdfsFreeFileInfo() in libhdfs does not free memory for mOwner and mGroup | Blocker | fuse-dfs | Zhang Bingjun | Zhang Bingjun |
| [HDFS-579](https://issues.apache.org/jira/browse/HDFS-579) | HADOOP-3792 update of DfsTask incomplete | Major | hdfs-client | Christian Kunz | Christian Kunz |
| [HDFS-187](https://issues.apache.org/jira/browse/HDFS-187) | TestStartup fails if hdfs is running in the same machine | Major | test | Tsz Wo Nicholas Sze | Todd Lipcon |
| [HDFS-185](https://issues.apache.org/jira/browse/HDFS-185) | Chown , chgrp , chmod operations allowed when namenode is in safemode . | Major | . | Ravi Phulari | Ravi Phulari |
| [HDFS-101](https://issues.apache.org/jira/browse/HDFS-101) | DFS write pipeline : DFSClient sometimes does not detect second datanode failure | Blocker | datanode | Raghu Angadi | Hairong Kuang |
| [MAPREDUCE-1251](https://issues.apache.org/jira/browse/MAPREDUCE-1251) | c++ utils doesn't compile | Major | . | Eli Collins | Eli Collins |
| [MAPREDUCE-1182](https://issues.apache.org/jira/browse/MAPREDUCE-1182) | Reducers fail with OutOfMemoryError while copying Map outputs | Blocker | . | Chandra Prakash Bhagtani | Chandra Prakash Bhagtani |
| [MAPREDUCE-1163](https://issues.apache.org/jira/browse/MAPREDUCE-1163) | hdfsJniHelper.h: Yahoo! specific paths are encoded | Trivial | . | Allen Wittenauer | Allen Wittenauer |
| [MAPREDUCE-1147](https://issues.apache.org/jira/browse/MAPREDUCE-1147) | Map output records counter missing for map-only jobs in new API | Blocker | . | Chris Douglas | Amar Kamat |
| [MAPREDUCE-1070](https://issues.apache.org/jira/browse/MAPREDUCE-1070) | Deadlock in FairSchedulerServlet | Major | . | Todd Lipcon | Todd Lipcon |
| [MAPREDUCE-1068](https://issues.apache.org/jira/browse/MAPREDUCE-1068) | In hadoop-0.20.0 streaming job do not throw proper verbose error message if file is not present | Major | contrib/streaming | Peeyush Bishnoi | Amareshwari Sriramadasu |
| [MAPREDUCE-1010](https://issues.apache.org/jira/browse/MAPREDUCE-1010) | Adding tests for changes in archives. | Minor | harchive | Mahadev konar | Mahadev konar |
| [MAPREDUCE-979](https://issues.apache.org/jira/browse/MAPREDUCE-979) | JobConf.getMemoryFor{Map\|Reduce}Task doesn't fallback to newer config knobs when mapred.taskmaxvmem is set to DISABLED\_MEMORY\_LIMIT of -1 | Blocker | jobtracker, tasktracker | Arun C Murthy | Sreekanth Ramakrishnan |
| [MAPREDUCE-826](https://issues.apache.org/jira/browse/MAPREDUCE-826) | harchive doesn't use ToolRunner / harchive returns 0 even if the job fails with exception | Trivial | harchive | Koji Noguchi | Koji Noguchi |
| [MAPREDUCE-617](https://issues.apache.org/jira/browse/MAPREDUCE-617) | Streaming should not throw java.lang.RuntimeException and ERROR while displaying help | Minor | contrib/streaming | Karam Singh | |
| [MAPREDUCE-433](https://issues.apache.org/jira/browse/MAPREDUCE-433) | TestReduceFetch failed. | Major | . | Tsz Wo Nicholas Sze | Chris Douglas |
| [MAPREDUCE-112](https://issues.apache.org/jira/browse/MAPREDUCE-112) | Reduce Input Records and Reduce Output Records counters are not being set when using the new Mapreduce reducer API | Blocker | . | Jothi Padmanabhan | Jothi Padmanabhan |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-919](https://issues.apache.org/jira/browse/HDFS-919) | Create test to validate the BlocksVerified metric | Major | test | gary murry | |
| [HDFS-907](https://issues.apache.org/jira/browse/HDFS-907) | Add tests for getBlockLocations and totalLoad metrics. | Minor | namenode | Ravi Phulari | Ravi Phulari |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,150 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.2 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-6498](https://issues.apache.org/jira/browse/HADOOP-6498) | *Blocker* | **IPC client bug may cause rpc call hang**
Correct synchronization error in IPC where handler thread could hang if request reader got an error.
---
* [HADOOP-6460](https://issues.apache.org/jira/browse/HADOOP-6460) | *Blocker* | **Namenode runs of out of memory due to memory leak in ipc Server**
If an IPC server response buffer has grown to than 1MB, it is replaced by a smaller buffer to free up the Java heap that was used. This will improve the longevity of the name service.
---
* [HADOOP-6428](https://issues.apache.org/jira/browse/HADOOP-6428) | *Major* | **HttpServer sleeps with negative values**
Corrected arithmetic error that made sleep times less than zero.
---
* [HADOOP-6231](https://issues.apache.org/jira/browse/HADOOP-6231) | *Major* | **Allow caching of filesystem instances to be disabled on a per-instance basis**
Allow a general mechanism to disable the cache on a per filesystem basis by using property fs.\<schemename\>.impl.disable.cache. eg. fs.har.impl.disable.cache in core-default.xml
---
* [HADOOP-6097](https://issues.apache.org/jira/browse/HADOOP-6097) | *Major* | **Multiple bugs w/ Hadoop archives**
Bugs fixed for Hadoop archives: character escaping in paths, LineReader and file system caching.
---
* [HDFS-793](https://issues.apache.org/jira/browse/HDFS-793) | *Blocker* | **DataNode should first receive the whole packet ack message before it constructs and sends its own ack message for the packet**
**WARNING: No release note provided for this incompatible change.**
---
* [HDFS-781](https://issues.apache.org/jira/browse/HDFS-781) | *Blocker* | **Metrics PendingDeletionBlocks is not decremented**
Correct PendingDeletionBlocks metric to properly decrement counts.
---
* [HDFS-761](https://issues.apache.org/jira/browse/HDFS-761) | *Major* | **Failure to process rename operation from edits log due to quota verification**
Corrected an error when checking quota policy that resulted in a failure to read the edits log, stopping the primary/secondary name node.
---
* [HDFS-677](https://issues.apache.org/jira/browse/HDFS-677) | *Blocker* | **Rename failure due to quota results in deletion of src directory**
Rename properly considers the case where both source and destination are over quota; operation will fail with error indication.
---
* [HDFS-596](https://issues.apache.org/jira/browse/HDFS-596) | *Blocker* | **Memory leak in libhdfs: hdfsFreeFileInfo() in libhdfs does not free memory for mOwner and mGroup**
Memory leak in function hdfsFreeFileInfo in libhdfs. This bug affects fuse-dfs severely.
---
* [MAPREDUCE-1182](https://issues.apache.org/jira/browse/MAPREDUCE-1182) | *Blocker* | **Reducers fail with OutOfMemoryError while copying Map outputs**
Modifies shuffle related memory parameters to use 'long' from 'int' so that sizes greater than maximum integer size are handled correctly
---
* [MAPREDUCE-1147](https://issues.apache.org/jira/browse/MAPREDUCE-1147) | *Blocker* | **Map output records counter missing for map-only jobs in new API**
Adds a counter to track the number of records emitted by map writing directly to HDFS i.e map tasks of job with 0 reducers.
---
* [MAPREDUCE-1068](https://issues.apache.org/jira/browse/MAPREDUCE-1068) | *Major* | **In hadoop-0.20.0 streaming job do not throw proper verbose error message if file is not present**
Fix streaming job to show proper message if file is is not present, for -file option.
---
* [MAPREDUCE-979](https://issues.apache.org/jira/browse/MAPREDUCE-979) | *Blocker* | **JobConf.getMemoryFor{Map\|Reduce}Task doesn't fallback to newer config knobs when mapred.taskmaxvmem is set to DISABLED\_MEMORY\_LIMIT of -1**
Added support to fallback to new task memory configuration when deprecated memory configuration values are set to disabled.
---
* [MAPREDUCE-826](https://issues.apache.org/jira/browse/MAPREDUCE-826) | *Trivial* | **harchive doesn't use ToolRunner / harchive returns 0 even if the job fails with exception**
Use ToolRunner for archives job and return non zero error code on failure.
---
* [MAPREDUCE-623](https://issues.apache.org/jira/browse/MAPREDUCE-623) | *Major* | **Resolve javac warnings in mapred**
Removes javac warnings by either resolving them or suppressing them (wherever resolution is not possible)
---
* [MAPREDUCE-433](https://issues.apache.org/jira/browse/MAPREDUCE-433) | *Major* | **TestReduceFetch failed.**
Resolves the test failure by modifying the test to base it on spill counters rather than on bytes read/written. It also introduces a new configuration parameter "mapred.job.shuffle.input.buffer.percent" to provide finer grained control on the memory limit to be used during shuffle.
---
* [MAPREDUCE-112](https://issues.apache.org/jira/browse/MAPREDUCE-112) | *Blocker* | **Reduce Input Records and Reduce Output Records counters are not being set when using the new Mapreduce reducer API**
Updates of counters for reduce input and output records were added in the new API so they are available for jobs using the new API.

View File

@ -0,0 +1,95 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.203.0 - 2011-05-11
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7108](https://issues.apache.org/jira/browse/HADOOP-7108) | hadoop-0.20.100 | Major | . | Arun C Murthy | Arun C Murthy |
| [HADOOP-4343](https://issues.apache.org/jira/browse/HADOOP-4343) | Adding user and service-to-service authentication to Hadoop | Blocker | . | Kan Zhang | Kan Zhang |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7247](https://issues.apache.org/jira/browse/HADOOP-7247) | Fix documentation to reflect new jar names | Major | . | Owen O'Malley | Owen O'Malley |
| [HDFS-1626](https://issues.apache.org/jira/browse/HDFS-1626) | Make BLOCK\_INVALIDATE\_LIMIT configurable | Minor | namenode | Arun C Murthy | Tsz Wo Nicholas Sze |
| [HDFS-457](https://issues.apache.org/jira/browse/HDFS-457) | better handling of volume failure in Data Node storage | Major | datanode | Boris Shkolnik | Boris Shkolnik |
| [MAPREDUCE-2355](https://issues.apache.org/jira/browse/MAPREDUCE-2355) | Add an out of band heartbeat damper | Major | jobtracker | Owen O'Malley | Arun C Murthy |
| [MAPREDUCE-2316](https://issues.apache.org/jira/browse/MAPREDUCE-2316) | Update docs for CapacityScheduler | Major | capacity-sched, documentation | Arun C Murthy | Arun C Murthy |
| [MAPREDUCE-1943](https://issues.apache.org/jira/browse/MAPREDUCE-1943) | Implement limits on per-job JobConf, Counters, StatusReport, Split-Sizes | Major | . | Mahadev konar | Mahadev konar |
| [MAPREDUCE-478](https://issues.apache.org/jira/browse/MAPREDUCE-478) | separate jvm param for mapper and reducer | Minor | . | Koji Noguchi | Arun C Murthy |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7259](https://issues.apache.org/jira/browse/HADOOP-7259) | contrib modules should include build.properties from parent. | Major | build | Owen O'Malley | Owen O'Malley |
| [HADOOP-7258](https://issues.apache.org/jira/browse/HADOOP-7258) | Gzip codec should not return null decompressors | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-7253](https://issues.apache.org/jira/browse/HADOOP-7253) | Fix default config | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-7246](https://issues.apache.org/jira/browse/HADOOP-7246) | The default log4j configuration causes warnings about EventCounter | Major | . | Owen O'Malley | Luke Lu |
| [HADOOP-7243](https://issues.apache.org/jira/browse/HADOOP-7243) | Fix contrib unit tests (fairshare, hdfsproxy, datajoin, streaming) | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-7232](https://issues.apache.org/jira/browse/HADOOP-7232) | Fix javadoc warnings | Blocker | documentation | Owen O'Malley | Owen O'Malley |
| [HADOOP-7215](https://issues.apache.org/jira/browse/HADOOP-7215) | RPC clients must connect over a network interface corresponding to the host name in the client's kerberos principal key | Blocker | security | Suresh Srinivas | Suresh Srinivas |
| [HADOOP-7190](https://issues.apache.org/jira/browse/HADOOP-7190) | Put metrics v1 back into the hadoop-20-security branch | Major | metrics | Owen O'Malley | Owen O'Malley |
| [HADOOP-7163](https://issues.apache.org/jira/browse/HADOOP-7163) | "java.net.SocketTimeoutException: 60000 millis timeout" happens a lot | Major | ipc | Owen O'Malley | Devaraj Das |
| [HADOOP-7143](https://issues.apache.org/jira/browse/HADOOP-7143) | Hive Hadoop20SShims depends on removed HadoopArchives | Major | fs | Joep Rottinghuis | Joep Rottinghuis |
| [HADOOP-7040](https://issues.apache.org/jira/browse/HADOOP-7040) | DiskChecker:mkdirsWithExistsCheck swallows FileNotFoundException. | Major | . | Boris Shkolnik | Boris Shkolnik |
| [HADOOP-6907](https://issues.apache.org/jira/browse/HADOOP-6907) | Rpc client doesn't use the per-connection conf to figure out server's Kerberos principal | Major | ipc, security | Kan Zhang | Kan Zhang |
| [HADOOP-5647](https://issues.apache.org/jira/browse/HADOOP-5647) | TestJobHistory fails if /tmp/\_logs is not writable to. Testcase should not depend on /tmp | Major | test | Ravi Gummadi | Ravi Gummadi |
| [HDFS-1822](https://issues.apache.org/jira/browse/HDFS-1822) | Editlog opcodes overlap between 20 security and later releases | Blocker | namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-1022](https://issues.apache.org/jira/browse/HDFS-1022) | Merge under-10-min tests specs into one file | Major | test | Erik Steffl | Erik Steffl |
| [MAPREDUCE-2365](https://issues.apache.org/jira/browse/MAPREDUCE-2365) | Add counters for FileInputFormat (BYTES\_READ) and FileOutputFormat (BYTES\_WRITTEN) | Major | . | Owen O'Malley | Siddharth Seth |
| [MAPREDUCE-2278](https://issues.apache.org/jira/browse/MAPREDUCE-2278) | DistributedCache shouldn't hold a ref to JobConf | Major | distributed-cache, tasktracker | Arun C Murthy | Chris Douglas |
| [MAPREDUCE-1699](https://issues.apache.org/jira/browse/MAPREDUCE-1699) | JobHistory shouldn't be disabled for any reason | Major | jobtracker | Arun C Murthy | Krishna Ramachandran |
| [MAPREDUCE-1280](https://issues.apache.org/jira/browse/MAPREDUCE-1280) | Eclipse Plugin does not work with Eclipse Ganymede (3.4) | Major | . | Aaron Kimball | Alex Kozlov |
| [MAPREDUCE-1233](https://issues.apache.org/jira/browse/MAPREDUCE-1233) | Incorrect Waiting maps/reduces in Jobtracker metrics | Major | jobtracker | V.Karthikeyan | Luke Lu |
| [MAPREDUCE-1118](https://issues.apache.org/jira/browse/MAPREDUCE-1118) | Capacity Scheduler scheduling information is hard to read / should be tabular format | Major | capacity-sched | Allen Wittenauer | Krishna Ramachandran |
| [MAPREDUCE-323](https://issues.apache.org/jira/browse/MAPREDUCE-323) | Improve the way job history files are managed | Critical | jobtracker | Amar Kamat | Dick King |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,117 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.203.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-5647](https://issues.apache.org/jira/browse/HADOOP-5647) | *Major* | **TestJobHistory fails if /tmp/\_logs is not writable to. Testcase should not depend on /tmp**
Removed dependency of testcase on /tmp and made it to use test.build.data directory instead.
---
* [HDFS-1626](https://issues.apache.org/jira/browse/HDFS-1626) | *Minor* | **Make BLOCK\_INVALIDATE\_LIMIT configurable**
Added a new configuration property dfs.block.invalidate.limit for FSNamesystem.blockInvalidateLimit.
---
* [HDFS-457](https://issues.apache.org/jira/browse/HDFS-457) | *Major* | **better handling of volume failure in Data Node storage**
Datanode can continue if a volume for replica storage fails. Previously a datanode resigned if any volume failed.
---
* [MAPREDUCE-1118](https://issues.apache.org/jira/browse/MAPREDUCE-1118) | *Major* | **Capacity Scheduler scheduling information is hard to read / should be tabular format**
Add CapacityScheduler servlet to enhance web UI for queue information.
---
* [MAPREDUCE-478](https://issues.apache.org/jira/browse/MAPREDUCE-478) | *Minor* | **separate jvm param for mapper and reducer**
Allow map and reduce jvm parameters, environment variables and ulimit to be set separately.
Configuration changes:
add mapred.map.child.java.opts
add mapred.reduce.child.java.opts
add mapred.map.child.env
add mapred.reduce.child.ulimit
add mapred.map.child.env
add mapred.reduce.child.ulimit
deprecated mapred.child.java.opts
deprecated mapred.child.env
deprecated mapred.child.ulimit
---
* [MAPREDUCE-323](https://issues.apache.org/jira/browse/MAPREDUCE-323) | *Critical* | **Improve the way job history files are managed**
This patch does four things:
* it changes the directory structure of the done directory that holds history logs for jobs that are completed,
* it builds toy databases for completed jobs, so we no longer have to scan 2N files on DFS to find out facts about the N jobs that have completed since the job tracker started [which can be hundreds of thousands of files in practical cases],
* it changes the job history browser to display more information and allow more filtering criteria, and
* it creates a new programmatic interface for finding files matching user-chosen criteria. This allows users to no longer be concerned with our methods of storing them, in turn allowing us to change those at will.
The new API described above, which can be used to programmatically obtain history file PATHs given search criteria, is described below:
package org.apache.hadoop.mapreduce.jobhistory;
...
// this interface is within O.A.H.mapreduce.jobhistory.JobHistory:
// holds information about one job hostory log in the done
// job history logs
public static class JobHistoryJobRecord {
public Path getPath() { ... }
public String getJobIDString() { ... }
public long getSubmitTime() { ... }
public String getUserName() { ... }
public String getJobName() { ... }
}
public class JobHistoryRecordRetriever implements Iterator\<JobHistoryJobRecord\> {
// usual Interface methods -- remove() throws UnsupportedOperationException
// returns the number of calls to next() that will succeed
public int numMatches() { ... }
}
// returns a JobHistoryRecordRetriever that delivers all Path's of job matching job history files,
// in no particular order. Any criterion that is null or the empty string does not constrain.
// All criteria that are specified are applied conjunctively, except that if there's more than
// one date you retrieve all Path's matching ANY date.
// soughtUser and soughtJobid must match exactly.
// soughtJobName can match the entire job name or any substring.
// dates must be in the format exactly MM/DD/YYYY .
// Dates' leading digits must be 2's . We're incubating a Y3K problem.
public JobHistoryRecordRetriever getMatchingJob
(String soughtUser, String soughtJobName, String[] dateStrings, String soughtJobid)
throws IOException

View File

@ -0,0 +1,66 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.203.1 - Unreleased
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7330](https://issues.apache.org/jira/browse/HADOOP-7330) | The metrics source mbean implementation should return the attribute value instead of the object | Major | metrics | Luke Lu | Luke Lu |
| [HADOOP-7297](https://issues.apache.org/jira/browse/HADOOP-7297) | Error in the documentation regarding Checkpoint/Backup Node | Trivial | documentation | arnaud p | Harsh J |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.203.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,127 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.204.0 - 2011-09-02
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-6255](https://issues.apache.org/jira/browse/HADOOP-6255) | Create an rpm integration project | Major | . | Owen O'Malley | Eric Yang |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7398](https://issues.apache.org/jira/browse/HADOOP-7398) | create a mechanism to suppress the HADOOP\_HOME deprecated warning | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-7144](https://issues.apache.org/jira/browse/HADOOP-7144) | Expose JMX with something like JMXProxyServlet | Major | . | Luke Lu | Robert Joseph Evans |
| [MAPREDUCE-2558](https://issues.apache.org/jira/browse/MAPREDUCE-2558) | Add queue-level metrics 0.20-security branch | Major | jobtracker | Jeffrey Naisbitt | Jeffrey Naisbitt |
| [MAPREDUCE-1938](https://issues.apache.org/jira/browse/MAPREDUCE-1938) | Ability for having user's classes take precedence over the system classes for tasks' classpath | Blocker | job submission, task, tasktracker | Devaraj Das | Krishna Ramachandran |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7277](https://issues.apache.org/jira/browse/HADOOP-7277) | Add Eclipse launch tasks for the 0.20-security branch | Minor | build | Jeffrey Naisbitt | Jeffrey Naisbitt |
| [HADOOP-7248](https://issues.apache.org/jira/browse/HADOOP-7248) | Have a way to automatically update Eclipse .classpath file when new libs are added to the classpath through Ivy for 0.20-\* based sources | Minor | build | Konstantin Boudnik | Thomas Graves |
| [HDFS-1773](https://issues.apache.org/jira/browse/HDFS-1773) | Remove a datanode from cluster if include list is not empty and this datanode is removed from both include and exclude lists | Minor | namenode | Tanping Wang | Tanping Wang |
| [MAPREDUCE-2524](https://issues.apache.org/jira/browse/MAPREDUCE-2524) | Backport trunk heuristics for failing maps when we get fetch failures retrieving map output during shuffle | Minor | tasktracker | Thomas Graves | Thomas Graves |
| [MAPREDUCE-2495](https://issues.apache.org/jira/browse/MAPREDUCE-2495) | The distributed cache cleanup thread has no monitoring to check to see if it has died for some reason | Minor | distributed-cache | Robert Joseph Evans | Robert Joseph Evans |
| [MAPREDUCE-2490](https://issues.apache.org/jira/browse/MAPREDUCE-2490) | Log blacklist debug count | Trivial | jobtracker | Jonathan Eagles | Jonathan Eagles |
| [MAPREDUCE-2479](https://issues.apache.org/jira/browse/MAPREDUCE-2479) | Backport MAPREDUCE-1568 to hadoop security branch | Major | tasktracker | Robert Joseph Evans | Robert Joseph Evans |
| [MAPREDUCE-2456](https://issues.apache.org/jira/browse/MAPREDUCE-2456) | Show the reducer taskid and map/reduce tasktrackers for "Failed fetch notification #\_ for task attempt..." log messages | Trivial | jobtracker | Jeffrey Naisbitt | Jeffrey Naisbitt |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7475](https://issues.apache.org/jira/browse/HADOOP-7475) | hadoop-setup-single-node.sh is broken | Blocker | . | Eric Yang | Eric Yang |
| [HADOOP-7373](https://issues.apache.org/jira/browse/HADOOP-7373) | Tarball deployment doesn't work with {start,stop}-{dfs,mapred} | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-7364](https://issues.apache.org/jira/browse/HADOOP-7364) | TestMiniMRDFSCaching fails if test.build.dir is set to something other than build/test | Major | test | Thomas Graves | Thomas Graves |
| [HADOOP-7356](https://issues.apache.org/jira/browse/HADOOP-7356) | RPM packages broke bin/hadoop script for hadoop 0.20.205 | Blocker | . | Eric Yang | Eric Yang |
| [HADOOP-7330](https://issues.apache.org/jira/browse/HADOOP-7330) | The metrics source mbean implementation should return the attribute value instead of the object | Major | metrics | Luke Lu | Luke Lu |
| [HADOOP-7324](https://issues.apache.org/jira/browse/HADOOP-7324) | Ganglia plugins for metrics v2 | Blocker | metrics | Luke Lu | Priyo Mustafi |
| [HADOOP-7274](https://issues.apache.org/jira/browse/HADOOP-7274) | CLONE - IOUtils.readFully and IOUtils.skipFully have typo in exception creation's message | Minor | util | Jonathan Eagles | Jonathan Eagles |
| [HADOOP-7232](https://issues.apache.org/jira/browse/HADOOP-7232) | Fix javadoc warnings | Blocker | documentation | Owen O'Malley | Owen O'Malley |
| [HDFS-2057](https://issues.apache.org/jira/browse/HDFS-2057) | Wait time to terminate the threads causing unit tests to take longer time | Major | datanode | Bharath Mundlapudi | Bharath Mundlapudi |
| [HDFS-2023](https://issues.apache.org/jira/browse/HDFS-2023) | Backport of NPE for File.list and File.listFiles | Major | datanode | Bharath Mundlapudi | Bharath Mundlapudi |
| [HDFS-1878](https://issues.apache.org/jira/browse/HDFS-1878) | TestHDFSServerPorts unit test failure - race condition in FSNamesystem.close() causes NullPointerException without serious consequence | Minor | namenode | Matt Foley | Matt Foley |
| [HDFS-1822](https://issues.apache.org/jira/browse/HDFS-1822) | Editlog opcodes overlap between 20 security and later releases | Blocker | namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-1758](https://issues.apache.org/jira/browse/HDFS-1758) | Web UI JSP pages thread safety issue | Minor | tools | Tanping Wang | Tanping Wang |
| [HDFS-1750](https://issues.apache.org/jira/browse/HDFS-1750) | fs -ls hftp://file not working | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-1692](https://issues.apache.org/jira/browse/HDFS-1692) | In secure mode, Datanode process doesn't exit when disks fail. | Major | datanode | Bharath Mundlapudi | Bharath Mundlapudi |
| [HDFS-1592](https://issues.apache.org/jira/browse/HDFS-1592) | Datanode startup doesn't honor volumes.tolerated | Major | . | Bharath Mundlapudi | Bharath Mundlapudi |
| [HDFS-1377](https://issues.apache.org/jira/browse/HDFS-1377) | Quota bug for partial blocks allows quotas to be violated | Blocker | namenode | Eli Collins | Eli Collins |
| [HDFS-1258](https://issues.apache.org/jira/browse/HDFS-1258) | Clearing namespace quota on "/" corrupts FS image | Blocker | namenode | Aaron T. Myers | Aaron T. Myers |
| [HDFS-1189](https://issues.apache.org/jira/browse/HDFS-1189) | Quota counts missed between clear quota and set quota | Major | namenode | Kang Xiao | John George |
| [MAPREDUCE-2846](https://issues.apache.org/jira/browse/MAPREDUCE-2846) | a small % of all tasks fail with DefaultTaskController | Blocker | task, task-controller, tasktracker | Allen Wittenauer | Owen O'Malley |
| [MAPREDUCE-2804](https://issues.apache.org/jira/browse/MAPREDUCE-2804) | "Creation of symlink to attempt log dir failed." message is not useful | Blocker | . | Allen Wittenauer | Owen O'Malley |
| [MAPREDUCE-2651](https://issues.apache.org/jira/browse/MAPREDUCE-2651) | Race condition in Linux Task Controller for job log directory creation | Major | task-controller | Bharath Mundlapudi | Bharath Mundlapudi |
| [MAPREDUCE-2621](https://issues.apache.org/jira/browse/MAPREDUCE-2621) | TestCapacityScheduler fails with "Queue "q1" does not exist" | Minor | . | Sherry Chen | Sherry Chen |
| [MAPREDUCE-2555](https://issues.apache.org/jira/browse/MAPREDUCE-2555) | JvmInvalidate errors in the gridmix TT logs | Minor | tasktracker | Thomas Graves | Thomas Graves |
| [MAPREDUCE-2529](https://issues.apache.org/jira/browse/MAPREDUCE-2529) | Recognize Jetty bug 1342 and handle it | Major | tasktracker | Thomas Graves | Thomas Graves |
| [MAPREDUCE-2514](https://issues.apache.org/jira/browse/MAPREDUCE-2514) | ReinitTrackerAction class name misspelled RenitTrackerAction in task tracker log | Trivial | tasktracker | Jonathan Eagles | Jonathan Eagles |
| [MAPREDUCE-2451](https://issues.apache.org/jira/browse/MAPREDUCE-2451) | Log the reason string of healthcheck script | Trivial | jobtracker | Thomas Graves | Thomas Graves |
| [MAPREDUCE-2447](https://issues.apache.org/jira/browse/MAPREDUCE-2447) | Set JvmContext sooner for a task - MR2429 | Minor | . | Siddharth Seth | Siddharth Seth |
| [MAPREDUCE-2443](https://issues.apache.org/jira/browse/MAPREDUCE-2443) | Fix FI build - broken after MR-2429 | Minor | test | Siddharth Seth | Siddharth Seth |
| [MAPREDUCE-2429](https://issues.apache.org/jira/browse/MAPREDUCE-2429) | Check jvmid during task status report | Major | tasktracker | Arun C Murthy | Siddharth Seth |
| [MAPREDUCE-2418](https://issues.apache.org/jira/browse/MAPREDUCE-2418) | Errors not shown in the JobHistory servlet (specifically Counter Limit Exceeded) | Minor | . | Siddharth Seth | Siddharth Seth |
| [MAPREDUCE-2411](https://issues.apache.org/jira/browse/MAPREDUCE-2411) | When you submit a job to a queue with no ACLs you get an inscrutible NPE | Minor | . | Dick King | Dick King |
| [MAPREDUCE-2409](https://issues.apache.org/jira/browse/MAPREDUCE-2409) | Distributed Cache does not differentiate between file /archive for files with the same path | Major | distributed-cache | Siddharth Seth | Siddharth Seth |
| [MAPREDUCE-2366](https://issues.apache.org/jira/browse/MAPREDUCE-2366) | TaskTracker can't retrieve stdout and stderr from web UI | Major | tasktracker | Owen O'Malley | Dick King |
| [MAPREDUCE-2364](https://issues.apache.org/jira/browse/MAPREDUCE-2364) | Shouldn't hold lock on rjob while localizing resources. | Major | tasktracker | Owen O'Malley | Devaraj Das |
| [MAPREDUCE-2362](https://issues.apache.org/jira/browse/MAPREDUCE-2362) | Unit test failures: TestBadRecords and TestTaskTrackerMemoryManager | Major | test | Owen O'Malley | Greg Roelofs |
| [MAPREDUCE-2360](https://issues.apache.org/jira/browse/MAPREDUCE-2360) | Pig fails when using non-default FileSystem | Major | client | Owen O'Malley | |
| [MAPREDUCE-2359](https://issues.apache.org/jira/browse/MAPREDUCE-2359) | Distributed cache doesn't use non-default FileSystems correctly | Major | . | Owen O'Malley | Krishna Ramachandran |
| [MAPREDUCE-2358](https://issues.apache.org/jira/browse/MAPREDUCE-2358) | MapReduce assumes HDFS as the default filesystem | Major | . | Owen O'Malley | Krishna Ramachandran |
| [MAPREDUCE-2357](https://issues.apache.org/jira/browse/MAPREDUCE-2357) | When extending inputsplit (non-FileSplit), all exceptions are ignored | Major | task | Owen O'Malley | Luke Lu |
| [MAPREDUCE-2356](https://issues.apache.org/jira/browse/MAPREDUCE-2356) | A task succeeded even though there were errors on all attempts. | Major | . | Owen O'Malley | Luke Lu |
| [MAPREDUCE-517](https://issues.apache.org/jira/browse/MAPREDUCE-517) | The capacity-scheduler should assign multiple tasks per heartbeat | Critical | . | Arun C Murthy | Arun C Murthy |
| [MAPREDUCE-118](https://issues.apache.org/jira/browse/MAPREDUCE-118) | Job.getJobID() will always return null | Blocker | client | Amar Kamat | Amareshwari Sriramadasu |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-2218](https://issues.apache.org/jira/browse/HDFS-2218) | Disable TestHdfsProxy.testHdfsProxyInterface in 0.20-security and branch-1 until HDFS-2217 is fixed | Blocker | contrib/hdfsproxy, test | Matt Foley | Matt Foley |
| [HDFS-2044](https://issues.apache.org/jira/browse/HDFS-2044) | TestQueueProcessingStatistics failing automatic test due to timing issues | Major | test | Matt Foley | Matt Foley |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-1767](https://issues.apache.org/jira/browse/HDFS-1767) | Namenode should ignore non-initial block reports from datanodes when in safemode during startup | Major | datanode | Matt Foley | Matt Foley |
| [HDFS-1541](https://issues.apache.org/jira/browse/HDFS-1541) | Not marking datanodes dead When namenode in safemode | Major | namenode | Hairong Kuang | Hairong Kuang |
| [HDFS-1445](https://issues.apache.org/jira/browse/HDFS-1445) | Batch the calls in DataStorage to FileUtil.createHardLink(), so we call it once per directory instead of once per file | Major | datanode | Matt Foley | Matt Foley |
| [MAPREDUCE-2415](https://issues.apache.org/jira/browse/MAPREDUCE-2415) | Distribute TaskTracker userlogs onto multiple disks | Major | task-controller, tasktracker | Bharath Mundlapudi | Bharath Mundlapudi |
| [MAPREDUCE-2413](https://issues.apache.org/jira/browse/MAPREDUCE-2413) | TaskTracker should handle disk failures at both startup and runtime | Major | task-controller, tasktracker | Bharath Mundlapudi | Ravi Gummadi |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,83 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.204.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-6255](https://issues.apache.org/jira/browse/HADOOP-6255) | *Major* | **Create an rpm integration project**
Added RPM/DEB packages to build system.
---
* [HDFS-2218](https://issues.apache.org/jira/browse/HDFS-2218) | *Blocker* | **Disable TestHdfsProxy.testHdfsProxyInterface in 0.20-security and branch-1 until HDFS-2217 is fixed**
Test case TestHdfsProxy.testHdfsProxyInterface has been temporarily disabled for this release, due to failure in the Hudson automated test environment.
---
* [HDFS-1445](https://issues.apache.org/jira/browse/HDFS-1445) | *Major* | **Batch the calls in DataStorage to FileUtil.createHardLink(), so we call it once per directory instead of once per file**
Batch hardlinking during "upgrade" snapshots, cutting time from aprx 8 minutes per volume to aprx 8 seconds. Validated in both Linux and Windows. Depends on prior integration with patch for HADOOP-7133.
---
* [MAPREDUCE-2846](https://issues.apache.org/jira/browse/MAPREDUCE-2846) | *Blocker* | **a small % of all tasks fail with DefaultTaskController**
Fixed a race condition in writing the log index file that caused tasks to 'fail'.
---
* [MAPREDUCE-2804](https://issues.apache.org/jira/browse/MAPREDUCE-2804) | *Blocker* | **"Creation of symlink to attempt log dir failed." message is not useful**
Removed duplicate chmods of job log dir that were vulnerable to race conditions between tasks. Also improved the messages when the symlinks failed to be created.
---
* [MAPREDUCE-2529](https://issues.apache.org/jira/browse/MAPREDUCE-2529) | *Major* | **Recognize Jetty bug 1342 and handle it**
Added 2 new config parameters:
mapreduce.reduce.shuffle.catch.exception.stack.regex
mapreduce.reduce.shuffle.catch.exception.message.regex
---
* [MAPREDUCE-2524](https://issues.apache.org/jira/browse/MAPREDUCE-2524) | *Minor* | **Backport trunk heuristics for failing maps when we get fetch failures retrieving map output during shuffle**
Added a new configuration option: mapreduce.reduce.shuffle.maxfetchfailures, and removed a no longer used option: mapred.reduce.copy.backoff.
---
* [MAPREDUCE-2479](https://issues.apache.org/jira/browse/MAPREDUCE-2479) | *Major* | **Backport MAPREDUCE-1568 to hadoop security branch**
Added mapreduce.tasktracker.distributedcache.checkperiod to the task tracker that defined the period to wait while cleaning up the distributed cache. The default is 1 min.

View File

@ -0,0 +1,64 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.204.1 - Unreleased
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |

View File

@ -0,0 +1,24 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.204.1 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.

View File

@ -0,0 +1,191 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop Changelog
## Release 0.20.205.0 - 2011-10-06
### INCOMPATIBLE CHANGES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-2202](https://issues.apache.org/jira/browse/HDFS-2202) | Changes to balancer bandwidth should not require datanode restart. | Major | balancer & mover, datanode | Eric Payne | Eric Payne |
| [HDFS-1554](https://issues.apache.org/jira/browse/HDFS-1554) | Append 0.20: New semantics for recoverLease | Major | . | Hairong Kuang | Hairong Kuang |
| [HDFS-630](https://issues.apache.org/jira/browse/HDFS-630) | In DFSOutputStream.nextBlockOutputStream(), the client can exclude specific datanodes when locating the next block. | Major | hdfs-client, namenode | Ruyue Ma | Cosmin Lehene |
### NEW FEATURES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7594](https://issues.apache.org/jira/browse/HADOOP-7594) | Support HTTP REST in HttpServer | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HADOOP-7119](https://issues.apache.org/jira/browse/HADOOP-7119) | add Kerberos HTTP SPNEGO authentication support to Hadoop JT/NN/DN/TT web-consoles | Major | security | Alejandro Abdelnur | Alejandro Abdelnur |
| [HADOOP-6889](https://issues.apache.org/jira/browse/HADOOP-6889) | Make RPC to have an option to timeout | Major | ipc | Hairong Kuang | John George |
| [HDFS-1520](https://issues.apache.org/jira/browse/HDFS-1520) | HDFS 20 append: Lightweight NameNode operation to trigger lease recovery | Major | namenode | Hairong Kuang | Hairong Kuang |
| [HDFS-200](https://issues.apache.org/jira/browse/HDFS-200) | In HDFS, sync() not yet guarantees data available to the new readers | Blocker | . | Tsz Wo Nicholas Sze | dhruba borthakur |
| [MAPREDUCE-2777](https://issues.apache.org/jira/browse/MAPREDUCE-2777) | Backport MAPREDUCE-220 to Hadoop 20 security branch | Major | . | Jonathan Eagles | Amar Kamat |
### IMPROVEMENTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7720](https://issues.apache.org/jira/browse/HADOOP-7720) | improve the hadoop-setup-conf.sh to read in the hbase user and setup the configs | Major | conf | Arpit Gupta | Arpit Gupta |
| [HADOOP-7707](https://issues.apache.org/jira/browse/HADOOP-7707) | improve config generator to allow users to specify proxy user, turn append on or off, turn webhdfs on or off | Major | conf | Arpit Gupta | Arpit Gupta |
| [HADOOP-7655](https://issues.apache.org/jira/browse/HADOOP-7655) | provide a small validation script that smoke tests the installed cluster | Major | . | Arpit Gupta | Arpit Gupta |
| [HADOOP-7472](https://issues.apache.org/jira/browse/HADOOP-7472) | RPC client should deal with the IP address changes | Minor | ipc | Kihwal Lee | Kihwal Lee |
| [HADOOP-7432](https://issues.apache.org/jira/browse/HADOOP-7432) | Back-port HADOOP-7110 to 0.20-security | Major | . | Sherry Chen | Sherry Chen |
| [HADOOP-7343](https://issues.apache.org/jira/browse/HADOOP-7343) | backport HADOOP-7008 and HADOOP-7042 to branch-0.20-security | Minor | test | Thomas Graves | Thomas Graves |
| [HADOOP-7314](https://issues.apache.org/jira/browse/HADOOP-7314) | Add support for throwing UnknownHostException when a host doesn't resolve | Major | . | Jeffrey Naisbitt | Jeffrey Naisbitt |
| [HDFS-1555](https://issues.apache.org/jira/browse/HDFS-1555) | HDFS 20 append: Disallow pipeline recovery if a file is already being lease recovered | Major | . | Hairong Kuang | Hairong Kuang |
| [HDFS-1211](https://issues.apache.org/jira/browse/HDFS-1211) | 0.20 append: Block receiver should not log "rewind" packets at INFO level | Minor | datanode | Todd Lipcon | Todd Lipcon |
| [HDFS-1210](https://issues.apache.org/jira/browse/HDFS-1210) | DFSClient should log exception when block recovery fails | Trivial | hdfs-client | Todd Lipcon | Todd Lipcon |
| [HDFS-1054](https://issues.apache.org/jira/browse/HDFS-1054) | Remove unnecessary sleep after failure in nextBlockOutputStream | Major | hdfs-client | Todd Lipcon | Todd Lipcon |
| [HDFS-895](https://issues.apache.org/jira/browse/HDFS-895) | Allow hflush/sync to occur in parallel with new writes to the file | Major | hdfs-client | dhruba borthakur | Todd Lipcon |
| [HDFS-826](https://issues.apache.org/jira/browse/HDFS-826) | Allow a mechanism for an application to detect that datanode(s) have died in the write pipeline | Major | hdfs-client | dhruba borthakur | dhruba borthakur |
| [MAPREDUCE-2981](https://issues.apache.org/jira/browse/MAPREDUCE-2981) | Backport trunk fairscheduler to 0.20-security branch | Major | contrib/fair-share | Matei Zaharia | Matei Zaharia |
| [MAPREDUCE-2729](https://issues.apache.org/jira/browse/MAPREDUCE-2729) | Reducers are always counted having "pending tasks" even if they can't be scheduled yet because not enough of their mappers have completed | Major | . | Sherry Chen | Sherry Chen |
| [MAPREDUCE-2494](https://issues.apache.org/jira/browse/MAPREDUCE-2494) | Make the distributed cache delete entires using LRU priority | Major | distributed-cache | Robert Joseph Evans | Robert Joseph Evans |
### BUG FIXES:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HADOOP-7724](https://issues.apache.org/jira/browse/HADOOP-7724) | hadoop-setup-conf.sh should put proxy user info into the core-site.xml | Major | . | Giridharan Kesavan | Arpit Gupta |
| [HADOOP-7721](https://issues.apache.org/jira/browse/HADOOP-7721) | dfs.web.authentication.kerberos.principal expects the full hostname and does not replace \_HOST with the hostname | Major | . | Arpit Gupta | Jitendra Nath Pandey |
| [HADOOP-7715](https://issues.apache.org/jira/browse/HADOOP-7715) | see log4j Error when running mr jobs and certain dfs calls | Major | conf | Arpit Gupta | Eric Yang |
| [HADOOP-7711](https://issues.apache.org/jira/browse/HADOOP-7711) | hadoop-env.sh generated from templates has duplicate info | Major | conf | Arpit Gupta | Arpit Gupta |
| [HADOOP-7708](https://issues.apache.org/jira/browse/HADOOP-7708) | config generator does not update the properties file if on exists already | Critical | conf | Arpit Gupta | Eric Yang |
| [HADOOP-7691](https://issues.apache.org/jira/browse/HADOOP-7691) | hadoop deb pkg should take a diff group id | Major | . | Giridharan Kesavan | Eric Yang |
| [HADOOP-7684](https://issues.apache.org/jira/browse/HADOOP-7684) | jobhistory server and secondarynamenode should have init.d script | Major | scripts | Eric Yang | Eric Yang |
| [HADOOP-7683](https://issues.apache.org/jira/browse/HADOOP-7683) | hdfs-site.xml template has properties that are not used in 20 | Minor | . | Arpit Gupta | Arpit Gupta |
| [HADOOP-7681](https://issues.apache.org/jira/browse/HADOOP-7681) | log4j.properties is missing properties for security audit and hdfs audit should be changed to info | Minor | conf | Arpit Gupta | Arpit Gupta |
| [HADOOP-7679](https://issues.apache.org/jira/browse/HADOOP-7679) | log4j.properties templates does not define mapred.jobsummary.logger | Major | conf | Ramya Sunil | Ramya Sunil |
| [HADOOP-7674](https://issues.apache.org/jira/browse/HADOOP-7674) | TestKerberosName fails in 20 branch. | Major | . | Jitendra Nath Pandey | Jitendra Nath Pandey |
| [HADOOP-7658](https://issues.apache.org/jira/browse/HADOOP-7658) | to fix hadoop config template | Major | . | Giridharan Kesavan | Eric Yang |
| [HADOOP-7649](https://issues.apache.org/jira/browse/HADOOP-7649) | TestMapredGroupMappingServiceRefresh and TestRefreshUserMappings fail after HADOOP-7625 | Blocker | security, test | Kihwal Lee | Jitendra Nath Pandey |
| [HADOOP-7645](https://issues.apache.org/jira/browse/HADOOP-7645) | HTTP auth tests requiring Kerberos infrastructure are not disabled on branch-0.20-security | Blocker | security | Aaron T. Myers | Jitendra Nath Pandey |
| [HADOOP-7644](https://issues.apache.org/jira/browse/HADOOP-7644) | Fix the delegation token tests to use the new style renewers | Blocker | security | Owen O'Malley | Owen O'Malley |
| [HADOOP-7637](https://issues.apache.org/jira/browse/HADOOP-7637) | Fair scheduler configuration file is not bundled in RPM | Major | build | Eric Yang | Eric Yang |
| [HADOOP-7633](https://issues.apache.org/jira/browse/HADOOP-7633) | log4j.properties should be added to the hadoop conf on deploy | Major | conf | Arpit Gupta | Eric Yang |
| [HADOOP-7631](https://issues.apache.org/jira/browse/HADOOP-7631) | In mapred-site.xml, stream.tmpdir is mapped to ${mapred.temp.dir} which is undeclared. | Major | conf | Ramya Sunil | Eric Yang |
| [HADOOP-7630](https://issues.apache.org/jira/browse/HADOOP-7630) | hadoop-metrics2.properties should have a property \*.period set to a default value foe metrics | Major | conf | Arpit Gupta | Eric Yang |
| [HADOOP-7626](https://issues.apache.org/jira/browse/HADOOP-7626) | Allow overwrite of HADOOP\_CLASSPATH and HADOOP\_OPTS | Major | scripts | Eric Yang | Eric Yang |
| [HADOOP-7625](https://issues.apache.org/jira/browse/HADOOP-7625) | TestDelegationToken is failing in 205 | Major | . | Owen O'Malley | Owen O'Malley |
| [HADOOP-7615](https://issues.apache.org/jira/browse/HADOOP-7615) | Binary layout does not put share/hadoop/contrib/\*.jar into the class path | Major | scripts | Eric Yang | Eric Yang |
| [HADOOP-7610](https://issues.apache.org/jira/browse/HADOOP-7610) | /etc/profile.d does not exist on Debian | Major | scripts | Eric Yang | Eric Yang |
| [HADOOP-7603](https://issues.apache.org/jira/browse/HADOOP-7603) | Set default hdfs, mapred uid, and hadoop group gid for RPM packages | Major | . | Eric Yang | Eric Yang |
| [HADOOP-7602](https://issues.apache.org/jira/browse/HADOOP-7602) | wordcount, sort etc on har files fails with NPE | Major | . | John George | John George |
| [HADOOP-7599](https://issues.apache.org/jira/browse/HADOOP-7599) | Improve hadoop setup conf script to setup secure Hadoop cluster | Major | scripts | Eric Yang | Eric Yang |
| [HADOOP-7596](https://issues.apache.org/jira/browse/HADOOP-7596) | Enable jsvc to work with Hadoop RPM package | Major | build | Eric Yang | Eric Yang |
| [HADOOP-7539](https://issues.apache.org/jira/browse/HADOOP-7539) | merge hadoop archive goodness from trunk to .20 | Major | . | John George | John George |
| [HADOOP-7400](https://issues.apache.org/jira/browse/HADOOP-7400) | HdfsProxyTests fails when the -Dtest.build.dir and -Dbuild.test is set | Major | build | Giridharan Kesavan | Giridharan Kesavan |
| [HADOOP-6833](https://issues.apache.org/jira/browse/HADOOP-6833) | IPC leaks call parameters when exceptions thrown | Blocker | . | Todd Lipcon | Todd Lipcon |
| [HADOOP-6722](https://issues.apache.org/jira/browse/HADOOP-6722) | NetUtils.connect should check that it hasn't connected a socket to itself | Major | util | Todd Lipcon | Todd Lipcon |
| [HDFS-2411](https://issues.apache.org/jira/browse/HDFS-2411) | with webhdfs enabled in secure mode the auth to local mappings are not being respected. | Major | webhdfs | Arpit Gupta | Jitendra Nath Pandey |
| [HDFS-2408](https://issues.apache.org/jira/browse/HDFS-2408) | DFSClient#getNumCurrentReplicas is package private in 205 but public in branch-0.20-append | Blocker | hdfs-client | stack | stack |
| [HDFS-2405](https://issues.apache.org/jira/browse/HDFS-2405) | hadoop dfs command with webhdfs fails on secure hadoop | Critical | webhdfs | Arpit Gupta | Jitendra Nath Pandey |
| [HDFS-2392](https://issues.apache.org/jira/browse/HDFS-2392) | Dist with hftp is failing again | Critical | namenode | Rajit Saha | Daryn Sharp |
| [HDFS-2375](https://issues.apache.org/jira/browse/HDFS-2375) | TestFileAppend4 fails in 0.20.205 branch | Blocker | hdfs-client | Suresh Srinivas | Suresh Srinivas |
| [HDFS-2373](https://issues.apache.org/jira/browse/HDFS-2373) | Commands using webhdfs and hftp print unnecessary debug information on the console with security enabled | Major | webhdfs | Arpit Gupta | Arpit Gupta |
| [HDFS-2368](https://issues.apache.org/jira/browse/HDFS-2368) | defaults created for web keytab and principal, these properties should not have defaults | Major | . | Arpit Gupta | Tsz Wo Nicholas Sze |
| [HDFS-2361](https://issues.apache.org/jira/browse/HDFS-2361) | hftp is broken | Critical | namenode | Rajit Saha | Jitendra Nath Pandey |
| [HDFS-2359](https://issues.apache.org/jira/browse/HDFS-2359) | NPE found in Datanode log while Disk failed during different HDFS operation | Major | datanode | Rajit Saha | Jonathan Eagles |
| [HDFS-2358](https://issues.apache.org/jira/browse/HDFS-2358) | NPE when the default filesystem's uri has no authority | Major | namenode | Rajit Saha | Daryn Sharp |
| [HDFS-2342](https://issues.apache.org/jira/browse/HDFS-2342) | TestSleepJob and TestHdfsProxy broken after HDFS-2284 | Blocker | build | Kihwal Lee | Tsz Wo Nicholas Sze |
| [HDFS-2333](https://issues.apache.org/jira/browse/HDFS-2333) | HDFS-2284 introduced 2 findbugs warnings on trunk | Major | . | Ivan Kelly | Tsz Wo Nicholas Sze |
| [HDFS-2331](https://issues.apache.org/jira/browse/HDFS-2331) | Hdfs compilation fails | Major | hdfs-client | Abhijit Suresh Shingate | Abhijit Suresh Shingate |
| [HDFS-2328](https://issues.apache.org/jira/browse/HDFS-2328) | hftp throws NPE if security is not enabled on remote cluster | Critical | . | Daryn Sharp | Owen O'Malley |
| [HDFS-2325](https://issues.apache.org/jira/browse/HDFS-2325) | Fuse-DFS fails to build on Hadoop 20.203.0 | Blocker | fuse-dfs, libhdfs | Charles Earl | Kihwal Lee |
| [HDFS-2320](https://issues.apache.org/jira/browse/HDFS-2320) | Make merged protocol changes from 0.20-append to 0.20-security compatible with previous releases. | Major | datanode, hdfs-client, namenode | Suresh Srinivas | Suresh Srinivas |
| [HDFS-2309](https://issues.apache.org/jira/browse/HDFS-2309) | TestRenameWhileOpen fails in branch-0.20-security | Major | . | Jitendra Nath Pandey | Jitendra Nath Pandey |
| [HDFS-2300](https://issues.apache.org/jira/browse/HDFS-2300) | TestFileAppend4 and TestMultiThreadedSync fail on 20.append and 20-security. | Major | . | Jitendra Nath Pandey | Jitendra Nath Pandey |
| [HDFS-2259](https://issues.apache.org/jira/browse/HDFS-2259) | DN web-UI doesn't work with paths that contain html | Minor | datanode | Eli Collins | Eli Collins |
| [HDFS-2190](https://issues.apache.org/jira/browse/HDFS-2190) | NN fails to start if it encounters an empty or malformed fstime file | Major | namenode | Aaron T. Myers | Aaron T. Myers |
| [HDFS-2117](https://issues.apache.org/jira/browse/HDFS-2117) | DiskChecker#mkdirsWithExistsAndPermissionCheck may return true even when the dir is not created | Minor | datanode | Eli Collins | Eli Collins |
| [HDFS-2053](https://issues.apache.org/jira/browse/HDFS-2053) | Bug in INodeDirectory#computeContentSummary warning | Minor | namenode | Michael Noll | Michael Noll |
| [HDFS-1836](https://issues.apache.org/jira/browse/HDFS-1836) | Thousand of CLOSE\_WAIT socket | Major | hdfs-client | Dennis Cheung | Bharath Mundlapudi |
| [HDFS-1779](https://issues.apache.org/jira/browse/HDFS-1779) | After NameNode restart , Clients can not read partial files even after client invokes Sync. | Major | datanode, namenode | Uma Maheswara Rao G | Uma Maheswara Rao G |
| [HDFS-1346](https://issues.apache.org/jira/browse/HDFS-1346) | DFSClient receives out of order packet ack | Major | datanode, hdfs-client | Hairong Kuang | Hairong Kuang |
| [HDFS-1260](https://issues.apache.org/jira/browse/HDFS-1260) | 0.20: Block lost when multiple DNs trying to recover it to different genstamps | Critical | . | Todd Lipcon | Todd Lipcon |
| [HDFS-1218](https://issues.apache.org/jira/browse/HDFS-1218) | 20 append: Blocks recovered on startup should be treated with lower priority during block synchronization | Critical | datanode | Todd Lipcon | Todd Lipcon |
| [HDFS-1207](https://issues.apache.org/jira/browse/HDFS-1207) | 0.20-append: stallReplicationWork should be volatile | Major | namenode | Todd Lipcon | Todd Lipcon |
| [HDFS-1204](https://issues.apache.org/jira/browse/HDFS-1204) | 0.20: Lease expiration should recover single files, not entire lease holder | Major | . | Todd Lipcon | sam rash |
| [HDFS-1202](https://issues.apache.org/jira/browse/HDFS-1202) | DataBlockScanner throws NPE when updated before initialized | Major | datanode | Todd Lipcon | Todd Lipcon |
| [HDFS-1197](https://issues.apache.org/jira/browse/HDFS-1197) | Blocks are considered "complete" prematurely after commitBlockSynchronization or DN restart | Major | datanode, hdfs-client, namenode | Todd Lipcon | Todd Lipcon |
| [HDFS-1186](https://issues.apache.org/jira/browse/HDFS-1186) | 0.20: DNs should interrupt writers at start of recovery | Blocker | datanode | Todd Lipcon | Todd Lipcon |
| [HDFS-1164](https://issues.apache.org/jira/browse/HDFS-1164) | TestHdfsProxy is failing | Major | contrib/hdfsproxy | Eli Collins | Todd Lipcon |
| [HDFS-1141](https://issues.apache.org/jira/browse/HDFS-1141) | completeFile does not check lease ownership | Blocker | namenode | Todd Lipcon | Todd Lipcon |
| [HDFS-1118](https://issues.apache.org/jira/browse/HDFS-1118) | DFSOutputStream socket leak when cannot connect to DataNode | Major | . | Zheng Shao | Zheng Shao |
| [HDFS-988](https://issues.apache.org/jira/browse/HDFS-988) | saveNamespace race can corrupt the edits log | Blocker | namenode | dhruba borthakur | Eli Collins |
| [HDFS-724](https://issues.apache.org/jira/browse/HDFS-724) | Pipeline close hangs if one of the datanode is not responsive. | Blocker | datanode, hdfs-client | Tsz Wo Nicholas Sze | Hairong Kuang |
| [HDFS-606](https://issues.apache.org/jira/browse/HDFS-606) | ConcurrentModificationException in invalidateCorruptReplicas() | Major | namenode | Konstantin Shvachko | Konstantin Shvachko |
| [HDFS-142](https://issues.apache.org/jira/browse/HDFS-142) | In 0.20, move blocks being written into a blocksBeingWritten directory | Blocker | . | Raghu Angadi | dhruba borthakur |
| [MAPREDUCE-3112](https://issues.apache.org/jira/browse/MAPREDUCE-3112) | Calling hadoop cli inside mapreduce job leads to errors | Major | contrib/streaming | Eric Yang | Eric Yang |
| [MAPREDUCE-3081](https://issues.apache.org/jira/browse/MAPREDUCE-3081) | Change the name format for hadoop core and vaidya jar to be hadoop-{core/vaidya}-{version}.jar in vaidya.sh | Major | contrib/vaidya | vitthal (Suhas) Gogate | |
| [MAPREDUCE-3076](https://issues.apache.org/jira/browse/MAPREDUCE-3076) | TestSleepJob fails | Blocker | test | Arun C Murthy | Arun C Murthy |
| [MAPREDUCE-2915](https://issues.apache.org/jira/browse/MAPREDUCE-2915) | LinuxTaskController does not work when JniBasedUnixGroupsNetgroupMapping or JniBasedUnixGroupsMapping is enabled | Major | task-controller | Kihwal Lee | Kihwal Lee |
| [MAPREDUCE-2852](https://issues.apache.org/jira/browse/MAPREDUCE-2852) | Jira for YDH bug 2854624 | Major | tasktracker | Eli Collins | Kihwal Lee |
| [MAPREDUCE-2764](https://issues.apache.org/jira/browse/MAPREDUCE-2764) | Fix renewal of dfs delegation tokens | Major | . | Daryn Sharp | Owen O'Malley |
| [MAPREDUCE-2705](https://issues.apache.org/jira/browse/MAPREDUCE-2705) | tasks localized and launched serially by TaskLauncher - causing other tasks to be delayed | Major | tasktracker | Thomas Graves | Thomas Graves |
| [MAPREDUCE-2650](https://issues.apache.org/jira/browse/MAPREDUCE-2650) | back-port MAPREDUCE-2238 to 0.20-security | Major | . | Sherry Chen | Sherry Chen |
| [MAPREDUCE-2610](https://issues.apache.org/jira/browse/MAPREDUCE-2610) | Inconsistent API JobClient.getQueueAclsForCurrentUser | Major | client | Joep Rottinghuis | Joep Rottinghuis |
| [MAPREDUCE-2549](https://issues.apache.org/jira/browse/MAPREDUCE-2549) | Potential resource leaks in HadoopServer.java, RunOnHadoopWizard.java and Environment.java | Major | contrib/eclipse-plugin, contrib/streaming | Devaraj K | Devaraj K |
| [MAPREDUCE-2489](https://issues.apache.org/jira/browse/MAPREDUCE-2489) | Jobsplits with random hostnames can make the queue unusable | Major | jobtracker | Jeffrey Naisbitt | Jeffrey Naisbitt |
| [MAPREDUCE-2324](https://issues.apache.org/jira/browse/MAPREDUCE-2324) | Job should fail if a reduce task can't be scheduled anywhere | Major | . | Todd Lipcon | Robert Joseph Evans |
| [MAPREDUCE-2187](https://issues.apache.org/jira/browse/MAPREDUCE-2187) | map tasks timeout during sorting | Major | . | Gianmarco De Francisci Morales | Anupam Seth |
### TESTS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-1252](https://issues.apache.org/jira/browse/HDFS-1252) | TestDFSConcurrentFileOperations broken in 0.20-appendj | Major | test | Todd Lipcon | Todd Lipcon |
| [HDFS-1242](https://issues.apache.org/jira/browse/HDFS-1242) | 0.20 append: Add test for appendFile() race solved in HDFS-142 | Major | . | Todd Lipcon | Todd Lipcon |
### SUB-TASKS:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-2404](https://issues.apache.org/jira/browse/HDFS-2404) | webhdfs liststatus json response is not correct | Major | webhdfs | Arpit Gupta | Suresh Srinivas |
| [HDFS-2403](https://issues.apache.org/jira/browse/HDFS-2403) | The renewer in NamenodeWebHdfsMethods.generateDelegationToken(..) is not used | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2395](https://issues.apache.org/jira/browse/HDFS-2395) | webhdfs api's should return a root element in the json response | Critical | webhdfs | Arpit Gupta | Tsz Wo Nicholas Sze |
| [HDFS-2385](https://issues.apache.org/jira/browse/HDFS-2385) | Support delegation token renewal in webhdfs | Major | webhdfs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2366](https://issues.apache.org/jira/browse/HDFS-2366) | webhdfs throws a npe when ugi is null from getDelegationToken | Major | webhdfs | Arpit Gupta | Tsz Wo Nicholas Sze |
| [HDFS-2356](https://issues.apache.org/jira/browse/HDFS-2356) | webhdfs: support case insensitive query parameter names | Major | webhdfs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2348](https://issues.apache.org/jira/browse/HDFS-2348) | Support getContentSummary and getFileChecksum in webhdfs | Major | webhdfs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2340](https://issues.apache.org/jira/browse/HDFS-2340) | Support getFileBlockLocations and getDelegationToken in webhdfs | Major | webhdfs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2338](https://issues.apache.org/jira/browse/HDFS-2338) | Configuration option to enable/disable webhdfs. | Major | webhdfs | Jitendra Nath Pandey | Jitendra Nath Pandey |
| [HDFS-2318](https://issues.apache.org/jira/browse/HDFS-2318) | Provide authentication to webhdfs using SPNEGO | Major | webhdfs | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2317](https://issues.apache.org/jira/browse/HDFS-2317) | Read access to HDFS using HTTP REST | Major | . | Tsz Wo Nicholas Sze | Tsz Wo Nicholas Sze |
| [HDFS-2284](https://issues.apache.org/jira/browse/HDFS-2284) | Write Http access to HDFS | Major | . | Sanjay Radia | Tsz Wo Nicholas Sze |
| [HDFS-1057](https://issues.apache.org/jira/browse/HDFS-1057) | Concurrent readers hit ChecksumExceptions if following a writer to very end of file | Blocker | datanode | Todd Lipcon | sam rash |
| [HDFS-561](https://issues.apache.org/jira/browse/HDFS-561) | Fix write pipeline READ\_TIMEOUT | Major | datanode, hdfs-client | Kan Zhang | Kan Zhang |
| [MAPREDUCE-2928](https://issues.apache.org/jira/browse/MAPREDUCE-2928) | MR-2413 improvements | Major | tasktracker | Eli Collins | Eli Collins |
| [MAPREDUCE-2780](https://issues.apache.org/jira/browse/MAPREDUCE-2780) | Standardize the value of token service | Major | . | Daryn Sharp | Daryn Sharp |
### OTHER:
| JIRA | Summary | Priority | Component | Reporter | Contributor |
|:---- |:---- | :--- |:---- |:---- |:---- |
| [HDFS-1795](https://issues.apache.org/jira/browse/HDFS-1795) | Port 0.20-append changes onto 0.20-security-203 | Major | . | Andrew Purtell | |

View File

@ -0,0 +1,195 @@
<!---
# Licensed to the Apache Software Foundation (ASF) under one
# or more contributor license agreements. See the NOTICE file
# distributed with this work for additional information
# regarding copyright ownership. The ASF licenses this file
# to you under the Apache License, Version 2.0 (the
# "License"); you may not use this file except in compliance
# with the License. You may obtain a copy of the License at
#
# http://www.apache.org/licenses/LICENSE-2.0
#
# Unless required by applicable law or agreed to in writing, software
# distributed under the License is distributed on an "AS IS" BASIS,
# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
# See the License for the specific language governing permissions and
# limitations under the License.
-->
# Apache Hadoop 0.20.205.0 Release Notes
These release notes cover new developer and user-facing incompatibilities, features, and major improvements.
---
* [HADOOP-7724](https://issues.apache.org/jira/browse/HADOOP-7724) | *Major* | **hadoop-setup-conf.sh should put proxy user info into the core-site.xml**
Fixed hadoop-setup-conf.sh to put proxy user in core-site.xml. (Arpit Gupta via Eric Yang)
---
* [HADOOP-7720](https://issues.apache.org/jira/browse/HADOOP-7720) | *Major* | **improve the hadoop-setup-conf.sh to read in the hbase user and setup the configs**
Added parameter for HBase user to setup config script. (Arpit Gupta via Eric Yang)
---
* [HADOOP-7715](https://issues.apache.org/jira/browse/HADOOP-7715) | *Major* | **see log4j Error when running mr jobs and certain dfs calls**
Removed unnecessary security logger configuration. (Eric Yang)
---
* [HADOOP-7711](https://issues.apache.org/jira/browse/HADOOP-7711) | *Major* | **hadoop-env.sh generated from templates has duplicate info**
Fixed recursive sourcing of HADOOP\_OPTS environment variables (Arpit Gupta via Eric Yang)
---
* [HADOOP-7708](https://issues.apache.org/jira/browse/HADOOP-7708) | *Critical* | **config generator does not update the properties file if on exists already**
Fixed hadoop-setup-conf.sh to handle config file consistently. (Eric Yang)
---
* [HADOOP-7707](https://issues.apache.org/jira/browse/HADOOP-7707) | *Major* | **improve config generator to allow users to specify proxy user, turn append on or off, turn webhdfs on or off**
Added toggle for dfs.support.append, webhdfs and hadoop proxy user to setup config script. (Arpit Gupta via Eric Yang)
---
* [HADOOP-7691](https://issues.apache.org/jira/browse/HADOOP-7691) | *Major* | **hadoop deb pkg should take a diff group id**
Fixed conflict uid for install packages. (Eric Yang)
---
* [HADOOP-7684](https://issues.apache.org/jira/browse/HADOOP-7684) | *Major* | **jobhistory server and secondarynamenode should have init.d script**
Added init.d script for jobhistory server and secondary namenode. (Eric Yang)
---
* [HADOOP-7681](https://issues.apache.org/jira/browse/HADOOP-7681) | *Minor* | **log4j.properties is missing properties for security audit and hdfs audit should be changed to info**
HADOOP-7681. Fixed security and hdfs audit log4j properties
(Arpit Gupta via Eric Yang)
---
* [HADOOP-7655](https://issues.apache.org/jira/browse/HADOOP-7655) | *Major* | **provide a small validation script that smoke tests the installed cluster**
Committed to trunk and v23, since code reviewed by Eric.
---
* [HADOOP-7603](https://issues.apache.org/jira/browse/HADOOP-7603) | *Major* | **Set default hdfs, mapred uid, and hadoop group gid for RPM packages**
Set hdfs uid, mapred uid, and hadoop gid to fixed numbers (201, 202, and 123, respectively).
---
* [HADOOP-7119](https://issues.apache.org/jira/browse/HADOOP-7119) | *Major* | **add Kerberos HTTP SPNEGO authentication support to Hadoop JT/NN/DN/TT web-consoles**
Adding support for Kerberos HTTP SPNEGO authentication to the Hadoop web-consoles
---
* [HDFS-2358](https://issues.apache.org/jira/browse/HDFS-2358) | *Major* | **NPE when the default filesystem's uri has no authority**
Give meaningful error message instead of NPE.
---
* [HDFS-2338](https://issues.apache.org/jira/browse/HDFS-2338) | *Major* | **Configuration option to enable/disable webhdfs.**
Added a conf property dfs.webhdfs.enabled for enabling/disabling webhdfs.
---
* [HDFS-2318](https://issues.apache.org/jira/browse/HDFS-2318) | *Major* | **Provide authentication to webhdfs using SPNEGO**
Added two new conf properties dfs.web.authentication.kerberos.principal and dfs.web.authentication.kerberos.keytab for the SPNEGO servlet filter.
---
* [HDFS-2202](https://issues.apache.org/jira/browse/HDFS-2202) | *Major* | **Changes to balancer bandwidth should not require datanode restart.**
New dfsadmin command added: [-setBalancerBandwidth \<bandwidth\>] where bandwidth is max network bandwidth in bytes per second that the balancer is allowed to use on each datanode during balacing.
This is an incompatible change in 0.23. The versions of ClientProtocol and DatanodeProtocol are changed.
---
* [HDFS-1554](https://issues.apache.org/jira/browse/HDFS-1554) | *Major* | **Append 0.20: New semantics for recoverLease**
Change recoverLease API to return if the file is closed or not. It also change the semantics of recoverLease to start lease recovery immediately.
---
* [HDFS-630](https://issues.apache.org/jira/browse/HDFS-630) | *Major* | **In DFSOutputStream.nextBlockOutputStream(), the client can exclude specific datanodes when locating the next block.**
**WARNING: No release note provided for this incompatible change.**
---
* [MAPREDUCE-3112](https://issues.apache.org/jira/browse/MAPREDUCE-3112) | *Major* | **Calling hadoop cli inside mapreduce job leads to errors**
Removed inheritance of certain server environment variables (HADOOP\_OPTS and HADOOP\_ROOT\_LOGGER) in task attempt process.
---
* [MAPREDUCE-3081](https://issues.apache.org/jira/browse/MAPREDUCE-3081) | *Major* | **Change the name format for hadoop core and vaidya jar to be hadoop-{core/vaidya}-{version}.jar in vaidya.sh**
contrib/vaidya/bin/vaidya.sh script fixed to use appropriate jars and classpath
---
* [MAPREDUCE-2777](https://issues.apache.org/jira/browse/MAPREDUCE-2777) | *Major* | **Backport MAPREDUCE-220 to Hadoop 20 security branch**
Adds cumulative cpu usage and total heap usage to task counters. This is a backport of MAPREDUCE-220 and MAPREDUCE-2469.
---
* [MAPREDUCE-2764](https://issues.apache.org/jira/browse/MAPREDUCE-2764) | *Major* | **Fix renewal of dfs delegation tokens**
Generalizes token renewal and canceling to a common interface and provides a plugin interface for adding renewers for new kinds of tokens. Hftp changed to store the tokens as HFTP and renew them over http.
---
* [MAPREDUCE-2494](https://issues.apache.org/jira/browse/MAPREDUCE-2494) | *Major* | **Make the distributed cache delete entires using LRU priority**
Added config option mapreduce.tasktracker.cache.local.keep.pct to the TaskTracker. It is the target percentage of the local distributed cache that should be kept in between garbage collection runs. In practice it will delete unused distributed cache entries in LRU order until the size of the cache is less than mapreduce.tasktracker.cache.local.keep.pct of the maximum cache size. This is a floating point value between 0.0 and 1.0. The default is 0.95.
---
* [MAPREDUCE-2187](https://issues.apache.org/jira/browse/MAPREDUCE-2187) | *Major* | **map tasks timeout during sorting**
I just committed this. Thanks Anupam!

Some files were not shown because too many files have changed in this diff Show More