Apache Hadoop Changelog

Release 0.3.0 - 2006-06-02

INCOMPATIBLE CHANGES:

JIRASummaryPriorityComponentReporterContributor

IMPORTANT ISSUES:

JIRASummaryPriorityComponentReporterContributor

NEW FEATURES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-256Implement a C api for hadoop dfsMajor.Arun C MurthyArun C Murthy
HADOOP-233add a http status server for the task trackersMajor.Owen O'MalleyOwen O'Malley
HADOOP-222Set replication from dfsshellTrivial.Johan OskarssonJohan Oskarsson
HADOOP-209Add a program to recursively copy directories across file systemsMajorfsMilind Bhandarkar
HADOOP-115permit reduce input types to differ from reduce output typesMajor.Runping QiRunping Qi

IMPROVEMENTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-254use http to shuffle data between the maps and the reducesMajor.Owen O'MalleyOwen O'Malley
HADOOP-218Inefficient calls to get configuration values in TaskInprogressMinor.Mahadev konarMahadev konar
HADOOP-216Task Detail web page missing progressTrivial.Bryan PendletonDoug Cutting
HADOOP-212allow changes to dfs block sizeCritical.Owen O'MalleyOwen O'Malley
HADOOP-211logging improvements for HadoopMinor.Sameer ParanjpyeSameer Paranjpye
HADOOP-208add failure page to webappMajor.Owen O'MalleyOwen O'Malley
HADOOP-204Need to tweak a few things in the metrics package to support the Simon pluginMajormetricsDavid BowenDavid Bowen
HADOOP-202sort should use a smaller number of reducesTrivial.Owen O'MalleyOwen O'Malley
HADOOP-195improve performance of map output transfersMajor.Owen O'MalleyOwen O'Malley
HADOOP-75dfs should check full file availability only at closeMinor.Doug CuttingMilind Bhandarkar

BUG FIXES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-270possible deadlock when shut down a datanode threadMajor.Hairong KuangHairong Kuang
HADOOP-265Abort tasktracker if it can not write to its local directoriesMajor.Hairong KuangHairong Kuang
HADOOP-264WritableFactory has no permissions to create DatanodeRegistrationBlocker.Owen O'MalleyOwen O'Malley
HADOOP-259map output http client does not timeoutMajor.Owen O'MalleyOwen O'Malley
HADOOP-251progress report failures kill taskMajor.Owen O'MalleyOwen O'Malley
HADOOP-247The Reduce Task thread for reporting progress during the sort exits in case of any IOExceptionCritical.Mahadev konarMahadev konar
HADOOP-241TestCopyFiles fails under cygwin due to incorrect pathMinorfsKonstantin ShvachkoMilind Bhandarkar
HADOOP-238map outputs transfers fail with EOFExceptionBlocker.Owen O'MalleyOwen O'Malley
HADOOP-235LocalFileSystem.openRaw() throws the wrong string for FileNotFoundExceptionMajor.Benjamin Reed
HADOOP-229hadoop cp should generate a better number of map tasksMinorfsYoram ArnonMilind Bhandarkar
HADOOP-228hadoop cp should have a -config optionMinorfsYoram ArnonMilind Bhandarkar
HADOOP-219SequenceFile#handleChecksumException NPETrivialiostackDoug Cutting
HADOOP-217IllegalAcessException when creating a Block object via WritableFactoriesMajor.Hairong KuangHairong Kuang
HADOOP-205the job tracker does not schedule enough map on the clusterMajor.Owen O'MalleyMahadev konar
HADOOP-200The map task names are sent to the reducesMajor.Owen O'MalleyOwen O'Malley
HADOOP-180task tracker times out cleaning big jobMajor.Owen O'MalleyOwen O'Malley
HADOOP-179task tracker ghosts remain after 10 minutesMajor.Owen O'MalleyOwen O'Malley
HADOOP-163If a DFS datanode cannot write onto its file system. it should tell the name node not to assign new blocks to it.Major.Runping QiHairong Kuang
HADOOP-161dfs blocks define equal, but not hashcodeMajor.Owen O'MalleyMilind Bhandarkar
HADOOP-146potential conflict in block id's, leading to data corruptionMajor.Yoram ArnonKonstantin Shvachko
HADOOP-141Disk thrashing / task timeouts during map output copy phaseMajor.p sutterOwen O'Malley
HADOOP-124don't permit two datanodes to run from same dfs.data.dirCritical.Bryan PendletonKonstantin Shvachko

TESTS:

JIRASummaryPriorityComponentReporterContributor

SUB-TASKS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-220Add -dfs and -jt command-line parameters to specify namenode and jobtracker.MajorfsMilind BhandarkarMilind Bhandarkar

OTHER:

JIRASummaryPriorityComponentReporterContributor