Apache Hadoop Changelog

Release 0.1.0 - 2006-04-02

INCOMPATIBLE CHANGES:

JIRASummaryPriorityComponentReporterContributor

IMPORTANT ISSUES:

JIRASummaryPriorityComponentReporterContributor

NEW FEATURES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-80binary keyMajorioOwen O'MalleyOwen O'Malley
HADOOP-46user-specified job namesMajor.Doug CuttingOwen O'Malley
HADOOP-44RPC exceptions should include remote stack traceMajoripcDoug CuttingDoug Cutting
HADOOP-37A way to determine the size and overall activity of the clusterMajor.Owen O'Malley

IMPROVEMENTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-103introduce a common parent class for Mapper and ReducerMinor.Owen O'MalleyOwen O'Malley
HADOOP-87SequenceFile performance degrades substantially compression is on and large values are encounteredMajorioSameer ParanjpyeDoug Cutting
HADOOP-79listFiles optimizationMajor.Konstantin ShvachkoKonstantin Shvachko
HADOOP-67Added statistic/reporting info to DFSTrivial.Barry KaplanDoug Cutting
HADOOP-60Specification of alternate conf. directoryMinor.stack
HADOOP-49JobClient cannot use a non-default server (unlike DFSShell)Major.Michel TournMichel Tourn
HADOOP-45JobTracker should log task errorsMajor.Doug CuttingDoug Cutting
HADOOP-41JAVA_OPTS for the TaskRunner ChildMinorconfstack
HADOOP-38default splitter should incorporate fs block sizeMajor.Doug Cutting
HADOOP-36Adding some uniformity/convenience to environment managementMajorconfBryan Pendleton
HADOOP-33DF enhancement: performance and win XP supportMinorfsKonstantin ShvachkoKonstantin Shvachko
HADOOP-30DFS shell: support for ls -r and catMajor.Michel Tourn
HADOOP-25a new map/reduce example and moving the examples from src/java to src/examplesMinor.Owen O'MalleyOwen O'Malley
HADOOP-20Mapper, Reducer need an occasion to cleanup after the last record is processed.Major.Michel Tourn

BUG FIXES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-112copyFromLocal should exclude .crc filesMinor.Monu OgbeDoug Cutting
HADOOP-110new key and value instances are allocated before each mapMajor.Owen O'MalleyOwen O'Malley
HADOOP-107Namenode errors “Failed to complete filename.crc because dir.getFile()==null and null”Major.Igor BolotinDoug Cutting
HADOOP-102Two identical consecutive loops in FSNamesystem.chooseTarget()Major.Konstantin ShvachkoKonstantin Shvachko
HADOOP-100Inconsistent locking of the JobTracker.taskTrackers fieldMajor.Owen O'MalleyOwen O'Malley
HADOOP-98The JobTracker's count of the number of running maps and reduces is wrongMajor.Owen O'MalleyOwen O'Malley
HADOOP-97DFSShell.cat returns NullPointerException if file does not existMajor.Konstantin ShvachkoKonstantin Shvachko
HADOOP-93allow minimum split size configurableMajor.Hairong KuangDoug Cutting
HADOOP-86If corrupted map outputs, reducers get stuck fetching foreverMajor.stackDoug Cutting
HADOOP-84client should report file name in which IO exception occursMinor.Yoram ArnonKonstantin Shvachko
HADOOP-83infinite retries accessing a missing blockMajor.Yoram ArnonKonstantin Shvachko
HADOOP-82JobTracker loses it: NoSuchElementExceptionMinor.stack
HADOOP-81speculative execution is only controllable from the default configMajor.Owen O'MalleyOwen O'Malley
HADOOP-78rpc commands not bufferedMajoripcOwen O'MalleyOwen O'Malley
HADOOP-77hang / crash when input folder does not exists.Critical.Stefan Groschupf
HADOOP-70the two file system tests TestDFS and TestFileSystem take too longMajor.Owen O'MalleyOwen O'Malley
HADOOP-66dfs client writes all data for a chunk to /tmpMajor.Sameer ParanjpyeDoug Cutting
HADOOP-57hadoop dfs -ls / does not show root of file systemMinor.Yoram Arnon
HADOOP-52mapred input and output dirs must be absoluteMajor.Doug CuttingOwen O'Malley
HADOOP-42PositionCache decrements its position for reads at the end of fileMajorfsKonstantin Shvachko
HADOOP-40bufferSize argument is ignored in FileSystem.create(File, boolean, int)MinorfsKonstantin Shvachko
HADOOP-34Build Paths Relative to PWD in build.xmlTrivial.Jeremy Bensley
HADOOP-28webapps brokenMajor.Owen O'Malley
HADOOP-22remove unused importsTrivial.Sami Siren
HADOOP-21the webapps need to be updated for the move from nutchMinor.Owen O'Malley
HADOOP-19Datanode corruptionCritical.Rod TaylorDoug Cutting
HADOOP-16RPC call times out while indexing map task is computing splitsMajor.Chris SchneiderMike Cafarella
HADOOP-12InputFormat used in job must be in JobTracker classpath (not loaded from job JAR)Minor.Bryan Pendleton
HADOOP-10ndfs.replication is not documented within the nutch-default.xml configuration file.Trivial.Rod Taylor
HADOOP-7MapReduce has a series of problems concerning task-allocation to worker nodesMajor.Mike Cafarella
HADOOP-6missing build directory in classpathMinor.Owen O'Malley
HADOOP-5need commons-logging-api jar fileMinor.Owen O'Malley
HADOOP-3Output directories are not cleaned up before the reduces runMinor.Owen O'MalleyOwen O'Malley
HADOOP-2Reused Keys and Values fail with a CombinerMajor.Owen O'MalleyOwen O'Malley

TESTS:

JIRASummaryPriorityComponentReporterContributor

SUB-TASKS:

JIRASummaryPriorityComponentReporterContributor

OTHER:

JIRASummaryPriorityComponentReporterContributor
HADOOP-1initial import of code from NutchMajor.Doug CuttingDoug Cutting