Apache Hadoop Changelog

Release 0.12.0 - 2007-03-02

INCOMPATIBLE CHANGES:

JIRASummaryPriorityComponentReporterContributor

IMPORTANT ISSUES:

JIRASummaryPriorityComponentReporterContributor

NEW FEATURES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-1032Support for caching Job JARsMinor.Gautam KowshikGautam Kowshik
HADOOP-492Global countersMajor.arkady borkovskyDavid Bowen
HADOOP-491streaming jobs should allow programs that don't do any IO for a long timeMajor.arkady borkovskyArun C Murthy

IMPROVEMENTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-1043Optimize the shuffle phase (increase the parallelism)Major.Devaraj DasDevaraj Das
HADOOP-1042Improve the handling of failed map output fetchesMajor.Devaraj DasDevaraj Das
HADOOP-1041Counter names are uglyMajor.Owen O'MalleyDavid Bowen
HADOOP-1040Improvement of RandomWriter example to use custom InputFormat, OutputFormat, and CountersMajor.Owen O'MalleyOwen O'Malley
HADOOP-1033Rewrite AmazonEC2 wiki pageMinorscriptsTom WhiteTom White
HADOOP-1030in unit tests, set ipc timeout in one placeMinortestDoug CuttingDoug Cutting
HADOOP-1025remove dead code in Server.javaMinoripcDoug CuttingDoug Cutting
HADOOP-1017Optimization: Reduce Overhead from ReflectionUtils.newInstanceMajorutilRon Bodkin
HADOOP-1007Names used for map, reduce, and shuffle metrics should be uniqueTrivialmetricsNigel DaleyNigel Daley
HADOOP-997Implement S3 retry mechanism for failed block transfersMajorfsTom WhiteTom White
HADOOP-985Namenode should identify DataNodes as ip:port instead of hostname:portMajor.Raghu AngadiRaghu Angadi
HADOOP-982A couple setter functions and toString method for BytesWritable.MajorioOwen O'MalleyOwen O'Malley
HADOOP-977The output from the user's task should be tagged and sent to the resepective console streams.Major.Owen O'MalleyArun C Murthy
HADOOP-975Separation of user tasks' stdout and stderr streamsMajor.Arun C MurthyArun C Murthy
HADOOP-972Improve the rack-aware replica placement performanceMajor.Hairong KuangHairong Kuang
HADOOP-954Metrics should offer complete set of static report methods or none at allMinormetricsNigel DaleyDavid Bowen
HADOOP-952Create a public (shared) Hadoop EC2 AMIMajorscriptsTom WhiteTom White
HADOOP-943fsck to show the filename of the corrupted fileTrivial.Koji Noguchidhruba borthakur
HADOOP-941Enhancements to Hadoop record I/O - Part 1MajorrecordMilind BhandarkarMilind Bhandarkar
HADOOP-940pendingReplications of FSNamesystem is not informativeMajor.Hairong Kuangdhruba borthakur
HADOOP-928make checksums optional per FileSystemMajorfsDoug CuttingHairong Kuang
HADOOP-889DFS unit tests have duplicate codeMinortestDoug CuttingMilind Bhandarkar
HADOOP-882S3FileSystem should retry if there is a communication problem with S3MajorfsTom WhiteTom White
HADOOP-867job client should generate input fragments before the job is submittedMajor.Owen O'Malley
HADOOP-858clean up smallJobsBenchmark and move to src/test/org/apache/hadoop/mapredMinorbuildNigel DaleyNigel Daley
HADOOP-432support undelete, snapshots, or other mechanism to recover lost filesMajor.Yoram ArnonDoug Cutting
HADOOP-333we should have some checks that the sort benchmark generates correct outputsMajor.Owen O'MalleyArun C Murthy
HADOOP-248locating map outputs via random probing is inefficientMajor.Owen O'MalleyDevaraj Das

BUG FIXES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-1056Decommission only recognizes IP addesses in hosts and exclude files on refresh.Major.Wendy ChienWendy Chien
HADOOP-1049race condition in setting up ipc connectionsMajoripcOwen O'MalleyDevaraj Das
HADOOP-1046Datanode should periodically clean up /tmp from partially received (and not completed) block filesMajor.Andrzej BialeckiAndrzej Bialecki
HADOOP-1044TestDecommission fails because it attempts to transfer block to a dead datanodeMajortestWendy ChienWendy Chien
HADOOP-1039Reduce the time taken by TestCheckpointMajor.dhruba borthakurdhruba borthakur
HADOOP-1037bin/slaves.sh not compatible with /bin/dashMajor.Doug Cutting
HADOOP-1036task gets lost during assignmentCritical.Owen O'MalleyArun C Murthy
HADOOP-1034RuntimeException and Error not catched in DataNode.DataXceiver.run()Major.Philippe Gassmann
HADOOP-1029streaming doesn't work with multiple mapsBlocker.Arun C MurthyArun C Murthy
HADOOP-1027Fix the RAM FileSystem/Merge problems (reported in HADOOP-1014)Major.Devaraj DasDevaraj Das
HADOOP-1021TestMiniMRLocalFS and TestMiniMRCaching broken on WindowsMajortestNigel Daley
HADOOP-1020Path class on Windows seems brokenMajor.Nigel DaleyDoug Cutting
HADOOP-1010getReordReader methof of InputFormat class should handle null reporter argumentMajor.Runping QiRunping Qi
HADOOP-1006The “-local” option does work properly with test programsMinortestGautam KowshikDoug Cutting
HADOOP-1000Loggers in the Task framework should not write the the Tasks stderrMajor.Owen O'MalleyArun C Murthy
HADOOP-994DFS Scalability : a BlockReport that returns large number of blocks-to-be-deleted cause datanode to lost connectivity to namenodeMajor.dhruba borthakurdhruba borthakur
HADOOP-992The mini/mr cluster for testing always uses the local file system rather than the namenode that was passed inMajortestOwen O'MalleyOwen O'Malley
HADOOP-990Datanode doesn't retry when write to one (full)drive failMajor.Koji NoguchiRaghu Angadi
HADOOP-973NPE in FSDataset during heavy Namenode loadMajor.Nigel Daleydhruba borthakur
HADOOP-947isReplicationInProgress() is very heavyweightMajor.dhruba borthakurdhruba borthakur
HADOOP-933Application defined InputSplits do not workMajor.Benjamin ReedOwen O'Malley
HADOOP-923DFS Scalability: datanode heartbeat timeouts cause cascading timeouts of other datanodesMajor.dhruba borthakurdhruba borthakur
HADOOP-893dead datanode set should be maintained in the file handle or file system for hdfsMajor.Owen O'MalleyRaghu Angadi
HADOOP-878reducer NONE does not work with multiple mapsMinor.Mahadev konarArun C Murthy
HADOOP-803Reducing memory consumption on Namenode : Part 1Major.Raghu AngadiRaghu Angadi
HADOOP-761Unit tests should cleanup created files in /tmp. It causes tests to fail if more than one users run tests on same machine.MinortestSanjay DahiyaNigel Daley
HADOOP-654jobs fail with some hardware/system failures on a small number of nodesMinor.Yoram ArnonArun C Murthy
HADOOP-649Jobs without any map and reduce operations seems to be lost after their executionMajor.Thomas FriolOwen O'Malley
HADOOP-564we should use hdfs:// in all API URIsMajor.eric baldeschwielerWendy Chien
HADOOP-476Streaming should check for correctness of the taskMajor.arkady borkovskyArun C Murthy
HADOOP-442slaves file should include an ‘exclude’ section, to prevent “bad” datanodes and tasktrackers from disrupting a clusterMajorconfYoram ArnonWendy Chien
HADOOP-338the number of maps in the JobConf does not match realityMajor.Owen O'MalleyOwen O'Malley
HADOOP-109Blocks are not replicated when...Major.Konstantin ShvachkoKonstantin Shvachko

TESTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-1051Add checkstyle target to ant build fileMajorbuild, testTom WhiteTom White

SUB-TASKS:

JIRASummaryPriorityComponentReporterContributor

OTHER:

JIRASummaryPriorityComponentReporterContributor