Apache Hadoop Changelog

Release 0.19.0 - 2008-11-20

INCOMPATIBLE CHANGES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-4430Namenode Web UI capacity report is inconsistent with BalancerBlocker.Suresh SrinivasSuresh Srinivas
HADOOP-4293Remove WritableJobConfMajor.Owen O'MalleyOwen O'Malley
HADOOP-4281Capacity reported in some of the commands is not consistent with the Web UI reported dataBlocker.Suresh SrinivasSuresh Srinivas
HADOOP-4227Remove the deprecated, unused class ShellCommand.MinorfsTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4190Changes to JobHistory makes it backward incompatibleBlocker.Amar KamatAmar Kamat
HADOOP-4116Balancer should provide better resource managementBlocker.Raghu AngadiHairong Kuang
HADOOP-3981Need a distributed file checksum algorithm for HDFSMajor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3963libhdfs should never exit on its own but rather return errors to the calling applicationMinor.Pete WyckoffPete Wyckoff
HADOOP-3938Quotas for disk space managementMajor.Robert ChanslerRaghu Angadi
HADOOP-3911' -blocks ' option not being recognizedMinorfs, utilKoji NoguchiLohit Vijayarenu
HADOOP-3889distcp: Better Error Message should be thrown when accessing source files/directory with no read permissionMinor.Peeyush BishnoiTsz Wo Nicholas Sze
HADOOP-3837hadop streaming does not use progress reporting to detect hung tasksMajor.dhruba borthakurdhruba borthakur
HADOOP-3796fuse-dfs should take rw,ro,trashon,trashoff,protected=blah mount arguments rather than them being compiled inMajor.Pete WyckoffPete Wyckoff
HADOOP-3792exit code from “hadoop dfs -test ...” is wrong for Unix shellMinorfsBen SluskyBen Slusky
HADOOP-3722Provide a unified way to pass jobconf options from bin/hadoopMinorconfMatei ZahariaEnis Soztutar
HADOOP-3667Remove deprecated methods in JobConfMajor.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-3652Remove deprecated class OutputFormatBaseMajor.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-3595Remove deprecated mapred.combine.once functionalityMajor.Chris DouglasChris Douglas
HADOOP-3245Provide ability to persist running jobs (extend HADOOP-1876)Major.Devaraj DasAmar Kamat
HADOOP-3150Move task file promotion into the taskMajor.Owen O'MalleyAmareshwari Sriramadasu
HADOOP-3062Need to capture the metrics for the network ios generate by dfs reads/writes and map/reduce shuffling and break them down by racksMajormetricsRunping QiChris Douglas
HADOOP-2816Cluster summary at name node web has confusing report for space utilizationMajor.Robert ChanslerSuresh Srinivas
HADOOP-2325Require Java 6MajorbuildDoug CuttingDoug Cutting
HADOOP-1869access times of HDFS filesMajor.dhruba borthakurdhruba borthakur
HADOOP-1700Append to files in HDFSMajor.stackdhruba borthakur

IMPORTANT ISSUES:

JIRASummaryPriorityComponentReporterContributor

NEW FEATURES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-4406Make TCTLSeparatedProtocol configurable and have DynamicSerDe initialize, initialize the SerDeMajor.Pete WyckoffPete Wyckoff
HADOOP-4301Forrest doc for skip bad records featureBlockerdocumentationSharad AgarwalSharad Agarwal
HADOOP-4260support show partitions in hiveMajor.Ashish ThusooAshish Thusoo
HADOOP-4176Implement getFileChecksum(Path) in HftpFileSystemMajor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4120[Hive] print time taken by query in interactive shellMinor.Raghotham MurthyRaghotham Murthy
HADOOP-4106add time, permission and user attribute support to fuse-dfsMajor.Pete WyckoffPete Wyckoff
HADOOP-4104add time, permission and user attribute support to libhdfsMajor.Pete WyckoffPete Wyckoff
HADOOP-4095[Hive] enhance describe table & partitionMajor.Prasad ChakkaNamit Jain
HADOOP-4086Add limit to Hive QLMajor.Ashish ThusooNamit Jain
HADOOP-4084Add explain plan capabilities to Hive QLMajor.Ashish ThusooAshish Thusoo
HADOOP-4070[Hive] Provide a mechanism for registering UDFs from the query languageMajor.Tom WhiteTom White
HADOOP-3992Synthetic Load Generator for NameNode testingMajor.Robert ChanslerHairong Kuang
HADOOP-3941Extend FileSystem API to return file-checksums/file-digestsMajorfsTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3939DistCp should support an option for deleting non-existing files.Major.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3924Add a ‘Killed’ job statusCritical.Alejandro AbdelnurSubru Krishnan
HADOOP-3873DistCp should have an option for limiting the number of files/bytes being copiedMajor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3854org.apache.hadoop.http.HttpServer should support user configurable filterMajorutilTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3829Narrown down skipped records based on user acceptable valueMajor.Sharad AgarwalSharad Agarwal
HADOOP-3828Write skipped records' bytes to DFSMajor.Sharad AgarwalSharad Agarwal
HADOOP-3754Support a Thrift Interface to access files/directories in HDFSMajor.dhruba borthakurdhruba borthakur
HADOOP-3746A fair sharing job schedulerMinor.Matei ZahariaMatei Zaharia
HADOOP-3730add new JobConf constructor that disables loading default configurationsMajorconfAlejandro AbdelnurAlejandro Abdelnur
HADOOP-3714Bash tab completion supportTrivialscriptsChris SmithChris Smith
HADOOP-3702add support for chaining Maps in a single Map and after a Reduce [M*/RM*]Major.Alejandro AbdelnurAlejandro Abdelnur
HADOOP-3698Implement access control for submitting jobs to queues in the JobTrackerMajor.Hemanth YamijalaHemanth Yamijala
HADOOP-3695[HOD] Have an ability to run multiple slaves per nodeMajorcontrib/hodHemanth YamijalaVinod Kumar Vavilapalli
HADOOP-3585Hardware Failure Monitoring in large clusters running Hadoop/HDFSMinormetricsIoannis KoltsidasIoannis Koltsidas
HADOOP-3485fix writesMinor.Pete WyckoffPete Wyckoff
HADOOP-3479Implement configuration items useful for Hadoop resource manager (v1)MajorconfHemanth YamijalaHemanth Yamijala
HADOOP-3445Implementing core scheduler functionality in Resource Manager (V1) for HadoopMajor.Vivek RatanVivek Ratan
HADOOP-3402Add example code to support run terasort on hadoopMajor.Owen O'MalleyOwen O'Malley
HADOOP-3361Implement renames for NativeS3FileSystemMajorfs/s3Tom WhiteTom White
HADOOP-3149supporting multiple outputs for M/R jobsMajor.Alejandro AbdelnurAlejandro Abdelnur
HADOOP-3019want input sampler & sorted partitionerMajor.Doug CuttingChris Douglas
HADOOP-2664lzop-compatible CompresionCodecMajorioChris DouglasChris Douglas
HADOOP-2658Design and Implement a Test Plan to support appends to HDFS filesBlockertestdhruba borthakurdhruba borthakur
HADOOP-2536MapReduce for MySQLMinor.Fredrik HedbergFredrik Hedberg
HADOOP-1823want InputFormat for bzip2 filesMajor.Doug Cutting
HADOOP-1480pipes should be able to set user countersMajor.Owen O'MalleyArun C Murthy
HADOOP-372should allow to specify different inputformat classes for different input dirs for Map/Reduce jobsMajor.Runping QiChris Smith
HADOOP-153skip records that fail TaskMajor.Doug CuttingSharad Agarwal

IMPROVEMENTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-4433Improve data loader for collecting metrics and log files from hadoop and systemMajor.Eric YangEric Yang
HADOOP-4431Add versionning/tags to Chukwa ChunkMajor.Jerome BoulonJerome Boulon
HADOOP-4424menu layout change for Hadoop documentationBlockerdocumentationBoris ShkolnikBoris Shkolnik
HADOOP-4353enable multi-line query from Hive CLIMinor.Prasad ChakkaPrasad Chakka
HADOOP-4345Hive: Check that partitioning predicate is present when hive.partition.pruning = strictMajor.Ashish ThusooAshish Thusoo
HADOOP-4307add an option to describe table to show extended properties of the table such as serialization/deserialization propertiesMajor.Prasad ChakkaPrasad Chakka
HADOOP-4279write the random number generator seed to log in the append-related testsBlockertestTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4252Catch Ctrl-C in Hive CLI so that corresponding hadoop jobs can be killedMinor.Prasad ChakkaPete Wyckoff
HADOOP-4231Hive: converting complex objects to JSON failed.Minor.Zheng ShaoZheng Shao
HADOOP-4230Hive: GroupBy should not pass the whole row from mapper to reducerBlocker.Zheng ShaoAshish Thusoo
HADOOP-4205[Hive] metastore and ql to use the refactored SerDe libraryMajor.Zheng ShaoZheng Shao
HADOOP-4194Add JobConf and JobID to job related methods in JobTrackerInstrumentationMajor.Mac YangMac Yang
HADOOP-4181some minor things to make Hadoop friendlier to gitMajorbuildOwen O'MalleyOwen O'Malley
HADOOP-4174Move non-client methods ou of ClientProtocolMajor.Konstantin ShvachkoKonstantin Shvachko
HADOOP-4151Add a memcmp-compatible interface for key typesMinor.Chris DouglasChris Douglas
HADOOP-4138[Hive] refactor the SerDe libraryMajor.Zheng ShaoZheng Shao
HADOOP-4124Changing priority of a job should be available in CLI and available on the web UI only along with the Kill Job actionsMajor.Hemanth YamijalaHemanth Yamijala
HADOOP-4117Improve configurability of Hadoop EC2 instancesMajorcontrib/cloudTom WhiteTom White
HADOOP-4113libhdfs should never exit on its own but rather return errors to the calling application - missing diff filesMinor.Pete WyckoffPete Wyckoff
HADOOP-4094[Hive]implement hive-site.xml similar to hadoop-site.xmlMinor.Prasad ChakkaPrasad Chakka
HADOOP-4090The configuration file lists two paths to hadoop directories (bin and conf). Startup should check that these are valid directories and give appropriate messages.Minor.Ashish ThusooRaghotham Murthy
HADOOP-4083change new config attribute queue.name to mapred.job.queue.nameMajor.Owen O'MalleyHemanth Yamijala
HADOOP-4075test-patch.sh should output the ant commands that it runsMajorbuildNigel DaleyRamya Sunil
HADOOP-4062IPC client does not need to be synchronized on the output stream when a connection is closedMajoripcHairong KuangHairong Kuang
HADOOP-4053Schedulers need to know when a job has completedBlocker.Vivek RatanAmar Kamat
HADOOP-3975test-patch can report the modifications found in the workspace along with the error messageMinortestHemanth YamijalaRamya Sunil
HADOOP-3965Make DataBlockScanner package privateMajor.Konstantin ShvachkoKonstantin Shvachko
HADOOP-3948Separate Namenodes edits and fsimageMajor.Lohit VijayarenuLohit Vijayarenu
HADOOP-3944TupleWritable listed as public class but cannot be used without methods private to the packageTrivialdocumentationMichael AndrewsChris Douglas
HADOOP-3943NetworkTopology.pseudoSortByDistance does not need to be a synchronized methodMajor.Hairong KuangHairong Kuang
HADOOP-3935Extract classes from DataNode.javaTrivial.Johan OskarssonJohan Oskarsson
HADOOP-3930Decide how to integrate scheduler info into CLI and job tracker web pageMajor.Matei ZahariaSreekanth Ramakrishnan
HADOOP-3908Better error message if llibhdfs.so doesn't existMinor.Pete WyckoffPete Wyckoff
HADOOP-3905Create a generic interface for edits log.Major.Konstantin ShvachkoKonstantin Shvachko
HADOOP-3892Include Unix group name in JobConfTrivialconfMatei ZahariaMatei Zaharia
HADOOP-3866Improve Hadoop Jobtracker AdminMajorscriptscraig weisenfluhcraig weisenfluh
HADOOP-3861Make MapFile.Reader and Writer implement java.io.CloseableMajorioTom WhiteTom White
HADOOP-3860Compare name-node performance when journaling is performed into local hard-drives or nfs.MajorbenchmarksKonstantin ShvachkoKonstantin Shvachko
HADOOP-3853Move multiple input format extension to library packageMajor.Tom WhiteTom White
HADOOP-3852If ShellCommandExecutor had a toString() operator that listed the command run, its error messages may be more meaningfulMinorutilSteve LoughranSteve Loughran
HADOOP-3844include message of local exception in Client call failuresMinoripcSteve LoughranSteve Loughran
HADOOP-3805improve fuse-dfs write performance which is 33% slower than hadoop dfs -copyFromLocalMinor.Pete Wyckoff
HADOOP-3780JobTracker should synchronously resolve the tasktracker's network location when the tracker registersMajor.Amar KamatAmar Kamat
HADOOP-3769expose static SampleMapper and SampleReducer classes of GenericMRLoadGenerator class for gridmix reuseMajortestLingyun YangLingyun Yang
HADOOP-3759Provide ability to run memory intensive jobs without affecting other running tasks on the nodesMajor.Hemanth YamijalaHemanth Yamijala
HADOOP-3747Add counter support to MultipleOutputsMinor.Alejandro AbdelnurAlejandro Abdelnur
HADOOP-3721CompositeRecordReader::next is unnecessarily complexMajor.Chris DouglasChris Douglas
HADOOP-3719ChukwaMajor.Ari RabkinAri Rabkin
HADOOP-3694if MiniDFS startup time could be improved, testing time would be reducedMajortestSteve LoughranDoug Cutting
HADOOP-3684The data_join should allow the user to implement a customer cloning functionMajor.Runping QiRunping Qi
HADOOP-3664Remove deprecated methods introduced in changes to validating input paths (HADOOP-3095)Major.Tom WhiteTom White
HADOOP-3661Normalize fuse-dfs handling of moving things to trash wrt the way hadoop dfs does it (only when non posix trash flag is enabled in compile)Major.Pete WyckoffPete Wyckoff
HADOOP-3660Add replication factor for injecting blocks in the data node clusterMajorbenchmarksSanjay RadiaSanjay Radia
HADOOP-3655provide more control options for the junit runMinorbuildSteve LoughranSteve Loughran
HADOOP-3646Providing bzip2 as codecMajorconf, ioAbdul QadeerAbdul Qadeer
HADOOP-3638Cache the iFile index files in memory to reduce seeks during map output servingMajor.Devaraj DasJothi Padmanabhan
HADOOP-3624CreateEditsLog could be improved to create tree directory structureMinortestLohit VijayarenuLohit Vijayarenu
HADOOP-3620Namenode should synchronously resolve a datanode's network location when the datanode registersMajor.Hairong KuangHairong Kuang
HADOOP-3617Writes from map serialization include redundant checks for accounting spaceMajor.Chris DouglasChris Douglas
HADOOP-3605Added an abort on unset AWS_ACCOUNT_ID to luanch-hadoop-masterMinorcontrib/cloudAl HoangAl Hoang
HADOOP-3581Prevent memory intensive user tasks from taking down nodesMajor.Hemanth YamijalaVinod Kumar Vavilapalli
HADOOP-3577Tools to inject blocks into name node and simulated data nodes for testingMinor.Sanjay RadiaSanjay Radia
HADOOP-3563Seperate out datanode and namenode functionality of generation stamp upgrade processMajor.dhruba borthakurdhruba borthakur
HADOOP-3556Substitute the synchronized code in MD5Hash to avoid lock contention. Use ThreadLocal instead.MajorioIván de PradoIván de Prado
HADOOP-3549meaningful errno values in libhdfsMajor.Ben SluskyBen Slusky
HADOOP-3514Reduce seeks during shuffle, by inline crcsMajor.Devaraj DasJothi Padmanabhan
HADOOP-3498File globbing alternation should be able to span path componentsMajorfsTom WhiteTom White
HADOOP-3478The algorithm to decide map re-execution on fetch failures can be improvedMajor.Jothi PadmanabhanJothi Padmanabhan
HADOOP-3446The reduce task should not flush the in memory file system before starting the reducerCritical.Owen O'MalleyChris Douglas
HADOOP-3412Refactor the scheduler out of the JobTrackerMinor.Brice ArnouldBrice Arnould
HADOOP-3368Can commons-logging.properties be pulled from hadoop-core?MajorbuildSteve LoughranSteve Loughran
HADOOP-3342Better safety of killing jobs via web interfaceMinor.Daniel NaberEnis Soztutar
HADOOP-3341make key-value separators in hadoop streaming fully configurableMajor.Zheng ShaoZheng Shao
HADOOP-3328DFS write pipeline : only the last datanode needs to verify checksumMajor.Raghu AngadiRaghu Angadi
HADOOP-3202Deprecate org.apache.hadoop.fs.FileUtil.fullyDelete(FileSystem fs, Path dir)MajorfsTsz Wo Nicholas SzeAmareshwari Sriramadasu
HADOOP-3169LeaseChecker daemon should not be started in DFSClient constructorMajor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-2411Add support for larger EC2 instance typesMajorcontrib/cloudTom WhiteChris K Wensel
HADOOP-2330Preallocate transaction log to improve namenode transaction logging performanceMajor.dhruba borthakurdhruba borthakur
HADOOP-2302Streaming should provide an option for numerical sort of keysMajor.Lohit VijayarenuDevaraj Das
HADOOP-2165Augment JobHistory to store tasks' userlogsMajor.Arun C MurthyVinod Kumar Vavilapalli
HADOOP-2130Pipes submit job should be Non-blockingCritical.Srikanth KakaniArun C Murthy
HADOOP-1627DFSAdmin incorrectly reports cluster data.Minor.Konstantin ShvachkoRaghu Angadi
HADOOP-657Free temporary space should be modelled betterMajor.Owen O'MalleyAri Rabkin
HADOOP-249Improving Map -> Reduce performance and Task JVM reuseMajor.Benjamin ReedDevaraj Das

BUG FIXES:

JIRASummaryPriorityComponentReporterContributor
HADOOP-4595JVM Reuse triggers RuntimeException(“Invalid state”)Major.Aaron KimballDevaraj Das
HADOOP-4552Deadlock in RPC ServerMajoripcRaghu AngadiRaghu Angadi
HADOOP-4525config ipc.server.tcpnodelay is no loger being respectedMajoripcClint MorganClint Morgan
HADOOP-4510FileOutputFormat protects getTaskOutputPathBlocker.Chris K WenselChris K Wensel
HADOOP-4500multifilesplit is using job default filesystem incorrectlyMajor.Joydeep Sen SarmaJoydeep Sen Sarma
HADOOP-4498JobHistory does not escape literal jobName when used in a regex patternBlocker.Chris K WenselChris K Wensel
HADOOP-4471Capacity Scheduler should maintain the right ordering of jobs in its running queueBlocker.Vivek RatanAmar Kamat
HADOOP-4466SequenceFileOutputFormat is coupled to WritableComparable and WritableBlockerioChris K WenselChris K Wensel
HADOOP-4457Input split logging in history is broken in 0.19Blocker.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4455Upload the derby.jar and TestSeDe.jar needed for fixes to 0.19 bugsBlocker.Ashish ThusooAshish Thusoo
HADOOP-4449Minor formatting changes to quota related commandsTrivial.Raghu AngadiRaghu Angadi
HADOOP-4446Update Scheduling Information display in Web UIMajor.Karam SinghSreekanth Ramakrishnan
HADOOP-4439Cleanup memory related resource managementBlocker.Hemanth YamijalaHemanth Yamijala
HADOOP-4438Add new/missing dfs commands in forrestBlockerdocumentationHemanth YamijalaSuresh Srinivas
HADOOP-4427Add new/missing commands in forrestBlockerdocumentationSharad AgarwalSreekanth Ramakrishnan
HADOOP-4425Edits log takes much longer to loadBlocker.Chris DouglasChris Douglas
HADOOP-4423FSDataset.getStoredBlock(id) should not return corrupted informationBlocker.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4418Update documentation in forrest for Mapred, streaming and pipesBlockerdocumentationAmareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4410TestMiniMRDebugScript fails on trunkBlocker.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4405all creation of hadoop dfs queries from with in hive shellMinor.Prasad ChakkaPrasad Chakka
HADOOP-4404saveFSImage() should remove files from a storage directory that do not correspond to its type.Blocker.Konstantin ShvachkoKonstantin Shvachko
HADOOP-4400Add “hdfs://” to fs.default.name on quickstart.htmlTrivialdocumentationJeff HammerbacherJeff Hammerbacher
HADOOP-4393Merge AccessControlException and AccessControlIOException into one exception classBlockerfsOwen O'MalleyOwen O'Malley
HADOOP-4387TestHDFSFileSystemContract fails on windowsBlockertestRaghu AngadiRaghu Angadi
HADOOP-4380Make new classes in mapred package private instead of publicMajor.Owen O'MalleyOwen O'Malley
HADOOP-4378TestJobQueueInformation fails regularlyBlockertestTsz Wo Nicholas SzeSreekanth Ramakrishnan
HADOOP-4376Fix line formatting in hadoop-default.xml for hadoop.http.filter.initializersBlockerconfEnis SoztutarEnis Soztutar
HADOOP-4373Guaranteed Capacity calculation is not calculated correctlyBlocker.Karam SinghHemanth Yamijala
HADOOP-4367Hive: UDAF functions cannot handle NULL valuesMajor.Zheng ShaoZheng Shao
HADOOP-4366Provide way to replace existing column names for columnSet tablesMajor.Prasad ChakkaPrasad Chakka
HADOOP-4361Corner cases in killJob from command lineBlocker.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4358NPE from CreateEditsLogBlockertestChris DouglasRaghu Angadi
HADOOP-4356[Hive] for a 2-stage map-reduce job, number of reducers not set correctlyMajor.Namit JainNamit Jain
HADOOP-4355hive 2 case sensitivity issuesMajor.Zheng Shao
HADOOP-4344Hive: Partition pruning causes semantic exception with joinsMajor.Ashish ThusooAshish Thusoo
HADOOP-4342[hive] bug in partition pruningMajor.Namit JainNamit Jain
HADOOP-4336fix sampling bug in fractional bucket caseMinor.Prasad ChakkaPrasad Chakka
HADOOP-4335FsShell -ls fails for file systems without owners or groupsMajorscriptsDavid PhillipsDavid Phillips
HADOOP-4333add ability to drop partitions through DDLMinor.Prasad ChakkaPrasad Chakka
HADOOP-4330Hive: AS clause with subqueries having group bys is not propogated to the outer query blockMajor.Ashish ThusooAshish Thusoo
HADOOP-4329Hive: [] operator with maps does not workMajor.Ashish ThusooAshish Thusoo
HADOOP-4327Create table hive does not set delimetersMajor.Edward CaprioloNamit Jain
HADOOP-4321Document the capacity scheduler in ForrestBlockerdocumentationHemanth YamijalaHemanth Yamijala
HADOOP-4320[Hive] TCTLSeparatedProtocol implement maps/lists/sets read/writesMajor.Pete Wyckoff
HADOOP-4319fuse-dfs dfs_read function may return less than the requested #of bytes even if EOF not reachedBlocker.Pete WyckoffPete Wyckoff
HADOOP-4316[Hive] extra new lines at outputMajor.Namit JainNamit Jain
HADOOP-4315Hive: Cleanup temporary files once the job is doneMajor.Ashish ThusooAshish Thusoo
HADOOP-4309eclipse-plugin no longer compiles on trunkBlockercontrib/eclipse-pluginChris DouglasChris Douglas
HADOOP-4303Hive: trim and rtrim UDFs behaviors are reversedMajor.Ashish ThusooAshish Thusoo
HADOOP-4302TestReduceFetch fails intermittentlyBlocker.Devaraj DasChris Douglas
HADOOP-4299Unable to access a file by a different user in the same group when permissions is set to 770 or when permissions is turned OFFBlocker.Ramya SunilHairong Kuang
HADOOP-4296Spasm of JobClient failures on successful jobs every once in a whileBlocker.Joydeep Sen Sarmadhruba borthakur
HADOOP-4294Hive: Parser should pass field schema to SerDeMajor.Zheng Shao
HADOOP-4288java.lang.NullPointerException is observed in Jobtracker log while call heartbeatBlocker.Karam SinghAmar Kamat
HADOOP-4287[mapred] jobqueue_details.jsp shows negative count of running and waiting reduces with CapacityTaskScheduler.Blocker.Vinod Kumar VavilapalliSreekanth Ramakrishnan
HADOOP-4282User configurable filter fails to filter accesses to certain directoriesBlocker.Kan ZhangTsz Wo Nicholas Sze
HADOOP-4280test-libhdfs consistently fails on trunkBlocker.Raghu AngadiPete Wyckoff
HADOOP-4278TestDatanodeDeath failed occasionallyBlocker.Tsz Wo Nicholas Szedhruba borthakur
HADOOP-4275New public methods added to the *ID classesMajor.Owen O'MalleyOwen O'Malley
HADOOP-4274Capacity scheduler's implementation of getJobs modifies the list of running jobs inadvertentlyBlocker.Hemanth YamijalaHemanth Yamijala
HADOOP-4273[Hive] job submission exception if input is nullMajor.Namit JainNamit Jain
HADOOP-4272Hive: metadataTypedColumnsetSerDe should check if SERIALIZATION.LIB is old columnsetSerDeMajor.Zheng ShaoPrasad Chakka
HADOOP-4269LineRecordReader.LineReader should use util.LineReaderMajorutilChris DouglasChris Douglas
HADOOP-4267TestDBJob failed on LinuxBlocker.Raghu AngadiEnis Soztutar
HADOOP-4266Hive: Support “IS NULL”, “IS NOT NULL”, and size(x) for map and listMajor.Zheng ShaoZheng Shao
HADOOP-4265[Hive] error when user specifies the delimiterMajor.Namit JainNamit Jain
HADOOP-4261Jobs failing in the init stage will never cleanupBlocker.Amar KamatAmareshwari Sriramadasu
HADOOP-4256Remove Completed and Failed Job tables from jobqueue_details.jspBlocker.Sreekanth RamakrishnanSreekanth Ramakrishnan
HADOOP-4254Cannot setSpaceQuota to 1TBBlocker.Tsz Wo Nicholas SzeRaghu Angadi
HADOOP-4250Remove short names of serdes from Deserializer, Serializer & SerDe interface and relevant code.Major.Prasad ChakkaPrasad Chakka
HADOOP-4249Declare hsqldb.jar in eclipse pluginBlockercontrib/eclipse-pluginTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4248Remove HADOOP-1230 API from 0.19Major.Owen O'MalleyOwen O'Malley
HADOOP-4247hadoop jar throwing exception when running examplesBlocker.Hemanth YamijalaOwen O'Malley
HADOOP-4246Reduce task copy errors may not kill it eventuallyBlocker.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4242Remove an extra “;” in FSDirectoryBlocker.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4241-hiveconf config parameters in hive cli should override all config variablesMajor.Joydeep Sen SarmaJoydeep Sen Sarma
HADOOP-4236JobTracker.killJob() fails to kill a job if the job is not yet initializedBlocker.Amar KamatSharad Agarwal
HADOOP-4232Race condition in JVM reuse when more than one slot becomes freeBlocker.Devaraj DasDevaraj Das
HADOOP-4225FSEditLog logs modification time instead of access time.Blocker.Konstantin ShvachkoKonstantin Shvachko
HADOOP-4213NPE in TestLimitTasksPerJobTaskSchedulerMajortestTsz Wo Nicholas SzeSreekanth Ramakrishnan
HADOOP-4209The TaskAttemptID should not have the JobTracker start timeBlocker.Owen O'MalleyAmar Kamat
HADOOP-4200Hadoop-Patch build is failingMajorbuildRamya SunilRamya Sunil
HADOOP-4197Need to update DATA_TRANSFER_VERSIONMajor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4195SequenceFile.Writer close() uses compressor after returning it to CodecPool.MajorioHong TangArun C Murthy
HADOOP-4189HADOOP-3245 is incompleteBlocker.Amar KamatAmar Kamat
HADOOP-4183select * to console issues in HiveMajor.Joydeep Sen Sarma
HADOOP-4175Incorporate metastore server review commentsMajor.Prasad ChakkaPrasad Chakka
HADOOP-4173TestProcfsBasedProcessTree failing on Windows machineMajortest, utilRamya SunilVinod Kumar Vavilapalli
HADOOP-4169‘compressed’ keyword in DDL syntax misleading and does not compressMajor.Joydeep Sen SarmaJoydeep Sen Sarma
HADOOP-4163If a reducer failed at shuffling stage, the task should fail, not just logging an exceptionBlocker.Runping QiSharad Agarwal
HADOOP-4155JobHisotry::JOBTRACKER_START_TIME is not initialized properlyMajor.Lohit VijayarenuLohit Vijayarenu
HADOOP-4154Fix javac warning in WritableUtilsMinorioTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4149JobQueueJobInProgressListener.jobUpdated() might not work as expectedBlocker.Amar KamatAmar Kamat
HADOOP-4147Remove JobWithTaskContext from JobInProgressTrivial.Amar KamatAmareshwari Sriramadasu
HADOOP-4146[Hive] null pointer exception on a joinMajor.Namit JainNamit Jain
HADOOP-4139[Hive] multi group by statement is not optimizedMajor.Namit JainNamit Jain
HADOOP-4135change max length of database columns for metastore to 767Minor.Prasad ChakkaPrasad Chakka
HADOOP-4133remove derby.log files form repository and also change the location where these files get createdMinor.Prasad ChakkaPrasad Chakka
HADOOP-4129Memory limits of TaskTracker and Tasks should be in kiloBytes.Blocker.Vinod Kumar VavilapalliVinod Kumar Vavilapalli
HADOOP-4125Reduce cleanup tip web ui is does not show attemptsMajor.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4121HistoryViewer initialization failure should log exception traceTrivial.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-4112Got ArrayOutOfBound exception while analyzing the job historyMajor.Amar KamatAmareshwari Sriramadasu
HADOOP-4100Scheduler.assignTasks should not be dealing with cleanupTaskMajor.Devaraj DasAmareshwari Sriramadasu
HADOOP-4099HFTP interface compatibility with older releases brokenBlockerfsKan Zhangdhruba borthakur
HADOOP-4097Hive interaction with speculative execution is brokenCritical.Joydeep Sen SarmaJoydeep Sen Sarma
HADOOP-4093[Hive]unify Table.getCols() & get_fields()Major.Prasad ChakkaPrasad Chakka
HADOOP-4089Check if the tmp file used in the CLI exists before using it.Major.Ashish Thusoo
HADOOP-4087Make Hive metastore server to work for PHP & Python clientsMajor.Prasad ChakkaPrasad Chakka
HADOOP-4078TestKosmosFileSystem fails on trunkBlockerfsAmareshwari SriramadasuLohit Vijayarenu
HADOOP-4077Access permissions for setting access times and modification times for filesBlocker.dhruba borthakurdhruba borthakur
HADOOP-4071FSNameSystem.isReplicationInProgress should add an underReplicated block to the neededReplication queue using method “add” not “update”Major.Hairong KuangHairong Kuang
HADOOP-4054During edit log loading, an underconstruction file's lease gets removed twiceMajor.Hairong KuangHairong Kuang
HADOOP-4050TestFairScheduler failed on LinuxMajor.Tsz Wo Nicholas SzeMatei Zaharia
HADOOP-4036Increment InterTrackerProtocol version number due to changes in HADOOP-3759Major.Hemanth YamijalaHemanth Yamijala
HADOOP-4030LzopCodec shouldn't be in the default list of codecs i.e. io.compression.codecsMajorioArun C MurthyArun C Murthy
HADOOP-4027When streaming utility is run without specifying mapper/reducer/input/output options, it returns 0.Major.Ramya Sunil
HADOOP-4023javadoc warnings: incorrect referencesMajordocumentationTsz Wo Nicholas SzeOwen O'Malley
HADOOP-4018limit memory usage in jobtrackerMajor.dhruba borthakurdhruba borthakur
HADOOP-4014DFS upgrade fails on WindowsBlockerfsNOMURA YoshihideKonstantin Shvachko
HADOOP-3991updates to hadoop-ec2-env.sh for 0.18.0Minorcontrib/cloudKarl AndersonTom White
HADOOP-3985TestHDFSServerPorts fails on trunkMajor.Amar KamatHairong Kuang
HADOOP-3970Counters written to the job history cannot be recovered backMajor.Amar KamatAmar Kamat
HADOOP-3968test-libhdfs fails on trunkMajor.Lohit VijayarenuPete Wyckoff
HADOOP-3964javadoc warnings by failmonMajorbuildTsz Wo Nicholas Szedhruba borthakur
HADOOP-3962Shell command “fs -count” should support paths with different file systsmsMajorfsTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3961resource estimation works badly in some casesBlocker.Ari RabkinAri Rabkin
HADOOP-3959[HOD] --resource_manager.options is not passed to qsubMajorcontrib/hodCraig MacdonaldVinod Kumar Vavilapalli
HADOOP-3958TestMapRed ignores failures of the test caseMajortestOwen O'MalleyOwen O'Malley
HADOOP-3957Fix javac warnings in DistCp and the corresponding testsMinor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3954Skip records enabled as default.Critical.Koji NoguchiSharad Agarwal
HADOOP-3952TestDataJoin references dfs.MiniDFSCluster instead of hdfs.MiniDFSClusterMajortestOwen O'MalleyOwen O'Malley
HADOOP-3951The package name used in FSNamesystem is incorrectTrivial.Tsz Wo Nicholas SzeChris Douglas
HADOOP-3950TestMapRed and TestMiniMRDFSSort failed on trunkMajortestTsz Wo Nicholas SzeEnis Soztutar
HADOOP-3949javadoc warnings: Multiple sources of package comments found for packageMajorbuild, documentationTsz Wo Nicholas SzeJerome Boulon
HADOOP-3946TestMapRed fails on trunkBlockertestAmareshwari SriramadasuTom White
HADOOP-3937Job history may get disabled due to overly long job namesMajor.Matei ZahariaMatei Zaharia
HADOOP-3933DataNode's BlockSender sends more data than necessaryMinor.Ning LiNing Li
HADOOP-3919hadoop conf got slightly mangled by 3772Minor.Ari RabkinAri Rabkin
HADOOP-3910Are ClusterTestDFSNamespaceLogging and ClusterTestDFS still valid tests?MinortestTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3907INodeDirectoryWithQuota should be in its own .java fileMinor.Steve LoughranTsz Wo Nicholas Sze
HADOOP-3904A few tests still using old hdfs package nameMinortestTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3875Fix TaskTracker's heartbeat timer to note the time the hearbeat RPC returned to decide next heartbeat timeMajor.Arun C MurthyArun C Murthy
HADOOP-3864JobTracker lockup due to JobInProgress.initTasks taking significant time for large jobs on large clustersCritical.Arun C MurthyArun C Murthy
HADOOP-3863Use a thread-local rather than static ENCODER/DECODER variables in Text for synchronizationCritical.Arun C MurthyArun C Murthy
HADOOP-3851spelling error in FSNamesystemMetrics log messageTrivial.Steve LoughranSteve Loughran
HADOOP-3848TaskTracker.localizeJob calls getSystemDir for each task rather than caching itMajor.Arun C MurthyArun C Murthy
HADOOP-3846CreateEditsLog used for benchmark misses creating parent directoriesMinorbenchmarksLohit VijayarenuLohit Vijayarenu
HADOOP-3836TestMultipleOutputs will fail if it is ran more than one timesMajortestTsz Wo Nicholas SzeAlejandro Abdelnur
HADOOP-3831slow-reading dfs clients do not recover from datanode-write-timeoutsMajor.Christian KunzRaghu Angadi
HADOOP-3820gridmix-env has a syntax error, and wrongly defines USE_REAL_DATASET by defaultMajorbenchmarksArun C MurthyArun C Murthy
HADOOP-3819can not get svn revision # at build time if locale is not englishMinorbuildRong-En FanRong-En Fan
HADOOP-3816KFS changes for faster directory listingMinorfsSriram RaoSriram Rao
HADOOP-3814[HOD] Remove dfs.client.buffer.dir generation, as this is removed in Hadoop 0.19.Blockercontrib/hodHemanth YamijalaVinod Kumar Vavilapalli
HADOOP-3795NameNode does not save image if different dfs.name.dir have different checkpoint stampsMajor.Lohit VijayarenuLohit Vijayarenu
HADOOP-3785FileSystem cache should be case-insensitiveMajorfsDoug CuttingBill de hOra
HADOOP-3783“deprecated filesystem name” warning on EC2Minorcontrib/cloudStuart SierraTom White
HADOOP-3778seek(long) in DFSInputStream should catch socket exception for retry laterMinor.Luo NingLuo Ning
HADOOP-3777Failure to load native lzo libraries causes job failureMajor.Chris DouglasChris Douglas
HADOOP-3776NPE in NameNode with unknown blocksBlocker.Raghu AngadiRaghu Angadi
HADOOP-3773Setting the conf twice in Pipes SubmitterTrivial.Koji NoguchiKoji Noguchi
HADOOP-3756dfs.client.buffer.dir isn‘t used in hdfs, but it’s still in conf/hadoop-default.xmlTrivial.Michael BieniosekRaghu Angadi
HADOOP-3732Block scanner should read block information during initialization.Blocker.Konstantin ShvachkoRaghu Angadi
HADOOP-3728Cannot run more than one instance of examples.SleepJob at the same time.Minor.Brice ArnouldBrice Arnould
HADOOP-3726TestCLI loses exception details on setup/teardownMinortestSteve LoughranSteve Loughran
HADOOP-3725TestMiniMRMapRedDebugScript loses exception detailsMinortestSteve LoughranSteve Loughran
HADOOP-3723libhdfs only accepts O_WRONLY and O_RDONLY so does not accept things like O_WRONLY | O_CREATMinor.Pete WyckoffPi Song
HADOOP-3720dfsadmin -refreshNodes should re-read the config file.Major.Lohit VijayarenuLohit Vijayarenu
HADOOP-3711Streaming input is not parsed properly to find the separatorMajor.Amareshwari SriramadasuAmareshwari Sriramadasu
HADOOP-3705CompositeInputFormat is unable to parse InputFormat classes with names containing ‘_’ or ‘$’Major.Jingkei LyChris Douglas
HADOOP-3658Incorrect destination IP logged for receiving blocksMinor.Koji NoguchiChris Douglas
HADOOP-3643jobtasks.jsp when called for running tasks should ignore completed TIPsMajor.Amar KamatAmar Kamat
HADOOP-3640NativeS3FsInputStream read() method for reading a single byte is incorrectMajorfs/s3Tom WhiteTom White
HADOOP-3623LeaseManager needs refactoring.Major.Konstantin ShvachkoTsz Wo Nicholas Sze
HADOOP-3592org.apache.hadoop.fs.FileUtil.copy() will leak input streams if the destination can't be openedMinorfsSteve LoughranBill de hOra
HADOOP-3570Including user specified jar files in the client side classpath path in Hadoop 0.17 streamingMajor.Suhas GogateSharad Agarwal
HADOOP-3560Archvies sometimes create empty part files.Blocker.Mahadev konarMahadev konar
HADOOP-3543Need to increment the year field for the copyright noticeTrivialdocumentationChris DouglasChris Douglas
HADOOP-3542Hadoop archives should not create _logs file in the final archive directory.Blocker.Mahadev konarMahadev konar
HADOOP-3528Metrics FilesCreated and files_deleted metrics do not match.BlockermetricsLohit VijayarenuLohit Vijayarenu
HADOOP-3506Occasional NPE in Jets3tFileSystemStoreMajorfs/s3RobertTom White
HADOOP-3488the rsync command in hadoop-daemon.sh also rsync the logs folder from the master, what deletes the datanode / tasktracker log files.CriticalscriptsStefan GroschupfCraig Macdonald
HADOOP-3319[HOD]checknodes prints errors messages on stdoutMajorcontrib/hodVinod Kumar VavilapalliVinod Kumar Vavilapalli
HADOOP-3155reducers stuck at shufflingBlocker.Runping Qidhruba borthakur
HADOOP-3131enabling BLOCK compression for map outputs breaks the reduce progress countersMajor.Colin EvansMatei Zaharia
HADOOP-3076[HOD] If a cluster directory is specified as a relative path, an existing script.exitcode file will not be deleted.Blockercontrib/hodHemanth YamijalaVinod Kumar Vavilapalli
HADOOP-2403JobHistory log files contain data that cannot be parsed by org.apache.hadoop.mapred.JobHistoryCritical.Runping QiAmareshwari Sriramadasu
HADOOP-2168Pipes with a C++ record reader does not update progress in the map until it is 100%Major.Owen O'MalleyArun C Murthy
HADOOP-1945pipes examples aren't in the releaseMajor.Owen O'MalleyOwen O'Malley
HADOOP-9mapred.local.dir temp dir. space allocation limited by smallest areaMinor.Paul BaclaceAri Rabkin

TESTS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-4464Separate testClientTriggeredLeaseRecovery() out from TestFileCreationBlockertestTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4426TestCapacityScheduler is brokenBlocker.Hemanth YamijalaHemanth Yamijala
HADOOP-4390Hive: test for case sensitivity in serde2 thrift serdeMinor.Zheng Shao
HADOOP-4259findbugs should run over the tools.jar alsoMinortestOwen O'MalleyChris Douglas
HADOOP-4237TestStreamingBadRecords.testNarrowDown fails intermittentlyMinortestSharad AgarwalSharad Agarwal
HADOOP-4069TestKosmosFileSystem can fail when run through ant test on systems shared by usersMinorfsHemanth YamijalaLohit Vijayarenu
HADOOP-4056Unit test for DynamicSerDeMinor.Pete WyckoffPete Wyckoff
HADOOP-3966Place the new findbugs warnings introduced by the patch in the /tmp directory when “ant test-patch” is run.MinortestRamya SunilRamya Sunil
HADOOP-3790Add more unit tests to test appending to files in HDFSBlockertestdhruba borthakurTsz Wo Nicholas Sze
HADOOP-3587contrib/data_join needs unit testsMajortestChris DouglasChris Douglas

SUB-TASKS:

JIRASummaryPriorityComponentReporterContributor
HADOOP-4354Separate TestDatanodeDeath.testDatanodeDeath() into 4 testsBlockertestTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-4186Move LineRecordReader.LineReader class to util packageMajor.Tom WhiteTom White
HADOOP-4184Fix simple module dependencies between core, hdfs and mapredMajor.Tom WhiteTom White
HADOOP-3824Refactor org.apache.hadoop.mapred.StatusHttpServerMajor.Tsz Wo Nicholas SzeTsz Wo Nicholas Sze

OTHER:

JIRASummaryPriorityComponentReporterContributor
HADOOP-3601Hive as a contrib projectMinor.Joydeep Sen SarmaAshish Thusoo
HADOOP-4105libhdfs wiki is very out-of-date and contains mostly broken linksMinordocumentationPete WyckoffPete Wyckoff
HADOOP-4076fuse-dfs REAME lists wrong ant flags and is not specific in some placeMajor.Pete WyckoffPete Wyckoff
HADOOP-3942Update DistCp documentationBlockerdocumentationTsz Wo Nicholas SzeTsz Wo Nicholas Sze
HADOOP-3791Use generics in ReflectionUtilsTrivial.Chris SmithChris Smith