blob: db6ad2de82ee1c45e2611815625d6095db06f212 [file] [log] [blame]
#
# This is a readme for hadoop-version-dfs-dir.tgz and hadoop-dfs-dir.txt.
#
# See HADOOP-1629 for more info if needed.
# These two files are used by unit test TestDFSUpgradeFromImage.java
#
# hadoop-14-dfs-dir.tgz :
# ---------------------
# This file contains the HDFS directory structure for one namenode and 4 datanodes.
# The structure is setup similar to the structure used in MiniDFSCluster.
# The directory was created with Hadoo-0.14.x.
#
# In the test, this directory is unpacked and MiniDFSCluster is run with
# "-upgrade" option. The test waits for the upgrade to complete
# (leave safe mode) and then all the files are read. The test checks that the
# directory structure and file checksums exactly match the information
# in this file.
#
# hadoop-dfs-dir.txt :
# ---------------------
# Along with this description this file contains the expected files and
# checksums or the files in the upgraded DFS.
#
# The original DFS directory was created with various types of files and with
# some recoverable errors (i.e. corrupt or missing .crc files).
#
# A similar set of files exist in two different DFS directories.
# For e.g. "top-dir-1Mb-512" contains files created with dfs.block.size of 1Mb
# and io.bytes.per.checksum of 512.
#
# In the future, when Hadoop project no longer supports upgrade from
# Hadoop-0.12, then a new DFS directory image must be created.
#
# To generate checksum info for new files :
# ---------------------------------------
# Uncomment the last coment (starts with "printChecksums") and run the
# test again. When the test sees this line, it prints the checksum
# information that should replace the checksum information in
# this file. When run in this mode, the test will fail with a descriptive IOException.
#
# Next, extract the checksum info from the test log like this:
# sed -n 's/.*CRC info for reference file : //p' test-log.txt >> this_file
# This will append a new list of files and checksums to this file. Be sure to remove the existing checksum info.
#
# For your reference, the format of the checksum info below is "filename whitespace*\twhitespace* checksum\n"
#
# Uncomment the following line to produce checksum info for a new DFS image.
#printChecksums
/1kb-multiple-checksum-blocks-64-16 191893480
/top-dir-120000-60/1Mb-file 4079112547
/top-dir-120000-60/4k-file 3716287280
/top-dir-120000-60/5Mb-file 2563834633
/top-dir-120000-60/directory1/500thousand-file 3036538664
/top-dir-120000-60/directory1/file-with-corrupt-crc 1984689737
/top-dir-120000-60/directory1/file-with-no-crc 4004594475
/top-dir-120000-60/directory1/zero1 0
/top-dir-120000-60/zerolen 0
/top-dir-1Mb-512/1Mb-file 4079112547
/top-dir-1Mb-512/4k-file 3716287280
/top-dir-1Mb-512/5Mb-file 2563834633
/top-dir-1Mb-512/directory1/500thousand-file 3036538664
/top-dir-1Mb-512/directory1/file-with-corrupt-crc 1984689737
/top-dir-1Mb-512/directory1/file-with-no-crc 4004594475
/top-dir-1Mb-512/directory1/zero1 0
/top-dir-1Mb-512/zerolen 0
overallCRC 1419480698