blob: b7fe4bb0674cf65a7f2e4456abceae8157efaf4e [file] [log] [blame]
SmallJobsBenchmark Readme :
Building the benchmark.
to build -
$ cd smallJobsBenchmark
$ ant deploy
Running the benchmark
$ $HADOOP_HOME/bin/hadoop jar <path to MRBenchmark.jar> smallJobsBenchmark -inputLines <numLines> -maps <numMaps> -reduces <numReduces> -jar <path to MRBenchmark.jar>
For running the benchmark multiple times with different input sizes, maps, reduces following helper scripts can be used.
$ cd build/contrib/smallJobsBenchmark
$ bin/run.sh
after successfully running the benchmark see logs/report.txt for consolidated output of all the runs.
change this script to configure options.
Configurable options are -
-inputLines noOfLines
no of lines of input to generate.
-inputType (ascending, descending, random)
type of input to generate.
-jar jarFilePath
Jar file containing Mapper and Reducer implementations in jar file. By default ant build creates MRBenchmark.jar file containing default Mapper and Reducer.
-times numJobs
No of times to run each MapReduce task, time is calculated as average of all runs.
-workDir dfsPath
DFS path to put output of MR tasks.
-maps numMaps
No of maps for wach task
-reduces numReduces
No of reduces for each task
-ignoreOutput
Doesn't copy the output back to local disk. Otherwise it creates the output back to a temp location on local disk.