docs/Hadoop-demo-VM.md

Mesos VMWare Image with Hadoop pre-installed

Last Updated: August 2011

Note: This VM download is approximately 1.7GB. Feel Free to mirror internally or externally to minimize bandwidth usage. The Uncompressed version of VM required 6GB of disk space.

To make it easy for you to get started with Apache Mesos, we created a virtual machine with everything you need. our VM runs Ubuntu 10.04 LTS - Long-term support 64-bit (Lucid Lynx) and Mesos with Apache Hadoop 0.20.2.

To launch the VMWare image, you will either need VMware Player for Windows and Linux, or VMware Fusion for Mac. (Note that VMware Fusion only works on Intel architectures, so older Macs with PowerPC processors cannot run the training VM.)

Once you launch the VM, log in with the following account details:

username: hadoop
password: hadoop

The hadoop account has sudo privileges in the VM.

1. Testing Mesos

run ~/mesos$ bin/tests/all-tests

~/mesos$ bin/tests/all-tests   
[==========] Running 61 tests from 6 test cases.  
[----------] Global test environment set-up.  
[----------] 18 tests from MasterTest  
[ RUN      ] MasterTest.ResourceOfferWithMultipleSlaves  
[       OK ] MasterTest.ResourceOfferWithMultipleSlaves (33 ms)  
[ RUN      ] MasterTest.ResourcesReofferedAfterReject  
[       OK ] MasterTest.ResourcesReofferedAfterReject (3 ms)  
  
[ ... trimmed ... ]  
  
[ RUN      ] MasterTest.MultipleExecutors  
[       OK ] MasterTest.MultipleExecutors (2 ms)  
[----------] 18 tests from MasterTest (38 ms total)  
  
[----------] Global test environment tear-down  
[==========] 61 tests from 6 test cases ran. (17633 ms total)  
[  PASSED  ] 61 tests.   
  YOU HAVE 3 DISABLED TESTS

2. Start all your frameworks!

Start Mesos's Master:
~/mesos$ bin/mesos-master &
Start Mesos's Slave:
~/mesos$ bin/mesos-slave --url=mesos://master@localhost:5050 &
Start Hadoop's namenode:
~/mesos/frameworks/hadoop-0.20.2$ bin/hadoop namenode &
Start Hadoop's datanode:
~/mesos/frameworks/hadoop-0.20.2$ bin/hadoop datanode &
Start Hadoop's jobtracker:
~/mesos/frameworks/hadoop-0.20.2$ bin/hadoop jobtracker &

3. Run the MapReduce job:

We will now run your first Hadoop MapReduce job. We will use the WordCount example job which reads text files and counts how often words occur. The input is text files and the output is text files, each line of which contains a word and the count of how often it occurred, separated by a tab.

Run the “wordcount” example MapReduce job:
~/mesos/frameworks/hadoop-0.20.2$ bin/hadoop jar build/hadoop-0.20.3-dev-examples.jar wordcount /user/hadoop/g /user/hadoop/output
You will see something like the following:

11/07/19 15:34:29 INFO input.FileInputFormat: Total input paths to process : 6 11/07/19 15:34:29 INFO mapred.JobClient: Running job: job_201107191533_0001 11/07/19 15:34:30 INFO mapred.JobClient: map 0% reduce 0% 11/07/19 15:34:43 INFO mapred.JobClient: map 16% reduce 0%

[ ... trimmed ... ]

Click the Firefox Web Browser on the Panel to view Mesos's documentation. The browser also provides the following bookmarks:

http://localhost:50030 - web UI for MapReduce job tracker(s)
http://localhost:50060 - web UI for task tracker(s)
http://localhost:50070 - web UI for HDFS name node(s)
http://localhost:8080 - web UI for Mesos master