blob: 21fea6da6d52388a670fb4718f9e36b37a271c31 [file] [log] [blame]
Ambari
======
- Ambari provides the GUI frontend that would help users install, manage and monitor the Hadoop cluster.
- It uses CSS, Javascript, and HTML on client side. We use the existing proven
open-source technologies in the backend e.g. PHP, Puppet for deploying and
configuring the Hadoop services, Nagios & Ganglia for monitoring alerts and
metrics graphs respectively.
Getting Ambari
==============
The source can be checked out anonymously from SVN with this command:
$ svn checkout http://svn.apache.org/viewvc/incubator/ambari/branches/ambari-186 ambari
How to build Ambari
===================
You need to build a AMBARI rpm from the source code that you checkout above:
$ yum install rpm-build
$ cd hmc; // The new directory structure
$ cd package/rpm
$ ./create_hmc_rpm.sh
When it succeeds, you will find two RPMs inside the build directory package/rpm/build/rpmbuild/RPMS/noarch/:
1) ambari-1.0.0-1.noarch.rpm
2) ambari-agent-1.0.0-1.noarch.rpm
Steps to run before you use Ambari for installing Hadoop on your cluster
========================================================================
The node on which you will run the Ambari web-server will be henceforth referred
to as Ambari master and the nodes which will be used to run Hadoop software will
be called as cluster nodes.
1) Set up password-less SSH for root to all your cluster nodes.
- Make sure you copy root's ssh public keys to all the cluster hosts.
- You will have to copy the ssh private key to the Ambari node for installing
Hadoop on your cluster nodes from the UI.
Make sure you copy root's ssh public keys to all the cluster hosts.
You will have to copy the ssh private key to your desktop for later use in the UI.
2) Install the above built ambari-agent-1.0.0-1.noarch.rpm on each of the nodes by running
$ sudo rpm -Uvh ambari-agent-1.0.0-1.noarch.rpm
3) Stop ip-tables on your Ambari master:
$ sudo service iptables stop
How to install Ambari
=====================
Ambari has few external runtime dependencies, most important of which are
1) puppet
2) php
3) ruby
To simplify the installation of dependencies, you need to enable the following
yum repos. For that, you can simply install the corresponding RPMs.
1) EPEL repo
2) Puppet passenger
3) Hadoop repo
So, you should do the following at the command line:
$ sudo rpm -Uvh http://download.fedoraproject.org/pub/epel/5/[i386|x86_64]/epel-release-5-4.noarch.rpm
$ sudo rpm -Uvh http://passenger.stealthymonkeys.com/rhel/5/passenger-release.noarch.rpm
$ sudo rpm -Uvh http://public-repo-1.hortonworks.com/HDP-1.0.13/repos/centos5/hdp-release-1.0.13-1.el5.noarch.rpm
Now, to install ambari-rpms
$ sudo yum install php-pecl-json (version 1.2.1 compatible with php-5.1 or php-5.2)
$ sudo rpm -iv ambari-1.0.0-1.noarch.rpm
Hadoop required JDK, so you can download it from the Oracle website
http://download.oracle.com/otn-pub/java/jdk/6u26-b03 . Ambari needs the following
two files to be available on the Ambari master node under /var/run/hmc/downloads/
:
1) jdk-6u26-linux-x64.bin
2) jdk-6u26-linux-i586.bin
How to run Ambari
=================
$ sudo service ambari start
You have installed ambari by now. You can start deploying Apache Hadoop and
ecosystem components on your cluster nodes by visiting:
http://AMBARIMASTER/hmc/html/index.php