blob: 1c3ca86ba5a4258355d5d33a57e56a4ac4a4755b [file] [log] [blame]
<?xml version="1.0" encoding="iso-8859-1"?>
<!--
Licensed to the Apache Software Foundation (ASF) under one or more
contributor license agreements. See the NOTICE file distributed with
this work for additional information regarding copyright ownership.
The ASF licenses this file to You under the Apache License, Version 2.0
(the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at
http://www.apache.org/licenses/LICENSE-2.0
Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.
-->
<document xmlns="http://maven.apache.org/XDOC/2.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
xsi:schemaLocation="http://maven.apache.org/XDOC/2.0 http://maven.apache.org/xsd/xdoc-2.0.xsd">
<properties></properties>
<title>a general BSP framework on top of Hadoop</title>
<body>
<section name="Apache Hama"></section>
<p>
<strong>Apache Hama</strong> is a pure BSP (Bulk Synchronous Parallel) computing framework on top of HDFS (Hadoop Distributed File System) for massive scientific computations
such as matrix, graph and network algorithms.</p>
<div class="green">
<h3 align="center">Recent News</h3>
<ul>
<li>June 26, 2013: release 0.6.2. available</li>
<li>April 01, 2013: release 0.6.1. available</li>
<li>November 28, 2012: release 0.6.0 available</li>
<li>June 31, 2012: release 0.5.0 available</li>
<li>May 17, 2012: Apache Hama graduated as a Top Level Project!</li>
</ul>
</div>
<subsection name="Why Hama and BSP?"></subsection>
<p>Today, many practical data processing applications require a more flexible programming abstraction model that is compatible to run on highly scalable and massive data systems (e.g., HDFS, HBase, etc). A message passing paradigm beyond Map-Reduce framework would increase its flexibility in its communication capability. Bulk Synchronous Parallel (BSP) model fills the bill appropriately. Some of its significant advantages over MapReduce and MPI are:</p>
<ul>
<li>Supports message passing paradigm style of application development</li>
<li>Provides a flexible, simple, and easy-to-use small APIs</li>
<li>Enables to perform better than MPI for communication-intensive applications</li>
<li>Guarantees impossibility of deadlocks or collisions in the communication mechanisms</li>
</ul>
<subsection name="Getting Started"></subsection>
<p>
Start by installing Hama on a Hadoop cluster.
<br/>
</p>
<ul>
<li><a href="downloads.html">Download</a> Hama from the release page.</li>
<li><a href="getting_started_with_hama.html">Getting Started</a> with Hama.</li>
<li><a href="hama_on_clouds.html">Launch a Hama cluster</a> on Clouds.</li>
<li>Hama <a href="hama_bsp_tutorial.html">BSP Tutorial.</a></li>
<li>Hama <a href="hama_graph_tutorial.html">Graph Tutorial.</a></li>
<li><a href="http://wiki.apache.org/hama">Learn about</a> Hama and BSP by reading the documentation.</li>
</ul>
<subsection name="Getting Involved"></subsection>
<p>
Hama is an open source volunteer project under the Apache Software Foundation. We encourage you to learn about the project and contribute your expertise.
Here are some starter links: <br/>
</p>
<ul>
<li>See our <a href="http://wiki.apache.org/hama/HowToContribute">How to Contribute to Hama</a> page</li>
<li><a href="http://wiki.apache.org/hama/Jira">Jira usage guidelines</a></li>
</ul>
</body>
</document>