blob: 3f8c08ebd3f4febf23aea34eeb5797b84734f5d3 [file] [log] [blame]
<!DOCTYPE html>
<html lang="en">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8"/>
<title>Apache SAMOA</title>
<link rel="icon" type="image/x-icon" href="/favicon.ico" />
<link rel="stylesheet" type="text/css" href="css/pages.css" media="all">
</head>
<body class="home">
<section id="hero-spot" class="hero-spot">
<h1>Apache SAMOA</h1>
<h2>Scalable Advanced Massive Online Analysis</h2>
<h3>Apache SAMOA is currently undergoing incubation at the Apache Software Foundation.
</br>Latest source release: <a href="https://www.apache.org/dist/incubator/samoa/0.3.0-incubating"> 0.3.0-incubating</a>
</br>View on <a href=" https://github.com/apache/incubator-samoa">GitHub</a>.
</h3>
<div id="slideshow">
<img src="images/slideshow/runtime.png" alt="Apache SAMOA" class="slide active" height= "300" />
</div>
</section>
<section id="tutorial" class="next-steps">
<h1><a href="http://incubator.apache.org/"><img style="max-width:55%;border:0px solid black;"src="http://incubator.apache.org/images/egg-logo.png" alt="Apache Incubator">
</a>Apache SAMOA is a distributed streaming machine learning (ML) framework that contains a programing abstraction for distributed streaming ML algorithms.</h1>
<h2>Apache SAMOA enables development of new ML algorithms without directly dealing with the complexity of underlying distributed stream processing engines (DSPEe, such as Apache Storm, Apache S4, and Apache Samza).
Apache SAMOA users can develop distributed streaming ML algorithms once and execute them on multiple DSPEs.</h2>
<ul class="next-steps wrapper">
<li class="jekyll">
<a class="hero-octicon" href="#build">
<span class="mega-octicon octicon-tools"></span>
</a>
<h4><a href="#build">Build Apache SAMOA</a></h4>
<p>Build Apache SAMOA for Apache Storm, Apache S4, Apache Samza, or Local mode.</p>
</li>
<li class="custom-urls">
<a class="hero-octicon" href="#getting-started">
<span class="mega-octicon octicon-key"></span>
</a>
<h4><a href="#getting-started">Getting started!</a></h4>
<p>Hands-on with Apache SAMOA: Getting Started in 5 minutes!</p>
</li>
<li class="guides">
<a class="hero-octicon" href="documentation/Home.html">
<span class="mega-octicon octicon-book"></span>
</a>
<h4><a href="documentation/Home.html">Documentation</a></h4>
<p>Learn how to use Apache SAMOA in the various different ways possible.</p>
</li>
<li class="guides">
<a class="hero-octicon" href="https://cwiki.apache.org/confluence/display/SAMOA/SAMOA+Home">
<span class="mega-octicon octicon-pencil"></span>
</a>
<h4><a href="https://cwiki.apache.org/confluence/display/SAMOA/SAMOA+Home">Wiki</a></h4>
<p>Roadmap and instructions for contributors.</p>
</li>
</ul>
</section>
<section class="tutorial">
<h1>Slides</h1>
<h2><a href="https://speakerdeck.com/gdfm/samoa-a-platform-for-mining-big-data-streams-2"><img src="samoa-slides.jpg" alt="SAMOA Slides" data-canonical-src="samoa-slides.jpg" style="max-width:100%;"></a></h2>
<h2>G. De Francisci Morales, A. Bifet. "<a href="http://jmlr.csail.mit.edu/papers/volume16/morales15a/morales15a.pdf">SAMOA: Scalable Advanced Massive Online Analysis</a>." Journal of Machine Learning Research, 16(Jan):149−153, 2015.</h2>
</section>
<section class="content">
<h1>Video</h1>
<h2><iframe width="640" height="360" src="//www.youtube.com/embed/7WsDKNU1A7o?feature=player_detailpage" frameborder="0" allowfullscreen></iframe></h2>
<h2>NoSQL matters Conference, Barcelona 2013.</h2>
<h1>Apache SAMOA Developer's Guide</h1>
<h2><a href="./documentation/SAMOA-Developers-Guide-0.0.1.pdf"><img style="max-width:95%;border:3px solid black;" src="Manual.png" alt="SAMOA Developer's guide" height="250"> </a></h2>
</section>
<section class="tutorial">
<h1>API Javadoc Reference</h1>
<h2><a href="./documentation/api/current">http://samoa.incubator.apache.org/documentation/api/current</a></h2>
<h1>Mailing list</h1>
<h2>Development mailing list
<a href="mailto:dev@samoa.incubator.apache.org">dev@samoa.incubator.apache.org</a><br/>
[ <a href="mailto:dev-subscribe@samoa.incubator.apache.org">subscribe</a> |
<a href="mailto:dev-unsubscribe@samoa.incubator.apache.org">unsubscribe</a> |
<a href="http://mail-archives.apache.org/mod_mbox/incubator-samoa-dev">archives</a> ]
</h2>
<h1>Contributors</h1>
<h2><a href="documentation/Team.html">List of contributors to the SAMOA project</a>.</h2>
</section>
<section class="next-steps">
<h1>License</h1>
<h2>The use and distribution terms for this software are covered by the Apache License, Version 2.0 (<a href="http://www.apache.org/licenses/LICENSE-2.0.html">http://www.apache.org/licenses/LICENSE-2.0.html</a>).</h2>
</section>
<section id="build" class="tutorial">
<h1>Build Apache SAMOA</h1>
<!-- ### Start of tutorials -->
<ul id="user-site" class="tutorial-list wrapper active">
<li id="terminal-step-1" class="option-terminal">
<h4>Apache Storm</h4>
<p>Go to the folder where you want to store your project, and clone the new repository:</p>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>git clone http://git.apache.org/incubator-samoa.git</p>
<p><span class="path">~</span><span class="prompt">$</span>cd incubator-samoa</p>
<p><span class="path">~</span><span class="prompt">$</span>mvn -Pstorm package</p>
</div>
</div>
<p>The deployable jar for Apache SAMOA will be in <code>target/SAMOA-Storm-0.3.0-SNAPSHOT.jar</code>.</p>
</li>
<li id="terminal-step-2" class="option-terminal">
<h4>Apache S4</h4>
<p>If you want to compile Apache SAMOA for S4, you will need to install the S4 dependencies
manually as explained in <a href="documentation/Executing-SAMOA-with-Apache-S4.html">Executing Apache SAMOA with Apache S4</a>.</p>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>git clone http://git.apache.org/incubator-samoa.git</p>
<p><span class="path">~</span><span class="prompt">$</span>cd incubator-samoa</p>
<p><span class="path">~</span><span class="prompt">$</span>mvn -Ps4 package</p>
</div>
</div>
<p>The deployable jar for Apache SAMOA will be in <code>target/SAMOA-S4-0.3.0-SNAPSHOT.jar</code>.</p>
</li>
<li id="terminal-step-3" class="option-terminal">
<h4>Apache Samza</h4>
<p>Go to the folder where you want to store your project, and clone the new repository:</p>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>git clone http://git.apache.org/incubator-samoa.git</p>
<p><span class="path">~</span><span class="prompt">$</span>cd incubator-samoa</p>
<p><span class="path">~</span><span class="prompt">$</span>mvn -Psamza package</p>
</div>
</div>
<p>The deployable jar for Apache SAMOA will be in <code>target/SAMOA-Samza-0.3.0-SNAPSHOT.jar</code>.</p>
</li>
<li id="terminal-step-4" class="option-terminal">
<h4>Local Test Mode</h4>
<p>If you want to test Apache SAMOA in a local environment, simply clone the repository and install Apache SAMOA.</p>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>git clone http://git.apache.org/incubator-samoa.git</p>
<p><span class="path">~</span><span class="prompt">$</span>cd incubator-samoa</p>
<p><span class="path">~</span><span class="prompt">$</span>mvn package</p>
</div>
</div>
<p>The deployable jar for Apache SAMOA will be in <code>target/SAMOA-Local-0.3.0-SNAPSHOT.jar</code>.</p>
</li>
</ul>
<!-- End of user site tutorial -->
</section>
<section id="getting-started" class="tutorial">
<h1>Getting Started</h1>
<!-- ### Start of tutorials -->
<ul id="user-site" class="tutorial-list wrapper active">
<li id="terminal-step-1" class="option-terminal">
<h4>Download Apache SAMOA</h4>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>git clone http://git.apache.org/incubator-samoa.git</p>
<p><span class="path">~</span><span class="prompt">$</span>cd incubator-samoa</p>
<p><span class="path">~</span><span class="prompt">$</span>mvn package</p>
</div>
</div>
</li>
<li id="terminal-step-1" class="option-terminal">
<h4>Download the Forest CoverType dataset </h4>
<p>If you want to compile Apache SAMOA for S4, you will need to install the S4 dependencies manually as explained in <a href="documentation/Executing-SAMOA-with-Apache-S4">Executing Apache SAMOA with Apache S4</a>.</p>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>wget "http://downloads.sourceforge.net/project/moa-datastream/Datasets/Classification/covtypeNorm.arff.zip"</p>
<p><span class="path">~</span><span class="prompt">$</span>unzip covtypeNorm.arff.zip </p>
</div>
</div>
<p><em>Forest Covertype</em> contains the forest cover type for 30 x 30 meter cells obtained from the US Forest Service (USFS) Region 2 Resource Information System (RIS) data. It contains 581,012 instances and 54 attributes, and it has been used in several articles on data stream classification.</p>
</li>
<li id="terminal-step-1" class="option-terminal">
<h4>Run an example</h4>
<p>Classifying the CoverType dataset with the bagging algorithm</p>
<div class="terminal">
<div class="header"></div>
<div class="shell">
<p><span class="path">~</span><span class="prompt">$</span>bin/samoa local target/SAMOA-Local-0.3.0-SNAPSHOT.jar "PrequentialEvaluation -l classifiers.ensemble.Bagging -s (ArffFileStream -f covtypeNorm.arff) -f 100000"</p>
</div>
</div>
<p>The output will be a list of the evaluation results, plotted each 100,000 instances.</p>
</li>
</ul>
</section>
<section class="tutorial">
<h2><a href="http://incubator.apache.org/"><img style="max-width:95%;border:0px solid black;"src="http://incubator.apache.org/images/egg-logo.png" alt="Apache Incubator"></a></h2>
<h2>
Apache SAMOA is an effort undergoing incubation at The Apache Software Foundation (ASF), sponsored by the Apache Incubator. Incubation is required of all newly accepted projects until a further review indicates that the infrastructure, communications, and decision making process have stabilized in a manner consistent with other successful ASF projects. While incubation status is not necessarily a reflection of the completeness or stability of the code, it does indicate that the project has yet to be fully endorsed by the ASF.
</h2>
<h2>Apache and the Apache feather logo are trademarks of The Apache Software Foundation.</h2>
</section>
<script src="js/jquery.js"></script>
<!--<script>window.slides = {"bower":"Bower","vht":"Vht", "architecture":"Architecture"}</script>-->
<script src="js/application.js"></script>
<footer class="page-footer">
<ul class="site-footer-links right">
<li><a href="https://github.com/apache/incubator-samoa">View On <strong>GitHub</strong></a></li>
</ul>
<a href="/">
<span class="mega-octicon octicon-mark-github"></span>
</a>
<ul class="site-footer-links">
<li>&copy; 2014 <span>Apache SAMOA</span></li>
<li><a href="#build">Build Apache SAMOA</a></h4>
<li><a href="documentation/Getting-Started.html">Getting started!</a></li>
<li><a href="documentation/Home.html">Documentation</a></li>
</ul>
</footer>
<script>
(function(i,s,o,g,r,a,m){i['GoogleAnalyticsObject']=r;i[r]=i[r]||function(){
(i[r].q=i[r].q||[]).push(arguments)},i[r].l=1*new Date();a=s.createElement(o),
m=s.getElementsByTagName(o)[0];a.async=1;a.src=g;m.parentNode.insertBefore(a,m)
})(window,document,'script','//www.google-analytics.com/analytics.js','ga');
ga('create', 'UA-3769691-40', 'github.com');
ga('send', 'pageview');
</script>
</body>
</html>