layout: default title: Books Tutorials and Talks
Intro
This page is a place for info about talks (past and upcoming), tutorials, articles, books, slides, PDFs, discussions, etc. about Mahout. No endorsements are implied or given.
Books
Mahout specific
- Apache Mahout: Beyond MapReduce by Dmitriy Lyubimov and Andrew Palumbo published Feb 2016. Covers new features in Mahout “Samsara” releases (0.10, 0.11+).
- Apache Mahout cookbook- Book by Piero Giacomelli published Dec 2013 by Packtpub.
- Mahout in Action - Book by Sean Owen, Robin Anil, Ted Dunning and Ellen Friedman published Oct 2011 by Manning Publications.
- Taming Text - By Grant Ingersoll and Tom Morton, published by Manning Publications. Will have some Mahout coverage, but by no means as complete as Mahout in Action.
Engineering oriented machine learning books
- Collective Intelligence in Action
- Programming Collective Intelligence
- Algorithms of the Intelligent Web
Scientific background
- Data Mining: Practical Machine Learning Tools and Techniques
- Introduction to Information Retrieval
- Machine Learning
- Pattern Recognition and Machine Learning (Information Science and Statistics)
News, Articles and Tutorials
- Mahout 0.10.x: first Mahout release as a programming environment
- Comparing Document Classification Functions of Lucene and Mahout
- Apache Mahout: Scalable Machine Learning for Everyone
- How to build a spam filter server with Mahout - Applying classification on a live server - April 2011
- Deploying a massively scalable recommender system with Apache Mahout - Blogpost of Sebastian Schelter in April 2011
- Apache Mahout & the commoditization of machine learning - Podcast interview with Grant Ingersoll at ApacheCon 2010
- Apache Mahout 0.4 mit neuen Algorithmen - published after the 0.4 release by heise Open/ Developer, November 2010
- Mahout on InfoQ - Interview with Grant Ingersoll on InfoQ
- Mahout in the Cloudera weblog - published after the Hadoop user group UK.
- Mahout in the Drools weblog - Michael Neale published an article on Mahout in the drools weblog
- Introducing Apache Mahout - Grant Ingersoll - Intro to Apache Mahout focused on clustering, classification and collaborative filtering. Japanese translation available at: http://www.ibm.com/developerworks/jp/java/library/j-mahout/
- Flexible Collaborative Filtering In Java With Mahout Taste - Philippe Adjiman - Quick starting guide on how to use the collaborative filtering package of Mahout (called Taste) to quickly and flexibly create, test and compare tailored recommendation engines.
- Integrating Mahout with Lucene and Solr Three part series on ways to integrate Mahout with Lucene and Solr
- Mahout Item Recommender Tutorial using Java and Eclipse - YouTube video tutorial by Steve Cook
Coursework/Lectures
Talks
In reverse chronological order, so that most recent talks are at the top
- [Distributed Machine Learning with Apache Mahout] Suneel Marthi at Apache Big Data North America, Vancouver, Canada, May 11, 2016 and MapR Washington DC Big Data Everywhere, Tysons, VA, June 2 2016
- Declarative Machine Learning with the Samsara DSL Sebastian Schelter at Flink Forward Conference, Berlin Germany, October 2015.
- Bringing Algebraic Semantics to Mahout Sebastian Schelter at HPI Infolunch, Potsdam Germany, May 2014
- Mahout Spark and Scala bindings: Bringing Algebraic Semantics (slides/video) - Dmitriy Lyubimov at Mahout Meetup, April 17, 2014.
- Mahout Future Directions - Ted Dunning, Suneel Marthi, Sebastian Schelter at Hadoop Summit Europe 2014, Amsterdam, April 3, 2014
- Building Recommender Systems for Mere-Mortals - Sebastian Schelter at Researchgate Developer Day, Berlin, November 2013
- Recommendations with Apache Mahout - Sebastian Schelter at IBM Almaden Research Center, San Jose, September 2013
- Next Directions in Mahout’s Recommenders - Sebastian Schelter at Bay Area Mahout Meetup, Redwood City, August 2013
- New Directions in Mahout’s Recommenders - Sebastian Schelter at Recommender Systems Get Together Berlin, April 2013
- Introduction to Mahout and Machine Learning - Slides by Varad Meru, Software Development Engineer at Orzota. July 27th, 2013.
- An Introduction to Collaborative Filtering with Apache Mahout - Sebastian Schelter at Recommender Systems Challenge Workshop in conjunction with ACM RecSys 2012, Dublin, September 2012
- How to build a recommender system based on Mahout and JavaEE - Slides by Manuel Blechschmidt at Berlin Expert Days March, 2012.
- Apache Mahout for intelligent data analysis - Slides from Isabel Drost at Apache Con NA November, 2011.
- Dr. Mahout: Analyzing clinical data using scalable and distributed computing - Slides from Shannon Quinn at Apache Con NA November, 2011.
- Frank Scholten at Berlin Buzzwords on June 7, 2011.
- Introduction to Collaborative Filtering using Mahout (updated) - Talk by Sean Owen at the London Hadoop User Group on April 14, 2011.
- Cool Tricks with Classifiers - Talk by Ted Dunning at the Los Angeles HUG talking about Mahout classifiers on March 16, 2011.
- First Mahout Hackathon, Berlin, March 2011
- Mahout meetup - there were two talks at the Apache Mahout meetup at JTeam in Amsterdam, February 2011. intro slides
- Mahout clustering - Talk on Mahout clustering at data dev room FOSDEM, February 2011.
- Scaling Data Analysis with Apache Mahout - talk on Mahout at O'Reilly Strata, February 2011.
- Practical Machine Learning - Slides from Biju B and Jaganadh G, FOSSMEET-NITC, Calicut, India, February 2011.
- Mahout at AlphaCSPs The Edge 2010 (pdf) - slideshare - Slides from Ariel Kogan AlphaCSP's The Edge, December 2010.
- Intelligent data analysis with Apache Mahout - Slides from Isabel Drost, Devoxx Antwerp, November 2010.
- Apache Mahout introduction - Slides from Isabel Drost, codebits Lisbon, November 2010.
- Apache Mahout - Making Data Analysis Easy - Slides from Isabel Drost, Apache Con US Atlanta, November 2010.
- Practical Machine Learning - Slides from Jaganadh G, BarCamp Kerala 9, November 2010.
- Mahout and its new classification framework - Slides from Ted Dunning, SDForum, November 2010.
- Distributed Item-based Collaborative Filtering with Apache Mahout - Slides from Sebastian Schelter, Hadoop Get Together Berlin, October 2010.
- Hidden Markov Models for Mahout - Slides from Max Heimel, Hadoop Get Together Berlin, October 2010.
- Apache Mahout Mammoth Scale Machine Learning - Slides from Robin Anil, OSCON 2010.
- Intro to Apache Mahout - Slides from Grant Ingersoll, RTP Semantic Web Group.
- Case study: Biometric Databases and Hadoop - Slides from Jason Trost, Hadoop Summit 2010.
- Spam Fighting at Yahoo
- Web Mining with Ken Krugler
- Keynote on intelligent search - Slides from Grant Ingersoll, Berlin Buzzwords, June 2010.
- Simple co-occurrence-based recommendation on Hadoop - Slides from Sean Owen, Berlin Buzzwords, June, 2010.
- Introduction to Collaborative Filtering using Mahout - Slides from Frank Scholten, Berlin Buzzwords, June, 2010.
- Introduction to Scalable Machine Learning - Slides and demos from Grant Ingersoll, March, 2010.
- Mahout @ India Hadoop Summit - Slides from a 1 hour talk on Mahout at the India Hadoop Summit by Robin Anil, February 2010.
- Mahout in 10 minutes - Slides from a 10 min intro to Mahout at the Map Reduce tutorial by David Zülke at Open Source Expo in Karlsruhe, Isabel Drost, November 2009.
- Mahout at Apache Con US - Slides from a talk on “Going from raw data to information” (with Mahout) at Apache Con US in Oakland, Isabel Drost, November 2009.
- Mahout at FrOSCon - Slides from a talk on Mahout at FrOSCon in Sankt Augustin, Isabel Drost, August 2009.
- Mahout at DAI group TU Berlin - Slides from a talk on Mahout at the DAI Laboratories TU Berlin, Isabel Drost, July 2009.
- Mahout at Machine Learning Group TU Berlin - Slides from a talk on Hadoop with some detour to Mahout at the Machine
- Learning Group of Prof. Dr. Klaus-Robert Müller at TU Berlin, Isabel Drost, June 2009.
- Mahout at Google Zürich - Slides from a Google tech-talk on the past, present and future of Mahout, Isabel Drost, May 2009.
- Hadoop user group UK - Slides from a talk on April 14, 2009 at the Hadoop User Group UK in London, Isabel Drost, April 2009.
- BI Over Petabytes: Meet Apache Mahout - Slides from a talk by Jeff Eastman on April 21, 2009 at the Bay Area SD Forum Business Intelligence SIG meeting at SAP in Palo Alto, CA.
- Lucene Meetup and Apache Barcamp in Amsterdam, March 2009.
- BarCampRDU - (Raleigh) on Aug. 2, 2008
- Introducing Mahout: Apache Machine Learning - Committer Grant Ingersoll gave a gentle introduction to Mahout and Machine Learning at ApacheCon in November (3rd through 7th) in New Orleans, USA.
- Mahout: Scaling Machine Learning - Introduction to Mahout and machine learning at FrOSCon in Sankt Augustin/Germany, Isabel Drost, August 2008. (slides)
- Mahout: Scalable Machine Learning - An introduction to Mahout and machine learning at the first German Hadoop gathering in newthinking store/ Berlin, Isabel Drost, July 2008.
- Apache Mahout: Industrial Strength Machine Learning - Committer Jeff Eastman gave an introduction to Mahout at Yahoo!, May 2008
- Apache Lucene - Mach's wie Google - Bernd Fondermann presented an overview of the Apache Lucene project,
- including Mahout at Open Source Expo 2008 in Karlsruhe, May 2008.
- Apache Mahout: Bringing Machine Learning to Industrial Strength - Committer Isabel Drost gave a Fast Feather introduction the the new project Mahout at Apache Con EU April, 2008