Merge pull request #599 from keith-turner/fluo-597

fixes #597 moved Fluo Formatter functionality to Fluo scan command
tree: 6b907909b6835d65fe8c72aa0ada9a913d4e7e1e
  1. contrib/
  2. docs/
  3. modules/
  4. .gitignore
  5. .travis.yml
  6. AUTHORS
  7. LICENSE
  8. pom.xml
  9. README.md
README.md

Fluo

Build Status

Fluo is transaction layer that enables incremental processing on big data.

Fluo is an implementation of Percolator built on Accumulo than runs in YARN. It is not recommended for production use yet. Check out the Fluo project website for news and general information.

Getting Started

There are several ways to run Fluo (listed in order of increasing difficulty):

  • quickstart - Starts a MiniFluo instance that is configured to run a word count application
  • MiniFluo - Sets up a minimal Fluo instance that writes its data to single directory
  • fluo-dev - Command-line tool for running Fluo and its dependencies on a single machine
  • fluo-deploy - Command-line tool that launches an AWS cluster and deploys Fluo and its dependencies to it
  • Production - Sets up Fluo on a cluster where Accumulo, Hadoop & Zookeeper are running

Except for quickstart, all above will set up a Fluo application that will be idle unless you create client & observer code for your application. You can either create your own application or configure your Fluo application to run an example below:

  • phrasecount - Computes phrase counts for unique documents
  • fluo-stress - Computes the number of unique integers by building bitwise trie

Implementation

  • Architecture - Overview of Fluo's architecture
  • Contributing - Documentation for developers who want to contribute to Fluo
  • Metrics - Fluo metrics are visible via JMX by default but can be configured to send to Graphite or Ganglia