tree: f19d008ed385be9b843af79128b5f78572d7a98f [path history] [tgz]
  1. src/
  2. pom.xml
  3. README.md
runners/gearpump/README.md

Gearpump Beam Runner

The Gearpump Beam runner allows users to execute pipelines written using the Apache Beam programming API with Apache Gearpump (incubating) as an execution engine.

##Getting Started

The following shows how to run the WordCount example that is provided with the source code on Beam.

###Installing Beam

To get the latest version of Beam with Gearpump-Runner, first clone the Beam repository:

git clone https://github.com/apache/beam
git checkout gearpump-runner

Then switch to the newly created directory and run Maven to build the Apache Beam:

cd beam
mvn clean install -DskipTests

Now Apache Beam and the Gearpump Runner are installed in your local Maven repository.

###Running Wordcount Example

Download something to count:

curl http://www.gutenberg.org/cache/epub/1128/pg1128.txt > /tmp/kinglear.txt

Run the pipeline, using the Gearpump runner:

cd examples/java
mvn exec:java -Dexec.mainClass=org.apache.beam.examples.WordCount -Dexec.args="--inputFile=/tmp/kinglear.txt --output=/tmp/wordcounts.txt --runner=TestGearpumpRunner" -Pgearpump-runner

Once completed, check the output file /tmp/wordcounts.txt-00000-of-00001