This Quickstart will walk you through executing your first Beam pipeline to run [WordCount]({{ site.baseurl }}/get-started/wordcount-example), written using Beam's [Go SDK]({{ site.baseurl }}/documentation/sdks/go), on a [runner]({{ site.baseurl }}/documentation#runners) of your choice.
If you're interested in contributing to the Apache Beam Go codebase, see the [Contribution Guide]({{ site.baseurl }}/contribute).
The Beam SDK for Go requires go
version 1.10 or newer. It can be downloaded here. Check that you have version 1.10 by running:
$ go version
The easiest way to obtain the Apache Beam Go SDK is via go get
:
$ go get -u github.com/apache/beam/sdks/go/...
For development of the Go SDK itself, see BUILD.md for details.
The Apache Beam examples directory has many examples. All examples can be run by passing the required arguments described in the examples.
For example, to run wordcount
, run:
{:.runner-direct}
$ go install github.com/apache/beam/sdks/go/examples/wordcount $ wordcount --input <PATH_TO_INPUT_FILE> --output counts
{:.runner-dataflow}
$ go install github.com/apache/beam/sdks/go/examples/wordcount # As part of the initial setup, for non linux users - install package unix before run $ go get -u golang.org/x/sys/unix $ wordcount --input gs://dataflow-samples/shakespeare/kinglear.txt \ --output gs://<your-gcs-bucket>/counts \ --runner dataflow \ --project your-gcp-project \ --temp_location gs://<your-gcs-bucket>/tmp/ \ --staging_location gs://<your-gcs-bucket>/binaries/ \ --worker_harness_container_image=apachebeam/go_sdk:latest
{:.runner-nemo}
This runner is not yet available for the Go SDK.
Please don't hesitate to [reach out]({{ site.baseurl }}/community/contact-us) if you encounter any issues!