title: “Overview”

This documentation is for Apache Flink version {{ site.version }}.

Apache Flink is an open source platform for distributed stream and batch data processing. Flinkā€™s core is a streaming dataflow engine that provides data distribution, communication, and fault tolerance for distributed computations over data streams. Flink also builds batch processing on top of the streaming engine, overlaying native iteration support, managed memory, and program optimization.

First Steps

  • Concepts: Start with the [basic concepts]({{ site.baseurl }}/concepts/concepts.html) of Flink. This will help you to fully understand the other parts of the documentation, including the setup and programming guides. It is highly recommended to read this first.

  • Quickstarts: Run an example program on your local machine or write a simple program working on live Wikipedia edits.

  • Setup: The [local]({{ site.baseurl }}/setup/local_setup.html), cluster, and cloud setup guides show you how to deploy Flink.

  • Programming Guides: You can check out our guides about basic concepts and the DataStream API or DataSet API to learn how to write your first Flink programs.

  • Migration Guide: Check out the 0.10 to 1.0 migration guide if you are upgrading from Flink 0.10.x.

Stack

This is an overview of Flink's stack. Click on any component to go to the respective documentation page.