Apache Flume

Clone this repo:
  1. 4b74aa2 FLUME-2963. FlumeUserGuide: Fix error in Kafka Source properties table by Denes Arvay · 2 years, 5 months ago trunk
  2. dff1505 Fix various typos by lfzCarlosC · 2 years, 5 months ago
  3. 988ede9 FLUME-2959. Fix issues with flume-checkstyle module by Lior Zeno · 2 years, 5 months ago
  4. 5a083a3 FLUME-2890. Typo in Twitter source warning by Daniel Templeton · 2 years, 5 months ago
  5. 10639e8 FLUME-2761. Move Hive sink out of preview mode by Roshan Naik · 2 years, 5 months ago

Welcome to Apache Flume!

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of log data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic application.

The Apache Flume 1.x (NG) code line is a refactoring of the first generation Flume to solve certain known issues and limitations of the original design.

Apache Flume is open-sourced under the Apache Software Foundation License v2.0.


Documentation is included in the binary distribution under the docs directory. In source form, it can be found in the flume-ng-doc directory.

The Flume 1.x guide and FAQ are available here:

Contact us!

Bug and Issue tracker.

Compiling Flume

Compiling Flume requires the following tools:

  • Oracle Java JDK 1.7
  • Apache Maven 3.x

Note: The Apache Flume build requires more memory than the default configuration. We recommend you set the following Maven options:

export MAVEN_OPTS=“-Xms512m -Xmx1024m -XX:PermSize=256m -XX:MaxPermSize=512m”

To compile Flume and build a distribution tarball, run mvn install from the top level directory. The artifacts will be placed under flume-ng-dist/target/.