Apache Flume Hadoop provides various Flume components for the Hadoop ecosystem

Clone this repo:
  1. 2313700 Move Hive metastore to a test directory. Eliminate reload4j by Ralph Goers · 12 months ago main
  2. fbe8ffb Use x86_64 as arch. conditionally include kudu sink in release distribution by rgoers · 12 months ago
  3. 90a8d33 Update zookeeper version by Ralph Goers · 12 months ago
  4. c7d03d8 Add back the hive db. Disable building kudu unless it the arch is x86. by Ralph Goers · 12 months ago
  5. 0af0228 Update notifications target by rgoers · 12 months ago

Welcome to Apache Flume Hadoop!

Apache Flume is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving large amounts of event data. It has a simple and flexible architecture based on streaming data flows. It is robust and fault tolerant with tunable reliability mechanisms and many failover and recovery mechanisms. The system is centrally managed and allows for intelligent dynamic management. It uses a simple extensible data model that allows for online analytic application.

The Apache Flume Hadoop module provides Flume components that leverage Hadoop technologies.

Apache Flume Hadoop is open-sourced under the Apache Software Foundation License v2.0.

Documentation

Documentation is included in the binary distribution under the docs directory. In source form, it can be found in the flume-ng-doc directory.

The Flume 1.x guide and FAQ are available here:

Contact us!

Bug and Issue tracker.

Compiling Flume Hadoop

Compiling Flume Hadoop requires the following tools:

  • Oracle Java JDK 8 (Note: 3.x does not support compiling with anything but Java 8)
  • Apache Maven 3.x