Documentation

Primary documentation for Apache Apex including overviews of the product, security, application development, operators and the commandline tool is here. Documentation for the Apache Apex Malhar operator library including a diagrammatic taxonomy and some in-depth tutorials for selected operators (such as Kafka Input) is here.

The Java API for both the platform and the operator library is here.

The following sections provide links to additional informative material such as presentations, videos and blogs.

Writing Apache Apex Applications

  • Building Your First Apache Apex Application (video) This video has a hands-on demonstration of how to check out the source code repositories and build them, then run the maven archetype command to generate a new Apache Apex project, populate the project with Java source files for a new application, and finally, build and run the application -- all on a virtual machine running Linux with Apache Hadoop installed.

  • Top N Words Application Tutorial This document provides a detailed step-by-step description of how to build and run a word counting application with Apache Apex starting with setting up your development, progressing to building, running and monitoring the application, visualizing the output and concluding with some advanced features such as assessing operator memory requiremnets, partitioning, and debugging.

  • Sales Dimensions Application Tutorial Similar to the Top N Words application but covers dimensional computations on a simulated sales data stream.

  • Example Applications A git repository with sample code for specialized tutorials covering a a variety of topcs such as a high-performance key-value store (HDHT), custom partitioning using stream codecs, etc.

  • Malhar Demos This is part of the source repository for Apache Apex Malhar and contains a number of more substantial sample applications that developers will find especially useful. They include an application that computes some statistics (such as moving averages) from a live stream of stock transactions from Yahoo! Finance; one that analyzes a synthetic stream of eruption event data for the Old Faithful geyser; a Twitter stream analyzer; and a simulated fraud detector.

Presentations