DATAFU-52 Added Base64Encode/Decode and unit tests.

https://issues.apache.org/jira/browse/DATAFU-52

Signed-off-by: Matthew Hayes <matthew.terence.hayes@gmail.com>
3 files changed
tree: 0e88f6ff724ec99a9a1602e41a3772f17005f01c
  1. .settings/
  2. build-plugin/
  3. cobertura/
  4. datafu-hourglass/
  5. datafu-pig/
  6. eclipselibs/
  7. examples/
  8. gradle/
  9. licenses/
  10. site/
  11. staticlibs/
  12. tools/
  13. .classpath.template
  14. .factorypath.template
  15. .gitignore
  16. .travis.yml
  17. build.gradle
  18. changes.md
  19. CONTRIBUTORS
  20. gradle.properties
  21. gradlew
  22. HEADER
  23. LICENSE
  24. NOTICE
  25. README.md
  26. releasing.md
  27. settings.gradle
  28. settings.xml.template
  29. test.sh
  30. test_in_background.sh
README.md

Apache DataFu

Apache DataFu is a collection of libraries for working with large-scale data in Hadoop. The project was inspired by the need for stable, well-tested libraries for data mining and statistics.

It consists of two libraries:

  • Apache DataFu Pig: a collection of user-defined functions for Apache Pig
  • Apache DataFu Hourglass: an incremental processing framework for Apache Hadoop in MapReduce

For more information please visit the website:

If you'd like to jump in and get started, check out the corresponding guides for each library:

Blog Posts

Presentations

Papers

Getting Help

Bugs and feature requests can be filed here. For other help please see the discussion group.

Developers

DataFu Pig

Building the Code

The Apache DataFu Pig library can be built by running the command below. More information about working with the source code can be found in the DataFu Pig Contributing Guide.

./gradlew assemble

The built JAR can be found under datafu-pig/build/libs by the name datafu-pig-x.y.z.jar, where x.y.z is the version.

Generating Eclipse Files

This command generates the eclipse project and classpath files:

./gradlew eclipse

To clean up the eclipse files:

./gradlew cleanEclipse

Running the Tests

To run all the tests:

./gradlew test

To run tests for a single class, use the test.single property. For example, to run only the QuantileTests:

/gradlew :datafu-pig:test -Dtest.single=QuantileTests

The tests can also be run from within eclipse.

DataFu Hourglass

Building the Code

The Apache DataFu Pig library can be built by running the commands below. More information about working with the source code can be found in the DataFu Hourglass Contributing Guide.

cd contrib/hourglass
ant jar