commit | faab884c278a3ab5197ec3cd8931f84a664f4904 | [log] [tgz] |
---|---|---|
author | Matt Hayes <mhayes@linkedin.com> | Sun May 25 17:56:44 2014 -0700 |
committer | Matt Hayes <mhayes@linkedin.com> | Sun May 25 18:23:10 2014 -0700 |
tree | b6a8339cc5a31313be1b52a33d1f2ed4b2a59ff0 | |
parent | c4149151e608a27d1e02dadd2f571a034be4f34f [diff] |
Revert "DATAFU-50 SimpleEvalFunc should extend ContextualEvalFunc, have good lifecycle hooks" This reverts commit a36cab397505837113a5193de52e6439bd12c831. Multiple test failures.
Apache DataFu is a collection of libraries for working with large-scale data in Hadoop. The project was inspired by the need for stable, well-tested libraries for data mining and statistics.
It consists of two libraries:
For more information please visit the website:
If you'd like to jump in and get started, check out the corresponding guides for each library:
Bugs and feature requests can be filed here. For other help please see the discussion group.
The Apache DataFu Pig library can be built by running the command below. More information about working with the source code can be found in the DataFu Pig Contributing Guide.
./gradlew assemble
The built JAR can be found under datafu-pig/build/libs
by the name datafu-pig-x.y.z.jar
, where x.y.z is the version.
This command generates the eclipse project and classpath files:
./gradlew eclipse
To clean up the eclipse files:
./gradlew cleanEclipse
To run all the tests:
./gradlew test
To run tests for a single class, use the test.single
property. For example, to run only the QuantileTests:
/gradlew :datafu-pig:test -Dtest.single=QuantileTests
The tests can also be run from within eclipse.
The Apache DataFu Pig library can be built by running the commands below. More information about working with the source code can be found in the DataFu Hourglass Contributing Guide.
cd contrib/hourglass ant jar