commit | e773c2573bba9a1d21a448fc00896b8dec6b58cd | [log] [tgz] |
---|---|---|
author | julien <julien@twitter.com> | Fri Jul 12 22:08:50 2013 -0700 |
committer | julien <julien@twitter.com> | Fri Jul 12 22:08:50 2013 -0700 |
tree | 0eacf761cde6b102259254eec34c9190d96a3f27 | |
parent | 2055a22b8f4755dd0e89a7c7b51b6183e05b0b12 [diff] |
update brennus
Parquet-mr is the java implementation of the Parquet format to be used in Hadoop. It uses the record shredding and assembly algorithm described in the Dremel paper. Integration with Pig and Map/Reduce are provided.
A Loader and a Storer are provided to read and write Parquet files with Apache Pig
Thrift mapping to the parquet schema is provided using a TBase extending class. You can read and write parquet files using Thrift generated classes.
See the APIs:
to run the unit tests: mvn test
The build runs in Travis CI:
Copyright 2012 Twitter, Inc.
Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0