refs/tags/apache-arrow-0.2.0 - arrow

tag	d996e60f18f9f08370eacc4f0971bbb9005f6479
tagger	Uwe L. Korn <uwelk@xhochy.com>	Wed Feb 15 15:59:46 2017 +0100
object	f6924ad83bc95741f003830892ad4815ca3b70fd

commit	f6924ad83bc95741f003830892ad4815ca3b70fd	[log] [tgz]
author	Uwe L. Korn <uwelk@xhochy.com>	Wed Feb 15 15:59:36 2017 +0100
committer	Uwe L. Korn <uwelk@xhochy.com>	Wed Feb 15 15:59:36 2017 +0100
tree	3147fb80d3fa9845101357a653c79b3c2b312337
parent	fa8d27f314b7c21c611d1c5caaa9b32ae0cb2b06 [diff]

tree: 3147fb80d3fa9845101357a653c79b3c2b312337

README.md

Apache Arrow

Powering Columnar In-Memory Analytics

Arrow is a set of technologies that enable big-data systems to process and move data fast.

Initial implementations include:

Arrow is an Apache Software Foundation project. Learn more at arrow.apache.org.

What's in the Arrow libraries?

The reference Arrow implementations contain a number of distinct software components:

Columnar vector and table-like containers (similar to data frames) supporting flat or nested types
Fast, language agnostic metadata messaging layer (using Google's Flatbuffers library)
Reference-counted off-heap buffer memory management, for zero-copy memory sharing and handling memory-mapped files
Low-overhead IO interfaces to files on disk, HDFS (C++ only)
Self-describing binary wire formats (streaming and batch/file-like) for remote procedure calls (RPC) and interprocess communication (IPC)
Integration tests for verifying binary compatibility between the implementations (e.g. sending data from Java to C++)
Conversions to and from other in-memory data structures (e.g. Python's pandas library)

Getting involved

Right now the primary audience for Apache Arrow are the developers of data systems; most people will use Apache Arrow indirectly through systems that use it for internal data handling and interoperating with other Arrow-enabled systems.

Even if you do not plan to contribute to Apache Arrow itself or Arrow integrations in other projects, we'd be happy to have you involved:

Join the mailing list: send an email to dev-subscribe@arrow.apache.org. Share your ideas and use cases for the project.
Follow our activity on JIRA
Learn the format
Contribute code to one of the reference implementations