tree: f2be52c5bc0b6d2cf2335f388f4623068f9a5233 [path history] [tgz]
  1. lib/
  2. test/
  3. .gitattributes
  4. .gitignore
  5. Gemfile
  6. LICENSE.txt
  7. NOTICE.txt
  8. Rakefile
  9. README.md
  10. red-arrow-format.gemspec
ruby/red-arrow-format/README.md

Red Arrow Format - Apache Arrow Format Ruby

Red Arrow Format is the pure Ruby Apache Arrow format serializer and deserializer implementation. This provides only serialize/deserialize features. If you want to process Apache Arrow data not only serialize/desrialize Apache Arrow data, you should use Red Arrow not Red Arrow Format.

Apache Arrow is an in-memory columnar data store. It's used by many products for data analytics.

Install

If you want to install Red Arrow Format by Bundler, you can add the followings to your Gemfile:

gem "red-arrow-format"

If you want to install Red Arrow Format by RubyGems, you can use the following command line:

$ gem install red-arrow-format

Usage

require "arrow-format"

File.open("/dev/shm/data.arrow", "rb") do |input|
  reader = ArrowFormat::FileReader.new(input)
  reader.each do |record_batch|
    # Use record_batch
  end
end

Development

You can run tests by the following command lines:

$ cd ruby/red-arrow-format
$ bundle install
$ bundle exec rake test