commit | 64e0573aca01dfd07b23ec41e20acb307829733e | [log] [tgz] |
---|---|---|
author | Vinoth Chandar <vinoth@uber.com> | Mon Aug 28 01:28:08 2017 -0700 |
committer | vinoth chandar <vinothchandar@users.noreply.github.com> | Mon Oct 02 20:44:53 2017 -0700 |
tree | 7cb72248968ce86dbb7057d906668fe93014ed6c | |
parent | c98ee057fcd9d2566e5cefcf15bd5d2e5ec9283e [diff] |
Adding hoodie-spark to support Spark Datasource for Hoodie - Write with COW/MOR paths work fully - Read with RO view works on both storages* - Incremental view supported on COW - Refactored out HoodieReadClient methods, to just contain key based access - HoodieDataSourceHelpers class can be now used to construct inputs to datasource - Tests in hoodie-client using new helpers and mechanisms - Basic tests around save modes & insert/upserts (more to follow) - Bumped up scala to 2.11, since 2.10 is deprecated & complains with scalatest - Updated documentation to describe usage - New sample app written using the DataSource API
Hoodie manages storage of large analytical datasets on HDFS and serve them out via two types of tables
For more, head over here