Reading baseCommitTime from the latest file slice as opposed to the tagged record value
1 file changed
tree: 004dc34451fdd7a1052cb13f958cf47a658fada3
  1. deploy/
  2. docker/
  3. hoodie-cli/
  4. hoodie-client/
  5. hoodie-common/
  6. hoodie-hadoop-mr/
  7. hoodie-hive/
  8. hoodie-integ-test/
  9. hoodie-spark/
  10. hoodie-timeline-service/
  11. hoodie-utilities/
  12. packaging/
  13. release/
  14. style/
  15. .gitignore
  16. .travis.yml
  17. _config.yml
  18. CHANGELOG.md
  19. KEYS
  20. LICENSE.txt
  21. NOTICE.txt
  22. pom.xml
  23. README.md
  24. RELEASE_NOTES.md
README.md

Hudi

Hudi (pronounced Hoodie) stands for Hadoop Upserts anD Incrementals. Hudi manages storage of large analytical datasets on HDFS and serve them out via two types of tables

  • Read Optimized Table - Provides excellent query performance via purely columnar storage (e.g. Parquet)
  • Near-Real time Table (WIP) - Provides queries on real-time data, using a combination of columnar & row based storage (e.g Parquet + Avro)

For more, head over here