tree: f7b8f9e8c9aa5e587c59babf23b614ae873e20f9 [path history] [tgz]
  1. api.h
  2. arrow-dataset.pc.in
  3. ArrowDatasetConfig.cmake.in
  4. CMakeLists.txt
  5. dataset.cc
  6. dataset.h
  7. dataset_internal.h
  8. dataset_test.cc
  9. discovery.cc
  10. discovery.h
  11. discovery_test.cc
  12. file_base.cc
  13. file_base.h
  14. file_ipc.cc
  15. file_ipc.h
  16. file_ipc_test.cc
  17. file_parquet.cc
  18. file_parquet.h
  19. file_parquet_test.cc
  20. file_test.cc
  21. filter.cc
  22. filter.h
  23. filter_test.cc
  24. partition.cc
  25. partition.h
  26. partition_test.cc
  27. pch.h
  28. projector.cc
  29. projector.h
  30. README.md
  31. scanner.cc
  32. scanner.h
  33. scanner_internal.h
  34. scanner_test.cc
  35. test_util.h
  36. type_fwd.h
  37. visibility.h
cpp/src/arrow/dataset/README.md

Arrow C++ Datasets

The arrow::dataset subcomponent provides an API to read and write semantic datasets stored in different locations and formats. It facilitates parallel processing of datasets spread across different physical files and serialization formats. Other concerns such as partitioning, filtering (partition- and column-level), and schema normalization are also addressed.

Development Status

Pre-alpha as of June 2019. API subject to change without notice.