Apache Arrow DataFusion SQL Query Engine

Clone this repo:
  1. 9333e3f minor: make window frame error messages more consistent (#6519) by comphead · 19 hours ago main
  2. 3907997 Support simplifying expressions like `~ ^(bar|foo)$` (#6487) by Ruixiang Tan · 28 hours ago
  3. 40b6565 Add serde for `DROP VIEW` (#6499) by Daniël Heres · 28 hours ago
  4. 2086e1d Minimize clone in or simplifier (#6509) by Armin Primadi · 28 hours ago
  5. aa7141a Make `FileStream` error handling configurable (#6491) by Dan Harris · 28 hours ago

DataFusion

Coverage Status

DataFusion is a very fast, extensible query engine for building high-quality data-centric systems in Rust, using the Apache Arrow in-memory format.

DataFusion offers SQL and Dataframe APIs, excellent performance, built-in support for CSV, Parquet, JSON, and Avro, extensive customization, and a great community.

https://arrow.apache.org/datafusion/ contains the project's documentation.

Using DataFusion

The example usage section in the user guide and the datafusion-examples code in the crate contain information on using DataFusion.

Contributing to DataFusion

The developer’s guide contains information on how to contribute.