Features

General

[x] Type coercion
[x] Projection (SELECT)
[x] Filter (WHERE)
[x] Filter post-aggregate (HAVING)
[x] Sorting (ORDER BY)
[x] Limit (LIMIT)
[x] Aggregate (GROUP BY)
[x] cast /try_cast
[x] VALUES lists
[x] String Functions
[x] Conditional Functions
[x] Time and Date Functions
[x] Math Functions
[x] Aggregate Functions (SUM, MEDIAN, and many more)
[x] Schema Queries
- [x] SHOW TABLES
- [x] SHOW COLUMNS FROM <table/view>
- [x] SHOW CREATE TABLE <view>
- [x] Basic SQL Information Schema (TABLES, VIEWS, COLUMNS)
- [ ] Full SQL Information Schema support
[x] Support for nested types (ARRAY/LIST and STRUCT.
- [x] Read support
- [x] Write support
- [x] Field access (col['field'] and [col[1]])
- [x] Array Functions
- [x] Struct Functions
  - [x] struct
  - [ ] Postgres JSON operators (->, ->>, etc.)
[x] Subqueries
[x] Common Table Expressions (CTE)
[x] Set Operations (UNION [ALL], INTERSECT [ALL], EXCEPT[ALL])
[x] Joins (INNER, LEFT, RIGHT, FULL, CROSS)
[x] Window Functions
- [x] Empty (OVER())
- [x] Partitioning and ordering: (OVER(PARTITION BY <..> ORDER BY <..>))
- [x] Custom Window (ORDER BY time ROWS BETWEEN 2 PRECEDING AND 0 FOLLOWING))
- [x] User Defined Window and Aggregate Functions
[x] Catalogs
- [x] Schemas (CREATE / DROP SCHEMA)
- [x] Tables (CREATE / DROP TABLE, CREATE TABLE AS SELECT)
[x] Data Insert
- [x] INSERT INTO
- [x] COPY .. INTO ..
- [x] CSV
- [x] JSON
- [x] Parquet
- [ ] Avro

In addition to allowing arbitrary datasources via the TableProvider trait, DataFusion includes built in support for the following formats:

[x] CSV
[x] Parquet
- [x] Primitive and Nested Types
- [x] Row Group and Data Page pruning on min/max statistics
- [x] Row Group pruning on Bloom Filters
- [x] Predicate push down (late materialization) not by default
[x] JSON
[x] Avro
[x] Arrow