10.0.0 (2022-07-12)
Full Changelog
Breaking changes:
Implemented enhancements:
- update documentation, fix styling to match main Arrow project #2864
- Update top-level README #2850
- [Question]How to call an async function in
ExecutionPlan::exec method? #2847 - Add
DataFrame::with_column #2844 - Improve ergonomics of physical expr
lit #2827 - Add Python examples for reading CSV and query by SQL in Doc #2824
- eliminate multi limit-offset nodes to EmptyRelation if possible #2822
- Make
LogicalPlan::Union be consistent with other plans #2816 - Use coerced data type from value and list expressions during planning inlist expression #2793
- Add configuration option to enable/disalbe
CoalesceBatchesExec #2790 - Simplify FilterNullJoinKeys rule #2780
- Allow configuration settings to be specified with environment variables #2776
- Automatically update
configs.md in user guide #2770 - Support multiple paths for ListingTableScanNode #2768
- Reduce outer joins #2757
- support data type coerced and decimal in INLIST expr #2755
- Change ExtensionPlanner::plan_extension() to an async function #2749
- Add
IsNotNull filter to join inputs if one side of join condition does not allow null #2739 - Sort preserving MergeJoin #2698
- Improve readability of table scan projections in query plans #2697
- DataFusion 9.0.0 Release #2676
- Improve UX for
UNION vs UNION ALL (introduce a LogicalPlan::Distinct) #2573 [sql] - Implement some way to show the sql used to create a view #2529
- Consider adopting IOx ObjectStore abstraction #2489
- Support
sum0 as a built-in agg function #2067 - implement grouping sets, cubes, and rollups #1327
- Ruby bindings #1114
- Support dates in hash join #2746 (andygrove)
Fixed bugs:
- Docker Error #2851
- Anti join ignores join filters #2842
- Can't test or compile sub-model code after upgrade to arrow-rs 17.0.0 #2835
- Not evaluate the set expr in the InList for the optimization #2820
- CASE When: result type should be coercible to a common type #2818
- IN/NOT IN List: NULL is not equal to NULL #2817
- panic when case statement returns null #2798
- InList: Can't cast the list expr data type to value expr data type directly #2774
- InList Expr: expr and list values must can be converted to a same data type #2759
- tpchgen docker syntax change prevents volume from binding #2751
- Cannot join on date columns (Unsupported data type in hasher: Date32) #2744
rewrite_expression does not properly handle Exists and ScalarSubquery #2736- LocalFileSystem Not sorted by file nameļ¼ As a result, the data lines queried in multiple files are out of order. #2730
- Filter push down need consider alias columns #2725
- Recent API change in
GlobalLimitExec breaks compatibility with Ballista #2720 - Common Subexpression Eliminiation pass errors if run twice on some plans: Schema contains duplicate unqualified field name ‘IsNull-Column-sys.host’ #2712
- The data type is not compatible with other system, for example spark or PG database #1379
Documentation updates:
Closed issues:
- Consider adding a prominent note in the readme about ballista #2853
- support decimal in (NULL) #2800
- InList: Don't treat Null as UTF8(None) #2782
- InList: don't need to treat Null as UTF8 data type #2773
- Implement extensible configuration mechanism #138
Merged pull requests: