1. 933297b feat(unparser): support binary literals (#23001) by Louis Vialar · 6 hours ago main
  2. 6c2221d bugfix: changed return type of spark's width_bucket to i64 (#22811) by Eduardo Aguilar · 6 hours ago
  3. 0838a4d Add StatisticsContext parameter to partition_statistics (#21815) by Alessandro Solimando · 14 hours ago
  4. 54b7dd9 Unify LRU memory-limiting caches into one generic cache (#22613) by Michael Kleen · 14 hours ago
  5. ad8e7b7 refactor: Simplify `approx_distinct` (-200 LoC) (#22921) by Yongting You · 20 hours ago
  6. 0fc55d0 refactor: make scalar distance u64 and overflow aware (#22892) by Florian Müller · 27 hours ago
  7. 1f45d83 fix: parquet limit pruning for row group selections (#22942) by Huaijin · 27 hours ago
  8. 3f4bcf1 fix: Omit NULL values from build side of hash joins (#22893) by Neil Conway · 28 hours ago
  9. 2da8887 chore: attach Diagnostic to unary operator type errors (#21288) by Harrison Crosse · 28 hours ago
  10. 2836480 fix: ProjectionPushdown internal error on NestedLoopJoin mark joins (#22902) by linfeng · 30 hours ago
  11. ddc157d Refactor outer join null-rejection analysis to track join sides directly (#22870) by kosiew · 2 days ago
  12. 7bb6e15 Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` (#22969) by Gabriel · 2 days ago
  13. 8172873 chore(deps): bump the all-other-cargo-deps group across 1 directory with 6 updates (#22975) by dependabot[bot] · 2 days ago
  14. 96a6096 feat: support reading from stdin in datafusion-cli (#22839) by Xin Huang · 3 days ago
  15. a0e6d49 Make LogicalPlan::Unnest expression/rebuild contracts consistent (#22783) by Nathan · 3 days ago
  16. c7e9284 refactor: use raw view access in do_append_val_inner and consolidate duplicated logic (#22907) by EeshanBembi · 3 days ago
  17. 408dad3 Add MERGE INTO types to datafusion-expr (#20763) by Xuanyi Li · 3 days ago
  18. d5f03d9 Fix leaf expression reconciliation (#22971) by Peter L · 3 days ago
  19. 6176a6d Add `.gitignore` for `proto-models` (#22977) by Jeffrey Vo · 3 days ago
  20. fa271ce refactor: Update SortMergeJoin to use async spill abstractions (#22230) by pantShrey · 3 days ago
  21. ae5f3f5 chore(deps): bump prost-build from 0.14.3 to 0.14.4 (#22843) by dependabot[bot] · 3 days ago
  22. fbd64b4 chore(deps): update pydata-sphinx-theme requirement from <1,>=0.18.0 to >=0.19.0,<1 in /docs (#22972) by dependabot[bot] · 3 days ago
  23. 2282d23 chore(deps): bump taiki-e/install-action from 2.81.8 to 2.81.11 (#22973) by dependabot[bot] · 3 days ago
  24. 46d241d chore(deps): update maturin requirement from <2,>=1.13.3 to >=1.14.0,<2 in /docs (#22974) by dependabot[bot] · 3 days ago
  25. 0fb650a chore(deps): bump insta-cmd from 0.6.0 to 0.7.0 (#22976) by dependabot[bot] · 3 days ago
  26. baa497d fix: Disable join dynamic filters for null-equal joins (#22965) by Neil Conway · 3 days ago
  27. 8cda78b Remove orphaned `snowflake_flatten_validation.sql` script (#22938) by Adam Gutglick · 3 days ago
  28. 3c6734e refactor: Simplify heap size estimation for arrays (#22954) by Michael Kleen · 3 days ago
  29. 15bc933 chore(deps): bump cryptography from 46.0.7 to 48.0.1 (#22968) by dependabot[bot] · 3 days ago
  30. 9849513 chore(deps-dev): bump launch-editor from 2.10.0 to 2.14.1 in /datafusion/wasmtest/datafusion-wasm-app (#22970) by dependabot[bot] · 3 days ago
  31. 152d8c4 Add `file_row_index` UDF to query file-level row indexes from Parquet files (#22604) by Adam Gutglick · 3 days ago
  32. a1e88e2 feat: decimal support for gcd and lcm (#22655) by theirix · 3 days ago
  33. a66c898 ci: Setup valid `Cargo.lock` for `depcheck` to unblock CI (#22933) by Adam Gutglick · 3 days ago
  34. 49b99bb chore(deps): bump pyjwt from 2.12.0 to 2.13.0 (#22966) by dependabot[bot] · 3 days ago
  35. 127731b Include `null_aware` status in the relevant Join node display implementations (#22913) by Adam Gutglick · 3 days ago
  36. c14379b refactor: remove `opt_filter` in `GroupsAccumulator::merge_batch` (#22816) by Huaijin · 4 days ago
  37. dede33c refactor(hash-aggr): Migrate existing tests on `GroupsHashAggregateStream` (#22953) by Yongting You · 4 days ago
  38. e20763c refactor(hash-aggr): Migrate the partial aggregation skip optimization to the new hash aggregation impl (#22899) by Yongting You · 4 days ago
  39. 99895e6 refactor: Simplify heap size estimation for types that own no heap allocations (#22918) by Michael Kleen · 4 days ago
  40. 6520315 fix(topk): call attempt_early_completion when filter rejects entire batch (#22852) by ajegou · 4 days ago
  41. d428760 fix: count shared buffers once in hash join build-side memory accounting (#22862) by Jordan Epstein · 5 days ago
  42. cb2542c fix: TRY_CAST returns NULL for timestamp/date overflow (#22897) by fys · 5 days ago
  43. 78033fa refactor: introduce ProbeEnd state in NestedLoopJoinExec (#22865) by Nathan · 6 days ago
  44. a7280b8 FFI: plumb `placement` for `FFI_ScalarUDF` (#22608) by Amogh Ramesh · 6 days ago
  45. 58e37a0 Clearly gate sliding SUM(DISTINCT) type support (#22866) by Kumar Ujjawal · 6 days ago
  46. 3bece3d Upgrade minimal tokio-postgres version to address security advisory (#22937) by Adam Gutglick · 6 days ago
  47. 574a1e6 fix: preserve Spark next_day whitespace validation (#22720) by Ahmed EL. · 6 days ago
  48. e5f7af1 feat(spark): add `concat_ws` with array support (#20928) by David López · 6 days ago
  49. f931728 fix: handle `date_bin` negative subsecond and overflow cases (#22610) by Kumar Ujjawal · 6 days ago
  50. 37768b8 fix: Enable sliding window execution for covar_pop, covar_samp, and corr (#22764) by pchintar · 6 days ago
  51. b8998c7 perf: Convert inner joins to semi joins when equivalent (#22652) by Neil Conway · 8 days ago
  52. 3b321a2 docs: link to 2026 Q3-Q4 roadmap discussion (#22884) by Andrew Lamb · 8 days ago
  53. db2d21e fix: preserve async UDF return field metadata (#22663) by Kristin Cowalcijk · 8 days ago
  54. dae03ee chore: Update to arrow/parquet 59.0.0 (#22744) by Andrew Lamb · 9 days ago
  55. 8bbc460 fix: Avoid incorrectly rounding large integers in `nanvl` (#22575) by Neil Conway · 9 days ago
  56. 656dc47 Add sort tpch SQL benchmark (#22814) by Bruce Ritchie · 9 days ago
  57. 8f06354 minor: handle NULL array input in array_remove and array_replace (#22790) by linfeng · 9 days ago
  58. 68cbe60 fix: map() fails when keys are literals and values are column expressions (#22784) by Nathan · 9 days ago
  59. 5615f25 fix: Correct output-count stats for partitioned partial aggs (#22780) by Neil Conway · 9 days ago
  60. 6d4cb31 feat: implement Spark-compatible weekday function (#22740) by 7. Sun · 9 days ago
  61. 92820c8 docs: Update/improve `SELECT` reference (#22672) by Neil Conway · 9 days ago
  62. 5e3bc99 perf: fast-path inline strings in ByteViewGroupValueBuilder::vectorized_append (#21794) by EeshanBembi · 9 days ago
  63. 3f52deb fix: add backtrace for `assert_*_or_internal_err` helpers (#18910) by Raz Luvaton · 9 days ago
  64. 666f862 Revert custom allocator auditing of MemoryPool tracking in SLTs (#22860) by Brent Gardner · 9 days ago
  65. a851dfd refactor: centralize SQL dialect metadata (#22840) by Kumar Ujjawal · 9 days ago
  66. fb3beba refactor(physical-plan): extract make_group_column factory + eager init at try_new + tighten Time variants (#22751) by Qi Zhu · 9 days ago
  67. d23321d Add logical range partitioning representation (#22777) by Gene Bordegaray · 9 days ago
  68. d77a02d feat: Add From<Option<T>> trait for Precision enum (#22792) by WeblWabl · 9 days ago
  69. 40a6454 refactor(hash-aggr): Forward port the soft limit optimization to the new hash aggregation impl (#22824) by Yongting You · 9 days ago
  70. 8f6876d chore(deps-dev): bump shell-quote from 1.8.3 to 1.8.4 in /datafusion/wasmtest/datafusion-wasm-app (#22856) by dependabot[bot] · 9 days ago
  71. d0ee6b5 chore: Define backport criteria (#22766) by Oleks V · 9 days ago
  72. bf71ea6 docs: link release tracking issue to release management page (#22822) by Andrew Lamb · 9 days ago
  73. 7dd1c6a feat: Support IEEE 754 negative zero semantics (#22835) by Oleks V · 9 days ago
  74. c0dc571 fix: regex simplification of anchored patterns produces wrong results (#22727) by linfeng · 10 days ago
  75. 8995ce6 [main] Update version and changelog to 54.0.0 (#22855) by Andrew Lamb · 10 days ago
  76. a0e05f6 Add smj SQL benchmark (#22803) by Bruce Ritchie · 10 days ago
  77. ea5d448 Add clickbench extended SQL benchmark (#22804) by Bruce Ritchie · 10 days ago
  78. 7333b96 Add nlj SQL benchmark (#22805) by Bruce Ritchie · 10 days ago
  79. 8ae94dd add clickbench sorted SQL benchmark (#22807) by Bruce Ritchie · 10 days ago
  80. 0163ac0 chore(deps): bump the all-other-cargo-deps group with 3 updates (#22844) by dependabot[bot] · 10 days ago
  81. 8b6917c Add hj SQL benchmark (#22802) by Bruce Ritchie · 10 days ago
  82. 228a996 chore(deps): bump taiki-e/install-action from 2.81.3 to 2.81.8 (#22841) by dependabot[bot] · 10 days ago
  83. 33eba61 Add tpcds SQL benchmark (#22801) by Bruce Ritchie · 10 days ago
  84. a63fd48 chore(deps): bump github/codeql-action from 4.36.1 to 4.36.2 (#22842) by dependabot[bot] · 10 days ago
  85. 986a712 refactor: Port CaseExpr proto serialization hooks (#22838) by chakkk309 · 10 days ago
  86. 84dcc0b fix: approx_distinct over-counts for utf8view (#22815) by Huaijin · 10 days ago
  87. 9b81ff8 bench: make wide_schema honor DATA_DIR like the other sql_benchmarks (#22836) by Adrian Garcia Badaracco · 10 days ago
  88. bdfdd09 Add example for PartitionedFile schema (#22809) by Filip Petkovski · 10 days ago
  89. ec126e6 minor: More comments to `AggregateMode::PartialReduce` (#22800) by Yongting You · 10 days ago
  90. 0f8a121 bench: add predicate_eval SQL micro-benchmark suite for conjunctive filter evaluation (#22704) by Adrian Garcia Badaracco · 10 days ago
  91. a630994 coerce Union vs scalar in comparisons (#22825) by Matthew Kim · 11 days ago
  92. 1d740ed feat: add array_sum scalar function (#22542) by crm26 · 11 days ago
  93. 883c38e docs: add Boston DataFusion meetup (#22722) by Andrew Lamb · 11 days ago
  94. e4ae236 fix: Coerce aggregate FILTER predicates to boolean (#22774) by pchintar · 11 days ago
  95. c83a981 feat: add DataFrame fill_nan (#22702) by Nagato Yuzuru · 11 days ago
  96. 710e929 fix: Optimize projections in recursive CTEs (#22476) by Nuno Faria · 11 days ago
  97. 04ef3c7 Spark quote function implementation (#22642) by Kazantsev Maksim · 11 days ago
  98. 6fdef65 refactor: Split hash aggregation logic into separated streams (#22729) by Yongting You · 11 days ago
  99. b23e6b6 perf: improve approx_distinct performance 100x when there are fewer distinct values with many groups (#22768) by Huaijin · 11 days ago
  100. d0d993d fix: NestedLoopJoinExec emits spurious unmatched-left rows with multiple probe partitions (#22791) by Nathan · 11 days ago