1. d140d2a add copilot ignore (#204) by Dmitrii Blaginin · 13 days ago main
  2. 13884fe Merge queue: make dev checks required + add .asf.yaml validation (#197) by Dmitrii Blaginin · 13 days ago
  3. d022cee Merge remote-tracking branch 'sandbox/main' into sandbox-main by blaginin · 13 days ago
  4. 18e6d8e Merge branch 'main' into sandbox-main by blaginin · 13 days ago
  5. 1a0af76 No cargo test for `sort_mem_validation` (#21222) by Dmitrii Blaginin · 13 days ago
  6. 2c03881 Add metric category filtering for EXPLAIN ANALYZE (#21160) by Adrian Garcia Badaracco · 14 days ago
  7. b09205a fix: propagate errors for unsupported table function arguments instead of silently dropping them (#21135) by Burak Şen · 2 weeks ago
  8. a910b03 fix(unparser): Fix BigQuery timestamp literal format in SQL unparsing (#21103) by Sergei Grebnov · 2 weeks ago
  9. 9f893a4 perf: Optimize `split_part`, support `Utf8View` (#21119) by Neil Conway · 2 weeks ago
  10. e913557 Fix: MemTable LIMIT ignored with reordered projections (#21177) by Ramakrishna Chilaka · 2 weeks ago
  11. 37c1b75 test: scale remaining sort-merge join (SMJ) benchmark queries (#21200) by Matt Butrovich · 2 weeks ago
  12. 580b0ab Use leaf level `ProjectionMask` for parquet projections (#20925) by Matthew Kim · 2 weeks ago
  13. 37cd3de [main] add 52.4.0 changelog (#21053) by Andrew Lamb · 2 weeks ago
  14. 0359a3c fix[physical-plan/aggregates]: fix grouping by Ree<Dict> (#21195) by Alfonso Subiotto Marqués · 2 weeks ago
  15. 1e3b956 Minor: compute qualify window expressions only when QUALIFY clause is present (#21173) by Burak Şen · 2 weeks ago
  16. 20434b0 Reduce parquet struct projection benchmark data volume (#21187) by Matthew Kim · 2 weeks ago
  17. 07ddfd7 Fix sort merge interleave overflow (#20922) by xudong.w · 2 weeks ago
  18. 627faba Migrate Avro reader to arrow-avro and remove internal conversion code (#17861) by Namgung Chan · 2 weeks ago
  19. 37978e3 [main] Update version to 53.0.0 and bring changelog (#21189) by Andrew Lamb · 2 weeks ago
  20. 38cc8e6 test: add SMJ benchmarks from #21184 (#21188) by Matt Butrovich · 2 weeks ago
  21. 1624d63 perf: Add support for `GroupsAccumulator` to `string_agg` (#21154) by Neil Conway · 2 weeks ago
  22. fb12029 chore: re-export projection in datafusion::datasource (#21185) by Raz Luvaton · 2 weeks ago
  23. 1416ed4 Add benchmarks for Parquet struct leaf-level projection pruning (#21180) by Matthew Kim · 2 weeks ago
  24. ba399a8 docs: add KalamDB to known users (#21181) by Jamal Saad · 2 weeks ago
  25. 7cbc6b4 Remove as_any from scalar UDF trait definition (#20812) by Tim Saucer · 2 weeks ago
  26. 4f13319 feat: implement PhysicalOptimizerRule in FFI crate (#20451) by Tim Saucer · 2 weeks ago
  27. 6980fcf docs: Add explicit fmt and clippy commands to AGENTS.md (#21171) by Qi Zhu · 2 weeks ago
  28. 4e2e4e8 fix: Df int timestamp cast fix failing CI (#21163) by Bhargava Vadlamani · 2 weeks ago
  29. 509ad09 Improvement: keep order-preserving repartitions for streaming aggregates (#21107) by xudong.w · 2 weeks ago
  30. 757ce78 chore: Optimize schema rewriter usages (#21158) by Oleks V · 2 weeks ago
  31. 69cd666 fix: skip empty metadata in intersect_metadata_for_union to prevent s… (#21127) by Rafael Herrero · 2 weeks ago
  32. c4562dc [Minor]: support window functions in order by expressions (#20963) by Burak Şen · 2 weeks ago
  33. 139b0b4 feat : support spark compatible int to timestamp cast (#20555) by Bhargava Vadlamani · 2 weeks ago
  34. 78820c5 chore(deps): bump requests from 2.32.5 to 2.33.0 (#21153) by dependabot[bot] · 2 weeks ago
  35. 51f13d7 perf: specialized SemiAntiSortMergeJoinStream (#20806) by Matt Butrovich · 2 weeks ago
  36. 9b726bc Enable debug assertions in CI. (#20832) by Stu Hood · 2 weeks ago
  37. 6e0dde0 fix(stats): widen sum_value integer arithmetic to SUM-compatible types (#20865) by Kumar Ujjawal · 2 weeks ago
  38. e62533b Substrait join consumer should not merge nullability of join keys (#21121) by Haresh Khanna · 2 weeks ago
  39. 4084a18 Simplify sqllogictest timing summary to boolean flag and remove top-N modes (#20598) by kosiew · 2 weeks ago
  40. a074902 Update to arrow/parquet `58.1.0` (#21044) by Andrew Lamb · 2 weeks ago
  41. a3dc8fa chore(deps): bump github/codeql-action from 4.33.0 to 4.34.1 (#21132) by dependabot[bot] · 2 weeks ago
  42. 0294c6c chore(deps): bump taiki-e/install-action from 2.68.34 to 2.69.7 (#21133) by dependabot[bot] · 2 weeks ago
  43. d530276 Fix `elapsed_compute` metric for Parquet DataSourceExec (#20767) by Ernest Provo · 3 weeks ago
  44. f30b85c chore(deps): bump env_logger from 0.11.9 to 0.11.10 in the all-other-cargo-deps group across 1 directory (#21136) by dependabot[bot] · 3 weeks ago
  45. 8d47fc6 Spark soundex function implementation (#20725) by Kazantsev Maksim · 3 weeks ago
  46. aa9d819 perf: optimize map validation for common key types (#20805) by lyne · 3 weeks ago
  47. 4d5aea4 perf: Optimize `array_min`, `array_max` for arrays of primitive types (#21101) by Neil Conway · 3 weeks ago
  48. 10fae81 Fix push_down_filter for children with non-empty fetch fields (#21057) by Shiv Bhatia · 3 weeks ago
  49. dc9098e chore: update testcontainers and astral-tokio-tar for cargo audit (#21114) by Namgung Chan · 3 weeks ago
  50. 76075e2 Preserve SPM when parent maintains input order (#21097) by Rohan Krishnaswamy · 3 weeks ago
  51. 98defe6 chore: Fix all sqllogictest dangling configs (#21108) by Yongting You · 3 weeks ago
  52. 7f29cb0 Add `arrow_try_cast` UDF (#21130) by Adrian Garcia Badaracco · 3 weeks ago
  53. 56e097a perf: optimize scatter with type-specific specialization (#20498) by Yu-Chuan Hung · 3 weeks ago
  54. cdaecf0 fix: string_to_array('', delim) returns empty array for PostgreSQL compatibility (#21104) by dd-david-levin · 3 weeks ago
  55. 2b986c8 Fix index panic in unparser with mismatched stacked projections (#21094) by Matthew Kim · 3 weeks ago
  56. bec6714 docs: Document the TableProvider evaluation order for filter, limit and projection (#21091) by Andrew Lamb · 3 weeks ago
  57. dfc8bb7 fix(sql): handle GROUP BY ALL with aliased aggregates (#20943) by Kumar Ujjawal · 3 weeks ago
  58. 293a880 Avoid creating new RecordBatches to simplify expressions (#20534) by Andrew Lamb · 3 weeks ago
  59. 4a7330f Add a builder to `SimplifyContext` to avoid allocating default values (#21092) by Adam Gutglick · 3 weeks ago
  60. ec7c9ab chore: Add `substr()` benchmarks, refactor (#20803) by Neil Conway · 3 weeks ago
  61. 2b7d4f9 feat: create a datafusion-example for in-memory file format (#20394) by Kumar Ujjawal · 3 weeks ago
  62. abf8f61 fix[physical-expr-adapter]: support casting structs nested inside complex types (#20907) by Alfonso Subiotto Marqués · 3 weeks ago
  63. 6796053 test: update sqllogictest expectation for negation type coercion (#21102) by yaommen · 3 weeks ago
  64. 646d183 docs: clarify NULL handling for array_remove functions (#21014) (#21018) by Rizky Mirzaviandy Priambodo · 3 weeks ago
  65. 93724b0 fix: validate wrapped negation during type coercion (#20965) by yaommen · 3 weeks ago
  66. 85a34e9 Fix CTE reference resolution slt tests (#21049) by Jonah Gao · 3 weeks ago
  67. 59e8a79 fix: use datafusion_expr instead of datafusion crate in spark (#21043) by David López · 3 weeks ago
  68. 878b879 docs: Improve getting started and testing guides for humans and agents (#20970) by Andrew Lamb · 3 weeks ago
  69. 448a395 Super fast extended tests and improved planning speed linux (#21084) by Dmitrii Blaginin · 3 weeks ago
  70. e5c69a4 perf: Optimize `array_sort()` (#21083) by Neil Conway · 3 weeks ago
  71. 3bdcdf5 perf: add in-place fast path for ScalarValue::add (#20959) by Kumar Ujjawal · 3 weeks ago
  72. 415bd42 chore: Remove duplicate imports in test code (#21061) by Neil Conway · 3 weeks ago
  73. d6cffd7 optimizer: Add configuration to disable join reordering (#21072) by Yongting You · 3 weeks ago
  74. 895d852 chore(deps): bump rustls-webpki from 0.103.9 to 0.103.10 (#21089) by dependabot[bot] · 3 weeks ago
  75. 8fdac0f Restore Sort unparser guard for correct ORDER BY placement (#20658) by Viktor Yershov · 3 weeks ago
  76. 5e54b89 perf: Optimize `lpad()`, `rpad()` for scalar args (#20657) by Neil Conway · 3 weeks ago
  77. 4c195b4 perf: Optimize `strpos()` for scalar needle, plus optimize UTF-8 codepath (#20754) by Neil Conway · 3 weeks ago
  78. 1cb4de4 perf: Optimize `approx_distinct` for inline Utf8View (#21064) by Neil Conway · 3 weeks ago
  79. f734ec5 Fix Subtraction overflow in `max_distinct_count` when hash join has a pushed-down limit (#20799) by Kartik Gupta · 3 weeks ago
  80. 8f721a6 keep fetch when merge FilterExec in FilterPushdown (#21070) by Huaijin · 3 weeks ago
  81. 78d5ac6 Remove file prefetching from FileStream (#20916) by Daniël Heres · 3 weeks ago
  82. be273d6 Support '>', '<', '>=', '<=', '<>' in any operator (#20830) by Burak Şen · 3 weeks ago
  83. 9885f4b fix: `arrays_zip/list_zip` allow single array argument (#21047) by hsiang-c · 3 weeks ago
  84. 6ef4cef chore: Cleanup fully-qualified ScalarFunctionArgs (#20804) by Neil Conway · 3 weeks ago
  85. 897b5c1 feat: support repartitioning of FFI execution plans (#20449) by Tim Saucer · 3 weeks ago
  86. c792700 build: update Rust toolchain version to 1.94.0 (#21045) by dario curreri · 3 weeks ago
  87. 4010a55 Add support for nested lists in substrait consumer (#20953) by alexanderbianchi · 3 weeks ago
  88. 4ae19eb fix: update clickbench expected plan for NDV-aware optimization (#21050) by Alessandro Solimando · 3 weeks ago
  89. 7014a45 feat: Extract NDV (distinct_count) statistics from Parquet metadata (#19957) by Alessandro Solimando · 3 weeks ago
  90. d138c36 infra: automatically delete branch on pr merge (#21033) by Kevin Liu · 3 weeks ago
  91. 317052e perf: Optimize `approx_distinct()` for string, binary inputs (#21037) by Neil Conway · 3 weeks ago
  92. 7e4818d fix: improve GroupOrdering docs (#20994) by Andrew Lamb · 3 weeks ago
  93. b6b542e perf: Optimize `array_positions()` for scalar needle (#20770) by Neil Conway · 3 weeks ago
  94. a6a4df9 Fix memory reservation starvation in sort-merge (#20642) by xudong.w · 3 weeks ago
  95. b7a3f53 docs: in release email, be specific about changelog location (#20975) by Kevin Liu · 3 weeks ago
  96. cf0a182 Simplify logic for memory pressure partial emit from ordered group by (#20559) by Andrew Lamb · 3 weeks ago
  97. 6ab16cc bug: fix `array_remove_*` with NULLS (#21013) by Oleks V · 4 weeks ago
  98. e74e58f fix: move overflow guard before dense ratio in hash join to prevent overflows (#20998) by Burak Şen · 4 weeks ago
  99. 8142308 chore(deps): Revert "chore(deps): bump runs-on/action from 2.0.3 to 2.1.0 (#20980)" (#21002) by Matt Butrovich · 4 weeks ago
  100. fd145c4 [Minor] Update Cargo.lock, Fix Tokio minor breaking change (#20978) by Daniël Heres · 4 weeks ago