Sign in
apache
/
datafusion
/
HEAD
933297b
feat(unparser): support binary literals (#23001)
by Louis Vialar
· 6 hours ago
main
6c2221d
bugfix: changed return type of spark's width_bucket to i64 (#22811)
by Eduardo Aguilar
· 6 hours ago
0838a4d
Add StatisticsContext parameter to partition_statistics (#21815)
by Alessandro Solimando
· 14 hours ago
54b7dd9
Unify LRU memory-limiting caches into one generic cache (#22613)
by Michael Kleen
· 14 hours ago
ad8e7b7
refactor: Simplify `approx_distinct` (-200 LoC) (#22921)
by Yongting You
· 20 hours ago
0fc55d0
refactor: make scalar distance u64 and overflow aware (#22892)
by Florian Müller
· 27 hours ago
1f45d83
fix: parquet limit pruning for row group selections (#22942)
by Huaijin
· 27 hours ago
3f4bcf1
fix: Omit NULL values from build side of hash joins (#22893)
by Neil Conway
· 28 hours ago
2da8887
chore: attach Diagnostic to unary operator type errors (#21288)
by Harrison Crosse
· 28 hours ago
2836480
fix: ProjectionPushdown internal error on NestedLoopJoin mark joins (#22902)
by linfeng
· 30 hours ago
ddc157d
Refactor outer join null-rejection analysis to track join sides directly (#22870)
by kosiew
· 2 days ago
7bb6e15
Remove redundant `collect_stat` and `target_partitions` on `ListingOptions` (#22969)
by Gabriel
· 2 days ago
8172873
chore(deps): bump the all-other-cargo-deps group across 1 directory with 6 updates (#22975)
by dependabot[bot]
· 2 days ago
96a6096
feat: support reading from stdin in datafusion-cli (#22839)
by Xin Huang
· 3 days ago
a0e6d49
Make LogicalPlan::Unnest expression/rebuild contracts consistent (#22783)
by Nathan
· 3 days ago
c7e9284
refactor: use raw view access in do_append_val_inner and consolidate duplicated logic (#22907)
by EeshanBembi
· 3 days ago
408dad3
Add MERGE INTO types to datafusion-expr (#20763)
by Xuanyi Li
· 3 days ago
d5f03d9
Fix leaf expression reconciliation (#22971)
by Peter L
· 3 days ago
6176a6d
Add `.gitignore` for `proto-models` (#22977)
by Jeffrey Vo
· 3 days ago
fa271ce
refactor: Update SortMergeJoin to use async spill abstractions (#22230)
by pantShrey
· 3 days ago
ae5f3f5
chore(deps): bump prost-build from 0.14.3 to 0.14.4 (#22843)
by dependabot[bot]
· 3 days ago
fbd64b4
chore(deps): update pydata-sphinx-theme requirement from <1,>=0.18.0 to >=0.19.0,<1 in /docs (#22972)
by dependabot[bot]
· 3 days ago
2282d23
chore(deps): bump taiki-e/install-action from 2.81.8 to 2.81.11 (#22973)
by dependabot[bot]
· 3 days ago
46d241d
chore(deps): update maturin requirement from <2,>=1.13.3 to >=1.14.0,<2 in /docs (#22974)
by dependabot[bot]
· 3 days ago
0fb650a
chore(deps): bump insta-cmd from 0.6.0 to 0.7.0 (#22976)
by dependabot[bot]
· 3 days ago
baa497d
fix: Disable join dynamic filters for null-equal joins (#22965)
by Neil Conway
· 3 days ago
8cda78b
Remove orphaned `snowflake_flatten_validation.sql` script (#22938)
by Adam Gutglick
· 3 days ago
3c6734e
refactor: Simplify heap size estimation for arrays (#22954)
by Michael Kleen
· 3 days ago
15bc933
chore(deps): bump cryptography from 46.0.7 to 48.0.1 (#22968)
by dependabot[bot]
· 3 days ago
9849513
chore(deps-dev): bump launch-editor from 2.10.0 to 2.14.1 in /datafusion/wasmtest/datafusion-wasm-app (#22970)
by dependabot[bot]
· 3 days ago
152d8c4
Add `file_row_index` UDF to query file-level row indexes from Parquet files (#22604)
by Adam Gutglick
· 3 days ago
a1e88e2
feat: decimal support for gcd and lcm (#22655)
by theirix
· 3 days ago
a66c898
ci: Setup valid `Cargo.lock` for `depcheck` to unblock CI (#22933)
by Adam Gutglick
· 3 days ago
49b99bb
chore(deps): bump pyjwt from 2.12.0 to 2.13.0 (#22966)
by dependabot[bot]
· 3 days ago
127731b
Include `null_aware` status in the relevant Join node display implementations (#22913)
by Adam Gutglick
· 3 days ago
c14379b
refactor: remove `opt_filter` in `GroupsAccumulator::merge_batch` (#22816)
by Huaijin
· 4 days ago
dede33c
refactor(hash-aggr): Migrate existing tests on `GroupsHashAggregateStream` (#22953)
by Yongting You
· 4 days ago
e20763c
refactor(hash-aggr): Migrate the partial aggregation skip optimization to the new hash aggregation impl (#22899)
by Yongting You
· 4 days ago
99895e6
refactor: Simplify heap size estimation for types that own no heap allocations (#22918)
by Michael Kleen
· 4 days ago
6520315
fix(topk): call attempt_early_completion when filter rejects entire batch (#22852)
by ajegou
· 4 days ago
d428760
fix: count shared buffers once in hash join build-side memory accounting (#22862)
by Jordan Epstein
· 5 days ago
cb2542c
fix: TRY_CAST returns NULL for timestamp/date overflow (#22897)
by fys
· 5 days ago
78033fa
refactor: introduce ProbeEnd state in NestedLoopJoinExec (#22865)
by Nathan
· 6 days ago
a7280b8
FFI: plumb `placement` for `FFI_ScalarUDF` (#22608)
by Amogh Ramesh
· 6 days ago
58e37a0
Clearly gate sliding SUM(DISTINCT) type support (#22866)
by Kumar Ujjawal
· 6 days ago
3bece3d
Upgrade minimal tokio-postgres version to address security advisory (#22937)
by Adam Gutglick
· 6 days ago
574a1e6
fix: preserve Spark next_day whitespace validation (#22720)
by Ahmed EL.
· 6 days ago
e5f7af1
feat(spark): add `concat_ws` with array support (#20928)
by David López
· 6 days ago
f931728
fix: handle `date_bin` negative subsecond and overflow cases (#22610)
by Kumar Ujjawal
· 6 days ago
37768b8
fix: Enable sliding window execution for covar_pop, covar_samp, and corr (#22764)
by pchintar
· 6 days ago
b8998c7
perf: Convert inner joins to semi joins when equivalent (#22652)
by Neil Conway
· 8 days ago
3b321a2
docs: link to 2026 Q3-Q4 roadmap discussion (#22884)
by Andrew Lamb
· 8 days ago
db2d21e
fix: preserve async UDF return field metadata (#22663)
by Kristin Cowalcijk
· 8 days ago
dae03ee
chore: Update to arrow/parquet 59.0.0 (#22744)
by Andrew Lamb
· 9 days ago
8bbc460
fix: Avoid incorrectly rounding large integers in `nanvl` (#22575)
by Neil Conway
· 9 days ago
656dc47
Add sort tpch SQL benchmark (#22814)
by Bruce Ritchie
· 9 days ago
8f06354
minor: handle NULL array input in array_remove and array_replace (#22790)
by linfeng
· 9 days ago
68cbe60
fix: map() fails when keys are literals and values are column expressions (#22784)
by Nathan
· 9 days ago
5615f25
fix: Correct output-count stats for partitioned partial aggs (#22780)
by Neil Conway
· 9 days ago
6d4cb31
feat: implement Spark-compatible weekday function (#22740)
by 7. Sun
· 9 days ago
92820c8
docs: Update/improve `SELECT` reference (#22672)
by Neil Conway
· 9 days ago
5e3bc99
perf: fast-path inline strings in ByteViewGroupValueBuilder::vectorized_append (#21794)
by EeshanBembi
· 9 days ago
3f52deb
fix: add backtrace for `assert_*_or_internal_err` helpers (#18910)
by Raz Luvaton
· 9 days ago
666f862
Revert custom allocator auditing of MemoryPool tracking in SLTs (#22860)
by Brent Gardner
· 9 days ago
a851dfd
refactor: centralize SQL dialect metadata (#22840)
by Kumar Ujjawal
· 9 days ago
fb3beba
refactor(physical-plan): extract make_group_column factory + eager init at try_new + tighten Time variants (#22751)
by Qi Zhu
· 9 days ago
d23321d
Add logical range partitioning representation (#22777)
by Gene Bordegaray
· 9 days ago
d77a02d
feat: Add From<Option<T>> trait for Precision enum (#22792)
by WeblWabl
· 9 days ago
40a6454
refactor(hash-aggr): Forward port the soft limit optimization to the new hash aggregation impl (#22824)
by Yongting You
· 9 days ago
8f6876d
chore(deps-dev): bump shell-quote from 1.8.3 to 1.8.4 in /datafusion/wasmtest/datafusion-wasm-app (#22856)
by dependabot[bot]
· 9 days ago
d0ee6b5
chore: Define backport criteria (#22766)
by Oleks V
· 9 days ago
bf71ea6
docs: link release tracking issue to release management page (#22822)
by Andrew Lamb
· 9 days ago
7dd1c6a
feat: Support IEEE 754 negative zero semantics (#22835)
by Oleks V
· 9 days ago
c0dc571
fix: regex simplification of anchored patterns produces wrong results (#22727)
by linfeng
· 10 days ago
8995ce6
[main] Update version and changelog to 54.0.0 (#22855)
by Andrew Lamb
· 10 days ago
a0e05f6
Add smj SQL benchmark (#22803)
by Bruce Ritchie
· 10 days ago
ea5d448
Add clickbench extended SQL benchmark (#22804)
by Bruce Ritchie
· 10 days ago
7333b96
Add nlj SQL benchmark (#22805)
by Bruce Ritchie
· 10 days ago
8ae94dd
add clickbench sorted SQL benchmark (#22807)
by Bruce Ritchie
· 10 days ago
0163ac0
chore(deps): bump the all-other-cargo-deps group with 3 updates (#22844)
by dependabot[bot]
· 10 days ago
8b6917c
Add hj SQL benchmark (#22802)
by Bruce Ritchie
· 10 days ago
228a996
chore(deps): bump taiki-e/install-action from 2.81.3 to 2.81.8 (#22841)
by dependabot[bot]
· 10 days ago
33eba61
Add tpcds SQL benchmark (#22801)
by Bruce Ritchie
· 10 days ago
a63fd48
chore(deps): bump github/codeql-action from 4.36.1 to 4.36.2 (#22842)
by dependabot[bot]
· 10 days ago
986a712
refactor: Port CaseExpr proto serialization hooks (#22838)
by chakkk309
· 10 days ago
84dcc0b
fix: approx_distinct over-counts for utf8view (#22815)
by Huaijin
· 10 days ago
9b81ff8
bench: make wide_schema honor DATA_DIR like the other sql_benchmarks (#22836)
by Adrian Garcia Badaracco
· 10 days ago
bdfdd09
Add example for PartitionedFile schema (#22809)
by Filip Petkovski
· 10 days ago
ec126e6
minor: More comments to `AggregateMode::PartialReduce` (#22800)
by Yongting You
· 10 days ago
0f8a121
bench: add predicate_eval SQL micro-benchmark suite for conjunctive filter evaluation (#22704)
by Adrian Garcia Badaracco
· 10 days ago
a630994
coerce Union vs scalar in comparisons (#22825)
by Matthew Kim
· 11 days ago
1d740ed
feat: add array_sum scalar function (#22542)
by crm26
· 11 days ago
883c38e
docs: add Boston DataFusion meetup (#22722)
by Andrew Lamb
· 11 days ago
e4ae236
fix: Coerce aggregate FILTER predicates to boolean (#22774)
by pchintar
· 11 days ago
c83a981
feat: add DataFrame fill_nan (#22702)
by Nagato Yuzuru
· 11 days ago
710e929
fix: Optimize projections in recursive CTEs (#22476)
by Nuno Faria
· 11 days ago
04ef3c7
Spark quote function implementation (#22642)
by Kazantsev Maksim
· 11 days ago
6fdef65
refactor: Split hash aggregation logic into separated streams (#22729)
by Yongting You
· 11 days ago
b23e6b6
perf: improve approx_distinct performance 100x when there are fewer distinct values with many groups (#22768)
by Huaijin
· 11 days ago
d0d993d
fix: NestedLoopJoinExec emits spurious unmatched-left rows with multiple probe partitions (#22791)
by Nathan
· 11 days ago
Next »