Sign in
apache
/
datafusion
/
HEAD
e74e58f
fix: move overflow guard before dense ratio in hash join to prevent overflows (#20998)
by Burak Şen
· 5 hours ago
main
8142308
chore(deps): Revert "chore(deps): bump runs-on/action from 2.0.3 to 2.1.0 (#20980)" (#21002)
by Matt Butrovich
· 7 hours ago
fd145c4
[Minor] Update Cargo.lock, Fix Tokio minor breaking change (#20978)
by Daniël Heres
· 8 hours ago
50b6bf8
fix: Run release verification with `--profile=ci` (#20987)
by Andrew Lamb
· 8 hours ago
84a79e1
fix: InList Dictionary filter pushdown type mismatch (#20962)
by Adam Curtis
· 9 hours ago
11b9693
chore(deps): bump runs-on/action from 2.0.3 to 2.1.0 (#20980)
by dependabot[bot]
· 15 hours ago
8a95c4c
chore(deps): bump astral-sh/setup-uv from 7.3.1 to 7.6.0 (#20981)
by dependabot[bot]
· 15 hours ago
13a39d7
chore(deps): bump github/codeql-action from 4.32.6 to 4.33.0 (#20982)
by dependabot[bot]
· 15 hours ago
1e99ed5
chore(deps): bump taiki-e/install-action from 2.68.25 to 2.68.34 (#20983)
by dependabot[bot]
· 15 hours ago
9756146
chore(deps): bump Swatinem/rust-cache from 2.8.2 to 2.9.1 (#20979)
by dependabot[bot]
· 15 hours ago
4c96125
Fix decimal log precision for non-power values (#20433)
by Kumar Ujjawal
· 18 hours ago
0dfcd97
Replace ahash with foldhash for faster hashing in datafusion-common (#20958)
by Daniël Heres
· 18 hours ago
fa6706a
chore(deps): bump lz4_flex from 0.12.0 to 0.12.1 (#20973)
by dependabot[bot]
· 19 hours ago
972b890
thin-ci (#20972)
by Dmitrii Blaginin
· 27 hours ago
a7e0941
fix(spark): return input string for PATH/FILE on schemeless URLs in `parse_url` (#20506)
by David López
· 27 hours ago
26251bb
Use exact distinct_count from statistics if exists for `COUNT(DISTINCT column))` calculations (#20845)
by Burak Şen
· 27 hours ago
bd071be
feat: add `custom_string_literal_override` to unparser Dialect trait (#20590)
by Jax Liu
· 27 hours ago
4166a6d
perf: Optimize comparison on nested types (#20716)
by Neil Conway
· 27 hours ago
5d37bab
chore: Remove usage of `paste` crate (#20946)
by Bhargava Vadlamani
· 27 hours ago
c6f7145
Extract shared `ParquetReadPlan` for leaf column resolution (#20913)
by Matthew Kim
· 31 hours ago
3ece9ec
test: add reproducer for Dictionary InList pushdown type mismatch (#2… (#20960)
by Adam Curtis
· 33 hours ago
b61aee7
Minor: make signatures of `SessionContext::register_*` methods consistent (#20873)
by Alexandre Crayssac
· 34 hours ago
8609288
Add --simulate-latency / SIMULATE_LATENCY option to dfbench / ./bench.sh (#20954)
by Daniël Heres
· 2 days ago
ab28234
Support `columns_sorted` in row_filters (#20497)
by Konstantin Tarasov
· 2 days ago
1f59d32
fix: dfbench respects DATAFUSION_RUNTIME_MEMORY_LIMIT env var (#20631)
by Adrian Garcia Badaracco
· 2 days ago
538a201
perf: Optimize array set ops on sliced arrays (#20693)
by Neil Conway
· 3 days ago
9b7d092
Add AGENTS.md / CLAUDE.md (#20939)
by Daniël Heres
· 3 days ago
6d3a846
Rewrite `SUM(expr + scalar)` --> `SUM(expr) + scalar*COUNT(expr)` (#20749)
by Andrew Lamb
· 3 days ago
5db04b8
chore(deps): bump pyjwt from 2.11.0 to 2.12.0 (#20938)
by dependabot[bot]
· 4 days ago
c74976f
minor: Move PreparedAccessPlan to same module as ParquetAccessPlan (#20929)
by Andrew Lamb
· 4 days ago
9c3c01a
refactor: Improve `SessionContext::parse_duration` API (#20816)
by Eren Avsarogullari
· 4 days ago
2c871b2
Project only accessed struct leaves in Parquet row filter pushdown (#20854)
by Matthew Kim
· 4 days ago
d09ff92
feat: Reduce allocations for aggregating `Statistics` (#20768)
by Jonathan Chen
· 4 days ago
10d8bcb
Add support for ListView in unnest (#20760)
by Frederic Branczyk
· 4 days ago
d2278a9
Check sqllogictests for any dangling config settings (#17914) (#20838)
by Sergey Zhukov
· 5 days ago
b7e4213
Include .proto files in datafusion-proto-common distribution (#20921)
by Huaijin
· 5 days ago
3c56e5d
perf: Use batched row conversion for `array_has_any`, `array_has_all` (#20588)
by Neil Conway
· 5 days ago
422b545
fix: Track metrics in hash joins with empty build sides (#20810)
by Nuno Faria
· 5 days ago
fcb1c93
Fix duplicate group keys after hash aggregation spill (#20724) (#20858)
by Guillaume Boucher
· 5 days ago
57b275a
feat: correct struct column names for `arrays_zip` return type (#20886)
by Oleks V
· 5 days ago
8b412de
fix: Optimize `!~ '.*'` case to `col IS NULL AND Boolean(NULL)` instead of `Eq ""` (#20702)
by Peter Nguyen
· 5 days ago
385d9db
try to remove redundant alias in expression rewriter and select (#20867)
by Burak Şen
· 5 days ago
6b71523
[main] Bump to 52.3.0 and changelog (#20790) (#20849)
by Andrew Lamb
· 5 days ago
4b022c0
fix: do not recompute hash join exec properties if not required (#20900)
by Albert Skalt
· 5 days ago
8d9b080
[Minor] propagate distinct_count as inexact through unions (#20846)
by Burak Şen
· 6 days ago
129c58f
fix: Remove `!=0` check from `supports_collect_by_thresholds` (#20730)
by Jonathan Chen
· 6 days ago
981b5c3
chore(deps): bump substrait from 0.62 to 0.63.0 (#20876)
by Ben Bellick
· 6 days ago
f8fb5bd
fix: Avoid unnecessary type casts in `concat_ws` (#20436)
by Neil Conway
· 6 days ago
d68b800
Minor: Deprecate unused `PartitionedFileStream` (#20869)
by Andrew Lamb
· 6 days ago
21cf60a
Add Null Type Coercions for Placeholders (#20543)
by Peter L
· 6 days ago
1efcbf5
Add benchmark for struct field filter pushdown in Parquet (#20829)
by Matthew Kim
· 6 days ago
ed793f0
chore: clean up dependencies (#20861)
by Oleks V
· 6 days ago
95a3dfd
chore: Ignore RUSTSEC-2024-0014 (#20862)
by Oleks V
· 6 days ago
da05287
Fix FileStream scanning_total to include sync next-file open time (#20627)
by Ratul Dawar
· 7 days ago
4bac1cf
impl ser/de for preserve_order in RepartitionExec (#20798)
by Huaijin
· 7 days ago
2589fa8
doc: Add documentation for pushing limit into plan (#20271)
by Yongting You
· 7 days ago
86cb815
[Minor] Remove redundant ProjectionExec nodes in sort-based plans (#20780)
by Daniël Heres
· 7 days ago
48199b9
Use `ParquetPushDecoder` in `ParquetOpener` (#20839)
by Daniël Heres
· 7 days ago
9b7cdda
chore(deps): bump quinn-proto from 0.11.13 to 0.11.14 (#20859)
by dependabot[bot]
· 7 days ago
5af7361
fix: SanityCheckPlan error with window functions and NVL filter (#20231)
by EeshanBembi
· 7 days ago
8e02b8e
chore: Ignore RUSTSEC-2024-0421 (#20850)
by Oleks V
· 7 days ago
64b5228
chore(deps): bump github/codeql-action from 4.32.5 to 4.32.6 (#20843)
by dependabot[bot]
· 7 days ago
31a4037
chore(deps): bump taiki-e/install-action from 2.68.16 to 2.68.25 (#20842)
by dependabot[bot]
· 7 days ago
fc514c2
perf: Optimize set operations to avoid RowConverter deserialization overhead (#20623)
by Neil Conway
· 7 days ago
daa8f52
fix: interval analysis error when have two filterexec that inner filter proves zero selectivity (#20743)
by Huaijin
· 7 days ago
1f87930
fix: sqllogictest cannot convert <subquery> to Substrait (#19739)
by Kumar Ujjawal
· 7 days ago
6f86c8d
Used constant with mapping instead of write! to display scalar value bytes (#20719)
by Burak Şen
· 7 days ago
23b88fb
Allow filters on struct fields to be pushed down into Parquet scan (#20822)
by Matthew Kim
· 7 days ago
af79d14
Make translate emit Utf8View for Utf8View input (#20624)
by Shivaang
· 7 days ago
39226c3
[datafusion-cli] Replace mutex with AtomicU64 for stream duration tracking in instrumentedObjectStore (#20802)
by Burak Şen
· 8 days ago
75c7da5
Pass ConfigOptions to scalar UDFs via FFI (#20454)
by Tim Saucer
· 8 days ago
fd97799
Make Physical CastExpr Field-aware and unify cast semantics across physical expressions (#20814)
by kosiew
· 8 days ago
84a22ea
Wrap Arc to Statistics for `partition_statistics` API (#20570)
by xudong.w
· 8 days ago
44dfa7b
Clean up date_part preimage implementation (#20350)
by Konstantin Tarasov
· 8 days ago
aca8c14
Fix FilterExec converting Absent column stats to Exact(NULL) (#20391)
by Filip Wojciechowski
· 8 days ago
097f04c
fix(spark): handle divide-by-zero in Spark `mod`/`pmod` with ANSI mode support (#20461)
by David López
· 8 days ago
9b3d6a4
Make lower and upper emit Utf8View for Utf8View input (#20616)
by Kumar Ujjawal
· 8 days ago
b51edff
Update reverse UDF to emit utf8view when input is utf8view (#20604)
by Bruce Ritchie
· 8 days ago
33b9afa
Allow SQL `TypePlanner` to plan SQL types as extension types (#20676)
by Dewey Dunnington
· 8 days ago
bb421db
Add tests for simplifying multiple aggregate expressions (#20723)
by Andrew Lamb
· 8 days ago
15bc6bd
feat: make DefaultLogicalExtensionCodec support serialisation of buil… (#20638)
by Acfboy
· 9 days ago
37b9a46
feat: `partition_statistics()` for HashJoinExec (#20711)
by Jonathan Chen
· 9 days ago
1eb5206
feat: Integrate CastColumnExpr into PhysicalExprAdapter (#20269)
by Kumar Ujjawal
· 9 days ago
4dbb449
ser/de fetch in FilterExec (#20738)
by Huaijin
· 9 days ago
92078d9
Copy limits before repartitions (#20736)
by Brent Gardner
· 10 days ago
8fe926d
test: Add `datafusion-cli` `fair` and `unbounded` memory-pool test coverage (#20565)
by Eren Avsarogullari
· 10 days ago
bfa0ea8
Hash join buffering on probe side (#19761)
by Gabriel
· 11 days ago
5211a8b
Fix repartition from dropping data when spilling (#20672)
by Xander
· 11 days ago
0ac434d
Add case-heavy LEFT JOIN benchmark and debug timing/logging for PushDownFilter hot paths (#20664)
by kosiew
· 11 days ago
02ce571
Push even local limits past windows (#20752)
by Brent Gardner
· 11 days ago
d72b0b8
fix: preserve None projection semantics across FFI boundary in ForeignTableProvider::scan (#20393)
by Kristin Cowalcijk
· 11 days ago
33c922f
use linker optimization for extended sqllogictests (#20740)
by Dmitrii Blaginin
· 11 days ago
678d1ad
Minor: Add comment explaining rationale to avoid dependencies on functions (#20667)
by Andrew Lamb
· 12 days ago
b3976d6
Fix DELETE/UPDATE filter extraction when predicates are pushed down into TableScan (#19884)
by kosiew
· 12 days ago
dd988f6
Fix test that's broken on Windows due to naive path handling (#20692)
by Alexander Rafferty
· 12 days ago
a95da70
doc: Add more context to `Precision` (#20713)
by Jonathan Chen
· 12 days ago
03d17e8
Improve documentation for `AggregateUdfImpl::simplify` and `WindowUDFImpl::simplify` (#20712)
by Andrew Lamb
· 12 days ago
631c918
perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions (#20729)
by Matt Butrovich
· 12 days ago
13cebf8
FFI_TableOptions are using default values only (#20721)
by Tim Saucer
· 12 days ago
00e36e8
fix: Return `probe_side.len()` for RightMark/Anti count(*) queries (#20710)
by Jonathan Chen
· 12 days ago
Next »