Sign in
apache
/
datafusion-sandbox
/
HEAD
6b83141
Merge branch 'main' into sanbox-main
by blaginin
· 4 days ago
main
545c37f
perf: optimise right for byte access and StringView (#20069)
by theirix
· 5 days ago
1a0c2e0
Use return_field_from_args in information schema and date_trunc (#20079)
by Andrea Bozzo
· 5 days ago
c849374
Refactor `iszero()` and `isnan()` to accept all numeric types (#20093)
by Kumar Ujjawal
· 5 days ago
51c0475
Implement preimage for floor function to enable predicate pushdown (#20059)
by Devanshu
· 6 days ago
3797995
perf: improve performance of `array_repeat` function (#20049)
by lyne
· 6 days ago
f779e96
perf: Optimize scalar path for chr function (#20073)
by Kumar Ujjawal
· 6 days ago
9de192a
docs: update data_types.md to reflect current Arrow type mappings (#20072)
by karuppuchamysuresh
· 6 days ago
bc4c245
Automatically generate examples documentation adv (#19294) (#19750)
by Sergey Zhukov
· 6 days ago
a02e683
Reduce number of traversals per node in `PhysicalExprSimplifier` (#20082)
by Adam Gutglick
· 7 days ago
4a63659
Minor: update tests in limit_pushdown.rs to insta (#20066)
by Andrew Lamb
· 7 days ago
b50e1f7
chore: Add microbenchmark (compared to ExprOrExpr) (#20076)
by Yu-Chuan Hung
· 7 days ago
2860ada
fix: The limit_pushdown physical optimization rule removes limits in some cases leading to incorrect results (#20048)
by Mason
· 7 days ago
cad3865
fix: correct weight handling in approx_percentile_cont_with_weight (#19941)
by Sergio Esteves
· 7 days ago
f0de02f
Use BooleanBufferBuilder rather than Vec<bool> in ArrowBytesViewMap (#20064)
by Eesh Sagar Singh
· 7 days ago
7c3ea05
feat: add AggregateMode::PartialReduce for tree-reduce aggregation (#20019)
by Nathaniel J. Smith
· 7 days ago
f997169
Improve documentation for ScalarUDFImpl::preimage (#20008)
by Andrew Lamb
· 8 days ago
ed0a060
replace private is_volatile_expression_tree with equivalent public is_volatile (#20056)
by Adrian Garcia Badaracco
· 8 days ago
41bfb79
feat: support Spark-compatible abs math function part 2 - ANSI mode (#18828)
by hsiang-c
· 9 days ago
8152b44
feat: optimise copying in `left` for Utf8 and LargeUtf8 (#19980)
by theirix
· 9 days ago
92f60ad
docs: Automatically update DataFusion version in docs (#20001)
by Nuno Faria
· 9 days ago
36c0cda
fix: respect DataFrameWriteOptions::with_single_file_output for paths without extensions (#19931)
by Kumar Ujjawal
· 9 days ago
20d5f06
perf: Push down join key filters for LEFT/RIGHT/ANTI joins (#19918)
by Nuno Faria
· 9 days ago
a77e5a5
Further refactoring of type coercion function code (#19603)
by Jeffrey Vo
· 9 days ago
568f19f
Simplify Spark `sha2` implementation (#19475)
by Jeffrey Vo
· 9 days ago
4b31aaa
Fix constant value from stats (#20042)
by Gabriel
· 9 days ago
5f9bacd
Enforce `clippy::allow_attributes` globally across workspace (#19576)
by Jeffrey Vo
· 9 days ago
1636898
minor: add tests for coercible signature considering nulls/dicts/ree (#19459)
by Jeffrey Vo
· 10 days ago
47bf178
fix: return correct length array for scalar null input to `calculate_binary_math` (#19861)
by Jeffrey Vo
· 10 days ago
81e9723
Fix/parquet opener page index policy (#19890)
by Aviral Garg
· 10 days ago
0a68b81
[main] Update version to `52.1.0` (#19878) (#20028)
by Andrew Lamb
· 10 days ago
bef1368
Simplify wait_complete function (#19937)
by Lía Adriana
· 10 days ago
f819061
Tweak `adapter serialization` example (#20035)
by Adrian Garcia Badaracco
· 10 days ago
41d48b3
feat: implement protobuf converter trait to allow control over serialization and deserialization processes (#19437)
by Tim Saucer
· 10 days ago
aae3e0f
refactor: make PhysicalExprAdatperFactory::create fallible (#20017)
by niebayes
· 10 days ago
adb8c8a
minor: Move metric `page_index_rows_pruned` to verbose level in `EXPLAIN ANALYZE` (#20026)
by Yongting You
· 10 days ago
1a48d58
add more projection pushdown slt tests (#20015)
by Adrian Garcia Badaracco
· 10 days ago
aca7169
feat(spark): add unix date and timestamp functions (#19892)
by cht42
· 10 days ago
af77197
chore(deps): bump taiki-e/install-action from 2.67.9 to 2.67.13 (#20020)
by dependabot[bot]
· 10 days ago
82f27e9
refactor: extract pushdown test utilities to shared module (#20010)
by Adrian Garcia Badaracco
· 10 days ago
52deee5
chore(deps): bump setuptools from 80.10.1 to 80.10.2 in /docs (#20022)
by dependabot[bot]
· 10 days ago
f5709e7
feat(spark): implement `StringView` for `SparkConcat` (#19984)
by Aryan Anand
· 10 days ago
c560bee
perf: Optimize repeat function for scalar and array fast (#19976)
by Kumar Ujjawal
· 10 days ago
58fb6e1
feat(spark): implement `from/to_utc_timestamp` functions (#19880)
by cht42
· 10 days ago
8653851
feat(spark): Add `SessionStateBuilderSpark` to datafusion-spark (#19865)
by cht42
· 10 days ago
073174b
feat: Show the number of matched Parquet pages in `DataSourceExec` (#19977)
by Nuno Faria
· 10 days ago
adddd4c
fix: Make `generate_series` return an empty set with invalid ranges (#19999)
by Nuno Faria
· 11 days ago
5c47d6d
Fix ClickBench EventDate handling by casting UInt16 days-since-epoch to DATE via `hits` view (#19881)
by kosiew
· 11 days ago
cda6425
perf: Optimize ArrowBytesViewMap with direct view access (#19975)
by Tushar Das
· 11 days ago
6524d91
minor: pull `uuid` into workspace dependencies (#19997)
by Jeffrey Vo
· 11 days ago
27abe5a
perf: improve performance of `array_remove`, `array_remove_n` and `array_remove_all` functions (#19996)
by lyne
· 11 days ago
8efc2b6
feat(spark): add `base64` and `unbase64` functions (#19968)
by cht42
· 11 days ago
50a3e13
Speedup statistics_from_parquet_metadata (#20004)
by Daniël Heres
· 11 days ago
f8a22a5
docs: Fix some broken / missing links in the DataFusion documentation (#19958)
by Andrew Lamb
· 11 days ago
b6f7521
Do not require mut in memory reservation methods (#19759)
by Gabriel
· 11 days ago
4d63f8c
minor: Add favicon (#20000)
by Nuno Faria
· 12 days ago
8023947
fix: maintain inner list nullability for `array_sort` (#19948)
by Jeffrey Vo
· 12 days ago
d1eea07
Fix broken links in the documentation (#19964)
by Andrew Lamb
· 12 days ago
e062ae5
minor: Fix doc about `write_batch_size` (#19979)
by Nuno Faria
· 12 days ago
03c2751
chore(deps): bump uuid from 1.19.0 to 1.20.0 (#19993)
by dependabot[bot]
· 12 days ago
d590824
chore(deps): bump sysinfo from 0.37.2 to 0.38.0 (#19990)
by dependabot[bot]
· 12 days ago
355cae8
chore(deps): bump nix from 0.30.1 to 0.31.1 (#19991)
by dependabot[bot]
· 12 days ago
853a617
chore(deps): bump quote from 1.0.43 to 1.0.44 (#19992)
by dependabot[bot]
· 12 days ago
6647583
chore(deps): bump taiki-e/install-action from 2.66.7 to 2.67.9 (#19987)
by dependabot[bot]
· 12 days ago
0b60c58
chore(deps): bump setuptools from 80.9.0 to 80.10.1 in /docs (#19988)
by dependabot[bot]
· 12 days ago
4127688
Change GitHub actions dependabot schedule to weekly (#19981)
by Jeffrey Vo
· 12 days ago
b9b756e
more checks
by blaginin
· 13 days ago
ce24ade
use dorny
by blaginin
· 13 days ago
db/dorny
b3c3f6b
Merge remote-tracking branch 'sandbox/main' into sandbox-main
by blaginin
· 13 days ago
62f8237
Merge branch 'upstream-main' into sandbox-main
by blaginin
· 13 days ago
e5e7636
export `SessionState::register_catalog_list(...)` (#19925)
by Albert Skalt
· 13 days ago
c135236
Improve error messages with nicer formatting of Date and Time types (#19954)
by Emil Ernerfeldt
· 13 days ago
05802e2
perf: Optimize factorial scalar path (#19949)
by Kumar Ujjawal
· 13 days ago
1897924
fix: change token consumption to pick to test on EOF in parser (#19927)
by Albert Skalt
· 13 days ago
c5e99e5
perf: Optimize scalar path for ascii function (#19951)
by Kumar Ujjawal
· 13 days ago
17cbff0
Misc hash / hash aggregation performance improvements (#19910)
by Daniël Heres
· 13 days ago
23f5003
Add struct pushdown query benchmark and projection pushdown tests (#19962)
by Adrian Garcia Badaracco
· 14 days ago
35e99b9
preserve FilterExec batch size during ser/de (#19960)
by Albert Skalt
· 14 days ago
b463a9f
chore(deps): bump actions/checkout from 6.0.1 to 6.0.2 (#19953)
by dependabot[bot]
· 14 days ago
45c8ae4
Support LargeUtf8 as partition column (#19942)
by Dewey Dunnington
· 14 days ago
0c82ade
Fix struct casts to align fields by name (prevent positional mis-casts) (#19674)
by kosiew
· 2 weeks ago
c2f3d65
Support API for "pre-image" for pruning predicate evaluation (#19722)
by Kosta Tarasov
· 2 weeks ago
736fa7c
feat(datafusion-spark): implement spark compatible `unhex` function (#19909)
by lyne
· 2 weeks ago
7cf63f1
Refactor ListArray hashing to consider only sliced values (#19500)
by Jeffrey Vo
· 2 weeks ago
cf76352
Improve error message when string functions receive Binary types (#19819)
by Miao
· 2 weeks ago
7a09e27
fix: union should retrun error instead of panic when input schema's len different (#19922)
by Huaijin
· 2 weeks ago
f0e38df
chore(deps): bump chrono from 0.4.42 to 0.4.43 (#19897)
by dependabot[bot]
· 2 weeks ago
efccfb1
Fix trailing whitespace in CROSS JOIN logical plan formatting (#19936)
by Michael Kleen
· 2 weeks ago
9f27e93
Coerce expressions to udtf (#19915)
by Xiangpeng Hao
· 2 weeks ago
bfe7d18
perf: Optimize scalar fast path for iszero (#19919)
by Kumar Ujjawal
· 2 weeks ago
6b7ddd8
chore(deps): bump actions/setup-python from 6.1.0 to 6.2.0 (#19935)
by dependabot[bot]
· 2 weeks ago
3b44bbf
feat: support pushdown alias on dynamic filter with `ProjectionExec` (#19404)
by discord9
· 2 weeks ago
b2c29ac
Include license and notice files in more crates (#19913)
by Andrew Kane
· 2 weeks ago
db64be9
chore(deps): bump thiserror from 2.0.17 to 2.0.18 (#19900)
by dependabot[bot]
· 2 weeks ago
10a1d4e
Remove UDAF manual Debug impls and simplify signatures (#19727)
by Jeffrey Vo
· 2 weeks ago
e82dc21
Feat : added truncate table support (#19633)
by Rosai
· 2 weeks ago
4d8d48c
perf: Optimize scalar performance for cot (#19888)
by Kumar Ujjawal
· 2 weeks ago
406e5aa
test(wasmtest): enable compression feature for wasm build (#19860)
by ChanTsune
· 2 weeks ago
774a7b5
chore(deps): bump the proto group across 1 directory with 5 updates (#19745)
by dependabot[bot]
· 2 weeks ago
9626776
fix: null in array_agg with DISTINCT and IGNORE (#19736)
by David López
· 2 weeks ago
Next »