Sign in
apache
/
datafusion-python
/
HEAD
1df6db2
fix: Inconsistent schemas when converting to pyarrow (#1315)
by Nuno Faria
· 4 days ago
main
3a4ae6d
Do not convert pyarrow scalar values to plain python types when passing as `lit` (#1319)
by Tim Saucer
· 4 days ago
474e9e6
fix: use coalesce instead of drop_duplicate_keys for join (#1318)
by Daniel Mesejo
· 4 days ago
fcd7056
Update build workflow link (#1330)
by Tim Saucer
· 10 days ago
6864d80
Minor build errors (#1325)
by Tim Saucer
· 2 weeks ago
db3c6a0
Upgrade to Datafusion 51 (#1311)
by Nuno Faria
· 2 weeks ago
c141dd3
Feat/parameterized sql queries (#964)
by Tim Saucer
· 5 weeks ago
276dc6a
chore: apply cargo fmt with import organization (#1303)
by Tim Saucer
· 7 weeks ago
f1b3029
Add function collect_column to dataframe (#1302)
by Tim Saucer
· 8 weeks ago
89d8930
feat: reduce duplicate fields on join (#1184)
by Tim Saucer
· 9 weeks ago
c4e7486
Add PyCapsule Type Support and Type Hint Enhancements for AggregateUDF in DataFusion Python Bindings (#1277)
by kosiew
· 9 weeks ago
e97ed57
Add Arrow C streaming, DataFrame iteration, and OOM-safe streaming execution (#1222)
by kosiew
· 9 weeks ago
d7e137e
Enable remaining pylints (#1298)
by Tim Saucer
· 10 weeks ago
51dc78a
chore: update datafusion minor version (#1297)
by Tim Saucer
· 2 months ago
a605b61
Update python minimum version to 3.10 (#1296)
by Tim Saucer
· 2 months ago
694a5d8
feat: Add SQL expression support for `with_columns` (#1286)
by Marko Milenković
· 2 months ago
d9c90d2
feat: Add SQL expression for `repartition_by_hash` (#1285)
by Marko Milenković
· 3 months ago
a93d614
with_column supports SQL expression (#1284)
by Marko Milenković
· 3 months ago
29bcb0f
Release 50.1 (#1281)
by Tim Saucer
· 3 months ago
6012349
feat: support session token parameter for AmaoznS3 (#1275)
by GCHQDeveloper028
· 3 months ago
030873b
Add temporary view option for into_view (#1267)
by Tim Saucer
· 3 months ago
fe0cf8c
feat: allow DataFrame.filter to accept SQL strings\ (#1276)
by 𝕂
· 3 months ago
6b16285
feat: expose `select_exprs` method on DataFrame (#1271)
by Marko Milenković
· 3 months ago
08901d5
feat: expose` DataFrame.parse_sql_expr` (#1274)
by Marko Milenković
· 3 months ago
16d4c03
chore: update dependencies (#1269)
by Tim Saucer
· 3 months ago
f5fdf59
feat: expose DataFrame.write_table (#1264)
by Tim Saucer
· 3 months ago
6f3b1ca
Unify Table representations (#1256)
by Tim Saucer
· 3 months ago
af1c07b
Freeze PyO3 wrappers & introduce interior mutability to avoid PyO3 borrow errors (#1253)
by kosiew
· 3 months ago
e75addf
macos-13 is deprecated (#1259)
by Kevin Liu
· 3 months ago
709c918
REVERT CHANGES to commit
by Siew Kam Onn
· 3 months ago
0154c03
CHANGES to review
by Siew Kam Onn
· 3 months ago
f08d5b0
Make Session Context `pyclass` frozen so interior mutability is only managed by rust (#1248)
by Nick
· 4 months ago
0ec0102
Fix drop() method to handle quoted column names consistently (#1242)
by H0TB0X420
· 4 months ago
5f8d500
Update version number, add changelog (#1249)
by Tim Saucer
· 4 months ago
bf22c1d
Prepare for DF50 (#1231)
by Tim Saucer
· 4 months ago
b7d3519
docs: add apache iceberg as datafusion data source (#1240)
by Kevin Liu
· 4 months ago
d54dc4a
Support string column identifiers for sort/aggregate/window and stricter Expr validation (#1221)
by kosiew
· 4 months ago
9e97636
chore: remove redundant error transformation (#1232)
by Daniel Mesejo
· 4 months ago
c609dfa
feat: allow passing a slice to and expression with the [] indexing (#1215)
by Tim Saucer
· 4 months ago
b325a38
fix: update link to user example for custom table provider (#1224)
by Tyler White
· 4 months ago
332c69b
docs: fix CaseBuilder documentation example (#1225)
by Tyler White
· 4 months ago
f0bbad7
Set fail on warning for documentation generation (#1218)
by Tim Saucer
· 4 months ago
200a249
Fixing a few Typos (#1220)
by Nick
· 4 months ago
61f981b
Add benchmark script and documentation for maximizing CPU usage in DataFusion Python (#1216)
by kosiew
· 4 months ago
b76cd8f
Update development guide in README.md (#1213)
by Koustubh Rao
· 4 months ago
c41c7f0
49.0.0 release (#1211)
by Tim Saucer
· 4 months ago
d6d6ea6
build(deps): bump actions/checkout from 4 to 5 (#1204)
by dependabot[bot]
· 5 months ago
cb29176
build(deps): bump url from 2.5.4 to 2.5.7 (#1210)
by dependabot[bot]
· 5 months ago
b553643
build(deps): bump actions/download-artifact from 4 to 5 (#1201)
by dependabot[bot]
· 5 months ago
0806ce5
fix(build): Include build.rs in published crates (#1199)
by Colin Marc
· 5 months ago
1ba8807
Improve `show()` output for empty DataFrames (#1208)
by kosiew
· 5 months ago
e28256c
suggestions (#1183)
by Dewey Dunnington
· 5 months ago
5892585
Allow users to pass a single expression instead of a list of expressions for partition_by and order_by (#1187)
by Tim Saucer
· 5 months ago
f947941
build(deps): bump slab from 0.4.10 to 0.4.11 (#1205)
by dependabot[bot]
· 5 months ago
1a6260f
build(deps): bump async-trait from 0.1.88 to 0.1.89 (#1203)
by dependabot[bot]
· 5 months ago
4472c6c
build(deps): bump uuid from 1.17.0 to 1.18.0 (#1202)
by dependabot[bot]
· 5 months ago
f0af095
build(deps): bump object_store from 0.12.2 to 0.12.3 (#1189)
by dependabot[bot]
· 5 months ago
c8ddb87
Update how to dev instructions (#1179)
by Nick
· 5 months ago
fd84556
chore: Upgrade datafusion to version 49 (#1200)
by Nuno Faria
· 5 months ago
be88837
Fix Python UDAF Accumulator Interface example to Properly Handle State and Updates with List[Array] Types (#1192)
by kosiew
· 6 months ago
94687cd
Change merge rule to require branches up to date but do not require a committer approval. This matches the arrow-rs repo. (#1188)
by Tim Saucer
· 6 months ago
cbe845b
Prepare for 48 release (#1175)
by Tim Saucer
· 6 months ago
2e1b713
refactor: style loading logic in DataFrameHtmlFormatter (#1177)
by kosiew
· 6 months ago
98f4773
feat: add FFI support for user defined functions (#1145)
by Tim Saucer
· 6 months ago
9362f53
feat: python based catalog and schema provider (#1156)
by Tim Saucer
· 6 months ago
9545634
feat: collect once during display() in jupyter notebooks (#1167)
by Tim Saucer
· 7 months ago
03921d9
feat: dataframe string formatter (#1170)
by Tim Saucer
· 7 months ago
ef62fa8
Add compression_level support to ParquetWriterOptions and enhance write_parquet to accept full options object (#1169)
by kosiew
· 7 months ago
0d3c37f
Consolidate DataFrame Docs: Merge HTML Rendering Section as Subpage (#1161)
by kosiew
· 7 months ago
1812a0d
Fix signature of `__arrow_c_stream__` (#1168)
by Kyle Barron
· 7 months ago
98dc06b
feat: Support Parquet writer options (#1123)
by Nuno Faria
· 7 months ago
9b6acec
Support types other than String and Int for partition columns (#1154)
by Michele Gregori
· 7 months ago
1391078
feat: upgrade df48 dependency (#1143)
by Tim Saucer
· 7 months ago
dc0d35a
Add Interruptible Query Execution in Jupyter via KeyboardInterrupt Support (#1141)
by kosiew
· 7 months ago
d6ef9bc
Add DataFrame API Documentation for DataFusion Python (#1132)
by kosiew
· 7 months ago
24f0b1a
Add a documentation build step in CI (#1139)
by Crystal Zhou
· 7 months ago
0cc9b0a
Release DataFusion 47.0.0 (#1130)
by Tim Saucer
· 8 months ago
278a33e
feat: add user defined table function support (#1113)
by Tim Saucer
· 8 months ago
1e7494b
add unit tests for expression functions (#1121)
by Tim Saucer
· 8 months ago
e8aa671
Move the udf module to user_defined (#1112)
by Tim Saucer
· 8 months ago
5a62c4b
With the update to rustc 1.87 we now fail clippy due to the size of the error type. Work around this by boxing the datafusion error. (#1126)
by Tim Saucer
· 8 months ago
f3c98ec
Add fill_null method to DataFrame API for handling missing values (#1019)
by kosiew
· 8 months ago
7d8bcd8
Partial fix for 1078: Enhance DataFrame Formatter Configuration with Memory and Display Controls (#1119)
by kosiew
· 8 months ago
15b96c4
feat: add missing PyLogicalPlan to_variant (#1085)
by Chen Chongchen
· 8 months ago
6fbecef
Copy over protected branch rule from datafusion repo (#1122)
by Tim Saucer
· 8 months ago
10600fb
fix: recursive import (#1117)
by Chen Chongchen
· 9 months ago
5a7f638
Add expression chaining of single parameter scalar functions
by deanm0000
· 9 months ago
00dea11
Improve col class access using __getattr__
by deanm0000
· 9 months ago
91b6635
Add DataFrame usage guide with HTML rendering customization options (#1108)
by kosiew
· 9 months ago
c9f1554
feat: alias with metadata (#1111)
by Chen Chongchen
· 9 months ago
d0d14f6
feat: update datafusion dependency 47 (#1107)
by Tim Saucer
· 9 months ago
818975b
Add Configurable HTML Table Formatter for DataFusion DataFrames in Python (#1100)
by kosiew
· 9 months ago
09b929a
Documentation updates: mention correct dataset on basics page (#1081)
by Florian Schäfer
· 10 months ago
ffafb59
feat: support unparser (#1088)
by Chen Chongchen
· 10 months ago
583e1e9
Update changelog and version number (#1089)
by Tim Saucer
· 10 months ago
d0315ff
feat: Update DataFusion dependency to 46 (#1079)
by Tim Saucer
· 10 months ago
42982da
Improve collection during repr and repr_html (#1036)
by Tim Saucer
· 10 months ago
b8dd97b
Add additional ruff suggestions (#1062)
by Spaarsh
· 10 months ago
7c1c08f
feat: expose regex_count function (#1066)
by Nirnay Roy
· 10 months ago
2f52688
Add decorator for udwf (#1061)
by kosiew
· 10 months ago
Next »