Google Git
Sign in
apache / datafusion-python / refs/heads/branch-0.8 / . / examples
tree: e7242127a4ee1f05a459f4e9fb23ea9854947b08 [path history] [tgz]
  1. chart.png
  2. dataframe-parquet.py
  3. python-udaf.py
  4. python-udf.py
  5. query-pyarrow-data.py
  6. README.md
  7. sql-on-cudf.py
  8. sql-on-pandas.py
  9. sql-on-polars.py
  10. sql-parquet.py
  11. sql-to-pandas.py
  12. substrait.py
examples/README.md

DataFusion Python Examples

Some examples rely on data which can be downloaded from the following site:

  • https://www.nyc.gov/site/tlc/about/tlc-trip-record-data.page

Here is a direct link to the file used in the examples:

  • https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2021-01.parquet

Executing Queries with DataFusion

  • Query a Parquet file using SQL
  • Query a Parquet file using the DataFrame API
  • Run a SQL query and store the results in a Pandas DataFrame
  • Query PyArrow Data

Running User-Defined Python Code

  • Register a Python UDF with DataFusion
  • Register a Python UDAF with DataFusion

Substrait Support

  • Serialize query plans using Substrait

Executing SQL against DataFrame Libraries (Experimental)

  • Executing SQL on Polars
  • Executing SQL on Pandas
Powered by Gitiles| Privacy| Termstxt json