Installation

DataFusion is easy to install, just like any other Python library.

Using uv

If you do not yet have a virtual environment, create one:

uv venv

You can add datafusion to your virtual environment with the usual:

uv pip install datafusion

Or, to add to a project:

uv add datafusion

Using pip

pip install datafusion

uv & JupyterLab setup

This section explains how to install DataFusion in a uv environment with other libraries that allow for a nice Jupyter workflow. This setup is completely optional. These steps are only needed if you'd like to run DataFusion in a Jupyter notebook and have an interface like this:

DataFusion in Jupyter

Create a virtual environment with DataFusion, Jupyter, and other useful dependencies and start the desktop application.

uv venv
uv pip install datafusion jupyterlab jupyterlab_code_formatter
uv run jupyter lab

Examples

See the DataFusion Python Examples for a variety of Python scripts that show DataFusion in action!