DataFusion is easy to install, just like any other Python library.
pip install datafusion
This section explains how to install DataFusion in a conda environment with other libraries that allow for a nice Jupyter workflow. This setup is completely optional. These steps are only needed if you'd like to run DataFusion in a Jupyter notebook and have an interface like this:

Create a conda environment with DataFusion, Jupyter, and other useful dependencies in the datafusion-env.yml file:
name: datafusion-env
channels:
- conda-forge
- defaults
dependencies:
- python=3.9
- ipykernel
- nb_conda
- jupyterlab
- jupyterlab_code_formatter
- isort
- black
- pip
- pip:
- datafusion
Create the environment with conda env create -f datafusion-env.yml.
Activate the environment with conda activate datafusion-env.
Run jupyter lab or open the JupyterLab Desktop application to start running DataFusion in a Jupyter notebook.
See the DataFusion Python Examples for a variety of Python scripts that show DataFusion in action!