tree: 74940f60527eda35dd64dc75f3b52f9d3edbdfaf
  1. Dockerfile
  2. provision.py
  3. README.md
  4. spark-defaults.conf
dev/spark/README.md

Spark Provisioning for Integration Tests

This directory contains the Spark + Paimon setup that provisions test tables into /tmp/paimon-warehouse.

Prerequisites

  • Docker (via colima or Docker Desktop)
  • docker compose

Provision Test Data

# Build and run (from repo root):
make docker-up

# Or manually:
docker compose -f dev/docker-compose.yaml build --build-arg PAIMON_VERSION=1.3.1
docker compose -f dev/docker-compose.yaml run --rm spark-paimon

provision.py automatically clears the warehouse directory before creating tables, so re-running is always safe.

colima Users

With colima, Docker volumes mount inside the colima VM, not on the macOS host filesystem. After provisioning, copy the data to the host:

colima ssh -- sudo tar cf - -C /tmp paimon-warehouse | tar xf - -C /tmp

Run Integration Tests

cargo test -p paimon-integration-tests

Files

  • Dockerfile — Spark 3.5 + Paimon connector image
  • spark-defaults.conf — Spark config with Paimon catalog
  • provision.py — Creates all test tables and inserts data