SpatialBench inherits its speed and efficiency from the tpchgen-rs project, which is one of the fastest open-source data generators available.
Key performance benefits:
SpatialBench is a Rust-based fork of the tpchgen-rs project. It preserves the original’s high-performance, multi-threaded, streaming architecture, while extending it with a spatial star schema and geometry generation logic.
You can build the SpatialBench data generator using Cargo:
cargo build --release
Alternatively, install it directly using:
cargo install --path ./spatialbench-cli
For contribution or debugging, refer to the ARCHITECTURE.md guide.
spatialbench-cli -s 1 --format=parquet
spatialbench-cli -s 1 --format=parquet --tables trip,building --output-dir sf1-parquet
for PART in $(seq 1 4); do mkdir part-$PART spatialbench-cli -s 10 --tables trip,building --output-dir part-$PART --parts 4 --part $PART done
You can override these defaults at runtime by passing a YAML file via the --config
flag:
spatialbench-cli -s 1 --format=parquet --tables trip,building --config spatialbench-config.yml
If --config is not provided, SpatialBench checks for ./spatialbench-config.yml. If absent, it falls back to built-in defaults.
For reference, see the provided spatialbench-config.yml.
See CONFIGURATION.md for more details about spatial data generation and the full YAML schema and examples.