commit	301e1331fe395d0b40d452c0c51bbdca040f2e49	[log] [tgz]
author	zhangli20 <zhangli20@kuaishou.com>	Tue May 17 18:09:34 2022 +0800
committer	zhangli20 <zhangli20@kuaishou.com>	Tue May 17 19:14:58 2022 +0800
tree	ded20c00c7221e435d139890108c4f316d34dc57
parent	dcc429cc8b8c0da000833d247b734bace6cf9fc0 [diff]

tree: ded20c00c7221e435d139890108c4f316d34dc57

README.md

Blaze

The Blaze project aims to provide Spark SQL with a high-performance, low-cost native execution layer.

We seek to solve a series of performance bottlenecks in the current JVM-based Task execution of Spark SQL, such as high fluctuations in performance due to GC, high memory overhead, and inability to accelerate computation directly with SIMD instructions.

This repo is under active development and is not ready for production (or even development) use, but stay tuned for updates! ☺️

Overview

How fast we are, compared to Vanilla Spark

How to run it

1. Build and Run

We could simply build Blaze using:

./gradlew -Pmode=[debug|release] build

Once we have Blaze successfully built, it can be submitted using the bin/spark-submit or bin/spark-sql script.

./bin/spark-submit \
  --jar target/blaze-engine-${VERSION}.jar
  ....

./bin/spark-sql \
  --jar target/blaze-engine-${VERSION}.jar
  ....

2. Run using Docker

TBD

For developers

Architectural Overview

Are we TPC-DS yet?

[ ] Q95