| commit | 3e726dbd95d96db5c2cdd4ce16cd214500116766 | [log] [tgz] |
|---|---|---|
| author | Lei Wang <wanglei860213@gmail.com> | Tue Aug 26 14:44:16 2025 +0800 |
| committer | Lei Wang <wanglei860213@gmail.com> | Tue Aug 26 14:44:16 2025 +0800 |
| tree | f8bf960c29a4fefe376a47bbceae02246d41a642 | |
| parent | 300592d1b0bfdabb14012f33231ab929a6d11bfb [diff] |
update READEME
The Auron accelerator for big data engine (e.g., Spark, Flink) leverages native vectorized execution to accelerate query processing. It combines the power of the Apache DataFusion library and the scale of the distributed computing framework.
Auron takes a fully optimized physical plan from distrbuted computing framework, mapping it into DataFusion's execution plan, and performs native plan computation.
The key Key capabilities of Auron include:
Based on the inherent well-defined extensibility of DataFusion, Auron can be easily extended to support:
We encourage you to extend DataFusion capability directly and add the supports in Auron with simple modifications in plan-serde and extension translation.
To build Auron, please follow the steps below:
The native execution lib is written in Rust. So you're required to install Rust (nightly) first for compilation. We recommend you to use rustup.
Auron has been well tested on jdk8/11/17.
Check out the source code.
Build the project.
use ./auron-build.sh for building the project. execute ./auron-build.sh --help for help.
After the build is finished, a fat Jar package that contains all the dependencies will be generated in the target directory.
You can use the following command to build a centos-7 compatible release:
SHIM=spark-3.3 MODE=release JAVA_VERSION=8 SCALA_VERSION=2.12 ./release-docker.sh
This section describes how to submit and configure a Spark Job with Auron support.
move auron jar package to spark client classpath (normally spark-xx.xx.xx/jars/).
add the follow confs to spark configuration in spark-xx.xx.xx/conf/spark-default.conf:
spark.auron.enable true spark.sql.extensions org.apache.spark.sql.auron.AuronSparkSessionExtension spark.shuffle.manager org.apache.spark.sql.execution.auron.shuffle.AuronShuffleManager spark.memory.offHeap.enabled false # suggested executor memory configuration spark.executor.memory 4g spark.executor.memoryOverhead 4096
spark-sql -f tpcds/q01.sql
TPC-DS 1TB Benchmark (for details, see https://auron-project.github.io/documents/benchmarks.html):
We also encourage you to benchmark Auron and share the results with us. 🤗
Mail List is the most recognized form of communication in the Apache community. Contact us through the following mailing list.
| Name | Scope | ||
|---|---|---|---|
| dev@auron.apache.org | Development-related discussions | Subscribe | Unsubscribe |
If you meet any questions, connect us and fix it by submitting a 🔗Pull Request.
Auron is licensed under the Apache 2.0 License. A copy of the license can be found here.