[Fix-18239] [Docs] Fix outdated Spark documentation links (#18241)
Co-authored-by: xiangzihao <460888207@qq.com>
2 files changed
tree: 653ab8c3e39b8e4c8a28e5a7aca97bc64582b7f4
- .github/
- .idea/
- .mvn/
- config/
- deploy/
- docs/
- dolphinscheduler-alert/
- dolphinscheduler-api/
- dolphinscheduler-api-test/
- dolphinscheduler-authentication/
- dolphinscheduler-bom/
- dolphinscheduler-common/
- dolphinscheduler-dao/
- dolphinscheduler-dao-plugin/
- dolphinscheduler-datasource-plugin/
- dolphinscheduler-dist/
- dolphinscheduler-e2e/
- dolphinscheduler-eventbus/
- dolphinscheduler-extract/
- dolphinscheduler-master/
- dolphinscheduler-meter/
- dolphinscheduler-microbench/
- dolphinscheduler-registry/
- dolphinscheduler-scheduler-plugin/
- dolphinscheduler-service/
- dolphinscheduler-spi/
- dolphinscheduler-standalone-server/
- dolphinscheduler-storage-plugin/
- dolphinscheduler-task-executor/
- dolphinscheduler-task-plugin/
- dolphinscheduler-tools/
- dolphinscheduler-ui/
- dolphinscheduler-worker/
- dolphinscheduler-yarn-aop/
- images/
- licenses/
- script/
- style/
- tools/
- .asf.yaml
- .gitignore
- .gitmodules
- .licenserc.yaml
- .pre-commit-config.yaml
- CLAUDE.md
- CONTRIBUTING.md
- doap.rdf
- LICENSE
- lombok.config
- mvnw
- mvnw.cmd
- NOTICE
- pom.xml
- README.md
- README_zh_CN.md
README.md
Apache Dolphinscheduler

About
Apache DolphinScheduler is a modern data orchestration platform that empowers agile, low-code development of high-performance workflows. It is dedicated to handling complex task dependencies in data pipelines and provides a wide range of built-in job types out of the box.
Key features for DolphinScheduler are as follows:
- Easy to deploy, providing four deployment modes including Standalone, Cluster, Docker, and Kubernetes.
- Easy to use, workflows can be created and managed via Web UI, Python SDK or Open API
- Highly reliable and high availability, with a decentralized, multi-master and multi-worker architecture and native support for horizontal scaling.
- High performance, its performance is several times faster than other orchestration platforms, and it is capable of handling tens of millions of tasks per day
- Cloud Native, DolphinScheduler supports orchestrating workflows across multiple clouds and data centers, and allows custom task types
- Workflow Versioning, provides version control for both workflows and individual workflow instances, including tasks.
- Flexible state control of workflows and tasks, supports pausing, stopping, and recovering them at any time.
- Multi-tenancy support
- Additional features, backfill support(Web UI native), permission control including project and data source etc.
QuickStart
- For quick experience
- For Kubernetes
- For Terraform
User Interface Screenshots
Homepage: Project and workflow overview, including the latest workflow instance and task instance status statistics. 
Workflow Definition: Create and manage workflows by drag and drop, easy to build and maintain complex workflows, support a wide range of tasks out of box. 
Workflow Tree View: Abstract tree structure could provide a clearer understanding of task relationships 
Data source: Supports multiple external data sources, provides unified data access capabilities for MySQL, PostgreSQL, Hive, Trino, etc. 
Monitor: View the status of the master, worker and database in real time, including server resource usage and load, do a quick health check without logging in to the server. 
Suggestions & Bug Reports
Follow this guide to report your suggestions or bugs.
Contributing
The community welcomes contributions from everyone. Please refer to this page to find out more details: How to contribute. Check out good first issues here if you are new to DolphinScheduler.
Community
Welcome to join the Apache DolphinScheduler community by:
Landscapes