Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Clone this repo:
  1. 8077c2b [Feature-18070][Task] Add Amazon EMR Serverless task plugin (#18069) by HUANG XIAO · 19 hours ago dev
  2. 08db465 [Fix-18131] Workflow instance stuck in RUNNING state forever when using CONTINUE failure strategy with a failed upstream task (#18146) by xiangzihao · 22 hours ago
  3. 373d9d6 [Improvement-18056] Clean up unused methods and classes in the dolphinschudler-dao-plugin module (#18156) by njnu-seafish · 23 hours ago
  4. 25f7429 [Fix-18154] Fix abnormal transmission of sub-workflow complement date (#18155) by xiangzihao · 35 hours ago
  5. 01856e0 [Improvement-18151] Simplify the code with lombok annotations (#18152) by xiangzihao · 2 days ago

Apache Dolphinscheduler

License codecov Quality Gate Status Twitter Follow CN doc

About

Apache DolphinScheduler is a modern data orchestration platform that empowers agile, low-code development of high-performance workflows. It is dedicated to handling complex task dependencies in data pipelines and provides a wide range of built-in job types out of the box.

Key features for DolphinScheduler are as follows:

  • Easy to deploy, providing four deployment modes including Standalone, Cluster, Docker, and Kubernetes.
  • Easy to use, workflows can be created and managed via Web UI, Python SDK or Open API
  • Highly reliable and high availability, with a decentralized, multi-master and multi-worker architecture and native support for horizontal scaling.
  • High performance, its performance is several times faster than other orchestration platforms, and it is capable of handling tens of millions of tasks per day
  • Cloud Native, DolphinScheduler supports orchestrating workflows across multiple clouds and data centers, and allows custom task types
  • Workflow Versioning, provides version control for both workflows and individual workflow instances, including tasks.
  • Flexible state control of workflows and tasks, supports pausing, stopping, and recovering them at any time.
  • Multi-tenancy support
  • Additional features, backfill support(Web UI native), permission control including project and data source etc.

QuickStart

User Interface Screenshots

  • Homepage: Project and workflow overview, including the latest workflow instance and task instance status statistics. home

  • Workflow Definition: Create and manage workflows by drag and drop, easy to build and maintain complex workflows, support a wide range of tasks out of box. workflow-definition

  • Workflow Tree View: Abstract tree structure could provide a clearer understanding of task relationships workflow-tree

  • Data source: Supports multiple external data sources, provides unified data access capabilities for MySQL, PostgreSQL, Hive, Trino, etc. data-source

  • Monitor: View the status of the master, worker and database in real time, including server resource usage and load, do a quick health check without logging in to the server. monitor

Suggestions & Bug Reports

Follow this guide to report your suggestions or bugs.

Contributing

The community welcomes contributions from everyone. Please refer to this page to find out more details: How to contribute. Check out good first issues here if you are new to DolphinScheduler.

Community

Welcome to join the Apache DolphinScheduler community by:

Landscapes