Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Clone this repo:
  1. bf9a8b7 [Chore] Move generateK8sTaskExecutionContext from AbstractParameters to K8sTaskParameters (#17976) by Wenjun Ruan · 3 days ago dev
  2. e62cc03 [Fix-17969][API] No tenant validation for workflow (#17970) by Wenjun Ruan · 3 days ago
  3. 5e6d41f [Chore] Add AI usage confirmation to the PR template (#17977) by Wenjun Ruan · 3 days ago
  4. 06e7935 [Chore] Bump testcontainer to `1.21.4` to fix could not find a valid Docker environment at CI (#17978) by Wenjun Ruan · 4 days ago
  5. d4a6caf [Feature-17931] Support configurable maximum runtime for workflow/task instance (#17932) by Wenjun Ruan · 12 days ago

Apache Dolphinscheduler

License codecov Quality Gate Status Twitter Follow CN doc

About

Apache DolphinScheduler is a modern data orchestration platform that empowers agile, low-code development of high-performance workflows. It is dedicated to handling complex task dependencies in data pipelines and provides a wide range of built-in job types out of the box.

Key features for DolphinScheduler are as follows:

  • Easy to deploy, providing four deployment modes including Standalone, Cluster, Docker, and Kubernetes.
  • Easy to use, workflows can be created and managed via Web UI, Python SDK or Open API
  • Highly reliable and high availability, with a decentralized, multi-master and multi-worker architecture and native support for horizontal scaling.
  • High performance, its performance is several times faster than other orchestration platforms, and it is capable of handling tens of millions of tasks per day
  • Cloud Native, DolphinScheduler supports orchestrating workflows across multiple clouds and data centers, and allows custom task types
  • Workflow Versioning, provides version control for both workflows and individual workflow instances, including tasks.
  • Flexible state control of workflows and tasks, supports pausing, stopping, and recovering them at any time.
  • Multi-tenancy support
  • Additional features, backfill support(Web UI native), permission control including project and data source etc.

QuickStart

User Interface Screenshots

  • Homepage: Project and workflow overview, including the latest workflow instance and task instance status statistics. home

  • Workflow Definition: Create and manage workflows by drag and drop, easy to build and maintain complex workflows, support a wide range of tasks out of box. workflow-definition

  • Workflow Tree View: Abstract tree structure could provide a clearer understanding of task relationships workflow-tree

  • Data source: Supports multiple external data sources, provides unified data access capabilities for MySQL, PostgreSQL, Hive, Trino, etc. data-source

  • Monitor: View the status of the master, worker and database in real time, including server resource usage and load, do a quick health check without logging in to the server. monitor

Suggestions & Bug Reports

Follow this guide to report your suggestions or bugs.

Contributing

The community welcomes contributions from everyone. Please refer to this page to find out more details: How to contribute. Check out good first issues here if you are new to DolphinScheduler.

Community

Welcome to join the Apache DolphinScheduler community by:

Landscapes