A cluster is a logical unit composed of a group of physical or virtual hosts, used to host the distributed runtime environment for big data components. Each cluster has independent configuration space and resource isolation boundaries.
A predefined standardized service collection that includes installation scripts, configuration templates, and dependency relationship descriptions.
Stack | Description |
---|---|
Infra | Services shared by all clusters, such as the monitoring system Prometheus |
Bigtop | Services provided by Apache Bigtop, such as Hadoop/Hive/Spark, etc. |
Extra | Community-provided or custom services, such as SeaTunnel |
A service unit running on a cluster, representing specific big data services (such as Hadoop/Hive/Spark, etc.).
A runtime instance within a service, corresponding to specific processes or functional modules. Component-level operations (start/stop, etc.) are executed by the Agent.
graph TB Hadoop-->NameNode Hadoop-->DataNode Hadoop-->ResourceManager Kafka-->KB[Kafka Broker] Solr-->SI[Solr Instance] SeaTunnel-->SM[SeaTunnel Master] SeaTunnel-->SW[SeaTunnel Worker] SeaTunnel-->SL[SeaTunnel Client]
The smallest schedulable unit initiated by users, representing a complete operation and maintenance target. For example:Start Hadoop service
, Update Spark configuration and restart
, etc.
A logical execution unit decomposed from a Job (Job), corresponding to independent operation steps of service components. For example:
The Start Hadoop
Job → Start NameNode
Stage, Start DataNode
Stage, etc.
An execution instance of a Stage (Stage) on a specific host, representing the smallest granularity of operation instructions. For example:
The Start NameNode
Stage → Start NameNode on host-01
Task, Start NameNode on host-02
Task.
After users submit operation requests via the REST API:
PENDING
).The scheduler executes Stages in DAG order:
Processing flow after the Agent receives a Task:
Execution Guarantee Mechanisms:
State Type | Trigger Condition | Handling Strategy |
---|---|---|
PENDING | Task created but not scheduled | Wait for invocation |
RUNNING | Task in execution | Monitor timeout threshold |
SUCCESSFUL/FAILED | Task execution result | Update component status |
CANCELED | Task canceled (only exists for Stage/Task) | Cancel subsequent tasks |