The guide in this section will show you how to manage data flows through the Inlong dashboard and introduce some management configurations. Through this guide, you will learn to perform operations such as data access, data consumption, data synchronization, and cluster management.
To start using the InLong dashboard, please deploy it first. Here are several ways to do it:
:::tip We recommend using docker for deployment. :::
Requires the user to enter the system account name and password. The default account name is admin and the password is inlong. It is recommended that you change the password in time after logging in.
The data access module is responsible for aggregating data from various data sources into a unified storage service, enabling further data queries and analysis.
Click on [Create] button, this step requires you to fill in the basic information of the data flow group.
Currently, supports three message middleware of MQ. we use TubeMQ as an example. you can also check Kafka data access example and Pulsar data access example.
:::info MQ Type
Click on [Next] button, to enter the data stream information filling step.
Data stream has a specific data source, data format and data sink. Click [Create] button to create a new data stream:
CSV, Key-Value, Avro, JSONYes: skip the invalid data and continue processing.No: stop processing when an error is encountered.UTF-8, GBK.Click on [Ok] button, to save the data stream information.
The left side is the data source, and the right side is the data target. You can find more examples of create data source and data target in the [Data Ingestion] section under [Quick Start].
The types supported by the data source are as:
The types supported by the data target are as:
After create the data source and data target, click the [Submit] button to enter the approval management.
Data synchronization is the process of establishing consistency between source and target data stores, and the continuous harmonization of the data over time. Click the [Create] button to create a data synchronization group.
InLong has two data synchronization modes, one is real-time data synchronization and the other is offline data synchronization.
Suitable for low-latency scenarios such as real-time monitoring and transaction flow synchronization
Select the synchronization type of [RealTime]
Click the [Next] button to create data stream.
Here are a few examples of creating real-time data synchronization:
Applicable to scenarios such as data warehouse construction and historical migration.
Select the synchronization type of [Offline], In addition to filling in the basic data stream group information, you also need to fill in offline synchronization related information, such as scheduling rules and dependency configuration.
Scheduling Rules:
quartz or airflow or dolphinScheduler engine.Crontab type.Here are a few examples of creating offline data synchronization:
Data Subscription provides subscribers bulk data feeds of the data they are entitled to access.
Click [New subscribe] button to create a new data subscription.
The data node module displays the list of data nodes within the current user permissions. You can view, edit, update and delete the details of these nodes.
Click [Create] button to pop up a dialog box for create a new node. You can choose different node types, such as: Redis, Kafka, MySQL, etc.
The cluster management module is divided into two modules: cluster label management and cluster management. Cluster label management is used to manage cluster labels, and cluster management is used to manage clusters.
Cluster label management can add, delete, modify cluster labels, view the cluster list, bind clusters, etc.
Click [Bind cluster] button to open the Bind cluster information box and select the cluster you want to bind.
Cluster management can add, delete, modify clusters, view the cluster list, view cluster details, view cluster nodes, etc.
Click [Create] button to open dialog box for creating a new cluster will pop up.
Agent, Kafka, etc.The approval management function module currently includes my application and my approval, and all tasks of data access and consumption application approval in the management system.
Click [Detail] to view the current basic information and approval process of the task, display the current task list submitted by the applicant for data access and consumption in the system.
Generally, the MQ or Flink cluster configuration. You can view the error information on the page, or enter the Manager container to view detailed logs.