sidebar_position: 3

Source 常用选项

Source connector 的常用参数

:::caution 警告

旧的配置名称 result_table_name 已经过时，请尽快迁移到新名称 plugin_output。

:::

名称	类型	必填	默认值	描述
plugin_output	String	否	-	当未指定 `plugin_output` 时，此插件处理的数据将不会被注册为可由其他插件直接访问的数据集 `(dataStream/dataset)`，或称为临时表 `(table)`。当指定了 `plugin_output` 时，此插件处理的数据将被注册为可由其他插件直接访问的数据集 `(dataStream/dataset)`，或称为临时表 `(table)`。此处注册的数据集 `(dataStream/dataset)` 可通过指定 `plugin_input` 直接被其他插件访问。
parallelism	Int	否	-	当未指定 `parallelism` 时，默认使用环境中的 `parallelism`。当指定了 `parallelism` 时，将覆盖环境中的 `parallelism` 设置。

重要提示

在作业配置中使用 plugin_output 时，必须设置 plugin_input 参数。

任务示例

简单示例

注册一个流或批处理数据源，并在注册时返回表名 fake_table

source {
    FakeSourceStream {
        plugin_output = "fake_table"
    }
}

复杂示例

这是将Fake数据源转换并写入到两个不同的目标中

env {
  job.mode = "BATCH"
}

source {
  FakeSource {
    plugin_output = "fake"
    row.num = 100
    schema = {
      fields {
        id = "int"
        name = "string"
        age = "int"
        c_timestamp = "timestamp"
        c_date = "date"
        c_map = "map<string, string>"
        c_array = "array<int>"
        c_decimal = "decimal(30, 8)"
        c_row = {
          c_row = {
            c_int = int
          }
        }
      }
    }
  }
}

transform {
  Sql {
    plugin_input = "fake"
    plugin_output = "fake1"
    # 查询表名必须与字段 'plugin_input' 相同
    query = "select id, regexp_replace(name, '.+', 'b') as name, age+1 as age, pi() as pi, c_timestamp, c_date, c_map, c_array, c_decimal, c_row from dual"
  }
  # SQL 转换支持基本函数和条件操作
  # 但不支持复杂的 SQL 操作，包括：多源表/行 JOIN 和聚合操作等
}

sink {
  Console {
    plugin_input = "fake1"
  }
   Console {
    plugin_input = "fake"
  }
}