import ChangeLog from ‘../changelog/connector-maxcompute.md’;

Maxcompute

Maxcompute 源连接器

描述

用于从 Maxcompute 读取数据.

关键特性

选项

名称类型必需默认值
accessIdstring-
accesskeystring-
endpointstring-
projectstring-
table_namestring-
partition_specstring-
split_rowint10000
read_columnsArray-
table_listArray-
common-optionsstring
schemaconfig

accessId [string]

accessId 您的 Maxcompute 密钥 Id, 可以从阿里云访问哪个云.

accesskey [string]

accesskey Your Maxcompute 密钥, 可以从阿里云访问哪个云.

endpoint [string]

endpoint 您的 Maxcompute 端点以 http 开头.

project [string]

project 您在阿里云中创建的Maxcompute项目.

table_name [string]

table_name 目标Maxcompute表名,例如:fake.

partition_spec [string]

partition_spec Maxcompute分区表的此规范,例如:ds=‘20220101’.

split_row [int]

split_row 每次拆分的行数,默认值: 10000.

read_columns [Array]

read_columns 要读取的列,如果未设置,则将读取所有列。例如. [“col1”, “col2”]

table_list [Array]

要读取的表列表,您可以使用此配置代替 table_name.

common options

源插件常用参数, 详见 源通用选项 .

示例

表读取

source {
  Maxcompute {
    accessId="<your access id>"
    accesskey="<your access Key>"
    endpoint="<http://service.odps.aliyun.com/api>"
    project="<your project>"
    table_name="<your table name>"
    #partition_spec="<your partition spec>"
    #split_row = 10000
    #read_columns = ["col1", "col2"]
  }
}

使用表列表读取

source {
  Maxcompute {
    accessId="<your access id>"
    accesskey="<your access Key>"
    endpoint="<http://service.odps.aliyun.com/api>"
    project="<your project>" # default project
    table_list = [
      {
        table_name = "test_table"
        #partition_spec="<your partition spec>"
        #split_row = 10000
        #read_columns = ["col1", "col2"]
      },
      {
        project = "test_project"
        table_name = "test_table2"
        #partition_spec="<your partition spec>"
        #split_row = 10000
        #read_columns = ["col1", "col2"]
      }
    ]
  }
}

变更日志