docs/en/connector-v2/sink/Oracle.md - seatunnel - Git at Google

 import ChangeLog from '../changelog/connector-jdbc.md';

 # Oracle

 > JDBC Oracle Sink Connector

 ## Support Those Engines

 > Spark<br/>
 > Flink<br/>
 > SeaTunnel Zeta<br/>

 ## Description

 Write data through jdbc. Support Batch mode and Streaming mode, support concurrent writing, support exactly-once
 semantics (using XA transaction guarantee).

 ## Using Dependency

 ### For Spark/Flink Engine

 > 1. You need to ensure that the [jdbc driver jar package](https://mvnrepository.com/artifact/com.oracle.database.jdbc/ojdbc8) has been placed in directory `${SEATUNNEL_HOME}/plugins/`.

 ### For SeaTunnel Zeta Engine

 > 1. You need to ensure that the [jdbc driver jar package](https://mvnrepository.com/artifact/com.oracle.database.jdbc/ojdbc8) has been placed in directory `${SEATUNNEL_HOME}/lib/`.

 ## Key Features

 - [x] [exactly-once](../../concept/connector-v2-features.md)
 - [x] [cdc](../../concept/connector-v2-features.md)

 > Use `Xa transactions` to ensure `exactly-once`. So only support `exactly-once` for the database which is
 > support `Xa transactions`. You can set `is_exactly_once=true` to enable it.

 ## Supported DataSource Info

 | Datasource |                    Supported Versions                    |          Driver          |                  Url                   |                               Maven                                |
 |------------|----------------------------------------------------------|--------------------------|----------------------------------------|--------------------------------------------------------------------|
 | Oracle     | Different dependency version has different driver class. | oracle.jdbc.OracleDriver | jdbc:oracle:thin:@datasource01:1523:xe | https://mvnrepository.com/artifact/com.oracle.database.jdbc/ojdbc8 |

 ## Database Dependency

 > Please download the support list corresponding to 'Maven' and copy it to the '$SEATUNNEL_HOME/plugins/jdbc/lib/' working directory<br/>
 > For example Oracle datasource: cp ojdbc8-xxxxxx.jar $SEATUNNEL_HOME/lib/<br/>
 > To support the i18n character set, copy the orai18n.jar to the $SEATUNNEL_HOME/lib/ directory.

 ## Data Type Mapping

 |                                   Oracle Data Type                                   | SeaTunnel Data Type |
 |--------------------------------------------------------------------------------------|---------------------|
 | INTEGER                                                                              | INT                 |
 | FLOAT                                                                                | DECIMAL(38, 18)     |
 | NUMBER(precision <= 9, scale == 0)                                                   | INT                 |
 | NUMBER(9 < precision <= 18, scale == 0)                                              | BIGINT              |
 | NUMBER(18 < precision, scale == 0)                                                   | DECIMAL(38, 0)      |
 | NUMBER(scale != 0)                                                                   | DECIMAL(38, 18)     |
 | BINARY_DOUBLE                                                                        | DOUBLE              |
 | BINARY_FLOAT<br/>REAL                                                                | FLOAT               |
 | CHAR<br/>NCHAR<br/>NVARCHAR2<br/>VARCHAR2<br/>LONG<br/>ROWID<br/>NCLOB<br/>CLOB<br/> | STRING              |
 | DATE                                                                                 | DATE                |
 | TIMESTAMP<br/>TIMESTAMP WITH LOCAL TIME ZONE                                         | TIMESTAMP           |
 | BLOB<br/>RAW<br/>LONG RAW<br/>BFILE                                                  | BYTES               |

 ## Options

 |                   Name                    |  Type   | Required |           Default            |                                                                                                                  Description                                                                                                                   |
 |-------------------------------------------|---------|----------|------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|
 | url                                       | String  | Yes      | -                            | The URL of the JDBC connection. Refer to a case: jdbc:oracle:thin:@datasource01:1523:xe                                                                                                                                                        |
 | driver                                    | String  | Yes      | -                            | The jdbc class name used to connect to the remote data source,<br/> if you use Oracle the value is `oracle.jdbc.OracleDriver`.                                                                                                                 |
 | username                                      | String  | No       | -                            | Connection instance user name                                                                                                                                                                                                                  |
 | password                                  | String  | No       | -                            | Connection instance password                                                                                                                                                                                                                   |
 | query                                     | String  | No       | -                            | Use this sql write upstream input datas to database. e.g `INSERT ...`,`query` have the higher priority                                                                                                                                         |
 | database                                  | String  | No       | -                            | Use this `database` and `table-name` auto-generate sql and receive upstream input datas write to database.<br/>This option is mutually exclusive with `query` and has a higher priority.                                                       |
 | table                                     | String  | No       | -                            | Use database and this table-name auto-generate sql and receive upstream input datas write to database.<br/>This option is mutually exclusive with `query` and has a higher priority.                                                           |
 | primary_keys                              | Array   | No       | -                            | This option is used to support operations such as `insert`, `delete`, and `update` when automatically generate sql.                                                                                                                            |
 | connection_check_timeout_sec              | Int     | No       | 30                           | The time in seconds to wait for the database operation used to validate the connection to complete.                                                                                                                                            |
 | max_retries                               | Int     | No       | 0                            | The number of retries to submit failed (executeBatch)                                                                                                                                                                                          |
 | batch_size                                | Int     | No       | 1000                         | For batch writing, when the number of buffered records reaches the number of `batch_size` or the time reaches `batch_interval_ms`<br/>, the data will be flushed into the database                                                             |
 | batch_interval_ms                         | Int     | No       | 1000                         | For batch writing, when the number of buffers reaches the number of `batch_size` or the time reaches `batch_interval_ms`, the data will be flushed into the database                                                                           |
 | is_exactly_once                           | Boolean | No       | false                        | Whether to enable exactly-once semantics, which will use Xa transactions. If on, you need to<br/>set `xa_data_source_class_name`.                                                                                                              |
 | generate_sink_sql                         | Boolean | No       | false                        | Generate sql statements based on the database table you want to write to.                                                                                                                                                                      |
 | xa_data_source_class_name                 | String  | No       | -                            | The xa data source class name of the database Driver, for example, Oracle is `oracle.jdbc.xa.client.OracleXADataSource`, and<br/>please refer to appendix for other data sources                                                               |
 | max_commit_attempts                       | Int     | No       | 3                            | The number of retries for transaction commit failures                                                                                                                                                                                          |
 | transaction_timeout_sec                   | Int     | No       | -1                           | The timeout after the transaction is opened, the default is -1 (never timeout). Note that setting the timeout may affect<br/>exactly-once semantics                                                                                            |
 | auto_commit                               | Boolean | No       | true                         | Automatic transaction commit is enabled by default                                                                                                                                                                                             |
 | properties                                | Map     | No       | -                            | Additional connection configuration parameters,when properties and URL have the same parameters, the priority is determined by the <br/>specific implementation of the driver. For example, in MySQL, properties take precedence over the URL. |
 | common-options                            |         | No       | -                            | Sink plugin common parameters, please refer to [Sink Common Options](../sink-common-options.md) for details                                                                                                                                    |
 | schema_save_mode                          | Enum    | No       | CREATE_SCHEMA_WHEN_NOT_EXIST | Before the synchronous task is turned on, different treatment schemes are selected for the existing surface structure of the target side.                                                                                                      |
 | data_save_mode                            | Enum    | No       | APPEND_DATA                  | Before the synchronous task is turned on, different processing schemes are selected for data existing data on the target side.                                                                                                                 |
 | custom_sql                                | String  | No       | -                            | When data_save_mode selects CUSTOM_PROCESSING, you should fill in the CUSTOM_SQL parameter. This parameter usually fills in a SQL that can be executed. SQL will be executed before synchronization tasks.                                     |
 | enable_upsert                             | Boolean | No       | true                         | Enable upsert by primary_keys exist, If the task has no key duplicate data, setting this parameter to `false` can speed up data import                                                                                                         |

 ### Tips

 > If partition_column is not set, it will run in single concurrency, and if partition_column is set, it will be executed  in parallel according to the concurrency of tasks.

 ## Task Example

 ### Simple

 > This example defines a SeaTunnel synchronization task that automatically generates data through FakeSource and sends it to JDBC Sink. FakeSource generates a total of 16 rows of data (row.num=16), with each row having two fields, name (string type) and age (int type). The final target table is test_table will also be 16 rows of data in the table. Before run this job, you need create database test and table test_table in your Oracle. And if you have not yet installed and deployed SeaTunnel, you need to follow the instructions in [Install SeaTunnel](../../start-v2/locally/deployment.md) to install and deploy SeaTunnel. And then follow the instructions in [Quick Start With SeaTunnel Engine](../../start-v2/locally/quick-start-seatunnel-engine.md) to run this job.

 ```
 # Defining the runtime environment
 env {
   parallelism = 1
   job.mode = "BATCH"
 }

 source {
   FakeSource {
     parallelism = 1
     plugin_output = "fake"
     row.num = 16
     schema = {
       fields {
         name = "string"
         age = "int"
       }
     }
   }
   # If you would like to get more information about how to configure seatunnel and see full list of source plugins,
   # please go to https://seatunnel.apache.org/docs/connector-v2/source
 }

 transform {
   # If you would like to get more information about how to configure seatunnel and see full list of transform plugins,
     # please go to https://seatunnel.apache.org/docs/transform-v2
 }

 sink {
     jdbc {
         url = "jdbc:oracle:thin:@datasource01:1523:xe"
         driver = "oracle.jdbc.OracleDriver"
         username = root
         password = 123456
         query = "INSERT INTO TEST.TEST_TABLE(NAME,AGE) VALUES(?,?)"
      }
   # If you would like to get more information about how to configure seatunnel and see full list of sink plugins,
   # please go to https://seatunnel.apache.org/docs/connector-v2/sink
 }
 ```

 ### Generate Sink SQL

 > This example  not need to write complex sql statements, you can configure the database name table name to automatically generate add statements for you

 ```
 sink {
     Jdbc {
         url = "jdbc:oracle:thin:@datasource01:1523:xe"
         driver = "oracle.jdbc.OracleDriver"
         username = root
         password = 123456

         generate_sink_sql = true
         database = XE
         table = "TEST.TEST_TABLE"
     }
 }
 ```

 ### Exactly-once

 > For accurate write scene we guarantee accurate once

 ```
 sink {
     jdbc {
         url = "jdbc:oracle:thin:@datasource01:1523:xe"
         driver = "oracle.jdbc.OracleDriver"

         max_retries = 0
         username = root
         password = 123456
         query = "INSERT INTO TEST.TEST_TABLE(NAME,AGE) VALUES(?,?)"

         is_exactly_once = "true"

         xa_data_source_class_name = "oracle.jdbc.xa.client.OracleXADataSource"
     }
 }
 ```

 ### CDC(Change Data Capture) Event

 > CDC change data is also supported by us In this case, you need config database, table and primary_keys.

 ```
 sink {
     jdbc {
         url = "jdbc:oracle:thin:@datasource01:1523:xe"
         driver = "oracle.jdbc.OracleDriver"
         username = root
         password = 123456

         generate_sink_sql = true
         # You need to configure both database and table
         database = XE
         table = "TEST.TEST_TABLE"
         primary_keys = ["ID"]
         schema_save_mode = "CREATE_SCHEMA_WHEN_NOT_EXIST"
         data_save_mode="APPEND_DATA"
     }
 }
 ```

 ## Changelog

 <ChangeLog />
	import ChangeLog from '../changelog/connector-jdbc.md';

	# Oracle

	> JDBC Oracle Sink Connector

	## Support Those Engines

	> Spark<br/>
	> Flink<br/>
	> SeaTunnel Zeta<br/>

	## Description

	Write data through jdbc. Support Batch mode and Streaming mode, support concurrent writing, support exactly-once
	semantics (using XA transaction guarantee).

	## Using Dependency

	### For Spark/Flink Engine

	> 1. You need to ensure that the [jdbc driver jar package](https://mvnrepository.com/artifact/com.oracle.database.jdbc/ojdbc8) has been placed in directory `${SEATUNNEL_HOME}/plugins/`.

	### For SeaTunnel Zeta Engine

	> 1. You need to ensure that the [jdbc driver jar package](https://mvnrepository.com/artifact/com.oracle.database.jdbc/ojdbc8) has been placed in directory `${SEATUNNEL_HOME}/lib/`.

	## Key Features

	- [x] [exactly-once](../../concept/connector-v2-features.md)
	- [x] [cdc](../../concept/connector-v2-features.md)

	> Use `Xa transactions` to ensure `exactly-once`. So only support `exactly-once` for the database which is
	> support `Xa transactions`. You can set `is_exactly_once=true` to enable it.

	## Supported DataSource Info

	\| Datasource \| Supported Versions \| Driver \| Url \| Maven \|
	\|------------\|----------------------------------------------------------\|--------------------------\|----------------------------------------\|--------------------------------------------------------------------\|
	\| Oracle \| Different dependency version has different driver class. \| oracle.jdbc.OracleDriver \| jdbc:oracle:thin:@datasource01:1523:xe \| https://mvnrepository.com/artifact/com.oracle.database.jdbc/ojdbc8 \|

	## Database Dependency

	> Please download the support list corresponding to 'Maven' and copy it to the '$SEATUNNEL_HOME/plugins/jdbc/lib/' working directory<br/>
	> For example Oracle datasource: cp ojdbc8-xxxxxx.jar $SEATUNNEL_HOME/lib/<br/>
	> To support the i18n character set, copy the orai18n.jar to the $SEATUNNEL_HOME/lib/ directory.

	## Data Type Mapping

	\| Oracle Data Type \| SeaTunnel Data Type \|
	\|--------------------------------------------------------------------------------------\|---------------------\|
	\| INTEGER \| INT \|
	\| FLOAT \| DECIMAL(38, 18) \|
	\| NUMBER(precision <= 9, scale == 0) \| INT \|
	\| NUMBER(9 < precision <= 18, scale == 0) \| BIGINT \|
	\| NUMBER(18 < precision, scale == 0) \| DECIMAL(38, 0) \|
	\| NUMBER(scale != 0) \| DECIMAL(38, 18) \|
	\| BINARY_DOUBLE \| DOUBLE \|
	\| BINARY_FLOAT<br/>REAL \| FLOAT \|
	\| CHAR<br/>NCHAR<br/>NVARCHAR2<br/>VARCHAR2<br/>LONG<br/>ROWID<br/>NCLOB<br/>CLOB<br/> \| STRING \|
	\| DATE \| DATE \|
	\| TIMESTAMP<br/>TIMESTAMP WITH LOCAL TIME ZONE \| TIMESTAMP \|
	\| BLOB<br/>RAW<br/>LONG RAW<br/>BFILE \| BYTES \|

	## Options

	\| Name \| Type \| Required \| Default \| Description \|
	\|-------------------------------------------\|---------\|----------\|------------------------------\|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------\|
	\| url \| String \| Yes \| - \| The URL of the JDBC connection. Refer to a case: jdbc:oracle:thin:@datasource01:1523:xe \|
	\| driver \| String \| Yes \| - \| The jdbc class name used to connect to the remote data source,<br/> if you use Oracle the value is `oracle.jdbc.OracleDriver`. \|
	\| username \| String \| No \| - \| Connection instance user name \|
	\| password \| String \| No \| - \| Connection instance password \|
	\| query \| String \| No \| - \| Use this sql write upstream input datas to database. e.g `INSERT ...`,`query` have the higher priority \|
	\| database \| String \| No \| - \| Use this `database` and `table-name` auto-generate sql and receive upstream input datas write to database.<br/>This option is mutually exclusive with `query` and has a higher priority. \|
	\| table \| String \| No \| - \| Use database and this table-name auto-generate sql and receive upstream input datas write to database.<br/>This option is mutually exclusive with `query` and has a higher priority. \|
	\| primary_keys \| Array \| No \| - \| This option is used to support operations such as `insert`, `delete`, and `update` when automatically generate sql. \|
	\| connection_check_timeout_sec \| Int \| No \| 30 \| The time in seconds to wait for the database operation used to validate the connection to complete. \|
	\| max_retries \| Int \| No \| 0 \| The number of retries to submit failed (executeBatch) \|
	\| batch_size \| Int \| No \| 1000 \| For batch writing, when the number of buffered records reaches the number of `batch_size` or the time reaches `batch_interval_ms`<br/>, the data will be flushed into the database \|
	\| batch_interval_ms \| Int \| No \| 1000 \| For batch writing, when the number of buffers reaches the number of `batch_size` or the time reaches `batch_interval_ms`, the data will be flushed into the database \|
	\| is_exactly_once \| Boolean \| No \| false \| Whether to enable exactly-once semantics, which will use Xa transactions. If on, you need to<br/>set `xa_data_source_class_name`. \|
	\| generate_sink_sql \| Boolean \| No \| false \| Generate sql statements based on the database table you want to write to. \|
	\| xa_data_source_class_name \| String \| No \| - \| The xa data source class name of the database Driver, for example, Oracle is `oracle.jdbc.xa.client.OracleXADataSource`, and<br/>please refer to appendix for other data sources \|
	\| max_commit_attempts \| Int \| No \| 3 \| The number of retries for transaction commit failures \|
	\| transaction_timeout_sec \| Int \| No \| -1 \| The timeout after the transaction is opened, the default is -1 (never timeout). Note that setting the timeout may affect<br/>exactly-once semantics \|
	\| auto_commit \| Boolean \| No \| true \| Automatic transaction commit is enabled by default \|
	\| properties \| Map \| No \| - \| Additional connection configuration parameters,when properties and URL have the same parameters, the priority is determined by the <br/>specific implementation of the driver. For example, in MySQL, properties take precedence over the URL. \|
	\| common-options \| \| No \| - \| Sink plugin common parameters, please refer to [Sink Common Options](../sink-common-options.md) for details \|
	\| schema_save_mode \| Enum \| No \| CREATE_SCHEMA_WHEN_NOT_EXIST \| Before the synchronous task is turned on, different treatment schemes are selected for the existing surface structure of the target side. \|
	\| data_save_mode \| Enum \| No \| APPEND_DATA \| Before the synchronous task is turned on, different processing schemes are selected for data existing data on the target side. \|
	\| custom_sql \| String \| No \| - \| When data_save_mode selects CUSTOM_PROCESSING, you should fill in the CUSTOM_SQL parameter. This parameter usually fills in a SQL that can be executed. SQL will be executed before synchronization tasks. \|
	\| enable_upsert \| Boolean \| No \| true \| Enable upsert by primary_keys exist, If the task has no key duplicate data, setting this parameter to `false` can speed up data import \|

	### Tips

	> If partition_column is not set, it will run in single concurrency, and if partition_column is set, it will be executed in parallel according to the concurrency of tasks.

	## Task Example

	### Simple

	> This example defines a SeaTunnel synchronization task that automatically generates data through FakeSource and sends it to JDBC Sink. FakeSource generates a total of 16 rows of data (row.num=16), with each row having two fields, name (string type) and age (int type). The final target table is test_table will also be 16 rows of data in the table. Before run this job, you need create database test and table test_table in your Oracle. And if you have not yet installed and deployed SeaTunnel, you need to follow the instructions in [Install SeaTunnel](../../start-v2/locally/deployment.md) to install and deploy SeaTunnel. And then follow the instructions in [Quick Start With SeaTunnel Engine](../../start-v2/locally/quick-start-seatunnel-engine.md) to run this job.

	```
	# Defining the runtime environment
	env {
	parallelism = 1
	job.mode = "BATCH"
	}

	source {
	FakeSource {
	parallelism = 1
	plugin_output = "fake"
	row.num = 16
	schema = {
	fields {
	name = "string"
	age = "int"
	}
	}
	}
	# If you would like to get more information about how to configure seatunnel and see full list of source plugins,
	# please go to https://seatunnel.apache.org/docs/connector-v2/source
	}

	transform {
	# If you would like to get more information about how to configure seatunnel and see full list of transform plugins,
	# please go to https://seatunnel.apache.org/docs/transform-v2
	}

	sink {
	jdbc {
	url = "jdbc:oracle:thin:@datasource01:1523:xe"
	driver = "oracle.jdbc.OracleDriver"
	username = root
	password = 123456
	query = "INSERT INTO TEST.TEST_TABLE(NAME,AGE) VALUES(?,?)"
	}
	# If you would like to get more information about how to configure seatunnel and see full list of sink plugins,
	# please go to https://seatunnel.apache.org/docs/connector-v2/sink
	}
	```

	### Generate Sink SQL

	> This example not need to write complex sql statements, you can configure the database name table name to automatically generate add statements for you

	```
	sink {
	Jdbc {
	url = "jdbc:oracle:thin:@datasource01:1523:xe"
	driver = "oracle.jdbc.OracleDriver"
	username = root
	password = 123456

	generate_sink_sql = true
	database = XE
	table = "TEST.TEST_TABLE"
	}
	}
	```

	### Exactly-once

	> For accurate write scene we guarantee accurate once

	```
	sink {
	jdbc {
	url = "jdbc:oracle:thin:@datasource01:1523:xe"
	driver = "oracle.jdbc.OracleDriver"

	max_retries = 0
	username = root
	password = 123456
	query = "INSERT INTO TEST.TEST_TABLE(NAME,AGE) VALUES(?,?)"

	is_exactly_once = "true"

	xa_data_source_class_name = "oracle.jdbc.xa.client.OracleXADataSource"
	}
	}
	```

	### CDC(Change Data Capture) Event

	> CDC change data is also supported by us In this case, you need config database, table and primary_keys.

	```
	sink {
	jdbc {
	url = "jdbc:oracle:thin:@datasource01:1523:xe"
	driver = "oracle.jdbc.OracleDriver"
	username = root
	password = 123456

	generate_sink_sql = true
	# You need to configure both database and table
	database = XE
	table = "TEST.TEST_TABLE"
	primary_keys = ["ID"]
	schema_save_mode = "CREATE_SCHEMA_WHEN_NOT_EXIST"
	data_save_mode="APPEND_DATA"
	}
	}
	```

	## Changelog

	<ChangeLog />