This document walks through the steps to register a Onetable synced Delta table in Unity Catalog on Databricks.
utilities-0.1.0-SNAPSHOT-bundled.jar by following the steps on the Installation pageCreate my_config.yaml in the cloned Onetable directory.
sourceFormat: HUDI|ICEBERG # choose only one targetFormats: - DELTA datasets: - tableBasePath: s3://path/to/source/data tableName: table_name partitionSpec: partitionpath:VALUE
:::tip Note: Replace s3://path/to/source/data to gs://path/to/source/data if you have your source table in GCS. And replace with appropriate values for sourceFormat, and tableName fields. :::
From your terminal under the cloned Onetable directory, run the sync process using the below command.
java -jar utilities/target/utilities-0.1.0-SNAPSHOT-bundled.jar -datasetConfig my_config.yaml
:::tip Note: At this point, if you check your bucket path, you will be able to see _delta_log directory with 00000000000000000000.json which contains the logs that helps query engines to interpret the source table as a Delta table. :::
In your Databricks workspace, under SQL editor, run the following queries.
CREATE CATALOG onetable; CREATE SCHEMA onetable.synced_delta_schema; CREATE TABLE onetable.synced_delta_schema.<table_name> USING DELTA LOCATION 's3://path/to/source/data';
:::tip Note: Replace s3://path/to/source/data to gs://path/to/source/data if you have your source table in GCS. :::
You can now see the created delta table in Unity Catalog under Catalog as <table_name> under synced_delta_schema and also query the table in the SQL editor:
SELECT * FROM onetable.synced_delta_schema.<table_name>;
In this guide we saw how to,