📣

TiDB Cloud Premium is now in public preview. Unlimited growth, instant elasticity, advanced security for enterprise workloads. Try it out →

Data Integration Overview

The Data Integration feature in TiDB Cloud Lake provides a visual, no-code interface for importing, synchronizing, or consuming data from external systems into TiDB Cloud Lake. The feature centers around two key concepts: data sources and integration tasks.

Key Concepts

Concept	Description
Data Sources	Reusable connection settings or credentials used to access external systems or send notifications, such as AWS Access Key / Secret Key, MySQL hostname / username / password, SQS (S3) queue URL, Kafka broker addresses, or a FeiShu bot webhook.
Integration Tasks	Executable tasks that define where data comes from, where the task writes data or how it saves results, which runtime parameters it uses, and how you start and monitor the task.

Data sources do not move data by themselves. They only store the information required to access external systems. Integration tasks are the units that actually perform imports, snapshots, continuous synchronization, or message consumption.

Not every data source corresponds to an ingestion task. For example, FeiShuBot is used for notifications rather than loading source data into TiDB Cloud Lake.

Supported Integration Task Types

Task Type	Description
Amazon S3	Imports CSV, Parquet, or NDJSON files from Amazon S3 with support for one-time or continuous ingestion.
Amazon SQS (S3) (Beta)	Consumes S3 object creation events from an SQS queue and writes the corresponding object data into TiDB Cloud Lake.
MySQL	Synchronizes table data from MySQL using `Snapshot`, `CDC Only`, or `Snapshot + CDC` modes.
PostgreSQL	Synchronizes table data from PostgreSQL using `Snapshot`, `CDC Only`, or `Snapshot + CDC` modes.
Kafka Consumer Integration Task (Beta)	Continuously consumes messages from Kafka topics and saves the message content to internal object storage.

Recommended Flow

Create and test reusable connection settings on the Data Sources page.
Review supported task types and their use cases on the Integration Tasks page.
Read the task-specific guide to configure the source, preview the data, and configure the result location or result viewing method.
Use the Task Management page to start tasks, check status, and troubleshoot execution issues.

Data Integration Overview

Key Concepts

Supported Integration Task Types

Recommended Flow

Was this page helpful?