📣
TiDB Cloud Premium is now in public preview. Unlimited growth, instant elasticity, advanced security for enterprise workloads. Try it out →

Loading Semi-structured Data



Semi-structured data contains tags or markers to separate semantic elements while not conforming to rigid database structures. TiDB Cloud Lake efficiently loads these formats using the COPY INTO command, with optional on-the-fly data transformation.

Supported File Formats

File FormatDescriptionGuide
ParquetEfficient columnar storage formatLoading Parquet
CSVComma-separated valuesLoading CSV
TSVTab-separated valuesLoading TSV
NDJSONNewline-delimited JSONLoading NDJSON
ORCOptimized Row Columnar formatLoading ORC
AvroRow-based format with schema definitionLoading Avro

Was this page helpful?