Auto Embedding Overview

The Auto Embedding feature lets you perform vector searches directly with plain text, without providing your own vectors. With this feature, you can insert text data directly and perform semantic searches using text queries, while TiDB automatically converts the text into vectors behind the scenes.

To use Auto Embedding, the basic workflow is as follows:

Define a table with a text column and a generated vector column using EMBED_TEXT().
Insert text data — vectors are automatically generated and stored concurrently.
Query using text — use VEC_EMBED_COSINE_DISTANCE() or VEC_EMBED_L2_DISTANCE() to find semantically similar content.

Note

Auto Embedding is only available on TiDB Cloud Starter clusters hosted on AWS.

Quick start example

Tip

For Python usage, see PyTiDB Documentation.

The following example shows how to use Auto Embedding with cosine distance to perform a semantic search. No API key is required in this example.

-- Create a table with auto-embedding
-- The dimension of the vector column must match the dimension of the embedding model,
-- otherwise TiDB returns an error when inserting data.
CREATE TABLE documents (
    id INT PRIMARY KEY AUTO_INCREMENT,
    content TEXT,
    content_vector VECTOR(1024) GENERATED ALWAYS AS (
        EMBED_TEXT("tidbcloud_free/amazon/titan-embed-text-v2", content)
    ) STORED
);

-- Insert text data (vectors are generated automatically)
INSERT INTO documents (content) VALUES
    ("Electric vehicles reduce air pollution in cities."),
    ("Solar panels convert sunlight into renewable energy."),
    ("Plant-based diets lower carbon footprints significantly."),
    ("Deep learning algorithms improve medical diagnosis accuracy."),
    ("Blockchain technology enhances data security systems.");

-- Search for semantically similar content using text query
SELECT id, content FROM documents
ORDER BY VEC_EMBED_COSINE_DISTANCE(
    content_vector,
    "Renewable energy solutions for environmental protection"
)
LIMIT 3;

The output is as follows:

+----+--------------------------------------------------------------+
| id | content                                                      |
+----+--------------------------------------------------------------+
|  2 | Solar panels convert sunlight into renewable energy.         |
|  1 | Electric vehicles reduce air pollution in cities.            |
|  4 | Deep learning algorithms improve medical diagnosis accuracy. |
+----+--------------------------------------------------------------+

The preceding example uses the Amazon Titan model. For other models, see Available text embedding models.

Auto Embedding + Vector index

Auto Embedding is compatible with Vector index for better query performance. You can define a vector index on the generated vector column, and it will be used automatically:

-- Create a table with auto-embedding and a vector index
CREATE TABLE documents (
    id INT PRIMARY KEY AUTO_INCREMENT,
    content TEXT,
    content_vector VECTOR(1024) GENERATED ALWAYS AS (
        EMBED_TEXT("tidbcloud_free/amazon/titan-embed-text-v2", content)
    ) STORED,
    VECTOR INDEX ((VEC_COSINE_DISTANCE(content_vector)))
);

-- Insert text data (vectors are generated automatically)
INSERT INTO documents (content) VALUES
    ("Electric vehicles reduce air pollution in cities."),
    ("Solar panels convert sunlight into renewable energy."),
    ("Plant-based diets lower carbon footprints significantly."),
    ("Deep learning algorithms improve medical diagnosis accuracy."),
    ("Blockchain technology enhances data security systems.");

-- Search for semantically similar content with a text query on the vector index using the same VEC_EMBED_COSINE_DISTANCE() function
SELECT id, content FROM documents
ORDER BY VEC_EMBED_COSINE_DISTANCE(
    content_vector,
    "Renewable energy solutions for environmental protection"
)
LIMIT 3;

Note

When defining a vector index, use VEC_COSINE_DISTANCE() or VEC_L2_DISTANCE().
When running queries, use VEC_EMBED_COSINE_DISTANCE() or VEC_EMBED_L2_DISTANCE().

Available text embedding models

TiDB Cloud supports various embedding models. Choose the one that best fits your needs:

Embedding model	Documentation	Hosted by TiDB Cloud ¹	BYOK ²
Amazon Titan	Amazon Titan Embeddings	✅
Cohere	Cohere Embeddings	✅	✅
Jina AI	Jina AI Embeddings		✅
OpenAI	OpenAI Embeddings		✅
Gemini	Gemini Embeddings		✅

You can also use open-source embedding models through the following inference services that TiDB Cloud supports:

Embedding model	Documentation	Hosted by TiDB Cloud ¹	BYOK ²	Example supported models
HuggingFace Inference	HuggingFace Embeddings		✅	`bge-m3`, `multilingual-e5-large`
NVIDIA NIM	NVIDIA NIM Embeddings		✅	`bge-m3`, `nv-embed-v1`

¹ Hosted models are hosted by TiDB Cloud and do not require any API keys. Currently, these hosted models are free to use, but certain usage limits might be applied to keep them available to everyone.

² BYOK (Bring Your Own Key) models require you to provide your own API keys from the corresponding embedding provider. TiDB Cloud does not charge for the usage of BYOK models. You are responsible for managing and monitoring the costs associated with using these models.

How Auto Embedding works

Auto Embedding uses the EMBED_TEXT() function to convert text into vector embeddings with your chosen embedding model. The generated vectors are stored in VECTOR columns and can be queried with plain text using VEC_EMBED_COSINE_DISTANCE() or VEC_EMBED_L2_DISTANCE().

Internally, VEC_EMBED_COSINE_DISTANCE() and VEC_EMBED_L2_DISTANCE() are executed as VEC_COSINE_DISTANCE() and VEC_L2_DISTANCE(), with the text query automatically converted into a vector embedding.

Key functions

`EMBED_TEXT()`

Converts text to vector embeddings:

EMBED_TEXT("model_name", text_content[, additional_json_options])

Use this function in GENERATED ALWAYS AS clauses to automatically generate embeddings when inserting or updating text data.

`VEC_EMBED_COSINE_DISTANCE()`

Calculates cosine similarity between a stored vector in the vector column and a text query:

VEC_EMBED_COSINE_DISTANCE(vector_column, "query_text")

Use this function in ORDER BY clauses to rank results by cosine distance. It uses the same calculation as VEC_COSINE_DISTANCE(), but automatically generates the embedding for the query text.

`VEC_EMBED_L2_DISTANCE()`

Calculates L2 (Euclidean) distance between a stored vector and a text query:

VEC_EMBED_L2_DISTANCE(vector_column, "query_text")

Use this function in ORDER BY clauses to rank results by L2 distance. It uses the same calculation as VEC_L2_DISTANCE(), but automatically generates the embedding for the query text.

Use Auto Embedding in Python

See PyTiDB Documentation.