📣

TiDB Cloud Essential is now in public preview. Try it out →

Determine Your TiDB Size

This document describes how to determine the size of a TiDB Cloud Dedicated cluster.

Note

You cannot change the size of a TiDB Cloud Starter or TiDB Cloud Essential cluster.

Size TiDB

TiDB is for computing only and does not store data. It is horizontally scalable.

You can configure node count, vCPU, and RAM for TiDB.

To learn performance test results of different cluster scales, see TiDB Cloud Performance Reference.

TiDB vCPU and RAM

The supported vCPU and RAM sizes include the following:

Standard size	High memory size
4 vCPU, 16 GiB	N/A
8 vCPU, 16 GiB	8 vCPU, 32 GiB
16 vCPU, 32 GiB	16 vCPU, 64 GiB
32 vCPU, 64 GiB	32 vCPU, 128 GiB

Note

To use the 32 vCPU, 128 GiB size of TiDB, contact TiDB Cloud Support.

If the vCPU and RAM size of TiDB is set as 4 vCPU, 16 GiB, note the following restrictions:

The node count of TiDB can only be set to 1 or 2, and the node count of TiKV is fixed to 3.
4 vCPU TiDB can only be used with 4 vCPU TiKV.
TiFlash is unavailable.

The 4 vCPU, 16 GiB size of TiDB is designed for learning, testing, and trial purposes. It is suitable for pre-production environments or small, non-critical workloads. However, it is NOT recommended for full-scale production due to performance limitations. If you need lower costs and an SLA guarantee for production, consider using the TiDB Cloud Essential cluster plan.

TiDB node count

For high availability, it is recommended that you configure at least two TiDB nodes for each TiDB Cloud cluster.

In general, TiDB performance increases linearly with the number of TiDB nodes. However, when the number of TiDB nodes exceeds 8, the performance increment becomes slightly less than linearly proportional. For each additional 8 nodes, the performance deviation coefficient is about 5%.

For example:

When there are 9 TiDB nodes, the performance deviation coefficient is about 5%, so the TiDB performance is about 9 * (1 - 5%) = 8.55 times the performance of a single TiDB node.
When there are 16 TiDB nodes, the performance deviation coefficient is about 10%, so the TiDB performance is 16 * (1 - 10%) = 14.4 times the performance of a single TiDB node.

For a specified latency of a TiDB node, the TiDB performance varies depending on the different read-write ratios.

The performance of an 8 vCPU, 16 GiB TiDB node in different workloads is as follows:

Workload	QPS (P95 ≈ 100ms)	QPS (P99 ≈ 300ms)	QPS (P99 ≈ 100ms)
Read	18,900	9,450	6,300
Mixed	15,500	7,750	5,200
Write	18,000	9,000	6,000

If the number of TiDB nodes is less than 8, the performance deviation coefficient is nearly 0%, so the TiDB performance of 16 vCPU, 32 GiB TiDB nodes is roughly twice that of 8 vCPU, 16 GiB TiDB nodes. If the number of TiDB nodes exceeds 8, it is recommended to choose 16 vCPU, 32 GiB TiDB nodes as this will require fewer nodes, which means smaller performance deviation coefficient.

When planning your cluster size, you can estimate the number of TiDB nodes according to your workload type, your overall expected performance (QPS), and the performance of a single TiDB node corresponding to the workload type using the following formula:

node count = ceil(overall expected performance ÷ performance per node * (1 - performance deviation coefficient))

In the formula, you need to calculate node count = ceil(overall expected performance ÷ performance per node) first to get a rough node count, and then use the corresponding performance deviation coefficient to get the final result of the node count.

For example, your overall expected performance is 110,000 QPS under a mixed workload, your P95 latency is about 100 ms, and you want to use 8 vCPU, 16 GiB TiDB nodes. Then, you can get the estimated TiDB performance of an 8 vCPU, 16 GiB TiDB node from the preceding table (which is 15,500), and calculate a rough number of TiDB nodes as follows:

node count = ceil(110,000 ÷ 15,500) = 8

As the performance deviation coefficient of 8 nodes is about 5%, the estimated TiDB performance is 8 * 15,500 * (1 - 5%) = 117,800, which can meet your expected performance of 110,000 QPS.

Therefore, 8 TiDB nodes (8 vCPU, 16 GiB) are recommended for you.

Size TiKV

TiKV is responsible for storing data. It is horizontally scalable.

You can configure node count, vCPU and RAM, and storage for TiKV.

To learn performance test results of different cluster scales, see TiDB Cloud Performance Reference.

TiKV vCPU and RAM

The supported vCPU and RAM sizes include the following:

Standard size	High memory size
4 vCPU, 16 GiB	N/A
8 vCPU, 32 GiB	8 vCPU, 64 GiB
16 vCPU, 64 GiB	Coming soon
32 vCPU, 128 GiB	N/A

Note

If the vCPU and RAM size of TiKV is set as 4 vCPU, 16 GiB, note the following restrictions:

The node count of TiDB can only be set to 1 or 2, and the node count of TiKV is fixed to 3.
4 vCPU TiKV can only be used with 4 vCPU TiDB.
TiFlash is unavailable.

The 4 vCPU, 16 GiB size of TiKV is designed for learning, testing, and trial purposes. It is suitable for pre-production environments or small, non-critical workloads. However, it is NOT recommended for full-scale production due to performance limitations. If you need lower costs and an SLA guarantee for production, consider using the TiDB Cloud Essential cluster plan.

TiKV node count

The number of TiKV nodes should be at least 1 set (3 nodes in 3 different Available Zones).

TiDB Cloud deploys TiKV nodes evenly to all availability zones (at least 3) in the region you select to achieve durability and high availability. In a typical 3-replica setup, your data is distributed evenly among the TiKV nodes across all availability zones and is persisted to the disk of each TiKV node.

Note

When you scale your TiDB cluster, nodes in the 3 availability zones are increased or decreased at the same time. For how to scale in or scale out a TiDB cluster based on your needs, see Scale Your TiDB Cluster.

Although TiKV is mainly used for data storage, the performance of the TiKV node also varies depending on different workloads. Therefore, when planning the number of TiKV nodes, you need to estimate it according to both your data volume and expected performance, and then take the larger of the two estimates as the recommended node count.

Estimate TiKV node count according to data volume

You can calculate a recommended number of TiKV nodes according to your data volume as follows:

node count = ceil(size of your data * TiKV compression ratio * the number of replicas ÷ TiKV storage usage ratio ÷ one TiKV capacity ÷ 3) * 3

Generally, it is recommended to keep the usage ratio of TiKV storage below 80%. The number of replicas in TiDB Cloud is 3 by default. The maximum storage capacity of an 8 vCPU, 64 GiB TiKV node is 4096 GiB.

Based on historical data, the average TiKV compression ratio is around 40%.

Suppose that the size of your MySQL dump files is 20 TB and the TiKV compression ratio is 40%. Then, you can calculate a recommended number of TiKV nodes according to your data volume as follows:

node count = ceil(20 TB * 40% * 3 ÷ 0.8 ÷ 4096 GiB ÷ 3) * 3 = 9

Estimate TiKV node count according to expected performance

Similarly as TiDB performance, TiKV performance increases linearly with the number of TiKV nodes. However, when the number of TiKV nodes exceeds 8, the performance increment becomes slightly less than linearly proportional. For each additional 8 nodes, the performance deviation coefficient is about 5%.

For example:

When there are 9 TiKV nodes, the performance deviation coefficient is about 5%, so the TiKV performance is about 9 * (1 - 5%) = 8.55 times the performance of a single TiKV node.
When there are 18 TiKV nodes, the performance deviation coefficient is about 10%, so the TiKV performance is 18 * (1 - 10%) = 16.2 times the performance of a single TiKV node.

For a specified latency of a TiKV node, the TiKV performance varies depending on the different read-write ratios.

The performance of an 8 vCPU, 32 GiB TiKV node in different workloads is as follows:

Workload	QPS (P95 ≈ 100ms)	QPS (P99 ≈ 300ms)	QPS (P99 ≈ 100ms)
Read	28,000	14,000	7,000
Mixed	17,800	8,900	4,450
Write	14,500	7,250	3,625

If the number of TiKV nodes is less than 8, the performance deviation coefficient is nearly 0%, so the performance of 16 vCPU, 64 GiB TiKV nodes is roughly twice that of 8 vCPU, 32 GiB TiKV nodes. If the number of TiKV nodes exceeds 8, it is recommended to choose 16 vCPU, 64 GiB TiKV nodes as this will require fewer nodes, which means smaller performance deviation coefficient.

When planning your cluster size, you can estimate the number of TiKV nodes according to your workload type, your overall expected performance (QPS), and the performance of a single TiKV node corresponding to the workload type using the following formula:

node count = ceil(overall expected performance ÷ performance per node * (1 - performance deviation coefficient))

In the formula, you need to calculate node count = ceil(overall expected performance ÷ performance per node) first to get a rough node count, and then use the corresponding performance deviation coefficient to get the final result of the node count.

For example, your overall expected performance is 110,000 QPS under a mixed workload, your P95 latency is about 100 ms, and you want to use 8 vCPU, 32 GiB TiKV nodes. Then, you can get the estimated TiKV performance of an 8 vCPU, 32 GiB TiKV node from the preceding table (which is 17,800), and calculate a rough number of TiKV nodes as follows:

node count = ceil(110,000 / 17,800 ) = 7

As 7 is less than 8, the performance deviation coefficient of 7 nodes is 0. The estimated TiKV performance is 7 * 17,800 * (1 - 0) = 124,600, which can meet your expected performance of 110,000 QPS.

Therefore, 7 TiKV nodes (8 vCPU, 32 GiB) are recommended for you according to your expected performance.

Next, you can compare the TiKV node count calculated according to data volume with the number calculated according to your expected performance, and take the larger one as a recommended number of your TiKV nodes.

TiKV node storage size

The supported node storage sizes of different TiKV vCPUs are as follows:

TiKV vCPU	Min node storage	Max node storage	Default node storage
4 vCPU	200 GiB	2048 GiB	500 GiB
8 vCPU	200 GiB	4096 GiB	500 GiB
16 vCPU	200 GiB	4096 GiB	500 GiB
32 vCPU	200 GiB	4096 GiB	500 GiB

Note

You cannot decrease the TiKV node storage size after the cluster creation.

TiKV node storage types

TiDB Cloud provides the following TiKV storage types for TiDB Cloud Dedicated clusters hosted on AWS:

Basic storage
Standard storage
Performance and Plus storage

Basic storage

The Basic storage is a general-purpose storage type that provides lower performance than the Standard storage.

The Basic storage type is applied automatically to the following clusters hosted on AWS:

Existing clusters that are created before April 1, 2025.
New clusters that are created with TiDB versions earlier than v7.5.5, v8.1.2, or v8.5.0.

Standard storage

The Standard storage is ideal for most workloads, providing a balance between performance and cost efficiency. Compared with the Basic storage, it offers better performance by reserving ample disk resources for Raft logs. This reduces the impact of Raft I/O on data disk I/O, improving read and write performance for TiKV.

The Standard storage type is applied automatically to new clusters hosted on AWS and created with TiDB versions v7.5.5, v8.1.2, v8.5.0, or later.

Performance and Plus storage

The Performance and Plus storage provide higher performance and stability, with pricing that reflects these enhanced capabilities. Currently, these two storage types are only available upon request for clusters deployed on AWS. To request the Performance or Plus storage, click ? in the lower-right corner of the TiDB Cloud console, and then click Support Tickets to go to the Help Center. Create a ticket, fill in "Apply for TiKV storage type" in the Description field, and then click Submit.

Size TiFlash

TiFlash synchronizes data from TiKV in real time and supports real-time analytics workloads right out of the box. It is horizontally scalable.

You can configure node count, vCPU and RAM, and storage for TiFlash.

TiFlash vCPU and RAM

The supported vCPU and RAM sizes include the following:

8 vCPU, 64 GiB
16 vCPU, 128 GiB
32 vCPU, 128 GiB
32 vCPU, 256 GiB

Note that TiFlash is unavailable when the vCPU and RAM size of TiDB or TiKV is set as 4 vCPU, 16 GiB.

TiFlash node count

TiDB Cloud deploys TiFlash nodes evenly to different availability zones in a region. It is recommended that you configure at least two TiFlash nodes in each TiDB Cloud cluster and create at least two replicas of the data for high availability in your production environment.

The minimum number of TiFlash nodes depends on the TiFlash replica counts for specific tables:

Minimum number of TiFlash nodes: min((compressed size of table A * replicas for table A + compressed size of table B * replicas for table B) / size of each TiFlash capacity, max(replicas for table A, replicas for table B))

For example, if you configure the node storage of each TiFlash node on AWS as 1024 GiB, and set 2 replicas for table A (the compressed size is 800 GiB) and 1 replica for table B (the compressed size is 100 GiB), then the required number of TiFlash nodes is as follows:

Minimum number of TiFlash nodes: min((800 GiB * 2 + 100 GiB * 1) / 1024 GiB, max(2, 1)) ≈ 2

TiFlash node storage

The supported node storage of different TiFlash vCPUs is as follows:

TiFlash vCPU	Min node storage	Max node storage	Default node storage
8 vCPU	200 GiB	4096 GiB	500 GiB
16 vCPU	200 GiB	4096 GiB	500 GiB
32 vCPU	200 GiB	8192 GiB	500 GiB

Note

You cannot decrease the TiFlash node storage after the cluster creation.

TiFlash node storage types

TiDB Cloud provides the following TiFlash storage types for TiDB Cloud Dedicated clusters hosted on AWS:

Basic storage
Plus storage

Basic storage

The Basic storage is ideal for most workloads, providing a balance between performance and cost efficiency.

Plus storage

The Plus storage provides higher performance and stability, with pricing that reflects these enhanced capabilities. Currently, this storage type is only available upon request for clusters deployed on AWS. To request it, click ? in the lower-right corner of the TiDB Cloud console, and then click Support Tickets to go to the Help Center. Create a ticket, fill in "Apply for TiFlash storage type" in the Description field, and then click Submit.

Determine Your TiDB Size

Size TiDB

TiDB vCPU and RAM

TiDB node count

Size TiKV

TiKV vCPU and RAM

TiKV node count

Estimate TiKV node count according to data volume

Estimate TiKV node count according to expected performance

TiKV node storage size

TiKV node storage types

Basic storage

Standard storage

Performance and Plus storage

Size TiFlash

TiFlash vCPU and RAM

TiFlash node count

TiFlash node storage

TiFlash node storage types

Basic storage

Plus storage

Was this page helpful?