Configure TiDB for Optimal Performance

This guide describes how to optimize the performance of TiDB, including:

Best practices for common workloads.
Strategies for handling challenging performance scenarios.

Note

The optimization techniques in this guide can help achieve optimal performance in TiDB. However, performance tuning often involves balancing multiple factors, and no single solution addresses all performance needs. Some techniques in this guide use experimental features, which are marked accordingly. Although these optimizations can significantly improve performance, they might not be suitable for production environments and require careful evaluation before implementation.

Overview

Optimizing TiDB for peak performance requires careful tuning of various settings. In many cases, achieving optimal performance involves adjusting configurations beyond their default values.

The default settings prioritize stability over performance. To maximize performance, you might need to use more aggressive configurations and, in some cases, experimental features. These recommendations are based on production deployment experience and performance optimization research.

This guide describes the non-default settings, including their benefits and potential trade-offs. Use this information to optimize TiDB settings for your workload requirements.

Key settings for common workloads

The following settings are commonly used to optimize TiDB performance:

Enhance execution plan cache, such as SQL Prepared Execution Plan Cache and Non-prepared plan cache.
Optimize the behavior of the TiDB optimizer by using Optimizer Fix Controls.
Use the Titan storage engine more aggressively.
Fine-tune TiKV's compaction and flow control configurations, to ensure optimal and stable performance under write-intensive workloads.

These settings can significantly improve performance for many workloads. However, as with any optimization, thoroughly test them in your environment before deploying to production.

System variables

Execute the following SQL commands to apply the recommended settings:

SET GLOBAL tidb_session_plan_cache_size=200; 
SET GLOBAL tidb_enable_non_prepared_plan_cache=on;
SET GLOBAL tidb_ignore_prepared_cache_close_stmt=on;
SET GLOBAL tidb_stats_load_sync_wait=2000;
SET GLOBAL tidb_enable_inl_join_inner_multi_pattern=on;
SET GLOBAL tidb_opt_limit_push_down_threshold=10000;
SET GLOBAL tidb_opt_derive_topn=on;
SET GLOBAL tidb_runtime_filter_mode=LOCAL;
SET GLOBAL tidb_opt_enable_mpp_shared_cte_execution=on;
SET GLOBAL tidb_rc_read_check_ts=on;
SET GLOBAL tidb_guarantee_linearizability=off;
SET GLOBAL pd_enable_follower_handle_region=on;
SET GLOBAL tidb_opt_fix_control = '44262:ON,44389:ON,44823:10000,44830:ON,44855:ON,52869:ON';

The following table outlines the impact of specific system variable configurations:

System variable	Description	Note
`tidb_session_plan_cache_size`	Increase the maximum number of plans that can be cached from the default `100` to `200`. This improves performance for workloads with many prepared statement patterns.	Increasing this value can lead to higher memory usage for the session plan cache.
`tidb_enable_non_prepared_plan_cache`	Enable the Non-prepared plan cache feature to reduce compile costs for applications that do not use prepared statements.	N/A
`tidb_ignore_prepared_cache_close_stmt`	Cache plans for applications that use prepared statements but close the plan after each execution.	N/A
`tidb_stats_load_sync_wait`	Increase the timeout for synchronously loading statistics from the default 100 milliseconds to 2 seconds. This ensures TiDB loads the necessary statistics before query compilation.	Increasing this value leads to a longer synchronization wait time before query compilation.
`tidb_enable_inl_join_inner_multi_pattern`	Enable Index Join support when the inner table has `Selection` or `Projection` operators on it.	N/A
`tidb_opt_limit_push_down_threshold`	Increase the threshold that determines whether to push the `Limit` or `TopN` operator down to TiKV.	When multiple index options exist, increasing this variable makes the optimizer favor indexes that can optimize the `ORDER BY` and `Limit` operators.
`tidb_opt_derive_topn`	Enable the optimization rule of Deriving TopN or Limit from window functions.	This is limited to the `ROW_NUMBER()` window function.
`tidb_runtime_filter_mode`	Enable Runtime Filter in the local mode to improve hash join efficiency.	The variable is introduced in v7.2.0 and is disabled by default for safety.
`tidb_opt_enable_mpp_shared_cte_execution`	Enable non-recursive Common Table Expressions (CTE) pushdown to TiFlash.	This is an experimental feature.
`tidb_rc_read_check_ts`	For the read-committed isolation level, enabling this variable avoids the latency and cost of getting the global timestamp and optimizes transaction-level read latency.	This feature is incompatible with the Repeatable Read isolation level.
`tidb_guarantee_linearizability`	Improve performance by skipping the commit timestamp fetch from the PD server.	This sacrifices linearizability in favor of performance. Only causal consistency is guaranteed. It is not suitable for scenarios requiring strict linearizability.
`pd_enable_follower_handle_region`	Activate the PD Follower feature, allowing PD followers to process Region requests. This helps distribute load evenly across all PD servers and reduces CPU pressure on the PD leader.	This is an experimental feature. Test in non-production environments.
`tidb_opt_fix_control`	Enable advanced query optimization strategies to improve performance through additional optimization rules and heuristics.	Test thoroughly in your environment, as performance improvements vary by workload.

The following describes the optimizer control configurations that enable additional optimizations:

44262:ON: Use Dynamic pruning mode to access the partitioned table when the GlobalStats are missing.
44389:ON: For filters such as c = 10 and (a = 'xx' or (a = 'kk' and b = 1)), build more comprehensive scan ranges for IndexRangeScan.
44823:10000: To save memory, plan cache does not cache queries with parameters exceeding the specified number of this variable. Increase plan cache parameter limit from 200 to 10000 to make plan cache available for query with long in-lists.
44830:ON: Plan cache is allowed to cache execution plans with the PointGet operator generated during physical optimization.
44855:ON: The optimizer selects IndexJoin when the Probe side of an IndexJoin operator contains a Selection operator.
52869:ON: The optimizer chooses index merge automatically if the optimizer can choose the single index scan method (other than full table scan) for a query plan.

TiKV configurations

Add the following configuration items to the TiKV configuration file:

[server]
concurrent-send-snap-limit = 64
concurrent-recv-snap-limit = 64
snap-io-max-bytes-per-sec = "400MiB"

[rocksdb]
max-manifest-file-size = "256MiB"
[rocksdb.titan]
enabled = true
[rocksdb.defaultcf.titan]
min-blob-size = "1KB"
blob-file-compression = "zstd"

[storage]
scheduler-pending-write-threshold = "512MiB"
[storage.flow-control]
l0-files-threshold = 50
soft-pending-compaction-bytes-limit = "512GiB"

[rocksdb.writecf]
level0-slowdown-writes-trigger = 20
soft-pending-compaction-bytes-limit = "192GiB"
[rocksdb.defaultcf]
level0-slowdown-writes-trigger = 20
soft-pending-compaction-bytes-limit = "192GiB"
[rocksdb.lockcf]
level0-slowdown-writes-trigger = 20
soft-pending-compaction-bytes-limit = "192GiB"

Configuration item	Description	Note
`concurrent-send-snap-limit`, `concurrent-recv-snap-limit`, and `snap-io-max-bytes-per-sec`	Set limits for concurrent snapshot transfer and I/O bandwidth during TiKV scaling operations. Higher limits reduce scaling time by allowing faster data migration.	Adjusting these limits affects the trade-off between scaling speed and online transaction performance.
`rocksdb.max-manifest-file-size`	Set the maximum size of the RocksDB Manifest file, which logs the metadata about SST files and database state changes. Increasing this size reduces the frequency of Manifest file rewrites, thereby minimizing their impact on foreground write performance.	The default value is `128MiB`. In environments with a large number of SST files (for example, hundreds of thousands), frequent Manifest rewrites can degrade write performance. Adjusting this parameter to a higher value, such as `256MiB` or larger, can help maintain optimal performance.
`rocksdb.titan`, `rocksdb.defaultcf.titan`, `min-blob-size`, and `blob-file-compression`	Enable the Titan storage engine to reduce write amplification and alleviate disk I/O bottlenecks. Particularly useful when RocksDB compaction cannot keep up with write workloads, resulting in accumulated pending compaction bytes.	Enable it when write amplification is the primary bottleneck. Trade-offs include: Potential performance impact on primary key range scans. Increased space amplification (up to 2x in the worst case). Additional memory usage for blob cache.
`storage.scheduler-pending-write-threshold`	Set the maximum size of the write queue in the TiKV scheduler. When the total size of pending write tasks exceeds this threshold, TiKV returns a `Server Is Busy` error for new write requests.	The default value is `100MiB`. In scenarios with high write concurrency or temporary write spikes, increasing this threshold (for example, to `512MiB`) can help accommodate the load. However, if the write queue continues to accumulate and exceeds this threshold persistently, it might indicate underlying performance issues that require further investigation.
`storage.flow-control.l0-files-threshold`	Control when write flow control is triggered based on the number of kvDB L0 files. Increasing the threshold reduces write stalls during high write workloads.	Higher thresholds might lead to more aggressive compactions when many L0 files exist.
`storage.flow-control.soft-pending-compaction-bytes-limit`	Control the threshold for pending compaction bytes to manage write flow control. The soft limit triggers partial write rejections.	The default soft limit is `192GiB`. In write-intensive scenarios, if compaction processes cannot keep up, pending compaction bytes accumulate, potentially triggering flow control. Adjusting the limit can provide more buffer space, but persistent accumulation indicates underlying issues that require further investigation.
`rocksdb.(defaultcf\\|writecf\\|lockcf).level0-slowdown-writes-trigger` `rocksdb.(defaultcf\\|writecf\\|lockcf).soft-pending-compaction-bytes-limit`	You need to manually set `level0-slowdown-writes-trigger` and `soft-pending-compaction-bytes-limit` back to their default values. This way they will not be affected by flow control parameters. In addition, set the Rocksdb parameters to maintain the same compaction efficiency as the default parameters.	For more information, see Issue 18708.

Note that the compaction and flow control configuration adjustments outlined in the preceding table are tailored for TiKV deployments on instances with the following specifications:

CPU: 32 cores
Memory: 128 GiB
Storage: 5 TiB EBS
Disk Throughput: 1 GiB/s

Recommended configuration adjustments for write-intensive workloads

To optimize TiKV performance and stability under write-intensive workloads, it is recommended that you adjust certain compaction and flow control parameters based on the hardware specifications of the instance. For example:

rocksdb.rate-bytes-per-sec: usually use the default value. If you notice that compaction I/O is consuming a significant share of the disk bandwidth, consider capping the rate to about 60% of your disk’s maximum throughput. This helps balance compaction work and ensures the disk is not saturated. For example, on a disk rated at 1 GiB/s, set this to roughly 600MiB.
storage.flow-control.soft-pending-compaction-bytes-limit and storage.flow-control.hard-pending-compaction-bytes-limit: increase these limits proportionally to the available disk space (for example, 1 TiB and 2 TiB, respectively) to provide more buffer for compaction processes.

These settings help ensure efficient resource utilization and minimize potential bottlenecks during peak write loads.

Note

TiKV implements flow control at the scheduler layer to ensure system stability. When critical thresholds are breached, including those for pending compaction bytes or write queue sizes, TiKV begins rejecting write requests and returns a ServerIsBusy error. This error indicates that the background compaction processes cannot keep pace with the current rate of foreground write operations. Flow control activation typically results in latency spikes and reduced query throughput (QPS drops). To prevent these performance degradations, comprehensive capacity planning is essential, along with proper configuration of compaction parameters and storage settings.

TiFlash-learner configurations

Add the following configuration items to the TiFlash-learner configuration file:

[server]
snap-io-max-bytes-per-sec = "300MiB"

Configuration item	Description	Note
`snap-io-max-bytes-per-sec`	Control the maximum allowable disk bandwidth for data replication from TiKV to TiFlash. Higher limits accelerate initial data loading and catch-up replication.	Higher bandwidth consumption might impact online transaction performance. Balance between replication speed and system stability.

Benchmark

This section compares performance between default settings (baseline) and optimized settings based on the preceding key settings for common loads.

YCSB workloads on large record value

Test environment

The test environment is as follows:

3 TiDB servers (16 cores, 64 GiB)
3 TiKV servers (16 cores, 64 GiB)
TiDB version: v8.1.0
Workload: go-ycsb workloada

Performance comparison

The following table compares throughput (operations per second) between the baseline and optimized settings.

Item	Baseline (OPS)	Optimized (OPS)	Improvement
Load data	2858.5	5074.3	+77.59%
Workloada	2243.0	12804.3	+470.86%

Performance analysis

Titan is enabled by default starting from v7.6.0 and the default min-blob-size of Titan in TiDB v8.1.0 is 32KiB. The baseline configuration uses a record size of 31KiB to ensure data is stored in RocksDB. In contrast, for the key settings configuration, set min-blob-size to 1KiB, causing data to be stored in Titan.

The performance improvement observed in the key settings is primarily attributed to Titan's ability to reduce RocksDB compactions. As shown in the following figures:

Baseline: The total throughput of RocksDB compaction exceeds 1 GiB/s, with peaks over 3 GiB/s.
Key settings: The peak throughput of RocksDB compaction remains below 100 MiB/s.

This significant reduction in compaction overhead contributes to the overall throughput improvement seen in the key settings configuration.

Test workload

The following go-ycsb load command loads data:

go-ycsb load mysql -P /ycsb/workloads/workloada -p {host} -p mysql.port={port} -p threadcount=100 -p recordcount=5000000 -p operationcount=5000000 -p workload=core -p requestdistribution=uniform -pfieldcount=31 -p fieldlength=1024

The following go-ycsb run command runs workload:

go-ycsb run mysql -P /ycsb/workloads/workloada -p {host} -p mysql.port={port} -p mysql.db=test -p threadcount=100 -p recordcount=5000000 -p operationcount=5000000 -p workload=core -prequestdistribution=uniform -p fieldcount=31 -p fieldlength=1024

Edge cases and optimizations

This section shows you how to optimize TiDB for specific scenarios that need targeted adjustments beyond basic optimizations. You will learn how to tune TiDB for your particular use cases.

Identify edge cases

To identify edge cases, perform the following steps:

Analyze query patterns and workload characteristics.
Monitor system metrics to identify performance bottlenecks.
Gather feedback from application teams about specific issues.

Common edge cases

The following lists some common edge cases:

High TSO wait for high-frequency small queries
Choose the proper max chunk size for different workloads
Tune coprocessor cache for read-heavy workloads
Optimize chunk size for workload characteristics
Optimize transaction mode and DML type for different workloads
Optimize GROUP BY and DISTINCT operations with TiKV pushdown
Optimize statistics collection during batch operations
Optimize thread pool settings for different instance types

The following sections explain how to handle each of these cases. You need to adjust different parameters or use specific TiDB features for each scenario.

Note

Apply these optimizations carefully and test thoroughly, as their effectiveness might vary based on your use case and data patterns.

High TSO wait for high-frequency small queries

Troubleshooting

If your workload involves frequent small transactions or queries that frequently request timestamps, TSO (Timestamp Oracle) can become a performance bottleneck. To check if TSO wait time is impacting your system, check the Performance Overview > SQL Execute Time Overview panel. If TSO wait time constitutes a large portion of your SQL execution time, consider the following optimizations:

Use low-precision TSO (enable tidb_low_resolution_tso) for read operations that do not need strict consistency. For more information, see Solution 1: use low-precision TSO.
Enable tidb_enable_batch_dml to reduce TSO requests for batch operations.

Solution 1: low-precision TSO

You can reduce TSO wait time by enabling the low-precision TSO feature (tidb_low_resolution_tso). After this feature is enabled, TiDB uses the cached timestamp to read data, reducing TSO wait time at the expense of potentially stale reads.

This optimization is particularly effective in the following scenarios:

Read-heavy workloads where slight staleness is acceptable.
Scenarios where reducing query latency is more important than absolute consistency.
Applications that can tolerate reads that are a few seconds behind the latest committed state.

Benefits and trade-offs:

Reduce query latency by enabling stale reads with a cached TSO, eliminating the need to request new timestamps.
Balance performance against data consistency: this feature is only suitable for scenarios where stale reads are acceptable. It is not recommended to use it when strict data consistency is required.

To enable this optimization:

SET GLOBAL tidb_low_resolution_tso=ON;

Tune coprocessor cache for read-heavy workloads

You can improve query performance for read-heavy workloads by optimizing the coprocessor cache. This cache stores the results of coprocessor requests, reducing repeated computations of frequently accessed data. To optimize cache performance, perform the following steps:

Monitor the cache hit ratio using the metrics described in Coprocessor Cache.
Increase the cache size to improve hit rates for larger working sets.
Adjust the admission threshold based on query patterns.

The following lists some recommended settings for a read-heavy workload:

[tikv-client.copr-cache]
capacity-mb = 4096
admission-max-ranges = 5000
admission-max-result-mb = 10
admission-min-process-ms = 0

Optimize chunk size for workload characteristics

The tidb_max_chunk_size system variable sets the maximum number of rows in a chunk during the execution process. Adjusting this value based on your workload can improve performance.

For OLTP workloads with large concurrency and small transactions:
- Set the value between 128 and 256 rows (the default value is 1024).
- This reduces memory usage and makes limit queries faster.
- Use case: point queries, small range scans.
```
SET GLOBAL tidb_max_chunk_size = 128;
```
For OLAP or analytical workloads with complex queries and large result sets:
- Set the value between 1024 and 4096 rows.
- This increases throughput when scanning large amounts of data.
- Use case: aggregations, large table scans.
```
SET GLOBAL tidb_max_chunk_size = 4096;
```

Optimize transaction mode and DML type for different workloads

TiDB provides different transaction modes and DML execution types to optimize performance for various workload patterns.

Transaction modes

You can set the transaction mode using the tidb_txn_mode system variable.

Pessimistic transaction mode (default):
- Suitable for general workloads with potential write conflicts.
- Provides stronger consistency guarantees.
```
SET SESSION tidb_txn_mode = "pessimistic";
```
Optimistic transaction mode:
- Suitable for workloads with minimal write conflicts.
- Better performance for multi-statement transactions.
- Example: BEGIN; INSERT...; INSERT...; COMMIT;.
```
SET SESSION tidb_txn_mode = "optimistic";
```

DML types

You can control the execution mode of DML statements using the tidb_dml_type system variable, which is introduced in v8.0.0.

To use the bulk DML execution mode, set tidb_dml_type to "bulk". This mode optimizes bulk data loading without conflicts and reduces memory usage during large write operations. Before using this mode, ensure that:

autocommit is enabled.
The pessimistic-auto-commit configuration item is set to false.

SET SESSION tidb_dml_type = "bulk";

Optimize `GROUP BY` and `DISTINCT` operations with TiKV pushdown

TiDB pushes down aggregation operations to TiKV to reduce data transfer and processing overhead. The performance improvement varies based on your data characteristics.

Usage scenarios

Ideal scenarios (high performance gain):
- Columns containing few distinct values (low NDV).
- Data containing frequent duplicate values.
- Example: status columns, category codes, date parts.
Non-ideal scenarios (potential performance loss):
- Columns containing mostly unique values (high NDV).
- Unique identifiers or timestamps.
- Example: User IDs, transaction IDs.

Configuration

Enable pushdown optimizations at the session or global level:

-- Enable regular aggregation pushdown
SET GLOBAL tidb_opt_agg_push_down = ON;

-- Enable distinct aggregation pushdown
SET GLOBAL tidb_opt_distinct_agg_push_down = ON;

Optimize statistics collection during batch operations

You can optimize performance during batch operations while maintaining query optimization by managing statistics collection. This section describes how to manage this process effectively.

When to disable auto analyze

You can disable auto analyze by setting the tidb_enable_auto_analyze system variable to OFF in the following scenarios:

During large data imports.
During bulk update operations.
For time-sensitive batch processing.
When you need full control over the timing of statistics collection.

Best practices

Before the batch operation:

-- Disable auto analyze
SET GLOBAL tidb_enable_auto_analyze = OFF;

After the batch operation:

-- Manually collect statistics
ANALYZE TABLE your_table;

-- Re-enable auto analyze
SET GLOBAL tidb_enable_auto_analyze = ON;

Optimize thread pool settings for different instance types

To improve TiKV performance, configure the thread pools based on your instance's CPU resources. The following guidelines help you optimize these settings:

For instances with 8 to 16 cores, the default settings are typically sufficient.

For instances with 32 or more cores, increase the pool sizes for better resource utilization. Adjust the settings as follows:

[server]
# Increase gRPC thread pool 
grpc-concurrency = 10

[raftstore]
# Optimize for write-intensive workloads
apply-pool-size = 4
store-pool-size = 4
store-io-pool-size = 2

Configure TiDB for Optimal Performance

Overview

Key settings for common workloads

System variables

TiKV configurations

Recommended configuration adjustments for write-intensive workloads

TiFlash-learner configurations

Benchmark

YCSB workloads on large record value

Test environment

Performance comparison

Performance analysis

Test workload

Edge cases and optimizations

Identify edge cases

Common edge cases

High TSO wait for high-frequency small queries

Troubleshooting

Solution 1: low-precision TSO

Tune coprocessor cache for read-heavy workloads

Optimize chunk size for workload characteristics

Optimize transaction mode and DML type for different workloads

Transaction modes

DML types

Optimize GROUP BY and DISTINCT operations with TiKV pushdown

Usage scenarios

Configuration

Optimize statistics collection during batch operations

When to disable auto analyze

Best practices

Optimize thread pool settings for different instance types

Was this page helpful?

Optimize `GROUP BY` and `DISTINCT` operations with TiKV pushdown