TiDB 8.5.5 Release Notes

Release date: January 15, 2026

TiDB version: 8.5.5

Quick access: Quick start | Production deployment

Features

Performance

Introduce significant performance improvements for certain lossy DDL operations (such as BIGINT → INT and CHAR(120) → VARCHAR(60)): when no data truncation occurs, the execution time of these operations can be reduced from hours to minutes, seconds, or even milliseconds, delivering performance gains ranging from tens to hundreds of thousands of times #63366 @wjhuang2016, @tangenta, @fzzf678

The optimization strategies are as follows:

In strict SQL mode, TiDB pre-checks for potential data truncation risks during type conversion.
If no data truncation risk is detected, TiDB updates only the metadata and avoids index rebuilding whenever possible.
If index rebuilding is required, TiDB uses a more efficient ingest process to significantly improve index rebuild performance.

The following table shows example performance improvements based on benchmark tests on a table with 114 GiB of data and 600 million rows. The test cluster consists of 3 TiDB nodes, 6 TiKV nodes, and 1 PD node. All nodes are configured with 16 CPU cores and 32 GiB of memory.

Scenario	Operation type	Before optimization	After optimization	Performance improvement
Non-indexed column	`BIGINT → INT`	2 hours 34 minutes	1 minute 5 seconds	142× faster
Indexed column	`BIGINT → INT`	6 hours 25 minutes	0.05 seconds	460,000× faster
Indexed column	`CHAR(120) → VARCHAR(60)`	7 hours 16 minutes	12 minutes 56 seconds	34× faster

Note that the preceding test results are based on the condition that no data truncation occurs during the DDL execution. The optimizations do not apply to conversions between signed and unsigned integer types, conversions between character sets, or tables with TiFlash replicas.

For more information, see documentation.

Improve DDL performance in scenarios with a large number of foreign keys, with up to a 25x increase in logical DDL performance #61126 @GMHDBJD
Before v8.5.5, in scenarios involving ultra-large-scale tables (for example, a cluster with 10 million tables in total, including hundreds of thousands of tables with foreign keys), the performance of logical DDL operations (such as creating tables or adding columns) can drop to approximately 4 QPS. This leads to low operational efficiency in multi-tenant SaaS environments.
TiDB v8.5.5 optimizes these scenarios. Test results show that in an extreme environment with 10 million tables (including 200,000 tables with foreign keys), the logical DDL processing performance consistently maintains 100 QPS. Compared to previous versions, the performance is improved by 25 times, significantly enhancing the operational responsiveness of ultra-large-scale clusters.
Support pushing index lookups down to TiKV to improve query performance #62575 @lcwangchao
Starting from v8.5.5, TiDB supports using optimizer hints to push the IndexLookUp operator down to TiKV nodes. This reduces the number of remote procedure calls (RPCs) and can improve query performance. The actual performance improvement varies depending on the specific workload and requires testing for verification.
To explicitly instruct the optimizer to push index lookups down to TiKV for a specific table, you can use the INDEX_LOOKUP_PUSHDOWN(t1_name, idx1_name [, idx2_name ...]) hint. It is recommended to combine this hint with the table's AFFINITY attribute. For example, set AFFINITY="table" for regular tables and AFFINITY="partition" for partitioned tables.
To disable index lookup pushdown to TiKV for a specific table, use the NO_INDEX_LOOKUP_PUSHDOWN(t1_name) hint.
For more information, see documentation.
Support table-level data affinity to improve query performance (experimental) #9764 @lhy1024
Starting from v8.5.5, you can configure the AFFINITY table option as table or partition when creating or altering a table. When this option is enabled, PD groups Regions that belong to the same table or the same partition into a single affinity group. During scheduling, PD prioritizes placing the Leaders and Voter replicas of these Regions on the same subset of a few TiKV nodes. In this scenario, by using the INDEX_LOOKUP_PUSHDOWN hint in queries, you can explicitly instruct the optimizer to push index lookups down to TiKV, reducing the latency caused by cross-node scattered queries and improving query performance.
Note that this feature is currently experimental and is disabled by default. To enable it, set the PD configuration item schedule.affinity-schedule-limit to a value greater than 0. This configuration item controls the maximum number of affinity scheduling tasks that PD can perform concurrently.
For more information, see documentation.
Point-in-time recovery (PITR) supports recovery from compacted log backups for faster restores #56522 @YuJuncen
Starting from v8.5.5, the log backup compaction feature provides offline compaction capabilities, converting unstructured log backup data into structured SST files. This results in the following improvements:
- Improved recovery performance: SST files can be more quickly imported into the cluster.
- Reduced storage space consumption: redundant data is removed during compaction.
- Reduced impact on applications: RPOs (Recovery Point Objective) can be maintained with less frequent full snapshot-based backups.
For more information, see documentation.
Accelerate recovery of system tables from backups #58757 @Leavrth
Starting from v8.5.5, for restoring system tables from a backup, BR introduces a new --fast-load-sys-tables parameter to use physical restore instead of logical restore. With this parameter enabled, BR fully replaces or overwrites the existing system tables rather than restoring data into them, significantly improving restore performance in large-scale deployments.
For more information, see documentation.

Reliability

Improve scheduling stability in TiKV during network jitter #9359 @okJiang
Starting from v8.5.5, TiKV introduces a network slow-node detection and feedback mechanism. When this mechanism is enabled, TiKV probes network latency between nodes, calculates a network slow score, and reports the score to PD. Based on this score, PD evaluates the network status of TiKV nodes and adjusts scheduling accordingly: when a TiKV node is detected to be experiencing network jitter, PD restricts the scheduling of new Leaders to that node; if the network jitter persists, PD proactively evicts existing Leaders from the affected node to other TiKV nodes, thereby reducing the impact of network issues on the cluster.
For more information, see documentation.

Availability

Introduce the client circuit breaker pattern for PD #8678 @Tema
To protect the PD leader from overloading during retry storms or similar feedback loops, TiDB now implements a circuit breaker pattern. When the error rate reaches a predefined threshold, the circuit breaker limits incoming traffic to allow the system to recover and stabilize. You can use the tidb_cb_pd_metadata_error_rate_threshold_ratio system variable to control the circuit breaker.
For more information, see documentation.

SQL

Support dynamically modifying the concurrency and throughput of distributed ADD INDEX jobs #64947 @joechenrh
In TiDB versions earlier than v8.5.5, when the Distributed eXecution Framework (DXF) tidb_enable_dist_task is enabled, modifying the THREAD, BATCH_SIZE, or MAX_WRITE_SPEED parameters of a running ADD INDEX job is not supported. To change these parameters, you have to cancel the running ADD INDEX job, reconfigure the parameters, and then resubmit the job, which is inefficient.
Starting from v8.5.5, you can use the ADMIN ALTER DDL JOBS statement to dynamically adjust these parameters of a running distributed ADD INDEX job based on the current workload and performance requirements, without interrupting the job.
For more information, see documentation.

DB operations

Support gracefully shutting down TiKV #17221 @hujiatao0
When shutting down a TiKV server, TiKV attempts to transfer the Leader replicas on the node to other TiKV nodes within a configurable timeout duration before the shutdown. The default timeout duration is 20 seconds, and you can adjust it using the server.graceful-shutdown-timeout configuration item. If the timeout is reached and some Leaders have not been successfully transferred, TiKV skips the remaining Leader transfers and proceeds with the shutdown.
For more information, see documentation.
Improve the compatibility between ongoing log backup and snapshot restore #58685 @BornChanger
Starting from v8.5.5, when a log backup task is running, you can still perform a snapshot restore as long as prerequisite conditions are met. This enables ongoing log backups to proceed without having to stop them during the restore process, and the restored data is properly recorded by the ongoing log backup.
For more information, see documentation.
Support table-level restores from log backups #57613 @Tristan1900
Starting from v8.5.5, you can perform point-in-time recovery (PITR) for individual tables from log backups by using filters. Restoring specific tables, rather than the entire cluster, to a target point in time provides more flexible and less disruptive recovery options.
For more information, see documentation.

Observability

Add storage engine identifiers to statement summary tables and slow query logs #61736 @henrybw
When both TiKV and TiFlash are deployed in a cluster, users often need to filter SQL statements by storage engine during database diagnostics and performance optimization. For example, if TiFlash is under high load, users might need to identify SQL statements running on TiFlash to locate potential causes. To meet this need, starting from v8.5.5, TiDB adds storage engine identifier fields to statement summary tables and slow query logs.
New fields in statement summary tables:
- STORAGE_KV: 1 indicates that the SQL statement accesses TiKV.
- STORAGE_MPP: 1 indicates that the SQL statement accesses TiFlash.
  New fields in slow query logs:
- Storage_from_kv: true indicates that the SQL statement accesses TiKV.
- Storage_from_mpp: true indicates that the SQL statement accesses TiFlash.
  This feature simplifies workflows in certain diagnostics and performance optimization scenarios and improves issue identification efficiency.
  For more information, see Statement Summary Tables and Identify Slow Queries.

Security

Support Azure Managed Identity (MI) authentication for Backup & Restore (BR) to Azure Blob Storage #19006 @RidRisR
Starting from v8.5.5, BR supports Azure Managed Identity (MI) for authenticating to Azure Blob Storage, eliminating the need for static SAS tokens. This enables secure, keyless, and ephemeral authentication that follows Azure security best practices.
With this feature, BR and the embedded BR worker in TiKV can acquire access tokens directly from Azure Instance Metadata Service (IMDS), reducing credential leakage risk and simplifying credential rotation management for both self-managed and cloud deployments on Azure.
This feature applies to TiDB clusters running on Azure Kubernetes Service (AKS) or other Azure environments, particularly in enterprise environments that require strict security controls for backup and restore operations.
For more information, see documentation.

Compatibility changes

For TiDB clusters newly deployed in v8.5.4 (that is, not upgraded from versions earlier than v8.5.3), you can smoothly upgrade to v8.5.5. Most changes in v8.5.5 are safe for routine upgrades, but this release also includes several behavior changes, MySQL compatibility adjustments, system variable updates, configuration parameter updates, and system table changes. Before upgrading, make sure to carefully review this section.

Behavior changes

Starting from v8.5.5, TiDB automatically sets target tables to restore mode during data restore. Tables in restore mode prohibit any user read or write operations. Once the restore completes, TiDB automatically switches the mode back to normal for these tables, allowing users to read and write the tables normally. This behavior ensures task stability and data consistency during the restore process.
Starting from v8.5.5, when the --load-stats parameter is set to false, BR no longer writes statistics for restored tables into the mysql.stats_meta table. To update the relevant statistics, you can manually execute ANALYZE TABLE after the restore.

MySQL compatibility

Starting from v8.5.5, TiDB introduces a new AFFINITY property to tables to control table or partition data affinity. You can configure this property using the CREATE TABLE or ALTER TABLE statement. For more information, see documentation.
Starting from v8.5.5, TiDB introduces a new SHOW AFFINITY statement to view the affinity information of tables. This statement is a TiDB extension of MySQL syntax. For more information, see documentation.

System variables

Variable name	Change type	Description
`tidb_analyze_column_options`	Modified	Changes the default value from `PREDICATE` to `ALL` to improve the completeness of statistics in OLAP and HTAP scenarios.
`tidb_advancer_check_point_lag_limit`	Newly added	Controls the maximum allowed checkpoint lag for a log backup task. The default value is `48h0m0s`. If a task's checkpoint lag exceeds this limit, TiDB Advancer pauses the task.
`tidb_cb_pd_metadata_error_rate_threshold_ratio`	Newly added	Controls when TiDB triggers the circuit breaker. The default value is `0`, which means the circuit breaker is disabled. Setting a value between `0.01` and `1` enables it, causing the circuit breaker to trigger when the error rate of specific requests sent to PD reaches or exceeds the threshold.
`tidb_index_lookup_pushdown_policy`	Newly added	Controls whether and when TiDB pushes the `IndexLookUp` operator down to TiKV. The default value is `hint-only`, which means TiDB pushes the `IndexLookUp` operator down to TiKV only when the `INDEX_LOOKUP_PUSHDOWN` hint is explicitly specified in the SQL statement.

Configuration parameters

Configuration file or component	Configuration parameter	Change type	Description
TiDB	`performance.enable-async-batch-get`	Newly added	Controls whether TiDB uses asynchronous mode to execute the Batch Get operator. The default value is `false`.
TiKV	`rocksdb.(defaultcf\|writecf\|lockcf\|raftcf).level0-slowdown-writes-trigger`	Modified	Starting from v8.5.5, when the flow control mechanism is enabled (`storage.flow-control.enable` is set to `true`), this configuration item is overridden by `storage.flow-control.l0-files-threshold` only if its value is greater than the `storage.flow-control.l0-files-threshold`. This behavior prevents weakening RocksDB's compaction acceleration mechanism when you increase the flow control threshold. In v8.5.4 and earlier versions, when the flow control mechanism is enabled, this configuration item is directly overridden by `storage.flow-control.l0-files-threshold`.
TiKV	`rocksdb.(defaultcf\|writecf\|lockcf\|raftcf).soft-pending-compaction-bytes-limit`	Modified	Starting from v8.5.5, when the flow control mechanism is enabled (`storage.flow-control.enable` is set to `true`), this configuration item is overridden by `storage.flow-control.soft-pending-compaction-bytes-limit` only if its value is greater than `storage.flow-control.soft-pending-compaction-bytes-limit`. This behavior prevents weakening RocksDB’s compaction acceleration mechanism when you increase the flow control threshold. In v8.5.4 and earlier versions, when the flow control mechanism is enabled, this configuration item is directly overridden by `storage.flow-control.soft-pending-compaction-bytes-limit`.
TiKV	`readpool.cpu-threshold`	Newly added	Specifies the CPU utilization threshold for the unified read pool. The default value is `0.0`, which means that there is no limit on the CPU usage of the unified read pool. The size of the thread pool is determined solely by the busy thread scaling algorithm, which adjusts the size dynamically based on the number of threads handling current tasks.
TiKV	`server.graceful-shutdown-timeout`	Newly added	Controls the timeout duration for graceful shutdown of TiKV. The default value is `20s`.
TiKV	`server.inspect-network-interval`	Newly added	Controls the interval at which the TiKV HealthChecker actively performs network detection to PD and other TiKV nodes. The default value is `100ms`.
PD	`schedule.max-affinity-merge-region-size`	Newly added	Controls the threshold for automatically merging adjacent small Regions within the same affinity group. The default value is `256`, in MiB.
PD	`schedule.affinity-schedule-limit`	Newly added	Controls the number of affinity scheduling tasks that can be performed concurrently. The default value is `0`, which means that affinity scheduling is disabled by default.
BR	`--checkpoint-storage`	Newly added	Specifies an external storage for checkpoint data.
BR	`--fast-load-sys-tables`	Newly added	Supports physical restore of system tables on a new cluster. This parameter is enabled by default.
BR	`--filter`	Newly added	Specifies patterns to include or exclude specific databases or tables for restore.

System tables

The system tables INFORMATION_SCHEMA.TABLES and INFORMATION_SCHEMA.PARTITIONS add a new column TIDB_AFFINITY to display the data affinity level.

Other changes

When performing PITR recovery on earlier TiDB versions (such as v8.5.4 or v8.1.2) using BR v8.5.5, the log recovery stage might fail and return errors.
Full data backup and restore are not affected by this issue.
It is recommended that you use a BR version that matches your target TiDB cluster version. For example, when performing PITR on a TiDB v8.5.4 cluster, use BR v8.5.4.

Improvements

TiDB
- Improve error messages for IMPORT INTO when encoding errors occur to help users identify issues more accurately #63763 @D3Hunter
- Enhance the parsing mechanism for Parquet files to improve the import performance of Parquet-formatted data #62906 @joechenrh
- Change the default value of tidb_analyze_column_options to ALL to collect statistics for all columns by default #64992 @0xPoe
- Optimize the execution logic of the IndexHashJoin operator by using incremental processing in specific JOIN scenarios to avoid loading large amounts of data at once, significantly reducing memory usage and improving performance #63303 @ChangRui-Ryan
- Optimize the CPU usage of internal SQL statements in the Distributed eXecution Framework (DXF) #59344 @D3Hunter
- Improve the performance of the expression.Contains function #61373 @hawkingrei
TiKV
- Introduce CPU-aware scaling for the unified read pool to avoid CPU starvation under hot read workloads #18464 @mittalrishabh
- Add network latency awareness to slow score to avoid scheduling leaders to TiKV nodes with unstable network conditions #18797 @okJiang
- Optimize hibernate Region behavior by allowing leaders to enter the hibernation state immediately after receiving a majority of votes, without waiting for offline non-voter peers #19070 @jiadebin
- Throttle BR log restore requests when TiKV memory usage is high to prevent TiKV OOM #18124 @3pointer
PD
- Optimize metrics with high cardinality to reduce PD memory usage and relieve pressure on the monitoring system #9357 @rleungx
- Optimize the logic for timestamp advancement and leader election #9981 @bufferflies
- Support batch configuration of TiKV store limits by storage engine (TiKV or TiFlash) #9970 @bufferflies
- Add the store label to the pd_cluster_status metric #9855 @SerjKol80
Tools
- TiCDC
  - Enhance the configuration validation logic for changefeeds: when creating or updating a changefeed, if a column referenced in the dispatcher configuration does not exist, TiCDC returns an error and rejects the operation to prevent execution failures #12253 @wk989898

Bug fixes

TiDB
- Fix the issue that TiDB fails to read the latest value of the tidb_mem_quota_binding_cache variable to perform initialization binding during startup #65381 @qw4990
- Fix the issue that candidate items are incorrectly skipped in extractBestCNFItemRanges, leading to inaccurate query range calculation #62547 @hawkingrei
- Fix the issue that plan replayer cannot load the binding #64811 @hawkingrei
- Fix the issue that PointGet fails to reuse chunks even when memory is sufficient, leading to unnecessary memory allocations #63920 @hawkingrei
- Fix the issue that LogicalProjection.DeriveStats allocates too much memory #63810 @hawkingrei
- Fix the issue that plan replayer fails to dump when a query panics #64835 @hawkingrei
- Fix the issue that the attribute order in the SHOW CREATE TABLE output for TTL tables is displayed incorrectly in specific scenarios #64876 @YangKeao
- Fix the issue that the execution summary information for a TTL job is empty when the job times out #61509 @YangKeao
- Fix the issue that correlated subqueries might trigger unexpected full table scans when the Plan Cache is enabled #64645 @winoros
- Fix the issue that system tables cause incorrect table health monitoring results #57176, #64080 @0xPoe
- Fix the issue that the mysql.tidb_ddl_notifier table cannot be cleaned up after disabling automatic statistics updates (tidb_enable_auto_analyze = OFF) #64038 @0xPoe
- Fix the issue that columns are repeatedly allocated in newLocalColumnPool #63809 @hawkingrei
- Fix the issue that invalid warning logs regarding syncload failure are generated #63880 @0xPoe
- Fix the issue that TiDB might panic and exit abnormally when manually terminating a connection that is executing a transaction #63956 @wshwsh12
- Fix the issue that goroutine and memory leaks might occur when a cached table reads from a TiFlash replica #63329 @xzhangxian1008
- Fix the issue that the foreign key is not updated after executing ALTER TABLE child CHANGE COLUMN to modify a column #59705 @fzzf678
- Fix the issue that the RENAME TABLE job arguments could be decoded incorrectly from an earlier TiDB version #64413 @joechenrh
- Fix the issue that the auto-increment ID fails to be rebased when BR restore fails #60804 @joechenrh
- Fix the issue that the TiDB node might get stuck during upgrade #64539 @joechenrh
- Fix the issue that admin check does not report errors when index records are missing #63698 @wjhuang2016
- Fix the issue that modifying the collation via MODIFY COLUMN results in data index inconsistency #61668 @tangenta
- Fix the issue that the embedded ANALYZE feature in DDL might not be triggered when performing multiple schema changes #65040 @joechenrh
- Fix the issue that the Distributed eXecution Framework (DXF) task is not cancelled after canceling the ADD INDEX job #64129 @tangenta
- Fix the issue that the validation logic is incorrect when determining whether to load the table information for tables that contain foreign keys #60044 @JQWong7
- Fix the issue that initialization for foreign key related fields is incorrect when copying table information #60044 @JQWong7
- Fix the issue that the auto ID is set incorrectly after renaming a table across different databases #64561 @joechenrh
- Fix the issue that incorrect handling of meta keys leads to high CPU usage #64323 @wjhuang2016
- Fix the issue that TiDB Lightning fails to report an error when a schema file lacks a trailing semicolon #63414 @GMHDBJD
- Fix the issue that executing IMPORT INTO with Global Sort enabled leads to an infinite loop while reading files #61177 @CbcWestwolf
- Fix the issue that a panic occurs when handling generated columns during IMPORT INTO #64657 @D3Hunter
- Fix the issue that an error might be incorrectly reported when a single SQL statement contains multiple AS OF TIMESTAMP expressions #65090 @you06
- Fix the potential OOM issue when querying information_schema.tables by improving memory usage monitoring when querying system tables #58985 @tangenta
TiKV
- Fix the issue that the KV Cursor Operations metric for analyze requests is always 0 #19206 @glorv
- Fix the issue that Region heartbeats might report incorrect Region size or key statistics to PD after a leader change #19180 @glorv
- Fix the issue that unsafe recovery gets stuck by removing tombstone TiFlash learners from the unsafe recovery demotion list #18458 @v01dstar
- Fix the issue that snapshots might be repeatedly canceled during continuous writes, which blocks replica recovery #18872 @exit-code-1
- Fix the issue that compaction slows down due to increased flow-control thresholds #18708 @hhwyt
- Fix the issue that Raft peers might enter hibernation prematurely in a corner case, causing them to remain busy and block leader transfers after a TiKV restart #19203 @LykxSassinator
PD
- Fix the issue that a node might not be removable during the process of bringing it online #8997 @lhy1024
- Fix the issue that a large number of Leader transfers might cause sudden changes in Region size #10014 @lhy1024
- Fix the issue that might cause PD panic during scheduling #9951 @bufferflies
- Fix the issue that data might become imbalanced during the import process #9088 @GMHDBJD
- Fix the issue that, after enabling the Active PD Follower feature, requests that fail on a Follower node cannot correctly fall back to the Leader node for retry #64933 @okJiang
- Fix the issue that some requests are not correctly forwarded in PD microservices mode #9825 @lhy1024
- Fix the issue that connections might fail due to incorrect TLS configuration loading in the tso and scheduling microservices #9367 @rleungx
TiFlash
- Fix the issue that TiFlash might panic when BR is restoring data #10606 @CalvinNeo
- Fix the issue that TiFlash cannot fully utilize more than 16 CPU cores when BR is restoring data #10605 @JaySon-Huang
- Fix the issue that TiFlash might exit unexpectedly when GROUP_CONCAT triggers a disk spill #10553 @ChangRui-Ryan
Tools
- Backup & Restore (BR)
  - Fix the issue that enabling log backup causes excessive memory usage when the cluster contains many Regions #18719 @YuJuncen
  - Fix the issue that the Azure SDK cannot find a suitable key from the environment #18206 @YuJuncen
  - Fix the issue that foreign keys cannot be properly restored during restore point #61642 @Leavrth
  - Fix the issue that restore fails if system table collations are incompatible between the backup and target cluster by adding the --sys-check-collation parameter to support restoring privilege tables from v6.5 to v7.5 #64667 @Leavrth
  - Fix the issue that restore log cannot be performed after a failed restore point, even when the operation is safe #64908 @RidRisR
  - Fix the issue that restore point from a checkpoint might panic when log backup data is mixed with a full backup #58685 @YuJuncen
- TiCDC
  - Fix the issue that data might be lost during replication to object storage because Writer close errors are not correctly captured #12436 @wk989898
  - Fix the issue that replicating a TRUNCATE operation on a partitioned table might cause changefeed failures #12430 @wk989898
  - Fix the issue that downstream execution order might be incorrect when replicating multi-table RENAME DDL statements #12449 @wlwilliamx
  - Fix the connection errors that might occur when using Glue Schema Registry by upgrading the aws-sdk-go-v2 dependency version #12424 @wk989898
  - Fix the issue that changefeed tasks might get stuck because the TiKV CDC component fails to release memory quotas correctly after a restart #18169 @asddongmen
  - Fix the issue that gRPC connections might be unexpectedly closed due to being misjudged as idle when incremental scan tasks accumulate in TiKV CDC #18915 @asddongmen