DM 5.3.0 Benchmark Report

This benchmark report describes the test purpose, environment, scenario, and results for DM 5.3.0.

Test purpose

The purpose of this test is to evaluate the performance of DM full import and incremental replication and to conclude recommended configurations for DM migration tasks based on the test results.

Test environment

Machine information

System information:

Machine IP	Operating System	Kernel version	File system type
172.16.6.1	CentOS Linux release 7.8.2003	3.10.0-957.el7.x86_64	ext4
172.16.6.2	CentOS Linux release 7.8.2003	3.10.0-957.el7.x86_64	ext4
172.16.6.3	CentOS Linux release 7.8.2003	3.10.0-957.el7.x86_64	ext4

Hardware information:

Type	Specification
CPU	Intel(R) Xeon(R) Silver 4214R @ 2.40GHz, 48 Cores
Memory	192G, 12 * 16GB DIMM DDR4 2133 MHz
Disk	Intel SSDPE2KX040T8 4TB
Network card	10 Gigabit Ethernet

Others:

Network rtt between servers: rtt min/avg/max/mdev = 0.045/0.064/0.144/0.024 ms

Cluster topology

Machine IP	Deployed instance
172.16.6.1	PD1, TiDB1, TiKV1, MySQL1, DM-master1
172.16.6.2	PD2, TiDB2, TiKV2, DM-worker1
172.16.6.3	PD3, TiDB3, TiKV3

Version information

MySQL version: 5.7.36-log
TiDB version: v5.2.1
DM version: v5.3.0
Sysbench version: 1.1.0

Test scenario

You can use a simple data migration flow, that is, MySQL1 (172.16.6.1) -> DM-worker(172.16.6.2) -> TiDB(load balance) (172.16.6.4), to do the test. For detailed test scenario description, see performance test.

Full import benchmark case

For detailed full import test method, see Full Import Benchmark Case.

Full import benchmark results

To enable multi-thread concurrent data export via Dumpling, you can configure the threads parameter in the mydumpers configuration item. This speeds up data export.

Item	Data size (GB)	Threads	Rows	Statement-size	Time (s)	Dump speed (MB/s)
dump data	38.1	32	320000	1000000	45	846

Item	Data size (GB)	Pool size	Statement per TXN	Max latency of TXN execution (s)	Time (s)	Import speed (MB/s)
load data	38.1	32	4878	76	2740	13.9

Benchmark results with different pool sizes in load unit

In this test, the full amount of data imported using sysbench is 3.78 GB. The following is detailed information of the test data:

load unit pool size	Max latency of TXN execution (s)	Import time (s)	Import Speed (MB/s)	TiDB 99 duration (s)
2	0.71	397	9.5	0.61
4	1.21	363	10.4	1.03
8	3.30	279	13.5	2.11
16	5.56	200	18.9	3.04
32	6.92	218	17.3	6.56
64	8.59	231	16.3	8.62

Benchmark results with different row count per statement

In this test, the full amount of imported data is 3.78 GB and the pool-size of load unit is set to 32. The statement count is controlled by statement-size, rows, or extra-args parameters in the mydumpers configuration item.

Row count per statement	mydumpers extra-args	Max latency of TXN execution (s)	Import time (s)	Import speed (MB/s)	TiDB 99 duration (s)
7506	-s 1500000 -r 320000	8.34	229	16.5	10.64
5006	-s 1000000 -r 320000	6.12	218	17.3	7.23
2506	-s 500000 -r 320000	4.27	232	16.2	3.24
1256	-s 250000 -r 320000	2.25	235	16.0	1.92
629	-s 125000 -r 320000	1.03	246	15.3	0.91
315	-s 62500 -r 320000	0.63	249	15.1	0.44

Incremental replication benchmark case

For detailed incremental replication test method, see Incremental Replication Benchmark Case.

Incremental replication benchmark result

In this test, the worker-count of sync unit is set to 32 and batch is set to 100.

Items	QPS	TPS	95% latency
MySQL	40.65k	40.65k	1.10ms
DM binlog replication unit	29.1k (The number of binlog events received per unit of time, not including skipped events)	-	92ms (txn execution time)
TiDB	32.0k (Begin/Commit 1.5 Insert 29.72k)	3.52k	95%: 6.2ms 99%: 8.3ms

Benchmark results with different sync unit concurrency

sync unit worker-count	DM QPS	Max DM execution latency (ms)	TiDB QPS	TiDB 99 duration (ms)
4	10.2	40	10.5k	4
8	17.6k	64	18.9k	5
16	29.5k	80	30.5k	7
32	29.1k	92	32.0k	9
64	27.4k	88	37.7k	14
1024	22.9k	85	57.5k	25

Benchmark results with different SQL distribution

Sysbench type	DM QPS	Max DM execution latency (ms)	TiDB QPS	TiDB 99 duration (ms)
insert_only	29.1k	64	32.0k	8
write_only	23.5k	296	24.2k	18

Recommended parameter configuration

dump unit

We recommend that the statement size be 200 KB~1 MB, and row count in each statement be approximately 1000~5000, which is based on the actual row size in your scenario.

load unit

We recommend that you set pool-size to 16~32.

sync unit

We recommend that you set batch to 100 and worker-count to 16~32.