AUTO_RANDOM New in v3.1.0
User scenario
When you write data intensively into TiDB and TiDB has a table with the primary key of the auto-increment integer type, hotspot issue might occur. To solve the hotspot issue, you can use the AUTO_RANDOM attribute.
For more information, see Highly Concurrent Write Best Practices.
Take the following created table as an example:
CREATE TABLE t (a bigint PRIMARY KEY AUTO_INCREMENT, b varchar(255))
On this t table, you execute a large number of INSERT statements that do not specify the values of the primary key as below:
INSERT INTO t(b) VALUES ('a'), ('b'), ('c')
In the above statement, values of the primary key (column a) are not specified, so TiDB uses the continuous auto-increment row values as the row IDs, which might cause write hotspot in a single TiKV node and affect the performance. To avoid such write hotspot, you can specify the AUTO_RANDOM attribute rather than the AUTO_INCREMENT attribute for the column a when you create the table. See the follow examples:
CREATE TABLE t (a bigint PRIMARY KEY AUTO_RANDOM, b varchar(255))
or
CREATE TABLE t (a bigint AUTO_RANDOM, b varchar(255), PRIMARY KEY (a))
Then execute the INSERT statement such as INSERT INTO t(b) VALUES.... Now the results will be as follows:
- Implicitly allocating values: If the
INSERTstatement does not specify the values of the integer primary key column (columna) or specify the value asNULL, TiDB automatically allocates values to this column. These values are not necessarily auto-increment or continuous but are unique, which avoids the hotspot problem caused by continuous row IDs. - Explicitly inserting values: If the
INSERTstatement explicitly specifies the values of the integer primary key column, TiDB saves these values, which works similarly to theAUTO_INCREMENTattribute. Note that if you do not setNO_AUTO_VALUE_ON_ZEROin the@@sql_modesystem variable, TiDB will automatically allocate values to this column even if you explicitly specify the value of the integer primary key column as0.
TiDB automatically allocates values in the following way:
The highest five digits (ignoring the sign bit) of the row value in binary (namely, shard bits) are determined by the starting time of the current transaction. The remaining digits are allocated values in an auto-increment order.
To use different number of shard bits, append a pair of parentheses to AUTO_RANDOM and specify the desired number of shard bits in the parentheses. See the following example:
CREATE TABLE t (a bigint PRIMARY KEY AUTO_RANDOM(3), b varchar(255))
In the above CREATE TABLE statement, 3 shard bits are specified. The range of the number of shard bits is [1, 16).
After creating the table, use the SHOW WARNINGS statement to see the maximum number of implicit allocations supported by the current table:
SHOW WARNINGS
+-------+------+----------------------------------------------------------+
| Level | Code | Message |
+-------+------+----------------------------------------------------------+
| Note | 1105 | Available implicit allocation times: 1152921504606846976 |
+-------+------+----------------------------------------------------------+
In addition, to view the shard bit number of the table with the AUTO_RANDOM attribute, you can see the value of the PK_AUTO_RANDOM_BITS=x mode in the TIDB_ROW_ID_SHARDING_INFO column in the information_schema.tables system table. x is the number of shard bits.
Values allocated to the AUTO_RANDOM column affect last_insert_id(). You can use SELECT last_insert_id () to get the ID that TiDB last implicitly allocates. For example:
INSERT INTO t (b) VALUES ("b")
SELECT * FROM t;
SELECT last_insert_id()
You might see the following result:
+------------+---+
| a | b |
+------------+---+
| 1073741825 | b |
+------------+---+
+------------------+
| last_insert_id() |
+------------------+
| 1073741825 |
+------------------+
Compatibility
TiDB supports parsing the version comment syntax. See the following example:
CREATE TABLE t (a bigint PRIMARY KEY /*T![auto_rand] auto_random */)
CREATE TABLE t (a bigint PRIMARY KEY AUTO_RANDOM)
The above two statements have the same meaning.
In the result of SHOW CREATE TABLE, the AUTO_RANDOM attribute is commented out. This comment includes an attribute identifier (for example, /*T![auto_rand] auto_random */). Here auto_rand represents the AUTO_RANDOM attribute. Only the version of TiDB that implements the feature corresponding to this identifier can parse the SQL statement fragment properly.
This attribute supports forward compatibility, namely, downgrade compatibility. TiDB of earlier versions that do not implement this feature ignore the AUTO_RANDOM attribute of a table (with the above comment) and can also use the table with the attribute.
Restrictions
Pay attention to the following restrictions when you use AUTO_RANDOM:
- Specify this attribute for the primary key column ONLY of integer type. Otherwise, an error might occur. In addition, when the attribute of the primary key is
NONCLUSTERED,AUTO_RANDOMis not supported even on the integer primary key. For more details about the primary key of theCLUSTEREDtype, refer to clustered index. - You cannot use
ALTER TABLEto modify theAUTO_RANDOMattribute, including adding or removing this attribute. - You cannot change the column type of the primary key column that is specified with
AUTO_RANDOMattribute. - You cannot specify
AUTO_RANDOMandAUTO_INCREMENTfor the same column at the same time. - You cannot specify
AUTO_RANDOMandDEFAULT(the default value of a column) for the same column at the same time. - It is not recommended that you explicitly specify a value for the column with the
AUTO_RANDOMattribute when you insert data. Otherwise, the numeral values that can be automatically allocated for this table might be used up in advance.