TiUP Common Operations

This document describes the common operations when you operate and maintain a TiDB cluster using TiUP.

View the cluster list

You can manage multiple TiDB clusters using the TiUP cluster component. When a TiDB cluster is deployed, the cluster appears in the TiUP cluster list.

To view the list, run the following command:

tiup cluster list

Start the cluster

The components in the TiDB cluster are started in the following order:

PD > TiKV > TiDB > TiFlash > TiCDC > Prometheus > Grafana > Alertmanager

To start the cluster, run the following command:

tiup cluster start ${cluster-name}

You can start only some of the components by adding the -R or -N parameters in the command. For example:

  • This command starts only the PD component:

    tiup cluster start ${cluster-name} -R pd
  • This command starts only the PD components on the 1.2.3.4 and 1.2.3.5 hosts:

    tiup cluster start ${cluster-name} -N 1.2.3.4:2379,1.2.3.5:2379

View the cluster status

After starting the cluster, check the status of each component to ensure that they work normally. TiUP provides the display command, so you do not have to log in to every machine to view the component status.

tiup cluster display ${cluster-name}

Modify the configuration

When the cluster is in operation, if you need to modify the parameters of a component, run the edit-config command. The detailed steps are as follows:

  1. Open the configuration file of the cluster in the editing mode:

    tiup cluster edit-config ${cluster-name}
  2. Configure the parameters:

    • If the configuration is globally effective for a component, edit server_configs:

      server_configs: tidb: log.slow-threshold: 300
    • If the configuration takes effect on a specific node, edit the configuration in config of the node:

      tidb_servers: - host: 10.0.1.11 port: 4000 config: log.slow-threshold: 300

    For the parameter format, see the TiUP parameter template.

    Use . to represent the hierarchy of the configuration items.

    For more information on the configuration parameters of components, refer to TiDB config.toml.example, TiKV config.toml.example, and PD config.toml.example.

  3. Rolling update the configuration and restart the corresponding components by running the reload command:

    tiup cluster reload ${cluster-name} [-N <nodes>] [-R <roles>]

Example

If you want to set the transaction size limit parameter (txn-total-size-limit in the performance module) to 1G in tidb-server, edit the configuration as follows:

server_configs: tidb: performance.txn-total-size-limit: 1073741824

Then, run the tiup cluster reload ${cluster-name} -R tidb command to rolling restart the TiDB component.

Replace with a hotfix package

For normal upgrade, see Upgrade TiDB Using TiUP. But in some scenarios, such as debugging, you might need to replace the currently running component with a temporary package. To achieve this, use the patch command:

tiup cluster patch --help
Replace the remote package with a specified package and restart the service Usage: cluster patch <cluster-name> <package-path> [flags] Flags: -h, --help help for patch -N, --node strings Specify the nodes --overwrite Use this package in the future scale-out operations -R, --role strings Specify the role --transfer-timeout int Timeout in seconds when transferring PD and TiKV store leaders (default 600) Global Flags: --native-ssh Use the system's native SSH client --wait-timeout int Timeout of waiting the operation --ssh-timeout int Timeout in seconds to connect host via SSH, ignored for operations that don't need an SSH connection. (default 5) -y, --yes Skip all confirmations and assumes 'yes'

If a TiDB hotfix package is in /tmp/tidb-hotfix.tar.gz and you want to replace all the TiDB packages in the cluster, run the following command:

tiup cluster patch test-cluster /tmp/tidb-hotfix.tar.gz -R tidb

You can also replace only one TiDB package in the cluster:

tiup cluster patch test-cluster /tmp/tidb-hotfix.tar.gz -N 172.16.4.5:4000

Rename the cluster

After deploying and starting the cluster, you can rename the cluster using the tiup cluster rename command:

tiup cluster rename ${cluster-name} ${new-name}

Stop the cluster

The components in the TiDB cluster are stopped in the following order (The monitoring component is also stopped):

Alertmanager > Grafana > Prometheus > TiCDC > TiFlash > TiDB > TiKV > PD

To stop the cluster, run the following command:

tiup cluster stop ${cluster-name}

Similar to the start command, the stop command supports stopping some of the components by adding the -R or -N parameters. For example:

  • This command stops only the TiDB component:

    tiup cluster stop ${cluster-name} -R tidb
  • This command stops only the TiDB components on the 1.2.3.4 and 1.2.3.5 hosts:

    tiup cluster stop ${cluster-name} -N 1.2.3.4:4000,1.2.3.5:4000

Clean up cluster data

The operation of cleaning up cluster data stops all the services and cleans up the data directory or/and log directory. The operation cannot be reverted, so proceed with caution.

  • Clean up the data of all services in the cluster, but keep the logs:

    tiup cluster clean ${cluster-name} --data
  • Clean up the logs of all services in the cluster, but keep the data:

    tiup cluster clean ${cluster-name} --log
  • Clean up the data and logs of all services in the cluster:

    tiup cluster clean ${cluster-name} --all
  • Clean up the logs and data of all services except Prometheus:

    tiup cluster clean ${cluster-name} --all --ignore-role prometheus
  • Clean up the logs and data of all services except the 172.16.13.11:9000 instance:

    tiup cluster clean ${cluster-name} --all --ignore-node 172.16.13.11:9000
  • Clean up the logs and data of all services except the 172.16.13.12 node:

    tiup cluster clean ${cluster-name} --all --ignore-node 172.16.13.12

Destroy the cluster

The destroy operation stops the services and clears the data directory and deployment directory. The operation cannot be reverted, so proceed with caution.

tiup cluster destroy ${cluster-name}

Switch from Prometheus to VictoriaMetrics

In large-scale clusters, Prometheus might encounter performance bottlenecks when handling a large number of instances. Starting from TiUP v1.16.3, TiUP supports switching the monitoring component from Prometheus to VictoriaMetrics (VM) to provide better scalability, higher performance, and lower resource consumption.

Set up VictoriaMetrics for a new deployment

By default, TiUP uses Prometheus as the metrics monitoring component. To use VictoriaMetrics instead of Prometheus in a new deployment, configure the topology file as follows:

# Monitoring server configuration monitoring_servers: # IP address of the monitoring server - host: ip_address ... prom_remote_write_to_vm: true enable_prom_agent_mode: true # Grafana server configuration grafana_servers: # IP address of the Grafana server - host: ip_address ... use_vm_as_datasource: true

Migrate an existing deployment to VictoriaMetrics

You can perform the migration without affecting running instances. Existing metrics will remain in Prometheus, while TiUP will write new metrics to VictoriaMetrics.

Enable VictoriaMetrics remote write

  1. Edit the cluster configuration:

    tiup cluster edit-config ${cluster-name}
  2. Under monitoring_servers, set prom_remote_write_to_vm to true:

    monitoring_servers: - host: ip_address ... prom_remote_write_to_vm: true
  3. Reload the configuration to apply the changes:

    tiup cluster reload ${cluster-name} -R prometheus

Switch the default data source to VictoriaMetrics

  1. Edit the cluster configuration:

    tiup cluster edit-config ${cluster-name}
  2. Under grafana_servers, set use_vm_as_datasource to true:

    grafana_servers: - host: ip_address ... use_vm_as_datasource: true
  3. Reload the configuration to apply the changes:

    tiup cluster reload ${cluster-name} -R grafana

View historical metrics generated before the switch (optional)

If you need to view historical metrics generated before the switch, switch the data source of Grafana as follows:

  1. Edit the cluster configuration:

    tiup cluster edit-config ${cluster-name}
  2. Under grafana_servers, comment out use_vm_as_datasource:

    grafana_servers: - host: ip_address ... # use_vm_as_datasource: true
  3. Reload the configuration to apply the changes:

    tiup cluster reload ${cluster-name} -R grafana
  4. To switch back to VictoriaMetrics, repeat the steps in Switch the default data source to VictoriaMetrics.

Clean up old metrics and services

After confirming that the old metrics have expired, you can perform the following steps to remove redundant services and files. This does not affect the running cluster.

Set Prometheus to agent mode

  1. Edit the cluster configuration:

    tiup cluster edit-config ${cluster-name}
  2. Under monitoring_servers, set enable_prom_agent_mode to true, and ensure you also set prom_remote_write_to_vm and use_vm_as_datasource correctly:

    monitoring_servers: - host: ip_address ... prom_remote_write_to_vm: true enable_prom_agent_mode: true grafana_servers: - host: ip_address ... use_vm_as_datasource: true
  3. Reload the configuration to apply the changes:

    tiup cluster reload ${cluster-name} -R prometheus

Remove expired data directories

  1. In the configuration file, locate the data_dir path of the monitoring server:

    monitoring_servers: - host: ip_address ... data_dir: "/tidb-data/prometheus-8249"
  2. Remove the data directory:

    rm -rf /tidb-data/prometheus-8249

Was this page helpful?