Understanding MariaDB RESET REPLICA/SLAVE for Clean Replication Starts
Purpose
- Useful in scenarios like:
- Replica server has fallen significantly behind the master due to errors or downtime.
- You want to synchronize the replica with the master from a specific point in time.
- Resets the replica's connection to the master server, causing it to forget its previous replication position in the master's binary log.
- Initiates a clean slate for replication on a replica (formerly called slave) server.
Mechanics
- Stopping Replication
Before executingRESET REPLICA
, it's essential to stop the replication process on the replica usingSTOP SLAVE
. This prevents any further changes from being applied while the reset is underway. - Executing RESET REPLICA
Once replication is stopped, connect to the replica's MariaDB server and issue theRESET REPLICA
statement. This statement discards the replica's current replication information, effectively resetting its connection to the master. - Setting Up Replication
After the reset, you'll need to re-establish the replication connection between the replica and the master. This typically involves:- Configuring the replica's connection details, including the master's hostname, port, username, and password.
- Specifying the replication starting point. You can use either the binary log file (file name) and position (offset within the file) or the master's GTID (Global Transaction Identifier) set.
Example
# On the replica server (after stopping replication):
RESET REPLICA;
# Configure replication connection details:
CHANGE MASTER TO
MASTER_HOST='master_server_hostname',
MASTER_PORT=3306, # Replace with the actual port number
MASTER_USER='replication_user',
MASTER_PASSWORD='replication_password';
# Option 1: Using binary log file and position
START SLAVE FOR CHANNEL 'my_channel' SOURCE='master-bin.000001' POSITION=1024;
# Option 2: Using GTIDs (if enabled on the master)
START SLAVE FOR CHANNEL 'my_channel' SOURCE GTID_SET=('gtid_set_on_master');
- Consider scheduling replica resets during off-peak hours to minimize potential performance impacts on the replica server and any applications it serves.
- During the resynchronization process after the reset, the replica server will experience increased load as it catches up to the master's state.
RESET REPLICA
requires appropriate privileges on the replica server (usually thereplication_slave
privilege).
Resetting and Starting Replication with Binary Log Coordinates (Detailed)
# 1. Stop replication on the replica (assuming it's running):
STOP SLAVE;
# 2. Check the master's current binary log file and position:
SHOW MASTER STATUS \G # Use `\G` for multi-line output
# Example output:
# File: master-bin.000002
# Position: 4096
# 3. Reset replication on the replica:
RESET REPLICA;
# 4. Configure the replica's connection details:
CHANGE MASTER TO
MASTER_HOST='master_server_hostname',
MASTER_PORT=3306, # Replace with the actual port number
MASTER_USER='replication_user',
MASTER_PASSWORD='replication_password';
# 5. Start replication using the retrieved binary log coordinates:
START SLAVE FOR CHANNEL 'my_channel' SOURCE='master-bin.000002' POSITION=4096;
- Lines 4-6: Establish the replication connection with the master, providing the current binary log file (
master-bin.000002
) and position (4096
) obtained from the master. - Line 3: Performs the
RESET REPLICA
operation. - Lines 1-2: Ensure replication is stopped to avoid applying changes while resetting.
Resetting and Starting Replication with GTIDs (if enabled on the master)
# 1. Stop replication on the replica (assuming it's running):
STOP SLAVE;
# 2. Reset replication on the replica:
RESET REPLICA;
# 3. Configure the replica's connection details:
CHANGE MASTER TO
MASTER_HOST='master_server_hostname',
MASTER_PORT=3306, # Replace with the actual port number
MASTER_USER='replication_user',
MASTER_PASSWORD='replication_password';
# 4. Obtain the current GTID set from the master:
SHOW MASTER STATUS \G
# Example output (look for the `Gtid_Set` value):
# Gtid_Set: '3e4c3de8-affe-11e4-86d9-bec5fb040a00:1-3e4c3de8-affe-11e4-86d9-bec5fb040a00:3'
# 5. Start replication using the GTID set:
START SLAVE FOR CHANNEL 'my_channel' SOURCE GTID_SET=('3e4c3de8-affe-11e4-86d9-bec5fb040a00:1-3e4c3de8-affe-11e4-86d9-bec5fb040a00:3');
- Lines 3-5: Establish the replication connection, specifying the
GTID_SET
obtained from the master'sSHOW MASTER STATUS
output. - Lines 1-2: Stop and reset replication as before.
# Example handling potential errors during `RESET REPLICA`
BEGIN;
# Try to reset replication
RESET REPLICA;
# If successful, commit the transaction
COMMIT;
# If an error occurs, rollback the transaction (optional)
ROLLBACK;
# Proceed with further actions (e.g., configuring replication again)
...
- Line 6: Continue with subsequent actions like reconfiguring replication.
- Lines 4-5: If successful, the transaction is committed. If an error occurs (e.g., insufficient privileges), the transaction can be rolled back (optional) to prevent unintended changes.
- Line 3: Attempts to reset replication.
- Lines 1-2: Encapsulate the
RESET REPLICA
statement within a transaction.
Point-in-Time Recovery with Backup and Restore
- Drawbacks
- Requires a recent and valid backup of the master server.
- Can be time-consuming, especially for large databases.
- Benefits
- Provides a more controlled approach to synchronization, allowing you to restore to a specific point in time.
- Process
- Take a backup of the master server at a desired point in time (e.g., using
mysqldump
or a dedicated backup tool). - Restore the backup to the replica server.
- Configure replication on the replica with the master, starting from the same binary log file and position captured during the backup.
- Take a backup of the master server at a desired point in time (e.g., using
- Scenario
If your replica server has diverged significantly from the master due to errors or downtime, and you have a recent backup of the master, consider point-in-time recovery.
Incremental Replication with CHANGE MASTER TO
- Drawbacks
- Requires accurate knowledge of the master's current replication position.
- May not be suitable for large discrepancies between master and replica.
- Benefits
- Faster than resetting and resynchronizing from scratch.
- Useful for catching up minor delays in replication.
- Process
- Stop replication on the replica using
STOP SLAVE
. - Use
SHOW MASTER STATUS
on the master to obtain the current binary log file and position. - Issue
CHANGE MASTER TO
on the replica, specifying the retrieved coordinates as the new starting point. - Start replication again using
START SLAVE
.
- Stop replication on the replica using
- Scenario
If the replica is slightly behind the master, you can useCHANGE MASTER TO
to update its replication position without a complete reset.
Semi-Sync Replication (if enabled)
- Drawbacks
- Requires additional configuration and may not be suitable for all environments.
- Benefits
- Automated resynchronization of lagging replicas.
- Reduces disruption to operations on the master.
- Process
- Enable semi-sync replication on both master and replica.
- The master will wait for acknowledgment from the replica before committing transactions.
- If the replica falls behind, the master will automatically attempt to resynchronize it.
- Scenario
If your MariaDB setup supports semi-sync replication, it can automatically attempt to catch up a lagging replica without a full reset.
Choosing the Right Alternative
The best alternative depends on your specific situation:
- If your environment supports semi-sync replication and you want automated resynchronization, enable it.
- If the replica is only slightly behind, consider using
CHANGE MASTER TO
. - If you have a recent backup and need precise control over the recovery point, use point-in-time recovery.