IBM Cloud Docs
Disaster Recovery - Failover from an inaccessible primary volume

Disaster Recovery - Failover from an inaccessible primary volume

If a catastrophic failure or disaster causes an outage on the primary site, customers can perform the following actions to quickly access their data on the secondary site. When the primary volume is inaccessible, you can force a failover to the remote replica. Before you start the failover, make sure that all host authorization is in place.

Authorized hosts and volumes must be in the same data center. For example, you can't have a replica volume in London and the host in Amsterdam. Both must be in London or both must be in Amsterdam.

You can create the authorization in the UI, from the CLI, with the API, or with Terraform.

This action breaks the replication relationship and restoring the connection between the primary and the replica location can be time-consuming.

Failing over to the replica volume in the UI

  1. Go to your list of File Storage for Classic. From the Classic Infrastructure Classic icon menu, click Storage > File Storage for Classic.
  2. Locate and click the volume name.
  3. Click Actions Actions icon > Failover.
  4. When the primary location is disabled, the option of Disaster Recovery Failover becomes active.
  5. Click Yes to proceed.

Failing over to the replica volume from the CLI

Before you begin, decide on the CLI client that you want to use.

Initiating a failover from the IBMCLOUDCLI

You can use the ibmcloud sl file replica-failover command to fail over operations from the source file share to the replica file share. The following example initiates a failover from the source share 560156918 to the replica share 560382016.

$ ibmcloud sl file file disaster-recovery-failover 560156918 560382016
OK
Failover of volume 560156918 to replica 560382016 is now in progress.

Initiating a failover from the SLCLI

Use the following command to fail a file volume over to a specific replicant volume.

$ slcli file disaster-recovery-failover --help
sage: slcli file disaster-recovery-failover [OPTIONS] VOLUME_ID

Options:
--replicant-id TEXT  ID of the replicant volume
-h, --help           Show this message and exit.

Failing over to the replica volume with the API

REST API

  • URL - https://USERNAME:APIKEY@api.softlayer.com/rest/v3/SoftLayer_Network_Storage/primaryvolumeId/disasterRecoveryFailoverToReplicant
  • Request body
 {
   "parameters": [replicavolumeid]
 }

SOAP API

  • URL - https://api.softlayer.com/soap/v3/SoftLayer_Network_Storage
  • Request body
    <?xml version="1.0" encoding="UTF-8"?>
    <SOAP-ENV:Envelope xmlns:SOAP-ENV="http://schemas.xmlsoap.org/soap/envelope/" xmlns:ns1="http://api.service.softlayer.com/soap/v3.1/">
    <SOAP-ENV:Header>
     <ns1:authenticate>
     <username>USERNAME</username>
     <apiKey>APIKEY</apiKey>
    </ns1:authenticate>
    <ns2:SoftLayer_Network_StorageInitParameters>
     <id>primary Volume Id</id>
    </ns2:SoftLayer_Network_StorageInitParameters>
    </SOAP-ENV:Header>
    <SOAP-ENV:Body>
     <ns1:disasterRecoveryFailoverToReplicant>
      <replicantId xsi:type="int">replica Volume ID</replicantId>
     </ns1:disasterRecoveryFailoverToReplicant>
    </SOAP-ENV:Body>
    </SOAP-ENV:Envelope>
    

During the Disaster Recovery Failover, the system is forced to fail over to the replica site and the replication relationship is severed. To be able to fail back to the original site after the site is restored to normal operations, the system must reestablish the replication bond. This action can take a considerable amount of time. During the Failback process, configuration-related actions are read-only. You can't edit any snapshot schedule or change snapshot space. The event is logged in the replication history.

If you need further assistance, create a support case.

Failing back to the original primary site in the UI

After a disaster event, IBM Cloud® begins remediation work to return the impacted locations to normal operations. When the site is restored, you can initiate a Failback to the original site by clicking Storage, File Storage for Classic in the IBM Cloud® console.

  1. Click your active volume ("target").

  2. Next, click Replica and click Actions Actions icon.

  3. Select Failback. When the primary location is marked disabled, the option of Disaster Recovery Failback becomes active.

    During the Disaster Recovery Failover, the system is forced to fail over to the replica site and the replication relationship is severed. To be able to fail back to the original site after the site is restored to normal operations, the system must reestablish the replication bond. This action can take a considerable amount of time. Expect a message that shows that the failover is in progress. Additionally, an icon appears next to your volume on the File Storage for Classic that indicates that an active transaction is occurring. Hovering over the icon produces a window that shows the transaction. The icon disappears when the transaction is complete. During the Failback process, configuration-related actions are read-only. You can't edit any snapshot schedule or change snapshot space. The event is logged in replication history.

  4. Next, click View All File Storage for Classic.

  5. Click your replica volume ("source"). This volume now has an Inactive status.

  6. Mount and attach your storage volume to the host. For more information, see connecting your storage.

If you need further assistance, create a support case.

Failing back from the CLI

Initiating a failback from the IBMCLOUDCLI

You can use the ibmcloud sl file replica-failback command to fail back operations from the replica file share to the original source file share. The following example initiates a failback to the original source share 560156918.

$ ibmcloud sl file replica-failback 560156918
OK
Failback of volume 560156918 is now in progress.

For more information about all of the parameters that are available for this command, see ibmcloud sl file replica-failover.

Initiating a failback from the SLCLI

To fail back a file volume from a specific replicant volume, use the following command.

$ slcli file replica-failback --help
Usage: slcli file replica-failback [OPTIONS] VOLUME_ID

Options:
 --replicant-id TEXT  ID of the replicant volume
 -h, --help           Show this message and exit.

During the Disaster Recovery Failover, the system is forced to fail over to the replica site and the replication relationship is severed. To be able to fail back to the original site after the site is restored to normal operations, the system must reestablish the replication bond. This action can take a considerable amount of time. During the Failback process, configuration-related actions are read-only. You can't edit any snapshot schedule or change snapshot space. The event is logged in the replication history.

When the original volume is active, you can mount and attach it to the host. For more information, see connecting your storage.

If you need further assistance, create a support case.