Disaster Recovery

The following clusters are supported by Spektra for Disaster Recovery as shown:

Source

Target

UA

UE-GCP

UE-AWS

UE-AWS

UE-AWS

UE-GCP

UE-GCP

UE-AWS

UA

UE-GCP

UA

If there’s an application running on one cluster (source) and you set up Disaster Recovery for it while specifying the target cluster on which you want to store the recovered application, the sys­tem moves data from the source cluster to the target cluster based on the replication interval you specify. Any data that changes in the duration on the source is moved to the target.

Replication is the process of copying data and meta-data from a primary site to a secondary site. Replication provides an extra measure of redundancy that can be invaluable if the primary site fails. Access (based on the duration you specify) to the replicated data minimizes downtime and its associated costs. This can be invaluable if the appliance disk cache or the entire appliance fails.

Replication streamlines disaster recovery processes by generating duplicate copies of all changes regularly. It can also simplify recovery from all types of disasters, including fire, flood, hurricane, virus, or worm.

Note

The Disaster Recovery feature is not supported for cloud clusters (GKE and EKS).

Pre-requisite

Create a project using at least two clusters.

_images/create_project.png

Enabling DR between two Clusters

To enable DR between two clusters:

  1. The application must be installed on the UA or UE cluster and should be in the running state.

  2. On the Catalog page, select the work template to enable the DR.

    _images/catalog_page_new.png

    You are directed to the installation tab of the workload.

    _images/Installation_tab_new.png
  3. Select the DR Settings tab and then select Update DR Settings.

    _images/dr_page_new.png
    1. Enable the DR toggle button.

    2. Select a Source Network. This is the network on the cluster used for replication.

    3. Select a Target Cluster from the drop-down. It can be either UA or UE. It should belong to the same project.

    4. Select a Target Network for the Disaster Recovery process.

    5. Select a Replication Period from the drop-down.

    6. Select the toggle to retain the volume on deletion

      _images/enable_dr.png

    Note

    When DR UE (AWS) to UA or DR UE (GCP) to UA is performed, the replication agent pod on the source UE (AWS/GCP) cluster cannot communicate with the replication agent pod on the target UA cluster. Since UA clusters usually have private IP addresses, pods created on UA clusters get private IP addresses. DR will work, if the system administrator maps a public IP address or external IP address to a private endpoint of the agent pod.

    _images/DR_exteral.png

    Note

    Ensure you run the diamanti-object-migration.sh –all script from node to view the available network on all applicable clusters. Other cases will still allow you to enable or disable Disaster Recovery, but the available networks will not appear.

  4. Select Update.

DR has been enabled, and you can see the details on the DR settings tab.

_images/dr_enabled_page_new.png

Updating DR Settings

After you enable DR, update DR settings, where you can change the replication period and suspend replication:

  1. Select the workload from the Catalog page.

  2. Select the DR Settings tab.

  3. Select Update DR Settings to display the DR Settings page.

  4. Change the Replication Period or Suspend Replication if necessary.

    _images/update_dr.png
  5. Select Update.

Enabling Fire Drill

After you enable DR, you can start the fire-drill:

  1. Select the workload from the catalog page.

  2. Select the DR Settings tab.

  3. Select the ellipsis next to the update DR settings.

    _images/options.png
  4. Select Fire drill to display the Fire drill setting page.

    _images/fire-drill.png
  5. Select Start Fire-drill to enable the application to Fire-drill from the source cluster to the target cluster.

Enabling Failover

After you enable DR, you can start the failover.

Important

Failover can only be enabled after replication has completed its initial sync.

  1. Select the workload from the catalog page.

  2. Select the DR Settings tab.

  3. Select the ellipsis next to the update DR settings.

  4. Select Fail Over to display the Fail Over setting page.

    _images/fail-over.png
  5. Select Fail Over to enable the application to failover from the source cluster to the target cluster.

Note

Wait for a while for the message Disaster Recovery PendingFailoverPhase to disappear after we initiate the failover.

Disabling DR

Once the DR is enabled, you can disable

  1. Select the workload from the catalog page.

  2. Select the DR Settings tab.

  3. Select the update DR settings.

  4. Disable the toggle.

    _images/disbale_dr.png
  5. Select Update to disable the DR for that installation.