How to Recover EMC SRDF Data after a Primary Room's
Complete Failure

This procedure performs data recovery when a campus cluster's primary
room fails completely, the primary room fails over to a secondary room, and
then the primary room comes back online. The campus cluster's primary room
is the primary node and storage site. The complete failure of a room includes
the failure of both the host and the storage in that room. If the primary
room fails, Sun Cluster automatically fails over to the secondary room, makes
the secondary room's storage device readable and writable, and enables the
failover of the corresponding device groups and resource groups.

When the primary room returns online, you can manually recover the data
from the SRDF device group that was written to the secondary room and resynchronize
the data. This procedure recovers the SRDF device group by synchronizing the
data from the original secondary room (this procedure uses phys-campus-2 for the secondary room) to the original primary room (phys-campus-1). The procedure also changes the SRDF device group
type to RDF1 on phys-campus-2 and to RDF2 on phys-campus-1.

These instructions demonstrate one method you can use to manually
recover SRDF data after the primary room fails over completely and then comes
back online. Check the EMC documentation for additional methods.

Log into the campus cluster's primary room to perform these steps. In
the procedure below, dg1 is the SRDF device group
name. At the time of the failure, the primary room in this procedure is phys-campus-1 and the secondary room is phys-campus-2.

Log into the campus cluster's primary room and become superuser
or assume a role that provides solaris.cluster.modify RBAC
authorization.

From the primary room, use the symrdf command
to query the replication status of the RDF devices and view information about
those devices.

phys-campus-1# symrdf -gdg1query

Tip –

A device group that is in the split state
is not synchronized.

If the RDF pair state is split and the device group type is RDF1,
then force a failover of the SRDF device group.

phys-campus-1# symrdf -gdg1-force failover

View the status of the RDF devices.

phys-campus-1# symrdf -gdg1query

After the failover, you can swap the data on the RDF devices that
failed over.

phys-campus-1# symrdf -gdg1swap

Verify the status and other information about the RDF devices.

phys-campus-1# symrdf -gdg1query

Establish the SRDF device group in the primary room.

phys-campus-1# symrdf -gdg1establish

Confirm that the device group is in a synchronized state and that
the device group type is RDF2.

This example provides the Sun Cluster-specific steps necessary to manually
recover EMC SRDF data after a campus cluster's primary room fails over, a
secondary room takes over and records data, and then the primary room comes
back online. In the example, the SRDF device group is called dg1 and
the standard logical device is DEV001. The primary room is phys-campus-1 at the time of the failure, and the secondary room is phys-campus-2. Perform the steps from the campus cluster's primary
room, phys-campus-1.