Heartbeat HDD Replacement Instructions

Heartbeat HDD Replacement Instructions

1. Replace the drive with the following instructions located hereBe sure to note the WWN of the failed HDD, as as well as the replacement HDD.

2. Open the cluster configuration file with your favorite editor (we will use vim in this example) 

# vim /opt/HAC/RSF-1/etc/config

3. Take note of the “Machines section”. This is where we will be making our edit.

 DISCLAIMER: These instructions must be followed exactly as they are stated, or you risk downtime if the config file is edited incorrectly.

 

Example “Machines section” shown below:

# Machines section

MACHINE zstax01

NET zstax02

DISC zstax02 /dev/rdsk/c0t5000C500409BD39Bd0s0:518:512 TAG pool1

DISC zstax02 /dev/rdsk/c0t5000C5003C9AE94Fd0s0:518:512 TAG pool0

DISC zstax02 /dev/rdsk/c0t5000C5003CA49C3Fd0s0:518:512 TAG pool0

DISC zstax02 /dev/rdsk/c0t5000C500409B084Fd0s0:518:512 TAG pool1

MACHINE zstax02

NET zstax01

DISC zstax01 /dev/rdsk/c0t5000C500409BD39Bd0s0:512:518 TAG pool1

DISC zstax01 /dev/rdsk/c0t5000C5003C9AE94Fd0s0:512:518 TAG pool0

DISC zstax01 /dev/rdsk/c0t5000C5003CA49C3Fd0s0:512:518 TAG pool0

DISC zstax01 /dev/rdsk/c0t5000C500409B084Fd0s0:512:518 TAG pool1

 

In this example, we need to replace c0t5000C500409BD39Bd0. It has been highlighted in red.

4. Replace the old WWN with the replacement WWN. You need to do this for both instances of the WWN

5. Save and exit the file.

6. On the other cluster node, make the exact same changes to the configuration file.

7. Distribute the updated config file.

# /opt/HAC/RSF-1/bin/config_dist --hot /opt/HAC/RSF-1/etc/config <node1> <node2>

<node1> and <node2> are the machine names, ex. zstax01, zstax02

8. The updated config file should have been served out to both nodes, and the new heartbeat device should be up and running. You can check this via NMV, or with rsfcli:

# /opt/HAC/RSF-1/bin/rsfcli status

9. Look for the updated WWN and make sure the status is “Up”

 

    • Related Articles

    • Failed HDD Replacement Instructions

      From Nexenta Management Console (NMC) check the status of your pools $ zpool status Find the faulted HDD and record the worldwide number (WWN) of the HDD, and the name of the pool it resides in. You will use the WWN to identify this HDD later An ...
    • Failed Boot Mirror HDD Replacement Instructions

      ​​From NMV: Settings > Disks Click blink on the far right for the faulted LUN Ignore the warning, it will not cause issues In this example, we will use c3t1d0 This will do a DD read blink on the drive. From NMC: # setup volume syspool offline-lun # ...
    • Ensuring IPMI is active

      Q: How do I know if IPMI is working? A: Most motherboards have an IPMI heartbeat LED that blinks slowly after IPMI has initialized. Other than that, the interface should respond to pings, and will have HTTP and HTTPS web interfaces running on ports ...
    • How to replace a failed HDD using sas2ircu

      1. Run as root: # zpool offline data2 <BAD WWN> 2. Remove the failed disk, then insert the replacement. Wait at least 20 seconds for the drive to initialize. 3. Run as root: # devfsadm -Cv This rescans and removes old dangling device links from the ...
    • DIMM Replacement Guidelines

      Replace a DIMM when one of the following events takes place: The DIMM fails memory testing under BIOS due to Uncorrectable Memory Errors (UCEs).  UCEs occur and investigation shows that the errors originated from memory.  More than 24 Correctable ...