Replacing Both System Disk Modules
This section explains how to replace a system disk module for the P36/C36.
Prerequisites
- The replacement system disk module is ready.
- The faulty system disk module has been located.
Precaution
- To prevent damaging system disk modules or connectors, remove or install system disk modules with even force.
- When you remove a system disk module, first remove it from the port. Wait at least 30 seconds and remove the system disk module completely from the main control board enclosure.
- To prevent a system disk module from damage, wait at least 1 minute after removal and before reinserting a system disk module, or after insertion and before removing a system disk module.
- To prevent system failure, do not change a used hard drive into the node.
- If the system version is earlier than V300R006C00SPC200, check whether the /opt/inspect directory exists after replacing the two system disks. If the directory does not exist, copy the inspect folder from another node to the path. Ensure that the user groups of the inspect folder and their permissions are the same as those of other nodes.
- If the hot patch has been installed, install it on the faulty node again after replacing the two system disks.
Procedure
- Follow 1 to 9 of Replacing a System Disk Module to replace two system disk modules.
- Use an operating system image file matching the original version of the node to be restored. You can refer to OceanStor 9000 V5 Software Installation Guide to set a RAID group for system disks and install an operating system image.
- Log in to the operating system as user root by using the KVM. Configure a preset IP address for an original back-end network port of the node, enabling the node to communicate with other nodes in the system. For example, if the back-end network
port is A0-0, its preset IP address is 192.168.0.10, and subnet mask is 16 bits, run the ifconfig A0-0 192.168.0.10/16 command to configure the IP address.
The back-end network port name and IP address of a node can be found in the configuration file for software installation. Or you can log in to the DeviceManager, click the device icon to go to Device View. Then click the node name and choose Ports.
- If the cluster contains multiple
nodes, on the node to be repaired, run the /usr/bin/BackUp_KeyData_Recovery.sh local_ip remote_ip command to restart the node.
In this command, local_ip indicates the back-end network IP address of the node to be repaired, and remote_ip indicates the back-end IP address of an normal node whose version is the same as that of the node to be repaired. To obtain the node version information, log in to DeviceManager and click the device icon. On the Device View page that is displayed, select Basic Node Information.
- Log in to the DeviceManager, click the device icon to go to Device View. Then click the node name and choose Basic Node Information. View whether the node is operating correctly.
If a node is normal, its Health Status is Normal and Running Status is Online.
- If yes, go to 6.
- If no, contact technical support engineers.
- Inspect the system using SmartKit. Choose OceanStor 9000 V5 Routine Maintenance.
to check the system status. For details, see the
- If all inspection items are successful: The inspection is successful.
- If some inspection items fail the inspection: The inspection is failed. Rectify the faults by taking recommended actions in the inspection reports. You need to reperform inspection after rectifying. If the inspection is still failed, contact Huawei technical support.