Reconstruction of a RAID Group Fails Because a Hard Disk Is Faulty or Removed

Publication Date:  2012-07-17 Views:  196 Downloads:  0
Issue Description
Related information about the product and version: CSS V100R001C01 Database Volume.
During the RAID group initialization, the 0xB02160010 RAID group reconstruction fails alarm is reported to the ISM.
Alarm Information
None
Handling Process
 Step 1     Log in on the maintenance terminal to the alarm device as the root user.                               
Step 2    
Run the MegaCli64 -adpautorbld -enbl -aall command to set the LSI RAID card to automatically reconstruct.

Step 3     Re-insert the hard disk or replace it with a new one according to the alarm description.

Step 4     Set the replacement or newly inserted hard disk as the hot spare disk.

Step 5     Run the MegaCli64 -pdrbld -showprog -physdrv [E:S] -aall command to check the reconstruction progress of the hard disk, as shown in the red circle in Figure 1-20. physdrv indicates the hard disk whose reconstruction progress is to be checked, E and S in [E:S] indicate Enclosure Device ID and Slot Number respectively.

Run the MegaCli64 -pdlist -aall | grep Enclosure -m1 command to query the Enclosure ID.
Figure 1-1  Checking the reconstruction progress of the hard disk

 
Step 6     After the reconstruction, run the MegaCli64 -ldinfo -lx-aall command to check the status of the RAID group, as shown in the red circle in Figure 1-21. x specifies the virtual drive number for the command.
Figure 1-2  Checking the reconstruction progress of the hard disk

 
l   If the RAID group is in the Optimal state, the reconstruction succeeds.
l   Otherwise, contact technical support engineers.
----End
Root Cause
On the ISM interface, the alarm listed in Table 1-13 is reported.
Table 1-1 Alarm on a RAID group reconstruction failure
Alarm ID Alarm Name Alarm Cause Alarm Description
0xB02160010 RAID group reconstruction fails A hard disk is faulty or removed. The hard disk ([slot-id]) of the device ([dev-name]) is faulty, which results in the reconstruction failure of the RAID group ([raid-name]); location: cloud storage domain ([domain-name]), rack ID ([rack]), frame ID ([frame]), device ([dev-name]).

Check the disk online indicator according to the alarm information. The indicator is red. Therefore, the RAID group fails because a hard disk is damaged.
 
Suggestions
None

END