No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

FusionServer Pro Rack Server iBMC Alarm Handling 31

This document describes iBMC alarms in terms of the meaning, impact on the system, possible causes, and handling suggestions.

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0x0200002F Rapid Increase of PHY Bit Errors on the Link Between the RAID Controller Card and Hard Disk (Disk, Major Alarm)

ALM-0x0200002F Rapid Increase of PHY Bit Errors on the Link Between the RAID Controller Card and Hard Disk (Disk, Major Alarm)

Description

Alarm message:

The PHY bit errors of the link between arg1 and arg2 increased too fast (SN: arg3, BN: arg4).

From iBMC V316, the CPU and disk alarms will also include the SN and BOM code and the mainboard and memory alarms will also include the BOM code.

This alarm is generated when the system detects sharply increased PHY bit errors on the links between the hard disks and the RAID controller card.

Expander controllers of different servers are located on different components, such as drive backplanes, mainboards, and GPU boards. For details, see the user manual of each server.

Alarm object: disk

Attribute

Alarm ID

Alarm Severity

Auto Clear

0x0200002F

Major

Yes

Parameters

Name

Meaning

arg1

RAID controller card, for example, RAID Card1.

arg2

Hard disk, for example, Disk0.

arg3

Disk serial number.

arg4

BOM code.

Impact on the System

The devices (such as the RAID controller card, expander controller, and hard drives) connected to the links are affected. The system may stop responding and data may be lost.

Possible Causes

  • The cables are not connected correctly or firmly.
  • The cables are faulty.
  • The drive backplane is faulty.
  • The hard disk is faulty.
  • The RAID controller card is faulty.

Procedure

  1. Connect the cables between the RAID controller card and the drive backplane. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 2.

  2. Replace the cables. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 3.

  3. Replace the hard disks. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 4.

  4. Replace the drive backplane. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 5.

  5. Replace the RAID controller card. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 6.

  6. Contact Huawei technical support.

Download
Updated: 2019-11-19

Document ID: EDOC1000054724

Views: 396714

Downloads: 3202

Average rating:
This Document Applies to these Products

Related Version

Related Documents

Share
Previous Next