No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

Huawei Rack Server iBMC Alarm Handling 28

This document describes iBMC alarms in terms of the meaning, impact on the system, possible causes, and handling suggestions.
Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0x070BFFFF Uncorrectable CPU Error (CPUN Status)

ALM-0x070BFFFF Uncorrectable CPU Error (CPUN Status)

Description

Alarm message:

Uncorrectable CPU error

This alarm is generated when one of the following errors occurs:

  • The SMI2 link fails in non-memory mirroring mode.
  • The CPU runs an error program.
  • A parity error occurs on the voltage mode single ended (VMSE) link.
  • The memory controller receives data marked with the poison tag.

Sensor triggering the alarm: CPUN Status

Attribute

Alarm ID Alarm Severity Auto Clear

0x070BFFFF

Critical

Yes

Parameters

Name Meaning
N indicates a CPU number.

Impact on the System

Services are interrupted, or the system restarts.

Possible Causes

  • The CPU is faulty.

  • The mainboard is faulty.

Procedure

  1. Power off the server, remove and reconnect the power cables, and power on the server. Check whether the alarm is cleared.

    • If yes, no further operation is required.

    • If no, go to 2.

  2. Remove and reinstall the CPU. Then, check whether the alarm is cleared.

    • If yes, no further operation is required.

    • If no, go to 3.

  3. Switch the CPU with a functioning CPU in the same chassis, and check whether the alarm is still generated for this CPU.

    • If yes, go to 4.

    • If no, go to 5.

  4. Replace the faulty CPU. Then, check whether the alarm is cleared.

    For details about how to replace the CPU, see "Replacing Parts" in the server user guide.

    • If yes, no further operation is required.

    • If no, go to 6.

  5. Replace the mainboard. Check whether the alarm is cleared.

    For details about how to replace the mainboard, see "Replacing Parts" in the server user guide.

    • If yes, no further operation is required.

    • If no, go to 6.

  6. Contact Huawei technical support.
Download
Updated: 2019-06-04

Document ID: EDOC1000054724

Views: 263428

Downloads: 2981

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next