No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

FusionServer Pro E9000 Server iBMC (Earlier than V250) Alarm Handling 02

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-070CFFFF Correctable Machine Check Error (CPUN Status)

ALM-070CFFFF Correctable Machine Check Error (CPUN Status)

Description

Alarm message:

Correctable Machine Check Error

This alarm is generated when the sensor detects that a self-check exception has occurred in a CPU.

This alarm is generated by the following sensor:

  • CPUN Status (N indicates a CPU number.)

Attribute

Alarm ID

Alarm Severity

Auto Clear

070CFFFF

Minor

Yes

Parameters

Name

Meaning

Time

Time when an alarm is generated.

Sensor

Name of the sensor that generates an alarm.

Event

Details about an alarm.

Severity

Severity of an alarm.

Event Code

Event code that corresponds to an alarm.

Impact on the System

The DIMMs corresponding to the CPU cannot be used. As a result, server performance may deteriorate.

Possible Causes

  • The SMI2 link has failed in memory mirroring mode.
  • An internal error has occurred in Jordan Creek.
  • The number of errors that occur during data transmission between Jordan Creek and the memory controller has reached the alarm threshold.

Procedure

  1. Replace the memory risers corresponding to the CPU. Then check whether the alarm is cleared.

    For details about how to replace a memory riser, see the server user guide.

    When this alarm is generated, the server can still operate properly. Replace the memory riser at an off-peak time.

    • If yes, no further action is required.
    • If no, go to Step 2.

  2. Contact Huawei technical support for help.
Translation
Download
Updated: 2019-11-19

Document ID: EDOC1100035007

Views: 28695

Downloads: 11

Average rating:
This Document Applies to these Products

Related Version

Related Documents

Share
Previous Next