No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

E9000 Server V100R001 HMM Alarm Handling 19

This document describes E9000 server alarms in terms of the meaning, impact on the system, possible causes, and solutions.
Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
Slot: Memory MCE Error (Critical, DIMMN)

Slot: Memory MCE Error (Critical, DIMMN)

Description

Alarm message:

Uncorrectable error, dimm is arg1 

or

arg1 triggered an uncorrectable error, arg2

This alarm is generated when an uncorrectable error occurred on a DIMM.

This alarm is generated by the following sensors:

DIMMN

Attribute

Alarm ID

Alarm Severity

Auto Clear

0x0C01FFFF

Critical

Yes

Parameters

Name

Meaning

arg1, N

  • DIMM silkscreen, for example, DIMM020 (A) or DIMM010 (B)
  • CPU socket number and channel number.

arg2

Error code of the alarm.

Impact on the System

The DIMM cannot be used, which affects server performance.

Possible Causes

  • The DIMM is not installed in the correct slot.
  • The DIMM is faulty.
  • The mainboard is faulty.

Procedure

  1. Power off the server and check whether the DIMM installation positions are correct.

    For details about DIMM installation rules, see the server user guide.

    • If yes, go to 3.
    • If no, go to 2.

  2. Install the DIMMs in correct positions by referring to DIMM installation rules. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 3.

  3. Check whether alarms are generated for multiple DIMMs of the same channel.

    • If yes, go to 4.
    • If no, go to 5.

  4. Switch the DIMM with a functioning DIMM in the same chassis. Then, check whether the alarm is still generated for this DIMM.

    • If yes, go to 5.
    • If no, go to 6.

  5. Replace the DIMM. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 7.

  6. Replace the mainboard. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 7.

  7. On the HMM WebUI, choose System Management > Information Collection, and collect logs.
  8. Contact Huawei technical support.
Translation
Download
Updated: 2018-08-16

Document ID: EDOC1000015902

Views: 193408

Downloads: 1567

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next