No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

KunLun Mission Critical Server V100R001 CMC Alarm Handling 09

This document describes KunLun 9016 and 9032 alarms in the CMC, in terms of their meanings, impact on the system, possible causes, and handling suggestions.
Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-28000005 CPU QPI Link Failed

ALM-28000005 CPU QPI Link Failed

Description

Alarm message:

CPU # QPI # link failed.

Attribute

Alarm ID

Alarm Severity

Auto Clear

28000005

Major

Yes

Parameters

Name

Meaning

Alarm Severity

Indicates the alarm severity.

Alarm Source

Indicates the alarm source.

Subject

Indicates the event body for which an alarm is generated.

Time

Indicates the time when an alarm is generated.

Description

Provides an alarm description.

Event Code

Indicates the event code of an alarm.

Impact on the System

Server performance may be affected.

Possible Causes

  • The CPU socket is damaged or in poor contact.
  • The CPU is faulty.
  • The mainboard is faulty.

Procedure

  1. Gracefully power off the server.
  2. Remove the CPU, and check whether the CPU socket has bent pins.

    For details about how to remove a CPU, see the KunLun 90xx V100R001 User Guide.

  3. Check whether the CPU is faulty.

    The following alarm message is used as an example to describe the check method:

    Cable / Interconnect (CPU1 QPI Link)
    1. Exchange the positions of CPU 1 and a functional CPU.
    2. Power on the server. If an alarm is still generated for CPU 1, CPU 1 is faulty. Otherwise, the QPI link is faulty.

  4. Gracefully power off the server, replace the CPU, and power it on again. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 6.

  5. Gracefully power off the server, replace the mainboard, and power it on again. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 6.

  6. Contact Huawei technical support.
Translation
Download
Updated: 2018-12-29

Document ID: EDOC1000111849

Views: 60254

Downloads: 77

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next