No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

KunLun 9008 V5 Alarm Handling 05

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0x0000000F System Shutdown Due to CPU Overtemperature (CPU, Critical Alarm)

ALM-0x0000000F System Shutdown Due to CPU Overtemperature (CPU, Critical Alarm)

Description

Alarm message (iBMC versions earlier than 2.96):

CPU arg1 temperature is too high and the server will be powered off.

Alarm message (iBMC 2.96 and later versions):

The OS was shut down due to CPU arg1 overheating.

This alarm is generated when the OS was shut down due to CPU overheating.

Alarm object: CPU

Attribute

Alarm ID

Alarm Severity

Auto Clear

0x0000000F

Critical

Yes

Parameters

Name

Meaning

arg1

Slot number of the CPU.

Impact on the System

Services are interrupted.

Possible Causes

  • A fan module is faulty.
  • The equipment room temperature exceeds the normal range.
  • The air inlet or outlet is blocked.
  • The air duct is not installed.
  • The heat sink is not properly connected to the CPU.
  • A CPU is faulty.

Procedure

  1. Check whether both air inlet and outlet high temperature alarms are generated.

  2. Rectify the fault according to troubleshooting suggestions. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 3.

  3. Power off the server, and check whether the air duct is properly installed in the server.

  4. Install the air duct properly, and power on the server. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 5.

  5. Power off the server, and check whether the CPU heat sink is properly installed.

  6. Install the CPU heat sink properly, and power on the server. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 7.

  7. Power off the server, and apply thermal compound evenly to the top of the CPU. Power on the server, and check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 8.

  8. Replace the faulty CPU. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 9.

  9. Getting Help.
Translation
Download
Updated: 2019-05-25

Document ID: EDOC1100023838

Views: 102377

Downloads: 17

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next