No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

Huawei Rack Server iBMC Alarm Handling 28

This document describes iBMC alarms in terms of the meaning, impact on the system, possible causes, and handling suggestions.
Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0x0000006D CPU Core Overtemperature (CPU, Minor Alarm)

ALM-0x0000006D CPU Core Overtemperature (CPU, Minor Alarm)

Description

Alarm message:

The CPU arg1 core temperature (arg2 degrees C) exceeds the temperature upper threshold (arg3 degrees C) (SN: arg4, PN: arg5).
NOTE:

From iBMC V316, the CPU and disk alarms will also include the SN and part umber and the mainboard and memory alarms will also include the part number.

This alarm is generated when the CPU core temperature exceeds the minor threshold.

Alarm object: CPU

Attribute

Alarm ID Alarm Severity Auto Clear

0x0000006D

Minor

Yes

Parameters

Name Meaning
arg1

Socket No. of the CPU.

arg2

Current reading of the sensor.

arg3

Alarm threshold.

arg4

CPU serial number.

arg5

Part number.

Impact on the System

Overheating affects CPU performance and server operation.

Possible Causes

  • A fan module is faulty.

  • The equipment room temperature exceeds the normal range.

  • The air inlet or outlet is blocked.

  • A CPU is faulty.

Procedure

  1. Check whether there are fan module alarms.

    • If yes, go to 2.

    • If no, go to 3.

  2. Replace the faulty fan or fan module. After 5 minutes, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 3.

  3. Check whether the equipment room temperature exceeds the normal range.

    • If yes, go to 4.

    • If no, go to 5.

  4. Reduce the equipment room temperature to the normal range. After 5 minutes, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 5.

  5. Check whether the air inlet or outlet of the server is blocked.

    • If yes, go to 6.

    • If no, go to 7.

  6. Clear the blockage. After 5 minutes, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 7.

  7. Replace the mainboard. Then, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 8.

  8. Contact Huawei technical support.
Download
Updated: 2019-06-04

Document ID: EDOC1000054724

Views: 241321

Downloads: 2947

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next