No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

FusionServer Pro E9000 Server iBMC (Earlier than V250) Alarm Handling 02

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0701FFFF Critical Alarm for CPU Temperature (Thermal Trip) (CPUN Status)

ALM-0701FFFF Critical Alarm for CPU Temperature (Thermal Trip) (CPUN Status)

Description

Alarm message:

Critical alarm for CPU temperature (thermal trip)

This alarm is generated when the iBMC detects a signal sent from the CPU that the CPU core temperature is excessively high. This alarm is cleared when the system detects that the temperature is restored to the acceptable range.

This alarm is generated by the following sensor:

  • CPUN Status (N indicates a CPU number.)

Attribute

Alarm ID

Alarm Severity

Auto Clear

0701FFFF

Critical

Yes

Parameters

Name

Meaning

Time

Time when an alarm is generated.

Sensor

Name of the sensor that generates an alarm.

Event

Details about an alarm.

Severity

Severity of an alarm.

Event Code

Event code that corresponds to an alarm.

Impact on the System

When the CPU core temperature is excessively high, the system powers off the mainboard forcibly for self-protection purpose. As a result, the services on the mainboard are interrupted, and data is lost.

Alarms are generated by the CPUN Prochot and CPUN DTS sensors before this alarm. Cool down the server before the increased temperature triggers this alarm.

Possible Causes

  • A fan module is faulty.
  • The service volume is massive.
  • The temperature in the equipment room is excessively high.
  • The air intake vent is blocked.
  • The air exhaust vent is blocked.
  • The heat sink is not properly connected to the mainboard.
  • The mainboard is faulty.

Procedure

  1. Log in to the MM910 CLI or WebUI, and check whether a lower fan speed alarm is generated for the fan module.

  2. Pull off and insert the fan module. 5 minutes later, check whether the fan alarm is cleared.

  3. Replace the fan module. 5 minutes later, check whether the fan alarm is cleared.

    For details, see the server user guide.

  4. Check whether the services running on the server are in massive volume.

  5. Stop non-critical services to lower the service load on the device. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 6.

  6. Check whether the ambient temperature is excessively high.

  7. Lower the ambient temperature to a range of 10°C to 30°C (50°F to 86°F). 5 minutes later, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 8.

  8. Check whether the air intake vent or air exhaust vent is blocked.

  9. Remove barriers. Then check whether the alarm is cleared.

    • If yes, go further action is required.
    • If no, go to Step 10.

  10. Power off the server, and open the chassis. Then check whether the heat sink is properly connected to the CPU.

  11. Remove and then install the heat sink. 5 minutes later, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 12.

  12. Replace the mainboard, then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 13.

  13. Contact Huawei technical support for help.
Translation
Download
Updated: 2019-11-19

Document ID: EDOC1100035007

Views: 28930

Downloads: 11

Average rating:
This Document Applies to these Products

Related Version

Related Documents

Share
Previous Next