No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade
FusionServer Pro E9000 Server iBMC (Earlier than V250) Alarm Handling 02
Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0149FFFF/0149FF01 Above Upper Major Threshold (PCIE Temp)

ALM-0149FFFF/0149FF01 Above Upper Major Threshold (PCIE Temp)

Description

Alarm message:

Above upper major threshold

This alarm is generated when the sensor detects that the temperature of a PCIe card is higher than the upper threshold for major alarms. This alarm is cleared when the sensor detects that the temperature of the PCIe card is lower than the upper threshold.

This alarm is generated by the following sensor:

  • PCIE Temp

Attribute

Alarm ID

Alarm Severity

Auto Clear

iBMC: 0149FFFF

MM: 0149FF01

Major

Yes

Parameters

Name

Meaning

Time

Indicates the time when an alarm is generated.

Sensor

Name of the sensor that generated the alarm.

Event

Provides detailed information about the alarm.

Severity

Indicates the severity of the alarm.

Event Code

Indicates the event code of an alarm.

Impact on the System

The PCIe card overtemperature may cause the PCIe card or system to break down, interrupting services.

Possible Causes

  • A fan module is faulty.
  • The ambient temperature is higher than 30°C (86°F).
  • The air intake vent is blocked.
  • The air exhaust vent is blocked.
  • The heat sink is in poor contact with the CPU.
  • The PCIe card is faulty.

Procedure

  1. Check whether a low fan speed alarm is generated for the fan module.

    Log in to the MM910 command-line interface (CLI) or WebUI, and view fan module alarms.

  2. Remove and then reinstall the fan module. 5 minutes later, check whether the fan module alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 3.

  3. Replace the fan module. 5 minutes later, check whether the fan module alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 4.

  4. Check whether the ambient temperature is higher than 30°C (86°F).

  5. Lower the ambient temperature to a range of 10°C to 30°C (50°F to 86°F). Five minutes later, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 6.

  6. Check whether the air intake or exhaust vent is blocked.

  7. Remove barriers from the air intake or exhaust vent. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 8.

  8. Power off the server, remove the PCIe card, and check whether the heat sink on the PCIe card is in poor contact.

  9. Remove and reinstall the PCIe card heat sink and reinstall the PCIe card. Power on the server. 5 minutes later, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 10.

  10. Replace the PCIe card, then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to Step 11.

  11. Contact Huawei technical support.
Translation
Download
Updated: 2019-11-19

Document ID: EDOC1100035007

Views: 32419

Downloads: 12

Average rating:
This Document Applies to these Products

Related Version

Related Documents

Share
Previous Next