No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

E9000 Server V100R001 HMM Alarm Handling 19

This document describes E9000 server alarms in terms of the meaning, impact on the system, possible causes, and solutions.
Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
Slot: Overtemperature (Minor, SSDN Temp)

Slot: Overtemperature (Minor, SSDN Temp)

Description

Alarm message:

Temperature above minor threshold

or

The [arg1] PCIe card arg2 (arg3) temperature (arg4 degrees C) exceeds the overtemperature threshold (arg5 degrees C). 

This alarm is generated when the SSD card temperature exceeds the overtemperature minor threshold. This alarm is cleared when the temperature is within the normal range.

This alarm is generated by the following sensors:

SSDN Temp

Attribute

Alarm ID

Alarm Severity

Auto Clear

0x0147FF01

Minor

Yes

Parameters

Name

Meaning

arg1

Location of the PCIe card. For example, front or rear.

arg2, N

Slot number of the PCIe card, for example, 1 or 2.

arg3

Type of the PCIe card, for example, ES3000 V2 SSD Card.

arg4

Current reading of the sensor.

arg5

Alarm threshold.

Impact on the System

Overheating affects PCIe card performance. If this alarm is not cleared in time, services related to the PCIe card will be interrupted.

Possible Causes

  • A fan module is faulty or not installed.
  • The service volume is massive.
  • The equipment room temperature exceeds the normal range.
  • The air inlet or outlet is blocked.
  • Idle slots or spaces are not installed with filler panels.
  • The SSD card is faulty.

Procedure

  1. Check whether the fan fault alarm is generated.

    • If yes, go to 2.
    • If no, go to 3.

  2. Clear the fan fault alarm. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 3.

  3. Check whether all fans are properly installed.

    • If yes, go to 5.
    • If no, go to 4.

  4. Install fans in spare slots properly. Then check whether the alarm is cleared.

    For details about how to install the fans, see the E9000 Server User Guide.
    • If yes, no further action is required.
    • If no, go to 5.

  5. Check whether the services running on the server are in massive volume.

    • If yes, go to 6.
    • If no, go to 7.

  6. Stop non-critical services to reduce the service load on the server. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 7.

  7. Check whether the ambient temperature is higher than 40°C (104°F).

    • If yes, go to 8.
    • If no, go to 9.

  8. Reduce the ambient temperature in the equipment room by using air conditioners and fans (for example, turn down the air conditioners and increase the fan speed. When the air conditioners cannot work properly, open doors and windows for ventilation) Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 9.

  9. Check whether the air intake vent or air exhaust vent is blocked.

    • If yes, go to 10.
    • If no, go to 11.

  10. Remove the barriers. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 11.

  11. Check whether the filler panels are inserted into the idle slots or spaces of the chassis.

    • If yes, go to 13.
    • If no, go to 12.

  12. Insert a filler pane into each vacant slot. Then check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 13.

  13. Replace the SSD card. Then, check whether the alarm is cleared.

    • If yes, no further action is required.
    • If no, go to 14.

  14. On the HMM WebUI, choose System Management > Information Collection, and collect logs.
  15. Contact Huawei technical support.
Translation
Download
Updated: 2018-08-16

Document ID: EDOC1000015902

Views: 193002

Downloads: 1567

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next