No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

Huawei Rack Server iBMC Alarm Handling 28

This document describes iBMC alarms in terms of the meaning, impact on the system, possible causes, and handling suggestions.
Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-0x0147FFFF Above Upper Minor Threshold (SSD DiskN Temp)

ALM-0x0147FFFF Above Upper Minor Threshold (SSD DiskN Temp)

Description

Alarm message:

Above upper minor threshold

This alarm is generated when the sensor detects that the temperature of the solid-state disk (SSD) is higher than the minor threshold. This alarm is cleared when the sensor detects that the SSD temperature falls below the threshold.

Sensor triggering the alarm: SSD DiskN Temp

Attribute

Alarm ID Alarm Severity Auto Clear

0x0147FFFF

Minor

Yes

Parameters

Name Meaning
N Indicates the serial number of the bay in which the SSD is installed.

Impact on the System

The components cannot operate stably, which shortens the service life of the server and increases power consumption. If the alarm persists, the server powers off or restarts, which interrupts services and causes data loss.

Possible Causes

  • The fan module is faulty.

  • The ambient temperature exceeds the normal range.

  • The air inlet or outlet is blocked.

  • Idle disk bays are not installed with hard disk fillers.

  • Air ducts are not installed properly.

  • The hard disk backplane of the SSD is faulty.
  • The SSD is faulty.

The location of the hard disk backplane varies with the server model. For details, see the user guide of the server you use.

The RH2288H V3 server is used as an example to describe how to clear the alarm. In an RH2288H V3 server, the front hard disk backplane is the hard disk backplane of the SSDs. For details, see the RH2288H V3 Server User Guide.

Procedure

  1. Check whether an alarm indicating low fan speed is generated for a fan module.

    You can obtain alarm information in either of the following ways:
    • View alarm information on the Current Alarms page of the iBMC WebUI.
    • Run the ipmcget -d healthevents command on the iBMC CLI.
    • If yes, go to 2.

    • If no, go to 5.

  2. Remove and reinstall the fan module. Five minutes later, check whether the fan module alarm is cleared.

    • If yes, go to 4.

    • If no, go to 3.

  3. Replace the fan module. After 5 minutes, checkwhether the fan module alarm is cleared.

    For details about how to replace the fan module, see the server user guide.

    • If yes, go to 4.

    • If no, go to 15.

  4. Check whether the SSD overheating alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 5.

  5. Check whether the ambient temperature exceeds the normal range.

    • If yes, go to 6.

    • If no, go to 7.

  6. Lower the ambient temperature to the normal range. After 5 minutes, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 7.

  7. Check whether the air inlet or outlet is blocked.

    • If yes, go to 8.

    • If no, go to 9.

  8. Remove the blockage from the air inlet or outlet. Then, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 9.

  9. Check whether there are empty disk bays.

    • If yes, go to 10.

    • If no, go to 11.

  10. Install hard disk fillers in all empty bays. After 5 minutes, check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 11.

  11. Check whether air ducts are installed properly.

    • If yes, go to 13.

    • If no, go to 12.

  12. Install air ducts properly. After 5 minutes, ,check whether the alarm is cleared.

    For details about how to install air ducts, see the server user guide.

    • If yes, no further action is required.

    • If no, go to 13

  13. Replace the hard disk backplane of the SSD. Then, check whether the alarm is cleared.

    For details about how to replace the hard disk backplane, see "Replacing Parts" in the user guide.

    In this example, replace the front hard disk backplane, and then check whether the alarm is cleared.

    • If yes, no further action is required.

    • If no, go to 14.

  14. Replace the SSD. Then, check whether the alarm is cleared.

    For details, see "Replacing Parts" in the server user guide.

    • If yes, no further action is required.

    • If no, go to 15.

  15. Contact Huawei technical support.
Download
Updated: 2019-06-04

Document ID: EDOC1000054724

Views: 244206

Downloads: 2950

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next