No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

In X6800 Chassis, PSUs Number 2, 3 and 4 cannot power ON

Publication Date:  2018-01-19 Views:  33 Downloads:  0
Issue Description

In X6800 Chassis, PSUs Number 2, 3 and 4 cannot power ON

Alarm Information

LED light indicator in PSU’s do not power on, thus indicating that the chassis may have power supply issues or HMM issues.

 

Handling Process

1.    Verified power consumption requirements. (http://support.huawei.com/onlinetoolsweb/ftpa/indexEn?serise=3)

2.    Collected Chassis Hardware logs and IBMC logs for the nodes in the chassis.

a.     IBMC logs collected with one click Information collection.

b.    Chassis Hardware logs collected with below command in CLI:

ipmcget -d diaginfo

3.    After Log review, the following was revealed:

·         PSU’s do not work in other positions except the first one. The other PSU’s were operating at 21 Watts. Only the PSU located in Slot 1 works with 343W.

·         Fan speed value data also shows abnormal.

dump_info.tar.gz\dump_info\AppDump\sensor_alarm



Root Cause

Abnormal PSU & Fan performance due to Faulty HMM module, which manages and controls server management. The HMM monitors power supply and fan status, collects asset information and real-time data, and performs fan speed adjustment and network convergence, hence, if the component was faulty, it can be causing performance issues on the Fans and PSU’s. 

Solution

Replace HMM card through RMA, after the replace of the HMM the PSU and FAN work correctly.

Proceeded to replace faulty HMM Module as below:

03022RHS

1

制成板-X6800-BC21SMMA-X6800 机箱管理模块-Board ID 0X65

Manufactured Board,X6800,BC21SMMA,X6800 Shelf Management Module,Board ID 0X65

After Management module is replaced, the LED indicators were all showing LED indicator correction and Fan log readings returned to normal.

 

Suggestions

Double check the dump_info log file in order to see if there's abnormal readings from Chassis components, that way, it can be a hint to further investigate if there is an HMM module error. 

END