The following error message is found in the SEL log of an RH5885H V3 server, while the RH5885H V3 server can start and its OS can be accessed normally.
718, Major,2018-07-10 18:37:39,2018-07-10 18:37:39,System Firmware Progress (SysFWProgress),POST error.Unrecoverable hard-disk/ATAPI/IDE device failure,0F0003FF,Asserted
The troubleshooting procedure is as follows:
1. View the SEL log. According to the SEL log, this alarm is generated about four minutes after the server starts. The error message itself also indicates that the error occurs in the POST phase.
2. View the serial port log (no such a log is available for RH5885H V3 servers) and FDM diagnosis information. No hardware error is reported. Replace the main board, RAID disks, hard disk backplane, BMC daughter board, cables and hard disks in sequence. The problem persists. This error is also reported when the server is mounted to the Toolkit.
3. Analyze the possible error sources BMC and BIOS. It is confirmed that the error is reported by the BIOS. The AMI analysis indicates that the fault is caused by the device on the PCH.
The reply from AMI is as follows:
IDE bus driver is used to to support ATA/ATAPI device Detection and Enumeration,it is included in CRB code,you can check the module of IdeBus.
According to the error information of "Unrecoverable hard-disk/ATAPI/IDE device failure",it belong to error code DXE_IDE_DEVICE_FAILURE,may be you need to check your HDD device.
Bios will collect progress and error codes and convert them into SEL format, then use Add SEL Entry Command to log to SEL.
4. Check the devices on the PCH based on the AMI's reply. The problem is resolved after the CD-ROM drive is removed.
The hard-disk, ATAPI, or IDE device on the PCH is abnormal. As a result, the POST error is reported.