OS Faults
OS Installation Faults
Diagnose and rectify faults related to OS installation depending on the symptoms.
For more fault symptoms and solutions, see the Computing Product Case Library. The Computing Product Case Library is available only to Huawei partners and Huawei engineers.
Possible Cause |
Diagnosis Procedure |
---|---|
Incompatible OS |
Use Computing Product Compatibility Checker to check whether the OS is compatible with the server. |
Incorrect installation method |
Use Computing Product Compatibility Checker to check whether the OS is compatible with the server and view the corresponding OS installation guide. To obtain the OS installation guide, perform the following steps:
|
Installation process issue |
|
Drive identification issue |
|
OS Faults
If you have confirmed that faults are not caused by other factors, diagnose them as follows:
Fault Symptom |
Diagnosis Method |
Conclusion |
---|---|---|
The server is suspended or restarted. |
Check whether the Kdump information contains crashed process names or board vendor names. For example, FC_XX indicates an FC device breakdown. |
The built-in OS drivers are incompatible. |
Check whether it is a PCIe card compatibility issue:
|
The PCIe card is incompatible. |
|
Use iBMC to locate the fault, for example, the DIMM, drive, or mainboard component for which an alarm is reported. |
Circuit hardware is faulty. |
|
If the OS logs contain read-only file system records, use Smart Provisioning to rate the drive and decide whether to replace the drive based on the result. |
A drive fault occurred. |
|
Check whether there is a Machine Check Exception issue. Locate such a fault by checking /var/log/mce.log and error codes of serial port Kdump information. |
|
|
Collect the following information:
After collecting the preceding information, determine whether it is a single server or hardware issue. Run Smart Provisioning for fault locating. |
Locate the fault based on the report. |
|
Breakdown occurs under specific circumstances after software upgrade of customer service software, database, middleware, kernel, BIOS, iBMC, and storage devices. |
|
|
Check whether the Kdump information of the breakdown screenshot periodically displays update_cpu_power, divide_error, or timer_xx. |
The OS has bugs or kernel defects. |
|
Check whether the Kdump information of the breakdown screenshot non-periodically displays gethostbyname. |