Diagnose and rectify memory faults depending on the symptoms.
- If a fault can be located using logs or tools, see "Handling Procedure". If a fault needs to be rectified quickly onsite, see "Quick Recovery Method".
- For more fault symptoms and solutions, see the Computing Product Case Library. The Computing Product Case Library is available only to Huawei partners and Huawei engineers.
Fault Symptom
|
Handling Procedure
|
Quick Recovery Method
|
The memory capacity detected by the system is less than the configured memory capacity.
|
- Check whether the DIMMs are compatible with the server by using Computing Product Compatibility Checker.
- Check whether the current memory capacity is supported by the OS. For details about the memory capacity supported by each OS, see the related OS documents.
- Check whether the DIMM installation positions meet configuration rules.
- If yes, go to 4.
- If no, reinstall the DIMMs in correct slots according to the configuration rules.
- Check whether a "DIMMxxx configuration error" alarm is generated by iBMC.
- If yes, replace the faulty DIMM. For details, see Handling Alarms.
- If no, go to 5.
- Check whether any DIMM slots are abnormal. If a DIMM slot is abnormal, replace the mainboard.
|
- If the iBMC generates the "DIMMxxx Configuration Error" alarm, replace the related DIMM.
- If the DIMM status displayed in iBMC or the OS is abnormal (unidentified or faulty), replace the faulty DIMMs.
- If DIMMs do not comply with the DIMM installation rules, use Computing Product Memory Configuration Assistant to reinstall the DIMMs.
- If DIMM installation slots are faulty, replace the mainboard.
|
An uncorrectable DIMM error is generated.
|
Install the faulty DIMM on a different channel and use Smart Provisioning to test the DIMM. - If the fault is caused by the DIMM, replace the DIMM.
- If the fault occurs on the same DIMM slot, check the DIMM slot. If the DIMM slot is damaged, replace the mainboard.
|
- Exchange a DIMM you suspect to be faulty with a DIMM that is functioning correctly. Then, determine whether the fault is caused by the DIMM or DIMM slot.
- If the fault is caused by the DIMM you suspect to be faulty, replace the DIMM.
- If the fault is caused by the DIMM slot, replace the mainboard.
- If the preceding steps do not reproduce the fault, use Smart Provisioning to perform memory pressure tests. If the fault is reproduced, perform 1. Otherwise, contact Huawei technical support.
|