EEM_1110000120 The System Resource Usage Is High
Description
The maintenance tool continuously samples the usage of server system resources including the CPU, memory, and disk space. When the usage in each sampling is larger than or equal to the threshold (90%) within a sampling period, this alarm is generated.
Impact on the System
- The EEM response is slow and there is a delay in performing some operations.
- There is a delay in real-time services such as collection tasks, parameter interrogation results, performance, and alarms. Users cannot obtain information immediately.
- The service processing is slow. As a result, messages are stacked.
- Alarm information fails to be dumped due to insufficient space.
Possible Causes
- The EEM server is busy temporarily.
- The threshold for the resource usage of the EEM server is incorrect.
- Time-consuming operations or operations that consume system resources are performed on the EEM server.
- The hardware performance of the EEM server is low and cannot support EEM running.
- The disk space of the EEM server is insufficient.
Procedure
- Check whether the EEM server is busy temporarily.
- Choose .
- Check First Occurrence Time on the Current Alarm page. Check whether the alarm meets the following conditions:
- The alarm is generated sometimes and can be automatically cleared within 10 minutes.
- The alarm is generated once within several days.
- The alarm affects the system (real-time services such as collection tasks, parameter interrogation results, performance, and alarms) in a short period of time (the alarm is automatically cleared within 30 minutes) or has less impact (no data is lost; the performance delay is within one period; the alarm delay is within 30s; the GUI response time is within 10s).
- Many other alarms or events are reported during the time range when the alarm about a high CPU usage is generated.
- Wait for 5 minutes and check whether the alarm is
cleared.
- If so, the process is ended.
- If not, go to 2.
- Check whether the threshold for the CPU usage of the EEM server is correct.
- Check whether the CPU usage of the EEM application
exceeds 90%.
- Check whether the CPU usage of the EEM application in the task manager of the EEM server exceeds 90%.
- Wait for 5 minutes and check whether the alarm is
cleared.
- If so, the process is ended.
- If not, go to 4.
- Check whether the CPU usage of non-EEM applications
exceeds 90%.
- Contact the administrator and check whether the CPU usage of the
non-EEM application exceeds 90%.
- If so, wait for the administrator to take measures. After the administrator takes measures, go to 4.b. If you cannot wait for service processing, go to 6.
- If not, go to 5.
- In the SUSE Linux OS, run the top command to check the CPU usage of each process in CPU of the command output.
- In the Windows OS, check the CPU usage of each process in CPU on the process page through the task manager.
- Wait for 5 minutes and check whether the alarm is
cleared.
- If so, the process is ended.
- If not, go to 5.
- Contact the administrator and check whether the CPU usage of the
non-EEM application exceeds 90%.
- Check whether the hardware performance of
the EEM server is low and cannot support EEM running.
- Contact the administrator and check whether the server hardware
performance is low. You can determine the low performance of the server
hardware by checking the following items:
- The capability of the server hardware cannot meet hardware requirements of the EEM.
- The server continuously or frequently receives the alarm.
- Wait for 5 minutes and check whether the alarm is
cleared.
- If so, the process is ended.
- If not, go to 6.
- Contact the administrator and check whether the server hardware
performance is low. You can determine the low performance of the server
hardware by checking the following items:
- Collect alarm information and contact Huawei technical support personnel.