ALM-1200035 ZooKeeper Process Does Not Respond
Description
The system checks whether the ZooKeeper process on the node responds to requests every two minutes. This alarm is generated when the ZooKeeper process does not respond.
Attribute
Alarm ID |
Alarm Severity |
Auto Clear |
---|---|---|
1200035 |
Major |
Yes |
Parameters
Parameter |
Description |
|
---|---|---|
Alarm Name |
Indicates the alarm name. |
|
Source Device |
Indicates the device for which the alarm is generated. |
|
Monitoring System |
Indicates the type of the interconnected system. |
|
Source System |
Indicates the name of the interconnected system. |
|
IP Address |
Indicates the IP address of the ECS instance for which the alarm is generated. |
|
Last Occurred At |
Indicates the time when the alarm was generated for the last time. |
|
Location Information |
Indicates the information required to locate the alarm. |
|
Additional Information |
Provides supplementary information about the alarm. |
Impact on the System
If the ZooKeeper process on a node does not respond, the node fails to provide services. In this case, handle the alarm immediately.
Possible Causes
System errors
Procedure
- Log in to the ManageOne OM plane using a browser.
- URL: https://Address for the homepage of the ManageOne OM plane:31943. For example: https://oc.type.com:31943.
- Default username: admin, default password: Huawei12#$.
- Click Login.
- On the menu bar in the upper part of the page, choose Alarms.
- In the alarm list, locate and click the target alarm name in the Name column. The alarm details and proposed processing actions dialog box is displayed.
- In the Basic Information list of the Alarm Details and Handling Recommendations dialog box, locate Location Info. The value of Host IP address in Location Info is the IP address of the node for which the alarm is generated.
- Use PuTTY to log in to the node for which the alarm is generated.
The default username is vpc. The default password is Huawei12#$.
- Run the following command to disable user logout upon system timeout:
TMOUT=0
- Run the following command to obtain the PID of the ZooKeeper process:
ps -ef |grep zookeeper
The following command output is displayed.
- Run the following command to stop the ZooKeeper process:
kill -9 pid
pid is the process ID in the command output of 8.
The process protection mechanism will restart the process.
- After 5 minutes, run the following command to check whether the ZooKeeper process exists:
ps -ef |grep zookeeper
- If the ZooKeeper process exists, go to 11.
- If the ZooKeeper process does not exist, contact technical support for assistance.
If the following output is displayed, the ZooKeeper process exists:
- After 10 minutes, check whether the alarm is cleared.
- If the alarm is cleared, no further action is required.
- If the alarm persists, contact technical support for assistance.
Alarm Clearance
This alarm will be automatically cleared after the fault is rectified.
Reference
None