The Slave MPU of the MA5200G Repeatedly Restarts Because the Clock Board Becomes Faulty

Publication Date:  2012-07-27 Views:  226 Downloads:  0
Issue Description
 The slave MPU in slot 9 on the MA5200G repeatedly restarts and the master MPU in slot 10 
Alarm Information
 Alarm messages collected through the console interface on the master MPU in slot 10 are as follows:
System is busy with warm backup, please wait for a moment...
System is busy with warm backup, please wait for a moment...
Sep 10 2009 00:26:57 YCDT-521-B-MA5200G-01 %%01VFS/3/IPCUNREGDEV_ERR(l): Failed to
unregister file system on device 9 through IPC, ipc return value 2.
Sep 10 2009 00:26:58 YCDT-521-B-MA5200G-01 %%01VFS/3/IPCREGDEV_ERR(l): Failed to register device 9
to main file system through IPC, ipc return value is 2.
Sep 10 2009 00:27:11 YCDT-521-B-MA5200G-01 %%01MEM/4/WARNING(l):
Just to trace lpu heartbeat
Alarm messages collected through the console interface on the slave MPU in slot 9 are as follows:
Because clock board has occurred exception ,and reset clock board 16.
Sep 10 2009 00:16:44 Quidway %%01SRM/1/LOCKCHANGE(l): Lock mode change to free-run. (CLK=9)
#Sep 10 00:16:45 2009 Quidway SRM/4/CLK_OK:OID 1.3.6.1.4.1.2011.2.17.0.110 CLK 9 hardware fail clear!
#Sep 10 00:16:46 2009 Quidway SRM/0/CLK_FAIL:OID 1.3.6.1.4.1.2011.2.17.0.109 CLK 9 hardware failed!
Sep 10 2009 00:16:48 Quidway %%01SRM/1/LOCKCHANGE(l): Lock mode change to hold. (CLK=9)
#Sep 10 00:17:05 2009 Quidway SRM/0/CLK_FAIL:OID 1.3.6.1.4.1.2011.2.17.0.109 CLK 9 hardware failed!
Sep 10 2009 00:17:06 Quidway %%01SRM/4/RESETCLOCK(l): 
 
Handling Process
 The problem is solved after the slave MPU in slot 9 is replaced. 
Root Cause
 Based on alarm messages, it can be concluded that the clock board on the slave MPU in slot 9 becomes faulty. It is a hardware failure and can be solved by replacing the MPU. 
Suggestions
 When the clock board becomes faulty, the slave MPU can be registered but is unable to synchronize its clock with that of the master MPU, which causes the slave MPU to restart. 

END