S5500一致性组远程复制失败

发布时间:  2015-09-15 浏览次数:  179 下载次数:  0
问题描述

S5500对接S6800T 远程复制

巡检告警结果




告警信息

远程复制状态异常,一致性组异常。

处理过程

通过命令检查LUN 状态,磁盘状态,链路状态。均正常。

收集日志,分析日志

可以看见原因:

S5500的disk:22出现坏道导致远程复制写从LUN的IO失败

[2015-08-31 14:07:34][][201f9000c][ERR][Received Critical Alarm: LUN(lun-name:LUN_DLDB_500000M) belong to RAID(raid-name:RAID003) medium error occour.][ALARM][ALM_PrintRece.dAlarm,232][alarm]
?[2015-08-31 14:07:34][23959816648][50000013001f][ERR][A Private IBS REQ result is not OK.Print details of the REQ:pReq=00000101007c0d40, result=-1, opCode=0x35, LunId=80.][BS][IBS_INISendPv.eqDone,857]
[2015-08-31 14:07:34][23959816649][50000011004b][ERR][BS REQ result is not OK.Print details of the REQ:pReq=000001010090d040, result=-1, opCode=0x332180c0.][BS][BS_TGTExecDon.tPrint,875]
[2015-08-31 14:07:34][23959816649][50000011004c][ERR][Continue previous log:LunId=80, Lba=133309568, Len=128, pSgl=0000000000000000, pRemoteSgl=0000000000000000, File=/home/r5c02src_tmp/alps/src/sic/bs/de4_ibs/bs_][BS][BS_TGTExecDon.tPrint,881]

远程复制一致性组IO失败处理:会置远程复制一致性组异常断开

[2015-08-31 14:07:48][23959830847][500000570580][INFO][Receive task from remote controller,task code:411,serial number:1209962,intend node:21000022a109a2d8,primary node:21000022a109a2d8][RM][RM_MngReceive.eliver,1494]
RM_PUBLIC_STG_CG_IO_ERROR,           /*411, CG镜像IO失败 */

坏道修复成功的打印:

[2015-08-31 14:07:41][23959823527][500000840015][INFO][Inform that LUN 80 of RAID 2 has been deleted all BST][BST][RP_BSTTableDe.SetCfg,879]
[2015-08-31 14:07:41][][201f9000c][INFO][Received Critical Resume alarm: LUN(lun-name:LUN_DLDB_500000M) belong to RAID(raid-name:RAID003) medium error occour.][ALARM][ALM_PrintRece.dAlarm,239][alarm]
[2015-08-31 14:07:48][23959830638][50000078000f][INFO][Repair bad sector (disk:22, LBA:184253825, len:1, time:23959830235) succeed][DF][DF_SetLocalBa.Result,640]
[2015-08-31 14:07:48][][1201f90014][INFO][Received Infor Event: The bad sectors on disk (frame-id:1, slot-id:22) were restored successfully.][ALARM][ALM_PrintRece.dAlarm,239][alarm]

由于坏道已成功修复,所以建议可以将该同步重新进行即可。若遇到相应结果为修复失败,则必须更换该硬盘。

根因

由于远程复制时硬盘出现坏道,导致远程复制一致性组异常断开。

解决方案

远程复制由IO错误导致的异常断开需要手动恢复同步,在S6800T上对异常断开的远程复制一致性组手动执行同步操作。

坏道修复成功后可以正常对从LUN进行读写操作,如果不出现其他坏道,不会同步失败。

如果该硬盘出现新的坏道或者修复失败,则建议更换硬盘。

END