数通产品CE6851(V100R005C10SPC200)交换机堆叠故障分析

发布时间:  2017-04-18 浏览次数:  229 下载次数:  0
问题描述

两台CE6851交换机通过堆叠部署后, 发现设备因未知原因离开了堆叠组,两台设备均为主,为使设备再次进行堆叠,手动清空原有配置,重新为设备定义域名、槽号和优先级等配置信息,在检查配置信息无误的情况下提交并保存配置,然后按照主、备顺序重启设备,但是最终发现堆叠组还是无法建立。

告警信息

Failed to join the stack because local configuration conflicted with the master device's configuration. Run the "display stack troubleshooting" command to view detailed information.

处理过程

步骤一:

使用display stack troubleshooting命令查看堆叠故障事件信息。

<E5_E6/CE6851_computing>dis stac troubleshooting

The records are only about tracks of active and historical stack events.

Total :1

--------------------------------------------------------------------------------

Seq  Time                     Event Description                                 

--------------------------------------------------------------------------------

1    2017-02-27 15:11:01.162  Configuration conflict: The port is configured as

                              member port of Stack-Port on the local device, but

                               it is configured as a stack isolated port or a fa

                              bric member port on the master device. (Port(s) =

                              40GE2/0/3, 40GE2/0/4, 40GE2/0/5, 40GE2/0/6,  Maste

                              rMAC = 883f-d330-8140)

步骤二:

通过dis stack link-state last-down-reason命令查看堆叠口DOWN的原因。

<E5_E6/CE6851_computing>dis stack link-state last-down-reason

Stack-Port :

--------------------------------------------------------------------------------

Stack-Port                : Stack-Port2/1

Stack-Port Protocol State : down

Member Port(s) State :

  Down Reason    : The Stack-Port does not have member interfaces.

根因

据以上信息分析,该问题因堆叠端口冲突导致堆叠无法创建,但是在清空原有堆叠配置,重新配置后设备始终无法创建堆叠组。 

再次经过反复排查发现,因设备之前已创建好堆叠组,相关堆叠配置信息且已保存在内核系统中(一般情况下无法删除或更改配置)。

解决方案

1、拆分堆叠。

在全局模式下通过clear inactive-configuratio n all命令来清除离线设备的非激活配置信息。

[~E5_E6/CE6851_computing]clear inactive-configuration all

Warning: All inactive configurations will be deleted and can't be restored.

Are you sure to continue? [Y/N]:y

Info: Operating, please wait for a moment..................done.

[~E5_E6/CE6851_computing]

2、查看现有堆叠配置

#display stack configuration --------------------------------------------------------查看堆叠配置

Oper          : Operation

Conf          : Configuration

*             : Offline configuration

Isolated Port : The port is in stack mode, but does not belong to any Stack-Port

 

Attribute Configuration:

-----------------------------------------

 MemberID      Domain         Priority     

Oper(Conf)   Oper(Conf)      Oper(Conf)  

-----------------------------------------

1(1)         10(10)          200(200)    

2(2)         10(10)          150(150)    

-----------------------------------------

 

Stack-Port Configuration:

--------------------------------------------------------------------------------

Stack-Port      Member Ports                                                    

--------------------------------------------------------------------------------

Stack-Port1/1   40GE1/0/5           40GE1/0/6-----------------------------槽位1堆叠接口                                   

Stack-Port2/2   40GE2/0/5           40GE2/0/6-----------------------------槽位2堆叠接口                                   

--------------------------------------------------------------------------------

 

Isolated Port Configuration:

           

3、删除堆叠配置

步骤一:

#stack---------------------------------------------------------------进入堆叠管理视图

#display this--------------------------------------------------------查看系统当前视图运行配置

#

stack

 #

 stack member 1 domain 10-------------------------------------堆叠域编号

 stack member 1 priority 200------------------------------------堆叠优先级

 #

 stack member 2 domain 10

 stack member 2 priority 150

#

Return

 

#undo  stack member 1 domain 10----------------------------删除配置

Info: The operation will take effect after reboot.

#undo stack member 1 priority 200----------------------------删除配置

Info: The operation will take effect after reboot.

#undo stack member 2 domain 10------------------------------删除配置

Info: The operation will take effect after reboot.

#undo  stack member 2 priority 150--------------------------删除配置

Info: The operation will take effect after reboot.

步骤二:

#int Stack-Port 1/1---------------------------------------------------------进入堆叠端口

#undo port member-group interface 40GE 1/0/5 to 1/0/6------------剔除堆叠端口

#commit---------------------------------------------------------------------提交配置

#qui--------------------------------------------------------------------------退出

#save-------------------------------------------------------------------------保存配置

Warning: The current configuration will be written to the device. Continue? [Y/N]:y

Now saving the current configuration to the slot 1 .

Info: Save the configuration successfully.

 

步骤三:

#stack

 #stack member 1 domain 10-------------------------------------------配置堆叠ID1域编号为10

 #stack member 1 priority 200------------------------------------------配置堆叠ID1优先级为200

#int Stack-Port 1/1---------------------------------------------------------进入堆叠端口

#port member-group interface 40GE 1/0/5 to 1/0/6------------------添加堆叠端口

#commit---------------------------------------------------------------------提交配置

#qui--------------------------------------------------------------------------退出

#save-------------------------------------------------------------------------保存配置

Warning: The current configuration will be written to the device. Continue? [Y/N]:y

Now saving the current configuration to the slot 1 .

Info: Save the configuration successfully.

 

4、备机配置同理

   。。。。过程省略。。。。。。。

   。。。。。。

   。。。。

 

5、重启设备

#reboot----------------------------------------------------------------------------------重启设备

slot 1:

Next startup system software: flash:/CE6851HI-V100R005C10SPC200.cc

Next startup saved-configuration file: flash:/vrpcfg.zip

Next startup paf file: default

Next startup patch package: flash:/CE6851HI-V100R005SPH009.PAT

The configuration information of any other board is the same as that of slot 1.

Warning: The current configuration will be saved to the next startup saved-configuration file. Continue? [Y/N]:y

Warning: The system will reboot. Continue? [Y/N]:y

#

 

注释:

系统启动过程中设备堆叠异常与正常状态对比,备机显示slave,相反主机显示为master;

备机异常状态如下:

Initializing stack ........................................... DONE

Stack member ID .............................................. 2

Stack domain ID .............................................. 10

Stack priority ............................................... 150

Default MAC .................................................. 88-3f-d3-30-7f-70

Competing with other devices in the stack .................... DONE

Stack role ................................................... SLAVE

STACK configuration conflict check ........................... CONFLICT

Failed to join the stack because local configuration conflicted with the master device's configuration. Run the "display stack troubleshooting" command to view detailed information.

 

备机正常状态如下:

Now starting VRP ........

 

Initializing stack ........................................... DONE

Stack member ID .............................................. 2

Stack domain ID .............................................. 10

Stack priority ............................................... 150

Default MAC .................................................. 88-3f-d3-30-7f-70

Competing with other devices in the stack .................... DONE

Stack role ................................................... SLAVE

 

User interface con0 is available

 

Please Press ENTER.

建议与总结

建议在配置过程中需要注意设备优先级大小,一般建议优先级高的作为master,优先级较低的作为standby(优先级默认为100),所以,为了避免返工,实施过程中应当谨慎操作。

为提高解决类似问题的能力,作为工程师应当学会使用查看相关日志信息命令,许多问题都可以在系统中通过display、debug等命令来快速定位问题,发现问题,从而能更好的使攻城狮理清思路,准确判断出故障点。

END