Questo sito utilizza cookie di profilazione (propri e di terze parti) per ottimizzare la tua esperienza online e per inviarti pubblicità in linea con le tue preferenze. Continuando a utilizzare questo sito senza modificare le tue preferenze acconsenti all’uso dei cookie. Se vuoi saperne di più o negare il consenso a tutti o ad alcuni cookie clicca qui>
The website that you are visiting also provides Arabian language. Do you wish to switch language version?
يوفر موقع الويب الذي تزوره المحتوى باللغة العربية أيضًا. هل ترغب في تبديل إصدار اللغة؟
The website that you are visiting also provides Russia language Do you wish to switch language version?
Данный сайт есть в английской версии. Желаете ли Вы перейти на английскую версию?
In the process of using OceanStor 6800 V3, the storage reported 3 controllers are isolated which can’t be monitored.
Alarm information as follows：
To analysis the reason of three controllers isolated, that three controllers (B, C, D) have occurred self healing reset in the adjacent time points which causes this issue. The log is as follows:
The latest NO.1 reset: localorcmostime=1453740225, ji=244711951, reason=failure recovery reset
The latest NO.1 reset: localorcmostime=1453740137, ji=245402264, reason=failure recovery reset
The latest NO.1 reset: localorcmostime=1453740374, ji=244748102, reason=failure recovery reset
The process of the whole self healing is as follows:
l Host multipath software periodically sends INQ command to the controller to query storage port’s location information. The command will arrive at TGT module of the storage. TGT
directly supply one kernel lock to query by the internal interface, at the same time storage internal monitor thread also periodically supply the same kernel lock to query device status.
l Once storage’s management board appears abnormal, internal monitor thread will take more time for get device status. The situation will cause timeout of release the kernel lock. So
TGT would be waiting for internal monitor thread to release the kernel lock. It causes that system detects TGT occupy the CPU for long time, then system considers there is
abnormal which triggers system self-healing to reset the controller.
Storage V3 series’ management board have saved the storage’s system image, the controller boots the system by management board when power on. After the controller restart to face the failure of management board, the controller won’t be able to load the system again, which results in the controller isolated.
Storage V3 series’ management board have saved the storage’s system image, the controller boots the system by management board when power on. After the controller restart to face the failure of management board, the controller won’t be able to load the system again, which results in the controller isolated. At this time only can replace management board to resume the controller isolated.