Checking the Server
Check the server in the sequence shown in Figure 5-41. The method to be used depends on the actual situation.
For details about the commands involved in the operations, see the Atlas 800 Inference Server iBMC User Guide (Model 3000).
Procedure
- Check indicator status.
Ensure that hardware devices are working correctly.
For details, see Indicators and Buttons on the Front Panel and Indicators on the Rear Panel.
- Check the server.
- Check the server by using the iBMC WebUI.
- Log in to the iBMC over the WebUI. For details, see Logging In to the iBMC WebUI.
You are advised to change the initial password when logging in to the iBMC for the first time. For details, see Changing Initial Passwords.
- Choose iBMC Settings on the menu bar, choose Firmware Upgrade in the navigation tree, and view server version information, as shown in Figure 5-42.
Check that the server version meets requirements.
- Check the health status of the server.
- In the upper right corner of the iBMC home page, query the server health status, as shown in Figure 5-43.
Icon
Meaning
Description
Alarm statistics
A critical alarm may power off the server, and even interrupt system services. Corrective actions must be taken immediately.
A major alarm may affect the normal operating of the system or interrupt services.
A minor alarm has minor impact on the system but requires corrective action as soon as possible. Otherwise, a more severe alarm will be generated.
Power status
Displays server power status. You can click
on the right of the indicator to power on or off the server.
UID status
Pinpoints the location of the server in a chassis. You can click
on the right of the indicator to control the state of the UID indicator.
- In the upper right corner of the iBMC home page, query the server health status, as shown in Figure 5-43.
- Clear any alarms if present. For details, see the Atlas 800 Inference Server iBMC Alarm Handling (Model 3000).
- Log in to the iBMC over the WebUI. For details, see Logging In to the iBMC WebUI.
- Check the server using the iBMC CLI.
- Set an IP address for the PC, and ensure that the IP address is on the same network segment as the iBMC management network port.
- Connect the network port on a PC to the iBMC management network port of the server using a network cable.
- Start a Secure Shell (SSH, PuTTY) tool on the PC and log in with the IP address of the iBMC management network port and iBMC user name and password.
By default, SSH is used to log in to iBMC. If the SSH service is disabled, enable it by choosing Services > Port Services on the iBMC WebUI.
- Run the ipmcget -d version command to view the server version information.
Check that the server version meets requirements.
iBMC:/->ipmcget -d version ------------------- iBMC INFO ------------------- IPMC CPU: Hi1711 IPMI Version: 2.0 CPLD Version: (U151)0.15 Active iBMC Version: (U68)3.01.01.00 Active iBMC Build: 005 Active iBMC Built: 18:43:56 Mar 6 2020 Backup iBMC Version: 3.01.01.00 Available iBMC Version: 3.01.01.00 Available iBMC Build: 005 SDK Version: 5.0.80.0 SDK Built: 21:11:10 Feb 29 2020 Active Uboot Version: 5.0.80.0 (21:21:56 Feb 29 2020) Backup Uboot Version: 5.0.80.0 (21:21:56 Feb 29 2020) Active Secure Bootloader Version: 5.0.80.0 (21:21:55 Feb 29 2020) Backup Secure Bootloader Version: 5.0.80.0 (21:21:55 Feb 29 2020) Active Secure Firmware Version: 5.0.80.0 (21:21:55 Feb 29 2020) Backup Secure Firmware Version: 5.0.80.0 (21:21:55 Feb 29 2020) ----------------- Product INFO ----------------- Product ID: 0x0007 Product Name: XXXX BIOS Version: (U75)1.13 -------------- Mother Board INFO --------------- Mainboard BoardID: 0x0005 Mainboard PCB: .A ------------------- NIC INFO ------------------- NIC 1 (TM280) BoardID: 0x0067 NIC 1 (TM280) PCB: .A --------------- Riser Card INFO ---------------- Riser1 BoardName: BC82PRUN Riser1 BoardID: 0x0093 Riser1 PCB: .A Riser2 BoardName: BC82PRUN Riser2 BoardID: 0x0093 Riser2 PCB: .A -------------- HDD Backplane INFO -------------- Disk BP0 BoardName: BC82THBB Disk BP0 BoardID: 0x004a Disk BP0 PCB: .A Disk BP0 CPLD Version: (U31)0.05 ---------------- IO Board INFO ---------------- IOBoard0 ProductName: BC82IOEA IOBoard0 BoardID: 0x0063 IOBoard0 PCB: .A -------------------- PSU INFO ------------------- PS1 Version: DC:113 PFC:113 PS2 Version: DC:111 PFC:111 ------------- Security Module INFO ------------- Specification Type: TPM/TCM Specification Version: N/A Manufacturer Name: N/A Manufacturer Version: N/A
- CPLD Version: complex programmable logical device (CPLD) version of the server
- BIOS Version: BIOS version of the server
- Active iBMC Version: active iBMC version of the server
- Backup iBMC Version: backup iBMC version of the server
- Check the health status of the server.
iBMC:/->ipmcget -d health System in health state.
- If System in health state is displayed, no further action is required.
- If alarms are generated, clear the alarms by performing the following steps.
- Check the alarm status of the server.
iBMC:/->ipmcget -d healthevents Event Num | Event Time | Alarm Level | Event Code | Event Description 1 | 2019-02-10 00:52:23 | Minor | 0x12000021 | get description failed. 2 | 2019-02-10 01:37:42 | Minor | 0x12000013 | Failed to obtain data of the air inlet temperature. 3 | 2019-02-10 00:52:23 | Minor | 0x12000019 | Right mounting ear is not present. 4 | 2019-02-10 00:52:19 | Major | 0x28000001 | The SAS or PCIe cable to front disk backplane is incorrectly connected.
- Clear alarms. For details, see the Atlas 800 Inference Server iBMC Alarm Handling (Model 3000).
- Check the server by using the iBMC WebUI.