Sensor List
Sensor |
Monitored Object |
Component |
---|---|---|
Inlet Temp |
Chassis air inlet temperature (ambient temperature) |
Mainboard |
Outlet1 Temp |
Air outlet temperature |
Mainboard |
Outlet2 Temp |
Air outlet temperature |
Mainboard |
Outlet3 Temp |
Air outlet temperature |
Mainboard |
Temperature at 10 mm below the BMC, that is, the temperature near the air outlet |
||
PCH Temp |
PCH bridge temperature |
PCH chip |
PCH VPVNN |
PCH VPVNN voltage |
Mainboard |
PCH PRIM 1V05 |
PCH PRIM voltage |
Mainboard |
CPUN Core Rem |
CPU core temperature |
CPU. N indicates the CPU ID. The value is 1 or 2. |
CPUN DTS |
CPU DTS value |
|
CPUN MEM Temp |
Temperature of DIMMs mapping to a CPU |
|
CPUN QPI Link |
CPU QPI link health status fault diagnosis |
|
CPUN Prochot |
CPU Prochot |
|
CPUN VDDQ Temp |
CPU VDDQ temperature |
Mainboard. N indicates the CPU ID. The value is 1 or 2. |
CPUN VRD Temp |
CPU VRD temperature |
Mainboard. N indicates the CPU ID. The value is 1 or 2. |
CPUN VCore |
1.8 V CPU voltage |
|
CPUN DDR VDDQ |
1.2 V memory voltage |
|
CPUN DDR VDDQ2 |
||
CPU1 Memory |
Temperature of DIMMs 1 to 6. |
Memory |
CPU2 Memory |
Temperature of DIMMs 7 to 12. |
|
CPUN DDR VPP1 |
CPU memory VPP voltage |
Mainboard |
CPUN DDR VPP2 |
||
CPUN VSA |
CPU memory VSA voltage |
Mainboard. N indicates the CPU ID. The value is 1 or 2. |
CPUN Status |
CPU status detection |
CPUN. N indicates the CPU ID. The value is 1 or 2. |
CPUN Margin |
CPUN Margin temperature |
|
SYS 3.3V |
Mainboard 3.3 V voltage |
Mainboard |
SYS 5V |
Mainboard 5.0 V voltage |
|
SYS 1.8V |
System 1.8 V voltage |
|
SYS 12V_N |
Mainboard 12.0 V voltage |
Mainboard. N indicates the component ID. The value ranges from 1 to 4. |
MezzN Temp |
Mezzanine card temperature |
Mezzanine card. N indicates the mezzanine card number. The value is 1 or 2. |
MezzN Status |
Mezzanine card health status fault diagnosis |
|
M2 Zone Temp |
M.2 slot, temperature near the air inlet. |
Mainboard |
RAID Temp |
RAID controller card temperature |
RAID controller card |
RAID Presence |
RAID controller card presence status |
CPLD |
FANN F Speed |
Fan speed |
Fan module. N indicates the fan module ID. The value ranges from 1 to 6. |
FANN R Speed |
||
FANN F Status |
Fan fault status |
|
FANN R Status |
||
FANN Presence |
Fan module presence status |
MM510. N indicates the management module ID. The value ranges from 1 to 6. |
PSN Presence |
PSU presence status |
MM510. N indicates the management module ID. The value ranges from 1 to 4. |
Power Button |
Power button status |
Mainboard |
PowerN |
PSU input power |
PSU. N indicates the PSU ID. The value ranges from 1 to 4. |
PSN Status |
PSU fault status |
|
PSN Fan Status |
PSU fan status |
|
Total Power |
Total PSU input power |
PSU |
HddBP2 Temp |
Temperature of the 3.5-inch drive backplane on the left |
Drive backplane on the left |
HddBP3 Temp |
Temperature of the 3.5-inch drive backplane on the right |
Drive backplane on the right |
Hdd Disk2 Temp |
HDD temperature |
HDD |
HddDisk0 Temp |
||
GPUN Temp |
GPU card temperature |
GPU card. N indicates the GPU card ID. The value ranges from 1 to 8. |
GPUN Power |
GPU card power |
|
GPUN HBM Temp |
GPU card HBM temperature |
GPU card, supported only by the V100 GPU card N indicates the GPU card ID. The value ranges from 1 to 8. |
FPGAN Temp |
FPGA card temperature |
FPGA card. N indicates the FPGA card ID. The value ranges from 1 to 8. |
FPGAN EnvTemp |
FPGA card operating temperature |
|
FPGAN DDR Temp |
FPGA card memory temperature |
|
FPGAN Power |
FPGA card power |
|
B# PM8053 Temp |
PM8053 temperature |
GS608 IT21SCUA board. B# indicates Board#. |
B# Outlet Temp |
Air outlet temperature (GPU board temperature) |
|
B# Inlet Temp |
Air inlet temperature (GPU board temperature) |
|
B# PCIeSWN Temp |
GPEA PCIe BridgeN temperature |
SW chip. N indicates the SW ID. The value ranges from 1 to 4. B# indicates Board#. |
IB# Temp |
Huawei 100G IB card temperature |
IB card. IB# indicates IB Board#. |
RTCBattery |
RTC battery status. The alarm threshold is 1 V. |
Mainboard |
DISKN |
Drive status |
Drive. N indicates the physical drive slot number.
|
Disks Temp |
Maximum drive smart temperature |
RAID controller card |
SSD DiskN Temp |
SSD temperature. |
SSD. N indicates the SSD ID. The value is 1 or 2. |
12V Start-Up |
24711 PwrGd is abnormal for three consecutive times. |
CPLD |
RAID Status |
RAID controller card health status |
RAID controller card |
RAID PCIe ERR |
RAID controller card health status fault diagnosis |
|
RAID Card BBU |
Avago SAS3508 daughter card and BBU |
|
DIMMN |
DIMM status |
DIMM. N indicates the DIMM slot number. |
UID Button |
UID button status |
Front panel of the general-purpose compute module |
Power |
Board power |
N/A. N indicates the component ID. |
ACPI State |
ACPI status |
|
CPUN Status |
CPU status detection |
|
CPUN Core |
CPU core faults |
|
PCH Status |
PCH chip health status fault diagnosis |
|
System Notice |
Hot restart reminder and fault diagnosis program information collection |
|
System Error |
System suspension or restart. Check the background logs. |
|
ACPI State |
ACPI status |
|
SysFWProgress |
Software processes and system startup errors |
|
SysRestart |
System restart causes |
|
Boot Error |
Boot error |
|
Watchdog2 |
Watchdog |
|
Mngmnt Health |
Health status of the management subsystem |
|
PwrOk Sig. Drop |
Voltage drop status |
|
PwrOn TimeOut |
Power-on timeout |
|
PwrCapStatus |
Power capping status |
|
FRU Hot Swap |
Hot swap event |
|
Power Status |
Power supply status |
|
PCIe Status |
PCIe status error |
|
CPU Usage |
CPU usage threshold |
|
Memory Usage |
Memory usage |
|
PS Redundancy |
Redundancy failure due to PSU removal |
|
PCIeCard GetInfo |
PCIe card information obtaining status |
|
Riser1 Pcie Info |
Riser card information obtaining status |
|
EX OutletN Temp |
Expansion air outlet temperature |
|
EX Inlet1 Temp |
Expansion air inlet temperature |
|
Eth Heartbeat |
Heartbeat status |