No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

Event:CAT error detected in the x86 OS, Severity:Assertion Critical, Event Code:0x0700ffff

Publication Date:  2015-10-30 Views:  167 Downloads:  0
Issue Description

 

One server reported  the following warning : Sensor:CPU1 Status, Event:CAT error detected in the x86 OS, Severity:Assertion Critical, Event Code:0x0700ffff

 

Solution

When this kind of error is reported we have to check information about  memory modules or cpu or mainboard, one or two from them could be faulty. 

In most of cases only one or two memory modules are faulty, after the repalcement the CAT error will not be present, as I said in most of cases not in all the cases.

In  my case As you can see in bellow table was nothnig wrong with memory modules according with logs files received from that server:


DIMM000          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM001          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM002          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM010          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM011          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM012          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM020          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM021          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM022          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM030          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM031          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM032          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM100          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM101          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM102          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM110          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM111          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM112          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM120          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM121          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM122          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM130          | 0x0        | discrete   | 0x8040| na        | na        | na        | na      | na         | na         | na    | na   
DIMM131          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na   
DIMM132          | 0x0        | discrete   | 0x8000| na        | na        | na        | na      | na         | na         | na    | na 



DIMM status:

0x8000: indicates that no DIMM is installed.

0x8040: indicates that a DIMM is detected and operating properly.

0x8080: indicates that a DIMM is not detected and is faulty.

0x80C0: indicates that a DIMM is detected but is faulty.


After i checked memory modules and i saw that everythnig is ok related to DIMMs i knew it that is no need to replace  memory modules fro mthat server

Next Step :

 I checked  details about CPUs and i found somethnig abnormal in fdm.log :

[Harware Error Log]:NO.3 collect:bios(boot) time: 2015-10-06 10:12:42 flag:0x01
CPU:0 (socket:CPU1) core:Uncore LogType:MCA BANK17 (CBo/LLC 0) MCA mode:Legacy IA-32 MCA
Error type:Uncorrected Errors-Catastrophic1/Fatal MCACODE:0x110a (Generic cache Level-2 Generic error) MSCODE:0x0019(REQ_RTID_TABLE_MISS_NON_DATA)

------dump reg:------

msr_mcg_contain: 0x0000000000000000
ia32_mcg_status: 0x0000000000000000
ia32_mcg_cap: 0x0000000001000c1d
error_control: 0x0000000000000000
ia32_mci_ctl: 0x00000000000ffbff

According with these information from above  the CPU1 should be repalced asap.

After the  CPU1 was replaced the CAT error did't occured on that server.

For this case the faulty part was CPU1 and not mainboard or one are two  memory modules. The CPU replacement sorted out this issue.

 

 

END