No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

Huawei Server Maintenance Manual 09

Rate and give feedback :
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
Locate the Slot of a Faulty DIMM by Using the MISC Register Value in the FDM Logs

Locate the Slot of a Faulty DIMM by Using the MISC Register Value in the FDM Logs

Problem Description

Table 5-257 Basic information

Item

Information

Source of the Problem

KunLun 9016

Intended Product

KunLun 9008/9016/9032

Release Date

2018-03-01

Keyword

DIMM slot

Symptom

On a KunLun 9000 series server, an error is generated in the FDM logs during memory inspection. However, the slot of the faulty DIMM is not provided.

Key Process and Cause Analysis

When this problem occurs, use the CPU slot number, DIMM channel number and MISC register to locate the slot of the faulty DIMM.

The FDM error record in the preceding figure shows that the CPU socket of the faulty DIMM is CPU3 (the third physical CPU), the channel is IMC 0 CH3 (the last channel of the first memory board on the CPU board). Each CPU has two IMCs, IMC 0 corresponds to the first memory board, and IMC 1 corresponds to the second memory board.

The memory channels on an IMC are described as follows:

J01 to J03: Channel 0

J04 to J06: Channel 1

J07 to J09: Channel 2

J10 to J12: Channel 3

  1. DIMM channel:

    Use bits 50 to 46 in the MISC register to locate faulty rank of the channel.

  2. MISC register:

    The FDM log displayed in Figure 1 is used as an example. The decimal value of bits 50 to 46 in the MISC register is 7, indicating that the faulty rank is rank 7.

  3. Register parsing:

    The mapping between the ranks and DIMM positions is as follows:

    3DPC:

    Ranks 0 to 3: DIMM1

    Ranks 4 to 5: DIMM2

    Ranks 6 to 7: DIMM3

    2DPC:

    Ranks 0 to 3: DIMM1

    Ranks 4 to 7: DIMM2

    In this case, all 12 DIMMs are inserted. Therefore, the DIMM configuration is 3DPC. The DIMM corresponding to rank 7 is DIMM3. Use the \one_touch_info_all\summaryinfo.txt file of the KunLun server to locate the DIMM slot.

  4. Faulty DIMM position:

    Based the CPU socket number, memory board slot number, and DIMM channel number, the faulty DIMM is in the J12 slot on the first memory board of the third CPU.

Conclusion and Solution

None

Note

None

Translation
Download
Updated: 2019-02-25

Document ID: EDOC1000041338

Views: 70984

Downloads: 3784

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next