No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

FusionStorage 8.0.0 Block Storage Parts Replacement 04

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
Replacing a PCIe Card on a Riser Card of a 2288H V5 Node or 5288 V5 Node

Replacing a PCIe Card on a Riser Card of a 2288H V5 Node or 5288 V5 Node

Storage nodes support 2-port 10GE or 25GE PCIe cards, 4-port 25GE RoCE PCIe cards, 2-port 56Gb IB PCIe cards, and 2-port 100Gb IB PCIe cards.

Impact on the System

Before replacing a PCIe card on a storage node, you must power off the node. This will adversely affect the services running on the node.

Prerequisites

  • A spare PCIe card is ready.
  • The faulty PCIe card has been located.

Precautions

None

Tools and Materials

  • Flat-head screwdrivers
  • Phillips screwdrivers
  • ESD gloves
  • ESD wrist straps
  • ESD bags
  • Labels

Procedure

  1. Log in to the primary management node as user dsware, and run the sh /opt/dsware/client/bin/dswareTool.sh --op setServerStorageMode -ip Management IP address of the faulty node -mode 1 command to switch to the maintenance mode. To run this command, enter the name and password of CLI super administrator account admin as prompted.
  2. Remove the faulty node from its cabinet, and remove its chassis cover. For details, see Basic Operations.
  3. Remove the riser card.

    Loosen the screw on the riser card and lift the riser card, as shown in the following figures.

    Figure 23-1 Removing a riser card of a 2288H V5 node
    Figure 23-2 Removing a riser card of a 5288 V5 node

  4. Remove the PCIe card from the riser card.

    1. Horizontally push the latch of the PCIe card outward to unlock it, as shown in step 1 in Figure 23-3.
      Figure 23-3 Removing the PCIe card from a riser card
    2. Rotate the PCIe card latch to open it, as shown in step 2 in Figure 23-3.
    3. Pull out the PCIe card, as shown in step 3 in Figure 23-3.

  5. Place the removed PCIe card in an ESD bag.
  6. Take the spare PCIe card out of its ESD bag.
  7. Install the PCIe card on the riser card.

    1. Insert the PCIe card into the PCIe slot, as shown in step 1 in Figure 23-4.
      Figure 23-4 Installing a PCIe card on the riser card
    2. Close the latch on the PCIe slot, as shown in step 2 in Figure 23-4.
    3. Some GPUs need to connect to dedicated power cables. If the PCIe card is a GPU, select a power cable connection method based on the GPU model.

      The power cable connection methods are as follows:

      • Using no power cable
      • Using only a 6-pin power cable

        Connect the 6-pin connector on the GPU to an 8-pin connector on either side of the riser card, as shown in step 1 in Figure 23-5.

        Figure 23-5 Connecting power cables to a GPU
      • Using an 8-pin power cable

        Connect the 8-pin connector on the GPU to an 8-pin connector on either side of the riser card, as shown in step 2 in Figure 23-5.

      • Using an 8-pin (one-to-two) power cable

        Connect the 8-pin connector on the GPU to the 8-pin connectors on both sides of the riser card, as shown in step 3 in Figure 23-5.

  8. Install the riser card.

    Install the riser card and tighten the screw, as shown in the following figures.
    Figure 23-6 Installing a riser card of a 2288H V5 node
    Figure 23-7 Installing a riser card of a 5288 V5 node

  9. Install the chassis cover, and put the node back into the cabinet. For details, see Basic Operations.
  10. Log in to the primary management node as user dsware, and run the sh /opt/dsware/client/bin/dswareTool.sh --op setServerStorageMode -ip Management IP address of the faulty node -mode 0 command to switch to the normal mode. To run this command, enter the name and password of CLI super administrator account admin as prompted.
  11. Connect the storage node to the power sockets and peripheral devices according to the cable connection labels. Ensure that all cables are correctly connected and then power on the storage node. If the storage node is not powered on, press the power button on the front panel.
  12. Check whether the PCIe card is successfully installed.

    Log in to the iBMC page on the storage node. Choose Information > System Info > Other Devices. Query the information about the corresponding part and check whether the installation is successful.
    • If information about the PCIe card is displayed on the iBMC page, the PCIe card is successfully installed.
    • If information about the PCIe card is not displayed on the iBMC page, the PCIe card fails to be installed. Reinstall it. If information about the PCIe card still cannot be obtained, contact Huawei technical support.

  13. When replacing 10GE or 25GE RoCE NICs (CX4_Lx NICs), query the GUID information. If the GUID information is empty, add the GUID information.

    1. Run the mst start command to load the mst firmware management tool.

    2. Run the mst status command on mst to query the CX4 device information.

      In the command output, check whether /dev/mst/mt4117_pciconfx exists. This parameter indicates the number of CX4_Lx NICs. If no /dev/mst/mt4117_pciconfx is returned, there is no CX4_Lx NICs in the server.

    3. Based on the query result in 13.b, run the flint –d x.x.x.x q command to view detailed information of each CX4_Lx NIC. In the command, x.x.x.x indicates each value of MST devices obtained in 13.b.

      Check whether the value of Base GUID is N/A. If yes, go to 13.d. Otherwise, proceed to 14. Repeat 13.c to check the Base GUID information of each CX4_Lx NIC.

    4. If the value of Base GUID is N/A, you need to manually add GUID information.
      1. Obtain the GUID. The following is an example:

        As shown in the preceding figure, the value of Base MAC obtained in 13.c is 28:41:c6:fb:a0:39. In this case, you need to manually add the GUID 28:41:c6:"x:x":fb:a0:39 in which x:x are the rightmost two character strings of the Base MAC value, that is, a0 and 39. Therefore, the GUID to be manually added is 2841c6a039fba039.

      2. Run the flint -d /dev/mst/mt4117_pciconf0 -ocr --guid 0x"x.x.x.x"sg command to load the GUID, in which x.x.x.x indicates the GUID to be manually added.

      For SLOT NICs, repeat 13.c and 13.d.

    5. After the GUID is added, power off and then power on the server. You cannot run the reboot command to restart the system. Instead, you can forcibly power off and then power on the server on iBMC, or remove and then insert the power plug to restart the server. After the system restarts, proceed to 13.a, 13.b, and 13.c to check whether the GUID takes effect.

      If the GUID is loaded successfully, the following information is displayed:

  14. Check the system status.

    On SmartKit, choose Home > Storage > Routine Maintenance > More > Inspection and check the system status.
    • If all inspection items pass the inspection, the inspection is successful.
    • If some inspection items fail, the inspection fails. Rectify the faults by taking recommended actions in the inspection reports. Perform inspection again after fault rectification. If the inspection still fails, contact Huawei technical support.

    For details, see the FusionStorage Block Storage Administrator Guide.

Follow-up Procedure

Label the replaced PCIe card to facilitate subsequent operations.

Translation
Download
Updated: 2019-09-19

Document ID: EDOC1100081420

Views: 5021

Downloads: 4

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next