No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

HUAWEI CLOUD Stack 6.5.0 Alarm and Event Reference 04

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-1316001 Hosts That Have Skipped Upgrade Are Left Unhandled

ALM-1316001 Hosts That Have Skipped Upgrade Are Left Unhandled

Description

upg-server periodically (once a day by default) checks whether there are faulty nodes for which upgrade operations are skipped. This alarm is generated when the current upgrade project has been submitted and a faulty node for which upgrade operations are skipped exists.

Attribute

Alarm ID

Alarm Severity

Auto Clear

1316001

Major

Yes

Parameters

Name

Meaning

Fault Location Info

host_id: specifies the ID of the host for which the alarm is generated.

Additional Info

  • hostname: specifies the name of the host for which the alarm is generated.
  • region_name: specifies the name of the region for which the alarm is generated.
  • Service: specifies the name of the service for which the alarm is generated.
  • ErrorInfo: provides error information related to the alarm.

Impact on the System

The system running is not adversely affected. However, the next upgrade is affected. Before the next upgrade, you must reinstall the ignored faulty node to prevent the failure of the next upgrade caused by residual data and service version mismatch.

Possible Causes

During the upgrade or rollback, if the upgrade or rollback fails due to a host hardware fault which cannot be rectified, the upgrade or rollback operations are skipped on the faulty node.

Procedure

  1. On the FusionSphere OpenStack web client, choose O&M > Capacity Expansion and set PXE Boot Hosts to ON.
  2. Manually start the faulty host, set its boot device to network during the startup process, and reinstall the host OS.

    To set the boot device to network, you can use the remote control function of the server BMC system or use a keyboard/video/mouse (KVM) to connect to the server.

    The installation takes about 10 to 15 minutes.

  3. View the installation progress on the Summary page on the FusionSphere OpenStack web client.

    The installation is complete if the progress reaches 100%.

  4. Select the new host and click Reboot.
  5. On the Service OM web client, choose Computing > ECS > Hosts and Hypervisors (on the host tab) to check whether nodes on which forcible operations are performed are in the resource isolation state after all such nodes are reinstalled and services are running properly.

    If yes, click More and choose De-isolate to cancel the resource isolation operation. If no, De-isolate is grey and cannot be clicked.

  6. On the FusionSphere OpenStack web client, choose O&M > Capacity Expansion and set PXE Boot Hosts to OFF.
  7. Use PuTTY to log in to a host as user fsp.

    The default password of user fsp is Huawei@CLOUD8.

    Run the su - root command to switch to user root. Enter the password of user root. The default password is Huawei@CLOUD8!.

    Press Enter and import OpenStack environment variables.

  8. Run the following command to cancel the ignored host:

    upgrade ext ignore-cancel --service-pkg-list "package1,package_version1" "package2,package_version2" --host-list host_id1,host_id2 --verify false

    NOTE:
    • Replace package1 and package_version1 with the service and version number involved in the upgrade. Obtain the fields following "ErrorInfo=ignored packages:" in Additional Info.
    • Replace host_id1 and host_id2 with the IDs of the hosts that are ignored.
    564DD0E0-7BE0-4479-FF87-CCDBAFBC25C5:/opt/fusionplatform/data/upgrade/upg-server # upgrade ext ignore-cancel --service-pkg-list "ceilometer,FUSIONSPHERE CEILOMETER 6.5.RC1.B040" "fusionplatform,FUSIONSPHERE FUSIONPLATFORM 6.5.RC1.B040" --host-list 564DCC89-DA70-5B6C-2A66-353A93556C70,564DFC2D-2195-A2F2-2EBF-2676F0AA459B --verify false
    +--------------------------------------+-----------------+----------------------+-------+
    | host-id                              | service-package | ignore-cancel-status | code  |
    +--------------------------------------+-----------------+----------------------+-------+
    | 564DCC89-DA70-5B6C-2A66-353A93556C70 | fusionplatform  | ignore_canceled      | 11213 |
    | 564DCC89-DA70-5B6C-2A66-353A93556C70 | ceilometer      | ignore_canceled      | 11213 |
    | 564DFC2D-2195-A2F2-2EBF-2676F0AA459B | fusionplatform  | ignore_canceled      | 11213 |
    | 564DFC2D-2195-A2F2-2EBF-2676F0AA459B | ceilometer      | ignore_canceled      | 11213 |
    +--------------------------------------+-----------------+----------------------+-------+

  9. Run the following command to check whether no information is displayed under host-list:

    upgrade ext ignore-query

    564DD0E0-7BE0-4479-FF87-CCDBAFBC25C5:/opt/fusionplatform/data/upgrade/upg-server # upgrade ext ignore-query
    +--------------------------------------+
    | host-list                            |
    +--------------------------------------+
    | 564DCC89-DA70-5B6C-2A66-353A93556C70 |
    | 564DFC2D-2195-A2F2-2EBF-2676F0AA459B |
    +--------------------------------------+
    • If yes, no further action is required.
    • If no, the ignored host exists. In this case, go to 8.

Translation
Download
Updated: 2019-08-30

Document ID: EDOC1100062365

Views: 35458

Downloads: 31

Average rating:
This Document Applies to these Products
Related Version
Related Documents
Share
Previous Next