No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

HUAWEI CLOUD Stack 6.5.0 Alarm and Event Reference 04

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-37019 Connection Between MPPDBServer Data Instances and GTM Is Abnormal

ALM-37019 Connection Between MPPDBServer Data Instances and GTM Is Abnormal

Description

This alarm is generated if:

  • The GTM instance is faulty.
  • The network of the machine housing the active GTM instance is faulty.
  • In synchronization mode, networks of machines housing both active and standby GTM instances are faulty.

Attribute

Alarm ID

Alarm Severity

Auto Clear

37019

Major

Yes

Parameters

Name

Meaning

ServiceName

Identifies the service for which the alarm is generated.

RoleName

Identifies the role for which the alarm is generated.

HostName

Identifies the host for which the alarm is generated.

Instance

Identifies the instance for which the alarm is generated.

Impact on the System

Before restoration of the active GTM instance, the system is unavailable for 120 seconds.

System Processing

  • If the active GTM instance is faulty for more than 120 seconds, the standby server will serve as the active one and the system recovers.
  • In synchronization mode, if networks of active and standby GTM instances are faulty, the cluster sets the active GTM instance to the highest availability mode 120 seconds later and the system recovers.

Possible Causes

  • The GTM instance is faulty.
  • The network of the machine housing the active GTM instance is faulty.
  • In synchronization mode, networks of machines housing both active and standby GTM instances are faulty.

Procedure

Locate the alarm cause.

  1. Log in to the FusionInsight Manager.

    1. Log in to the ManageOne OM plane using a browser, then choose Alarms.
      • Login address: https://URL for the homepage of the ManageOne OM plane:31943. Example: https://oc.type.com:31943.
      • Default username: admin, default password: Huawei12#$.
    2. In the alarm list, locate and click the target alarm name in the Name column. The Alarm Details and Handling Recommendations dialog box is displayed.
    3. Locate the value in the IP Address/URL/Domain Name column, which is the float IP address of the FusionInsight Manager.
    4. Log in to the FusionInsight Manager using a browser.
      • Login address: https://float IP address of the FusionInsight Manager:28443/web. Example: https://10.10.192.100:28443/web.
      • Default username: admin, default password: obtain it from the system administrator.

  2. On FusionInsight Manager, choose Services > MPPDB > Instances, and obtain the nodes where the MPPDB instance residies.
  3. Log in to any MPPDB instance node as user omm and run the source command to configure the environment variables and the gs_om -t status --detail command to check the cluster status (provided that the cluster installation directory is /opt/huawei/Bigdata).

    Default user: omm, default password: Bigdata123@.

    source /opt/huawei/Bigdata/mppdb/.mppdbgs_profile

    gs_om -t status --detail

    As shown in the following information, if the instance status in GTM State is P in the query result of the cluster status, the instance is the active GTM instance.

    [     GTM State     ]
    
    node              node_ip       instance                               state                    sync_state
    ------------------------------------------------------------------------------------------------------------
    2  SZX1000071374  10.90.57.222  1001 /opt/huawei/Bigdata/mppdb/gtm     P Primary Connection ok  Sync
    1  SZX1000071373  10.90.57.221  1002 /opt/huawei/Bigdata/mppdb/gtm     S Standby Connection ok  Sync
    

  1. Check whether the active GTM instance is faulty. Log in to the node where the active GTM instance resides as user omm and run the following commands to check whether the instance is in the normal status.

    Default user: omm, default password: Bigdata123@.

    source /opt/huawei/Bigdata/mppdb/.mppdbgs_profile

    gtm_ctl query -D /opt/huawei/Bigdata/mppdb/gtm

    In this example, the data directory of the active GTM instance is /opt/huawei/Bigdata/mppdb/gtm.

    HA state:
            server_mode                   : Primary
            connection_state              : Connection ok
            global_transaction_id         : 16471
            sync_mode                     : Sync on
     Sync state:
            message_send_count            : 0
            message_receive_count         : 0
    • If the query result is inconsistent with the preceding information, the active GTM instance is faulty. Locate causes by analyzing logs of the instance and other monitoring instances. Then go to 5.
    • If the query result shows that the network connection between active and standby machines fails, check whether networks of active and standby machines are normal and rectify network faults in time. Then, no further action is required.

Collect fault information.

  1. On FusionInsight Manager, choose System > Log Download.
  2. Select MPPDB from the Services drop-down list box and click OK.
  3. Set Start Time for log collection to 1 hour ahead of the alarm generation time and End Time to 1 hour after the alarm generation time, and click Download.
  4. Contact Technical Support and send the collected logs.

Alarm Clearing

After the fault is rectified, the system automatically clears this alarm.

Related Information

None

Translation
Download
Updated: 2019-08-30

Document ID: EDOC1100062365

Views: 49125

Downloads: 33

Average rating:
This Document Applies to these Products
Related Version
Related Documents
Share
Previous Next