No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

HUAWEI CLOUD Stack 6.5.0 Alarm and Event Reference 04

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
ALM-37025 Feature Vector Code Training Platform Is Unavailable

ALM-37025 Feature Vector Code Training Platform Is Unavailable

Description

This alarm is generated when the feature vector code training process deployed on the CMS node is abnormal.

Attribute

Alarm ID

Alarm Severity

Auto Clear

37025

Major

Yes

Parameters

Name

Description

ServiceName

Specifies the name of the service for which the alarm is generated.

RoleName

Specifies the role for which the alarm is generated.

Impact on the System

If this alarm is generated, the code training platform in the cluster that can train long feature codes to short feature codes is unavailable. If this conversion fails, the retrieval service for short feature codes cannot be used, which lowers the efficiency of the database to search the codes with matched features.

Possible Causes

The code training platform process is not started or stopped by other programs, or the CMS node is faulty.

Procedure

Locate the alarm cause.

  1. Log in to the FusionInsight Manager.

    1. Log in to the ManageOne OM plane using a browser, then choose Alarms.
      • Login address: https://URL for the homepage of the ManageOne OM plane:31943. Example: https://oc.type.com:31943.
      • Default username: admin, default password: Huawei12#$.
    2. In the alarm list, locate and click the target alarm name in the Name column. The Alarm Details and Handling Recommendations dialog box is displayed.
    3. Locate the value in the IP Address/URL/Domain Name column, which is the float IP address of the FusionInsight Manager.
    4. Log in to the FusionInsight Manager using a browser.
      • Login address: https://float IP address of the FusionInsight Manager:28443/web. Example: https://10.10.192.100:28443/web.
      • Default username: admin, default password: obtain it from the system administrator.

  2. On FusionInsight Manager, choose Services > MPPDB > Service Configuration. Check the value of mppdb.cms.active.ip, the IP address of the active CMS node.
  3. Log in to the active CMS node as user omm, and run the following commands to configure environment variables and check the cluster status (provided that the cluster installation directory is /opt/huawei/Bigdata). Check whether the cluster status is Normal.

    Default user: omm, default password: Bigdata123@.

    source /opt/huawei/Bigdata/mppdb/.mppdbgs_profile

    gs_om -t status --detail

    [  CMServer State   ] 
      
     node              node_ip       instance                                          state 
     ------------------------------------------------------------------------------------------- 
     1  SZX1000071373  10.90.57.221  1    /opt/huawei/Bigdata/mppdb/cm/cm_server       Primary 
     2  SZX1000071374  10.90.57.222  2    /opt/huawei/Bigdata/mppdb/cm/cm_server       Standby 
      
     [   Cluster State   ] 
      
     cluster_state   : Normal
     redistributing  : No 
     balanced        : No  
    • If no, Rectify the cluster fault by referring to "Fault Management" in the Product Documentation and go to 2.
    • If yes, go to 4.

  1. Run the following command to go to the MPPDB installation directory and find the simSearch/TrainServer/bin directory.

    cd /opt/huawei/Bigdata/FusionInsight_MPPDB_V100R002C80SPC300/install/FusionInsight-MPPDB-2.8.0/simSearch/TrainServer/bin

  1. In the simSearch/TrainServer/bin directory, run the sh monitor_trainServer.sh status command.

    • If [monitor_trainServer.sh] process status normal is displayed, no further action is required.
    • If [monitor_trainServer.sh] process status abnormal is displayed, go to 6.

  1. Run the start_trainServer.sh script and then perform 5 to check the process status.
  2. Wait for 3 minutes and check whether the alarm persists.

    • If yes, go to 8.
    • If no, no further action is required.

Collect fault information.

  1. On FusionInsight Manager, choose System > Log Download.
  1. Select MPPDB from the Services drop-down list box and click OK.
  2. Set Start Time for log collection to 1 hour ahead of the alarm generation time and End Time to 1 hour after the alarm generation time, and click Download.
  3. Contact Technical Support technical support and send the collected logs.

Alarm Clearing

After the fault is rectified, the system automatically clears this alarm.

Related Information

None

Translation
Download
Updated: 2019-08-30

Document ID: EDOC1100062365

Views: 33604

Downloads: 31

Average rating:
This Document Applies to these Products
Related Documents
Related Version
Share
Previous Next