No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

FusionStorage V100R006C30 Block Storage Service Troubleshooting Guide 05

Rate and give feedback:
Huawei uses machine translation combined with human proofreading to translate this document to different languages in order to help you better understand the content of this document. Note: Even the most advanced machine translation cannot match the quality of professional translators. Huawei shall not bear any responsibility for translation accuracy and it is recommended that you refer to the English document (a link for which has been provided).
Standby FSM GaussDB Database Failed

Standby FSM GaussDB Database Failed

Symptom

FSM nodes are deployed working in active/standby mode. In the alarm list on FusionStorage Block Self-Maintenance Platform, alarm FSM Resources Are Abnormal of the gaussDB resource name is generated and cannot be automatically cleared in a long period of time.

Possible Causes

The standby FSM GaussDB database failed to be automatically restored due to the large amount of data. As a result, the GaussDB process cannot be automatically restored.

Procedure

    1. Use PuTTY to log in to the standby FSM node.

      Ensure that the management IP address and username dsware are used to establish the connection.

      If the public and private keys are used to authenticate the login, perform the operations based on Using PuTTY to Log In to a Node in Key Pair Authentication Mode in the FusionStorage Block Storage Service Administrator Guide.

      NOTE:
      The default passwords of users dsware and root are IaaS@OS-CLOUD9! and IaaS@OS-CLOUD8!, respectively.

    2. Run the following command and enter the password of user root to switch to user root:

      su - root

    3. Run the following command to check the running status of the GaussDB database:

      sh /opt/dsware/manager/setup/forCommonServer/checkFSMStatus.sh | grep gaussDB

      Information similar to the following is displayed:
      DSM01           gaussDB           Active_normal          Normal              Active_standby   
      DSM02           gaussDB           Repairing              Exception           Active_standby  
      
      In the command output, Active_normal specifies that DSM01 is the active FSM node and DSM02 is the standby FSM node. Repairing and Exception specify the running status of the standby FSM GaussDB database.

    4. Run the following command to check the failure cause of the GaussDB database:

      cat /var/log/omm/oms/db/gs_ctl-current.log | grep WalSegmentRemoved

      Check whether information similar to the following is displayed:
      ...
      WalSegmentRemoved

    5. Use PuTTY to log in to the active FSM node.

      Ensure that the management IP address and username dsware are used to establish the connection.

      If the public and private keys are used to authenticate the login, perform the operations based on Using PuTTY to Log In to a Node in Key Pair Authentication Mode in the FusionStorage Block Storage Service Administrator Guide.

    6. Run the following command and enter the password of user root to switch to user root:

      su - root

    7. Run the following command to check the default value of wal_keep_segments:

      cat /opt/omm/oms/gaussdb/data/postgresql.conf | grep wal_keep_segments

      If information similar to the following is displayed, the default value of wal_keep_segments is 16.
      wal_keep_segments = 16

    8. Run the following command to check the available disk space of the GaussDB data partition:

      df -h /opt

      Information similar to the following is displayed:
      Filesystem            Size   Used   Avail   Use%  Mounted on
      /dev/vda12            350G   200G   150G    57%    /opt
      
      In the command output, the Avail value specifies the available disk space of the GaussDB data partition, for example, 150 GB.

    9. Run the following command to check the data directory size of the GaussDB database on the active FSM node:

      du -sh /opt/omm/oms/gaussdb/data

      If information similar to the following is displayed, the data directory size of the GaussDB database is 189 GB.
      189G	/opt/omm/oms/gaussdb/data

    10. Estimate the wal_keep_segments value based on the following formula:

      Data directory size (GB) x 1.5 (1/GB)

      For example, 189 GB multiply 1.5 equals 300.
      • If the value of Estimated wal_keep_segments x 16 + 100 (MB) is less than the value of Available disk space of GaussDB data partition (MB), use the estimated value for wal_keep_segments.
      • Otherwise, calculate the wal_keep_segments value based on the following formula:

        (Available disk space of GaussDB data partition — 100)/16 (MB)

    11. Run the following command to change the wal_keep_segments value:

      su- ommdba -c "gs_guc reload -c wal_keep_segments=Value of wal_keep_segments"

      The wal_keep_segments value is calculated using 10.

      Example: su - ommdba -c "gs_gucreload -c wal_keep_segments=300"

      If information similar to the following is displayed, the value is successfully changed.
      gs_guc reload: wal_keep_segments=300
      server signaled
      

    12. On FusionStorage Block Self-Maintenance Platform, choose Monitoring > Alarms to check whether the alarm is automatically cleared.

      • If yes, go to 13.
      • If no, wait until the alarm is automatically cleared and go to 13.

    13. Run the following command to restore the default value of wal_keep_segment:

      su- ommdba -c "gs_guc reload -c wal_keep_segments=Default value of wal_keep_segments"

      The default value of wal_keep_segments is obtained in 7.

      Example: su - ommdba -c"gs_guc reload -c wal_keep_segments=16"

      If information similar to the following is displayed, the value is successfully changed.
      gs_guc reload: wal_keep_segments=16
      server signaled
      

Related Information

None

Translation
Download
Updated: 2019-09-09

Document ID: EDOC1100027999

Views: 5858

Downloads: 17

Average rating:
This Document Applies to these Products
Related Version
Related Documents
Share
Previous Next