Maintenance and Monitoring (Management Plane)
- About iMaster NCE-Campus
- Starting iMaster NCE-Campus
- Stopping iMaster NCE-Campus
- System Monitoring
- Backup and Restore
- Configuring Products
- Network Configuration
- Alarm Configuration
- Software Management
- Election Service Management
- Task Management
- O&M Management
- Security Management
- Log Management
- Replacing Certificates
- Suggestions on Certificate Making
- Southbound Certificates
- Northbound Certificates
- Configuring CA Server Parameters
- Configuring CRL Server Parameters
- Updating CA Certificates
- Updating IR Certificates
- Updating ER Certificates
- Updating the Trust Certificate of the Syslog Server (the Management Plane)
- Updating the Certificates of User Management (the Management Plane)
- Updating Certificate Revocation Lists
- Updating DR System Certificates
- Configuring Trust Certificates of Browsers
- Replacing the SP Certificate
- Replacing the IdP Certificate
- Replacing the Token Signature Certificate
- Replacing the FusionInsight CA Certificate
- Port Certificates Dependent on IR Certificates
- Internal Certificates
- FAQ
- Key Management
- Disk Management
- FAQs
- Hardware Devices
- FusionCompute
- Operating System
- iMaster NCE-Campus System
- Querying the Version Number of the Management Plane
- Checking the Certificate Validity Period (the Management Plane)
- Querying the MAC Address of a Node
- How Do I Check the Active/Standby Status of a Node?
- How Do I Query the IP Address of the Node Where a Service Resides?
- How Do I Query the IP Address of the Node Where a Database Instance Resides?
- How Do I Query the IP Address of a Node?
- How Do I Query the Floating IP Address of a Node?
- How Do I Query the Node Name Corresponding to the IP Address of the Management Node?
- How Do I Change the IP Addresses and Domain Names After iMaster NCE-Campus Is Installed?
- How Do I Determine the Deployment Mode of Nodes?
- How Do I Check Whether Management Nodes and Product Nodes Use the Same Database Software?
- How Do I Query SFTP Fingerprint Authentication Information?
- How Do I Create a Backup Path for a Backup Server?
- How Do I Manually Authenticate the SFTP Fingerprint?
- How Do I Manually Cancel or Configure Port 22 to Listen to IP Addresses?
- How Do I Set the IP Address and Port to Be Listened to When iMaster NCE-Campus Functions as the SNMP Client to Access Devices?
- Updating ER Certificates of the Management Plane in CLI Mode
- Updating ER Certificates of the Service Plane in CLI Mode
- CA Certificate
- Restoring the ER or IR Certificates That Failed to Be Updated
- Updating the User Management Certificate in CLI Mode (the Management Plane)
- Notifications
- How Do I Obtain the Public Key Fingerprint of the Server?
- How Do I Obtain the Public Key Fingerprint of the SFTP Server?
- How Do I Obtain the Public Key of the Third-Party SFTP Server?
- How Do I Enable or Disable the Data Import Function for Fast Fault Demarcation?
- How Do I Change the Database Instance Password? (Product Node)
- How Do I Configure SSL Encryption Algorithms for GaussDB 100 V1?
- Common OMMHA Operations
- Modifying the VM Startup Mode and Restarting the VM
- Modifying the Startup Mode of the Physical Machine and Restarting the Physical Machine
- How Do I Log In to the Remote Console?
- How Do I Check the Deployment Status of a Product?
- How Do I Install the RPM Component Package That Is Not Installed by Default on the OS?
- Encrypting the Private Key of the Signature Certificate (the Management Plane)
- Managing Passwords in the OS Weak Password Dictionary
- Forcibly Stopping Tasks for Backing Up Product Data
- Modifying Configuration Parameters After a Floating IP Address Is Added or Deleted
- How Do I Enable or Disable the SSH RekeyLimit Function?
- How Do I Obtain the IP Addresses of OMP and Service Nodes?
- How Do I Resolve the Device Data Receiving Problem in the Remote Recovery Scenario?
- Common Operations
- Logging In to a Server Using PuTTY
- How Do I Prevent PuTTY from Being Disconnected upon Timeout?
- Transferring Files Using FileZilla
- Converting the Certificate Format
- Encrypting the Private Key of the Signature Certificate (the Service Plane)
- Obtaining Documents of Common Third-Party Software and Hardware
- Querying the Product Name
About iMaster NCE-Campus
After iMaster NCE-Campus is installed, you can maintain it through the management plane(iMaster NCE-Campus-OMP) and implement service functions through the service plane.
Concepts
GUI Type |
Intended Audience |
Function Description |
Address |
---|---|---|---|
Management plane |
|
Centrally manages iMaster NCE-Campus software resources, applications, and databases to implement functions such as installation and deployment, system monitoring, system maintenance (including user and password management, data backup and restore, certificate and key management), and system troubleshooting (including system health check and disaster recovery). |
https://management plane IP address:18102 |
service plane |
|
Provides system settings (including license management and interconnection with southbound systems), security management (including user and permission management, security policies, and log management), alarm management, premium private line provisioning, key service assurance (KSA), network survivability analysis, OVPN, network performance analysis for the following O&M scenarios on the network:
|
https://service plane IP address:31943 |
Table 3-2 describes the common concepts related to iMaster NCE-Campus management and maintenance.
Concept |
Description |
---|---|
Management node |
Node where the management plane service is deployed. |
Product node or service nodes |
A collection of nodes where services or databases of a product are deployed. |
Database node |
Node where a database is deployed. |
Product |
A collection of services, excluding the management plane service, deployed on a server. |
Service |
A collection of features that provide related functions. A service may have one or multiple instances. |
Instance |
A collection of a running program in the system and the resources allocated to the program. An instance may have one or multiple processes. |
Process |
An execution action performed by a program with certain functions on a collection of data. It is the basic unit for resource allocation and scheduling. |
Single-server |
Services are deployed on a node. All the management plane and product functions are provided by the node. |
Cluster |
Services are deployed on multiple nodes. All the management plane or product functions are provided by the nodes in a collaborative manner. |
iMaster NCE-Campus Supports distributed, centralized and compact scenarios. For details about the main differences between Distributed, centralized and compact scenarios, see Table 3-3. This document describes Distributed, centralized and compact scenarios. Select the corresponding content based on the actual scenario.
- Distributed: iMaster NCE-Campus consists of at least two of Manager, Controller, and Analyzer.
- Centralized: iMaster NCE-Campus consists of only Manager.
- Compact: iMaster NCE-Campus consists of at least two of Manager, Controller, and Analyzer. In addition, to reduce resource usage, service nodes are not deployed in redundancy mode, that is, no local node protection is configured.
Category |
Distributed scenario |
Centralized scenario |
Compact scenario |
Difference |
---|---|---|---|---|
Management node |
OMP_01 OMP_02 OMP_03 |
OMP |
OMP |
In a Distributed scenario, there are three management nodes: OMP_01, OMP_02, OMP_03. OMP_03 and DB_02 are deployed on the same node IP. |
Service node |
For example, Common_Service and DB |
NMS |
For example, Common_Service and DB |
The database users of the Centralized service nodes reside on the NMS node. |
Backup and restore function differences |
iMaster NCE-Campus uses the OMP_02 node as the backup server by default. |
iMaster NCE-Campus uses the OMP as the backup server by default. |
iMaster NCE-Campus uses the OMP as the backup server by default. |
When you configure backup parameters in the Centralized scenario, the IP address of the backup server is the IP address of the OMP node. Other information (username, password, and path) is the same as that in the Distributed scenario. |
O&M management |
iMaster NCE-Campus supports the O&M dashboard, unified monitoring, fault alarm, fault diagnosis, health check, and data collection functions. NOTE:
In the NCE Lite management and control scenario, only the fault alarm, health check, data collection and historical problem matching functions are supported. |
TaiShan: Supports the O&M dashboard, unified monitoring, fault alarm, fault diagnosis, health check, and data collection functions. 2288H V5: Supports the fault alarm, health check, data collection and historical problem matching functions. |
iMaster NCE-Campus supports the O&M dashboard, unified monitoring, fault alarm, fault diagnosis, health check, and data collection functions. |
The 2288H V5 centralized scenario supports only the fault alarm, health check, data collection and historical problem matching functions. |
Use Precautions
- Unless otherwise specified, there are no requirements for the operations described in this document on the DR system.
- Different network planes are planned for each node. If you need to remotely log in to the operating system of a virtual node using SSH, see How Do I Log In to the OS of a Node?
Starting iMaster NCE-Campus
If iMaster NCE-Campus is manually stopped or unexpectedly terminated, restart iMaster NCE-Campus after the affecting factors are eliminated.
Powering On Devices (2288H V5)
Power on devices and prepare for installing or using iMaster NCE-Campus.
Context
The power-on sequence is as follows: Turn on the power switch of the cabinet PDB, and then power on the devices from bottom to top based on the installation positions of the devices in the cabinet.
Procedure
- Check the label on the power cable, find the power switch on the cabinet PDB that matches the power cable, and turn on the switch.
- Plug in the 2288H V5 server.
- Check the indicator status of the PSU to ensure that the server is powered on properly.
Figure 3-3 shows the status indicator of the AC PSU.
Table 3-4 describes the status indicator of the AC PSU.
Table 3-4 Status descriptionName
Color
Status and Meaning
PSU running status indicator
Green
- On: The PSU is running properly.
- Blinking (0.5 Hz): The PSU is in the hibernation state and makes no output.
Red
On: The PSU is running inappropriately.
NOTE:The PSU malfunctions possibly because:
- The PSU encounters an input overvoltage or undervoltage.
- The PSU encounters an output overvoltage or undervoltage.
- The PSU makes no output.
- A fan alarm or fault occurs.
Off
Off: There is no power supply.
Powering On Devices (TaiShan)
Power on devices and prepare for installing iMaster NCE-Campus.
Context
The power-on sequence is as follows: Turn on the power switch of the cabinet PDB, and then power on the devices from bottom to top based on the installation positions of the devices in the cabinet.
Procedure
Server
- Check the label on the power cable, find the power switch on the cabinet PDB that matches the power cable, and turn on the switch.
- Check the indicator status of the PSU to ensure that the server is powered on properly.
For example: Figure 3-4 shows the status indicator of the AC PSU.
Table 3-5 describes the status indicator of the AC PSU.
Table 3-5 Status descriptionName
Color
Status and Meaning
Indicator of PSU running
Green
- Steady green: The power input and output are normal.
- Blinking green at 1 Hz:
The input is normal, and the server is standby.
The input is overvoltage or undervoltage.
- Blinking green at 4 Hz: The PSU firmware is being upgraded online.
Orange
Steady orange: The input is normal, but no power output is supplied due to overheat protection, overcurrent protection, short circuit protection, output overvoltage protection, or some component failures.
Off
No AC power is supplied.
Switch
- Verify that the power cable of the switch is properly connected.
- Check the label on the power cable, find the power switch on the cabinet PDB that matches the power cable, and turn on the switch.
- Turn on the PSUs of the switch.
- Check the switch after powering it on. Check the following items after powering on the switch:
Whether the sound of fans can be heard and air is exhausted from the air vent.
Whether the indicators on the switch, PSUs, and fan modules are in normal state. Normally, the SYS indicator on the switch and the STATUS indicators on the fan modules and PSUs are steady green.
Starting the VM
Select the appropriate VM startup method based on the actual scenario. When the VM is started, the OS of the VM starts, and the services and database on the VM node are also started.
Starting the VM (FusionCompute)
This section describes how to start a virtual machine in a FusionCompute scenario.
Prerequisites
You have logged in to FusionCompute. For details, see How Do I Log In to FusionCompute?
Procedure
- Choose Resource Pools > ManagementCluster > CNA_XX from the navigation pane.
- In the navigation pane, right-click the VM to be started (for example: NMS-Server_192.168.33.180) and choose Power > Start from the shortcut menu.
If the VRM cannot be started, rectify the fault by referring to "How to Rectify VRM VM Startup Failure" in FusionSphere Virtualization Suite Product Documentation.
The method of obtaining the document is as follows:
- For carrier users, log in to https://support.huawei.com/carrier/docTypeNewOffering?col=product&path=PBI1-21430725/PBI1-23710112/PBI1-21431666/PBI1-21462737/PBI1-8576912 and search for FusionSphere Virtualization Suite Product Documentation of the required version.
- For enterprise users, log in to https://support.huawei.com/enterprise/en/cloud-computing/fusioncompute-pid-8576912 and search for FusionSphere Virtualization Suite Product Documentation of the required version.
Starting the Management Plane
The databases, services, and processes of the management plane can only be started in CLI mode.
- To start only the management-plane databases, see Starting the Management Plane Service.
- To start only the management-plane services, see Logging In to the Management Plane.
- To start the management-plane services and databases, see Starting the Management Plane Service and Databases.
Starting the Management Plane Databases
This section describes how to start the databases of the management plane by running commands.
Prerequisites
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- You have obtained the management IP address of the management node.
Procedure
- Use PuTTY to log in to the management node as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
If the management plane is deployed in cluster mode, open a new PuTTY window to start the databases on OMP_02 within three minutes after you run the startup command on OMP_01. After you have started databases on two nodes, check the startup result of the nodes. If the startup fails on a node, contact Huawei technical support.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start the management plane databases:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startdc -tenant manager
If information similar to the following is displayed and success is displayed for all processes, the databases of the management plane are started successfully. Otherwise, contact Huawei technical support.
============================ Starting data container processes... Starting redis process woadapterrdb-1-14 ... success ... Starting redis process serviceinspectionrdb-1-3 ... success Starting redis process privilegerdb-1-28 ... success ============================ Starting data container processes is complete.
Starting the Management Plane Service
This section describes how to start the management plane service by running commands.
Prerequisites
- The database instances of the management plane are running. For details, see Starting the Management Plane Databases.
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- You have obtained the management IP address of the management node.
- If the management plane is deployed in cluster mode, that is, there are multiple management nodes, you have obtained the IP addresses of the management nodes where the active and standby OMMHA services reside. For details, see How Do I Check the Active/Standby Status of a Node?
Procedure
If the management plane is deployed in cluster mode:
- Use PuTTY to log in to the management node where the active OMMHA service resides, as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start OMMHA:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startapp -tenant manager -app OMMHAService
If information similar to the following is displayed and success is displayed for all processes, OMMHA is started successfully. Otherwise, contact Huawei technical support.
Starting process ommha-0-0 ... success
- Run the following command to start services other than OMMHA on the node:
> ipmc_adm -cmd startapp -tenant manager
If information similar to the following is displayed and success is displayed for all processes, the services on the node are started successfully. Otherwise, contact Huawei technical support.
... Starting process user-0-0 ... success Starting process cron-0-0 ... success ...
- Use PuTTY to log in to the management node where the standby OMMHA service resides, as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start OMMHA:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startapp -tenant manager -app OMMHAService
If information similar to the following is displayed and success is displayed for all processes, OMMHA is started successfully. Otherwise, contact Huawei technical support.
Starting process ommha-0-0 ... success
- Run the following command to start services other than OMMHA on the node:
> ipmc_adm -cmd startapp -tenant manager
If information similar to the following is displayed and success is displayed for all processes, the services on the node are started successfully. Otherwise, contact Huawei technical support.
... Starting process user-0-0 ... success Starting process cron-0-0 ... success ...
- Use PuTTY to log in to other management nodes as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start all services:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startapp -tenant manager
If information similar to the following is displayed and success is displayed for all processes, all services on the node are started successfully. Otherwise, contact Huawei technical support.
... Starting process user-0-0 ... success Starting process cron-0-0 ... success ...
Starting the Management Plane Service and Databases
This section describes how to start the service and databases of the management plane by running commands.
Prerequisites
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- You have obtained the management IP address of the management node.
- If the management plane is deployed in cluster mode, that is, there are multiple management nodes, you have obtained the IP addresses of the management nodes where the active and standby OMMHA services reside. For details, see How Do I Check the Active/Standby Status of a Node?
Procedure
If the management plane is deployed in cluster mode:
- Use PuTTY to log in to the management node where the active OMMHA service resides, as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start OMMHA:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startapp -tenant manager -app OMMHAService
If information similar to the following is displayed and success is displayed for the process, OMMHA is started successfully. Otherwise, contact Huawei technical support.
Starting process ommha-0-0 ... success
- Run the following commands to start the service and databases of the management plane:
> ipmc_adm -cmd startmgr
If information similar to the following is displayed, the service and databases of the management plane on the node are started successfully. Otherwise, contact Huawei technical support.
... ============================ Starting management dc is complete ... ============================ Starting management processes is complete.
- Use PuTTY to log in to the management node where the standby OMMHA service resides, as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start OMMHA:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startapp -tenant manager -app OMMHAService
If information similar to the following is displayed and success is displayed for the process, OMMHA is started successfully. Otherwise, contact Huawei technical support.
Starting process ommha-0-0 ... success
- Run the following commands to start the service and databases of the management plane:
> ipmc_adm -cmd startmgr
If information similar to the following is displayed, the service and databases of the management plane on the node are started successfully. Otherwise, contact Huawei technical support.
... ============================ Starting management dc is complete ... ============================ Starting management processes is complete.
- Use PuTTY to log in to other management nodes as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to start the service and databases of the management plane:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd startmgr
If information similar to the following is displayed, the service and databases of the management plane on the node are started successfully. Otherwise, contact Huawei technical support.
... ============================ Starting management dc is complete ... ============================ Starting management processes is complete.
Logging In to the Management Plane
You can log in to the management plane using a browser.
Prerequisites
- The current PC can communicate properly with the server of the management plane and the services of the management plane are running properly.
- You have obtained the password for the current user.
Context
- The management plane provides only the admin user. This user has all permissions for all objects managed by the management plane. After you log in to the management plane for the first time, change the password as prompted to enhance security.
- Your PC's OS and browser must meet the following requirements:
Table 3-6 Configuration requirements on the clients
Configuration Item
Minimum Configuration Requirements
Memory
4 GB or above
Hard disk
Free hard disk: 8 GB or above
OS
- Windows 10 (32/64-bit)
Browser
- Recommended:
- Google Chrome 70 or later (32-bit or 64-bit)
- Firefox ESR 61.0.1 or later (32-bit or 64-bit)
- Compatible:
- Google Chrome 57 or later (32-bit or 64-bit)
- Firefox ESR 52 or later (32-bit or 64-bit)
Resolution
1366 x 768 or higher; recommended resolution: 1920 x 1080
Procedure
- Access the management plane at https://management plane IP address:18102, and click Enter.
After iMaster NCE-Campus is enabled, you can only log in to the management plane using https://management plane IP address:18102.
If iMaster NCE-Campus services are disabled, you can only use https://ER floating IP:31945 to log in to the management plane. If you want to enable iMaster NCE-Campus services, log in to the operating system as user sopuser. Switch ossadm user, and run the following commands to start the LVS service:
. /opt/oss/manager/agent/bin/engr_profile.sh ipmc_adm -cmd restartapp -app LVSService
- Enter the user name and password, and click Log In.
- When you log in to the system in local mode, enter the user name admin and its password. To ensure system security, change your password upon first login and keep the new password safe.
- When you log in to the management plane for the first time, the system prompts you to change the default password for the admin user. Perform operations as prompted. If the password for the admin user is lost, you can only restore the default password by reinstalling the management plane.
- For security purposes, do not save your password in the browser.
- If you enter the password for the admin user incorrectly five times consecutively within 10 minutes, the login IP address will be locked for 10 minutes.
Starting the FusionInsight
Prerequisites
Power on the server.
Procedure
- Use PuTTY to log in to the management node as user omm.
- Run the following command to start the OMS:
sh ${BIGDATA_HOME}/om-server/om/sbin/start-oms.sh
If the following information is displayed, the OMS is successfully started:
Warning: HA monitor has been running already. start HA successfully.
- Use PuTTY to log in to the other management node as user omm.
- Run the following command to start the OMS:
sh ${BIGDATA_HOME}/om-server/om/sbin/start-oms.sh
If the following information is displayed, the OMS is successfully started:
Warning: HA monitor has been running already. start HA successfully.
- In the address box of your browser, enter the FusionInsight Manager network address. The page can be displayed properly after the OMS automatically starts.
- On the FusionInsight Manager home page, click
next to the name of the cluster to be operated and select Start.
In the displayed dialog box, click OK to start the cluster. Operation succeeded is displayed. Click Finish. The service starts successfully.
Starting the Service Plane
On the management plane, you can start the service plane in GUI mode.
- To start only the service-plane databases, see Starting the Service Plane Services.
- To start the service-plane services, see Logging In to service plane.
- To start the services and databases on the service plane at the same time, see, see Starting Product Services and Databases.
Starting the Service Plane Databases
After iMaster NCE-Campus is installed, the databases of the service plane are running by default. If the databases are manually stopped for a special operation or become abnormal due to an exception, you need to start the databases after the operation is complete or the exception is rectified so that iMaster NCE-Campus can properly read and write data.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, click
to switch to the product. Perform operations according to Table 3-7.
Table 3-7 Starting databasesTask
Operation
Starting all databases of the product.
On the upper left corner of the page, click Start, choose Start DB from the drop-down menu, and perform related operations as prompted.
Starting the databases of a single node.
On the Nodes tab page, click
in the Operation column of the row that contains the node and perform operations as prompted.
- On the current page, check the database status and check whether the operation is successful.
- In a DR scenario, unfreeze the product. For details, see Freezing Products to Disable Automatic Service Startup.
Starting the Service Plane Services
After iMaster NCE-Campus is installed, the services of the service plane are running by default. If the services are manually stopped for a special operation or become abnormal due to an exception, you need to start the services after the operation is complete or the exception is rectified so that iMaster NCE-Campus can run properly.
Prerequisites
- You have logged in to the iMaster NCE-Campus management plane. For details, see Logging In to the Management Plane.
- The databases of service nodes are running. For details, see System Monitoring.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, click
to switch to the product. Perform operations according to Table 3-8.
Table 3-8 Starting servicesTask
Operation Instructions
Starting all services of the product.
On the upper left corner of the page, click Start, choose Start Service from the drop-down menu, and perform related operations as prompted.
Starting services on a single node.
On the Nodes tab page, click
in the Operation column of the row that contains the node and perform operations as prompted.
Starting a single service or multiple services.
On the Services tab page, select the service to be started, click Start on the right of the page, and perform related operations as prompted.
- On the current page, check the service status and check whether the operation is successful.
- In a DR scenario, unfreeze the product. For details, see Freezing Products to Disable Automatic Service Startup.
Starting Product Services and Databases
This section describes how to start product services and databases.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product. Perform required operations based on Table 3-9.
Table 3-9 Starting product services and databasesTask
Operation
Start all services and databases of the product.
In the upper left corner of the page, click Start, choose Start All from the drop-down menu, and perform operations as prompted.
Start services and databases on nodes.
On the Nodes tab page, select the nodes with services and databases to be started, click Start on the right of the page, and perform operations as prompted.
- After the task is successfully executed, check that Service Status and DB Status of the nodes are both Running on the Nodes tab page.
- In a DR scenario, unfreeze the product. For details, see Freezing Products to Disable Automatic Service Startup.
Logging In to service plane
This section describes how to log in to service plane using a browser.
Prerequisites
- The current PC can communicate properly with service plane and the services of service plane are running properly.
- You have obtained the user name and password for login.
Context
- The OS and browser of your PC must meet the following requirements.
Table 3-10 Configuration requirements
Software Type
Requirements
OS
Windows Server 2008, Windows 10 or later
Browser
- Google Chrome 57 or later
Resolution
Recommended resolution: 1366 x 768 (px) or higher. Optimal resolution: 1920 x 1080 (px)
- The service plane provides the default admin user as the system administrator that has permissions on all resources. To ensure security of the service plane, change the password periodically and keep the new password secure. If the admin user enters incorrect passwords five times consecutively within 10 minutes, the login IP address will be locked for 10 minutes.
- If three accounts using the same IP address are locked within a 10-minute period, this IP address will be locked for 30 minutes.
- If the login of an IP address meets the IP address lockout conditions specified on the Account Policy page, this IP address will be locked.
- If the login of a local account meets the account lockout conditions specified on the Account Policy page, this account will be locked for 30 minutes by default.
- All users can log in to the system again after their accounts are automatically unlocked. Local users can also contact security administrators to unlock their accounts to log in again.
Procedure
- Log in to the service plane at https://the service plane IP address:18008.
- On the login page, enter the username and password.
- If you have changed the initial password after login, keep the new password secure. If you forget the password of the admin user, you can reset the password only by reinstalling the service plane.
- For security purpose, do not set the browser to remember the password.
- Click Log In.
Stopping iMaster NCE-Campus
Stopping iMaster NCE-Campus is potentially service-affecting. Ensure that the current service can be stopped and then follow the procedure described in this document to perform operations in sequence.
- In the DR scenarios, please disable the Automatic Switch function first, and then stop iMaster NCE-Campus. After the DR system is started properly and no DR-related alarms are generated, then enable the Automatic Switch function.
On the management plane, choose HA > Remote High Availability System > Manage DR System from the main menu, click Modify in the upper right corner.
- Select the DR Extended Configuration tab and disable the Automatic Switch function.
- In the DR scenario, stop the secondary site first. After the secondary site is completely stopped, stop the primary site.
Stopping the Service Plane
On the management plane, you can stop the service plane in GUI mode.
Stopping the Service Plane Services
After iMaster NCE-Campus is installed, the services of the service plane are running by default. Exercise caution when manually stopping the services.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- In a DR scenario, the product has been frozen. For details, see Freezing Products to Disable Automatic Service Startup.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, click
to switch to the product. Perform operations according to Table 3-11.
Table 3-11 Stopping servicesTask
Operation Instructions
Stopping all services of the product.
On the upper left corner of the page, click Stop, choose Stop Service from the drop-down menu, and perform related operations as prompted.
Stopping the services of a single node.
On the Nodes tab page, click
in the Operation column of the row that contains the node and perform operations as prompted.
Stopping a single service or multiple services.
On the Services tab page, select the service to be stopped, click Stop on the right of the page, and perform related operations as prompted.
- On the current page, check the service status and check whether the operation is successful.
Stopping the Service Plane Databases
After iMaster NCE-Campus is installed, the databases of the service plane are running by default. Exercise caution when manually stopping the databases.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- To ensure that services are running properly, you are advised to stop the services of the corresponding product or node before stopping the database. For details, see Stopping the Service Plane Services.
- In a DR scenario, the product has been frozen. For details, see Freezing Products to Disable Automatic Service Startup.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, click
to switch to the product. Perform operations according to Table 3-12.
Table 3-12 Stopping servicesTask
Operation Instructions
Stopping all databases of the product.
On the upper left corner of the page, click Stop, choose Stop DB from the drop-down menu, and perform related operations as prompted.
Stopping the databases of a single node.
On the Nodes tab page, click
in the Operation column of the row that contains the node and perform operations as prompted.
- On the current page, check the service status and check whether the operation is successful.
Stopping Product Services and Databases
Stop product services and databases as required during system maintenance.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
In a DR scenario, the product has been frozen. For details, see Freezing Products to Disable Automatic Service Startup.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product. Perform required operations based on Table 3-13.
Table 3-13 Stopping product services and databasesTask
Operation
Stop all services and databases of the product.
In the upper left corner of the page, click Stop, choose Stop All from the drop-down menu, and perform operations as prompted.
Stopping services and databases on a node.
On the Nodes tab page, select the nodes to be stopped, click Stop on the right of the page, and perform operations as prompted.
- After the task is successfully executed, check that Service Status and DB Status of the nodes are both Not Running on the Nodes tab page.
Stopping the Management Plane
The Management Plane can only be stopped in CLI mode.
Stopping the Management Plane Service and Databases
This section describes how to stop the service and databases of the management plane by running commands.
Prerequisites
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- You have obtained the management IP address of the management node.
Procedure
If the management plane is deployed in cluster mode:
- Use PuTTY to log in to the management node as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
Perform the following operations on all the management nodes.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to stop the service and databases of the management plane:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd stopnode
If the management node and product node are the same node, the preceding command also stops product services on the node. If you do not need to stop the product services, run the ipmc_adm -cmd stopmgr command instead. For details about how to check that the management node and product node are the same node, see How Do I Determine the Deployment Mode of Nodes?
If success is displayed for all services, all services and databases on the node is stopped successfully. Otherwise, contact Huawei technical support.
Stopping the Management Plane Service
This section describes how to stop the management plane service by running commands.
Prerequisites
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- You have obtained the management IP address of the management node.
- If the management plane is deployed in cluster mode, that is, there are multiple management nodes, you have obtained the IP addresses of the management nodes where the active and standby OMMHA services reside. For details, see How Do I Check the Active/Standby Status of a Node?
Procedure
If the management plane is deployed in cluster mode:
- Use PuTTY to log in to the management node where the standby OMMHA service resides, as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to stop OMMHA:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd stopapp -tenant manager -app OMMHAService
If information similar to the following is displayed and success is displayed for the process, OMMHA is stopped successfully. Otherwise, contact Huawei technical support.
Stopping process ommha-0-0 ... success
- Run the following command to stop services other than OMMHA on the node:
> ipmc_adm -cmd stopapp -tenant manager
If information similar to the following is displayed and success is displayed for all processes, the services on the node are stopped successfully. Otherwise, contact Huawei technical support.
... Stopping process user-0-0 ... success Stopping process cron-0-0 ... success ...
- Use PuTTY to log in to the management node where the active OMMHA service resides as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to stop OMMHA:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd stopapp -tenant manager -app OMMHAService
If information similar to the following is displayed and success is displayed for the process, OMMHA is stopped successfully. Otherwise, contact Huawei technical support.
Stopping process ommha-0-0 ... success
- Run the following command to stop services other than OMMHA on the node:
> ipmc_adm -cmd stopapp -tenant manager
If information similar to the following is displayed and success is displayed for all processes, the services on the node are stopped successfully. Otherwise, contact Huawei technical support.
... Stopping process user-0-0 ... success Stopping process cron-0-0 ... success ...
- Use PuTTY to log in to other management nodes as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to stop all services:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd stopapp -tenant manager
If information similar to the following is displayed and success is displayed for all processes, all services on the node are stopped successfully. Otherwise, contact Huawei technical support.
... Stopping process user-0-0 ... success Stopping process cron-0-0 ... success ...
Stopping the Management Plane Databases
This section describes how to stop the databases of the management plane by running commands.
Prerequisites
- The management plane service is in the Not Running state. For details, see Stopping the Management Plane Service.
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- You have obtained the management IP address of the management node.
Procedure
- Use PuTTY to log in to the management node as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
If the management plane is deployed in cluster mode, log in to OMP_01 and OMP_02, and perform the following operations.
- Run the following command to switch to the ossadm user:
> su - ossadm
Password: password for the ossadm user
- Run the following commands to stop the databases of the management plane:
> source /opt/oss/manager/bin/engr_profile.sh
> ipmc_adm -cmd stopdc -tenant manager
If information similar to the following is displayed and success is displayed for all processes, the databases of the management plane are stopped successfully. Otherwise, contact Huawei technical support.
============================ Stopping data container processes... Stopping redis process woadapterrdb-1-14 ... success ... Stopping redis process privilegerdb-1-28 ... success Stopping redis process rnrdb-1-21 ... success ============================ Stopping data container processes is complete.
Stopping the FusionInsight
High Risk Operation
Contact Huawei technical support if you need to perform the following operations:
- Restart FusionInsight service components. This operation will interrupt controller services and cause data loss.
- Restart the FusionInsight OMS component. If this operation is performed, the FusionInsight cluster cannot be managed or maintained within a certain period of time.
- Stop FusionInsight service components. This operation is forbidden. Otherwise, controller services will be interrupted, causing data loss and feature failures.
- Stop Spark statistics collection tasks. This operation will interrupt controller services and cause data loss.
- Change the NTP, time, and time zone configurations of the FusionInsight cluster. If this operation is performed, the FusionInsight cluster may fail and service data may be lost.
Procedure
- Log in to FusionInsight Manager.
- On the FusionInsight Manager home page, click
next to the name of the cluster to be operated and select Stop.
- In the displayed window, enter the password of the current login administrator user and click OK.
- In the displayed dialog box, click OK.
After the system displays "Operation succeeded", click Finish. The cluster is successfully stopped.
- Use PuTTY to log in to the active management node as user omm.
- Run the following command to stop the active OMS:
sh ${BIGDATA_HOME}/om-server/om/sbin/stop-oms.sh
If the following information is displayed, the OMS is successfully stopped:
stop HA successfully.
The active/standby management node failover takes about three minutes.
- Use PuTTY to log in to the standby management node as user omm.
- Run the following command to stop the standby OMS:
sh ${BIGDATA_HOME}/Bigdata/om-server/om/sbin/stop-oms.sh
If the following information is displayed, the OMS is successfully stopped:
stop HA successfully.
Shutting Down the VM
The method of shutting a VM varies depending on the actual deployment scenario. Perform the corresponding operations based on the actual scenario. When the VM is shut down, the OS of the VM is also shut down.
Shutting Down the VM (FusionCompute)
This section describes how to shut down a virtual machine in a FusionCompute scenario.
Prerequisites
You have logged in to FusionCompute. For details, see How Do I Log In to FusionCompute?
Procedure
- Choose Resource Pools > ManagementCluster > CNA_XX from the navigation pane.
- In the navigation pane, right-click the VM to be shut down (for example: NMS-Server_192.168.33.180) and choose Power > Stop from the shortcut menu.
Powering Off Devices (2288H V5)
After devices are powered off, iMaster NCE-Campus stops running.
Prerequisites
The services and databases have been stopped. For details, see the related section in Stopping iMaster NCE-Campus.
Precautions
- If you run the shutdown command to shut down the OS, the server is automatically powered off. You do not need to manually power off the server.
- The power-off sequence is as follows: power off the devices from top to bottom based on the installation positions of the devices in the cabinet, and then turn off the power switch of the cabinet PDB.
Procedure
- Log in to the iBMC web UI. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- Choose .
- On the Power Control page, click Power off. When a power-off message is displayed, click Yes.
Forcible power-off may damage user programs or data that is not saved. Exercise caution when performing this operation.
Powering Off Devices (TaiShan/2288X V5)
After devices are powered off, iMaster NCE-Campus stops running.
Prerequisites
The services and databases have been stopped. For details, see the related section in Stopping iMaster NCE-Campus.
Precautions
- If you run the shutdown command to shut down the OS, the server is automatically powered off. You do not need to manually power off the server.
- The power-off sequence is as follows: power off the devices from top to bottom based on the installation positions of the devices in the cabinet, and then turn off the power switch of the cabinet PDB.
Context
- Log in to the iBMC web UI. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- View the iBMC version information.
- If the iBMC version is 3.01.0.0 or later, on the iBMC homepage, view iBMC Firmware Version in the Device Info area.
- If the iBMC version is 5.00 or later, on the iBMC homepage, view iBMC Version in the Device Info area.
- If the iBMC version is earlier than 5.00, on the iBMC homepage, view iBMC Firmware Version in the Basic Info area.
Procedure
- If the iBMC version of the delivered device is 3.01.0.0 or later, or 5.00 or later, perform the following steps.
- Log in to the iBMC web UI. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- Choose .
- Click Power Control, on the Power Control page, click Power off. When a power-off message is displayed, click OK.
Forcible power-off may damage user programs or data that is not saved. Exercise caution when performing this operation.
- If the iBMC version of the delivered device is earlier than 5.00, perform the following steps.
- Log in to the iBMC web UI. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- Choose .
- On the Power Control page, click Power off. When a power-off message is displayed, click Yes.
Forcible power-off may damage user programs or data that is not saved. Exercise caution when performing this operation.
System Monitoring
After a product is installed on the management plane, you can use this function to monitor resources such as nodes, services, databases, and processes on the management plane and the service plane. This helps you detect and resolve exceptions in a timely manner, ensuring efficient server running and normal product running.
Monitoring Products
You can monitor the indicators of each node, service, and database in a centralized manner. By predicting and analyzing the indicators of each resource, you can identify and rectify faults in a timely manner. In a DR scenario, you are advised to check the resource statuses at the primary and secondary sites in sequence every day to ensure that the system is running properly.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
The management plane refreshes the status of nodes, services, and databases displayed on the page every 30 seconds.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product or NCE-OMP.
- In the upper right corner of the page, check whether any resource in Nodes, Services, Relational Databases, and Redis Databases is abnormal.
The number in red indicates the number of abnormal resources.
Figure 3-7 Monitoring products- If the number of abnormal resources is 0, all resources of the product are normal.
- If the number of abnormal resources is not 0, there are abnormal resources in the product.
Click a number of abnormal resources to view details on the tab page for the resource type.
Monitoring Nodes
You can monitor all nodes of the management plane and the service plane to identify and rectify faults in a timely manner, which ensures that the node servers are running efficiently.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Precautions
For a service deployed in active/standby mode, the service runs only on the active node.
Context
The management plane periodically monitors resources of the system. Table 3-14 lists the monitoring intervals. The status of monitored objects on the page is refreshed every 30 seconds.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product or NCE-OMP.
- On the Nodes tab page, view the usage of the CPU, virtual memory, physical memory, and disk partition. Refer to Table 3-15 to check the resource statuses. Alternatively, on the Nodes tab page, click a node name. On the page for node details, view the disk partition usage and the resource statuses of processes on the node.Figure 3-8 Monitoring nodesTable 3-15 Node resource status
Object
State
Description
Measures
Connection Status
Normal
The connection between the node and the management plane is normal.
N/A
Disconnected
The connection between the node and the management plane is abnormal.
- Click the abnormal node and view the space usage and process status of each disk in the node on the details page.
- An alarm is reported if the node status is abnormal. Rectify the fault based on the alarm information.
Database Status
--
The node does not have a database.
N/A
Partially Running
Some database instances on the node are not running.
An alarm is reported if the database status is abnormal. Rectify the fault based on the alarm information.
Not Running
All database instances on the node are stopped.
Some maintenance operations require that the database is not running. Determine whether the database is abnormal based on site requirements.
Running
The database instances on the node are running properly.
N/A
Unknown
The system cannot detect the database instance status on the node.
Collect related information and contact Huawei technical support.
Starting
All database instances on the node are being started.
If the service is in this state for a long time, contact Huawei technical support.
Stopping
All database instances on the node are being stopped.
Service Status
Uninstalled
The node service is not installed.
Collect related information and contact Huawei technical support.
--
The node does not provide services.
N/A
Partially Running
Some processes on the node are not running.
On the Nodes tab page, click the node name. On the node details page, view the processes that are not running and determine whether the processes are abnormal based on site requirements.
Not Running
All services on the node are stopped.
Some maintenance operations require that the service is not running. Determine whether the service is abnormal based on site requirements.
Running
All services on the node are running.
N/A
Unknown
The system cannot detect the service status on the node.
An alarm is reported if the service status is abnormal. Rectify the fault based on the alarm information.
Faulty
Some services on the node are faulty.
Starting
All services on the node are being started.
The startup or stopping duration of a service does not exceed 1 minute. If the service is in this state for a long time, contact Huawei technical support.
Stopping
All services on the node are being stopped.
Monitoring Services
You can monitor all services of the management plane and the service plane to identify and rectify faults in a timely manner, which ensures that the services are running properly.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
The management plane monitors services of the system every 30 seconds. The status displayed on the page is refreshed every 30 seconds.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product or NCE-OMP.
- Click the Services tab page and refer to Table 3-16 to check the service status. Alternatively, click the desired service instance name. On the page for service details, view the status of processes and other resources of the service.Figure 3-9 Monitoring servicesTable 3-16 Service status
State
Description
Measures
Partially Running
Some processes of the service are not running.
Click the name of the corresponding service instance. On the page for service details, view the processes that are not running, and determine whether the processes are abnormal based on site requirements.
Not Running
All processes of the service are stopped.
Some maintenance operations require that the service is not running. Therefore, you need to determine whether the service is abnormal based on site requirements.
Running
All processes of the service are running.
N/A
Unknown
The system cannot detect the status of processes of the service.
An alarm is reported if the service status is abnormal. Rectify the fault based on the alarm information.
Faulty
Some processes of the service are faulty.
An alarm is reported if the service status is abnormal. Rectify the fault based on the alarm information.
Starting
All processes of the service are being started.
The startup or stopping duration of a service does not exceed 1 minute. If the service is in this state for a long time, contact Huawei technical support.
Stopping
All processes of the service are being stopped.
Monitoring Processes
You can monitor the processes of the management plane and the service plane to identify and rectify faults in a timely manner, which ensures that the processes are running properly.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
The management plane monitors processes of the system every 30 seconds. The status displayed on the page is refreshed every 30 seconds.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product or NCE-OMP.
- On the Processes tab page, check the process status based on Table 3-17.Figure 3-10 Monitoring processesTable 3-17 Process status
State
Description
Measures
Not Running
The process is not running.
Some maintenance operations require that the processes are not running. Determine whether a process is abnormal based on site requirements.
NOTE:For a process deployed in active/standby mode, the process runs only on the active node. If Service Status of the node whose process is not running is Normal, the process is normal.
Running
The process is running.
N/A
Unknown
The system cannot detect the process status.
Collect related information and contact Huawei technical support.
Faulty
The process is faulty.
An alarm is reported if the process is faulty. Rectify the fault based on the alarm information.
Starting
The process is being started.
If the service is in this state for a long time, contact Huawei technical support.
Stopping
The process is being stopped.
Monitoring Databases
You can monitor the relational databases and Redis databases of the management plane and the service plane to identify and rectify faults in a timely manner, which ensures that the databases are running properly.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
- The management plane monitors databases of the system every 60 seconds. The status displayed on the page is refreshed every 30 seconds.
- A relational database is created on the basis of a relational model for storing persistent data, such as the GaussDB 100 database. The management plane monitors the usage of the relational databases. This helps you to identify and rectify the insufficiency of the database space in a timely manner.
- A Redis database is a high-performance key-value database that stores cached status-related data. The management plane monitors the memory usage of the Redis databases so that you can identify and rectify insufficient memory space in a timely manner.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product or NCE-OMP.
- Click the Relational Databases or Redis Databases tab page. Refer to Table 3-18 and Table 3-19 to check the database status.Figure 3-11 Monitoring relational databasesFigure 3-12 Monitoring Redis databasesTable 3-18 Database status table
State
Description
Measures
Not Running
The database is not running.
An alarm is reported if the database status is abnormal. Rectify the fault based on the alarm information.
Unknown
The system cannot detect the database status.
Running
The database on the node is running properly.
N/A
Table 3-19 Database instance replication statusState
Description
Measures
Normal
The replication between the master and slave database instances is normal.
N/A
Abnormal
The replication between the master and slave database instances is abnormal.
An alarm is reported if the database instance replication status is abnormal. Rectify the fault based on the alarm information.
Full Synchronizing
The slave database instance is rebuilt, and all data of the master database instance is being forcibly synchronized to the slave database instance.
N/A
--
The database instance is a single instance and does not have a master/slave relationship.
N/A
Configuring Process Startup Types
If a function is not required, disable the startup of corresponding product processes. This saves system resources. When you need to use the disabled processes, you can configure the processes to be automatically or manually started.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Precautions
All processes of a product are displayed on the Processes tab page. Disabling a process may cause corresponding functions to become abnormal. Select processes to be disabled based on site requirements.
In a DR scenario, configure the process startup type only at the primary site, and the configuration will be synchronized to the secondary site.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product.
- On the Processes tab page, select the processes and click a startup type based on Table 3-20. Perform operations as prompted.Table 3-20 Operation description
Startup Type
Description
Manual
The process does not start with the OS and can only be started manually.
Automatic
The process automatically starts with the OS.
Disabled
The process cannot be started.
If the process startup type is changed to Manual or Automatic, the process status does not change. If the startup type is changed to Disabled, the process is stopped.
- (Optional) To change the running status of processes, select the processes to be started or stopped, and click Start or Stop as required above the process list. Perform operations as prompted.
Modifying Monitoring Thresholds
The management plane can report alarms if a resource is abnormal. The system has default monitoring thresholds for nodes, relational databases, and Redis databases. When the usage of a monitored object reaches the thresholds, the management plane reports corresponding alarms, and the alarm notifications are received on the service plane. If the default monitoring threshold does not match the actual resource usage and the alarm is frequently reported, you can modify the monitoring threshold based on site requirements.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
Table 3-21 lists the default thresholds and resource objects that can be configured.
X indicates the value of Alarm Generation Threshold, Y indicates the value of Alarm Clearance Threshold, and N indicates the value of Number of Times Overloaded. Y must be less than X.
Item |
Object |
Conditions for Reporting Alarms |
Condition for Clearing Alarms |
---|---|---|---|
Node |
CPU |
The CPU usage sampled every 15 seconds in N consecutive times is greater than or equal to X%. Default value: N = 40, X = 85 |
Any sampled CPU usage is less than X%. Default value: X = 85 |
Physical memory |
In a detection period (15 seconds), the physical memory usage is greater than or equal to X%. Default value: X = 95 |
The physical memory usage is less than or equal to Y%. Default value: Y = 90 |
|
Virtual memory |
In a detection period (15 seconds), the virtual memory usage is greater than or equal to X%. Default value: X = 85 |
The virtual memory usage is less than or equal to Y%. Default value: Y = 80 |
|
Disk partition |
In a detection period (15 seconds), the disk partition usage is greater than or equal to X%. Default value: X = 80 |
The disk partition usage is less than or equal to Y%. Default value: Y = 75 |
|
Relational database |
Database tablespace usage |
In a detection period (180 seconds), the database tablespace usage is greater than or equal to X%. Default value: X = 95 |
The database tablespace usage is less than or equal to Y%. Default value: Y = 85 |
Redis database |
Memory |
In a detection period (180 seconds), the memory usage is greater than or equal to X%. Default value: X = 80 |
The memory usage is less than or equal to Y%. Default value: Y = 70 |
Precautions
In a DR scenario, the node and database monitoring thresholds of nodes of a product or NCE-OMP need to be modified at both the primary and secondary sites, and the monitoring thresholds must be consistent between the two sites. If they are inconsistent, alarms may be generated only at one site.
Procedure
- On the management plane, choose Product > System Monitoring from the main menu.
- In the upper left corner of the System Monitoring page, move the pointer to
and select the product or NCE-OMP.
- Click the tab where the resource object resides.
- Click
on the right of the page and set parameters.
Backup and Restore
Backup and restore can improve system reliability by reducing the data loss of the management plane or the service plane caused by misoperations or faults.
Overview
To improve the reliability of the management plane and the service plane, application, and product data. If the management plane or the service plane is faulty, you can restore the corresponding plane to the state before it was backed up by using the backup data.
Concepts
Table 3-22 lists the concepts related to backup and restore to give you a better understanding of these functions.
Concept |
Description |
---|---|
Application |
Data that does not change in real time while the system is running, including product applications (such as files in the /opt/oss directory) and database applications (such as files in the /opt/redis directory and the /opt/zenith directory). |
Product data |
Data and configuration files that change in real time while the system is running, such as files in the /opt/oss/envs directory. |
Scheduled backup |
By default, iMaster NCE-Campus is preset with two scheduled backup tasks:
If the preset scheduled backup tasks cannot meet your requirements, you can create a one-time scheduled backup task or a periodic scheduled backup task as needed. |
Manual backup |
Manually back up data of a time point to the backup server. |
Principles
As shown in Figure 3-13, after the backup server is configured, you can back up the data of products and applications. The backup data is transferred to the backup server through a transfer protocol; for example, Secure File Transfer Protocol (SFTP). Different types of data can be backed up independently of one another.
If the data of the management plane or the service plane is missing or damaged, you can restore the corresponding data. Before restoring data of a certain type, ensure that its lower-layer data is normal. For example, before restoring the product data, ensure that the application data are normal. Figure 3-14 shows the dependency.
Advantages
- Flexibility and ease of use
Function
Description
Multi-dimensional backup
Data backup of instances, nodes, and products is supported.
Multi-dimensional restoration
Data restoration of instances, nodes, and products is supported.
Scheduled backup
Periodically scheduled data backup of products and the management plane is supported. Manual operations are not required.
Easy operations
On the web client of the management plane, the backup and restore operations are easy, with clear guidance provided.
- Security and reliability
Function
Description
Secure transfer of backup data
Data is transferred over SFTP, and security measures are provided to ensure data security; for example, digital signatures are added to the backup data.
Real-time task status display
The details of backup and restore tasks is displayed in real time in the task list.
- High performance
Function
Description
Concurrent backup
Backup tasks of different products can be executed at the same time.
Time-specific restoration
Data can be restored to a specific time point.
Function Description
As shown in Figure 3-14, application, and data of a product and the management plane can be backed up and restored. Before restoring upper-layer data, ensure that its lower-layer data is normal. For example, before restoring the product data, ensure that the application data is normal. When you restore the database application or product application, the two types of restoration are independent of each other.
Configuration Requirements
Item |
Description |
---|---|
Database type |
The GaussDB 100 and Redis databases are supported. |
Disk space |
Calculate the backup space based on site requirements of each product and the management plane for different network scales. Total backup space = Sum of backup space for all products + Backup space for the management plane Backup space for a product = Backup space for the product data + Backup space for the product application + Backup space for the database applications |
Data transfer protocol |
The data can be transferred over Secure File Transfer Protocol (SFTP). |
Backup and Restoration Scenarios and Policies
Scenario |
Operation |
Automatic Backup |
Manual Backup |
---|---|---|---|
Multi-node fault |
Reinstall the system and restore data. |
Periodically back up product data. You are advised to configure a periodic backup task to back up product data daily.
|
- |
Upgrade |
Upgrade both the service plane and management plane. |
For details about data backup and restoration requirements and operations, see the upgrade guide of the desired version. |
- |
Capacity expansion |
Expand the node capacity of the cluster. |
For details about data backup and restoration requirements and operations, see iMaster NCE-Campus Capacity Expansion Guide. |
- |
Maintenance |
Changing passwords for database users (the OMP node) |
Application and data of the management plane |
None |
Changing passwords for database users (product node) |
|
Manually create backup tasks by following instructions provided in: |
|
Updating CA certificates online |
Application and data of the management plane |
Manually create backup tasks by following instructions provided in: |
|
Uploading and updating CA certificates (non-DR system) |
Application and data of the management plane |
Manually create backup tasks by following instructions provided in: |
|
Updating the root key and working keys (non-DR system) |
|
Manually create backup tasks by following instructions provided in: |
Backup Server Requirements
After the installation is complete, iMaster NCE-Campus uses the local node as the backup server by default. You can also configure a third-party backup server.
Local Default Backup Server
Distributed:
- The OMP_02 node meets the requirements of the management plane for the backup server by default. For details about the backup paths, see Table 3-26 and Table 3-27. To configure a third-party backup server, you need to make preparations by referring to Table 3-25. When the OMP_02 node or a third-party server is used as the backup server, you are advised to clean up the disk space periodically.
- When the OMP_02 node is damaged, the backup data stored on the OMP_02 node may also be damaged. Regularly move the backup data to a third-party storage medium to ensure data availability when the hard disk is damaged.
Third-party Server
If a third-party server is used as the backup server:
- You are advised to periodically clear the product data, and application data. Retain at least one latest backup file for each data type.
- NAT cannot be configured between the third-party backup server and the management plane.
Backup server requirements
Item |
Requirement |
---|---|
SFTP |
|
User Name and Password |
|
Connectivity |
Data can be transferred between the backup server and all nodes through SFTP. |
Back up server IP address. |
IP address instead of the floating IP address for transferring data between the backup server and all nodes through SFTP. |
Disk space |
500 GB or above NOTE:
You are advised to periodically clear the disk space. Otherwise, the backup and restoration may take a long time. |
Bandwidth |
Recommended: 1.5 Gbit/s Minimum: 100 Mbit/s |
Configuring Backup Parameters
Before performing backup, you must set backup parameters, such as adding a backup server for storing backup files and setting a storage policy for backup files.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- You have planned and configured the backup server by following Backup Server Requirements. In addition, you have obtained the IP address of the backup server, the user name and password for transferring backup files using SFTP, and the storage path for backup files.
In the distributed scenario, If no third-party backup server meets the requirements (for example, the third-party backup server cannot communicate with all iMaster NCE-Campus nodes, and data cannot be automatically backed up to the third-party backup server), you are advised to manually copy data from the local backup node to a third-party storage medium periodically to prevent data loss due to hard disk damage.
Context
- For details about the backup paths, see Table 3-26 and Table 3-27. Do not delete the backup files in the paths. Otherwise, no backup files are available for restoration.
Table 3-26 Backup path of the OMP node
Data
Path
Example Path on the Local Default Backup Server
Applications and Data of the Management Plane
/root directory of the backup server user/path specified in the backup parameters/management/management/timestamp/node name
- Distributed: /opt/backup/ftpboot/backup/management/management/20191119070005090/Management1
- Compact: /opt/backup/ftpboot/backup/management/management/20191129070010333/Management
Table 3-27 Backup file path of the service nodesData
Path
Example Path on the Local Default Backup Server
Database application
/root directory of the backup server user/path specified in the backup parameters/product name/static/timestamp/node name/DB
- Distributed: /opt/backup/ftpboot/backup/NCECAMPUS/static/20191118230000546/DB_01/DB
- Compact: /opt/backup/ftpboot/backup/NCECAMPUS/static/20191202230001138/Common_Service/DB
Product application
/root directory of the backup server user/path specified in the backup parameters/product name/static/timestamp/node name/APP
- Distributed: /opt/backup/ftpboot/backup/NCECAMPUS/static/20191118232333974/NMS_01/APP
- Compact: /opt/backup/ftpboot/backup/NCECAMPUS/static/20191130230005926/Common_Service/APP
Product data
/root directory of the backup server user/path specified in the backup parameters/product name/dynamic
- Distributed: /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/20191120220006667
- Compact: /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/20191203112551567
- A maximum of 10 backup servers can be added to the management plane, and the same backup files are saved on each backup server. When a backup server is abnormal, other backup servers can still provide backup files for restoration.
- When a backup server is added, the management plane authenticates the SFTP fingerprint between the backup server and the node to be backed up. If the authentication is not performed, iMaster NCE-Campus automatically configures fingerprint authentication for the backup server.
- Modifying or deleting a backup server does not erase data on it.
Precautions
- When the backup server transfers data over SFTP, the system performs SFTP fingerprint authentication when you verify the connectivity to the server. You can verify the SFTP fingerprint authentication information to improve the data security of the management plane. For details about how to obtain SFTP fingerprint authentication information, see How Do I Query SFTP Fingerprint Authentication Information?
- If the parameters of the backup server are changed, update the parameters in the Backup Server area and back up the application and data of the management plane again. Otherwise, the management plane cannot be backed up or restored. For details, see Manually Backing Up the Application and Data of the Management Plane.
Procedure
- On the management plane, choose Backup and Restore > Configuration > Configure Backup Parameters from the main menu.
- Set backup parameters.
- Configure the backup server parameters.
- In the Backup Server area, set parameters as follows Table 3-28.
Table 3-28 Backup server parameters
Parameter
Value
Transmission Mode
SFTP.
Backup Server IP Address
- OMP_02 node: The Inter-node Communication IP Address.
- Third-party backup server: IP address instead of the floating IP address for transferring data between the backup server and all nodes through SFTP.
Port
- OMP_02 node: Port number of the backup server. The default value is 22, and the value range is 1 to 65535.
- Third-party backup server: Port number of the server, and the value range is 1 to 65535.
Username
- OMP_02 node: ftpuser.
- Third-party backup server: user with the SFTP permission.
Password
- OMP_02 node: password for the ftpuser user.
- Third-party backup server: password for the user with the SFTP permission.
Backup Path
- OMP_02 node: /opt/backup/ftpboot/backup.
After the initial installation and commissioning, the data that is manually backed up for the first time is stored in /opt/backup/ftpboot/initdatabackup.
- Third-party backup server: SFTP shared directory for storing backup files.
NOTE:The backup path must be a relative path under the default SFTP user directory. For example, if the OMP_02 node is used as the backup server by default, you only need to enter backup. The backup files are stored in the /opt/backup/ftpboot/backup directory on the OMP_02 node. Before configuring backup parameters, create a backup path on the backup server. For details, see How Do I Create a Backup Path for a Backup Server?
- Click
.
- If the management plane detects that there are nodes where SFTP fingerprint authentication has not been performed, perform the operations as required:
- (Recommended) If you click OK in the Warning dialog box, the management plane automatically performs the SFTP fingerprint authentication on the node.
- If you click Cancel in the Warning dialog box, log in to each node where the SFTP fingerprint authentication has not been performed and manually perform authentication. For details, see How Do I Manually Authenticate the SFTP Fingerprint?
- In the Backup Server area, set parameters as follows Table 3-28.
Perform the following operations to change the default storage thresholds for the data backup files of products as required.
On the Storage Strategy for the Product Data Backup Files area, set thresholds based on Table 3-29.
Table 3-29 Threshold settings for backup filesTask
Procedure
Modify the storage thresholds for products.
Modify the thresholds one by one or in batches.
- Select the products for which you want to configure the backup file storage threshold.
- Click Modify Product Threshold.
- In the dialog box that is displayed, configure the number of backup files and click Save.
When the default storage threshold for the management plane backup files cannot meet actual requirements, perform the following operations to modify the storage strategy of the NCE-OMP backup files:
- In the Storage Strategy for the NCE-OMP Backup Files area, configure the number of back file copies.
- Click Save.
- Configure the backup server parameters.
Follow-up Procedure
- If parameters on the backup server have changed, set the backup parameters again.
- After the password for the backup server is changed, back up the management plane again. For details, see Manually Backing Up the Application and Data of the Management Plane.
Backing Up Products
After backup parameters are configured, periodically back up product data to ensure the reliability. If the product is abnormal due to misoperations or other situations, you can use the backup data to restore it.
Backing Up Product Data
Before a configuration file that affects product functions is modified, and before and after the product is upgraded, you can use the management plane to manually back up the product data. In this way, the product can be restored by using the backup data if the product is abnormal due to misoperations or other reasons.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The backup parameters have been configured. For details, see Configuring Backup Parameters.
- The database instances on all nodes are in the Running state. For details, see Monitoring Databases.
- The deployment status of the product is Installed. For details, see How Do I Check the Deployment Status of a Product?
Context
The backup data may contain personal information (such as names, phone numbers, and email addresses), and all usernames and passwords. You must follow the applicable laws in your country or the privacy policies of your company and take effective measures to fully protect customer privacy.
Precautions
- To ensure product data accuracy during backup, do not manually back up data during service provisioning.
- If the status of the scheduled task for backing up the product data is partially successful, the data that has been successfully backed up by the task can be restored on the management plane. The backup file is incomplete, so the product may fail to start or run properly after the restoration. Exercise caution.
- If you need to execute a task with higher priority, you can forcibly stop the tasks for backing up product data. For details, see Forcibly Stopping Tasks for Backing Up Product Data.
- In a DR scenario, tasks for backing up product data can be executed only at the active site. If a task related to DR operations other than freezing products is in progress, perform operations in this section after the task is complete. Otherwise, the DR task or the backup task may fail.
Procedure
- On the management plane, choose from the main menu.
- On the Back Up Product Data page, refer to Table 3-30 to perform the operations.
Backing Up Product on a Scheduled Basis
When a product is running properly, you can create scheduled tasks for backing up its, application, and data, so that the backup data is periodically saved to the backup server. If a product becomes abnormal due to misoperations or other situations, you can restore the product to the state at a certain time point.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The backup parameters have been configured. For details on how to configure the parameters, see Configuring Backup Parameters.
- When backing up the product data, ensure that the database instances on all nodes are in the Running state. For details, see Monitoring Databases.
- When backing up the product data, ensure that the product deployment status of the product is Installed. For details, see How Do I Check the Deployment Status of a Product?
Context
- When the product information is imported on the management plane for the first time, the management plane creates a default scheduled task for backing up the product data. After the product installation is complete and the backup parameters are configured, the task automatically backs up the product data. When the number of products, nodes or service instances is changed due to upgrade or capacity expansion, the modified product data is automatically backed up by the scheduled backup task after the upgrade or capacity expansion is complete.
The backup data may contain personal information (such as names, mobile numbers, and email addresses), and all usernames and passwords. You must follow the applicable laws in your country or the privacy policies of your company and take effective measures to fully protect customer privacy.
- The backup files of the application, and product data are stored on the backup server. For details about the backup file path, see Table 3-31. Do not delete the backup files in this directory. Otherwise, the product cannot be restored.
Table 3-31 Backup file path
Data
Path
Database application
/root directory of the backup server user/path specified in the backup parameters/product name/static/timestamp/node name/DB
Product application
/root directory of the backup server user/path specified in the backup parameters/product name/static/timestamp/node name/APP
Product data
/root directory of the backup server user/path specified in the backup parameters/product name/dynamic
Precautions
- You are advised to stagger the start time of each scheduled backup task, for example, at an interval of two hours. For details about the execution sequence of backup tasks, see Backup and Restoration Scenarios and Policies.
- To prevent high resource usage caused by conflicts between scheduled backup tasks and NE data synchronization, stagger the execution period of the scheduled backup tasks and that of NE data synchronization or service provisioning.
- For periodic scheduled backup, you are advised to set the backup interval to 24 hours. A long interval is not recommended, because data backup at long intervals may result in data loss during data restoration. Specify a backup interval as required.
- In a DR scenario, scheduled tasks for backing up product data are executed only at the active site, and scheduled tasks for backing up product applications can be executed at both the active and standby sites. Do not perform DR operations, excluding freezing products, when a scheduled backup task is in progress. Otherwise, the scheduled backup task or the DR operation may fail.
Procedure
- On the management plane, choose from the main menu.
- On the Configure Scheduled Backup Task page, Click Create.
- Perform operations as required to manually create a scheduled backup task.
- Select Product Data and create a scheduled task for backing up the product data.
- Select Product Application and create a scheduled task for backing up the product application.
- Select Database Application and create a scheduled task for backing up the database application.
- In the Backup Object area, select the backup object. Refer to Table 3-32 to perform the operations.Table 3-32 Backing up product on a scheduled basis
Task
Operation
Perform one-time backup at a specified time point.
Select One-Time and perform operations as prompted.
Perform periodic backup within a specified period.
Select Periodic and perform operations as prompted.
After a scheduled backup task is created successfully, you can enable or disable the task in the Enabled State column.
Backing Up Product Applications
After the initial installation of the product application or before and after the upgrade of the product application, you can back up the product application in real time on the management plane. If a product application runs abnormally or fails to be upgraded and needs to be rolled back to the previous state, you can use the backup file to restore the product application to the state before the backup.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The backup parameters have been configured. For details on how to configure the parameters, see Configuring Backup Parameters.
Context
The backup files of the product applications are stored in the /root directory of the backup server user/path specified in the backup parameters/product name/static/timestamp/node name/APP directory on the backup server. Do not delete the backup files in this directory. Otherwise, no backup file can be used for restoring product applications, causing restoration failures.
Precautions
In a DR scenario, tasks for backing up product applications can be executed at both the active and standby sites.
Procedure
- On the management plane, choose from the main menu.
- On the Back Up Product Application page, perform operations as prompted.
Backing Up Database Applications
After the initial installation of the database application or before and after the upgrade of the database application, you can back up the database application, that is, the database data that does not change in real time when the system runs, on the management plane. If the database is not running properly because a file in the database application is damaged or lost, but the OS of the node is running properly, you can use the backup file to restore the database application to the state before the backup.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The backup parameters have been configured. For details on how to configure the parameters, see Logging In to the Management Plane.
Context
The backup files of the database applications are stored in the /root directory of the backup server/path specified in the backup parameters/product name/static/timestamp/node name/DB directory on the backup server. Do not delete the backup files in this directory. Otherwise, no backup file can be used for restoring database applications, causing restoration failures.
Precautions
If the management node and the product node are the same node and use the same database software, the database applications cannot be backed up using this function. In this case, back up the database applications by backing up the management plane. For details, see Manually Backing Up the Application and Data of the Management Plane. Refer to How Do I Check Whether Management Nodes and Product Nodes Use the Same Database Software? to determine whether you can back up database applications by backing up the management plane.
In a DR scenario, tasks for backing up database applications can be executed at both the active and standby sites.
Procedure
- On the management plane, choose from the main menu.
- On the Back Up Database Application page, perform operations as prompted.
Backing Up the Management Plane
Periodically back up the application and data of the management plane on the management plane. If the management plane is abnormal due to exceptions or manual operations, you can use the backup files to restore the OS, application, or data of the management plane.
Manually Backing Up the Application and Data of the Management Plane
After initial installation of the management plane, before and after an upgrade or patch installation, or before major service adjustment, you are advised to manually back up the management plane. If the service is abnormal or the database is abnormal, you can use the backup package to restore the management plane to the state before the backup.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The backup parameters have been configured. For details on how to configure the parameters, see Configuring Backup Parameters.
- The database and service instances on the management node are in the Running state. For details, see Monitoring Databases.
Context
The backup files of the management plane are stored in the /root directory of the backup server/path specified in the backup parameters/management/management/timestamp/node name directory on the backup server. Do not delete the backup files in this directory. Otherwise, no backup file can be used for restoring the management plane, causing restoration failures.
Precautions
Ensure that no database-related operations are being performed, such as modifying IP addresses and routes. Otherwise, the backup data will be incomplete.
In a DR scenario, tasks for backing up the management plane can be executed at both the active and standby sites. If a task related to DR operations other than freezing products is in progress, perform operations in this section after the task is complete. Otherwise, the backup task or the DR task may fail.
Procedure
- On the management plane, choose from the main menu.
- On this page, perform operations as prompted.
Backing Up the Application and Data of the Management Plane on a Scheduled Basis
In routine maintenance, you can create scheduled backup tasks for backing up the application and data of the management plane on a scheduled basis. If the management plane is unreachable due to service or database exceptions, you can use the backup package to restore it. In this manner, no manual operation is required, reducing maintenance costs.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The backup parameters have been configured. For details on how to configure the parameters, see Configuring Backup Parameters.
- The database and service instances on the management node are in the Running state. For details, see Monitoring Databases.
Context
The backup files of the management plane are stored in the /root directory of the backup server/path specified in the backup parameters/management/management/timestamp/node name directory on the backup server. Do not delete the backup files in this directory. Otherwise, no backup file can be used for restoring the management plane, causing restoration failures.
Precautions
- To prevent high resource usage caused by conflicts between scheduled backup tasks and NE data synchronization, stagger the execution period of the scheduled backup tasks and that of NE data synchronization or service provisioning.
- You are advised to stagger the start time of each scheduled backup task, for example, at an interval of two hours. For details about the execution sequence of backup tasks, see Backup and Restoration Scenarios and Policies.
- For periodic scheduled backup, you are advised to set the backup interval to 24 hours. A long interval is not recommended, because data backup at long intervals may result in data loss during data restoration. Specify a backup interval as required.
- In a DR scenario, scheduled tasks for backing up the management plane can be executed at both the active and standby sites. Do not perform DR operations, excluding freezing products, when a scheduled backup task is in progress. Otherwise, the scheduled backup task or the DR operation may fail.
Procedure
- On the management plane, choose from the main menu.
- On the Configure Scheduled Backup Task page, Click Create.
- Select the option corresponding to the management plane and perform the operations according to Table 3-33.Table 3-33 Scheduled management plane backup
Task
Operation
Perform one-time backup at a specified time point.
Select One-Time and perform operations as prompted.
Perform periodic backup within a specified period.
Select Periodic and perform operations as prompted.
After a scheduled backup task is created successfully, you can enable or disable the task in the Enabled State column.
Restoring Products
If a product is abnormal due to misoperations or other reasons, you can use the backup data to restore database application, product application, and product data.
- When data is restored in the new environment, the southbound and northbound IP addresses must be the same in the original and new environments. Otherwise, service exceptions may occur.
- If the networks in the original and new environments are isolated from each other, configure the southbound and northbound IP addresses to be the same in the two environments.
- If the networks in the original and new environments are not isolated from each other, change the southbound and northbound IP addresses of the new environment to be the same as those of the original environment after data is restored in the new environment.
- Data restoration is not supported between sets of iMaster NCE-Campus running different versions.
- If the backup data is used for restoration, the service data generated after the last backup will be lost.
Restoring Database Applications
If the database is abnormal due to damages but the OS of the product node is still running properly, restore the database application.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- There are backup files of the database application to be restored.
- If the /opt/backup/backuptmp directory does not exist on the service nodes to be restored, perform the following operations to create the backuptmp directory and set an owner group for it.
- Use PuTTY to log in to the service nodes to be restored in SSH mode as the sopuser user.
- Run the following command to switch to the root user:
> su - root
Password: password for the root user
- Run the following commands to create the directory for the update and configure the certificate group:
# mkdir -p /opt/backup/backuptmp 2>/dev/null
# chown root:ossgroup /opt/backup
# chown ossadm:ossgroup /opt/backup/backuptmp
# chmod 750 /opt/backup/backuptmp
- In a DR scenario, the DR relationship between the primary and secondary sites has been deleted. For details, see Deleting the DR System.
Precautions
- Before the database application is restored, the management plane automatically stops the databases and iMaster NCE-Campus services using the database.
- If the backup files on the backup server are manually deleted, or the files are not displayed in the Backup File column, click Synchronize to synchronize the information about the backup files to the management plane. Ensure that the parameters, including Backup Server IP Address, Username, Password, and Backup Path, of the backup server on the Configure Backup Parameters page on the management plane are consistent with those of the backup server where the desired backup files reside.
- The system automatically verifies the integrity of backup files. Only successfully verified files can be used for restoration.
- In a DR scenario, if a task related to DR operations other than freezing products is in progress, perform operations in this section after the task is complete. Otherwise, the DR task or the restoration task may fail.
Procedure
- On the management plane, choose from the main menu.
- On the Restore Database Application page, select the backup server where the files to be restored reside. If multiple backup servers are available, select the IP address of the desired backup server from the Backup Server drop-down list. Otherwise, skip this step.
- Select an object to be restored and select the target file in the Backup File column. Restore the data based on Table 3-34.Table 3-34 Restoring database applications
Task
Operation
Restore database applications for a product.
Select the product and perform operations as prompted.
Restore database applications for product nodes.
Select the product and click
to view nodes under a product. Select nodes to be restored, and perform operations as prompted.
- On the management plane, choose System > Task List from the main menu and view the execution status of the task for restoring the database applications.
- If Task Status is Execution Succeeded, the database applications are restored successfully.
- If Task Status is Execution Failed, the product data fails to be restored. Contact Huawei technical support.
- Restore the product data. For details, see Restoring Product Data.
- Start the restored services. For details, see Starting the Service Plane Services.
- In a DR scenario, re-establish the DR relationship between the primary and secondary sites. For details, see Configuring the DR System.
Restoring Product Applications
If the product applications are abnormal due to damaged files of the product applications or configuration files, but the OS of the product node is still running properly, the product applications need to be restored.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- There are backup files of the product application to be restored.
- If the /opt/backup/backuptmp directory does not exist on the service nodes to be restored, perform the following operations to create the backuptmp directory and set an owner group for it.
- Use PuTTY to log in to the service nodes to be restored in SSH mode as the sopuser user.
- Run the following command to switch to the root user:
> su - root
Password: password for the root user
- Run the following commands to create the directory for the update and configure the certificate group:
# mkdir -p /opt/backup/backuptmp 2>/dev/null
# chown root:ossgroup /opt/backup
# chown ossadm:ossgroup /opt/backup/backuptmp
# chmod 750 /opt/backup/backuptmp
- In a DR scenario, the DR relationship between the primary and secondary sites has been deleted. For details, see Deleting the DR System.
Precautions
- The management plane will automatically stop all iMaster NCE-Campus services before restoring the product application.
- If the backup files on the backup server are manually deleted, or the files are not displayed in the Backup File column, click Synchronize to synchronize the information about the backup files to the management plane. Ensure that the parameters, including Backup Server IP Address, Username, Password, and Backup Path, of the backup server on the Configure Backup Parameters page on the management plane are consistent with those of the backup server where the desired backup files reside.
- The system automatically verifies the integrity of backup files. Only successfully verified files can be used for restoration.
- In a DR scenario, if a task related to DR operations other than freezing products is in progress, perform operations in this section after the task is complete. Otherwise, the DR task or the restoration task may fail.
Procedure
- On the management plane, choose .
- On the Restore Product Application page, select the backup server where the backup files used for the restoration reside. If multiple backup servers are available, select the IP address of the backup server from the Backup Server drop-down list. Otherwise, skip this step.
- Select an object to be restored and select the target file in the Backup File column. Restore the data based on Table 3-35.Table 3-35 Restoring product applications
Task
Operation
Restore the application of a product.
Select the product and perform operations as prompted.
Restore the product application for product nodes.
Select the product and click
to view nodes under a product. Select nodes to be restored, and perform operations as prompted.
- Start the restored services. For details, see Starting the Service Plane Services.
- In a DR scenario, re-establish the DR relationship between the primary and secondary sites. For details, see Deleting the DR System.
Restoring Product Data
If the product cannot be used when database instances are running properly but product data is abnormal, you can restore the product data based on the restoration scenario.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- There are backup files for the product data to be restored.
- All databases of the product are in the Running state. To confirm the database status, see Monitoring Databases.
- The IP address in the environment must be the same as those in the backup data.
- In a DR scenario, the DR relationship between the primary and secondary sites has been deleted. For details, see Deleting the DR System.
Context
If multiple backup servers are configured, the same backup data is stored on all the backup servers. The backup servers work in redundancy mode. During data restoration, all the backup servers can provide backup data. By default, the system randomly selects data from one of the backup servers. You can also specify a backup server to provide backup data.
Precautions
- If the backup files on the backup server are manually deleted, or the files are not displayed in the Backup File column, click Synchronize to synchronize the information about the backup files to the management plane. Ensure that the parameters, including Backup Server IP Address, Username, Password, and Backup Path, of the backup server on the Configure Backup Parameters page on the management plane are consistent with those of the backup server where the desired backup files reside.
- The system automatically verifies the integrity of backup files. Only successfully verified files can be used for restoration.
Procedure
- On the management plane, choose .
- On the Restore Product Data page, select the backup server where the files to be restored reside. If multiple backup servers are available, select the IP address of the backup server from the Backup Server drop-down list. Otherwise, skip this step.
- Select the product to be restored and select the target file in the Backup File column. Perform operations as prompted.
- If the selected backup file is generated by a backup task that is partially successful, only the data that has been successfully backed up will be restored. The backup file is incomplete, so the product may fail to start or run properly after the restoration. Exercise caution.
- In the dialog box displayed when you create a product data restoration task, set the product services to be automatically or manually started after the restoration is complete. For details about how to start product services, see Stopping the Service Plane Services.
- On the management plane, choose System > Task List from the main menu and view the task execution status.
- If the Task Status is Execution Succeeded, the product data is restored successfully.
- If the Task Status is Partially Succeeded, some service instances fail to be restored. Contact Huawei technical support.
- If Task Status is Execution Failed, the product data fails to be restored. Contact Huawei technical support.
- In a DR scenario, re-establish the DR relationship between the primary and secondary sites. For details, see Configuring the DR System.
- In a DR scenario, check the Data Synchronization Status of the product.
- On the management plane at the primary site, choose HA > Remote High Availability System > Manage DR System from the main menu.
- On the Manage DR System page, check whether Data Synchronization Status of the products is Abnormal.
- If yes, forcibly synchronize the product data between the primary and secondary sites. For details, see Synchronizing Product Data Between Primary and Secondary Sites.
- If no, no further action is required.
Restoring Data from a Remote Site
The management plane backup data of site A can be restored at site B through the management plane. In this manner, site B becomes the backup site of site A. When a new environment needs to be set up by importing data due to the migration of an equipment room or the fault recovery of a single site, you can use this function to restore data. The following takes sites A and B as an example to describe how to remotely restore the management plane.
Prerequisites
- The same deployment scheme is used, and the same software packages are installed at site A and site B, that is, the quantity and specifications of the nodes at the two sites, the management plane and product languages, versions, services and their versions, deployed products, node names, and keys must be the consistent.
- The time zone and time of the site A must be the same as that of the site B.
- You have obtained the IP address of the backup server, backup path, and backup server user with read and write permissions at site A and site B. This section uses the OMP_02 node for the backup server as an example.
Context
- Table 3-36 lists the scenarios supported by remote restoration.
Table 3-36 Application scenarios of remote restoration
Application Scenario
Architecture of Site A
Architecture of Site B
Scenario 1
x86
x86
Scenario 2
ARM
ARM
The OS at site A must be the same as that at site B, for example, both are EulerOS.
- When the product data is restored remotely, the data in Table 3-37 is not restored.
Table 3-37 Unrestored data
Domain
Feature
Unrestored Data of Site A
Impact
NCE (Common)
O&M management
Historical monitoring data, call chain data, alarm/event data, and health check report
The system at site B cannot query the historical data of the unified monitoring, call chain, alarm/event, and health check report in the system at site A.
NBI
Temporary alarm files of the OMC NBI
The alarms that have been reported to site A cannot be queried at site B, and the alarms reported to site B are numbered from 1.
Alarm and performance inventory files exported by the OMC NBI
Site B cannot obtain the alarm and performance inventory files exported by the OMC NBI at site A.
Inventory files exported by the REST NBI
Site B cannot obtain the inventory files exported by the REST NBI at site A.
Performance files exported by the performance text NBI
Site B cannot obtain the performance files exported by the performance text NBI at site A.
NE Software Management
NE backup files and NE software library files
The secondary site cannot obtain NE backup files and NE software library files of the primary site.
NCE (Super)
Service management and resource management
Redis cache data
Does not affect functions. When site B queries data from the database for the first time, the delay increases slightly. After the query is complete, the data is loaded from the database to the Redis.
NCE (Transport Domain)
NE communication
SSL certificate file used for communication between NCE and NEs
If the certificate used for communication between site A and NEs is user-defined, site B cannot communicate with NEs properly.
Service O&M
ASON trail optimization and rerouting records
The ASON trail optimization and rerouting records generated in the system of site A cannot be queried in the system of site B.
Data export through scheduled tasks
Files exported through scheduled tasks for transport NEs. For example, files exported by implementing functions such as DCC View Export, SDH Resource Statistics, SFP Information Collection, WDM Resource Statistics, Board Manufacturer Information, Transport DCN Report Export, Transport port NE Performance Collection and Export, Inventory Reports Export, Export Native Ethernet Services, Export WDM Trail, RTN Link Report Export, GNE Route and ECC Count Export, Network Health Check, Script Export, and Collect NE MML Configuration Data.
Files exported through scheduled tasks at site A cannot be obtained in the system of site B.
Precautions
In a DR scenario, if a task related to DR operations other than freezing products is in progress, perform operations in this section after the task is complete. Otherwise, the DR task or the restoration task may fail.
Procedure
- Download the backup files for the product data from the backup server of site A to your PC.
- Use FileZilla to log in to the backup server of site A as the backup server user. For details, see Transferring Files Using FileZilla.
- Download the backup files for the product data to your PC from the /root directory of the backup server user/path specified in the backup parameters/product name/dynamic/ directory.
for example, /opt/backup/ftpboot/backup/NCECAMPUS/dynamic on the OMP_02 node.
- If site A has the DR relationship, after the backup files are downloaded to your PC, check whether the dr_site name.tag and dr_site name.tag.sign identification files exist in the backup package named the timestamp.
- Use PuTTY to log in to the backup server of site B as the sopuser user in SSH mode. For details, see Logging In to a Server Using PuTTY.
- Run the following commands to check whether the NCECAMPUS directory exists:> su - root
Password:password for the root user
# cd /opt/backup/ftpboot/backup/
# ll
- If information similar to the following is displayed, the NCECAMPUS directory exists. Go to 4.
drwx------. 3 ftpuser ossgroup 4096 Nov 7 22:09 NCECAMPUS
- If information similar to the following is displayed, the NCECAMPUS directory does not exist. Run the following commands to create the NCECAMPUS/dynamic directory and go to 5:
total 0
# cd /opt/backup/ftpboot/backup/
# mkdir -p NCECAMPUS/dynamic
# chown ftpuser:ossgroup /opt/backup/ftpboot/backup/NCECAMPUS/
# chown ftpuser:ossgroup /opt/backup/ftpboot/backup/NCECAMPUS/dynamic
# chmod 700 /opt/backup/ftpboot/backup/NCECAMPUS/
# chmod 700 /opt/backup/ftpboot/backup/NCECAMPUS/dynamic
# cd /opt/backup/ftpboot/backup/NCECAMPUS/
# ll
If information similar to the following is displayed, the backup directory is created successfully:
drwx------. 3 ftpuser ossgroup 4096 Nov 7 22:09 dynamic
- If information similar to the following is displayed, the NCECAMPUS directory exists. Go to 4.
- Run the following commands to check whether the dynamic directory exists:> su - root
Password: password for the root user
# cd /opt/backup/ftpboot/backup/NCECAMPUS/
# ll
- If information similar to the following is displayed, the dynamic directory exists. Go to 5.
drwx------. 3 ftpuser ossgroup 4096 Nov 7 22:09 dynamic
- If information similar to the following is displayed, the dynamic directory does not exist. Run the following commands to create the dynamic directory and go to 5:
total 0
# cd /opt/backup/ftpboot/backup/NCECAMPUS/
# mkdir dynamic
# chown ftpuser:ossgroup /opt/backup/ftpboot/backup/NCECAMPUS/dynamic
# chmod 700 /opt/backup/ftpboot/backup/NCECAMPUS/dynamic
# ll
If information similar to the following is displayed, the backup directory is created successfully:
drwx------. 3 ftpuser ossgroup 4096 Nov 7 22:09 dynamic
- If information similar to the following is displayed, the dynamic directory exists. Go to 5.
- Set the owner group and permission of the /opt/backup/sopuserboot directory on the backup server of site B.
Use PuTTY to log in to the backup server of site B as the sopuser user in SSH mode.
- Run the following commands to set the permissions for the backup files of the product data:> su - root
Password: password for the root user
Setting the owner group and permission of the /opt/backup/sopuserboot folder.
# chown sopuser:ossgroup /opt/backup/sopuserboot
# chmod 750 /opt/backup/sopuserboot
- Use FileZilla to log in to the backup server of site B as the sopuser. For details, see Transferring Files Using FileZilla.
- Upload the backup data package of the product data obtained at site A to the /opt/backup/sopuserboot directory of site B.
- Move the product data backup file to the /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/ directory and set the permissions of the product data backup file.
- Use PuTTY to log in to the backup server of site B as the sopuser user in SSH mode.
- Run the following commands to set the permissions for the backup files of the product data:> su - root
Password:
# cd /opt/backup/sopuserboot
# mv timestamp directory /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/
# chmod 700 /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/timestamp directory
# chown -R ftpuser:ossgroup /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/timestamp directory
# find /opt/backup/ftpboot/backup/NCECAMPUS/dynamic/timestamp directory -type f| xargs chmod 600
- Perform the following operations to synchronize the backup file to site B automatically:
- Log in to the management plane. For details, see Logging In to the Management Plane.
- Choose Backup and Restore > Data Restore > Restore Product Data from the main menu.
- On the Restore Product Data page, click Synchronize.
- In the Backup File column, check whether the backup file at site B is the same as that at site A.
- If the file is the same, perform 11.
- If the file is not the same, contact Huawei technical support engineers.
- Perform the restoration task and check whether the restoration task is executed successfully. For details, see 6.6.1 Restoring Product Data.
- If the task is successful, remote restoration is successful.
- If the task fails, contact Huawei technical support engineers.
Configuring Products
After products are installed, product information is automatically added to the management plane. You do not need to manually add product information. The current iMaster NCE-Campus version does not support the function of manually adding product information on the product information configuration page but supports only the deletion function.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- In a DR system, the product DR relationship between the primary and secondary sites has been deleted. For details, see Separating the Primary and Secondary Site Products.
- You have uninstalled the product software. For details, see Uninstalling Product Software.
Precautions
- When you delete the product information, the following information will be deleted and the product will be removed from the management plane. Exercise caution when performing this operation.
- Services and data installed on all nodes of the product
- Historical backup data of the product
- If a node is faulty, you need to delete the node and replace it with a new node. If the capacity expansion fails and the rollback fails, you need to delete the node and add a node. The requirements for deleting a node are as follows:
- If ZookeeperService or Etcd is deployed on a node, the node cannot be deleted.
- If the management and product nodes are the same node, the node cannot be deleted.
- If a database node has master and slave nodes, delete both of them.
- Exercise caution when you delete nodes of a product, because the services and data on the nodes will be deleted and the nodes will be removed from the management plane accordingly.
- When you delete nodes of a product, the system first automatically stops the services and database on the nodes to be deleted.
Procedure
- On the management plane, choose Product > Product Planning > Import Product Information from the main menu.
- On the Import Product Information page, select the product to be deleted and click Delete.
- After configuration parameters are modified, the historical backup files of the management plane will become invalid. Scheduled tasks for backing up the application and data of the management plane and the OS of the product node will be automatically created. On the management plane, choose System > Task List from the main menu to check whether the tasks have been automatically created. If not, manually back up the corresponding data. For details, see Backing Up Products.
Backup operations occupy system resources. On the management plane, choose Backup and Restore > Configuration > Configure Scheduled Backup Task from the main menu to modify the start time of the scheduled backup task, if the execution time of the task is in busy hours.
- In a DR scenario, connect the primary and secondary site products after the nodes are deleted. For details, see Connecting the Primary and Secondary Site Products.
Network Configuration
On the management plane, you can configure network information for each node to ensure that nodes successfully communicate with each other.
Network Configuration Scenarios and Policies
Before network configuration, you are advised to learn the network configuration scenarios and policies.
Typical Network Configuration Scenarios and Policies
Table 3-38 lists only the network configuration policies of each node (such as the hostname, network interface, IP address, and route). Perform other operations based on site requirements.
Scenario |
Configuration Object |
Configuration Method |
---|---|---|
The network to be configured is in the same network segment as the current network, for example, the management IP address is changed from 10.0.0.1 to 10.0.0.X (X ranges from 2 to 254). |
One or all nodes |
Log in to the management plane to configure the network. |
The network to be configured is in a different network segment from the current network, for example, the equipment room is relocated. |
All nodes |
|
Configuring Network Interfaces
If the network interface configuration changes, (for example, an equipment room is relocated or a subnet is changed), the network interface must change accordingly. After a network interface on a VM is added to the management plane, you can manage and maintain the network interface on the management plane.
Context
Set the NIC usage according to the service scenario. In addition to the usages listed in the following tables, the following usages can be allocated to each IP address:
- Southbond2–Southbond10: If multiple southbound IP addresses of a single node need to be isolated on the live network, allocate different NIC usages for each southbound IP address, for example, Southbound and Southbound2.
- Northbond2–Northbond3: If multiple northbound IP addresses of a single node need to be isolated on the live network, allocate different NIC usages for each northbound IP address, for example, Northbound and Northbound2.
- Multiserver-Multiserver3: If the current floating IP address is abnormal, replace the floating IP address with the Multiserver IP address and log in to the web client.
Table 3-39 Usage of network interfaces in the Distributed scenario
Network Plane
NIC
NIC Usage
Involved Node
NOTE
Management network
eth4
HardwareMgr
OMP
The NIC is used to monitor hardware alarms on iMaster NCE-Campus The eth4 NIC of the OMP node is used by default.
Inter-node communication network
eth0
MANAGEIP
All nodes
The NIC is used by the OMP node to manage other nodes and for communication between service nodes.
Maintenance
Service nodes
DR network
eth0:1
ACCESSIP
-
The NIC is used for communication between primary and secondary sites.
Service network
(The northbound and southbound networks are not isolated.)
eth2
Service,
Northbound and access-external
NMS
The NIC is used for communication between the service node and OSS or used to log in to the OS.
floatingbase
Common_Service
The NIC is used to configure the fixed IP address of the floating IP address on the Common_Service node.
Northbound,
external-access
Common_Service
The NIC is used to log in to the web client of the service plane with the floating IP address of the Common_Service node.
WEBMGMTIP
OMP
The NIC is used to log in to the web client of the management plane through the OMP node.
Southbound
TController_Driver01
TController_Driver02
The NIC is used for communication between the service node and southbound devices.
Service network
(The southbound and northbound networks are isolated, and the Manager and Controller are not isolated.)
eth3
Service,
Northbound and access-external
For all nodes
The NIC is used for communication between the service node and OSS or used to log in to the OS.
floatingbase
Common_Service01
Common_Service02
The NIC is used to configure the fixed IP address of the floating IP address on the Common_Service node.
Northbound,
external-access
Common_Service01
Common_Service02
The NIC is used to log in to the web client of the service plane with the floating IP address of the Common_Service node.
WEBMGMTIP
OMP
The NIC is used to log in to the web client of the management plane through the OMP node.
eth2
Southbound
TController_Driver01–TController_Driver02
The NIC is used for communication between the service node and southbound devices of the Controller.
eth5
Southbound
NMS
The NIC is used for communication between the service node and southbound devices of the Manager.
Service network
(The southbound and northbound networks are isolated, and the Manager and Controller are isolated.)
eth3
Service,
Northbound and access-external
All nodes
The NIC is used for communication between the service node and OSS or used to log in to the OS.
floatingbase
Common_Service01
Common_Service02
The NIC is used to configure the fixed IP address of the floating IP address on the Common_Service node.
Northbound,
external-access
Common_Service01
Common_Service02
The NIC is used to log in to the web client of the service plane with the floating IP address of the Common_Service node.
WEBMGMTIP
OMP
The NIC is used to log in to the web client of the management plane through the OMP node.
eth2
Southbound
TController_Driver01–TController_Driver02
The NIC is used for communication between the service node and southbound devices of the Controller.
eth5
Southbound
NMS
The NIC is used for communication between the service node and southbound devices of the Manager.
Service network
(IP+Optical)
eth3
Service,
Northbound and access-external
All nodes
The NIC is used for communication between the service node and OSS or used to log in to the OS.
floatingbase
Common_Service01
Common_Service02
The NIC is used to configure the fixed IP address of the floating IP address on the Common_Service node.
Northbound,
external-access
Common_Service01
Common_Service02
The NIC is used to log in to the web client of the service plane with the floating IP address of the Common_Service node.
WEBMGMTIP
OMP
The NIC is used to log in to the web client of the management plane through the OMP node.
eth2
Southbound
TController_Driver01
TController_Driver02
For the service nodes that communicate with the IP domain Controller southbound devices.
eth5
Southbound
NMS
For the service nodes that communicate with IP domain Manager southbound devices.
eth2
Southbound
TController_Driver01
TController_Driver02
For the service nodes that communicate with the transport domain Controller southbound devices.
eth5
Southbound
NMS
For the service nodes that communicate with the transport Manager southbound devices.
Table 3-40 IP Usage DescriptionIP Usage
Description
Usage Description
HDmgmtbound
IP address of the hardware management network.
Floating IP address of the hardware management network, which is used to receive hardware alarms.
HDmgmtFloatingBase
Basic fixed IP address of the floating IP address of the hardware management network.
Fixed IP address of the hardware management network.
HAdisasterbound
Heartbeat IP address of the DR system.
Heartbeat IP address of the remote HA system, which is used to check whether the peer is running properly.
PCEPdbs
Floating IP address of the DBS service cluster.
Floating IP address of the FusionInsight HD database.
PCEPweb
Floating IP address of the WEB service cluster.
Floating IP address for accessing the FusionInsight HD WebUI.
PCEPoms
Floating IP address of the OMS service cluster.
Floating IP address of the FusionInsight HD O&M management plane.
JHS
Floating IP address of the JHS service cluster.
Floating IP address of the JHS service cluster.
lvs-virtual-ip
Virtual IP address of the LVS service cluster.
Floating IP address of the interconnected LVS used by ODAE. (It is available in input.json.)
PCEPlvs
Virtual IP address of the LVS service cluster.
Floating IP address of the interconnected LVS used by ODAE. (It is available on the ROC website and assigned to lvs-virtual-ip.)
VimBound
Southbound IP address of the Vim service.
IP address for the Vim service to interconnect with the cloud service (such as FusionSphere and FusionStage) at the IaaS layer.
VimBound on LCM indicates a floating IP address.
Service
IP address of the platform service application.
External IP address of the service plane.
INNER_ACCESS
Internal access.
Internal communication IP address of the service plane.
Analyzer_SERVER_IP
Analyzer service IP address.
Floating IP address (eth0) for Analyzer node disaster recovery (active/standby).
NMS_SERVER_IP
NMS service IP address.
Local IP address used by the Manager microservice.
Southbound
IP address for southbound network communication.
IP address 1 for southbound access.
Southbound2BaseIP
Basic fixed IP address of the floating IP address of southbound network 2.
Basic fixed IP address 2 for southbound access.
BGPSouthBound
Floating IP address of the controller service southbound network.
Southbound floating IP address of IP Controller, which is used for communicating with devices.
Southbound
IP address for southbound network communication.
IP address 1 for southbound access.
LVS-VIP2
Floating IP address of LVS southbound network 2.
Southbound floating IP address of the LVS (load balancing) for connecting to NEs. This IP address is not used currently.
LVS-VIP
Floating IP address of the LVS southbound network.
Southbound floating IP address of the LVS (load balancing) for connecting to NEs. (eth5)
SouthboundBaseIP
Basic fixed IP address of the floating IP address of the southbound network.
Basic fixed IP address 1 for southbound access.
Northbound
IP address for northbound network communication.
IP address 1 for northbound access.
NorthboundBaseIP
Basic fixed IP address of the floating IP address of the northbound network.
Basic fixed IP address 1 for northbound access.
AnalyzerCollector_SERVER_IP
IP address of the Analyzer collection service.
Floating IP address (eth0) for collector node disaster recovery (active/standby).
ommha_heartbeat
Heartbeat IP address of the ommha service.
The local primary and secondary sites need to use OMM HA. This IP address is the heartbeat IP address of the OMM HA service between the primary and secondary sites.
Data_Replication_IP
Data replication IP address
IP address used for data replication between the primary and secondary sites in a DR scenario.
DRHEARTBEAT
Data replication/heartbeat IP address
Used for data replication and heartbeat detection between the primary and secondary sites in the DR scenario.
globalBackendLBFloatIP
Floating IP address for inter-system communication
Used for internal services communication and services communication between the primary and secondary sites.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- The network interface to be added to the management plane exists on the VM.
- In a DR scenario, the DR relationship must be deleted as follows before you configure the network interfaces:
- For a product node, the primary and secondary site products must be separated. For details, see Separating the Primary and Secondary Site Products.
- If the product node and the management node are the same node, the DR relationship between the primary and secondary sites must be deleted. For details, see Deleting the DR System. For details about how to check that the management node and product node are the same node, see How Do I Determine the Deployment Mode of Nodes?
- For the management node, the DR relationship between the primary and secondary sites must be deleted. For details, see Deleting the DR System.
Precautions
- In a Distributed scenario, after iMaster NCE-Campus is installed, you need to add the eth4 network ports on the OMP_01 and OMP_02 nodes on the management network so that the management plane can manage the IP address of the hardware management network of the OMP nodes.
- When a network interface is deleted, its IP address will be deleted, and the network interface will be removed from the management plane. However, the network interface on the VM will not be deleted.
- If MCZKService is deployed on the OMP node, MCZKService will be automatically restarted during the network interface configuration.
- During the restart of MCZKService, the services monitored by MCZKService are also restarted and become temporarily unavailable. After the restart, the services become available again.
- During the restart, the service plane may report the Failed to connect ZooKeeper alarm because MCZKService is stopped, which is normal. After the restart, the service plane automatically clears this alarm.
- If you configure the network interface of the management node, the system automatically restarts OMMHAService on all nodes. If you configure the network interface of a product node, the system automatically restarts OMMHAService on all nodes of the product. During the restart of OMMHAService, the services monitored by OMMHAService are also restarted and become unavailable temporarily. After the restart, the services become available again. If OMMHAService is not deployed on the management node or the product node, restart is not involved.
- If the product services are not in the Running state, after a network interface is added or deleted, the product services will be automatically started.
If a network interface without IP address is added or deleted, the system does not need to start product services.
- In a DR scenario, if you configure a network interface of the secondary site, do not select Automatically start the product services after the configuration in the Warning dialog box.
- In a DR scenario, configure network interfaces at the primary site and then perform the same configurations at the secondary site to ensure that the network interface configurations at the primary and secondary sites are consistent.
- If any OMP node is faulty, restore the node first. Otherwise, network interfaces cannot be configured. For details, see "Management Node Faults" in Troubleshooting Guide.
- After a network interface is configured, historical backup files of the application and data of the management plane, OS of the OMP node, OS of the product nodes, database applications, product applications, and product data will become invalid. A scheduled task for backing up the application and data of the management plane will be automatically created.
Backup operations occupy system resources. On the management plane choose Backup and Restore > Configuration > Configure Scheduled Backup Task from the main menu to modify the start time of the scheduled backup task, if the execution time of the task is in busy hours.
Procedure
- On the management plane, choose Maintenance > Network Configuration > Configure NIC from the main menu.
- On the Configure NIC page, perform operations as prompted.
- If you want to perform other configuration operations that need to restart product services after configuring the network interfaces, do not select Automatically start the product services after the configuration in the Warning dialog box. In this case, after the configuration, the product services will not be automatically started, preventing the product services from being restarted for several times.
- In a DR scenario, if you are configuring the network interfaces at the secondary site, do not select Automatically start the product services after the configuration in the Warning dialog box, preventing the product services of the secondary site from being restarted and causing the product to become dual-active.
- In a DR scenario, re-establish the DR relationship as follows after configuring network interfaces:
- After configuring the network interface of a product node, connect the primary and secondary site products. For details, see Connecting the Primary and Secondary Site Products.
- If the product node and the management node are the same node, after configuring the network interface of the node, re-establish the DR relationship between the primary and secondary sites. For details, see Configuring the DR System.
- After configuring the network interface of the management node, re-establish the DR relationship between the primary and secondary sites. For details, see Configuring the DR System.
Configuring Routes
If the network configuration changes (for example, an equipment room is relocated or a subnet is changed), or if communication with networks in different network segments is required, the route configuration must be modified accordingly to ensure proper system running. If no route is configured between the system and the destination IP address, the system communicates with the destination IP address using the default route.
Prerequisites
- You have obtained the destination network, subnet mask, and gateway of the route.
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
Precautions
- Do not delete the route used for connecting to the management when deleting a route of the management node. Otherwise, you cannot log in to the management plane.
- If there are two routes with the same destination address, you need to manually delete the unnecessary route on the Configure Route page. Otherwise, the network connection to the destination address is abnormal.
Procedure
- On the management plane, choose Maintenance> Network Configuration > Configure Route from the main menu.
- In the Select node and query route area, select the nodes with routes to be configured.
- Click Query.
- In the Configure route area, modify, add, or delete routes for the selected nodes. For details, see Table 3-41.Table 3-41 Configuring routes
Task
Operation
Modifying routes
If NIC Name is -, the node has the default route. If the default route exists, you can modify it but cannot add other default routes for the node.
IP addresses of the same protocol (IPv4 or IPv6) have only one default route.- Select and modify a route.
- To modify the default route, select the corresponding records whose NIC Name is -, and configure Gateway/Next Hop.
- To modify a non-default route, select the desired network interface from the NIC Name drop-down list based on the planning information, and configure Destination Network, Subnet Mask/Prefix Length, and Gateway/Next Hop.
- Click Apply.NOTE:
Clicking Reset will clear all unsaved configurations on the page and will restore the configurations to the state before you select nodes and query routes.
- In the Confirm dialog box, click Yes.
The related task is created successfully. Click Task List to view the task execution status. If the task execution fails, rectify the fault based on the task details.
Adding routes
If no default route is available or new routes need to be added, perform the following operations:
- Click Add Route.
- Select nodes with routes to be added.
- To add a default route, select Default Route and configure Gateway/Next Hop.
- To add a non-default route, select the desired network interface from the NIC Name drop-down list, and configure Destination Network, Subnet Mask/Prefix Length, and Gateway/Next Hop.
- Click OK.
- In the Confirm dialog box, click Yes.
The related task is created successfully. Click Task List to view the task execution status. If the task execution fails, rectify the fault based on the task details.
Deleting routes
If routes are no longer necessary, delete them to save resources.
- Select the desired routes and delete them.
- To delete default routes, select the desired routes, and click Delete Route.
- To delete non-default routes, select one or more desired routes and click Delete Route.
- In the Warning dialog box, click OK.
The related task is created successfully. Click Task List to view the task execution status. If the task execution fails, rectify the fault based on the task details.
NOTE:Do not delete the route used for connecting to the management plane when deleting a route of the management node. Otherwise, you cannot log in to the management plane.
- Select and modify a route.
Configuring Floating IP Addresses
You can configure floating IP addresses for multiple nodes of the same type to prevent the system from failing to provide services due to the fault a single node. During iMaster NCE-Campus installation with EasySuite, the floating IP address has been configured. If an IP address conflict occurs or the overall network plan changes (such as equipment room relocation or subnet change), change the floating IP address.
Prerequisites
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- Databases of the product with floating IP address to be changed are running properly. For details, see Monitoring Databases.
- A floating IP address can be configured only for nodes of the same type. For details about how to check the node type, see Monitoring Databases.
- In a DR scenario, the DR relationship must be deleted as follows before you configure the floating IP address:
- For a product node, the primary and secondary site products must be separated. For details, see Separating the Primary and Secondary Site Products.
- If the product node and the management node are the same node, the DR relationship between the primary and secondary sites must be deleted. For details, see Deleting the DR System. For details about how to check that the management node and product node are the same node, see How Do I Determine the Deployment Mode of Nodes?
- For the management node, the primary and secondary site products must be deleted. For details, see Deleting the DR System.
Precautions
- The floating IP address on which services depend has been configured to be undeletable to prevent service exceptions caused by misoperations. If you select the floating IP address, the Delete button is dimmed, indicating that the floating IP address cannot be deleted.
- After you change the floating IP address, the product services will be automatically restarted by default.
- If MCZKService is deployed on the OMP node, MCZKService will be automatically restarted during the floating IP address configuration. During the restart, the service plane may report the Failed to connect ZooKeeper alarm because MCZKService is stopped, which is normal. After the restart, the service plane automatically clears this alarm.
- If the new and old floating IP addresses belong to different network segments, the fixed IP address need to be changed accordingly to ensure that the floating IP address and the fixed IP address are in the same network segment. There is no requirement on the change sequence.
- If any OMP node is faulty, restore the node first. Otherwise, floating IP addresses cannot be configured. For details, see "Management Node Faults" in Troubleshooting Guide.
- After a floating IP address is configured, historical backup files of the application and data of the management plane, OS of the OMP node, OS of the product nodes, database applications, product applications, and product data will become invalid. A scheduled task for backing up the application and data of the management plane will be automatically created.
Backup operations occupy system resources. On the management plane, choose Backup and Restore > Configuration > Configure Scheduled Backup Task from the main menu to modify the start time of the scheduled backup task, if the execution time of the task is in busy hours.
Procedure
- On the management plane, choose Maintenance > Network Configure > Configure Floating IP Address from the main menu.
- On the Configure Floating IP Address page, perform operations as prompted.
- After configuring the floating IP addresses, if you want to perform other configuration operations that need to restart product services , do not select Automatically start the product services after configuring the floating IP address in the Warning dialog box. In this case, after the configuration, the product services will not be automatically started, preventing the product services from being repeatedly restarted.
- In a DR scenario, if you are configuring the floating IP addresses at the secondary site, do not select Automatically start the product services after configuring the floating IP address in the Warning dialog box, preventing the product services of the secondary site from being restarted and causing the product to become dual-active.
- If OMMHA is deployed and a floating IP address has been configured for a node of the service plane, you need to add or delete a floating IP address on the node for the new floating IP address, that is, the IP address of the service plane, to take effect. For details, see Modifying Configuration Parameters After a Floating IP Address Is Added or Deleted.
- In a DR scenario, re-establish the DR relationship as follows after configuring the floating IP address:
- After configuring the floating IP address for a product node, connect the primary and secondary site products. For details, see Connecting the Primary and Secondary Site Products.
- If the product node and the management node are the same node, after configuring the floating IP address for the node, re-establish the DR relationship between the primary and secondary sites. For details, see Configuring the DR System (Manual Switchover).
- After configuring the floating IP address for the management node, re-establish the DR relationship between the primary and secondary sites. For details, see Configuring the DR System (Manual Switchover).
Follow-up Procedure
Manually back up the OS of the management node, OS of product nodes, database applications, product applications, and product data. For details, see Backing Up the Management Plane and Backing Up Products.
Alarm Configuration
The management plane receives alarms from hardware (for example, hardware servers) and reports the alarms to the service plane. You can monitor the alarms in a unified manner on the service plane.
Modifying Server Alarm Forwarding Parameters (Rack Server)
During iMaster NCE-Campus installation with EasySuite, the rack server has been configured to send alarms. If the alarm forwarding parameters are manually modified on the rack server, modify the receiving parameters recorded on the management plane accordingly.
Prerequisites
To configure server alarm reporting, ensure that iBMC can communicate with the service distribution plane.
You have logged in to the server controller card. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
Context
Hardware alarms are monitored by the management module of the server controller card. Through the default communication IP address (configured by EasySuite) of the hardware monitoring module, the OMP node communicates with the iBMC, which helps the management plane receive hardware alarms.
Procedure
- If the iBMC version of the delivered device is 5.00 or later, perform the following steps.
- On the iBMC GUI, choose Maintenance > Alarm Settings. Click Trap Notification, and complete related operations by referring to Table 3-42 and click Save.
Table 3-42 Trap function settings
Task
Procedure
Modifying Trap Function
Set Trap Function to Enable.
Modifying Trap Version
Select SNMPv3.
Modifying SNMPv3 User
Set this parameter to the username used by the controller card to report alarms to the alarm server.
Modifying Trap Mode
2288H V5: Set this parameter to OID.
2288X V5/TaiShan: Set this parameter to Precise Alarm (recommended).
Modify Include Alarm Severities
The recommended options include Minor.
- In the Trap Server and Message Format area, click Edit of a row. Complete related operations by referring to Table 3-43 and click Save.
Table 3-43 Configure Trap Server and Packet Format
Task
Procedure
Modify Current Status
Set this parameter to Enable.
Modify Trap Server Address
Set this parameter to the internal floating IP address 1 of the service distribution plane configured during installation. You can also run the ifconfig eth1:1 or ifconfig bond1:1 command on the server to obtain this IP address.
Modify Port
Set this parameter to 30085.
Modify Message Delimiter
Select ;.
Select Message Content
Select Time, Sensor Name, Severity, Event Code, and Event Description.
Modify Display Keyword in Message
Select Yes in the Display Keywords in Message area.
- On the iBMC GUI, choose Maintenance > Alarm Settings. Click Trap Notification, and complete related operations by referring to Table 3-42 and click Save.
- If the iBMC version of the delivered device is earlier than 5.00, perform the following steps.
- On the iBMC GUI, choose Alarm & SEL > Alarm Setting. Complete related operations by referring to Table 3-44 and click Save.
Table 3-44 Alarm trap notification settings
Task
Procedure
Modifying Trap Notification Settings
Set Trap Function to ON.
Modifying Trap Version
Select SNMPv3.
Modifying SNMPv3 User
Set this parameter to the username used by the controller card to report alarms to the alarm server.
Modifying Trap Mode
2288H V5: Set this parameter to OID.
2288X V5/TaiShan: Set this parameter to Precise Alarm (recommended).
Modify Include Alarm Severities
The recommended options include Minor.
- In Configure Trap Server and Packet Format, click
of a row. Complete related operations by referring to Table 3-45 and click Save.
Table 3-45 Configure Trap Server and Packet FormatTask
Procedure
Modify Current Status
Set this parameter to ON.
Modify Trap Server Address
Set this parameter to the internal floating IP address 1 of the service distribution plane configured during installation. You can also run the ifconfig eth1:1 or ifconfig bond1:1 command on the server to obtain this IP address.
Modify Trap Port
Set this parameter to 30085.
Modify Message Delimiter
Select ;.
Modify Select Message Content
Select Time, Sensor Name, Severity, Event Code, and Event Description.
Modify Display Keyword in Message
Select Display Keyword in Message.
If the iBMC versions of the delivered devices are different, some operations on the iBMC WebUI are correspondingly different. Perform the following operations to view the iBMC version information.- Log in to the iBMC web UI. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- View the iBMC version information.
- If the iBMC version is 3.01.0.0 or later, on the iBMC homepage, view iBMC Firmware Version in the Device Info area.
- If the iBMC version is 5.00 or later, on the iBMC homepage, view iBMC Version in the Device Info area.
- If the iBMC version is earlier than 5.00, on the iBMC homepage, view iBMC Firmware Version in the Basic Info area.
- On the iBMC GUI, choose Alarm & SEL > Alarm Setting. Complete related operations by referring to Table 3-44 and click Save.
Follow-up Procedure
- If the iBMC version of the delivered device is 5.00 or later, perform the following steps.
- Log in to the controller card. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- Choose Maintenance > Alarm Settings, and click Trap Notification.
- In the Trap Server and Message Format area, click Test in the row with Current Status set to Enabled.
If the message Operation Successful is displayed, the hardware alarms for Huawei servers are successfully configured.
- If the iBMC version of the delivered device is earlier than 5.00, perform the following steps.
- Log in to the controller card. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- Choose Alarm & SEL > Alarm Setting to access the Alarm Setting window.
- In Configure Trap Server and Packet Format, click Test in the row with Current Status set to ON.
If the message Success is displayed, the hardware alarms for Huawei servers are successfully configured.
If the iBMC versions of the delivered devices are different, some operations on the iBMC WebUI are correspondingly different. Perform the following operations to view the iBMC version information.- Log in to the iBMC web UI. For details, see How Do I Use an iBMC IP Address to Remotely Log In to the Server?
- View the iBMC version information.
- If the iBMC version is 3.01.0.0 or later, on the iBMC homepage, view iBMC Firmware Version in the Device Info area.
- If the iBMC version is 5.00 or later, on the iBMC homepage, view iBMC Version in the Device Info area.
- If the iBMC version is earlier than 5.00, on the iBMC homepage, view iBMC Firmware Version in the Basic Info area.
Modifying FusionCompute Alarm Forwarding Parameters
During iMaster NCE-Campus installation with EasySuite, the FusionCompute has been configured to send alarms. If the alarm forwarding parameters are manually modified on the server, modify the receiving parameters recorded on the management plane accordingly.
Prerequisites
To configure server alarm reporting, ensure that FusionCompute can communicate with the service distribution plane.
You have logged in to FusionCompute. For details, see How Do I Log In to FusionCompute?
Procedure
- In the navigation pane of FusionCompute, click
. The System page is displayed.
- Choose System > Connect To > SNMP Mgt. Station. The SNMP Mgt. Station page is displayed.
- Set FusionCompute alarm forwarding parameters by referring to Table 3-46 and click Save.Table 3-46 FusionCompute alarm forwarding parameters
Task
Procedure
Name
Set this parameter to FC_alarm.
SNMP Version
Select SNMPv3.
Maintenance Port
Set this parameter to 30085.
IP Address
Set this parameter to the internal floating IP address 1 of the service distribution plane configured during installation. You can also run the ifconfig eth1:1 or ifconfig bond1:1 command on the server to obtain this IP address.
Security Username
Set this parameter to FC_admin.
Authentication Protocol
Select HMAC-SHA.
Authentication Password
Set this parameter to the authentication password for the FC_admin user.
Confirm Authentication Password
Privacy Password
Set this parameter to the encryption password for the FC_admin user.
Confirm Privacy Password
Timeout Duration
Retain the default value.
Privacy Protocol
Retain the default value.
Set the authentication password and encryption password for the FC_admin user to different values.
Setting Server Alarm Receiving Parameters (Rack Server)
During iMaster NCE-Campus installation with EasySuite, the 2288H V5/TaiShan server has been configured to send alarms. You only need to set the alarm receiving parameters for the server on the management plane. The management plane then can receive server alarms and report the alarms to the service plane.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Precautions
In a DR scenario, you need to configure the server alarm receiving parameters at the primary site based on the server of the primary site, and configure the parameters at the secondary site based on the server of the secondary site.
Procedure
- On the management plane, choose from the main menu.
- On the Configure Alarm Reporting Parameters page, complete related operations by referring to Table 2-1.Table 3-47 Server alarm reporting parameters
Parameter
Description
IP Address
iMana/iBMC IP address.
Server Type
Set this parameter to RH2288H V2/V3/V5/TaiShan 200 (Model 2280).
SNMP Username
Set this parameter to the alarm reporting user.
Authentication Password
SNMP authentication password. Enter the login password for the alarm reporting user.
Encryption Password
SNMP encryption password. Enter the SNMPv3 encryption password of the alarm reporting user.
- By default, the Administrator user of the 2288H V5/TaiShan alarm node functions as the SNMP user, and the login password of the Administrator user functions as the authentication password and the encryption password at the same time. If an alarm message is displayed indicating that the two passwords are the same, confirm the risk and continue.
- You are advised to use a dedicated SNMP user on the 2288H V5/TaiShan for alarm reporting. When an alarm is reported, the management plane connects to the server using the SNMP user. If the login user is reused, the login user may be locked after the user login password is changed. If you need to add an alarm reporting user, add the user by referring to How Do I Add an Alarm Reporting User (Rack Server)? and set different authentication and encryption passwords for the user.
Follow-up Procedure
If the parameters on the hardware server have changed, reset the alarm receiving parameters.
Setting FusionCompute Receiving Parameters
During iMaster NCE-Campus installation with EasySuite, the FusionCompute has been configured to send alarms. You only need to set the alarm receiving parameters for the FusionCompute on the management plane. The management plane then can receive the FusionCompute alarms and report the alarms to the service plane
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Precautions
In a DR scenario, you need to configure the FusionCompute alarm receiving parameters at the primary site based on the server of the primary site, and configure the parameters at the secondary site based on the FusionCompute of the secondary site.
Procedure
- On the management plane, choose from the main menu.
- On the Set Server Alarm Receiving Parameters page, complete related operations by referring to Table 3-48.Table 3-48 FusionCompute alarm receiving parameters
Parameter
Description
IP Address
Set this parameter to the VRM floating IP address.
Server Type
Select FusionCompute.
SNMP Username
Set this parameter to FC_admin. The value of this parameter must be the same as the value of Security Username set in Modifying FusionCompute Alarm Forwarding Parameters.
Authentication Password
SNMP authentication password. Enter the authentication password for the FC_admin user. For details about the password, see Virtualization-Layer User List.
Encryption Password
SNMP encryption password. Enter the encryption password for the FC_admin user. For details about the password, see Virtualization-Layer User List.
- Manually clear all alarms on FusionCompute.
- Log in to FusionCompute. For details, see How Do I Log In to FusionCompute?
- In the navigation pane of FusionCompute, click
. On the Monitoring page, choose .
- On the Real-Time Alarms page, check whether alarms exist. If alarms exist, click Clear to manually clear them.
Follow-up Procedure
If the parameters on the hardware server have changed, reset the alarm receiving parameters.
Software Management
This chapter describes the operations related to product software, including software package management, third-party patch management, and product software installation, upgrade, capacity expansion, and uninstallation.
Managing Software Packages
You can upload software packages required for installation and upgrade, or management of third-party patches, and delete unnecessary software packages if the disk space is insufficient.
Prerequisites
- The communication between your PC and the management node is normal.
- To upload a software package:
- You have obtained the software package and corresponding signature file to be uploaded.
- You have obtained the passwords for the sopuser and ossadm users of the management node.
- To delete a software package:
The software package to be deleted is not in the running state.
Precautions
- The software package to be uploaded must meet the following requirements. Otherwise, the upload fails.
- The software package name can contain a maximum of 128 characters, including letters, digits, underscores (_), hyphens (-), and dots (.).
- If the software package is in .zip, .7z, or .gz format, upload the signature file at the same time. The signature file can be in .asc, .cms, or .crl format.
- If the software package is in .tar format, you do not need to upload the signature file at the same time because the software package contains the corresponding signature file.
- If the software package size is less than 6 GB, the signature file size is less than 2 MB, and the network bandwidth between your PC and the management node is greater than 100 Mbit/s, you can upload software packages using the software management function on the management plane. If the preceding conditions are not met, upload the software packages in resumable transfer mode.
- During the scanning for software packages, do not restart any management node. Otherwise, the scanning may fail.
- In a DR scenario, the same software package and signature file must be uploaded to both the primary and secondary sites.
Procedure
Task |
Operation |
---|---|
(Recommended) Upload the software package to the management in resumable transfer mode. |
|
Use the software management function of the management plane to upload the software package. |
|
Delete a software package. |
|
Installing Product Software
Product software must be installed before you use EasySuite to install iMaster NCE-Campus. If you need to install product software manually, refer to the contents of this section.
Prerequisites
- You have logged in to management plane. For details, see Logging In to the Management Plane.
- The software package has been uploaded and scanned. For details, see Managing Software Packages.
Precautions
You are advised to retain the software installation package. The complete software installation package is required for patch installation.
Procedure
- On the management plane, choose from the main menu.
- On the Deploy Product Software page, click the product to be installed. The product details page is displayed.
- In the upper right corner of the page for product details, click Install. Perform operations as prompted.
Upgrading Product Software
After the product software upgrade package is uploaded to the management plane, you can upgrade the product. For details about the iMaster NCE-Campus upgrade scenarios and E2E operation guide, see Patch Installation Guide of the desired version. The upgrade function is not supported on the Product > Software Management > Deploy Product Software page of the iMaster NCE-Campus management plane.
Uninstalling Product Software
You can uninstall the product software that has been installed.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
In a DR scenario, the primary and secondary site products have been separated. For details, see Separating the Primary and Secondary Site Products.
Procedure
- On the management plane, choose from the main menu.
- On the Deploy Product Software page, click the product to be uninstalled. The product details page is displayed.
- In the upper right corner, click More, and choose Uninstall from the drop-down menu. Perform operations as prompted.
Follow-up Procedure
If you do not need to install the product again, delete the product information. For details, see Configuring Products.
Modifying Configuration Parameters of Product Software
You can use this function to modify the product configurations.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
In a DR scenario, the product DR relationship between the primary and secondary sites has been deleted. For details, see Separating the Primary and Secondary Site Products.
Precautions
Parameters on this page are installation and deployment procedure parameters and have been configured during installation. Do not change the parameters unless detailed modification scenarios and procedures are described in the documentation.
Procedure
- On the management plane, choose from the main menu.
- On the Deploy Product Software page, click the product with configurations to be modified. The page for product details is displayed.
- In the upper right corner, click More and choose Modify Configurations from the drop-down menu, and perform operations as prompted.
- In a DR scenario, re-establish the product DR relationship between the primary and secondary sites. For details, see Connecting the Primary and Secondary Site Products.
Managing Third-Party Patches
After a third-party patch package has been uploaded to the management plane, you can install the patch or roll back patch installation on nodes.
Context
The following types of patches are supported:
EulerOS
GaussDB 100 database
Prerequisites
- The third-party patch package has been uploaded. For details, see Managing Software Packages.
- You have logged in to the management plane. For details, see Logging In to the Management Plane.
- For the patch installation of the GaussDB 100 database, you have obtained the password for the dbuser user of the OS.
If there are multiple database nodes, the password must be the same on all the nodes. Otherwise, the patch installation fails.
- In a DR scenario, the primary and secondary site products have been separated. For details, see Separating the Primary and Secondary Site Products.
Precautions
- OS patch management
- Before the upgrade, you are advised to back up the OS of the node. If the upgrade fails, you can use the backup file to restore the OS to the normal state.
- Do not upgrade the OS of the management node and product nodes at the same time. Otherwise, the upgrade fails.
- During the upgrade, the OS is restarted and cannot be logged in and services are interrupted. You are advised to perform this operation in off-peak hours.
- Database patch management
- Before the upgrade, you are advised to back up the database. If the upgrade fails, you can use the backup file to restore the database to the normal state.
- During the upgrade, databases are restarted and services are interrupted. You are advised to perform operations in this section in off-peak hours.
- If the management node and the database node are the same node and use the same database software, database patch installation is not supported.
For details about how to check that the management node and database node are the same node and use the same database software, see How Do I Determine the Deployment Mode of Nodes? and How Do I Check Whether Management Nodes and Product Nodes Use the Same Database Software?
Procedure
- On the management plane, choose Product > Software Management > Manage Third-Party Software Patches from the main menu.
- On the Manage Third-Party Software Patches page, click the card corresponding to the product to be upgraded and perform operations as prompted.
- In a DR scenario, connect the primary and secondary site products. For details, see Connecting the Primary and Secondary Site Products.
Election Service Management
The election service offers the election function to provide service groups with one-active one-standby or one-active multi-standby capabilities. In this way, the active/standby switchover of services is implemented to improve reliability. For the service groups managed by the election service, you can view the active/standby status of the service groups on the election service page of the management plane, and switch the active and standby services manually.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
- In the same service group, the service IDs are the same, and the node IDs and active/standby statuses are different.
- When a service is abnormal, the system automatically performs an active/standby switchover to ensure business continuity. In the following scenarios, you need to manually switch the active/standby status of services:
- The resource usage of the node where the active service resides is too high, and the system runs slowly.
- Multiple services in the same service group are active.
Procedure
- On the management plane, choose from the main menu.
- On the Election Service page, switch the active and standby services as prompted. If the active service becomes standby and the standby service becomes active, the active/standby switchover of services is successful.
Task Management
The task management function allows you to view the task details to check the task execution status or locate the cause of task execution failure.
Prerequisites
You have logged in to the management plane. For details, see Logging In to the Management Plane.
Context
Table 3-49 lists the types and description of tasks that you can view on the management plane.