What Should I Do if a datarvg Exception Occurs and Cannot Be Brought Online During Force Active
Symptom
When force active was performed for an HA system in dual-active state, the datarvg resource on the active server became faulty, and the AppService resource group automatically went offline. After the datarvg fault is rectified, the datarvg resource cannot go online.
Possible Causes
During force active, the datarvg resource is automatically created for the active and standby servers. The creation will fail when an exception occurs.
Procedure
- Log in to the faulty node as the root user.
- Run the following command to adjust the monitoring parameters of datarvg:
haconf -makerw
hatype -modify RVG MonitorInterval 60
hatype -modify RVG ToleranceLimit 10
haconf -dump -makero
- Run the following command to rectify the datarvg fault:
hares -clear datarvg
- Run the following command to check the datarvg status:
vxprint -g datadg -Vl datarvg
The command output is similar to the following information:Rvg: datarvg info: rid=0.1140 version=0 rvg_version=45 last_tag=2 state: state=ACTIVE kernel=ENABLED assoc: datavols=lvdbdata,lvfiledata srl=srl rlinks=(none) exports=(none) vsets=(none) att: rlinks=(none) flags: closed primary enabled attached logging device: minor=9003 bdev=199/9003 cdev=199/9003 path=/dev/vx/dsk/datadg/datarvg perms: user=root group=root mode=0600
According to the information, the datavols and srl configurations of datarvg are normal. You can bring the datarvg resource and AppService resource group online and continue force active.
If datarvg is not found or datavols=(none) or srl=(none) is displayed, perform the following steps to restore the datarvg resource:
- Create the datarvg resource again.
- Run the following commands in sequence to clear the datarvg resource:
vxrlink -g datadg -f det datarlk
vxrlink -g datadg -f dis datarlk
vxvol -g datadg -f dis lvdbdata
vxvol -g datadg -f dis lvfiledata
vxvol -g datadg -f dis srl
vxedit -g datadg -rf rm datarlk
vxedit -g datadg -rf rm datarvg
- Run the following commands in sequence to create the datarvg resource:
vxmake -g datadg rvg datarvg primary=<true|false>
vxvol -g datadg aslog datarvg srl
vxrvg -g datadg start datarvg
vxvol -g datadg assoc datarvg lvdbdata
vxvol -g datadg assoc datarvg lvfiledata
To restore datarvg on the active server, use primary=true; to restore datarvg on the standby server, use primary=false.
If an error occurs during the creation of the datarvg resource, contact Huawei technical support.
- Run the following commands in sequence to clear the datarvg resource:
- Check the datarvg status. For details, see 4.
- Bring the datarvg and AppService resource groups online. For details, see "Bringing a Resource Online" in Maintenance Guide.
- For details about force active, see "Forcibly Setting the Local Server as the Active Server" in Maintenance Guide.
- After forcibly setting a server as the active server, run the following command to restore the monitoring parameters of datarvg:
haconf -makerw
hatype -modify RVG MonitorInterval 60
hatype -modify RVG ToleranceLimit 0
haconf -dump -makero