No relevant resource is found in the selected language.

This site uses cookies. By continuing to browse the site you are agreeing to our use of cookies. Read our privacy policy>Search

Reminder

To have a better experience, please upgrade your IE browser.

upgrade

How Do I Determine the Slow Disk Slot for the Server Big Data Service System?

Publication Date:  2019-04-23 Views:  20 Downloads:  0
Issue Description

The V3 servers are equipped with the LSI SAS2208, LSI SAS2308, LSI SAS3008, or LSI SAS3108 RAID controller card. You can use the drive letter to locate the hard disk slot on Linux in a big data service scenario.


Solution

2308/3008+linux

 

Background: Locate the slot of a slow hard disk in a big data service scenario.

(1) In the system:

             Run the df command to query the drive letter corresponding to the abnormal file system. (Upper-layer big data and confirmation of specific drive letters)



1. Querying the Serial Number of Hard Disks

1.1 Querying Device Serial Number Based on Smart Information

(1) In the system:

   Run smartctl –a/dev/sdb. (The smartctl file is required in the system. Generally, the file is installed by default during the Linux OS installation.)


The serial number of the hard disk corresponding to the sdb drive letter is 9XG50X1F.

 

Note: You can use the drive letter to query the hard disk serial number only in single-disk RAID 0 and disk pass-through scenarios. When multiple hard disks exist under one VD, multiple hard disks share one drive letter. In such a case, do not use the drive letter to query the device serial number.

1.2 Querying Slot Number Based on the Serial Number

(1) On the system:

a. Go to the \InfoCollect_Linux\modules\raid\RAIDtool\3008 directory where the tool is located, and run chmod +x sas* to grand the execute permission.

b. Run the ./ sas3ircu 0 display command.

c. In the command output, query the slot number by using the serial number obtained in step 1.



Note: You can also query the slot number by searching the raid folder in the collected log package. However, for servers (such as the X6800) that are configured with SoftRAID and a RAID controller card, the information about the RAID controller card may not exist in the log files. Therefore, using the preceding commands is more accurate.

2. Description of Special Conditions

Failure to Obtain the SMART Information

(1) In the system:

a. Check the messages log. The messages log shows that the sdm disk in slot 11 has task abort records.



b. On the OS, run lsscsi to view the hard disk information. In the displayed information, the first column shows the [H:C:D:L] numbers of the hard disks. Use the [H:C:D:L] number to query the disk slot number.



Check method:

For example, if the [H:C:D:L] number is [0:0:11:0], the meanings of the numbers are as follows:

H: indicates the HBA number. For RAID controller cards, 0 indicates an onboard RAID controller card. If the system has only one RAID controller card, the value of H is 0. C: indicates the channel number. The default value is 0. You can ignore this number.

D: indicates the device number. If a RAID controller card is used, the value indicates the VD number. For [0:0:11:0], view the VD 11 information of the RAID controller card. The VD 11 information shows that the slot number is 11.


The sda disk is a RAID 1 array using slot 0 and slot 1, and the [H:C:D:L] number is [0:2:0:0]. The sda disk is the boot partition. The [H:C:D:L] number of the sdb disk is [0:2:2:0]. The sdb disk is in slot 2.

In single-disk RAID 0 scenarios, one slot is used by one VD. Therefore, you can use this method to query the slot number.

In the hard disk pass-through scenario where an LSI SAS3008 RAID controller card is used, the value of the device number is the slot number. For example, if the [H:C:D:L] value is [0:0:11:0], the hard disk is located in slot 11.

L: indicates the LUN number, which is the number of the channel between the local host and the SCSI device. LUN is not used in the local storage, and the default value is 0.

 

2208/3108+linux


3. Querying the Serial Number of Hard Disks


3.1 Querying Device Serial Number Based on Smart Information

(1) On the system:

   Run smartctl –a/dev/sdb. (The smartctl file is required in the system. Generally, the file is installed by default during the Linux OS installation.)


The serial number of the hard disk corresponding to the sdb drive letter is 9XG50X1F.

 

You can use the drive letter to query the hard disk serial number only in single-disk RAID 0 and JBOD scenarios. When multiple hard disks exist under one VD, multiple hard disks correspond to one drive letter. In such scenario, do not use the drive letter to query the device serial number.

3.2 Querying Slot Number Based on the Serial Number

a. Use the StorCLI tool. If the tool is unavailable, run chmod +x storcli to grand the execute permission.

b. Run storcli64 -PDList -aALL.

c. In the command output, query the slot number by using the serial number obtained in 2.1.



Note: You can also query the slot number by searching the raid folder in the collected log package. However, for servers (such as the X6800) that are configured with SoftRAID and a RAID controller card, the information about the RAID controller card may not exist in the log files. Therefore, using the preceding commands is more accurate.

 







END