我有一台 Scientific Linux 6.5 服务器,该服务器带有一个带 SSD 磁盘的 RAID。我在 dmesg 文件中看到几个错误,但 RAID 控制器(LSI 控制器)的实用程序没有提供任何警报或错误。
服务器是 Supermico D20-4x-M4,使用 Infiniband 直接连接到带有 SSD RAID 的存储服务器。控制器是 LSI MegaRaid SAS 9286CV-8e。磁盘是 SAMSUNG MZ7WD120。
dmesg文件的错误如下:
# dmesg
program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO
program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO
program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO
program smartctl is using a deprecated SCSI ioctl, please convert it to SG_IO
__ratelimit: 19 callbacks suppressed
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
__ratelimit: 19 callbacks suppressed
__ratelimit: 19 callbacks suppressed
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 00 00 00 08 00
Buffer I/O error on device sdb, logical block 0
sd 1:0:0:0: [sdb] Unhandled error code
sd 1:0:0:0: [sdb] Result: hostbyte=DID_BAD_TARGET driverbyte=DRIVER_OK
sd 1:0:0:0: [sdb] CDB: Read(10): 28 00 00 00 00 08 00 00 08 00
Buffer I/O error on device sdb, logical block 1
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 8
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 234441640
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 0
end_request: I/O error, dev sdb, sector 234441640
end_request: I/O error, dev sdb, sector 0
我也尝试过:
smartctl /dev/sdb -a -T permissive
结果如下:
smartctl 5.43 2012-06-30 r3573 [x86_64-linux-2.6.32-431.20.3.el6.x86_64] (local build)
Copyright (C) 2002-12 by Bruce Allen, http://smartmontools.sourceforge.net
Vendor: /1:0:0:0
Product:
User Capacity: 600,332,565,813,390,450 bytes [600 PB]
Logical block size: 774843950 bytes
>> Terminate command early due to bad response to IEC mode page
Error Counter logging not supported
Device does not support Self Test logging
-
# lsscsi
[0:0:0:0] disk ATA SAMSUNG MZ7WD120 DXM8 /dev/sda
[1:0:0:0] disk ATA SAMSUNG MZ7WD120 DXM8 /dev/sdb
[6:0:32:0] enclosu LSI SAS2X36 0e0b -
[6:0:33:0] enclosu LSI SAS2X36 0e0b -
[6:2:0:0] disk LSI MR9286CV-8e 3.40 /dev/sdc
提前致谢。