我最近购买了 5 个二手 SAS 硬盘,每个硬盘的使用时间约为 35 天。与购买新硬盘相比,这笔交易非常划算,但一开始我以为它们是 SATA,因此出现了一点混淆。经过一番调查,我购买了一个 RAID 控制器(用作 HBA),即 PERC H710。但是,安装完所有组件后,我注意到这些 SAS 硬盘在空闲时会发出奇怪的咔嗒声。我发现,五个包装完好的磁盘不太可能出现同样的问题,尤其是考虑到卖家的良好声誉。我记录了噪音以供参考:
靠近服务器: https://www.youtube.com/shorts/DFqMGDFCObU
在另一个房间(墙的另一边): https://www.youtube.com/shorts/4jHKJnIhp2Q 附言:从视频中听清楚可能有点困难,尤其是在一些 SATA 硬盘正在运行时。服务器正在生产中,因此其他 SATA 硬盘正在持续读取/写入。
以下是关于其中一个的信息:
=== START OF INFORMATION SECTION ===
Vendor: SEAGATE
Product: ST33000650SS
Revision: RS16
Compliance: SPC-4
User Capacity: 3,000,592,982,016 bytes [3.00 TB]
Logical block size: 512 bytes
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000c50055ad05e3
Serial number: Z295A8NC
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Wed Feb 21 17:40:10 2024 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Disabled or Not Supported
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Current Drive Temperature: 48 C
Drive Trip Temperature: 68 C
Accumulated power on time, hours:minutes 63899:38
Manufactured in week 43 of year 2012
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 70
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 12281
Elements in grown defect list: 0
以前有人遇到过类似的 SAS HDD 问题吗?这是 SAS 的正常现象吗?还是我应该进行一些测试?任何建议都将不胜感激!
答案1
运行以下命令并等待其完成后,我们得到“已更正的读取错误总数=1809673244”,并且驱动程序停止发出这些噪音。
smartctl -tlong /dev/sdX
完成后的输出如下:
root@abc:~# smartctl -a /dev/sda
smartctl 7.2 2020-12-30 r5155 [x86_64-linux-5.10.0-27-amd64] (local build)
Copyright (C) 2002-20, Bruce Allen, Christian Franke, www.smartmontools.org
=== START OF INFORMATION SECTION ===
Vendor: SEAGATE
Product: ST33000650SS
Revision: RS16
Compliance: SPC-4
User Capacity: 3,000,592,982,016 bytes [3.00 TB]
Logical block size: 512 bytes
Rotation Rate: 7200 rpm
Form Factor: 3.5 inches
Logical Unit id: 0x5000c50055ad05e3
Serial number: Z295A8NC
Device type: disk
Transport protocol: SAS (SPL-3)
Local Time is: Thu Feb 22 09:40:48 2024 EST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled
Temperature Warning: Disabled or Not Supported
=== START OF READ SMART DATA SECTION ===
SMART Health Status: OK
Current Drive Temperature: 43 C
Drive Trip Temperature: 68 C
Accumulated power on time, hours:minutes 63915:46
Manufactured in week 43 of year 2012
Specified cycle count over device lifetime: 10000
Accumulated start-stop cycles: 70
Specified load-unload count over device lifetime: 300000
Accumulated load-unload cycles: 12283
Elements in grown defect list: 0
Vendor (Seagate Cache) information
Blocks sent to initiator = 3623441469
Blocks received from initiator = 3686219937
Blocks read from cache and sent to initiator = 272821687
Number of read and write commands whose size <= segment size = 36308955
Number of read and write commands whose size > segment size = 15
Vendor (Seagate/Hitachi) factory information
number of hours powered up = 63915.77
number of minutes until next internal SMART test = 55
Error counter log:
Errors Corrected by Total Correction Gigabytes Total
ECC rereads/ errors algorithm processed uncorrected
fast | delayed rewrites corrected invocations [10^9 bytes] errors
read: 1809673244 0 0 1809673244 0 30248.740 0
write: 0 0 0 0 0 8554.361 0
verify: 2794696942 0 0 2794696942 0 1042801.370 0
Non-medium error count: 59813
SMART Self-test log
Num Test Status segment LifeTime LBA_first_err [SK ASC ASQ]
Description number (hours)
# 1 Background long Completed 32 63907 - [- - -]
# 2 Background short Completed 32 63890 - [- - -]
# 3 Background short Completed 32 23 - [- - -]
# 4 Background long Completed 32 22 - [- - -]
Long (extended) Self-test duration: 27600 seconds [460.0 minutes]
root@abc:~#
我希望这能帮助遇到同样问题的人!