启动时出现 I/O 错误

启动时出现 I/O 错误

我有时在启动过程中会遇到错误:

I/O Error, dev sda, sector xxxxxxx 

有时fsck可以帮助我启动我的Ubuntu 18 system。但是,几天后问题又出现了。

我尝试将其清零,然后scan and remap使用 mhdd 磁盘。扫描表明没有错误扇区。但是,问题仍然存在。

目前,GSmartControl显示:

ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x002f   100   100   051    Pre-fail  Always       -       667
  2 Throughput_Performance  0x0026   252   252   000    Old_age   Always       -       0
  3 Spin_Up_Time            0x0023   089   089   025    Pre-fail  Always       -       3463
  4 Start_Stop_Count        0x0032   096   096   000    Old_age   Always       -       4426
  5 Reallocated_Sector_Ct   0x0033   252   252   010    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x002e   252   252   051    Old_age   Always       -       0
  8 Seek_Time_Performance   0x0024   252   252   015    Old_age   Offline      -       0
  9 Power_On_Hours          0x0032   100   100   000    Old_age   Always       -       20258
 10 Spin_Retry_Count        0x0032   252   252   051    Old_age   Always       -       0
 12 Power_Cycle_Count       0x0032   096   096   000    Old_age   Always       -       4246
191 G-Sense_Error_Rate      0x0022   100   100   000    Old_age   Always       -       457
192 Power-Off_Retract_Count 0x0022   252   252   000    Old_age   Always       -       0
194 Temperature_Celsius     0x0002   059   051   000    Old_age   Always       -       41 (Min/Max 17/49)
195 Hardware_ECC_Recovered  0x003a   100   100   000    Old_age   Always       -       0
196 Reallocated_Event_Count 0x0032   252   252   000    Old_age   Always       -       0
197 Current_Pending_Sector  0x0032   100   100   000    Old_age   Always       -       1
198 Offline_Uncorrectable   0x0030   252   252   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x0036   100   100   000    Old_age   Always       -       1
200 Multi_Zone_Error_Rate   0x002a   001   001   000    Old_age   Always       -       60844
223 Load_Retry_Count        0x0032   100   100   000    Old_age   Always       -       311
225 Load_Cycle_Count        0x0032   081   081   000    Old_age   Always       -       197746



Complete error log:

SMART Extended Comprehensive Error Log Version: 1 (2 sectors)
Device Error Count: 227 (device log contains only the most recent 8 errors)
    CR     = Command Register
    FEATR  = Features Register
    COUNT  = Count (was: Sector Count) Register
    LBA_48 = Upper bytes of LBA High/Mid/Low Registers ]  ATA-8
    LH     = LBA High (was: Cylinder High) Register    ]   LBA
    LM     = LBA Mid (was: Cylinder Low) Register      ] Register
    LL     = LBA Low (was: Sector Number) Register     ]
    DV     = Device (was: Device/Head) Register
    DC     = Device Control Register
    ER     = Error register
    ST     = Status register
Powered_Up_Time is measured from power on, and printed as
DDd+hh:mm:SS.sss where DD=days, hh=hours, mm=minutes,
SS=sec, and sss=millisec. It "wraps" after 49.710 days.

Error 227 [2] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 51 00 08 00 00 3a d7 98 68 e0 00  Error: UNC 8 sectors at LBA = 0x3ad79868 = 987207784

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  25 00 00 00 10 00 00 3a d7 98 60 e0 08     00:00:00.060  READ DMA EXT
  25 00 00 00 10 00 00 3a d7 98 40 e0 08     00:00:00.060  READ DMA EXT
  25 00 00 00 20 00 00 3a d7 97 e0 e0 08     00:00:00.060  READ DMA EXT
  25 00 00 09 08 00 00 3b 57 89 00 e0 08     00:00:00.060  READ DMA EXT
  25 00 00 00 08 00 00 3b 17 9d c0 e0 08     00:00:00.060  READ DMA EXT

Error 226 [1] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3b 16 88 80 40 00  Error: UNC at LBA = 0x3b168880 = 991332480

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3b 16 88 80 40 08     00:00:00.708  READ FPDMA QUEUED
  60 00 10 00 08 00 00 3b 16 88 80 40 08     00:00:00.763  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     00:00:00.763  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     00:00:00.763  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     00:00:00.763  IDENTIFY DEVICE

Error 225 [0] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3b 16 88 80 40 00  Error: UNC at LBA = 0x3b168880 = 991332480

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3b 16 88 80 40 08     00:00:00.708  READ FPDMA QUEUED
  60 00 10 00 08 00 00 3b 16 88 80 40 08     00:00:00.760  READ FPDMA QUEUED
  61 00 10 00 08 00 00 1a 17 ab 00 40 08     00:00:00.760  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 1a 16 aa f8 40 08     00:00:00.760  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 52 16 8a d8 40 08     00:00:00.760  WRITE FPDMA QUEUED

Error 224 [7] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3a d6 88 80 40 00  Error: UNC at LBA = 0x3ad68880 = 987138176

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3a d6 88 80 40 08     00:00:00.708  READ FPDMA QUEUED
  60 00 10 00 08 00 00 3a d6 88 80 40 08     00:00:00.758  READ FPDMA QUEUED
  ef 00 10 00 02 00 00 00 00 00 00 a0 08     00:00:00.758  SET FEATURES [Enable SATA feature]
  27 00 00 00 00 00 00 00 00 00 00 e0 08     00:00:00.758  READ NATIVE MAX ADDRESS EXT [OBS-ACS-3]
  ec 00 00 00 00 00 00 00 00 00 00 a0 08     00:00:00.758  IDENTIFY DEVICE

Error 223 [6] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3a d6 88 80 40 00  Error: UNC at LBA = 0x3ad68880 = 987138176

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3a d6 88 80 40 08     00:00:00.708  READ FPDMA QUEUED
  60 00 10 00 08 00 00 3a d6 88 80 40 08     00:00:00.756  READ FPDMA QUEUED
  61 00 10 00 08 00 00 3b 16 8a c8 40 08     00:00:00.737  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 3b 16 8a b8 40 08     00:00:00.737  WRITE FPDMA QUEUED
  61 00 10 00 10 00 00 3b 16 8a 70 40 08     00:00:00.737  WRITE FPDMA QUEUED

Error 222 [5] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3b 16 88 80 40 00  Error: UNC at LBA = 0x3b168880 = 991332480

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3b 16 88 80 40 08     00:00:00.671  READ FPDMA QUEUED
  61 00 10 00 08 00 00 51 d7 8b f8 40 08     00:00:00.706  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 51 17 89 88 40 08     00:00:00.706  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 51 17 89 38 40 08     00:00:00.706  WRITE FPDMA QUEUED
  61 00 10 00 18 00 00 51 17 89 18 40 08     00:00:00.706  WRITE FPDMA QUEUED

Error 221 [4] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3a d6 88 80 40 00  Error: UNC at LBA = 0x3ad68880 = 987138176

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3a d6 88 80 40 08     00:00:00.671  READ FPDMA QUEUED
  61 00 10 00 08 00 00 39 16 94 c0 40 00     00:00:00.704  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 37 97 a1 78 40 00     00:00:00.704  WRITE FPDMA QUEUED
  61 00 10 00 08 00 00 37 96 8d b0 40 00     00:00:00.704  WRITE FPDMA QUEUED
  60 00 10 00 10 00 00 3d d6 88 00 40 00     00:00:00.704  READ FPDMA QUEUED

Error 220 [3] occurred at disk power-on lifetime: 20256 hours (844 days + 0 hours)
  When the command that caused the error occurred, the device was active or idle.

  After command completion occurred, registers were:
  ER -- ST COUNT  LBA_48  LH LM LL DV DC
  -- -- -- == -- == == == -- -- -- -- --
  40 -- 41 00 08 00 00 3a d6 88 80 40 00  Error: UNC at LBA = 0x3ad68880 = 987138176

  Commands leading to the command that caused the error were:
  CR FEATR COUNT  LBA_48  LH LM LL DV DC  Powered_Up_Time  Command/Feature_Name
  -- == -- == -- == == == -- -- -- -- --  ---------------  --------------------
  60 00 10 00 08 00 00 3d 96 88 80 40 08     00:00:00.673  READ FPDMA QUEUED
  60 00 10 00 08 00 00 3a d6 88 80 40 08     00:00:00.670  READ FPDMA QUEUED
  60 00 10 00 10 00 00 3d 96 88 00 40 08     00:00:00.671  READ FPDMA QUEUED
  60 00 10 00 08 00 00 3d 56 88 80 40 08     00:00:00.671  READ FPDMA QUEUED
  60 00 10 00 10 00 00 3d 56 88 00 40 08     00:00:00.671  READ FPDMA QUEUED
  1. 重复错误的原因是什么?
  2. 我估计磁盘有问题。我可以修复它吗?如何确定我应该更换磁盘(我的意思是 - 这不是由 SATA 插槽、SATA 电缆、主板引起的)。

提前致谢 :)

答案1

这确实表明您的驱动器有问题,但并不一定表明驱动器出现故障并需要更换。这实际上是说驱动器已发现问题并将尝试纠正它。

如果成功……那就开心了。如果没有,那么你就需要更换驱动器了。你应该担心的是离线不可纠正值。如果该值不是 0,那么是时候更换驱动器了。

此外,如果你正在使用 mdraid,请检查它是否降低了阵列的性能,因为这可能会导致你出现启动问题

tl;dr 别担心。如果它升级为离线不可纠正,那就担心吧。如果你有突袭,那么你可以高枕无忧了。

相关内容