磁盘文件系统 ext4 上的 I/O Xen Server

磁盘文件系统 ext4 上的 I/O Xen Server

我有一台 Xen Server 6.0 和一个磁盘(2TB)报告 I/O 错误,我的虚拟机是 Centos 6.2(文件系统 ext4)

end_request: I/O error, dev xvdc, sector 896084224
end_request: I/O error, dev xvdc, sector 896084312
end_request: I/O error, dev xvdc, sector 896084400
end_request: I/O error, dev xvdc, sector 896084488
end_request: I/O error, dev xvdc, sector 896084576
end_request: I/O error, dev xvdc, sector 896084664
end_request: I/O error, dev xvdc, sector 896084752
end_request: I/O error, dev xvdc, sector 896084840
end_request: I/O error, dev xvdc, sector 896084928
end_request: I/O error, dev xvdc, sector 896085016
end_request: I/O error, dev xvdc, sector 896085104
end_request: I/O error, dev xvdc, sector 896085192
end_request: I/O error, dev xvdc, sector 896085280
end_request: I/O error, dev xvdc, sector 896085368
end_request: I/O error, dev xvdc, sector 896085456
end_request: I/O error, dev xvdc, sector 896085544
end_request: I/O error, dev xvdc, sector 896085632
end_request: I/O error, dev xvdc, sector 896085720
end_request: I/O error, dev xvdc, sector 896085808

智能检查:

   smartctl version 5.38 [i686-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/

=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME          FLAG     VALUE WORST THRESH TYPE      UPDATED  WHEN_FAILED RAW_VALUE
  1 Raw_Read_Error_Rate     0x000f   114   099   006    Pre-fail  Always       -       65412104
  3 Spin_Up_Time            0x0003   093   093   000    Pre-fail  Always       -       0
  4 Start_Stop_Count        0x0032   100   100   020    Old_age   Always       -       23
  5 Reallocated_Sector_Ct   0x0033   100   100   036    Pre-fail  Always       -       0
  7 Seek_Error_Rate         0x000f   072   060   030    Pre-fail  Always       -       18633333
  9 Power_On_Hours          0x0032   094   094   000    Old_age   Always       -       5873
 10 Spin_Retry_Count        0x0013   100   100   097    Pre-fail  Always       -       0
 12 Power_Cycle_Count       0x0032   100   100   020    Old_age   Always       -       26
183 Unknown_Attribute       0x0032   100   100   000    Old_age   Always       -       0
184 Unknown_Attribute       0x0032   100   100   099    Old_age   Always       -       0
187 Reported_Uncorrect      0x0032   100   100   000    Old_age   Always       -       0
188 Unknown_Attribute       0x0032   100   097   000    Old_age   Always       -       3
189 High_Fly_Writes         0x003a   099   099   000    Old_age   Always       -       1
190 Airflow_Temperature_Cel 0x0022   066   063   045    Old_age   Always       -       34 (Lifetime Min/Max 30/35)
191 G-Sense_Error_Rate      0x0032   100   100   000    Old_age   Always       -       0
192 Power-Off_Retract_Count 0x0032   100   100   000    Old_age   Always       -       16
193 Load_Cycle_Count        0x0032   100   100   000    Old_age   Always       -       26
194 Temperature_Celsius     0x0022   034   040   000    Old_age   Always       -       34 (0 22 0 0)
195 Hardware_ECC_Recovered  0x001a   026   003   000    Old_age   Always       -       65412104
197 Current_Pending_Sector  0x0012   100   100   000    Old_age   Always       -       0
198 Offline_Uncorrectable   0x0010   100   100   000    Old_age   Offline      -       0
199 UDMA_CRC_Error_Count    0x003e   200   200   000    Old_age   Always       -       0
240 Head_Flying_Hours       0x0000   100   253   000    Old_age   Offline      -       64587718203122
241 Unknown_Attribute       0x0000   100   253   000    Old_age   Offline      -       3045564792
242 Unknown_Attribute       0x0000   100   253   000    Old_age   Offline      -       78354915

SMART Self-test log structure revision number 1
Num  Test_Description    Status                  Remaining  LifeTime(hours)  LBA_of_first_error
# 1  Extended offline    Self-test routine in progress 10%      5873    

这是 xen 的 bug 吗?

答案1

确保你的备份完好无损。这种错误通常意味着你的驱动器已经坏了。

有可能是电缆或控制器出了问题,但我发现通常这意味着驱动器正准备让您陷入完全无法启动的情况,之前会突然出现系统冻结。尤其是如果系统在此之前运行良好并持续了相当长一段时间。

最好的情况是重新安装电缆。最坏的情况是,您的备份即将接受测试。

相关内容