我有一台 Xen Server 6.0 和一个磁盘(2TB)报告 I/O 错误,我的虚拟机是 Centos 6.2(文件系统 ext4)
end_request: I/O error, dev xvdc, sector 896084224
end_request: I/O error, dev xvdc, sector 896084312
end_request: I/O error, dev xvdc, sector 896084400
end_request: I/O error, dev xvdc, sector 896084488
end_request: I/O error, dev xvdc, sector 896084576
end_request: I/O error, dev xvdc, sector 896084664
end_request: I/O error, dev xvdc, sector 896084752
end_request: I/O error, dev xvdc, sector 896084840
end_request: I/O error, dev xvdc, sector 896084928
end_request: I/O error, dev xvdc, sector 896085016
end_request: I/O error, dev xvdc, sector 896085104
end_request: I/O error, dev xvdc, sector 896085192
end_request: I/O error, dev xvdc, sector 896085280
end_request: I/O error, dev xvdc, sector 896085368
end_request: I/O error, dev xvdc, sector 896085456
end_request: I/O error, dev xvdc, sector 896085544
end_request: I/O error, dev xvdc, sector 896085632
end_request: I/O error, dev xvdc, sector 896085720
end_request: I/O error, dev xvdc, sector 896085808
智能检查:
smartctl version 5.38 [i686-redhat-linux-gnu] Copyright (C) 2002-8 Bruce Allen
Home page is http://smartmontools.sourceforge.net/
=== START OF READ SMART DATA SECTION ===
SMART Attributes Data Structure revision number: 10
Vendor Specific SMART Attributes with Thresholds:
ID# ATTRIBUTE_NAME FLAG VALUE WORST THRESH TYPE UPDATED WHEN_FAILED RAW_VALUE
1 Raw_Read_Error_Rate 0x000f 114 099 006 Pre-fail Always - 65412104
3 Spin_Up_Time 0x0003 093 093 000 Pre-fail Always - 0
4 Start_Stop_Count 0x0032 100 100 020 Old_age Always - 23
5 Reallocated_Sector_Ct 0x0033 100 100 036 Pre-fail Always - 0
7 Seek_Error_Rate 0x000f 072 060 030 Pre-fail Always - 18633333
9 Power_On_Hours 0x0032 094 094 000 Old_age Always - 5873
10 Spin_Retry_Count 0x0013 100 100 097 Pre-fail Always - 0
12 Power_Cycle_Count 0x0032 100 100 020 Old_age Always - 26
183 Unknown_Attribute 0x0032 100 100 000 Old_age Always - 0
184 Unknown_Attribute 0x0032 100 100 099 Old_age Always - 0
187 Reported_Uncorrect 0x0032 100 100 000 Old_age Always - 0
188 Unknown_Attribute 0x0032 100 097 000 Old_age Always - 3
189 High_Fly_Writes 0x003a 099 099 000 Old_age Always - 1
190 Airflow_Temperature_Cel 0x0022 066 063 045 Old_age Always - 34 (Lifetime Min/Max 30/35)
191 G-Sense_Error_Rate 0x0032 100 100 000 Old_age Always - 0
192 Power-Off_Retract_Count 0x0032 100 100 000 Old_age Always - 16
193 Load_Cycle_Count 0x0032 100 100 000 Old_age Always - 26
194 Temperature_Celsius 0x0022 034 040 000 Old_age Always - 34 (0 22 0 0)
195 Hardware_ECC_Recovered 0x001a 026 003 000 Old_age Always - 65412104
197 Current_Pending_Sector 0x0012 100 100 000 Old_age Always - 0
198 Offline_Uncorrectable 0x0010 100 100 000 Old_age Offline - 0
199 UDMA_CRC_Error_Count 0x003e 200 200 000 Old_age Always - 0
240 Head_Flying_Hours 0x0000 100 253 000 Old_age Offline - 64587718203122
241 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 3045564792
242 Unknown_Attribute 0x0000 100 253 000 Old_age Offline - 78354915
SMART Self-test log structure revision number 1
Num Test_Description Status Remaining LifeTime(hours) LBA_of_first_error
# 1 Extended offline Self-test routine in progress 10% 5873
这是 xen 的 bug 吗?
答案1
确保你的备份完好无损。这种错误通常意味着你的驱动器已经坏了。
有可能是电缆或控制器出了问题,但我发现通常这意味着驱动器正准备让您陷入完全无法启动的情况,之前会突然出现系统冻结。尤其是如果系统在此之前运行良好并持续了相当长一段时间。
最好的情况是重新安装电缆。最坏的情况是,您的备份即将接受测试。