HP 代码 341“物理驱动器状态:预测故障。预计该物理驱动器将很快出现故障。”

HP 代码 341“物理驱动器状态:预测故障。预计该物理驱动器将很快出现故障。”

这是我的第一篇帖子所以请耐心等待 :-)

背景: 我有一个 RAID5 设置,其中包含 4 个磁盘,多年来一直运行良好。一个驱动器发生故障后,我安装了新驱动器,并进行了重建,但它会将新驱动器标记为智能状态故障。

硬件: HP Proliant ML350 G6、6Gb RAM、Xeon E5620、BIOS D22 Windows 服务器 2008R2(已更新)智能阵列 P410i 4x 300Gb SAS 磁盘 10k

情况: 我买了 6 个新硬盘,在重建阵列时,智能状态失败。我试过 6 个硬盘中的 5 个,都出现相同的错误。

我做了什么: 已将 raid 控制器和几个硬盘更新至最新 FW。

我尝试使用其中两个新驱动器来设置带有 raid1 的另一个逻辑驱动器以对其进行测试,并且没有任何问题。

开机时所有驱动器均已插入和取出。

想法和问题: 真的 5 个都会被打破吗?

有没有什么办法可以清除智能失败状态?

以下是 ACU 的最新报告: https://dl.dropboxusercontent.com/u/15772069/report-4bef4a9c-00000cf8-00000000.zip其中最重要的部分是:

ACU Version                             8.70.9.0
Diagnostic Module Version               5.2.64.0
INFOMGR Version                         6.0.1.0
Time Generated                          Monday November 23, 2015 9:22:40AM

Device Summary:
   Smart Array P410i in Embedded Slot

Consolidated Error Report:
   Controller: Smart Array P410i in Embedded Slot
       Device: Physical Drive 1I:1:2
      Message: The physical drive has failed.
   Controller: Smart Array P410i in Embedded Slot
       Device: Physical Drive 2I:1:6
      Message: Physical Drive State: Predictive failure. This physical drive is predicted to fail soon.
   Controller: Smart Array P410i in Embedded Slot
       Device: Physical Drive 2I:1:7
      Message: Physical Drive State: Predictive failure. This physical drive is predicted to fail soon.

Report for Smart Array P410i in Embedded Slot
---------------------------------------------

Smart Array P410i in Embedded Slot : Device Error Report

Device                Severity Error                                                                                    
--------------------- -------- ---------------------------------------------------------------------------------------- 
Physical Drive 1I:1:2 Critical The physical drive has failed.
Physical Drive 2I:1:6 Warning  Physical Drive State: Predictive failure. This physical drive is predicted to fail soon.
Physical Drive 2I:1:7 Warning  Physical Drive State: Predictive failure. This physical drive is predicted to fail soon.

我用另一批驱动器替换了它们,现在看来智能状态没问题,但有时 Windows 会抱怨有故障的块等。:S

这 6 个新的智能状态失败的驱动器现在位于另一台服务器中。它是一台 ML350 G6,在 P410i 控制器上装有最新固件,还有驱动器(我认为)。我已将所有 6 个驱动器放入新的 RAID5 驱动器中,并刚刚初始化完成。它似乎运行良好,但智能状态仍然失败。

我按了 CTRL ALL SHOW CONFIG DETAIL,输出如下。有没有办法重置智能状态之类的?

Smart Storage Administrator CLI 2.60.18.0

检测控制器...完成。输入“help”查看支持的命令列表。输入“exit”关闭控制台。

` => ctrl all 显示配置详细信息

插槽 0 中的智能阵列 P410i(嵌入式) 总线接口:PCI 插槽:0 序列号:5001438005EDDF40 缓存序列号:PACCQ9SY70JU 控制器状态:正常 硬件修订版:C 固件版本:6.64-0 重建优先级:中 扩展优先级:中 表面扫描延迟:15 秒 表面扫描模式:空闲 支持并行表面扫描:否 队列深度:自动 监控和性能延迟:60 分钟 电梯排序:已启用 性能下降优化:已禁用 不一致修复策略:已禁用 等待缓存空间:已禁用 表面分析不一致通知:已禁用 发布提示超时:0 秒 缓存板存在:真 缓存状态:永久禁用 缓存状态详细信息:由于当前运行的固件不支持一个或多个连接的电池,因此缓存被禁用。缓存比率:25% 读取/75% 写入 驱动器写入缓存:已禁用 总缓存大小:256 MB 可用的总缓存内存:144 MB 无电池写入缓存:已禁用 缓存备用电源:电池 电池/电容器数量:1 电池/电容器状态:失败(更换电池) 支持 SATA NCQ:真 端口数:2 仅限内部 驱动程序名称:HpSAMD.sys 驱动程序版本:8.0.4.0 PCI 地址(域:总线:设备.功能):0000:04:00.0 主机序列号:CZJ941003H 支持的清理擦除:假 主启动卷:无 次要启动卷:无

端口名称:1I 端口 ID:0 端口连接号:0 SAS 地址:5001438005EDDF40 端口位置:内部

端口名称:2I 端口 ID:1 端口连接号:1 SAS 地址:5001438005EDDF44 端口位置:内部

位于端口 1I、盒子 1 的内部驱动器笼,OK

  Power Supply Status: Not Redundant
  Drive Bays: 4
  Port: 1I
  Box: 1
  Location: Internal

物理驱动器 physicaldrive 1I:1:1 (端口 1I:盒 1:托架 1,SAS HDD,300 GB,预测性故障) physicaldrive 1I:1:2 (端口 1I:盒 1:托架 2,SAS HDD,300 GB,预测性故障) physicaldrive 1I:1:3 (端口 1I:盒 1:托架 3,SAS HDD,300 GB,预测性故障)

位于端口 2I、盒子 1 的内部驱动器笼,OK

  Power Supply Status: Not Redundant
  Drive Bays: 4
  Port: 2I
  Box: 1
  Location: Internal

物理驱动器 physicaldrive 2I:1:5 (端口 2I:box 1:bay 5, SAS HDD, 300 GB, 预测性故障) physicaldrive 2I:1:6 (端口 2I:box 1:bay 6, SAS HDD, 300 GB, 预测性故障) physicaldrive 2I:1:7 (端口 2I:box 1:bay 7, SAS HDD, 300 GB, 预测性故障)

阵列:A 接口类型:SAS 未使用空间:0 MB (0.0%) 已用空间:1.6 TB (100.0%) 状态:OK 阵列类型:数据

  Logical Drive: 1
     Size: 1.4 TB
     Fault Tolerance: 5
     Heads: 255
     Sectors Per Track: 32
     Cylinders: 65535
     Strip Size: 64 KB
     Full Stripe Size: 320 KB
     Status: OK
     Caching:  Disabled
     Parity Initialization Status: Initialization Completed
     Unique Identifier: 600508B1001030354544444634300500
     Disk Name: \\.\PhysicalDrive0 (Disk 0) (Bus: 0,Target: 4,Lun: 0)
     Mount Points: Offline 500 MB Partition Number 1, C:\ 146.0 GB Partition Number 2, D:\ 1.2 TB Partition Number 3
     Logical Drive Label: A0017FDF5001438005EDDF40CEA0
     Drive Type: Data
     LD Acceleration Method: All disabled

  physicaldrive 1I:1:1
     Port: 1I
     Box: 1
     Bay: 1
     Status: Predictive Failure
     Drive Type: Data Drive
     Interface Type: SAS
     Size: 300 GB
     Drive exposed to OS: False
     Logical/Physical Block Size: 512/512
     Rotational Speed: 10000
     Firmware Revision: HPDG
     Serial Number: 6SE52A2P0000B213CFP2
     WWID: 5000C500437249DD
     Model: HP      EG0300FAWHV
     Current Temperature (C): 33
     Maximum Temperature (C): 62
     PHY Count: 2
     PHY Transfer Rate: 6.0Gbps, Unknown
     Sanitize Erase Supported: False
     Shingled Magnetic Recording Support: None

  physicaldrive 1I:1:2
     Port: 1I
     Box: 1
     Bay: 2
     Status: Predictive Failure
     Drive Type: Data Drive
     Interface Type: SAS
     Size: 300 GB
     Drive exposed to OS: False
     Logical/Physical Block Size: 512/512
     Rotational Speed: 10000
     Firmware Revision: HPDG
     Serial Number: 6SE51A8S0000B213BAGM
     WWID: 5000C5004371B7C5
     Model: HP      EG0300FAWHV
     Current Temperature (C): 33
     Maximum Temperature (C): 68
     PHY Count: 2
     PHY Transfer Rate: 6.0Gbps, Unknown
     Sanitize Erase Supported: False
     Shingled Magnetic Recording Support: None

  physicaldrive 1I:1:3
     Port: 1I
     Box: 1
     Bay: 3
     Status: Predictive Failure
     Drive Type: Data Drive
     Interface Type: SAS
     Size: 300 GB
     Drive exposed to OS: False
     Logical/Physical Block Size: 512/512
     Rotational Speed: 10000
     Firmware Revision: HPDG
     Serial Number: 6SE519840000B212DHLM
     WWID: 5000C500437278E1
     Model: HP      EG0300FAWHV
     Current Temperature (C): 31
     Maximum Temperature (C): 63
     PHY Count: 2
     PHY Transfer Rate: 6.0Gbps, Unknown
     Sanitize Erase Supported: False
     Shingled Magnetic Recording Support: None

  physicaldrive 2I:1:5
     Port: 2I
     Box: 1
     Bay: 5
     Status: Predictive Failure
     Drive Type: Data Drive
     Interface Type: SAS
     Size: 300 GB
     Drive exposed to OS: False
     Logical/Physical Block Size: 512/512
     Rotational Speed: 10000
     Firmware Revision: HPDG
     Serial Number: 6SE521XZ0000B213B62Z
     WWID: 5000C50043760E91
     Model: HP      EG0300FAWHV
     Current Temperature (C): 32
     Maximum Temperature (C): 63
     PHY Count: 2
     PHY Transfer Rate: 6.0Gbps, Unknown
     Sanitize Erase Supported: False
     Shingled Magnetic Recording Support: None

  physicaldrive 2I:1:6
     Port: 2I
     Box: 1
     Bay: 6
     Status: Predictive Failure
     Drive Type: Data Drive
     Interface Type: SAS
     Size: 300 GB
     Drive exposed to OS: False
     Logical/Physical Block Size: 512/512
     Rotational Speed: 10000
     Firmware Revision: HPDG
     Serial Number: 6SE519140000B213A6SG
     WWID: 5000C5004371FB05
     Model: HP      EG0300FAWHV
     Current Temperature (C): 33
     Maximum Temperature (C): 67
     PHY Count: 2
     PHY Transfer Rate: 6.0Gbps, Unknown
     Sanitize Erase Supported: False
     Shingled Magnetic Recording Support: None

  physicaldrive 2I:1:7
     Port: 2I
     Box: 1
     Bay: 7
     Status: Predictive Failure
     Drive Type: Data Drive
     Interface Type: SAS
     Size: 300 GB
     Drive exposed to OS: False
     Logical/Physical Block Size: 512/512
     Rotational Speed: 10000
     Firmware Revision: HPDG
     Serial Number: 6SE5194L0000B213BC7W
     WWID: 5000C50043720255
     Model: HP      EG0300FAWHV
     Current Temperature (C): 31
     Maximum Temperature (C): 60
     PHY Count: 2
     PHY Transfer Rate: 6.0Gbps, Unknown
     Sanitize Erase Supported: False
     Shingled Magnetic Recording Support: None

SEP(供应商 ID PMCSIERA,型号 SRC 8x6G)250 设备编号:250 固件版本:RevC WWID:5001438005EDDF4F 供应商 ID:PMCSIERA 型号:SRC 8x6G

=>`

初始化完成后,以下是最新的 ADUreport: https://dl.dropboxusercontent.com/u/15772069/ADUReport%20after%20init.zip

答案1

假设所有 5 个新硬盘都是正确类型的,那么它们全都出现故障的可能性极小。我们曾经遇到过类似的问题,HP ProLiant NAS 的新磁盘不断出现故障,我们通过更换磁盘控制器解决了这个问题。

相关内容