我目前配置的是 HP Proliant bl460gen6,带有控制器智能阵列 p711m Ubuntu 操作系统,在 raid 1+0 中配置了 35 个硬盘驱动器,其中 1 个磁盘处于备用状态,通常我使用默认命令监控 raid 的状态
hpacucli ctrl 全部显示配置
如果发现磁盘故障,我会更换它。偶然间我注意到二极管发出信号,表示存储系统上的两个硬盘坏了。同时,报告中的 hpacucli 说所有硬盘都正常。在谷歌搜索问题后,我得到了另一个版本的 hpacucli 语法,如
hpacucli ctrl slot=2 ld 1 显示
实施后,确认存在问题 HDD 更换一个 HDD 继续监视情况,RAID 的恢复在正常模式下进行,但是,列表中 HDD 的编号是错误的,驱动器编号加倍
更换的 HDD 位于插槽 2
hpacucli ctrl 全部显示配置
Smart Array P711m in Slot 2
array A (SATA, Unused Space: 0 MB)
logicaldrive 1 (61.9 TB, RAID 1+0, Recovering, 75% complete)
physicaldrive 1E:1:1 (port 1E:box 1:bay 1, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:1 (port 1E:box 1:bay 1, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:2 (port 1E:box 1:bay 2, SATA, 4000.7 GB, Rebuilding)
physicaldrive 1E:1:2 (port 1E:box 1:bay 2, SATA, 4000.7 GB, Rebuilding)
physicaldrive 1E:1:3 (port 1E:box 1:bay 3, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:3 (port 1E:box 1:bay 3, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:4 (port 1E:box 1:bay 4, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:4 (port 1E:box 1:bay 4, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:5 (port 1E:box 1:bay 5, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:5 (port 1E:box 1:bay 5, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:6 (port 1E:box 1:bay 6, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:6 (port 1E:box 1:bay 6, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:7 (port 1E:box 1:bay 7, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:7 (port 1E:box 1:bay 7, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:8 (port 1E:box 1:bay 8, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:8 (port 1E:box 1:bay 8, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:9 (port 1E:box 1:bay 9, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:9 (port 1E:box 1:bay 9, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:10 (port 1E:box 1:bay 10, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:10 (port 1E:box 1:bay 10, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:11 (port 1E:box 1:bay 11, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:11 (port 1E:box 1:bay 11, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:12 (port 1E:box 1:bay 12, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:12 (port 1E:box 1:bay 12, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:13 (port 1E:box 1:bay 13, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:13 (port 1E:box 1:bay 13, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:14 (port 1E:box 1:bay 14, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:14 (port 1E:box 1:bay 14, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:15 (port 1E:box 1:bay 15, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:16 (port 1E:box 1:bay 16, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:17 (port 1E:box 1:bay 17, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:18 (port 1E:box 1:bay 18, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:19 (port 1E:box 1:bay 19, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:20 (port 1E:box 1:bay 20, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:21 (port 1E:box 1:bay 21, SATA, 4000.7 GB, OK, active spare)
hpacucli ctrl slot=2 ld 1 显示
Smart Array P711m in Slot 2
array A
Logical Drive: 1
Size: 61.9 TB
Fault Tolerance: 1+0
Heads: 255
Sectors Per Track: 32
Cylinders: 65535
Strip Size: 256 KB
Full Stripe Size: 4352 KB
Status: Recovering, 78% complete
MultiDomain Status: OK
Caching: Enabled
Unique Identifier:
Disk Name: /dev/sda
Mount Points: None
Logical Drive Label:
Mirror Group 0:
physicaldrive 1E:1:1 (port 1E:box 1:bay 1, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:2 (port 1E:box 1:bay 2, SATA, 4000.7 GB, Rebuilding)
physicaldrive 1E:1:3 (port 1E:box 1:bay 3, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:4 (port 1E:box 1:bay 4, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:5 (port 1E:box 1:bay 5, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:6 (port 1E:box 1:bay 6, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:7 (port 1E:box 1:bay 7, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:8 (port 1E:box 1:bay 8, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:9 (port 1E:box 1:bay 9, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:10 (port 1E:box 1:bay 10, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:11 (port 1E:box 1:bay 11, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:12 (port 1E:box 1:bay 12, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:13 (port 1E:box 1:bay 13, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:14 (port 1E:box 1:bay 14, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:1 (port 1E:box 1:bay 1, SATA, 4000.7 GB, OK)
physicaldrive 1E:1:2 (port 1E:box 1:bay 2, SATA, 4000.7 GB, Failed)
physicaldrive 1E:1:3 (port 1E:box 1:bay 3, SATA, 4000.7 GB, OK)
哪里出错了,如何修复,以及为什么不同的 hpacucli 命令返回不同的 HDD 状态
答案1
您可能已安装双域布线(多路径 SAS)。
P711 是刀片服务器 SAS RAID 控制器,用于连接刀片机箱扩展端口(SAS 交换机)并链接到更大的机箱(如 D6000 35 或 70 托架 SAS JBOD)。
这可能就是您所遇到的情况。等待磁盘重建。
另外,您不应该以现在的方式监控 RAID 状态。您只需安装 HP 管理代理,系统就会通过电子邮件或 SNMP 陷阱发送健康状态变化信息。
看:监控 HP ProLiant DL380 G7 而不会造成臃肿
使用hplog -v
也将显示所有系统警报。