HP P410 ADU(hpacucli)生成空报告

HP P410 ADU(hpacucli)生成空报告

我正在尝试监控硬盘的智能状态。我尝试使用 smartctl 工具和 HP 自己的 hpacucli 来生成 ADU 报告。但是,它们都无用。smartctl 不显示诸如开机时间或硬盘温度之类的值。ADU 报告显示它们为空。

使用 HP RAID 控制器监控硬盘的正确方法是什么?

智能控制:smartctl -a -d cciss,0 /dev/sg0

    smartctl 5.41 2011-06-09 r3365 [x86_64-linux-2.6.32-20-pve] (local build)
    Copyright (C) 2002-11 by Bruce Allen, http://smartmontools.sourceforge.net

    User Capacity:        3,000,592,982,016 bytes [3.00 TB]
    Logical block size:   512 bytes
    Logical Unit id:      0x5000c5003f11a168
    Serial number:        XXXXXXX
    Device type:          disk
    Local Time is:        Sun Jul 14 22:42:08 2013 HADT
    Device supports SMART and is Enabled
    Temperature Warning Disabled or Not Supported
    SMART Health Status: OK

    Current Drive Temperature:     <not available>
    Read defect list: asked for grown list but didn't get it

    Error Counter logging not supported
    Device does not support Self Test logging

帕库克利:hpacucli ctrl all diag 文件=/usr/monitor/report.zip ris=on xml=on zip=on

    Smart Array P410 in slot 1 : Internal Drive Cage at Port 1I : Box 1 : Drive Cage on Port 1I : Physical Drive (3 TB SATA) 1I:1:12 : Monitor and Performance Statistics (Since Reset)

    Serial Number                        XXXXXXXX
    Firmware Revision                    0003    
    Product Revision                     ATA     ST33000650NS    
    Reference Time                       0x00001715
    Sectors Read                         0x00000000f868ca8b
    Read Errors Hard                     0x00000000
    Read Errors Retry Recovered          0x00000000
    Read Errors ECC Corrected            0x0000000000000000
    Sectors Written                      0x0000000016dd925d
    Write Errors Hard                    0x00000000
    Write Errors Retry Recovered         0x00000000
    Seek Count                           0x0000000000000000
    Seek Errors                          0x0000000000000000
    Spin Cycles                          0x00000000
    Spin Up Time                         0x0000
    Performance Test 1                   0x0000
    Performance Test 2                   0x0000
    Performance Test 3                   0x0000
    Performance Test 4                   0x0000
    Reallocation Sectors                 0xffffffff
    Reallocated Sectors                  0x00000000
    DRQ Time Outs                        0x0000
    Other Time Outs                      0x0000
    Drive Rebuild Count                  0 (0x0000)
    Spin Retries                         65535 (0xffff)
    Recovers Failed Read                 0x0000
    Recovers Failed Write                0x0000
    Format Errors                        0x0000
    Self Test Failures                   0x0000
    Not Ready Failures                   0x00000000
    Remap Abort Failures                 0x00000000
    IRQ Deglitch Count                   0 (0x00000000)
    Bus Faults                           0x00000003
    Hot Plug Count                       0 (0x00000000)
    Track Rewrite Errors                 0xffff
    Write Errors After Remap             0x0000
    Background Firmware Revision         0x00 0x00 0x00 0x00 0x00 0x00 0x00 0x00
    Media Failures                       0x0000
    Hardware Errors                      0x0000
    Aborted Command Failures             0x0000
    Spin Up Failures                     0x0000
    Bad Target Count                     0 (0x0000)
    Predictive Failure Errors            0x00000000

答案1

您不应该smartctl直接使用这些控制器。HP Smart Array 控制器使用多种技术来确定驱动器和系统的健康状况。聪明的是其中之一,但不是最终的决定因素。利用现有的专用工具是明智之举。

因此,对于您的情况,请配置 hp-snmp-agents,以便在出现问题时发送电子邮件。在 Linux 上,电子邮件默认发送给 root 用户,并在系统日志中发送一条消息,但配置警报目标可以在 中处理/opt/hp/hp-snmp-agents/cma.conf

至于hpacucli实用程序,运行hpacucli ctrl all show config detail提供了大部分相关的阵列健康信息。

答案2

对于我来说这样工作

 smartctl -d cciss,0 -a /dev/cciss/c0d0


Device supports SMART and is Enabled
Temperature Warning Enabled
SMART Health Status: OK

Current Drive Temperature:     31 C
Drive Trip Temperature:        68 C
Elements in grown defect list: 0
Vendor (Seagate) cache information
  Blocks sent to initiator = 3203138637
  Blocks received from initiator = 3715997197
  Blocks read from cache and sent to initiator = 484569203
  Number of read and write commands whose size <= segment size = 1111593814
  Number of read and write commands whose size > segment size = 0
Vendor (Seagate/Hitachi) factory information
  number of hours powered up = 14706.28
  number of minutes until next internal SMART test = 33

相关内容