在 LSI MegaRaid 卡上启用温度支持

在 LSI MegaRaid 卡上启用温度支持

我在两台不同的机器上安装了两张 IBM ServeRAID M5014 SAS/SATA 控制器卡。一张报告驱动器温度,另一张不报告。我希望这是某种可以设置的 megacli 选项。但是,我查看了整个 megacli 文档,没有任何提示表明这是一个可配置选项。有没有人使用这些卡的经验,可以告诉我固件升级是否可以解决此差异?当前正在工作的卡(报告温度)位于其本机 IBM Systems X 配置中,而第二张卡(除了报告温度外,目前正在运行)并不位于本机 Systems X 机器中,而更像是……科学怪人,找不到更好的词了。

作为另一个数据点,是否有人在非 Systems X 机器中拥有这些卡之一并报告温度?

以下是相关信息的汇总,仅供存档之用,也供感兴趣的人参考。

首先,虽然这两款卡都是 M5014,但它们是不同的版本。报告温度的是:

                    Versions
                ================
Product Name    : ServeRAID M5014 SAS/SATA Controller
Serial No       : SV13815228
FW Package Build: 12.12.0-0065

                    Mfg. Data
                ================
Mfg. Date       : 09/17/11
Rework Date     : 00/00/00
Revision No     : 80B
Battery FRU     : N/A

                Image Versions in Flash:
                ================
FW Version         : 2.120.143-1325
BIOS Version       : 3.22.00_4.11.05.00_0x05060000
Preboot CLI Version: 04.04-017:#%00008
WebBIOS Version    : 6.0-40-e_35-Rel
NVDATA Version     : 2.09.03-0020
Boot Block Version : 2.02.00.00-0000
BOOT Version       : 09.250.01.219

                Pending Images in Flash
                ================
None

                PCI Info
                ================
Controller Id   : 0000
Vendor Id       : 1000
Device Id       : 0079
SubVendorId     : 1014
SubDeviceId     : 03c7

Host Interface  : PCIE

未报告温度的是:

                    Versions
                ================
Product Name    : ServeRAID M5014 SAS/SATA Controller
Serial No       : SV02102642
FW Package Build: 12.0.1-0097

                    Mfg. Data
                ================
Mfg. Date       : 05/17/10
Rework Date     : 00/00/00
Revision No     :
Battery FRU     : N/A

                Image Versions in Flash:
                ================
BIOS Version       : 3.09.00
FW Version         : 2.0.33-0901
Preboot CLI Version: 02.00-015:#%00008
WebBIOS Version    : 3.0-22-e_12-Rel
NVDATA Version     : 2.02.0043
Boot Block Version : 2.00.00.00-0018
BOOT Version       : 01.250.04.219

                Pending Images in Flash
                ================
None

                PCI Info
                ================
Controller Id   : 0000
Vendor Id       : 1000
Device Id       : 0079
SubVendorId     : 1014
SubDeviceId     : 03c7

Host Interface  : PCIE

探测驱动器温度的结果是:

sudo megacli -PDList -aALL -NoLog | grep -m1 "Drive Temperature"
Drive Temperature :28C (82.40 F)

sudo megacli -PDList -aALL -NoLog | grep -m1 "Drive Temperature"
Drive Temperature : N/A

进一步调查显示,第二张卡不支持温度。

sudo megacli -AdpAllInfo -aAll

                Supported PD Operations
                ================
Support Temperature                     : Yes

并且未在第二张卡上启用。

                Supported PD Operations
                ================
Support Temperature                     : No

以下是物理设备的详细信息:

卡片#1

Enclosure Device ID: 8
Slot Number: 8
Drive's postion: DiskGroup: 7, Span: 0, Arm: 0
Enclosure position: 1
Device Id: 19
WWN: 5000c5001130d3ca
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SATA

Raw Size: 1.364 TB [0xaea87b30 Sectors]
Non Coerced Size: 1.363 TB [0xae987b30 Sectors]
Coerced Size: 1.363 TB [0xae81a800 Sectors]
Firmware state: Online, Spun Up
Device Firmware Level: CC1H
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x500507603947ae49
Connected Port Number: 0(path0)
Inquiry Data:             9VS1GZVVST31500341AS                            CC1H
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 3.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive Temperature :35C (95.00 F)
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 6.0Gb/s
Drive has flagged a S.M.A.R.T alert : No

第二张牌:

Enclosure Device ID: 252
Slot Number: 6
Drive's postion: DiskGroup: 6, Span: 0, Arm: 0
Enclosure position: N/A
Device Id: 18
WWN:
Sequence Number: 2
Media Error Count: 0
Other Error Count: 0
Predictive Failure Count: 0
Last Predictive Failure Event Seq Number: 0
PD Type: SATA

Raw Size: 1.364 TB [0xaea87b30 Sectors]
Non Coerced Size: 1.363 TB [0xae987b30 Sectors]
Coerced Size: 1.363 TB [0xae81a800 Sectors]
Firmware state: Online, Spun Up
Device Firmware Level: CC1J
Shield Counter: 0
Successful diagnostics completion on :  N/A
SAS Address(0): 0x4433221106000000
Connected Port Number: 4(path0)
Inquiry Data:             9VS0FTMYST31500341AS                            CC1J
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 3.0Gb/s
Link Speed: 3.0Gb/s
Media Type: Hard Disk Device
Drive Temperature : N/A
PI Eligibility:  No
Drive is formatted for PI information:  No
PI: No PI
Port-0 :
Port status: Active
Port's Linkspeed: 3.0Gb/s
Drive has flagged a S.M.A.R.T alert : No

这是“sudo megacli -AdpAllInfo -aAll”的差异(diff <(sort card1.txt) <(sort card2.txt)):

72d70
< Auto Enhanced Import             : No
76c74
< Battery Warning                  : Enabled
---
> Battery Warning                  : Disabled
83c81
< BIOS Version       : 3.22.00_4.11.05.00_0x05060000
---
> BIOS Version       : 3.09.00
85,87c83,84
< Boot Block Version : 2.02.00.00-0000
< BOOT Version       : 09.250.01.219
< BreakMirror RAID Support         : No
---
> Boot Block Version : 2.00.00.00-0018
> BOOT Version       : 01.250.04.219
94d90
< ChipRevision    : B4
105,106c101,102
< Current Size of FW Cache       : 145 MB
< Current Time                     : 16:17:43 6/25, 2013
---
> Current Size of FW Cache       : 0 MB
> Current Time                     : 16:23:41 6/25, 2013
110c106
< Default spin down time in minutes: 30
---
> Default spin down time in minutes: 0
112c108
< Delay Among Spinup Groups        : 2s
---
> Delay Among Spinup Groups        : 12s
131d126
< Disable Join Mirror              : No
136c131
< Disable Spin Down of hot spares         : Yes
---
> Disable Spin Down of hot spares         : No
139d133
< Don't Auto spin down Configured Drives
151,152c145
< Enable Shield State              : No
< Enable Spin Down of UnConfigured Drives : Yes
---
> Enable Spin Down of UnConfigured Drives : No
166,167c159,160
< FW Package Build: 12.12.0-0065
< FW Version         : 2.120.143-1325
---
> FW Package Build: 12.0.1-0097
> FW Version         : 2.0.33-0901
186,187c179,180
< Max Drives to Spinup at One Time : 4
< Maximum number of direct attached drives to spin up in 1 min : 120
---
> Max Drives to Spinup at One Time : 2
> Maximum number of direct attached drives to spin up in 1 min : 0
192d184
< Max power savings option is  not allowed for LDs. Only T10 power conditions are to be used.
202c194
< Mfg. Date       : 09/17/11
---
> Mfg. Date       : 05/17/10
208c200
< NVDATA Version     : 2.09.03-0020
---
> NVDATA Version     : 2.02.0043
218c210
< Physical Devices  : 10
---
> Physical Devices  : 9
222c214
< Power Saving option              : Don't spin down Hot spares
---
> Power Saving option              : All power saving options are enabled
226c218
< Preboot CLI Version: 04.04-017:#%00008
---
> Preboot CLI Version: 02.00-015:#%00008
241c233
< Revision No     : 80B
---
> Revision No     :
243c235
< SAS Address      : 500605b003b0c060
---
> SAS Address      : 500605b00234f210
250c242
< Serial No       : SV13815228
---
> Serial No       : SV02102642
256c248
< Spin Down time                          : 30
---
> Spin Down time                          : 0
261c253
< Support Boot Time PFK Change    : Yes
---
> Support Boot Time PFK Change    : No
271,272c263,264
< Support Temperature                     : Yes
< Support the OCE without adding drives : Yes
---
> Support Temperature                     : No
> Support the OCE without adding drives : No
279d270
< TTY Log In Flash                 : No
288c279
< WebBIOS Version    : 6.0-40-e_35-Rel
---
> WebBIOS Version    : 3.0-22-e_12-Rel

答案1

此问题已通过固件更新得到解决。由于 M5014 卡不再驻留在 IBM 系统中,因此我无法使用其标准固件包刷新该卡。为了完成固件更新,我使用 LSI 的 9260-8i 固件刷新了该卡。

请注意,如果您的固件版本低于 12.7.0-0020(IBM),则需要先刷入 LSI 的 12.12.0-0090 固件,然后才能刷入较新的版本。无需启动到 DOS 或创建复杂的 USB 拇指驱动器。Debian Wheezy 上的以下命令可以解决问题

sudo megacli -adpfwflash -f the_rom_file.rom -a0

祝您突袭愉快......

相关内容