我们有一台 Sun Sparc 5440 服务器,最初有 64 GB 的 RAM,我们正尝试将其升级到 128 GB。在此过程中,SCSI 根磁盘似乎已损坏,因此我们正尝试重建系统。
我们有一张 Solaris 10 的 DVD(这是必需的,因为该服务器镜像了生产服务器)。它是从 Oracle 下载的最新版本(sol-10-u11-ga-sparc-dvd.iso)。当我们将 DVD 放入驱动器并执行操作时,boot cdrom
它会运行一段时间,然后就冻结了。
我们已经验证该 DVD 可以在另一个 Sparc 系统上运行,因此我们排除了这种可能性。
我们还排除了 DVD 驱动器的问题(通过连接 USB-DVD 驱动器并从中启动)。
搜索之后,我们尝试使用 -v 标志进行启动,这是我们看到的输出:
{0} ok boot /pci@400/pci@0/pci@9/pci@0/usb@0,2/storage@1/disk -rsv
Boot device: /pci@400/pci@0/pci@9/pci@0/usb@0,2/storage@1/disk File and args: -rsv
hsfs-file-system
Loading: /platform/sun4v/boot_archive
ramdisk-root ufs-file-system
Loading: /platform/SUNW,T5440/kernel/sparcv9/unix
module /platform/SUNW,T5440/kernel/sparcv9/unix: text at [0x1000000, 0x10cbab5] data at 0x1800000
module /platform/SUNW,T5440/kernel/sparcv9/genunix: text at [0x10cbab8, 0x12b4e0f] data at 0x1a16800
module /platform/SUNW,T5440/kernel/misc/sparcv9/platmod: text at [0x12b4e10, 0x12b4e27] data at 0x1a6e4b0
module /platform/SUNW,T5440/kernel/cpu/sparcv9/SUNW,UltraSPARC-T2+: text at [0x12b4e30, 0x12bad2f] data at 0x1a6ec00
SunOS Release 5.10 Version Generic_147147-26 64-bit
Copyright (c) 1983, 2013, Oracle and/or its affiliates. All rights reserved.
os-io Ethernet address = 0:14:4f:ec:e9:b2
Using default device instance data
mem = 133857280K (0x1fea000000)
avail mem = 133886173184
root nexus = T5440
pseudo0 at root
pseudo0 is /pseudo
scsi_vhci0 at root
scsi_vhci0 is /scsi_vhci
ramdisk0 at root
ramdisk0 is /ramdisk-root
root on /ramdisk-root:a fstype ufs
virtual-device: cnex0
cnex0 is /virtual-devices@100/channel-devices@200
pseudo-device: dld0
dld0 is /pseudo/dld@0
px2 at root: 0x600 0x0
px2 is /pci@600
PCI Express-device: pci@0, pcieb10
pcieb10 is /pci@600/pci@0
PCIE-device: pci@c, pcieb12
pcieb12 is /pci@600/pci@0/pci@c
PCIE-device: SUNW,XVR-300@0, nfb0
nfb0 is /pci@600/pci@0/pci@c/SUNW,XVR-300@0
px0 at root: 0x400 0x0
px0 is /pci@400
PCI Express-device: pci@0, pcieb0
pcieb0 is /pci@400/pci@0
PCIE-device: pci@9, pcieb2
pcieb2 is /pci@400/pci@0/pci@9
PCIE-device: pci@0, pcieb4
pcieb4 is /pci@400/pci@0/pci@9/pci@0
PCIE-device: usb@0,2, ehci0
ehci0 is /pci@400/pci@0/pci@9/pci@0/usb@0,2
PCIE-device: usb@0, ohci0
ohci0 is /pci@400/pci@0/pci@9/pci@0/usb@0
PCIE-device: usb@0,1, ohci1
ohci1 is /pci@400/pci@0/pci@9/pci@0/usb@0,1
cpu0: UltraSPARC-T2+ (chipid 0, clock 1164 MHz)
cpu1: UltraSPARC-T2+ (chipid 0, clock 1164 MHz)
cpu2: UltraSPARC-T2+ (chipid 0, clock 1164 MHz)
...
cpu80: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu81: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu82: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
PCIE-device: pci@d, pcieb5
pcieb5 is /pci@400/pci@0/pci@d
cpu83: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu84: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
...
cpu104: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu105: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu106: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
USB 2.0 device (usb644,1008) operating at hi speed (USB 2.x) on USB 2.0 root hub: storage@1, scsa2usb2 at bus address 2
TEAC USB DVD Reader DEF10BED1014
scsa2usb2 is /pci@400/pci@0/pci@9/pci@0/usb@0,2/storage@1
/pci@400/pci@0/pci@9/pci@0/usb@0,2/storage@1 (scsa2usb2) online
cpu107: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu108: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
cpu109: UltraSPARC-T2+ (chipid 1, clock 1164 MHz)
...
cpu157: UltraSPARC-T2+ (chipid 2, clock 1164 MHz)
cpu158: UltraSPARC-T2+ (chipid 2, clock 1164 MHz)
cpu159: UltraSPARC-T2+ (chipid 2, clock 1164 MHz)
USB 2.0 device (usb4b4,6560) operating at hi speed (USB 2.x) on USB 2.0 root hub: hub@2, hubd0 at bus address 3
hubd0 is /pci@400/pci@0/pci@9/pci@0/usb@0,2/hub@2
/pci@400/pci@0/pci@9/pci@0/usb@0,2/hub@2 (hubd0) online
cpu160: UltraSPARC-T2+ (chipid 2, clock 1164 MHz)
cpu161: UltraSPARC-T2+ (chipid 2, clock 1164 MHz)
cpu162: UltraSPARC-T2+ (chipid 2, clock 1164 MHz)
...
cpu216: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
sd3 at scsa2usb2: target 0 lun 0
sd3 is /pci@400/pci@0/pci@9/pci@0/usb@0,2/storage@1/disk@0,0
cpu217: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu218: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu219: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu220: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu221: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu222: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
USB 2.0 device (usb451,625f) operating at hi speed (USB 2.x) on USB 2.0 external hub: storage@3, scsa2usb3 at bus address 4
Texas Instruments Incorporated 6250 FW: 04.01.02.22.562AD423 ACD6CE6E2E1A
scsa2usb3 is /pci@400/pci@0/pci@9/pci@0/usb@0,2/hub@2/storage@3
/pci@400/pci@0/pci@9/pci@0/usb@0,2/hub@2/storage@3 (scsa2usb3) online
cpu223: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu224: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu225: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu226: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu227: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu228: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu229: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu230: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu231: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
sd4 at scsa2usb3: target 0 lun 0
sd4 is /pci@400/pci@0/pci@9/pci@0/usb@0,2/hub@2/storage@3/disk@0,0
cpu232: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu233: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu234: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
...
cpu254: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
cpu255: UltraSPARC-T2+ (chipid 3, clock 1164 MHz)
/pci@400/pci@0/pci@9/pci@0/usb@0,2/storage@1/disk@0,0 (sd3) online
/pci@400/pci@0/pci@9/pci@0/usb@0,2/hub@2/storage@3/disk@0,0 (sd4) online
PCIE-device: SUNW,qlc@0, qlc0
qlc0 is /pci@400/pci@0/pci@d/SUNW,qlc@0
PCIE-device: SUNW,qlc@0,1, qlc1
qlc1 is /pci@400/pci@0/pci@d/SUNW,qlc@0,1
px1 at root: 0x500 0x0
px1 is /pci@500
PCI Express-device: pci@0, pcieb6
pcieb6 is /pci@500/pci@0
PCIE-device: pci@9, pcieb7
pcieb7 is /pci@500/pci@0/pci@9
PCIE-device: SUNW,qlc@0, qlc2
qlc2 is /pci@500/pci@0/pci@9/SUNW,qlc@0
PCIE-device: SUNW,qlc@0,1, qlc3
qlc3 is /pci@500/pci@0/pci@9/SUNW,qlc@0,1
fp5 is /pci@400/pci@0/pci@d/SUNW,qlc@0,1/fp@0,0
fp4 is /pci@400/pci@0/pci@d/SUNW,qlc@0/fp@0,0
fp6 is /pci@500/pci@0/pci@9/SUNW,qlc@0/fp@0,0
fp7 is /pci@500/pci@0/pci@9/SUNW,qlc@0,1/fp@0,0
PCIE-device: pci@1, pcieb1
pcieb1 is /pci@400/pci@0/pci@1
PCIE-device: pci@c, pcieb3
pcieb3 is /pci@400/pci@0/pci@c
PCIE-device: pci@c, pcieb8
pcieb8 is /pci@500/pci@0/pci@c
PCIE-device: pci@d, pcieb9
pcieb9 is /pci@500/pci@0/pci@d
PCIE-device: pci@9, pcieb11
pcieb11 is /pci@600/pci@0/pci@9
PCIE-device: pci@d, pcieb13
pcieb13 is /pci@600/pci@0/pci@d
px3 at root: 0x700 0x0
px3 is /pci@700
PCI Express-device: pci@0, pcieb14
pcieb14 is /pci@700/pci@0
PCIE-device: pci@9, pcieb15
pcieb15 is /pci@700/pci@0/pci@9
PCIE-device: pci@c, pcieb16
pcieb16 is /pci@700/pci@0/pci@c
iscsi0 at root
iscsi0 is /iscsi
我们已经将它放置了一夜,但它仍没有任何进展。
知道可能出了什么问题吗?或者有什么建议可以进一步调试吗?
答案1
似乎您已用其他系统的硬件 RAID 替换了磁盘。您需要移除或激活此 RAID。
第一种方式。
更复杂的方法是通过操作系统将其移除,但您需要了解 OBP 的最低限度:如果 Solaris 在启动期间看到外来 RAID,它会挂起,这就是为什么您需要移除所有磁盘并从 DVD 或网络启动,然后在 solaris 启动时插入磁盘。然后通过 devfsadm 扫描磁盘并通过 raidctl 移除 RAID。我不喜欢这种方式,因为它需要在服务器附近。
第二种方式。
通过 OBP 在内部磁盘上激活外来 RAID。
ok setenv fcode-debug? true
ok setenv auto-boot? false
ok reset-all
运行 select 命令,这取决于平台。您可以查看 show-disks 命令以了解适合您平台的正确路径。
T5440 的路径选择说明:
ok select /pci@400/pci@0/pci@1/scsi@0/
我还为其他人提供其他平台的选择。
T1000 的路径选择说明
ok select /pci@7c0/pci@0/pci@8/scsi@2
T2000 的路径选择说明(零件号 501-6843)
ok select /pci@7c0/pci@0/pci@1/pci@0,2/LSILogic,sas@2
T2000 (pn 501-7501) 的路径选择说明:
ok select /pci@780/pci@0/pci@9/scsi@0
V215/V245 的路径选择说明:
ok select /pci@1e,600000/pci@0/pci@a/pci@0/pci@8/scsi@1
V445/Boston 的路径选择说明:
ok select /pci@1f,700000/pci@0/pci@2/pci@0/pci@8/LSILogic,sas@1
带 501-7244 SAS 控制器的 V445 路径选择说明
ok select /pci@1e,600000/pci@0/pci@2/scsi@0
T5120/T5220/Huron 的路径选择说明:
ok select /pci@0/pci@0/pci@2/scsi@0
T5140/T5240 的路径选择说明:
ok select /pci@400/pci@0/pci@8/scsi@0/
此后,您可以从 OBP 管理 RAID 卷:
ok show-volumes ( Look for the inactive volume)
ok X activate-volume (X is the volume number)
您也可以通过以下方式删除卷
ok X delete-volume (X is the volume number)
完成:
ok unselect-dev
ok probe-scsi-all (to check for disks)
处理多个卷时,请确保它们按降序激活:
ok show-volumes
ok 1 activate-volume
ok 0 activate-volume
ok unselect-dev
ok probe-scsi-all (to check for disks)
如果不这样做将导致以下错误,并且需要在激活第二个卷之前重置所有内容:
ok 0 activate-volume
Volume 0 is now activated
ok 1 activate-volume
RAID volume request failed, ioc-status = 0007
Volume 1 could not be activated
激活后,第一个卷将保持“降级”状态,直到两个卷完成同步。
不要忘记返回值:
ok setenv fcode-debug? true
ok setenv auto-boot? true
ok reset-all