内核日志 - 常规“尝试中止任务 - 开机或设备重置”错误

内核日志 - 常规“尝试中止任务 - 开机或设备重置”错误

大约每两周一次,我的内核日志中会出现这种错误:

[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: attempting task abort! scmd(000000006f6a751f)
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3471 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: handle(0x001d), sas_address(0x443322110b000000), phy(11)
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: enclosure logical id(0x500062b206412140), slot(17) 
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: enclosure level(0x0000), connector name(     )
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: task abort: SUCCESS scmd(000000006f6a751f)
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: attempting task abort! scmd(000000005203b095)
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3012 CDB: Read(16) 88 00 00 00 00 02 a5 27 a8 48 00 00 01 00 00 00
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: handle(0x001d), sas_address(0x443322110b000000), phy(11)
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: enclosure logical id(0x500062b206412140), slot(17) 
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: enclosure level(0x0000), connector name(     )
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: task abort: SUCCESS scmd(000000005203b095)
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3012 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3012 CDB: Read(16) 88 00 00 00 00 02 a5 27 a8 48 00 00 01 00 00 00
[Wed Jul  6 16:11:14 2022] print_req_error: I/O error, dev sde, sector 11360774216
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: attempting task abort! scmd(00000000baf88a87)
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3011 CDB: Read(16) 88 00 00 00 00 02 a5 27 a3 48 00 00 01 00 00 00
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: handle(0x001d), sas_address(0x443322110b000000), phy(11)
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: enclosure logical id(0x500062b206412140), slot(17) 
[Wed Jul  6 16:11:14 2022] scsi target0:0:4: enclosure level(0x0000), connector name(     )
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: task abort: SUCCESS scmd(00000000baf88a87)
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3011 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: [sde] tag#3011 CDB: Read(16) 88 00 00 00 00 02 a5 27 a3 48 00 00 01 00 00 00
[Wed Jul  6 16:11:14 2022] print_req_error: I/O error, dev sde, sector 11360772936
[Wed Jul  6 16:11:14 2022] sd 0:0:4:0: Power-on or device reset occurred
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2451 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#3453 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#3200 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#3453 CDB: Read(16) 88 00 00 00 00 05 74 ff fd 20 00 00 00 08 00 00
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 23437770016
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2451 CDB: Read(16) 88 00 00 00 00 01 fd 8e 63 38 00 00 01 00 00 00
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 8548934456
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#3200 CDB: Read(16) 88 00 00 00 00 01 fd 8e 64 38 00 00 01 00 00 00
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 8548934712
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: Power-on or device reset occurred
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2050 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2504 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2050 CDB: Write(16) 8a 00 00 00 00 05 26 99 8f 68 00 00 00 08 00 00
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 22122434408
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2504 CDB: Read(16) 88 00 00 00 00 00 00 00 20 00 00 00 00 08 00 00
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#3203 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#3203 CDB: Read(16) 88 00 00 00 00 02 a5 27 ad 48 00 00 01 00 00 00
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 11360775496
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2505 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:15 2022] sd 0:0:4:0: [sde] tag#2505 CDB: Read(16) 88 00 00 00 00 02 a5 27 ac 48 00 00 01 00 00 00
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 11360775240
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:15 2022] print_req_error: I/O error, dev sde, sector 8192
[Wed Jul  6 16:11:15 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] sd 0:0:4:0: Power-on or device reset occurred
[Wed Jul  6 16:11:16 2022] sd 0:0:4:0: [sde] tag#2615 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 16:11:16 2022] print_req_error: I/O error, dev sde, sector 22122434448
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] sd 0:0:4:0: [sde] tag#2615 CDB: Write(16) 8a 00 00 00 00 05 26 99 8f a0 00 00 00 08 00 00
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 16:11:16 2022] sd 0:0:4:0: Power-on or device reset occurred
[Wed Jul  6 16:11:17 2022] sd 0:0:4:0: Power-on or device reset occurred
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: attempting task abort! scmd(00000000685dac60)
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#371 CDB: Read(16) 88 00 00 00 00 05 23 d4 00 e0 00 00 01 00 00 00
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: handle(0x0021), sas_address(0x4433221113000000), phy(19)
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: enclosure logical id(0x500062b206412140), slot(9) 
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: enclosure level(0x0000), connector name(     )
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: task abort: SUCCESS scmd(00000000685dac60)
[Wed Jul  6 17:31:04 2022] scsi_io_completion_action: 6 callbacks suppressed
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#371 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#371 CDB: Read(16) 88 00 00 00 00 05 23 d4 00 e0 00 00 01 00 00 00
[Wed Jul  6 17:31:04 2022] print_req_error: 6 callbacks suppressed
[Wed Jul  6 17:31:04 2022] print_req_error: I/O error, dev sdi, sector 22075932896
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: attempting task abort! scmd(00000000c7dc4ce2)
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#370 CDB: Read(16) 88 00 00 00 00 05 23 d3 ea e0 00 00 01 00 00 00
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: handle(0x0021), sas_address(0x4433221113000000), phy(19)
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: enclosure logical id(0x500062b206412140), slot(9) 
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: enclosure level(0x0000), connector name(     )
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: task abort: SUCCESS scmd(00000000c7dc4ce2)
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#370 FAILED Result: hostbyte=DID_TIME_OUT driverbyte=DRIVER_OK
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#370 CDB: Read(16) 88 00 00 00 00 05 23 d3 ea e0 00 00 01 00 00 00
[Wed Jul  6 17:31:04 2022] print_req_error: I/O error, dev sdi, sector 22075927264
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: attempting task abort! scmd(00000000d5697c0a)
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: [sdi] tag#16 CDB: Synchronize Cache(10) 35 00 00 00 00 00 00 00 00 00
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: handle(0x0021), sas_address(0x4433221113000000), phy(19)
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: enclosure logical id(0x500062b206412140), slot(9) 
[Wed Jul  6 17:31:04 2022] scsi target0:0:8: enclosure level(0x0000), connector name(     )
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: task abort: SUCCESS scmd(00000000d5697c0a)
[Wed Jul  6 17:31:04 2022] sd 0:0:8:0: Power-on or device reset occurred
[Wed Jul  6 17:31:05 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: [sdi] tag#4 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: [sdi] tag#4 CDB: Read(16) 88 00 00 00 00 00 00 00 00 08 00 00 00 08 00 00
[Wed Jul  6 17:31:05 2022] print_req_error: I/O error, dev sdi, sector 8
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: [sdi] tag#736 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: [sdi] tag#736 CDB: Read(16) 88 00 00 00 00 04 c8 4d fc 38 00 00 00 08 00 00
[Wed Jul  6 17:31:05 2022] print_req_error: I/O error, dev sdi, sector 20540423224
[Wed Jul  6 17:31:05 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:05 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: [sdi] tag#735 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: [sdi] tag#735 CDB: Read(16) 88 00 00 00 00 04 70 9a 87 30 00 00 01 00 00 00
[Wed Jul  6 17:31:05 2022] print_req_error: I/O error, dev sdi, sector 19069044528
[Wed Jul  6 17:31:05 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:05 2022] sd 0:0:8:0: Power-on or device reset occurred
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5726 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5723 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5726 CDB: Read(16) 88 00 00 00 00 01 53 df 28 00 00 00 01 00 00 00
[Wed Jul  6 17:31:06 2022] print_req_error: I/O error, dev sdi, sector 5702100992
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#939 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5723 CDB: Read(16) 88 00 00 00 00 05 74 ff fc 20 00 00 00 08 00 00
[Wed Jul  6 17:31:06 2022] print_req_error: I/O error, dev sdi, sector 23437769760
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#939 CDB: Read(16) 88 00 00 00 00 05 23 d3 fc e0 00 00 01 00 00 00
[Wed Jul  6 17:31:06 2022] print_req_error: I/O error, dev sdi, sector 22075931872
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: Power-on or device reset occurred
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5738 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5693 FAILED Result: hostbyte=DID_SOFT_ERROR driverbyte=DRIVER_OK
[Wed Jul  6 17:31:06 2022] print_req_error: I/O error, dev sdi, sector 22238540184
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5693 CDB: Write(16) 8a 00 00 00 00 00 b9 9c 77 18 00 00 01 00 00 00
[Wed Jul  6 17:31:06 2022] print_req_error: I/O error, dev sdi, sector 3114039064
[Wed Jul  6 17:31:06 2022] sd 0:0:8:0: [sdi] tag#5738 CDB: Read(16) 88 00 00 00 00 05 74 ff ff 88 00 00 00 38 00 00
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:06 2022] mpt3sas_cm0: log_info(0x31110e03): originator(PL), code(0x11), sub_code(0x0e03)
[Wed Jul  6 17:31:07 2022] sd 0:0:8:0: Power-on or device reset occurred
[Wed Jul  6 17:31:07 2022] sd 0:0:8:0: Power-on or device reset occurred

我有大约 20 个 SATA 驱动器连接到此服务器上的 SATA/SAS 控制器,并且许多(但不是全部)驱动器都会发生错误,有些驱动器比其他驱动器更频繁地导致错误。该问题似乎与文件系统负载有关(负载越大 => 错误越可能)。直到今天,该问题一次只影响一个驱动器,并且我的所有驱动器都是镜像的,因此每当发生故障时我都可以重新同步故障镜像。在两年的时间里,我一直在 Google 上搜索这个问题,并在各种支持论坛上搜索,但都无功而返,这个问题一直困扰着我。然而,今天,双驱动器镜像中的两个镜像在 1 小时内都遇到了相同的故障,使得解决这个问题变得更加迫切。我猜这可能是硬件/控制器问题,但我不知道如何检查是否是这种情况,或者如果是,如何修复它。任何帮助都将不胜感激。谢谢。

相关内容