Linux 机器每天都会崩溃,这个内核堆栈跟踪意味着什么?

Linux 机器每天都会崩溃,这个内核堆栈跟踪意味着什么?

崩溃以完全挂起的形式出现。不再有控制并且屏幕冻结。

我使用以下方法在这台机器上获取了堆栈跟踪:

sudo journalctl -f

最后显示的消息是:

Nov 18 19:42:12  kernel: Bad mode in Error handler detected, code 0xbf000002 -- SError
Nov 18 19:42:12  kernel: Internal error: Oops - bad mode: 0 [#3] SMP
Nov 18 19:42:12  kernel: Modules linked in: algif_hash algif_skcipher af_alg btrfs xor raid6_pq 8188fu joydev bcmdhd uio_pdrv_genirq uio binfmt_misc sch_fq_codel bnep ip_tables x_tables
Nov 18 19:42:12  kernel: CPU: 3 PID: 3469 Comm: smbd Tainted: G      D W       4.4.179 #1
Nov 18 19:42:12  kernel: Hardware name: FriendlyElec NanoPi M4 (DT)
Nov 18 19:42:12  kernel: task: ffffffc0aedcd400 task.stack: ffffffc047ee8000
Nov 18 19:42:12  kernel: PC is at 0x7f78af9dfc
Nov 18 19:42:12  kernel: LR is at 0x7f78af9dd8
Nov 18 19:42:12  kernel: pc : [<0000007f78af9dfc>] lr : [<0000007f78af9dd8>] pstate: 80000000
Nov 18 19:42:12  kernel: sp : 0000007f6fbbe370
Nov 18 19:42:12  kernel: x29: 0000007f6fbbe370 x28: 00000055aa9fa870 
Nov 18 19:42:12  kernel: x27: 0000007f78afc640 x26: 0000000000000000 
Nov 18 19:42:12  kernel: x25: 00000055aa9fa898 x24: 0000007f6fbbe4e8 
Nov 18 19:42:12  kernel: x23: 0000000000000000 
Nov 18 19:42:12  kernel: Bad mode in Error handler detected, code 0xbf000002 -- SError
Nov 18 19:42:12  kernel: x22: 0000000000000000 
Nov 18 19:42:12  kernel: 
Nov 18 19:42:12  kernel: x21: 00000055aa9fa898 x20: 0000000000000000 
Nov 18 19:42:12  kernel: x19: 0000000000000189 x18: 0000000000000001 
Nov 18 19:42:12  kernel: x17: 0000000000000002 x16: 0000000000000002 
Nov 18 19:42:12  kernel: x15: 0000000000000000 x14: 002ffa52590473c3 
Nov 18 19:42:12  kernel: x13: 0000000063784283 x12: 0000000000000018 
Nov 18 19:42:12  kernel: x11: 000000003006b4dc x10: 0000000063784283 
Nov 18 19:42:12  kernel: x9 : 003b9aca00000000 x8 : 0000000000000062 
Nov 18 19:42:12  kernel: x7 : 0000007f6fbbe448 x6 : 0000000000000000 
Nov 18 19:42:12  kernel: x5 : 00000000ffffffff x4 : 0000000000000000 
Nov 18 19:42:12  kernel: x3 : 0000007f6fbbe4e8 x2 : 0000000000000000 
Nov 18 19:42:12  kernel: x1 : 0000000000000189 x0 : 0000000000000000 
Nov 18 19:42:12  kernel: 
Nov 18 19:42:12  kernel: Process smbd (pid: 3469, stack limit = 0xffffffc047ee8000)
Nov 18 19:42:12  kernel: ---[ end trace 5fba866947145e9b ]---
Nov 18 19:42:12  kernel: Bad mode in Error handler detected, code 0xbf000002 -- SError
Nov 18 19:42:12  kernel: Internal error: Oops - bad mode: 0 [#4] SMP
Nov 18 19:42:12  kernel: Modules linked in: algif_hash algif_skcipher af_alg btrfs xor raid6_pq 8188fu joydev bcmdhd uio_pdrv_genirq uio binfmt_misc sch_fq_codel bnep ip_tables x_tables
Nov 18 19:42:12  kernel: CPU: 5 PID: 3471 Comm: smbd Tainted: G      D W       4.4.179 #1
Nov 18 19:42:12  kernel: Hardware name: FriendlyElec NanoPi M4 (DT)
Nov 18 19:42:12  kernel: task: ffffffc0b72c8000 task.stack: ffffffc047d40000
Nov 18 19:42:12  kernel: PC is at 0x7f78af9dfc
Nov 18 19:42:12  kernel: LR is at 0x7f78af9dd8
Nov 18 19:42:12  kernel: pc : [<0000007f78af9dfc>] lr : [<0000007f78af9dd8>] pstate: 80000000
Nov 18 19:42:12  kernel: sp : 0000007f7240e370
Nov 18 19:42:12  kernel: x29: 0000007f7240e370 x28: 00000055aa9fa870 
Nov 18 19:42:12  kernel: x27: 0000007f78afc640 x26: 0000000000000000 
Nov 18 19:42:12  kernel: x25: 00000055aa9fa898 x24: 0000007f7240e4e8 
Nov 18 19:42:12  kernel: x23: 0000000000000000 x22: 0000000000000000 
Nov 18 19:42:12  kernel: x21: 00000055aa9fa898 x20: 0000000000000000 
Nov 18 19:42:12  kernel: x19: 0000000000000189 x18: 0000000000000000 
Nov 18 19:42:12  kernel: x17: 0000000000000004 x16: 0000000000000002 
Nov 18 19:42:12  kernel: x15: 0000000000000000 x14: 00302818e1b6bcc3 
Nov 18 19:42:12  kernel: x13: 0000000063784283 x12: 0000000000000018 
Nov 18 19:42:12  kernel: x11: 0000000030366a81 x10: 0000000063784283 
Nov 18 19:42:12  kernel: x9 : 003b9aca00000000 x8 : 0000000000000062 
Nov 18 19:42:12  kernel: x7 : 0000007f7240e448 x6 : 0000000000000000 
Nov 18 19:42:12  kernel: x5 : 00000000ffffffff x4 : 0000000000000000 
Nov 18 19:42:12  kernel: x3 : 0000007f7240e4e8 x2 : 0000000000000000 
Nov 18 19:42:12  kernel: x1 : 0000000000000189 x0 : 0000000000000000 
Nov 18 19:42:12  kernel: 
Nov 18 19:42:12  kernel: Process smbd (pid: 3471, stack limit = 0xffffffc047d40000)
Nov 18 19:42:12  kernel: ---[ end trace 5fba866947145e9c ]---
Nov 18 19:42:12  kernel: Internal error: Oops - bad mode: 0 [#5] SMP

当大量访问 NVME 驱动器时似乎会发生这种情况,但这可能只是与该问题相关。我使用 hdparm 并尝试关闭许多驱动器功能,但错误仍然存​​在。

我还尝试过更改 CPU 的时钟速度以及更改电源。这些效果不大

答案1

好的,在看到 Artem 的评论后,我尝试升级我的内核。然而,由于某种原因,我的 pi 不允许我启动到 GRUB,所以我运行了启动修复。我花了一段时间在这上面,但启动修复一直抱怨我磁盘空间不足。

我有 30MB 空闲空间,我的东西分区方式很糟糕,所以没有太多空间可以使用,但我能够释放另外 700MB 空间,从那时起崩溃就消失了。

事实证明,这毕竟是 NVME 驱动程序。使用 USB 代替解决了这个问题

相关内容