AMD Ryzen ThreadRipper 的家用服务器继续关闭

AMD Ryzen ThreadRipper 的家用服务器继续关闭

我的家庭服务器规格

  • AMD RYZEN Threadripper PRO 5955WX
  • 华硕 PRO WS WRX80E-SAGE SE WIFI
  • 三星 DDR4 64GB * 8
  • Corsair HX750 80PLUS 白金版

安装上述规范,使用 Ubuntu 220.4 LXC 安装并将配置好的字词安装到容器中。

启动LXC控制器时没有问题,启动时也没有问题。启动容器并操作Docker容器容器并操作Docker容器容器,然后出现以下错误。

Feb 25 02:17:01 v4 CRON[35659]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)  
Feb 25 02:17:01 v4 CRON[35660]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)  
Feb 25 02:17:01 v4 CRON[35659]: pam_unix(cron:session): session closed for user root  
Feb 25 02:21:37 v4 kernel: mce: [Hardware Error]: Machine check events logged  
Feb 25 02:21:37 v4 kernel: [Hardware Error]: Corrected error, no action required.  
Feb 25 02:21:37 v4 kernel: [Hardware Error]: CPU:1 (19:8:2) MC1_STATUS[Over|CE|MiscV|-|-|-|SyndV|-|-|-]: 0xd8200000060a0859  
Feb 25 02:21:37 v4 kernel: [Hardware Error]: PPIN: 0x02b68f671f2d007b  
Feb 25 02:21:37 v4 kernel: [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000005a000586  
Feb 25 02:21:37 v4 kernel: [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 10, L1 BTB Multi-Match Error.  
Feb 25 02:21:37 v4 kernel: [Hardware Error]: cache level: L1, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout)  
Feb 25 02:24:34 v4 pmxcfs[1191]: [dcdb] notice: data verification successful  
Feb 25 02:31:05 v4 pvedaemon[1316]: <root@pam> successful auth for user 'root@pam'  
Feb 25 02:46:30 v4 pvedaemon[1317]: <root@pam> successful auth for user 'root@pam'  
Feb 25 02:52:45 v4 kernel: mce: [Hardware Error]: Machine check events logged  
Feb 25 02:52:45 v4 kernel: [Hardware Error]: Corrected error, no action required.  
Feb 25 02:52:45 v4 kernel: [Hardware Error]: CPU:1 (19:8:2) MC1_STATUS[Over|CE|MiscV|-|-|-|SyndV|-|-|-]: 0xd8200000060a0859  
Feb 25 02:52:45 v4 kernel: [Hardware Error]: PPIN: 0x02b68f671f2d007b  
Feb 25 02:52:45 v4 kernel: [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000005a000581  
Feb 25 02:52:45 v4 kernel: [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 10, L1 BTB Multi-Match Error.  
Feb 25 02:52:45 v4 kernel: [Hardware Error]: cache level: L1, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout)
Feb 25 03:03:38 v4 pvedaemon[1315]: <root@pam> successful auth for user 'root@pam'  
Feb 25 03:10:01 v4 CRON[49309]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)  
Feb 25 03:10:01 v4 CRON[49310]: (root) CMD (test -e /run/systemd/system || SERVICE_MODE=1 /sbin/e2scrub_all -A -r)  
Feb 25 03:10:01 v4 CRON[49309]: pam_unix(cron:session): session closed for user root  
Feb 25 03:17:01 v4 CRON[51083]: pam_unix(cron:session): session opened for user root(uid=0) by (uid=0)  
Feb 25 03:17:01 v4 CRON[51084]: (root) CMD ( cd / && run-parts --report /etc/cron.hourly)  
Feb 25 03:17:01 v4 CRON[51083]: pam_unix(cron:session): session closed for user root  
Feb 25 03:24:34 v4 pmxcfs[1191]: [dcdb] notice: data verification successful  
Feb 25 03:29:04 v4 kernel: mce: [Hardware Error]: Machine check events logged  
Feb 25 03:29:04 v4 kernel: [Hardware Error]: Corrected error, no action required.  
Feb 25 03:29:04 v4 kernel: [Hardware Error]: CPU:1 (19:8:2) MC1_STATUS[Over|CE|MiscV|-|-|-|SyndV|-|-|-]: 0xd8200000060a0859  
Feb 25 03:29:04 v4 kernel: [Hardware Error]: PPIN: 0x02b68f671f2d007b  
Feb 25 03:29:04 v4 kernel: [Hardware Error]: IPID: 0x000100b000000000, Syndrome: 0x000000005a000a98  
Feb 25 03:29:04 v4 kernel: [Hardware Error]: Instruction Fetch Unit Ext. Error Code: 10, L1 BTB Multi-Match Error.  
Feb 25 03:29:04 v4 kernel: [Hardware Error]: cache level: L1, mem/io: IO, mem-tx: IRD, part-proc: SRC (no timeout)  
Feb 25 03:29:08 v4 pvedaemon[1317]: <root@pam> successful auth for user 'root@pam'  
-- Reboot --
Instruction Fetch Unit Ext. Error Code: 10, L1 BTB Multi-Match Error.

我对这句话感到怀疑,于是搜索了一下,发现主板上的 BIOS 更新解决了这个问题。

因此我更新了最新版本的BIOS。

即使更新BIOS之后,上述症状仍然如此。

相关内容