在 AMD Ryzen 2700X + RTX 2080 + Ubuntu 18.04 堆栈上,系统经常冻结

在 AMD Ryzen 2700X + RTX 2080 + Ubuntu 18.04 堆栈上,系统经常冻结

我使用上述设置,使用 python MXNet 进行深度学习计算。

当我使用 unity/gnome 时,崩溃之前journactl会报告类似这样的情况(总是 gnome 相关的进程先崩溃):

kov. 28 14:04:51 emil-NNNgine gnome-control-c[19370]: g_object_unref: 断言“G_IS_OBJECT (object)”失败
kov. 28 14:04:51 emil-NNNgine gnome-control-c[19370]: g_object_unref: 断言“G_IS_OBJECT (object)”失败
kov. 28 14:04:51 emil-NNNgine gnome-control-c[19370]: g_object_unref: 断言“G_IS_OBJECT (object)”失败
kov. 28 14:04:51 emil-NNNgine gnome-control-c[19370]: g_object_unref: 断言“G_IS_OBJECT (object)”失败
kov. 28 14:04:51 emil-NNNgine gnome-control-c[19370]: g_object_unref: 断言“G_IS_OBJECT (object)”失败
kov. 28 14:17:01 emil-NNNgine CRON[22454]: pam_unix(cron:session): 会话由 (uid=0) 为用户 root 打开
kov. 28 14:17:01 emil-NNNgine CRON[22455]: (root) CMD (cd / && run-parts --report /etc/cron.hourly)
kov. 28 14:17:01 emil-NNNgine CRON[22454]: pam_unix(cron:session): 用户 root 的会话已关闭
kov。28 14:20:03 emil-NNNgine gnome-shell[1564]: 对象 .Gjs_AppIndicatorIconActor__1 (0x563d31d7d9d0),已完成。不可能
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: == 上下文 0x563d2cba9330 的堆栈跟踪 ==
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #0 0x7ffd571eb540 b resource:///org/gnome/gjs/modules/_legacy.js:83 (0x7f1288
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #1 0x563d2d043948 i /usr/share/gnome-shell/extensions/ubuntu-appindicators@ub
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #2 0x7ffd571ec8a0 b resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f1288
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #3 0x7ffd571ec960 b 自托管:916 (0x7f12886f12b8 @ 367)
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #4 0x7ffd571eca50 b resource:///org/gnome/gjs/modules/signals.js:128 (0x7f128
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #5 0x563d2d0438c0 i /usr/share/gnome-shell/extensions/ubuntu-appindicators@ub
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #6 0x7ffd571eddb0 b resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f1288
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #7 0x563d2d04381​​8 i /usr/share/gnome-shell/extensions/ubuntu-appindicators@ub
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #8 0x7ffd571ef110 b resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f1288
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #9 0x563d2d0437a0 i /usr/share/gnome-shell/extensions/ubuntu-appindicators@ub
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #10 0x563d2d0436e0 i resource:///org/gnome/shell/ui/extensionSystem.js:82 (0x
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #11 0x563d2d043660 i resource:///org/gnome/shell/ui/extensionSystem.js:344 (0
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #12 0x7ffd571efe80 b 自托管:251 (0x7f12886c4ab0 @ 223)
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #13 0x563d2d0435e0 i resource:///org/gnome/shell/ui/extensionSystem.js:343 (0
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #14 0x563d2d043560 i resource:///org/gnome/shell/ui/extensionSystem.js:361 (0
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #15 0x7ffd571f1380 b resource:///org/gnome/gjs/modules/signals.js:128 (0x7f12
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #16 0x7ffd571f1b90 b resource:///org/gnome/shell/ui/sessionMode.js:205 (0x7f1
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #17 0x7ffd571f2870 I resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f128
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #18 0x563d2d043420 i resource:///org/gnome/shell/ui/sessionMode.js:167 (0x7f1
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #19 0x7ffd571f3450 I resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f128
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #20 0x563d2d043378 i resource:///org/gnome/shell/ui/screenShield.js:1282 (0x7
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #21 0x7ffd571f4030 I resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f128
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #22 0x563d2d0432f0 i resource:///org/gnome/shell/ui/screenShield.js:902 (0x7f
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #23 0x7ffd571f4c10 I resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f128
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #24 0x7ffd571f53b0 b 自托管:916 (0x7f12886f12b8 @ 367)
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #25 0x7ffd571f54a0 b resource:///org/gnome/gjs/modules/signals.js:128 (0x7f12
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #26 0x563d2d043270 i resource:///org/gnome/shell/ui/lightbox.js:186 (0x7f1288
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #27 0x7ffd571f6940 b resource:///org/gnome/gjs/modules/tweener/tweener.js:208
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #28 0x7ffd571f7190 b resource:///org/gnome/gjs/modules/tweener/tweener.js:337
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #29 0x7ffd571f7240 b resource:///org/gnome/gjs/modules/tweener/tweener.js:350
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #30 0x7ffd571f72d0 b resource:///org/gnome/gjs/modules/tweener/tweener.js:365
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #31 0x7ffd571f7350 I resource:///org/gnome/gjs/modules/signals.js:128 (0x7f12
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #32 0x7ffd571f7400 b resource:///org/gnome/shell/ui/tweener.js:244 (0x7f12886
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #33 0x7ffd571f7470 I resource:///org/gnome/gjs/modules/_legacy.js:82 (0x7f128
kov. 28 14:20:03 emil-NNNgine org.gnome.Shell.desktop[1564]: #34 0x7ffd571f7470 I resource:///org/gnome/shell/ui/tweener.js:219 (0x7f12886
kov. 28 14:20:03 emil-NNNgine gnome-software[1966]: 没有应用程序更改[电子邮件保护]
kov. 28 14:20:03 emil-NNNgine gnome-software[1966]: 没有应用程序更改[电子邮件保护]
-  重启  - 
kov. 28 14:26:03 emil-NNNgine 内核:Linux 版本 4.18.0-16-generic (buildd@lcy01-amd64-

-- Reboot --当我注意到系统冻结并按下重启按钮时。

现在,我已切换到 xfce4 桌面管理器。它运行了很长时间,我对其进行了整夜测试。当我回到 PC 时,屏幕上显示随机的矩形彩色斑块。最后一行显示journalctl

snapd[911]: stateengine.go:102: 状态确保错误:无法刷新“核心”的 snap-declaration:获取 https://api.snapcraft.io/api/v1/snaps/assertions/snap-declaration/16/*******?max-format=3:拨号 tcp:查找 api.snapcraft.io:没有这样的主机

我不确定问题是否真的是 ubuntu 桌面管理器无法正确与 NVidia 通信,或者是 NVidia 驱动程序问题,或者是 Ryzen 问题......

也许有人有一些线索?

重要更新:在所有可以追踪的情况下,崩溃都是在 np.loadtxt(...) 操作非常大的文件(实际上是训练数据集)期间发生的。

相关内容