最近我的 Ubuntu 18.04 台式机一直卡死。我仍然可以移动鼠标,但点击没有任何反应,并且无法识别任何键盘命令。它不会响应 Ctrl-Alt-Backspace,但 REISUB 会将其关闭。我也可以通过 SSH 进入机器。
当我查看系统日志时,发现今天发生了以下崩溃:
Jan 26 17:49:54 meeks kernel: [53943.653157] nouveau 0000:02:00.0: fifo: write fault at 0000244000 engine 00 [GR] client 0f [GPC0/PROP_0] reason 02 [PTE] on channel 13 [007f4e9000 systemd-logind[1746]]
Jan 26 17:49:54 meeks kernel: [53943.653163] nouveau 0000:02:00.0: fifo: channel 13: killed
Jan 26 17:49:54 meeks kernel: [53943.653165] nouveau 0000:02:00.0: fifo: runlist 0: scheduled for recovery
Jan 26 17:49:54 meeks kernel: [53943.653169] nouveau 0000:02:00.0: fifo: engine 0: scheduled for recovery
Jan 26 17:50:28 meeks kernel: [53977.661464] nouveau 0000:02:00.0: slack[6821]: failed to idle channel 20 [slack[6821]]
Jan 26 17:50:43 meeks kernel: [53992.661575] nouveau 0000:02:00.0: slack[6821]: failed to idle channel 20 [slack[6821]]
Jan 26 17:50:43 meeks kernel: [53992.661685] nouveau 0000:02:00.0: fifo: read fault at 0000013000 engine 07 [HOST0] client 07 [HOST_CPU] reason 02 [PTE] on channel 20 [007f1cc000 slack[6821]]
Jan 26 17:50:43 meeks kernel: [53992.661694] nouveau 0000:02:00.0: fifo: channel 20: killed
Jan 26 17:50:43 meeks kernel: [53992.661696] nouveau 0000:02:00.0: fifo: runlist 0: scheduled for recovery
Jan 26 17:50:43 meeks kernel: [53992.661711] nouveau 0000:02:00.0: fifo: engine 0: scheduled for recovery
这是昨天的另一个例子:
Jan 25 06:23:46 meeks kernel: [14531.638260] nouveau 0000:02:00.0: fifo: write fault at 0000240000 engine 00 [GR] client 0f [GPC0/PROP_0] reason 02 [PTE] on channel 13 [007f4e9000 systemd-logind[1702]]
Jan 25 06:23:46 meeks kernel: [14531.638267] nouveau 0000:02:00.0: fifo: channel 13: killed
Jan 25 06:23:46 meeks kernel: [14531.638268] nouveau 0000:02:00.0: fifo: runlist 0: scheduled for recovery
Jan 25 06:23:46 meeks kernel: [14531.638273] nouveau 0000:02:00.0: fifo: engine 0: scheduled for recovery
Jan 25 06:23:59 meeks kernel: [14544.901549] CPU5: Core temperature above threshold, cpu clock throttled (total events = 30066)
Jan 25 06:23:59 meeks kernel: [14544.901549] CPU1: Core temperature above threshold, cpu clock throttled (total events = 30066)
Jan 25 06:23:59 meeks kernel: [14544.901568] CPU4: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901568] CPU0: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901569] CPU1: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901570] CPU5: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901573] CPU3: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901574] CPU2: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901575] CPU6: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.901576] CPU7: Package temperature above threshold, cpu clock throttled (total events = 124706)
Jan 25 06:23:59 meeks kernel: [14544.902621] CPU5: Core temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902621] CPU1: Core temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902623] CPU7: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902624] CPU0: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902624] CPU3: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902625] CPU4: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902625] CPU5: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902626] CPU1: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902648] CPU2: Package temperature/speed normal
Jan 25 06:23:59 meeks kernel: [14544.902648] CPU6: Package temperature/speed normal
Jan 25 06:24:29 meeks kernel: [14574.481259] nouveau 0000:02:00.0: slack[11445]: failed to idle channel 18 [slack[11445]]
Jan 25 06:24:44 meeks kernel: [14589.480973] nouveau 0000:02:00.0: slack[11445]: failed to idle channel 18 [slack[11445]]
Jan 25 06:24:44 meeks kernel: [14589.481019] nouveau 0000:02:00.0: fifo: read fault at 0000013000 engine 07 [HOST0] client 07 [HOST_CPU] reason 02 [PTE] on channel 18 [007f3e8000 slack[11445]]
Jan 25 06:24:44 meeks kernel: [14589.481026] nouveau 0000:02:00.0: fifo: channel 18: killed
Jan 25 06:24:44 meeks kernel: [14589.481028] nouveau 0000:02:00.0: fifo: runlist 0: scheduled for recovery
Jan 25 06:24:44 meeks kernel: [14589.481037] nouveau 0000:02:00.0: fifo: engine 0: scheduled for recovery
主板:B250M Pro4
显卡:华硕 GT 710(最近安装,安装前肯定发生了冻结,但安装后冻结可能发生得更多?)
内存:4x16GB DIMM DDR4 同步(2 个 2667 MHz,2 个 2133 MHz)
硬盘驱动器:
# df
Filesystem 1K-blocks Used Available Use% Mounted on
udev 32422652 0 32422652 0% /dev
tmpfs 6489276 2188 6487088 1% /run
/dev/sdc5 19091540 16451780 1646892 91% /
tmpfs 32446360 406376 32039984 2% /dev/shm
tmpfs 5120 4 5116 1% /run/lock
tmpfs 32446360 0 32446360 0% /sys/fs/cgroup
/dev/loop0 2560 2560 0 100% /snap/gnome-calculator/748
/dev/loop3 384 384 0 100% /snap/gnome-characters/570
/dev/loop1 2304 2304 0 100% /snap/gnome-system-monitor/145
/dev/loop2 144128 144128 0 100% /snap/gnome-3-26-1604/98
/dev/loop6 144128 144128 0 100% /snap/gnome-3-26-1604/100
/dev/loop7 223232 223232 0 100% /snap/gnome-3-34-1804/60
/dev/loop5 1024 1024 0 100% /snap/gnome-logs/93
/dev/loop4 66432 66432 0 100% /snap/gtk-common-themes/1514
/dev/loop8 224256 224256 0 100% /snap/gnome-3-34-1804/66
/dev/loop9 56832 56832 0 100% /snap/core18/1944
/dev/loop11 165376 165376 0 100% /snap/gnome-3-28-1804/128
/dev/loop13 100352 100352 0 100% /snap/core/10583
/dev/loop10 56704 56704 0 100% /snap/core18/1932
/dev/loop12 384 384 0 100% /snap/gnome-characters/550
/dev/loop14 2304 2304 0 100% /snap/gnome-system-monitor/148
/dev/loop15 2560 2560 0 100% /snap/gnome-calculator/826
/dev/loop16 63616 63616 0 100% /snap/gtk-common-themes/1506
/dev/loop17 1024 1024 0 100% /snap/gnome-logs/100
/dev/loop18 166784 166784 0 100% /snap/gnome-3-28-1804/145
/dev/loop19 100224 100224 0 100% /snap/core/10577
/dev/sdb1 960774412 318087732 642686680 34% /data
/dev/sdc6 210475628 120355016 79359348 61% /home
vmpool 696824704 607706496 89118208 88% /vms
tmpfs 6489272 16 6489256 1% /run/user/120
tmpfs 6489272 36 6489236 1% /run/user/1000
tmpfs 6489272 0 6489272 0% /run/user/0
很高兴能找到其他信息,因为这些信息会很有帮助。我很想弄清楚如何解决此类问题。