总结:我的系统一直完全冻结 - 没有鼠标、键盘等。屏幕完全冻结。我认为这与文件系统有关,但这是我的 Nvidia 驱动程序的问题。我尝试从 Unity 切换到xfce4
,但仍然发生随机冻结。仍在尝试找出发生了什么。
我最近在我的旧笔记本电脑上安装了一份全新的 Ubuntu 16.04,想从我的外部硬盘(Seagate 4TB SRD0NF1)复制一些文件到它上面。所以我插上硬盘,用 nautilus 导航到文件夹,然后就在能够复制它之前,一切都冻结了。无法移动鼠标 - 完全冻结。所以我按住电源按钮关机,然后重试了几次,它又冻结了几次,但最后一次我设法将大约 10% 的文件(总共大约有 160,000 个文件)复制到我的桌面上,然后它再次冻结。
所以我认为 Linux 在处理外部硬盘时出现了一些奇怪的错误,但在重启并断开硬盘后,我决定打开文件夹查看哪些文件被复制过来了。Ubuntu 再次冻结。
所以现在我认为这与文件本身有关,但我不知道。它们嵌套得很深,而且可能有一些很长的文件名,如果这很重要的话(我记得 Windows 遇到这种问题)(参见编辑)。这是我的系统日志在崩溃之前:
Jan 29 21:22:49 joe-l502x dbus[910]: [system] Activating via systemd: service name='org.freedesktop.hostname1' unit='dbus-org.freedesktop.hostname1.service'
Jan 29 21:22:49 joe-l502x systemd[1]: Starting Hostname Service...
Jan 29 21:22:49 joe-l502x dbus[910]: [system] Successfully activated service 'org.freedesktop.hostname1'
Jan 29 21:22:49 joe-l502x systemd[1]: Started Hostname Service.
Jan 29 21:22:49 joe-l502x org.gtk.vfs.Daemon[1687]: ** (process:2120): WARNING **: send_done_cb: No such interface 'org.gtk.vfs.Enumerator' on object at path /org/gtk/vfs/client/enumerator/2 (g-dbus-error-quark,
19)
Jan 29 21:22:51 joe-l502x kernel: [ 231.057723] NVRM: GPU at PCI:0000:01:00: GPU-e89c676b-485a-4551-fc4f-e23db62c065d
Jan 29 21:22:51 joe-l502x kernel: [ 231.057736] NVRM: Xid (PCI:0000:01:00): 32, Channel ID 00000003 intr 80804000
Jan 29 21:22:52 joe-l502x kernel: [ 231.059903] NVRM: Xid (PCI:0000:01:00): 62, !18bc(0ed4)
Jan 29 21:22:52 joe-l502x org.gtk.vfs.Daemon[1687]: ** (gvfsd:1758): WARNING **: dbus_mount_reply: Error from org.gtk.vfs.Mountable.mount(): Failed to retrieve share list from server: Connection refused
Jan 29 21:22:52 joe-l502x org.gtk.vfs.Daemon[1687]: ** (process:2374): WARNING **: Couldn't create directory monitor on smb://x-gnome-default-workgroup/. Error: The specified location is not mounted
Jan 29 21:22:55 joe-l502x org.gtk.vfs.Daemon[1687]: ** (gvfsd:1758): WARNING **: dbus_mount_reply: Error from org.gtk.vfs.Mountable.mount(): Failed to retrieve share list from server: Connection refused
Jan 29 21:22:55 joe-l502x org.gtk.vfs.Daemon[1687]: ** (process:2391): WARNING **: Couldn't create directory monitor on smb://x-gnome-default-workgroup/. Error: The specified location is not mounted
Jan 29 21:22:55 joe-l502x kernel: [ 235.057714] NVRM: Xid (PCI:0000:01:00): 32, Channel ID 00000003 intr 80804000
我是 Linux 新手,所以我真的不知道那些日志里发生了什么。有人知道这里发生了什么吗?与这个“dbus”有关吗?还有其他日志我应该查看吗?谢谢!
编辑1:
好的,看起来这和那些 GPU 消息有关。这是另一个崩溃:
<< FREEZES HERE >>
Jan 29 21:52:22 joe-l502x kernel: [ 90.715448] NVRM: GPU at PCI:0000:01:00: GPU-e89c676b-485a-4551-fc4f-e23db62c065d
Jan 29 21:52:22 joe-l502x kernel: [ 90.715457] NVRM: Xid (PCI:0000:01:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 0): Illegal Instruction Encoding
Jan 29 21:52:22 joe-l502x kernel: [ 90.715466] NVRM: Xid (PCI:0000:01:00): 13, Graphics SM Global Exception on (GPC 0, TPC 0): Physical Multiple Warp Errors
Jan 29 21:52:22 joe-l502x kernel: [ 90.715472] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ESR 0x504648=0x9 0x504650=0x4 0x504644=0x1beff2 0x50464c=0xf
Jan 29 21:52:22 joe-l502x kernel: [ 90.715494] NVRM: Xid (PCI:0000:01:00): 13, Graphics SM Warp Exception on (GPC 0, TPC 1): Illegal Instruction Encoding
Jan 29 21:52:22 joe-l502x kernel: [ 90.715500] NVRM: Xid (PCI:0000:01:00): 13, Graphics SM Global Exception on (GPC 0, TPC 1): Physical Multiple Warp Errors
Jan 29 21:52:22 joe-l502x kernel: [ 90.715506] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ESR 0x504e48=0x9 0x504e50=0x4 0x504e44=0x1beff2 0x504e4c=0xf
Jan 29 21:52:22 joe-l502x kernel: [ 90.715526] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ChID 0001, Class 00009197, Offset 00000100, Data 0fffcc00
Jan 29 21:52:58 joe-l502x systemd[1]: Stopping User Manager for UID 108...
Jan 29 21:52:58 joe-l502x systemd[1163]: Reached target Shutdown.
Jan 29 21:52:58 joe-l502x systemd[1163]: Stopped target Default.
Jan 29 21:52:59 joe-l502x systemd[1163]: Starting Exit the Session...
Jan 29 21:52:59 joe-l502x systemd[1163]: Stopped target Basic System.
Jan 29 21:52:59 joe-l502x systemd[1163]: Stopped target Paths.
Jan 29 21:52:59 joe-l502x systemd[1163]: Stopped target Timers.
Jan 29 21:52:59 joe-l502x systemd[1163]: Stopped target Sockets.
Jan 29 21:52:59 joe-l502x systemd[1163]: Received SIGRTMIN+24 from PID 2399 (kill).
Jan 29 21:52:59 joe-l502x systemd[1]: Stopped User Manager for UID 108.
Jan 29 21:52:59 joe-l502x systemd[1]: Removed slice User Slice of lightdm.
这似乎并非特定于 nautilius(“文件”)。这是我使用“Thunar”文件资源管理器打开时收到的崩溃消息:
Jan 29 22:08:50 joe-l502x kernel: [ 843.694898] NVRM: GPU at PCI:0000:01:00: GPU-e89c676b-485a-4551-fc4f-e23db62c065d
Jan 29 22:08:50 joe-l502x kernel: [ 843.694903] NVRM: Xid (PCI:0000:01:00): 44, Ch 00000001, engmask 00000101, intr 10000000
Jan 29 22:08:50 joe-l502x kernel: [ 844.196602] NVRM: Xid (PCI:0000:01:00): 39, CCMDs 00000007 000090b5
导航到文件夹并通过终端复制工作正常,所以这似乎与文件系统没有特别的关系。
我的显卡是Nvidia GTX 540M,我的驱动程序是384.111版本。
编辑2:
实际上,我记得几周前使用 Unity 调整工具 GUI 时也发生过类似的冻结。所以这可能与 Unity 有关?或者与 Xorg 有关?
编辑3:
确认这很可能与文件系统无关。我刚刚在 Ubuntu System Settings > Software & Updates
GUI 中遇到了冻结。非常相似的日志:
Jan 30 06:33:51 joe-l502x kernel: [30264.899235] NVRM: GPU at PCI:0000:01:00: GPU-e89c676b-485a-4551-fc4f-e23db62c065d
Jan 30 06:33:51 joe-l502x kernel: [30264.899241] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: Shader Program Header 6 Error
Jan 30 06:33:51 joe-l502x kernel: [30264.899245] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: Shader Program Header 9 Error
Jan 30 06:33:51 joe-l502x kernel: [30264.899248] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: Shader Program Header 18 Error
Jan 30 06:33:51 joe-l502x kernel: [30264.899252] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ESR 0x405840=0xa0040240
Jan 30 06:33:51 joe-l502x kernel: [30264.899267] NVRM: Xid (PCI:0000:01:00): 13, Graphics Exception: ChID 0001, Class 00009197, Offset 00000100, Data 0fffcc00
看起来这确实与我的 GPU 和某些 Linux GUI 库/东西之间的交互有关。
编辑4:
有人指导我/var/log/Xorg.0.log.old
,我认为这基本上 100% 证实了这是 nvidia 驱动程序在发挥作用:
[ 30246.723] (--) NVIDIA(GPU-0): DFP-0: disconnected
[ 30246.723] (--) NVIDIA(GPU-0): DFP-0: Internal TMDS
[ 30246.723] (--) NVIDIA(GPU-0): DFP-0: 165.0 MHz maximum pixel clock
[ 30246.723] (--) NVIDIA(GPU-0):
[ 30246.723] (--) NVIDIA(GPU-0): DFP-0: disconnected
[ 30246.723] (--) NVIDIA(GPU-0): DFP-0: Internal TMDS
[ 30246.723] (--) NVIDIA(GPU-0): DFP-0: 165.0 MHz maximum pixel clock
[ 30246.723] (--) NVIDIA(GPU-0):
[ 30267.150] (EE) NVIDIA(0): The NVIDIA X driver has encountered an error; attempting to
[ 30267.150] (EE) NVIDIA(0): recover...
<< but it never recovers :( >>