如何查找笔记本电脑死机的原因?

如何查找笔记本电脑死机的原因?

我有一个新笔记本,它经常死机。

$ uname -a
Linux bpgergo-notebook 4.2.0-27-generic #32-Ubuntu SMP Fri Jan 22 04:49:08 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux

$ lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 15.10
Release:    15.10
Codename:   wily

我无法指定特定于崩溃的任何情况或应用程序。我想找出事故原因。我将描述它崩溃时的样子以及我在系统日志中可以看到的内容。我希望您告诉我如何继续查找原因。

崩溃时的样子

有时它会在重新启动后一小时内冻结,有时会在两天内冻结。例如,当最近一次冻结发生时,重新启动后,我只是启动了一些普通的应用程序,例如浏览器和终端,将其单独放置一个小时,当我回到它时,我注意到它没有响应任何内容。甚至不能按 alt+ctl+F1。此时我唯一能做的就是按住电源按钮直到它关闭。

当冻结发生时,我通常会注意到笔记本电脑的温度比应有的温度要高一些。如果我立即重新启动并检查sensors,我可以看到 70 摄氏度左右的温度,这不是极端的情况,但比正常运行温度(大约 50 摄氏度)高得多。

系统日志

我检查了 /var/log/syslog,这是我发现崩溃之前的最新日志行。 chrash1:

Feb 10 15:01:39 bpgergo-notebook kernel: [26093.242080] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0x6013d4 0xffff5703 (0x1c408200)
Feb 10 15:01:39 bpgergo-notebook kernel: [26093.242132] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0x10ecc0 0xffffffff (0x1a40822c)
Feb 10 15:02:09 bpgergo-notebook kernel: [26123.130129] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 15:02:09 bpgergo-notebook kernel: [26123.130403] ACPI: \_SB_.PCI0.PEG0.PEGP: failed to evaluate _DSM
Feb 10 15:02:09 bpgergo-notebook kernel: [26123.130407] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 15:02:11 bpgergo-notebook kernel: [26124.445525] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0x10ecc0 0xffffffff (0x1c40822c)

崩溃2

Feb 10 16:17:58 bpgergo-notebook kernel: [ 1088.808587] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0x6013d4 0xffff5700 (0x1c408200)
Feb 10 16:18:23 bpgergo-notebook kernel: [ 1113.486503] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 16:18:23 bpgergo-notebook kernel: [ 1113.487291] ACPI: \_SB_.PCI0.PEG0.PEGP: failed to evaluate _DSM
Feb 10 16:18:23 bpgergo-notebook kernel: [ 1113.487305] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.831356] nouveau E[    PBUS][0000:01:00.0] MMIO read of 0x00000000 FAULT at 0x122130 [ IBUS ]
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835021] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0xbad00100 0xbadf1002 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835247] nouveau E[   PIBUS][0000:01:00.0] ROP4: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835252] nouveau E[   PIBUS][0000:01:00.0] ROP6: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835257] nouveau E[   PIBUS][0000:01:00.0] ROP7: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835262] nouveau E[   PIBUS][0000:01:00.0] ROP9: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835267] nouveau E[   PIBUS][0000:01:00.0] ROP11: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835271] nouveau E[   PIBUS][0000:01:00.0] ROP12: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835276] nouveau E[   PIBUS][0000:01:00.0] ROP13: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835281] nouveau E[   PIBUS][0000:01:00.0] ROP15: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835286] nouveau E[   PIBUS][0000:01:00.0] GPC8: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835293] nouveau E[   PIBUS][0000:01:00.0] GPC20: 0x000000 0x00000000 (0x00000000)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835298] nouveau E[   PIBUS][0000:01:00.0] GPC22: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835303] nouveau E[   PIBUS][0000:01:00.0] GPC23: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835308] nouveau E[   PIBUS][0000:01:00.0] GPC25: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835312] nouveau E[   PIBUS][0000:01:00.0] GPC27: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.854481] nouveau E[   PIBUS][0000:01:00.0] GPC28: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.877204] nouveau E[   PIBUS][0000:01:00.0] GPC29: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.900634] nouveau E[   PIBUS][0000:01:00.0] GPC31: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.992570] nouveau E[    PBUS][0000:01:00.0] MMIO read of 0x00000000 FAULT at 0x120058 [ IBUS TIMEOUT ]
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.072344] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078887] nouveau E[   PIBUS][0000:01:00.0] ROP4: 0xbad00100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078904] nouveau E[   PIBUS][0000:01:00.0] ROP6: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078910] nouveau E[   PIBUS][0000:01:00.0] ROP7: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078917] nouveau E[   PIBUS][0000:01:00.0] ROP9: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078923] nouveau E[   PIBUS][0000:01:00.0] ROP11: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078939] nouveau E[   PIBUS][0000:01:00.0] ROP12: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078945] nouveau E[   PIBUS][0000:01:00.0] ROP13: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078950] nouveau E[   PIBUS][0000:01:00.0] ROP15: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078954] nouveau E[   PIBUS][0000:01:00.0] GPC8: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078960] nouveau E[   PIBUS][0000:01:00.0] GPC20: 0x000000 0x00000000 (0x00000000)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078964] nouveau E[   PIBUS][0000:01:00.0] GPC22: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078968] nouveau E[   PIBUS][0000:01:00.0] GPC23: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078971] nouveau E[   PIBUS][0000:01:00.0] GPC25: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078975] nouveau E[   PIBUS][0000:01:00.0] GPC27: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078978] nouveau E[   PIBUS][0000:01:00.0] GPC28: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078982] nouveau E[   PIBUS][0000:01:00.0] GPC29: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078987] nouveau E[   PIBUS][0000:01:00.0] GPC31: 0x000000 0x00000000 (0x00000000)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078997] nouveau E[    PBUS][0000:01:00.0] MMIO read of 0x00000000 FAULT at 0x120058 [ IBUS TIMEOUT ]
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.079008] nouveau E[   PIBUS][0000:01:00.0] HUB0: 0x136928 0xbadf1100 (0x19400200)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.079014] nouveau E[   PIBUS][0000:01:00.0] ROP0: 0x10f904 0xffffffff (0x1e408201)

答案1

所以为了回答我的问题,

  1. 找出崩溃前在系统日志中写入最后几行的程序或程序包
  2. 还应该检查这个目录:/var/crash/

正如评论者指出的那样,nouveau 是开源的 nvidia 驱动程序。

关于具体问题,我安装了专有的 nvidia 驱动程序,从那以后我就没有再遇到过崩溃。

相关内容