我有一台新笔记本,它经常崩溃。
$ uname -a
Linux bpgergo-notebook 4.2.0-27-generic #32-Ubuntu SMP Fri Jan 22 04:49:08 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
$ lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 15.10
Release: 15.10
Codename: wily
我无法指定任何特定于崩溃的情况或应用程序。我想找出崩溃的原因。我将描述崩溃时的情况以及我在系统日志中看到的内容。我希望您告诉我如何继续查找原因。
崩溃时的样子
有时它会在重启后一小时内冻结,有时则在两天内冻结。例如,当最近发生这次冻结时,重启后我刚刚启动了一些普通应用程序,如浏览器和终端,让它静置一个小时,当我返回时,我注意到它对任何操作都没有反应。甚至对 alt+ctl+F1 也没有反应。此时我唯一能做的就是一直按住电源按钮直到它关闭。
当发生冻结时,我通常会注意到笔记本电脑比正常温度高一点。如果我立即重新启动并检查,sensors
我可以看到温度为 70 摄氏度,虽然不是极端温度,但比正常工作温度(约 50 摄氏度)高得多。
系统日志
我检查了 /var/log/syslog,我发现这是崩溃前的最新日志行。chrash1:
Feb 10 15:01:39 bpgergo-notebook kernel: [26093.242080] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0x6013d4 0xffff5703 (0x1c408200)
Feb 10 15:01:39 bpgergo-notebook kernel: [26093.242132] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0x10ecc0 0xffffffff (0x1a40822c)
Feb 10 15:02:09 bpgergo-notebook kernel: [26123.130129] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 15:02:09 bpgergo-notebook kernel: [26123.130403] ACPI: \_SB_.PCI0.PEG0.PEGP: failed to evaluate _DSM
Feb 10 15:02:09 bpgergo-notebook kernel: [26123.130407] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 15:02:11 bpgergo-notebook kernel: [26124.445525] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0x10ecc0 0xffffffff (0x1c40822c)
崩溃2
Feb 10 16:17:58 bpgergo-notebook kernel: [ 1088.808587] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0x6013d4 0xffff5700 (0x1c408200)
Feb 10 16:18:23 bpgergo-notebook kernel: [ 1113.486503] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 16:18:23 bpgergo-notebook kernel: [ 1113.487291] ACPI: \_SB_.PCI0.PEG0.PEGP: failed to evaluate _DSM
Feb 10 16:18:23 bpgergo-notebook kernel: [ 1113.487305] ACPI Warning: \_SB_.PCI0.PEG0.PEGP._DSM: Argument #4 type mismatch - Found [Buffer], ACPI requires [Package] (20150619/nsarguments-95)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.831356] nouveau E[ PBUS][0000:01:00.0] MMIO read of 0x00000000 FAULT at 0x122130 [ IBUS ]
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835021] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0xbad00100 0xbadf1002 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835247] nouveau E[ PIBUS][0000:01:00.0] ROP4: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835252] nouveau E[ PIBUS][0000:01:00.0] ROP6: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835257] nouveau E[ PIBUS][0000:01:00.0] ROP7: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835262] nouveau E[ PIBUS][0000:01:00.0] ROP9: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835267] nouveau E[ PIBUS][0000:01:00.0] ROP11: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835271] nouveau E[ PIBUS][0000:01:00.0] ROP12: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835276] nouveau E[ PIBUS][0000:01:00.0] ROP13: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835281] nouveau E[ PIBUS][0000:01:00.0] ROP15: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835286] nouveau E[ PIBUS][0000:01:00.0] GPC8: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835293] nouveau E[ PIBUS][0000:01:00.0] GPC20: 0x000000 0x00000000 (0x00000000)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835298] nouveau E[ PIBUS][0000:01:00.0] GPC22: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835303] nouveau E[ PIBUS][0000:01:00.0] GPC23: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835308] nouveau E[ PIBUS][0000:01:00.0] GPC25: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.835312] nouveau E[ PIBUS][0000:01:00.0] GPC27: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.854481] nouveau E[ PIBUS][0000:01:00.0] GPC28: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.877204] nouveau E[ PIBUS][0000:01:00.0] GPC29: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.900634] nouveau E[ PIBUS][0000:01:00.0] GPC31: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1116.992570] nouveau E[ PBUS][0000:01:00.0] MMIO read of 0x00000000 FAULT at 0x120058 [ IBUS TIMEOUT ]
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.072344] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0xbad00100 0xbad00100 (0xbad00100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078887] nouveau E[ PIBUS][0000:01:00.0] ROP4: 0xbad00100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078904] nouveau E[ PIBUS][0000:01:00.0] ROP6: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078910] nouveau E[ PIBUS][0000:01:00.0] ROP7: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078917] nouveau E[ PIBUS][0000:01:00.0] ROP9: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078923] nouveau E[ PIBUS][0000:01:00.0] ROP11: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078939] nouveau E[ PIBUS][0000:01:00.0] ROP12: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078945] nouveau E[ PIBUS][0000:01:00.0] ROP13: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078950] nouveau E[ PIBUS][0000:01:00.0] ROP15: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078954] nouveau E[ PIBUS][0000:01:00.0] GPC8: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078960] nouveau E[ PIBUS][0000:01:00.0] GPC20: 0x000000 0x00000000 (0x00000000)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078964] nouveau E[ PIBUS][0000:01:00.0] GPC22: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078968] nouveau E[ PIBUS][0000:01:00.0] GPC23: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078971] nouveau E[ PIBUS][0000:01:00.0] GPC25: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078975] nouveau E[ PIBUS][0000:01:00.0] GPC27: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078978] nouveau E[ PIBUS][0000:01:00.0] GPC28: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078982] nouveau E[ PIBUS][0000:01:00.0] GPC29: 0xbadf1100 0xbadf1100 (0xbadf1100)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078987] nouveau E[ PIBUS][0000:01:00.0] GPC31: 0x000000 0x00000000 (0x00000000)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.078997] nouveau E[ PBUS][0000:01:00.0] MMIO read of 0x00000000 FAULT at 0x120058 [ IBUS TIMEOUT ]
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.079008] nouveau E[ PIBUS][0000:01:00.0] HUB0: 0x136928 0xbadf1100 (0x19400200)
Feb 10 16:18:26 bpgergo-notebook kernel: [ 1117.079014] nouveau E[ PIBUS][0000:01:00.0] ROP0: 0x10f904 0xffffffff (0x1e408201)
编辑
有人建议这可能与显卡有关。我没有安装任何与显卡相关的驱动程序或软件。这是 lspci 结果的相关部分
01:00.0 3D controller: NVIDIA Corporation GM107M [GeForce GTX 960M] (rev a2)
Subsystem: ASUSTeK Computer Inc. Device 18dd
Flags: bus master, fast devsel, latency 0, IRQ 31
Memory at eb000000 (32-bit, non-prefetchable) [size=16M]
Memory at c0000000 (64-bit, prefetchable) [size=256M]
Memory at d0000000 (64-bit, prefetchable) [size=32M]
I/O ports at e000 [size=128]
Expansion ROM at ec000000 [disabled] [size=512K]
Capabilities: [60] Power Management version 3
Capabilities: [68] MSI: Enable+ Count=1/1 Maskable- 64bit+
Capabilities: [78] Express Endpoint, MSI 00
Capabilities: [100] Virtual Channel
Capabilities: [250] Latency Tolerance Reporting
Capabilities: [258] L1 PM Substates
Capabilities: [128] Power Budgeting <?>
Capabilities: [600] Vendor Specific Information: ID=0001 Rev=1 Len=024 <?>
Capabilities: [900] #19
Kernel driver in use: nouveau
答案1
那么回答我的问题,
- 找出崩溃前在系统日志中写入最后几行的程序或软件包
- 还应该检查这个目录:
/var/crash/
正如评论者指出的那样,nouveau 是开源 nvidia 驱动程序。
关于具体问题,我已经安装了专有的 nvidia 驱动程序,从那时起我就再也没有遇到过任何崩溃。
答案2
伊布斯:https://en.m.wikipedia.org/wiki/Intelligent_Input_Bus
就我而言,我必须拔下并重新插入我的 USB 键盘,冻结才得以结束。
升级所有软件包(sudo apt-get upgrade)后,问题解决了,所以我不知道问题出在哪里……