我的系统经常但间歇性地锁定。我不确定下一步该做什么来排除故障和/或是否应该将此报告为错误。希望得到建议。请注意,我的问题与之前在radeon 0000:02:00.0:ring 0 停滞超过 10492 毫秒尚未得到回应。
我的系统每隔几天就会间歇性地锁定。发生这种情况时,屏幕会变黑,然后恢复。此时,时钟上的时间显示屏幕已冻结,但音频会继续播放 30 秒到 1 分钟。键盘和鼠标不会影响显示的内容,但我可以按 Alt - SysRq - REISUB 重新启动。
每次(6 次崩溃并且还在增加)时,/var/log/kern.log 都会显示
kernel: [353692.378886] radeon 0000:03:00.0: ring 0 stalled for more than 10280msec
kernel: [353692.378896] radeon 0000:03:00.0: GPU lockup (current fence id 0x00000000006e96e5 last fence id 0x00000000006e96e9 on ring 0)
系统锁定前。不同振铃次数报告的停顿时间不同。
我正在运行 16.04 LTS。我有一张 Radeon HD 显卡。
:~$ lspci | grep VGA
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cedar [Radeon HD 5000/6000/7350/8350 Series]
:~$ lspci -v -s 03:00.0
03:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Cedar [Radeon HD 5000/6000/7350/8350 Series] (prog-if 00 [VGA controller])
Subsystem: Gigabyte Technology Co., Ltd Cedar [Radeon HD 5000/6000/7350/8350 Series]
Flags: bus master, fast devsel, latency 0, IRQ 53
Memory at c0000000 (64-bit, prefetchable) [size=256M]
Memory at d3d20000 (64-bit, non-prefetchable) [size=128K]
I/O ports at 7000 [size=256]
Expansion ROM at d3d00000 [disabled] [size=128K]
Capabilities: <access denied>
Kernel driver in use: radeon
Kernel modules: radeon
在寻找解决方案时,我发现了 14.04 的一个旧错误报告,目前尚未分配https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1409393。
[编辑:] 根据@MichaelBay 的建议,我运行了 Checkbox 显卡测试,一切顺利。我还运行了 MemoryTest。经过 3 次测试后未发现任何错误。最后,我加载了 Psensor 来检查 GPU 温度,但没有 GPU 读数。
答案1
我上radeon 0000:04:00.0: ring 0 stalled for more than 10240msec
我的[AMD/ATI] RV620 GL [FirePro 2450]当我在Ubuntu 20.04.5 LTS几分钟后,Firefox 或任何其他程序都没有问题,只有 Opera 有问题。
[128524.943553] radeon 0000:04:00.0: ring 0 stalled for more than 10240msec
[128524.943565] radeon 0000:04:00.0: GPU lockup (current fence id 0x000000000029caf6 last fence id 0x000000000029cafc on ring 0)
[128524.955392] radeon 0000:04:00.0: Saved 185 dwords of commands on ring 0.
[128524.955409] radeon 0000:04:00.0: GPU softreset: 0x00000009
[128524.955413] radeon 0000:04:00.0: R_008010_GRBM_STATUS = 0xA2303030
[128524.955417] radeon 0000:04:00.0: R_008014_GRBM_STATUS2 = 0x00000003
[128524.955420] radeon 0000:04:00.0: R_000E50_SRBM_STATUS = 0x200010C0
[128524.955423] radeon 0000:04:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[128524.955426] radeon 0000:04:00.0: R_008678_CP_STALLED_STAT2 = 0x00008002
[128524.955429] radeon 0000:04:00.0: R_00867C_CP_BUSY_STAT = 0x00008086
[128524.955432] radeon 0000:04:00.0: R_008680_CP_STAT = 0x80018645
[128524.955435] radeon 0000:04:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[128525.013038] radeon 0000:04:00.0: R_008020_GRBM_SOFT_RESET=0x00007FEF
[128525.013097] radeon 0000:04:00.0: SRBM_SOFT_RESET=0x00000100
[128525.015187] radeon 0000:04:00.0: R_008010_GRBM_STATUS = 0xA0003030
[128525.015191] radeon 0000:04:00.0: R_008014_GRBM_STATUS2 = 0x00000003
[128525.015195] radeon 0000:04:00.0: R_000E50_SRBM_STATUS = 0x200080C0
[128525.015198] radeon 0000:04:00.0: R_008674_CP_STALLED_STAT1 = 0x00000000
[128525.015201] radeon 0000:04:00.0: R_008678_CP_STALLED_STAT2 = 0x00000000
[128525.015204] radeon 0000:04:00.0: R_00867C_CP_BUSY_STAT = 0x00000000
[128525.015207] radeon 0000:04:00.0: R_008680_CP_STAT = 0x80100000
[128525.015210] radeon 0000:04:00.0: R_00D034_DMA_STATUS_REG = 0x44C83D57
[128525.015220] radeon 0000:04:00.0: GPU reset succeeded, trying to resume
[128525.031584] [drm] PCIE gen 2 link speeds already enabled
[128525.034184] [drm] PCIE GART of 512M enabled (table at 0x0000000000142000).
[128525.034222] radeon 0000:04:00.0: WB enabled
[128525.034224] radeon 0000:04:00.0: fence driver on ring 0 use gpu addr 0x0000000010000c00
[128525.034579] radeon 0000:04:00.0: fence driver on ring 5 use gpu addr 0x00000000000521d0
[128525.034797] debugfs: File 'radeon_ring_gfx' in directory '0' already present!
[128525.066237] [drm] ring test on 0 succeeded in 1 usecs
[128525.066242] debugfs: File 'radeon_ring_uvd' in directory '0' already present!
[128525.240884] [drm] ring test on 5 succeeded in 1 usecs
[128525.240893] [drm] UVD initialized successfully.
[128535.695467] radeon 0000:04:00.0: ring 0 stalled for more than 10456msec
[128535.695479] radeon 0000:04:00.0: GPU lockup (current fence id 0x000000000029caf8 last fence id 0x000000000029cafc on ring 0)
[128535.697433] [drm:r600_ib_test [radeon]] *ERROR* radeon: fence wait failed (-35).
[128535.697551] [drm:radeon_ib_ring_tests [radeon]] *ERROR* radeon: failed testing IB on GFX ring (-35).