Ubuntu 20.04 随机冻结/崩溃 - 20 分钟到约 3 小时,当前 NVIDIA 驱动程序 - 535.54.03(如果我不使用 GPU 就不会冻结)
我尝试了不同的 Nvidia 驱动程序 530.30.02、525.85 和 Ubuntu 22.04。
free -h
total used free shared buff/cache available
Mem: 62Gi 6.1Gi 413Mi 1.0Gi 55Gi 54Gi
Swap: 2.0Gi 1.4Gi 597Mi
cat /proc/sys/vm/swappiness
60
sudo lshw -C memory
*-firmware
description: BIOS
vendor: American Megatrends Inc.
physical id: 0
version: 0603
date: 09/08/2022
size: 64KiB
capacity: 32MiB
capabilities: pci upgrade shadowing cdboot bootselect socketedrom edd int13floppynec int13floppytoshiba int13floppy360 int13floppy1200 int13floppy720 int13floppy2880 int5printscreen int14serial int17printer int10video acpi usb biosbootspecification uefi
*-cache:0
description: L1 cache
physical id: c
slot: L1 - Cache
size: 768KiB
capacity: 768KiB
clock: 1GHz (1.0ns)
capabilities: pipeline-burst internal write-back unified
configuration: level=1
*-cache:1
description: L2 cache
physical id: d
slot: L2 - Cache
size: 12MiB
capacity: 12MiB
clock: 1GHz (1.0ns)
capabilities: pipeline-burst internal write-back unified
configuration: level=2
*-cache:2
description: L3 cache
physical id: e
slot: L3 - Cache
size: 64MiB
capacity: 64MiB
clock: 1GHz (1.0ns)
capabilities: pipeline-burst internal write-back unified
configuration: level=3
*-memory
description: System Memory
physical id: 11
slot: System board or motherboard
size: 64GiB
*-bank:0
description: [empty]
product: Unknown
vendor: Unknown
physical id: 0
serial: Unknown
slot: DIMM 0
*-bank:1
description: DIMM Synchronous Unbuffered (Unregistered) 4800 MHz (0.2 ns)
product: F5-5600J3636D32G
vendor: Unknown
physical id: 1
serial: 00000000
slot: DIMM 1
size: 32GiB
width: 64 bits
clock: 505MHz (2.0ns)
*-bank:2
description: [empty]
product: Unknown
vendor: Unknown
physical id: 2
serial: Unknown
slot: DIMM 0
*-bank:3
description: DIMM Synchronous Unbuffered (Unregistered) 4800 MHz (0.2 ns)
product: F5-5600J3636D32G
vendor: Unknown
physical id: 3
serial: 00000000
slot: DIMM 1
size: 32GiB
width: 64 bits
clock: 505MHz (2.0ns)
期刊CTL- 这是一个大文件,请搜索“BUG:处理过程中的页面状态不良”
我也尝试过memtester
- 它没有返回任何错误
我已经为此苦苦挣扎了一个多星期,我真的很感激任何建议/帮助。我也在这个论坛上遇到过许多类似的问题。