Ubuntu 20.04 每隔 1-2 天崩溃一次

Ubuntu 20.04 每隔 1-2 天崩溃一次

大约一周前,我将用作 NAS 的计算机从 Windows 切换到 Ubuntu 20.04 LTS,从那时起,我就遇到 NAS 每隔 1-2 天崩溃一次的问题。

当 NAS 崩溃时,整个屏幕都会冻结,包括鼠标移动。即使 REISUB 也无法重新启动系统。我总是不得不按下计算机上的重置按钮。

NAS 运行的是 Ryzen 7 1700 处理器,所以我认为这可能是电源状态问题。我发现的大多数建议都说要禁用主板 BIOS 中的 C6 电源状态,但我的主板(Asus x370-f strix)似乎没有这个选项。我确实找到了一个使用 MX Linux 的人,他说他们用命令禁用了 C6 电源状态cpupower idle-set --disable c6。我试过了,但问题仍然存在。

我在上次崩溃后运行了该命令journalctl | grep error ,这些是上次崩溃当天唯一的错误消息。

Jun 03 21:22:22 NAS-PC kernel: EXT4-fs (sdb2): re-mounted. Opts: errors=remount-ro
Jun 03 21:22:25 NAS-PC systemd[1]: Condition check resulted in Process error reports when automatic reporting is enabled (file watch) being skipped.
Jun 03 21:22:26 NAS-PC gnome-session[1435]: gnome-session-binary[1435]: WARNING: Falling back to non-systemd startup procedure due to error: GDBus.Error:org.freedesktop.DBus.Error.Spawn.ChildExited: Process org.freedesktop.systemd1 exited with status 1
Jun 03 21:22:26 NAS-PC gnome-session-binary[1435]: WARNING: Falling back to non-systemd startup procedure due to error: GDBus.Error:org.freedesktop.DBus.Error.Spawn.ChildExited: Process org.freedesktop.systemd1 exited with status 1
Jun 03 21:22:33 NAS-PC /usr/lib/gdm3/gdm-x-session[2177]:         (WW) warning, (EE) error, (NI) not implemented, (??) unknown.

我对 Linux 还不太熟悉,所以我不知道这是否是我应该用于错误日志的命令,如果有更好的方法来检查错误,请告诉我。

任何帮助都将不胜感激,谢谢。

编辑:这是我被要求运行的命令的结果。

nas@NAS-PC:~$ sudo dmidecode -s bios-version
3803
nas@NAS-PC:~$ free -h
              total        used        free      shared  buff/cache   available
Mem:           15Gi       2.0Gi        11Gi       5.0Mi       2.2Gi        13Gi
Swap:         2.0Gi          0B       2.0Gi

nas@NAS-PC:~$ sysctl vm.swappiness
vm.swappiness = 60
nas@NAS-PC:~$ sudo lshw -C memory
  *-firmware                
       description: BIOS
       vendor: American Megatrends Inc.
       physical id: 0
       version: 3803
       date: 01/22/2018
       size: 64KiB
       capacity: 16MiB
       capabilities: pci apm upgrade shadowing cdboot bootselect socketedrom edd int13floppy1200 int13floppy720 int13floppy2880 int5printscreen int9keyboard int14serial int17printer acpi usb biosbootspecification uefi
  *-memory
       description: System Memory
       physical id: b
       slot: System board or motherboard
       size: 16GiB
     *-bank:0
          description: [empty]
          product: Unknown
          vendor: Unknown
          physical id: 0
          serial: Unknown
          slot: DIMM_A1
     *-bank:1
          description: DIMM DDR4 Synchronous Unbuffered (Unregistered) 2133 MHz (0.5 ns)
          product: CMW16GX4M2C3200C16
          vendor: Unknown
          physical id: 1
          serial: 00000000
          slot: DIMM_A2
          size: 8GiB
          width: 64 bits
          clock: 2133MHz (0.5ns)
     *-bank:2
          description: [empty]
          product: Unknown
          vendor: Unknown
          physical id: 2
          serial: Unknown
          slot: DIMM_B1
     *-bank:3
          description: DIMM DDR4 Synchronous Unbuffered (Unregistered) 2133 MHz (0.5 ns)
          product: CMW16GX4M2C3200C16
          vendor: Unknown
          physical id: 3
          serial: 00000000
          slot: DIMM_B2
          size: 8GiB
          width: 64 bits
          clock: 2133MHz (0.5ns)
  *-cache:0
       description: L1 cache
       physical id: d
       slot: L1 - Cache
       size: 768KiB
       capacity: 768KiB
       clock: 1GHz (1.0ns)
       capabilities: pipeline-burst internal write-back unified
       configuration: level=1
  *-cache:1
       description: L2 cache
       physical id: e
       slot: L2 - Cache
       size: 4MiB
       capacity: 4MiB
       clock: 1GHz (1.0ns)
       capabilities: pipeline-burst internal write-back unified
       configuration: level=2
  *-cache:2
       description: L3 cache
       physical id: f
       slot: L3 - Cache
       size: 16MiB
       capacity: 16MiB
       clock: 1GHz (1.0ns)
       capabilities: pipeline-burst internal write-back unified
       configuration: level=3

编辑:主板是华硕 rog stix x370-f gaming。这是grep -i sdb2 /var/log/syslog* 从 6 月 3 日崩溃时没有显示任何内容的结果,但今天我尝试在 USB 记忆棒上设置内存测试时确实遇到了几次崩溃。希望这里的数据有用。

/var/log/syslog:Jun  4 09:55:54 NAS-PC kernel: [    1.566219]  sdb: sdb1 sdb2
/var/log/syslog:Jun  4 09:55:54 NAS-PC kernel: [    4.959148] EXT4-fs (sdb2): mounted filesystem with ordered data mode. Opts: (null)
/var/log/syslog:Jun  4 09:55:54 NAS-PC kernel: [    5.499027] EXT4-fs (sdb2): re-mounted. Opts: errors=remount-ro
/var/log/syslog:Jun  4 10:00:29 NAS-PC kernel: [    1.554147]  sdb: sdb1 sdb2
/var/log/syslog:Jun  4 10:00:29 NAS-PC kernel: [    4.965393] EXT4-fs (sdb2): mounted filesystem with ordered data mode. Opts: (null)
/var/log/syslog:Jun  4 10:00:29 NAS-PC kernel: [    5.441670] EXT4-fs (sdb2): re-mounted. Opts: errors=remount-ro
/var/log/syslog:Jun  4 10:08:32 NAS-PC kernel: [    1.546202]  sdb: sdb1 sdb2
/var/log/syslog:Jun  4 10:08:32 NAS-PC kernel: [    4.973591] EXT4-fs (sdb2): mounted filesystem with ordered data mode. Opts: (null)
/var/log/syslog:Jun  4 10:08:32 NAS-PC kernel: [    5.444522] EXT4-fs (sdb2): re-mounted. Opts: errors=remount-ro
/var/log/syslog:Jun  4 10:18:38 NAS-PC kernel: [    1.542013]  sdb: sdb1 sdb2
/var/log/syslog:Jun  4 10:18:38 NAS-PC kernel: [    4.934979] EXT4-fs (sdb2): mounted filesystem with ordered data mode. Opts: (null)
/var/log/syslog:Jun  4 10:18:38 NAS-PC kernel: [    5.395831] EXT4-fs (sdb2): re-mounted. Opts: errors=remount-ro
/var/log/syslog:Jun  4 10:30:06 NAS-PC kernel: [    1.549958]  sdb: sdb1 sdb2
/var/log/syslog:Jun  4 10:30:06 NAS-PC kernel: [    4.848546] EXT4-fs (sdb2): mounted filesystem with ordered data mode. Opts: (null)
/var/log/syslog:Jun  4 10:30:06 NAS-PC kernel: [    5.322608] EXT4-fs (sdb2): re-mounted. Opts: errors=remount-ro
Binary file /var/log/syslog.1 matches

我的系统中有 3 个驱动器:1 个 120gb ssd,2 个 6tb hdd,使用 mdadm 设置为 RAID 1。它们都没有安装 Windows。以下是结果sudo fdisk -l

Disk /dev/loop0: 55.48 MiB, 58159104 bytes, 113592 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop1: 138.93 MiB, 145657856 bytes, 284488 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop2: 55.43 MiB, 58114048 bytes, 113504 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop3: 218.102 MiB, 229629952 bytes, 448496 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop4: 51.4 MiB, 53522432 bytes, 104536 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop5: 162.89 MiB, 170778624 bytes, 333552 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop6: 32.9 MiB, 33644544 bytes, 65712 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop7: 64.79 MiB, 67915776 bytes, 132648 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/sda: 5.47 TiB, 6001175126016 bytes, 11721045168 sectors
Disk model: WDC WD6002FFWX-6
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: 25D7A7C5-BE1E-4DE1-BE76-AF1CCCA56E55

Device     Start         End     Sectors  Size Type
/dev/sda1   2048 11721043967 11721041920  5.5T Microsoft basic data


Disk /dev/sdb: 111.81 GiB, 120034123776 bytes, 234441648 sectors
Disk model: KINGSTON SA400S3
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes
Disklabel type: gpt
Disk identifier: 1FFE61F6-6E4D-45CD-80EB-61F0F8CFF851

Device       Start       End   Sectors   Size Type
/dev/sdb1     2048   1050623   1048576   512M EFI System
/dev/sdb2  1050624 234440703 233390080 111.3G Linux filesystem


Disk /dev/sdc: 5.47 TiB, 6001175126016 bytes, 11721045168 sectors
Disk model: WDC WD6002FFWX-6
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: gpt
Disk identifier: 78290624-A970-432A-978C-B8D7F9A79561

Device     Start         End     Sectors  Size Type
/dev/sdc1   2048 11721043967 11721041920  5.5T Microsoft basic data


Disk /dev/md0: 5.47 TiB, 6001038196736 bytes, 11720777728 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 4096 bytes
I/O size (minimum/optimal): 4096 bytes / 4096 bytes
Disklabel type: dos
Disk identifier: 0x00000000


Disk /dev/loop8: 32.1 MiB, 33660928 bytes, 65744 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop9: 65.1 MiB, 68259840 bytes, 133320 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes


Disk /dev/loop10: 141.4 MiB, 148254720 bytes, 289560 sectors
Units: sectors of 1 * 512 = 512 bytes
Sector size (logical/physical): 512 bytes / 512 bytes
I/O size (minimum/optimal): 512 bytes / 512 bytes

以下是DISK智能数据的截图(RAID阵列本身显然没有任何智能数据)

6.0TB RAID-1 阵列

金士顿 SA400S37120G (SBFK71E0)

WDC WD6002FFWX-68TZ4N0 (83.H0A83)(NCGWXW2V)

WDC WD6002FFWX-68TZ4N0 (83.H0A83)(NCH9XLDS)

编辑:这是 ls -al /var/crash。Cpupower-gui 是我尝试禁用 C6 电源状态的一部分,但我最终没有使用此应用程序。

total 15744
drwxrwsrwt  2 root     whoopsie     4096 Jun  4 07:37  .
drwxr-xr-x 14 root     root         4096 Feb 10 05:56  ..
-rw-r-----  1 nas      whoopsie    66768 Jun  2 11:52  _usr_bin_cpupower-gui.1000.crash
-rw-rw-r--  1 nas      whoopsie        0 Jun  2 11:52  _usr_bin_cpupower-gui.1000.upload
-rw-------  1 whoopsie whoopsie       37 Jun  2 11:52  _usr_bin_cpupower-gui.1000.uploaded
-rw-r-----  1 plex     whoopsie 16036359 May 30 02:15 '_usr_lib_plexmediaserver_Plex Media Server.998.crash'

编辑:这是sudo lshw -C memory新的 RAM 不在 QVL 上

      *-firmware                
       description: BIOS
       vendor: American Megatrends Inc.
       physical id: 0
       version: 5603
       date: 07/28/2020
       size: 64KiB
       capacity: 16MiB
       capabilities: pci apm upgrade shadowing cdboot bootselect socketedrom edd int13floppy1200 int13floppy720 int13floppy2880 int5printscreen int9keyboard int14serial int17printer acpi usb biosbootspecification uefi
  *-memory
       description: System Memory
       physical id: 2c
       slot: System board or motherboard
       size: 16GiB
     *-bank:0
          description: Project-Id-Version: lshwReport-Msgid-Bugs-To: FULL NAME <EMAIL@ADDRESS>PO-Revision-Date: 2012-02-02 13:04+0000Last-Translator: Joel Addison <[email protected]>Language-Team: English (Australia) <[email protected]>MIME-Version: 1.0Content-Type: text/plain; charset=UTF-8Content-Transfer-Encoding: 8bitX-Launchpad-Export-Date: 2021-01-21 18:43+0000X-Generator: Launchpad (build 2d1d5e352f0d063d660df2300e31f66bed027fa5)Project-Id-Version: lshwReport-Msgid-Bugs-To: FULL NAME <EMAIL@ADDRESS>PO-Revision-Date: 2012-02-02 13:04+0000Last-Translator: Joel Addison <[email protected]>Language-Team: English (Australia) <[email protected]>MIME-Version: 1.0Content-Type: text/plain; charset=UTF-8Content-Transfer-Encoding: 8bitX-Launchpad-Export-Date: 2021-01-21 18:43+0000X-Generator: Launchpad (build 2d1d5e352f0d063d660df2300e31f66bed027fa5) [empty]
          product: Unknown
          vendor: Unknown
          physical id: 0
          serial: Unknown
          slot: DIMM_A1
     *-bank:1
          description: DIMM DDR4 Synchronous Unbuffered (Unregistered) 2400 MHz (0.4 ns)
          product: KHX2400C15D4/8G
          vendor: Kingston
          physical id: 1
          serial: 9321D353
          slot: DIMM_A2
          size: 8GiB
          width: 64 bits
          clock: 2400MHz (0.4ns)
     *-bank:2
          description: Project-Id-Version: lshwReport-Msgid-Bugs-To: FULL NAME <EMAIL@ADDRESS>PO-Revision-Date: 2012-02-02 13:04+0000Last-Translator: Joel Addison <[email protected]>Language-Team: English (Australia) <[email protected]>MIME-Version: 1.0Content-Type: text/plain; charset=UTF-8Content-Transfer-Encoding: 8bitX-Launchpad-Export-Date: 2021-01-21 18:43+0000X-Generator: Launchpad (build 2d1d5e352f0d063d660df2300e31f66bed027fa5)Project-Id-Version: lshwReport-Msgid-Bugs-To: FULL NAME <EMAIL@ADDRESS>PO-Revision-Date: 2012-02-02 13:04+0000Last-Translator: Joel Addison <[email protected]>Language-Team: English (Australia) <[email protected]>MIME-Version: 1.0Content-Type: text/plain; charset=UTF-8Content-Transfer-Encoding: 8bitX-Launchpad-Export-Date: 2021-01-21 18:43+0000X-Generator: Launchpad (build 2d1d5e352f0d063d660df2300e31f66bed027fa5) [empty]
          product: Unknown
          vendor: Unknown
          physical id: 2
          serial: Unknown
          slot: DIMM_B1
     *-bank:3
          description: DIMM DDR4 Synchronous Unbuffered (Unregistered) 2400 MHz (0.4 ns)
          product: KHX2400C15D4/8G
          vendor: Kingston
          physical id: 3
          serial: 90216553
          slot: DIMM_B2
          size: 8GiB
          width: 64 bits
          clock: 2400MHz (0.4ns)
  *-cache:0
       description: L1 cache
       physical id: 2e
       slot: L1 - Cache
       size: 768KiB
       capacity: 768KiB
       clock: 1GHz (1.0ns)
       capabilities: pipeline-burst internal write-back unified
       configuration: level=1
  *-cache:1
       description: L2 cache
       physical id: 2f
       slot: L2 - Cache
       size: 4MiB
       capacity: 4MiB
       clock: 1GHz (1.0ns)
       capabilities: pipeline-burst internal write-back unified
       configuration: level=2
  *-cache:2
       description: L3 cache
       physical id: 30
       slot: L3 - Cache
       size: 16MiB
       capacity: 16MiB
       clock: 1GHz (1.0ns)
       capabilities: pipeline-burst internal write-back unified
       configuration: level=3

答案1

BIOS

华硕 ROG STRIX X370-F GAMING

你有一个真的很老自 2018 年 1 月 22 日起,BIOS 版本为 3803。自 2020 年 8 月 10 日起,有更新的 BIOS 版本 5603 可用,可从这里

笔记:确认我拥有您型号主板的正确网页。

笔记:更新 BIOS 之前请做好备份。

笔记:某些 BIOS 允许直接从当前 BIOS 中更新。否则,如果您没有 Windows,请尝试 FreeDOShttp://www.freedos.org/

记忆

锐龙 7 1700

Ryzen 处理器对 RAM 非常挑剔。

Corsair CMW16GX4M2C3200C16 未出现在受支持内存列表文档中。请参阅 CPU/内存文档这里

https://www.memtest86.com/并免费下载/运行它们memtest来测试你的记忆力。至少完成一次所有 4/4 测试以确认记忆力良好。这可能需要几个小时才能完成。

磁盘

金士顿 SA400S37120G (SBFK71E0)

笔记:检查 SSD 的固件更新。获取Kingston SSD Manager 这里

/dev/sdb2 正在以只读方式重新挂载。

SMART 数据看起来不错。

让我们检查 /dev/sdb2 上的 Linux 文件系统...

  • 以“试用 Ubuntu”模式启动 Ubuntu Live DVD/USB
  • terminalCtrl+ Alt+打开窗口T
  • 类型sudo fdisk -l
  • 识别“Linux 文件系统”的 /dev/sdXX 设备名称
  • 输入sudo fsck -f /dev/sdb2,替换sdXX为您之前找到的数字
  • fsck如果有错误则重复命令
  • 类型reboot

交换

您的 2G /swapfile 太小了。让我们将其增加到 4G...

笔记:错误使用rmdd命令可能会导致数据丢失。建议复制/粘贴。

在里面terminal...

sudo swapoff -a           # turn off swap
sudo rm -i /swapfile      # remove old /swapfile

sudo dd if=/dev/zero of=/swapfile bs=1M count=4096

sudo chmod 600 /swapfile  # set proper file protections
sudo mkswap /swapfile     # init /swapfile
sudo swapon /swapfile     # turn on swap
free -h                   # confirm 16G RAM and 4G swap

sudo -H gedit /etc/fstab使用或编辑 /etc/fstab sudo pico /etc/fstab

确认 /etc/fstab 中的此 /swapfile 行...并确认没有其他“交换”行...在此行中使用空格...确认没有制表符...

/swapfile  none  swap  sw  0  0

reboot                    # reboot and verify operation

更新#1:

memtest原始内存确实出现故障。更换内存后,memtest现在运行无错误。

更新 #2:

重新安装 GPU。

相关内容