“无法写入条目……只读文件系统……EXT-FS错误”:Debian 11突然崩溃

“无法写入条目……只读文件系统……EXT-FS错误”:Debian 11突然崩溃
Debian 11 Stable
KDE Plasma 5.20.5

我的系统大约每天崩溃一次。我在纸上写下了似乎最相关的错误:

[83262.955525] systemd-journal [301]: failed to write entry (22 items, 747 bytes), 
ignoring: Read-only file system

EXT-FS Error (device sda1) __ext4_find_entry:1534 inode #1573987: 
com gmain: reading directory lblock 0

我找到了这张照片这里

在此输入图像描述

这几乎就是我的屏幕在崩溃时的样子。

驱动器和分区:

Drives:    Local Storage: total: 1.38 TiB used: 853.2 GiB (60.6%) 
           ID-1: /dev/sda vendor: Samsung model: SSD 860 EVO 1TB size: 931.51 GiB 
           ID-2: /dev/sdb vendor: Samsung model: SSD 850 PRO 512GB size: 476.94 GiB 
Partition: ID-1: / size: 45.53 GiB used: 8.3 GiB (18.2%) fs: ext4 dev: /dev/sda1 
           ID-2: /home size: 869.04 GiB used: 755.96 GiB (87.0%) fs: ext4 dev: /dev/sda3 
Swap:      ID-1: swap-1 type: partition size: 976 MiB used: 0 KiB (0.0%) dev: /dev/sda5 

我还没有尝试在此安装上安装任何专有驱动程序。我使用这台计算机进行工作,非专有驱动程序的性能似乎足以满足我的工作需求(除非以某种方式需要它们来阻止这些崩溃)。

我在日志文件中看到的唯一错误与固件有关,我认为 debian 不需要:

[    0.101567] DMAR: [Firmware Bug]: No firmware reserved region can cover this RMRR [0x00000000bdeac000-0x00000000bdecbfff], contact BIOS vendor for fixes
[    0.101697] DMAR: [Firmware Bug]: Your BIOS is broken; bad RMRR [0x00000000bdeac000-0x00000000bdecbfff]
               BIOS vendor: Hewlett-Packard; Ver: F.25; Product Version: 0499220000241210001040000
[    0.237433] core: CPUID marked event: 'bus cycles' unavailable
[    0.244065] MDS CPU bug present and SMT on, data leak possible. See https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/mds.html for more details.
[    0.245624]  #5 #6 #7
[    0.266653] mtrr: your CPUs had inconsistent variable MTRR settings
[    1.889198] [Firmware Bug]: Invalid critical threshold (0)
[    2.306443] ACPI Warning: SystemIO range 0x0000000000000428-0x000000000000042F conflicts with OpRegion 0x0000000000000400-0x000000000000047F (\PMIO) (20200925/utaddress-204)
[    2.306458] ACPI Warning: SystemIO range 0x0000000000000540-0x000000000000054F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GPIO) (20200925/utaddress-204)
[    2.306467] ACPI Warning: SystemIO range 0x0000000000000530-0x000000000000053F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GPIO) (20200925/utaddress-204)
[    2.306476] ACPI Warning: SystemIO range 0x0000000000000500-0x000000000000052F conflicts with OpRegion 0x0000000000000500-0x0000000000000563 (\GPIO) (20200925/utaddress-204)
[    2.306484] lpc_ich: Resource conflict(s) found affecting gpio_ich
[    2.354733] r8169 0000:03:00.0: can't disable ASPM; OS doesn't have ASPM control
[    2.740251] nouveau 0000:01:00.0: bios: OOB 1 015f1901 015f1901
[    2.764763] ata5.00: supports DRM functions and may not be fully accessible
[    2.765804] ata1.00: supports DRM functions and may not be fully accessible
[    2.768822] ata1.00: supports DRM functions and may not be fully accessible
[    2.776029] ata5.00: supports DRM functions and may not be fully accessible
[    5.256428] systemd[1]: /lib/systemd/system/plymouth-start.service:16: Unit configured to use KillMode=none. This is unsafe, as it disables systemd's process lifecycle management for the service. Please update your service to use a safer KillMode=, such as 'mixed' or 'control-group'. Support for KillMode=none is deprecated and will eventually be removed.
[    5.599321] systemd-journald[301]: File /var/log/journal/e60bce5c0cc141a5b1ca070182b03357/system.journal corrupted or uncleanly shut down, renaming and replacing.
[    5.631756] i801_smbus 0000:00:1f.3: BIOS is accessing SMBus registers
[    5.631758] i801_smbus 0000:00:1f.3: Driver SMBus register access inhibited
[    5.711693] at24 0-0050: supply vcc not found, using dummy regulator
[    5.765689] rc rc0: nonsensical timing event of duration 0
[    5.765692] rc rc0: two consecutive events of type space
[    5.895968] iwlwifi 0000:02:00.0: can't disable ASPM; OS doesn't have ASPM control
[    5.900275] iwlwifi 0000:02:00.0: firmware: failed to load iwlwifi-6000-4.ucode (-2)
[    5.900277] firmware_class: See https://wiki.debian.org/Firmware for information about missing firmware
[    5.900279] iwlwifi 0000:02:00.0: Direct firmware load for iwlwifi-6000-4.ucode failed with error -2
[    5.900284] iwlwifi 0000:02:00.0: iwlwifi-6000-4 is required
[    5.900286] iwlwifi 0000:02:00.0: check git://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git
[    6.136626] kvm: VM_EXIT_LOAD_IA32_PERF_GLOBAL_CTRL does not work properly. Using workaround
[    6.765931] r8169 0000:03:00.0: firmware: failed to load rtl_nic/rtl8168d-2.fw (-2)
[    6.766420] r8169 0000:03:00.0: Direct firmware load for rtl_nic/rtl8168d-2.fw failed with error -2
[    6.766425] r8169 0000:03:00.0: Unable to load firmware rtl_nic/rtl8168d-2.fw (-2)
[    7.019228] L1TF CPU bug present and SMT on, data leak possible. See CVE-2018-3646 and https://www.kernel.org/doc/html/latest/admin-guide/hw-vuln/l1tf.html for details.
[    7.020441] kvm: KVM_SET_TSS_ADDR need to be called before entering vcpu
[   20.142355] systemd-journald[301]: File /var/log/journal/e60bce5c0cc141a5b1ca070182b03357/user-1000.journal corrupted or uncleanly shut down, renaming and replacing.
[   24.444130] nouveau 0000:01:00.0: firmware: failed to load nouveau/nva5_fuc084 (-2)
[   24.444136] nouveau 0000:01:00.0: Direct firmware load for nouveau/nva5_fuc084 failed with error -2
[   24.444149] nouveau 0000:01:00.0: firmware: failed to load nouveau/nva5_fuc084d (-2)
[   24.444151] nouveau 0000:01:00.0: Direct firmware load for nouveau/nva5_fuc084d failed with error -2
[   24.444154] nouveau 0000:01:00.0: msvld: unable to load firmware data
[   24.444157] nouveau 0000:01:00.0: msvld: init failed, -19
[   24.505830] CE: hpet5 increased min_delta_ns to 20115 nsec
[   25.499297] CE: hpet6 increased min_delta_ns to 20115 nsec
[   34.557568] hrtimer: interrupt took 14722 ns
[ 2760.299762] CE: hpet3 increased min_delta_ns to 20115 nsec
[ 2979.256577] CE: hpet increased min_delta_ns to 20115 nsec
[ 3050.545325] show_signal_msg: 19 callbacks suppressed
[ 3053.040108] CE: hpet7 increased min_delta_ns to 20115 nsec
[ 5509.560255] CE: hpet4 increased min_delta_ns to 20115 nsec

免费驱动一般都稳定吗?这是我的视频卡:

lspci -vnn | grep -A12 'VGA\|Display'
01:00.0 VGA compatible controller [0300]: NVIDIA Corporation GT216M [GeForce GT 230M] [10de:0a28] (rev a2) (prog-if 00 [VGA controller])
    DeviceName: NVIDIA Video Graphics Controller
    Subsystem: Hewlett-Packard Company GT216M [GeForce GT 230M] [103c:7001]
    Flags: bus master, fast devsel, latency 0, IRQ 33, IOMMU group 16
    Memory at d2000000 (32-bit, non-prefetchable) [size=16M]
    Memory at c0000000 (64-bit, prefetchable) [size=256M]
    Memory at d0000000 (64-bit, prefetchable) [size=32M]
    I/O ports at 6000 [size=128]
    Expansion ROM at 000c0000 [disabled] [size=128K]
    Capabilities: <access denied>
    Kernel driver in use: nouveau
    Kernel modules: nouveau

我不擅长确定这些崩溃的确切原因。任何建议表示赞赏。

这看起来像我可以修复的东西吗?如果是这样,怎么办?

更新

LinuxSecurityFreak的回答中,他建议在重新启动时强制进行 fsck 修复。执行此操作后,我在启动日志中发现了这一点:

猫 /var/log/boot.log

------------ Tue Dec 07 06:01:46 CST 2021 ------------
/dev/sda1: recovering journal
/dev/sda1: Clearing orphaned inode 1966113 (uid=1000, gid=1000, mode=0100600, size=3538944)
/dev/sda1: Clearing orphaned inode 1966101 (uid=1000, gid=1000, mode=0100600, size=9830400)
/dev/sda1: Clearing orphaned inode 2802080 (uid=0, gid=0, mode=0100644, size=71592)
/dev/sda1: Clearing orphaned inode 2802077 (uid=0, gid=0, mode=0100644, size=917632)
/dev/sda1: Clearing orphaned inode 2802076 (uid=0, gid=0, mode=0100644, size=191416)
/dev/sda1: Clearing orphaned inode 2802075 (uid=0, gid=0, mode=0100644, size=190368)
/dev/sda1: Clearing orphaned inode 2802073 (uid=0, gid=0, mode=0100644, size=34728)
/dev/sda1: Clearing orphaned inode 2802071 (uid=0, gid=0, mode=0100644, size=18352)
/dev/sda1: Clearing orphaned inode 2802069 (uid=0, gid=0, mode=0100644, size=18352)
/dev/sda1: Clearing orphaned inode 2802067 (uid=0, gid=0, mode=0100644, size=14256)
/dev/sda1: Clearing orphaned inode 2802065 (uid=0, gid=0, mode=0100644, size=14256)
/dev/sda1: Clearing orphaned inode 2802063 (uid=0, gid=0, mode=0100644, size=22448)
/dev/sda1: Clearing orphaned inode 2802061 (uid=0, gid=0, mode=0100644, size=14256)
/dev/sda1: Clearing orphaned inode 2802059 (uid=0, gid=0, mode=0100644, size=14328)
/dev/sda1: Clearing orphaned inode 2802057 (uid=0, gid=0, mode=0100644, size=14256)
/dev/sda1: clean, 255277/3055616 files, 2444203/12206848 blocks

------------ Tue Dec 07 07:08:18 CST 2021 ------------
e2fsck 1.46.2 (28-Feb-2021)
Pass 1: Checking inodes, blocks, and sizes
Pass 2: Checking directory structure                                           
Pass 3: Checking directory connectivity                                        
Pass 4: Checking reference counts
Pass 5: Checking group summary information                                     
/dev/sda1: 255255/3055616 files (0.1% non-contiguous), 2446557/12206848 blocks 

更新2

BIOS 是我在 HP 网站上找到的最新版本(Hewlett-Packard v:F.25)。可能是一个后来的BIOS,但我不确定我是否可以信任该来源。

因西-Fx

System:    Host: sidekick Kernel: 5.10.0-9-amd64 x86_64 bits: 64 compiler: gcc v: 10.2.1 
           Desktop: KDE Plasma 5.20.5 Distro: Debian GNU/Linux 11 (bullseye) 
Machine:   Type: Laptop System: Hewlett-Packard product: HP Pavilion dv8 Notebook PC 
           v: 0499220000241210001040000 serial: CNF02839BM 
           Mobo: Hewlett-Packard model: 7001 v: 35.35 serial: CNF02839BM BIOS: Hewlett-Packard v: F.25 
           date: 05/31/2010 
Battery:   ID-1: BAT0 charge: 0% condition: 93.1/365.8 Wh (25%) model: Hewlett-Packard Primary status: Unknown 
CPU:       Info: Quad Core model: Intel Core i7 Q 740 bits: 64 type: MT MCP arch: Nehalem rev: 5 L2 cache: 6 MiB 
           flags: lm nx pae sse sse2 sse3 sse4_1 sse4_2 ssse3 vmx bogomips: 27668 
           Speed: 931 MHz min/max: 933/1734 MHz boost: enabled Core speeds (MHz): 1: 931 2: 931 3: 931 4: 931 
           5: 931 6: 931 7: 931 8: 931 
Graphics:  Device-1: NVIDIA GT216M [GeForce GT 230M] vendor: Hewlett-Packard driver: nouveau v: kernel 
           bus ID: 01:00.0 
           Device-2: Quanta HP Webcam type: USB driver: uvcvideo bus ID: 2-1.5:5 
           Display: x11 server: X.Org 1.20.11 driver: loaded: modesetting unloaded: fbdev,vesa resolution: 
           1: 1920x1080~60Hz 2: 1920x1080~60Hz 
           OpenGL: renderer: NVA5 v: 3.3 Mesa 20.3.5 direct render: Yes 
Audio:     Device-1: Intel 5 Series/3400 Series High Definition Audio vendor: Hewlett-Packard 
           driver: snd_hda_intel v: kernel bus ID: 00:1b.0 
           Device-2: NVIDIA GT216 HDMI Audio vendor: Hewlett-Packard driver: snd_hda_intel v: kernel 
           bus ID: 01:00.1 
           Sound Server: ALSA v: k5.10.0-9-amd64 
Network:   Device-1: Intel Centrino Advanced-N 6200 driver: N/A port: 6000 bus ID: 02:00.0 
           Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Hewlett-Packard 
           driver: r8169 v: kernel port: 4000 bus ID: 03:00.0 
           IF: enp3s0 state: up speed: 1000 Mbps duplex: full mac: c8:0a:a9:eb:14:ba 
Drives:    Local Storage: total: 1.38 TiB used: 853.2 GiB (60.6%) 
           ID-1: /dev/sda vendor: Samsung model: SSD 860 EVO 1TB size: 931.51 GiB 
           ID-2: /dev/sdb vendor: Samsung model: SSD 850 PRO 512GB size: 476.94 GiB 
Partition: ID-1: / size: 45.53 GiB used: 8.3 GiB (18.2%) fs: ext4 dev: /dev/sda1 
           ID-2: /home size: 869.04 GiB used: 755.96 GiB (87.0%) fs: ext4 dev: /dev/sda3 
Swap:      ID-1: swap-1 type: partition size: 976 MiB used: 0 KiB (0.0%) dev: /dev/sda5 
Sensors:   System Temperatures: cpu: 59.0 C mobo: N/A gpu: nouveau temp: 56.0 C 
           Fan Speeds (RPM): N/A 
Info:      Processes: 223 Uptime: 20m Memory: 7.76 GiB used: 1.9 GiB (24.5%) Init: systemd runlevel: 5 
           Compilers: gcc: N/A Packages: 2579 Shell: Bash v: 5.1.4 inxi: 3.3.01

答案1

初步尝试

我建议先这样做:我应该怎么做才能在启动时强制进行根文件系统检查(以及可选的修复)?,或者使用 GRUB 的几乎相同的解决方案:

将其添加到您的/etc/default/grubto 行GRUB_CMDLINE_LINUX_DEFAULT

fsck.mode=force fsck.repair=yes

然后运行update-grub​​,只是reboot为了ext4在启动时修复您的文件系统。

如果这没有帮助,请报告回来。


进一步阐述

由于您的 BIOS 是最新的,我们对 BIOS 无能为力虫子然而,在你的文章中提到dmesg,它们可能只是通知而不是严重的错误,所以我个人会同意这些。

然后是smartmontools如果我拼写正确的话,请运行它smartctl -a /dev/sda- 只需更换设备即可。我在实验室经常使用它,但现在我不在办公室,请注意正确拼写您的启动设备(SATA 磁盘、NVMe 驱动器、记忆棒等)。

相关内容