RAID 阵列：无法访问一个分区上的文件，出现输入/输出错误

Question

事实证明，这里的根本问题确实是 XFS 文件系统在非正常断电期间损坏。更糟糕的是，XFS 文件系统有一个未解析的日志文件，导致出现以下警告：

bpbrown@eguzki:/$ sudo xfs_check /dev/md10
ERROR: The filesystem has valuable metadata changes in a log which needs to
be replayed.  Mount the filesystem to replay the log, and unmount it before
re-running xfs_repair.  If you are unable to mount the filesystem, then use
the -L option to destroy the log and attempt a repair.
Note that destroying the log may cause corruption -- please attempt a mount
of the filesystem before doing this.

安装仍然失败，所以我们继续xfs_repair -L。这个方法很快就见效了（不到 5 分钟），尽管/home出现了可怕的警告，但分区之后立即就可以安装和读取了。

bpbrown@eguzki:/$ sudo xfs_repair -L /dev/md10
Phase 1 - find and verify superblock...
Phase 2 - using internal log
        - scan filesystem freespace and inode maps...
agi unlinked bucket 34 is 50978 in ag 1 (inode=536921890)
<...>
Phase 7 - verify and correct link counts...
resetting inode 97329 nlinks from 2 to 3
resetting inode 536921890 nlinks from 0 to 2
done
bpbrown@eguzki:/$

据我们所知，系统运行正常，没有遭受任何关键数据丢失。

Cray 最终为那些像我这样的新手提供了一些有用的文档xfs_check，xfs_repair因此我附上了它们的链接，以防其他人第一次遇到这些问题：

http://docs.cray.com/books/S-2377-22/html-S-2377-22/z1029470303.html

欢呼吧，感谢所有阅读本文并提出想法的人，

--本

Answer 1