昨晚我们的主要网络服务器瘫痪了,3 分钟内就syslog
崩溃了。我真的很难判断这是否syslog
表明我的症状是潜在原因,或者是否有关于直接的原因。有人能理解这一点吗?
首先是致命错误消息:
Jun 15 23:27:04 ywpadmin systemd-udevd[513]: Process '/bin/sh -c 'echo 180 >/sys$DEVPATH/device/timeout'' failed with exit code 2.
Jun 15 23:27:04 ywpadmin systemd-udevd[513]: Process '/bin/sh -c 'echo 180 >/sys$DEVPATH/device/timeout'' failed with exit code 2.
Jun 15 23:27:04 ywpadmin systemd-udevd[505]: Process '/bin/sh -c 'echo 180 >/sys$DEVPATH/device/timeout'' failed with exit code 2.
Jun 15 23:27:04 ywpadmin systemd-udevd[514]: Process '/bin/sh -c 'echo 180 >/sys$DEVPATH/device/timeout'' failed with exit code 2.
Jun 15 23:27:04 ywpadmin systemd-udevd[508]: Process '/bin/sh -c 'echo 180 >/sys$DEVPATH/device/timeout'' failed with exit code 2.
Jun 15 23:27:04 ywpadmin rpcbind[856]: rpcbind: xdr_/run/rpcbind/rpcbind.xdr: failed
Jun 15 23:27:04 ywpadmin rpcbind[856]: rpcbind: xdr_/run/rpcbind/portmap.xdr: failed
Jun 15 23:27:04 ywpadmin kernel: [ 0.436467] pci 0000:00:17.4: BAR 13: no space for [io size 0x1000]
Jun 15 23:27:04 ywpadmin kernel: [ 0.431044] pci 0000:00:15.3: BAR 13: failed to assign [io size 0x1000]
-------------^^^ THIS ONE HAPPENED ABOUT 20 TIME
Jun 15 23:27:04 ywpadmin kernel: [ 2.499847] blk_update_request: I/O error, dev fd0, sector 0
Jun 15 23:27:04 ywpadmin kernel: [ 2.500273] floppy: error -5 while reading block 0
Jun 15 23:27:04 ywpadmin kernel: [ 5.933471] EXT4-fs (sda3): re-mounted. Opts: errors=remount-ro
Jun 15 23:27:05 ywpadmin sm-mta[1254]: gethostbyaddr(10.2.x.x) failed: 1
我附上了这几分钟的完整日志。它很详尽,但这就是我把致命错误放在第一位的原因。这是日志中“异常”的开始,直到我强制重新启动服务器:
我对此感到十分困惑。它只是一个基本的网络服务器...值得一提的是,它意外地/var/log/apache2/error.log
被“跟踪”了tail -f error.log
很长一段时间。有什么想法吗?