消息

消息

我们注意到服务器崩溃并出现以下错误。不确定它与任何有缺陷的硬件有关或完全与

服务器详细信息:Red Hat Enterprise Linux ES 版本 4(Nahant 更新 6)[root@athena log]# uname -a Linux athena.nsdecatur.local 2.6.9-67.0.7.ELsmp #1 SMP 2 月 27 日星期三 04:47: 23 EST 2008 x86_64 x86_64 x86_64 GNU/Linux

消息

Sep 17 15:08:16 athena kernel: EDAC k8 MC0: general bus error: participating processor(local node response), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Sep 17 15:08:16 athena kernel: MC0: CE page 0x2c2766, offset 0xb10, grain 8, syndrome 0xac08, row 1, channel 0, label "": k8_edac
Sep 17 15:08:16 athena kernel: MC0: CE - no information available: k8_edac Error Overflow set
Sep 17 15:08:16 athena kernel: EDAC k8 MC0: extended error code: ECC chipkill x4 error
Sep 17 15:08:17 athena su(pam_unix)[19579]: session opened for user oracle by (uid=0)
Sep 17 15:08:17 athena su(pam_unix)[19579]: session closed for user oracle
Sep 17 15:08:17 athena su(pam_unix)[19634]: session opened for user oracle by (uid=0)
Sep 17 15:08:17 athena su(pam_unix)[19634]: session closed for user oracle
Sep 17 15:08:18 athena kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Sep 17 15:08:18 athena kernel: MC0: CE page 0x39c857, offset 0xd50, grain 8, syndrome 0x1cc8, row 1, channel 0, label "": k8_edac
Sep 17 15:08:18 athena kernel: MC0: CE - no information available: k8_edac Error Overflow set
Sep 17 15:08:18 athena kernel: EDAC k8 MC0: extended error code: ECC chipkill x4 error
Sep 17 15:08:18 athena su(pam_unix)[19715]: session opened for user oracle by (uid=0)
Sep 17 15:08:18 athena su(pam_unix)[19715]: session closed for user oracle
Sep 17 15:08:18 athena su(pam_unix)[19758]: session opened for user oracle by (uid=0)
Sep 17 15:08:19 athena su(pam_unix)[19758]: session closed for user oracle
Sep 17 15:08:20 athena su(pam_unix)[19807]: session opened for user oracle by (uid=0)
Sep 17 15:08:20 athena su(pam_unix)[19807]: session closed for user oracle
Sep 17 15:08:20 athena su(pam_unix)[19850]: session opened for user oracle by (uid=0)
Sep 17 15:08:20 athena su(pam_unix)[19850]: session closed for user oracle
Sep 17 15:08:20 athena kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Sep 17 15:08:20 athena kernel: MC0: CE page 0x39c857, offset 0xd50, grain 8, syndrome 0x1cc8, row 1, channel 0, label "": k8_edac
Sep 17 15:08:20 athena kernel: EDAC k8 MC0: extended error code: ECC chipkill x4 error
Sep 17 15:08:21 athena su(pam_unix)[19899]: session opened for user oracle by (uid=0)
Sep 17 15:08:21 athena kernel: EDAC k8 MC0: general bus error: participating processor(local node origin), time-out(no timeout) memory transaction type(generic read), mem or i/o(mem access), cache level(generic)
Sep 17 15:23:54 athena syslogd 1.4.1: restart.
Sep 17 15:23:54 athena syslog: syslogd startup succeeded
Sep 17 15:23:54 athena kernel: klogd 1.4.1, log source = /proc/kmsg started.

答案1

这些错误意味着您的 RAM 检测到了 ECC 事件。您的 RAM 有错误。通常,您会继续监视更多的错误,这通常表明您的 RAM 出现故障/有故障,或者 RAM 的控制器出现故障。偶尔出现一两个并不罕见。

无论哪种情况,都是硬件故障。

监控

如果您有兴趣监视这些故障并设置阈值,您可能需要查看该mcelog软件包。触发器的设置及其作用包含在标题为以下的 U&L 问题中:为 mcelog 编写触发器

相关内容