有时服务器似乎冻结了。但我们发现,端口 21 上的 FTP 仍然可用,而端口 80、8080、10000、我们的 SSH 端口(不是 22)和 443 不可用。登录也停止。所有日志文件在该时间段内确实存在间隙。使所有功能恢复正常的唯一机会是硬件重置,这将重新启动机器。
mcelog 显示工作通风不良。
Authlog 和 syslog 显示了常见的嫌疑,但没有任何内容会带来这种行为。
以前有人有过这样的行为吗?
因为这很奇怪,我真的不知道除了 syslog 中的间隙开始之外还要呈现什么
May 6 05:48:35 server4 kernel: [74230.825498] [UFW BLOCK] IN=enp0s31f6 OUT= MAC=90:1b:0e:93:23:8f:00:31:46:0d:3b:85:08:00 SRC=216.249.101.105 DST=88.99.44.77 LEN=457 TOS=0x00 PREC=0x00 TTL=54 ID=29755 DF PROTO=UDP SPT=5060 DPT=5060 LEN=437
May 6 05:48:35 server4 postfix/smtpd[31939]: connect from unknown[88.148.12.67]
May 6 05:48:35 server4 kernel: [74230.939036] [UFW BLOCK] IN=enp0s31f6 OUT= MAC=90:1b:0e:93:23:8f:00:31:46:0d:3b:85:08:00 SRC=178.128.220.214 DST=88.99.44.67 LEN=40 TOS=0x00 PREC=0x00 TTL=242 ID=42751 PROTO=TCP SPT=48441 DPT=25579 WINDOW=1024 RES=0x00 SYN URGP=0
May 6 05:48:35 server4 kernel: [74230.947378] [UFW BLOCK] IN=enp0s31f6 OUT= MAC=90:1b:0e:93:23:8f:00:31:46:0d:3b:85:08:00 SRC=80.82.64.146 DST=88.99.44.65 LEN=40 TOS=0x00 PREC=0x00 TTL=250 ID=47541 PROTO=TCP SPT=44343 DPT=3638 WINDOW=1024 RES=0x00 SYN URGP=0
May 6 05:48:35 server4 kernel: [74230.939036] [UFW BLOCK] IN=enp0s31f6 OUT= MAC=90:1b:0e:93:23:8f:00:31:46:0d:3b:85:08:00 SRC=178.128.220.214 DST=88.99.44.67 LEN=40 TOS=0x00 PREC=0x00 TTL=242 ID=42751 PROTO=TCP SPT=48441 DPT=25579 WINDOW=1024 RES=0x00 SYN URGP=0
May 6 05:48:35 server4 kernel: [74230.947378] [UFW BLOCK] IN=enp0s31f6 OUT= MAC=90:1b:0e:93:23:8f:00:31:46:0d:3b:85:08:00 SRC=80.82.64.146 DST=88.99.44.65 LEN=40 TOS=0x00 PREC=0x00 TTL=250 ID=47541 PROTO=TCP SPT=44343 DPT=3638 WINDOW=1024 RES=0x00 SYN URGP=0
May 6 07:31:42 server4 systemd[1]: Started Create Static Device Nodes in /dev.
May 6 07:31:42 server4 systemd[1]: Starting udev Kernel Device Manager...
May 6 07:31:42 server4 systemd[1]: Started udev Kernel Device Manager.```
Code from mcelog
```Hardware event. This is not a software error.
MCE 2
CPU 4 THERMAL EVENT TSC 3e4ca482cdde
TIME 1651835101 Fri May 6 13:05:01 2022
Processor 4 below trip temperature. Throttling disabled
Running trigger `unknown-error-trigger'
mcelog: Too many trigger children running already
STATUS 88020a82 MCGSTATUS 0
MCGCAP c0a APICID 1 SOCKETID 0
CPUID Vendor Intel Family 6 Model 94
mcelog: warning: 16 bytes ignored in each record
mcelog: consider an update
Hardware event. This is not a software error.
MCE 0
CPU 4 THERMAL EVENT TSC 3e4ca4583284
TIME 1651835101 Fri May 6 13:05:01 2022
Processor 4 heated above trip temperature. Throttling enabled.
Please check your system cooling. Performance will be impacted
Running trigger `unknown-error-trigger'
mcelog: Too many trigger children running already```