我在集群成员上启动 corosync 时遇到错误:
May 16 00:53:32 neftis corosync[19741]: [MAIN ] Corosync Cluster Engine ('2.3.4'): started and ready to provide service.
May 16 00:53:32 neftis corosync[19741]: [MAIN ] Corosync built-in features: dbus systemd xmlconf snmp pie relro bindnow
May 16 00:53:32 neftis corosync[19741]: [MAIN ] parse error in config: No interfaces defined
May 16 00:53:32 neftis corosync[19741]: [MAIN ] Corosync Cluster Engine exiting with status 8 at main.c:1278.
May 16 00:53:32 neftis corosync: Starting Corosync Cluster Engine (corosync): [FALL�]
May 16 00:53:32 neftis systemd: corosync.service: control process exited, code=exited status=1
May 16 00:53:32 neftis systemd: Failed to start Corosync Cluster Engine.
May 16 00:53:32 neftis systemd: Unit corosync.service entered failed state.
May 16 00:53:32 neftis systemd: corosync.service failed.
May 16 00:54:06 neftis systemd: Cannot add dependency job for unit firewalld.service, ignoring: Unit firewalld.service is masked.
May 16 00:54:06 neftis systemd: Starting Corosync Cluster Engine...
May 16 00:54:06 neftis corosync[19773]: [MAIN ] Corosync Cluster Engine ('2.3.4'): started and ready to provide service.
May 16 00:54:06 neftis corosync[19773]: [MAIN ] Corosync built-in features: dbus systemd xmlconf snmp pie relro bindnow
May 16 00:54:06 neftis corosync[19773]: [MAIN ] parse error in config: No interfaces defined
May 16 00:54:06 neftis corosync[19773]: [MAIN ] Corosync Cluster Engine exiting with status 8 at main.c:1278.
May 16 00:54:06 neftis corosync: Starting Corosync Cluster Engine (corosync): [FALL�]
May 16 00:54:06 neftis systemd: corosync.service: control process exited, code=exited status=1
May 16 00:54:06 neftis systemd: Failed to start Corosync Cluster Engine.
May 16 00:54:06 neftis systemd: Unit corosync.service entered failed state.
这是我在三个节点上的配置,但仅在我最近添加的 netfis 中失败。
totem {
version: 2
secauth: off
cluster_name: cluster-osiris
transport: udpu
}
nodelist {
node {
ring0_addr: isis.localdoamin
nodeid: 1
}
node {
ring0_addr: horus.localdoamin
nodeid: 2
}
node {
ring0_addr: netfis.localdoamin
nodeid: 3
}
}
quorum {
provider: corosync_votequorum
}
logging {
to_syslog: yes
}
我正在 CentOS 7.1 64 位上运行pacemaker、corosync、pcs 集群。
我在互联网上搜索,但不清楚发生了什么。
你可以帮帮我吗?
答案1
您在 Corosync 配置中列出的主机名是否可以正确解析?我首先要验证这一点。
# host isis.localdoamin
由于“domain”似乎拼写错误(或使用我不知道的语言),我猜该命令会失败? ;-)
此外,您还可以使用短主机名(不带“.localdomain”)或您希望 Corosync 绑定到的接口的 IP 地址。
答案2
出现此错误消息是因为 proxmox 在我的容器主机名的环回 ip 上添加了一条记录,如下/etc/hosts
所示:
# --- BEGIN PVE ---
127.0.1.1 pgmaster
# --- END PVE ---
我还有一张hosts
关于这个名字的记录。注释掉环回记录,现在就可以了。