我已经使用 DRBD 和几个基于网络的主/从资源实现了一个简单的双节点集群。
使用 ethmonitor RA,我通过将主角色仅限制在 ethmon 变量为“1”的节点上,在主/主节点在指定的以太网物理设备上丢失链接时设置故障转移。
但是,我的主机托管约束出了点问题 - 如果我设置 DRBDFS 资源来监视 eth1 上的链接,则拔下主节点上的 eth1 会按预期导致故障转移 - 所有主资源都会降级为“从属”并在相反的节点上提升,并且“正常”DRBDFS 会按预期移动到另一个节点。
但是,如果我对基于网络的主/从资源施加相同的 ethmonitor 约束,则只有该特定资源会发生故障转移 - DRBDFS 会停留在同一位置(尽管它会停止),其他主/从资源也是如此。
这气味对我来说就像一个约束问题——有人知道我可能做错了什么吗?
之前 PCS:
Cluster name: node1.hostname.com_node2.hostname.com
Stack: corosync
Current DC: node2.hostname.com_0 (version 1.1.16-12.el7_4.4-94ff4df) - partition with quorum
Last updated: Tue Mar 20 16:25:47 2018
Last change: Tue Mar 20 16:00:33 2018 by hacluster via crmd on node2.hostname.com_0
2 nodes configured
11 resources configured
Online: [ node1.hostname.com_0 node2.hostname.com_0 ]
Full list of resources:
Master/Slave Set: drbd.master [drbd.slave]
Masters: [ node1.hostname.com_0 ]
Slaves: [ node2.hostname.com_0 ]
drbdfs (ocf::heartbeat:Filesystem): Started node1.hostname.com_0
Master/Slave Set: inside-interface-sameip.master [inside-interface-sameip.slave]
Masters: [ node1.hostname.com_0 ]
Slaves: [ node2.hostname.com_0 ]
Master/Slave Set: outside-interface-sameip.master [outside-interface-sameip.slave]
Masters: [ node1.hostname.com_0 ]
Slaves: [ node2.hostname.com_0 ]
Clone Set: monitor-eth1-clone [monitor-eth1]
Started: [ node1.hostname.com_0 node2.hostname.com_0 ]
Clone Set: monitor-eth2-clone [monitor-eth2]
Started: [ node1.hostname.com_0 node2.hostname.com_0 ]
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: inactive/disabled
之后的 PCS:
Cluster name: node1.hostname.com_node2.hostname.com
Stack: corosync
Current DC: node2.hostname.com_0 (version 1.1.16-12.el7_4.4-94ff4df) - partition with quorum
Last updated: Tue Mar 20 16:29:40 2018
Last change: Tue Mar 20 16:00:33 2018 by hacluster via crmd on node2.hostname.com_0
2 nodes configured
11 resources configured
Online: [ node1.hostname.com_0 node2.hostname.com_0 ]
Full list of resources:
Master/Slave Set: drbd.master [drbd.slave]
Masters: [ node1.hostname.com_0 ]
Slaves: [ node2.hostname.com_0 ]
drbdfs (ocf::heartbeat:Filesystem): Stopped
Master/Slave Set: inside-interface-sameip.master [inside-interface-sameip.slave]
Masters: [ node2.hostname.com_0 ]
Stopped: [ node1.hostname.com_0 ]
Master/Slave Set: outside-interface-sameip.master [outside-interface-sameip.slave]
Masters: [ node1.hostname.com_0 ]
Slaves: [ node2.hostname.com_0 ]
Clone Set: monitor-eth1-clone [monitor-eth1]
Started: [ node1.hostname.com_0 node2.hostname.com_0 ]
Clone Set: monitor-eth2-clone [monitor-eth2]
Started: [ node1.hostname.com_0 node2.hostname.com_0 ]
Daemon Status:
corosync: active/enabled
pacemaker: active/enabled
pcsd: inactive/disabled
这是我的 CIB 的“约束”部分:
<constraints>
<rsc_colocation id="pcs_rsc_colocation_set_drbdfs_set_drbd.master_inside-interface-sameip.master_outside-interface-sameip.master" score="INFINITY">
<resource_set id="pcs_rsc_set_drbdfs" sequential="false">
<resource_ref id="drbdfs"/>
</resource_set>
<resource_set id="pcs_rsc_set_drbd.master_inside-interface-sameip.master_outside-interface-sameip.master" role="Master" sequential="false">
<resource_ref id="drbd.master"/>
<resource_ref id="inside-interface-sameip.master"/>
<resource_ref id="outside-interface-sameip.master"/>
</resource_set>
</rsc_colocation>
<rsc_order id="pcs_rsc_order_set_drbd.master_inside-interface-sameip.master_outside-interface-sameip.master_set_drbdfs" kind="Serialize" symmetrical="false">
<resource_set action="promote" id="pcs_rsc_set_drbd.master_inside-interface-sameip.master_outside-interface-sameip.master-1" role="Master">
<resource_ref id="drbd.master"/>
<resource_ref id="inside-interface-sameip.master"/>
<resource_ref id="outside-interface-sameip.master"/>
</resource_set>
<resource_set id="pcs_rsc_set_drbdfs-1">
<resource_ref id="drbdfs"/>
</resource_set>
</rsc_order>
<rsc_location id="location-inside-interface-sameip.master" rsc="inside-interface-sameip.master">
<rule id="location-inside-interface-sameip.master-rule" score="-INFINITY">
<expression attribute="ethmon_result-eth1" id="location-inside-interface-sameip.master-rule-expr" operation="ne" value="1"/>
</rule>
</rsc_location>
<rsc_location id="location-outside-interface-sameip.master" rsc="outside-interface-sameip.master">
<rule id="location-outside-interface-sameip.master-rule" score="-INFINITY">
<expression attribute="ethmon_result-eth2" id="location-outside-interface-sameip.master-rule-expr" operation="ne" value="1"/>
</rule>
</rsc_location>
</constraints>