我一直在处理的集群突然开始出现故障...看起来我遇到了 exportfs 资源的问题。
有什么方法可以解决这个问题吗?我找不到“-2”返回代码
============
Last updated: Mon Jan 7 09:18:18 2013
Last change: Fri Jan 4 16:02:13 2013 via crmd on emserver1
Stack: openais
Current DC: emserver1 - partition with quorum
Version: 1.1.6-9971ebba4494012a93c03b40a2c58ec0eb60f50c
2 Nodes configured, 2 expected votes
9 Resources configured.
============
Online: [ emserver1 emserver2 ]
Master/Slave Set: ms_drbd_nfs [p_drbd_nfs]
Masters: [ emserver1 ]
Slaves: [ emserver2 ]
Clone Set: cl_lsb_nfsserver [p_lsb_nfsserver]
Started: [ emserver1 emserver2 ]
Resource Group: g_nfs
p_fs_nfs (ocf::heartbeat:Filesystem): Started emserver1
p_exportfs_nfs (ocf::heartbeat:exportfs): Started emserver1 (unmanaged) FAILED
p_ip_nfs (ocf::heartbeat:IPaddr2): Stopped
Clone Set: cl_exportfs_root [p_exportfs_root]
Started: [ emserver1 ]
Stopped: [ p_exportfs_root:1 ]
Failed actions:
p_drbd_nfs:1_promote_0 (node=emserver2, call=22, rc=-2, status=Timed Out): unknown exec error
p_exportfs_root:1_start_0 (node=emserver2, call=10, rc=-2, status=Timed Out): unknown exec error
p_exportfs_nfs_stop_0 (node=emserver1, call=32, rc=-2, status=Timed Out): unknown exec error
p_drbd_nfs:0_demote_0 (node=emserver1, call=19, rc=1, status=complete): unknown error
答案1
ubuntu 服务器软件包的资源代理已过时。exportfs 资源代理中有一个错误,导致 nfs rmtab 增长到非常大(这就是发生超时的原因)。
我从 github 升级了资源代理并删除了 2GB rmtab。之后一切都很好。