I have a 2 node GFS 6.1 cluster with the following configuration:
<?xml version="1.0"?>
<cluster name="fccrac" config_version="5">
<cman two_node="1" expected_votes="1">
</cman>
<clusternodes>
<clusternode name="fcc1" votes="1">
<fence>
<method name="single">
<device name="human" nodename="fcc1"/>
</method>
</fence>
</clusternode>
<clusternode name="fcc4" votes="1">
<fence>
<method name="single">
<device name="human" nodename="fcc4"/>
</method>
</fence>
</clusternode>
</clusternodes>
<fence_devices>
<fence_device name="human" agent="fence_manual"/>
</fence_devices>
</cluster>
It turns out that manual fencing doest work as expected. When I force power down a node, the other could not fence it and worse, the whole GFS file system is freeze waiting for the downed node to be up again. I got something like below in kernel log
Apr 2 16:46:28 fcc1 fenced[3444]: fencing node "fcc4"
Apr 2 16:46:28 fcc1 fenced[3444]: fence "fcc4" failed
Some information about GFS and kernel:
[root@fcc1 ~]# rpm -qa | grep GFS
GFS-6.1.3-0
GFS-kernel-2.6.9-45.0.2
[root@fcc1 ~]# uname -a
Linux fcc1 2.6.9-22.0.2.EL #1 SMP Thu Jan 5 17:04:58 EST 2006 ia64 ia64 ia64 GNU/Linux
Please help.
TIA,
Thai Duong.
-- Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster