Manual fencing doest work

"Thai Duong" <thaidn@xxxxxxxxx> · Mon, 3 Apr 2006 17:30:16 +0700

Hi all,

I have a 2 node GFS 6.1 cluster with the following configuration:

<?xml version="1.0"?>

<cluster name="fccrac" config_version="5">

    <cman two_node="1" expected_votes="1">

    </cman>

    <clusternodes>

      <clusternode name="fcc1" votes="1">

       <fence>

        <method name="single">

         <device name="human" nodename="fcc1"/>

        </method>

       </fence>

      </clusternode>

      <clusternode name="fcc4" votes="1">

       <fence>

        <method name="single">

         <device name="human" nodename="fcc4"/>

        </method>

       </fence>

      </clusternode>

   </clusternodes>

  <fence_devices>

   <fence_device name="human" agent="fence_manual"/>

  </fence_devices>

 </cluster>

It turns out that manual fencing doest work as expected. When I force
power down a node, the other could not fence it and worse, the whole
GFS file system is freeze waiting for the downed node to be up again. I
got something like below in kernel log

Apr  2 16:46:28 fcc1 fenced[3444]: fencing node "fcc4"

Apr  2 16:46:28 fcc1 fenced[3444]: fence "fcc4" failed

Some information about GFS and kernel:

[root@fcc1 ~]# rpm -qa | grep GFS

GFS-6.1.3-0

GFS-kernel-2.6.9-45.0.2

[root@fcc1 ~]# uname -a

Linux fcc1 2.6.9-22.0.2.EL #1 SMP Thu Jan 5 17:04:58 EST 2006 ia64 ia64 ia64 GNU/Linux

Please help.

TIA,

Thai Duong.

--

Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster