[Linux-cluster] Cluster node hung ( SM: Assertion failed )

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi!
I have two node cluster. (CL10 and CL20)
Generally it's working good.
But sometimes after rebooting one of nodes
I get unstable situation:
For example ... after incorrect shutdown node CL10
Remaining node CL20 regaining quorom and left operational until
other node CL10 was and start joining to cluster.
In that moment console CL10 get message ...

kernel: CMAN: no HELLO from cl20, removing from the cluster

On CL20 side console view ..

SM: Assertion Failed on line 52 of file
/usr/local/src/cluster/cman-kernel/src/sm_misc.c

SM: assertion: "!error"

SM: time 1729980

Kernel panic: SM:
    Records message above and reboot.
...
and  CL20 hung after that.

My config is:
<?xml version="1.0"?>
<cluster name="alpha" config_version="1">

  <cman two_node="1" expected_votes="1">
  </cman>

  <nodes>
    <node name="cl10" votes="1">
        <altname name="cl010"/>
      <fence>
        <method name="single">
          <device name="human" ipaddr="cl10"/>
        </method>
      </fence>
    </node>
    <node name="cl20" votes="1">
        <altname name="cl020"/>
      <fence>
        <method name="single">
          <device name="human" ipaddr="cl20"/>
        </method>
      </fence>
    </node>
  </nodes>

  <fence_devices>
    <device name="human" agent="fence_manual"/>
  </fence_devices>

</cluster>

__
Thanks for any information.


[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux