Hello, I have a four node cluster using fence-scsi and when I
purposely fail any node but the primary (node 1) it seems to always “avert”
fencing. If I fail the primary node, the cluster correctly fences the
node. From node1’s ‘group_tool dump fence’
when I fail node4: 1255903014 start default 103 members 2 3 1
The node doesn’t get fenced and it retains it’s
scsi registrations. From the source: http://git.fedorahosted.org/git/fence.git?p=fence.git;a=blob;f=fence/fenced/recover.c It looks like the conditions for failing are: I don’t think either “is_clean_daemon_member”
or “is_fenced”external” should be true. Fenced
isn’t started as a clean daemon and is_fenced_external (I believe) means
that the node is fenced externally by another fenced daemon which shouldn’t
be true either. Any ideas what could be going on here? Help or
suggestions would be appreciated! Sincerely, David Challoner |
-- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster