On Wed, Jun 03, 2009 at 04:22:30PM +0200, Marco Nietz wrote: > Hi, > > i have a Problem with (propably) the Communication between fenced and > ccsd. After a node-failure, fenced should connect ccsd and then try to > fence the failing node. this does not happen on one of our systems. > > Here's an strace from the fence-daemon. > > socket(PF_FILE, SOCK_STREAM, 0) = 9 > connect(9, {sa_family=AF_FILE, path=@"groupd_socket"}, 16) = 0 > write(9, "get_group -1 groupd\0\0\0\0\0\0\0\0\0\0\0\0\0"..., 2200) = 2200 > read(9, > "\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0\0"..., > 1128) = 1128 > close(9) = 0 > write(7, "start_done default 3\0\0\0\0\0\0\0\0\0\0\0\0"..., 2200) = 2200 > poll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=7, > events=POLLIN}, {fd=-1}], 4, -1) = 1 ([{fd=7, revents=POLLIN}]) > read(7, "finish default 3\0\0\0\0\0\0\0\0\350\37Y\21\377\177\0\0"..., > 2200) = 2200 > poll([{fd=4, events=POLLIN}, {fd=5, events=POLLIN}, {fd=7, > events=POLLIN}, {fd=-1}], 4, -1 > > At the Connect-Line i expect the Path to the ccsd-socket > (/var/run/cluster/ccsd.sock). > > How can i tell fenced where to find the Socket. It's not clear from this that fenced/ccsd communication is the problem. After the node failure, please collect from all nodes the output of - cman_tool nodes - group_tool -v - group_tool dump fence - any messages in /var/log/messages Dave -- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster