Hi Bob, Yeah the communication is all good, can ping each other, in fact I'm scp'ing the cluster.conf file to svr01 and there's nothing else on that network, they might as well be connected through a x-over cable as these two machines are the only machines on the network. I got around the startup hang by starting the OS with the "I" keypress and said No to all cluster services. Once in the OS, I did # service ccsd start # service cman start This works fine on svr00, on svr01 it comes back with [FAILED]. When I do, 'service cman status' it says the following [root@svr01 ~]# service cman status Protocol version: 5.0.1 Config version: 4 Cluster name: testcluster Cluster ID: 27453 Cluster Member: No Membership state: Joining When I do a manual: cman_tool join -w I get back a "cman_tool: Node is already active" Also, I load my modules automatically during system startup with S99local. BTW, I do now see the following messages in my /var/log/messages file Jun 29 14:36:39 svr01 ccsd[3685]: Unable to perform sendto: Cannot assign requested address Jun 29 14:36:40 svr01 kernel: CMAN: sending membership request Jun 29 14:36:41 svr01 ccsd[3685]: Unable to perform sendto: Cannot assign requested address Jun 29 14:36:45 svr01 last message repeated 2 times Jun 29 14:36:45 svr01 kernel: CMAN: sending membership request Jun 29 14:36:47 svr01 ccsd[3685]: Unable to perform sendto: Cannot assign requested address Jun 29 14:36:50 svr01 kernel: CMAN: sending membership request Jun 29 14:37:25 svr01 last message repeated 7 times Does this help? Can I have the application generate more detailed logging? Thanks in advance \R -----Original Message----- From: linux-cluster-bounces@xxxxxxxxxx [mailto:linux-cluster-bounces@xxxxxxxxxx] On Behalf Of Robert Peterson Sent: Friday, June 30, 2006 12:36 AM To: linux clustering Subject: Re: two node cluster not coming up RR wrote: > Hi Bob, > > Attached is the cluster.conf file, and below is the status I get from > the command "service cman status" on the working node: > > Svr00# service cman status > Protocol version: 5.0.1 > Config version: 4 > Cluster name: testcluster > Cluster ID: 27453 > Cluster Member: Yes > Membership state: Cluster-Member > Nodes: 1 > Expected_votes: 1 > Total_votes: 1 > Quorum: 1 > Active subsystems: 0 > Node name: svr00 > Node addresses: 10.1.3.64 > > svr00# clustat > Member Status: Quorate > > Resource Group Manager not running; no service information available. > > Member Name Status > ------ ---- ------ > svr00 Online, Local > svr01 Offline > > > I rebooted svr01 and now it just sits there at Starting clvmd: during > bootup. > > Hope this helps in anyone understanding my issue? Do I need all the > other services configured for this to work properly? i.e. clvmd, fenced, etc. etc. > I just wanted to see two nodes in a cluster first before I configured > any resources, services, fencing etc etc. > > \R > Hi RR, Hm. I didn't see anything obviously wrong with your cluster.conf file. I guess I'd reboot svr01 and try to bring it into the cluster manually, and see if it complains about anything along the way. (You may need to bring it up in single-user mode so that it doesn't hang at the service script that starts clvmd) Something like this: modprobe lock_dlm modprobe gfs ccsd cman_tool join -w fence_tool join -w clvmd I'd verify that your communications are sound, that you can ping svr00 from svr01, and that multicast is working. Any reason you went with multicast rather than broadcast? You could see if a broadcast ping (ping -b) would work from svr01 to svr00. Also, you could test to see if your firewall is blocking the IO by temporarily doing "service iptables stop" on both nodes. I'd hope that selinux isn't interfering either, but you could try doing "setenforce 0" just as an experiment to make sure. These are just some ideas. Regards, Bob Peterson Red Hat Cluster Suite -- Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster -- Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster