Hi all,
I've been fussing with a test cluster (2-node) for a bit now. I had
it working, but I had very little luck with test failure and recovery.
So I decided to start over and follow the "Redhat" way. Specifically, I
was following along with their "Configuring and Managing a Red Hat
Cluster; Red Hat Cluster for Red Hat Enterprise 5" PDF.
I've gotten to the point where, using luci, the cluster was built.
However, the nodes haven't joined and trying to use 'have node join
cluster' fails and generates the following in '/var/log/messages':
---------------------------------------------
Oct 9 13:15:45 vsh02 luci[22301]: Unable to retrieve batch 531050721
status from vsh02.canadaequity.com:11111: module scheduled for execution
Oct 9 13:15:46 vsh02 ccsd[24724]: Unable to connect to cluster
infrastructure after 154350 seconds.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] AIS Executive Service
RELEASE 'subrev 1358 version 0.80.3'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Copyright (C) 2002-2006
MontaVista Software, Inc and contributors.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Copyright (C) 2006 Red
Hat, Inc.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] AIS Executive Service:
started and ready to provide service.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Using default multicast
address of 239.192.119.37
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_cpg loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais cluster closed process group service v1.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_cfg loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais configuration service'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_msg loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais message service B.01.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_lck loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais distributed locking service B.01.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_evt loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais event service B.01.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_ckpt loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais checkpoint service B.01.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_amf loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais availability management framework B.01.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_clm loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais cluster membership service B.01.01'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_evs loaded.
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] Registering service
handler 'openais extended virtual synchrony service'
Oct 9 13:15:47 vsh02 openais[31632]: [MAIN ] openais component
openais_cman loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] AIS Executive Service
RELEASE 'subrev 1358 version 0.80.3'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Copyright (C) 2002-2006
MontaVista Software, Inc and contributors.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Copyright (C) 2006 Red
Hat, Inc.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] AIS Executive Service:
started and ready to provide service.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Using default multicast
address of 239.192.119.37
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_cpg loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais cluster closed process group service v1.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_cfg loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais configuration service'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_msg loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais message service B.01.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_lck loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais distributed locking service B.01.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_evt loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais event service B.01.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_ckpt loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais checkpoint service B.01.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_amf loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais availability management framework B.01.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_clm loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais cluster membership service B.01.01'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_evs loaded.
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] Registering service
handler 'openais extended virtual synchrony service'
Oct 9 13:15:49 vsh02 openais[31682]: [MAIN ] openais component
openais_cman loaded.
Oct 9 13:15:51 vsh02 luci[22301]: Unable to retrieve batch 531050721
status from vsh02.canadaequity.com:11111: service cman start failed:
---------------------------------------------
When I try to start 'cman' from the command line, I get this error:
---------------------------------------------
# service cman start
Starting cluster:
Enabling workaround for Xend bridged networking... done
Loading modules... done
Mounting configfs... done
Starting ccsd... done
Starting cman... failed
/usr/sbin/cman_tool: aisexec daemon didn't start
---------------------------------------------
This generates the same MAIN: openais errors.
My cluster is pretty simple;
- Two ASUS servers with three NICs each. One dedicated to a DRBD link.
- IPMI for fencing
- LVM running on DRBD (no SAN, I'm afraid)
Any insight into what I might be doing wrong?
Thanks!
Madi
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster