2008/6/25 GS R <gsrlinux@xxxxxxxxx>:
On 6/24/08, Gian Paolo Buono <gpbuono@xxxxxxxxx> wrote:Hi,
We have two RHEL5.1 boxes installed sharing a
single iscsi emc2 SAN, whitout fence devices. System is configured
as a high-availability system of xen guest.
One of the most repeating problems are fence_tool related.
# service cman start
Starting cluster:
Loading modules... done
Mounting configfs... done
Starting ccsd... done
Starting cman... done
Starting daemons... done
Starting fencing... fence_tool: can't communicate with fenced -1
# fenced -D
1204556546 cman_init error 0 111
# clustat
CMAN is not running.
# cman_tool join
# clustat
msg_open: Connection refused
Member Status: Quorate
Member Name ID Status
------ ---- ---- ------
yoda1 1 Online, Local
yoda2 2 Offline
Sometimes this problem gets solved if the two machines are rebooted at
the same time. But in the current HA configuration, I cannot guarantee
two systems will be rebooted at the same time for every problem we
face. This is my config file:
###################################cluster.conf####################################
<?xml version="1.0"?>
<cluster alias="yoda-cl" config_version="2" name="yoda-cl">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="yoda2" nodeid="1" votes="1">
<fence/>
</clusternode>
<clusternode name="yoda1" nodeid="2" votes="1">
<fence/>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<rm>
<failoverdomains/>
<resources/>
</rm>
<fencedevices/>
</cluster>
###################################cluster.conf####################################
Regards.HiI configured a two node cluster with no fence device on RHEL5.1.The cluster started and stopped with no issues. The only difference that I see is that I have used FQDN in my cluster.confi.e., <clusternode name="yoda2.gsr.com" nodeid="1" votes="1">Check your /etc/hosts if it has the FQDN in it.ThanksGowrishankar Rajaiyan
On 6/25/08, Gian Paolo Buono <gpbuono@xxxxxxxxx> wrote:
Hi,
the problem of my cluster is that it start-up weel but after two days the problem that I have described is running, and this problem gets solved if the two machines are rebooted at the same time.
Thanks
Gian Paolo
Hi Gian
Could you please attach the logs.
Thanks
Gowrishankar Rajaiyan
-- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster