Ok i think that it works but now i have another problem
On the node0 messages i can see the follow message
fence_manual: Node node1 needs to be reset before recovery can procede. Waiting for node1 to rejoin the cluster or for manual acknowledgement that it has been reset (i.e. fence_ack_manual -n node1)
so i try to fence_ack_manual -n node1
and i recived this message
fence_ack_manual -n node1
Warning: If the node "node1" has not been manually fenced
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable! Please verify that the node shown above has
been reset or disconnected from storage.
(i.e. power cycled or disconnected from shared storage devices)
the GFS file system may become corrupted and all its data
unrecoverable! Please verify that the node shown above has
been reset or disconnected from storage.
Are you certain you want to continue? [yN] y
can't open /tmp/fence_manual.fifo: No such file or directory
Thank you for your help
Best Regards
Mauro Casiraghi
On Thu, Nov 13, 2008 at 3:58 PM, John Ruemker <jruemker@xxxxxxxxxx> wrote:
Try adding the nodename attribute to each device as seen here:<device name="Manual-0" nodename="node0"/>
Mauro Casiraghi wrote:
I have two cluster nodes with the follow configuration
For each node i had setup manual fencing
<?xml version="1.0"?>
<cluster alias="rhcs" config_version="13" name="mauro">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="node0" nodeid="1" votes="1">
<fence>
<method name="1"><device name="Manual-1" nodename="node1"/>
</method>
</fence>
</clusternode>
<clusternode name="node1" nodeid="2" votes="1">
<fence>
<method name="1">-John
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_manual" name="Manual-0" />
<fencedevice agent="fence_manual" name="Manual-1" />
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="rhcs-domain" ordered="0" restricted="1">
<failoverdomainnode name="node0" priority="1"/>
<failoverdomainnode name="node1" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<ip address="xx.xxx.xx.78" monitor_link="1"/>
</resources>
<service autostart="1" domain="rhcs-domain" exclusive="0" name="rhcs-web" recovery="relocate">
<ip ref="xx.xxx.xx..78"/>
</service>
</rm>
</cluster>
On my messages (node0) i had recived this message
Nov 13 12:06:34 lxxxxxxx fenced[2002]: fencing node "node1"
Nov 13 12:06:34 lxxxxxxx fenced[2002]: agent "fence_manual" reports: failed: fence_manual no node name
How can i fix this problem
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster
-- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster