Guest is not relocating under cluster

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi 

I have six node test cluster, running on rhel5.7 86_64 bit OS. 

The nodes are under the xen environment. Trying to relocate the guest if the node fails
where the guest is running. But the guest is not relocating, it is getting stopped.

The version of cman and rgmanger are :

cman-2.0.115-85.el5
rgmanager-2.0.52-21.el5


Here is the cluster.conf
--------------------------------------

<?xml version="1.0"?>
<cluster alias="newtest" config_version="26" name="newtest">
        <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
        <clusternodes>
                <clusternode name="node1" nodeid="1" votes="1">
                        <fence>
                                <method name="1">
                                        <device action="reboot" name="ilo-node1"/>
                                </method>
                        </fence>
                </clusternode>
........
<snip>
        </clusternodes>
<cman>
    <multicast addr="xxx.1.5.1"/>
</cman>
<totem token="20000"/>
        <fencedevices>
                <fencedevice agent="fence_ilo" hostname="node1r" login="Admin" name="ilo-node1" passwd="xxxxx"/>
........
<snip>
  </fencedevices>
        <rm log_level="7" log_facility="local4">
                <failoverdomains>
                   <failoverdomain name="nd1-nd2-nd3-nd4-nd5-nd6" nofailback="1" ordered="1" restricted="1">
                        <failoverdomainnode name="node1" priority="1"/>
                        <failoverdomainnode name="node2" priority="2"/>
                        <failoverdomainnode name="node3" priority="3"/>
                        <failoverdomainnode name="node4" priority="4"/>
                        <failoverdomainnode name="node5" priority="5"/>
                        <failoverdomainnode name="node6" priority="6"/>
                </failoverdomain>
                </failoverdomains>
                <resources/>
                <vm autostart="1" name="guest1" migrate="live" recovery="relocate"/>
        </rm>
        <cman/>
</cluster>

Here are  few lines  from the log file..
--------------------------------------------------------------

Aug 20 18:51:09 node clurgmgrd[7431]: <debug> Event: Port Opened 
Aug 20 18:51:09 node clurgmgrd[7431]: <info> State change: node3 UP 
Aug 20 18:51:14 node clurgmgrd[7431]: <debug> Evaluating RG vm:guest1, state stopped, owner none 
Aug 20 18:51:14 node clurgmgrd[7431]: <debug> Event (0:3:1) Processed 
Aug 20 18:51:19 node clurgmgrd[7431]: <debug> 1 events processed 
Aug 20 18:51:35 node clurgmgrd[7431]: <debug> No other nodes have seen vm:guest1 
Aug 20 18:51:35 node clurgmgrd[7431]: <notice> Starting stopped service vm:guest1 
Aug 20 18:51:36 node clurgmgrd: [7431]: <debug> virsh -c xen:/// start guest1 
Aug 20 18:51:37 node clurgmgrd[7431]: <notice> start on vm "guest1" returned 1 (generic error) 
Aug 20 18:51:37 node clurgmgrd[7431]: <warning> #68: Failed to start vm:guest1; return value: 1 
Aug 20 18:51:37 node clurgmgrd[7431]: <debug> Stopping failed service vm:guest1 
Aug 20 18:51:37 node clurgmgrd[7431]: <notice> Stopping service vm:guest1 
Aug 20 18:51:37 node clurgmgrd: [7431]: <debug> Virtual machine guest1 is  
Aug 20 18:51:38 node clurgmgrd[7431]: <notice> Service vm:guest1 is recovering 
Aug 20 18:51:38 node clurgmgrd[7431]: <warning> #71: Relocating failed service vm:guest1 
Aug 20 18:51:38 node clurgmgrd[7431]: <debug> Sent remote-start request to 6 
Aug 20 18:51:49 node clurgmgrd[7431]: <debug> 4 events processed 

Any advice is really appreciated.

Thanks in advance.

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster



[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux