Re: restart or relocate?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Carlo Mandelli wrote:
Hi all,

I'm trying to test a 2 nodes cluster (RHCS U4) with apache and one
monitored ip on eth1 (VIP 192.168.0.3), the hearthbeat is on eth0.

When I unplug the cable (eth1) on active node, I get these errors:

Nov 29 17:03:54 node1 clurgmgrd: [4368]: <info> Executing
/etc/init.d/httpd status
Nov 29 17:04:24 node1 clurgmgrd: [4368]: <info> Executing
/etc/init.d/httpd status
Nov 29 17:04:25 node1 kernel: tg3: eth1: Link is down.
Nov 29 17:04:44 node1 clurgmgrd: [4368]: <warning> Link for eth1: Not
detected
Nov 29 17:04:44 node1 clurgmgrd: [4368]: <warning> No link on eth1...
Nov 29 17:04:44 node1 clurgmgrd[4368]: <notice> status on ip
"192.168.0.3" returned 1 (generic error)
Nov 29 17:04:44 node1 clurgmgrd[4368]: <notice> Stopping service http
Nov 29 17:04:44 node1 clurgmgrd: [4368]: <info> Executing
/etc/init.d/httpd stop
Nov 29 17:04:44 node1 httpd: httpd shutdown succeeded
Nov 29 17:04:44 node1 clurgmgrd: [4368]: <info> Removing IPv4 address
192.168.0.3 from eth1
Nov 29 17:04:54 node1 clurgmgrd[4368]: <notice> Service http is recovering
Nov 29 17:04:54 node1 clurgmgrd[4368]: <notice> Recovering failed
service http
Nov 29 17:04:54 node1 clurgmgrd: [4368]: <warning> Link for eth1: Not
detected
Nov 29 17:04:54 node1 clurgmgrd: [4368]: <info> Executing
/etc/init.d/httpd start
Nov 29 17:04:54 node1 httpd: httpd startup succeeded
Nov 29 17:04:54 node1 clurgmgrd[4368]: <notice> Service http started
Nov 29 17:05:04 node1 clurgmgrd: [4368]: <warning> 192.168.0.3 is not
configured
Nov 29 17:05:04 node1 clurgmgrd[4368]: <notice> status on ip
"192.168.0.3" returned 1 (generic error)
Nov 29 17:05:04 node1 clurgmgrd[4368]: <notice> Stopping service http
Nov 29 17:05:04 node1 clurgmgrd: [4368]: <info> Executing
/etc/init.d/httpd stop
Nov 29 17:05:04 node1 httpd: httpd shutdown succeeded
Nov 29 17:05:04 node1 clurgmgrd[4368]: <notice> Service http is recovering
Nov 29 17:05:04 node1 clurgmgrd[4368]: <notice> Recovering failed
service http
Nov 29 17:05:04 node1 clurgmgrd: [4368]: <warning> Link for eth1: Not
detected
Nov 29 17:05:04 node1 clurgmgrd: [4368]: <info> Executing
/etc/init.d/httpd start
Nov 29 17:05:04 node1 httpd: httpd startup succeeded
<...>

and it restarts the service continously.

It performs failover only if I modify recovery mode in cluster.conf:

<service autostart="1" name="http" recovery="relocate">

Is there any way to set max number of retries before relocate service?

Thanks
Carlo
Hi Carlo,

You're probably the victim of the init-script-not-returning-zero issue. See:
http://sources.redhat.com/cluster/faq.html#rgm_wontrestart

Regards,

Bob Peterson
Red Hat Cluster Suite

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux