Hello list I'm trying to setup a 3 nodes Cluster with 2 failover Domain for an HA mail solution. I want 1 run active for the Imap server in the Imap Failover domain , 1 node active for the Smtp in the Smtp Failover domain and the 3rd in the 2 failover domain as a backup node. I run Centos 5.3 My fence device is a wti power switch My cluster.conf is in attachement My SMTP service is composed of: 1 IP 1 amavisd scritp 1 postfix script 2 NFS mount for postfix and amavis If I manually kill the postfix master process (to simulate a crash), my node is not fence and the logs said: Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info> Executing /etc/init.d/postfix status Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <err> script:postfix: status of /etc/init.d/postfix failed (returned 3) Jul 6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice> status on script "postfix" returned 1 (generic error) Jul 6 10:00:40 centos-smtp1 clurgmgrd[4228]: <notice> Stopping service service:Postfix Jul 6 10:00:40 centos-smtp1 clurgmgrd: [4228]: <info> Executing /etc/init.d/amavisd stop Jul 6 10:00:40 centos-smtp1 kernel: do_vfs_lock: VFS is out of sync with lock manager! Jul 6 10:00:40 centos-smtp1 last message repeated 8 times Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info> Executing /etc/init.d/postfix stop Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <err> script:postfix: stop of /etc/init.d/postfix failed (returned 1) Jul 6 10:00:41 centos-smtp1 clurgmgrd[4228]: <notice> stop on script "postfix" returned 1 (generic error) Jul 6 10:00:41 centos-smtp1 clurgmgrd: [4228]: <info> Removing IPv4 address 195.83.126.201/24 from bond0 Jul 6 10:00:41 centos-smtp1 avahi-daemon[3552]: Withdrawing address record for 195.83.126.201 on bond0. Jul 6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info> unmounting /var/lib/amavis Jul 6 10:00:51 centos-smtp1 clurgmgrd: [4228]: <info> unmounting /var/spool/postfix Jul 6 10:00:51 centos-smtp1 clurgmgrd[4228]: <crit> #12: RG service:Postfix failed to stop; intervention required Jul 6 10:00:51 centos-smtp1 clurgmgrd[4228]: <notice> Service service:Postfix is failed Jul 6 10:00:52 centos-smtp1 ntpd[3322]: synchronized to 195.83.126.119, stratum 1 Clustat said: Cluster Status for cluster-test @ Mon Jul 6 10:02:39 2009 Member Status: Quorate Member Name ID Status ------ ---- ---- ------ centos-imap1.ill.fr 1 Online, Local, rgmanager centos-imap2.ill.fr 2 Online, rgmanager centos-smtp1.ill.fr 3 Online, rgmanager /dev/disk/by-id/scsi-360a98000567247514634507447594661-part1 0 Online, Quorum Disk Service Name Owner (Last) State ------- ---- ----- ------ ----- service:Imap centos-imap2.ill.fr started service:Postfix (centos-smtp1.ill.fr) failed So I have to disable the Postfix servcie with: clusvcadm -d Postfix and re-enable clusvcadm -e Postfix Could you explain my why my original smtp node is not fenced and why my service is not start on the 2nd node ??? Is there a way to force the fencing ??? -- ARMANET Stephane Division Projet Technique Service Informatique Groupe Infrastructure Institut Laue langevin
<?xml version="1.0"?> <cluster alias="cluster-test" config_version="57" name="cluster-test"> <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/> <clusternodes> <clusternode name="centos-imap1.test.fr" nodeid="1" votes="1"> <fence> <method name="1"> <device name="psu2" option="off" port="1"/> <device name="psu2" option="off" port="5"/> </method> <method name="2"> <device name="centos_manual-fence" nodename="centos-imap1.test.fr"/> </method> </fence> </clusternode> <clusternode name="centos-imap2.test.fr" nodeid="2" votes="1"> <fence> <method name="1"> <device name="psu2" option="off" port="2"/> <device name="psu2" option="off" port="6"/> </method> <method name="2"> <device name="centos_manual-fence" nodename="centos-imap2.test.fr"/> </method> </fence> </clusternode> <clusternode name="centos-smtp1.test.fr" nodeid="3" votes="1"> <fence> <method name="1"> <device name="psu1" option="off" port="1"/> <device name="psu1" option="off" port="5"/> </method> <method name="2"> <device name="centos_manual-fence" nodename="centos-smtp1.test.fr"/> </method> </fence> </clusternode> </clusternodes> <cman/> <fencedevices> <fencedevice agent="fence_manual" name="centos_manual-fence"/> <fencedevice agent="fence_wti" ipaddr="192.168.0.215" name="psu1" passwd="passwd"/> <fencedevice agent="fence_wti" ipaddr="192.168.0.216" name="psu2" passwd="passwd"/> </fencedevices> <rm log_facility="local4" log_level="7"> <failoverdomains> <failoverdomain name="imap-FOD" nofailback="0" ordered="1" restricted="1"> <failoverdomainnode name="centos-imap1.test.fr" priority="1"/> <failoverdomainnode name="centos-imap2.test.fr" priority="2"/> </failoverdomain> <failoverdomain name="smtp-FOD" ordered="1" restricted="1"> <failoverdomainnode name="centos-smtp1.test.fr" priority="1"/> <failoverdomainnode name="centos-imap2.test.fr" priority="2"/> </failoverdomain> </failoverdomains> <resources> <netfs export="/vol/volSMTP/postfix" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/spool/postfix" name="NFS-postfix" options="rw,nolock"/> <fs device="/dev/mapper/vgMail-lvMailboxes" force_fsck="1" force_unmount="1" fsid="34650" fstype="ext3" mountpoint="/var/spool/imap" name="lvMailboxes" options="commit=1" self_fence="1"/> <fs device="/dev/mapper/vgMail-lvDBMail" force_fsck="1" force_unmount="1" fsid="4277" fstype="ext3" mountpoint="/var/lib/imap" name="lvDBMail" options="commit=1" self_fence="1"/> <netfs export="/vol/volSMTP/amavis" force_unmount="1" fstype="nfs" host="romulus.test.fr" mountpoint="/var/lib/amavis" name="NFS Amavis" options=""/> </resources> <service autostart="1" domain="imap-FOD" name="Imap" recovery="relocate"> <ip address="192.168.0.198/24" monitor_link="1"> <script file="/etc/init.d/cyrus-imapd" name="Cyrus-imapd"/> </ip> <fs ref="lvMailboxes"/> <fs ref="lvDBMail"/> </service> <service autostart="1" domain="smtp-FOD" name="Postfix" recovery="relocate"> <ip address="192.168.0.201" monitor_link="1"> <script file="/etc/init.d/postfix" name="postfix"/> <script file="/etc/init.d/amavisd" name="amavisd"/> </ip> <netfs ref="NFS-postfix"/> <netfs ref="NFS Amavis"/> </service> </rm> <quorumd interval="2" label="QDISK" min_score="1" tko="5" votes="2"> <heuristic interval="2" program="/bin/ping -c 1 -t 1 192.168.0.254" score="1"/> <heuristic interval="5" program="/bin/ping -c 3 -t 1 192.168.0.254" score="1"/> </quorumd> <totem consensus="4800" join="60" token="25000" token_retransmits_before_loss_const="20"/> </cluster>
-- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster