Hi It seems you are using hostname of cluster nodes at the place of hostname of ilo (ILO should have separate ip and hostname in DNS) In below config is node1.drctmb.com assigned as hostname of node or the hostname of ILO device? It should be hostname of ilo device.. <fencedevices> > <fencedevice agent="fence_ilo" hostname="node1.drctmb.com" > login="root" name="NODE1" passwd="redhat123"/> > <fencedevice agent="fence_ilo" hostname="node2.drctmb.com" > login="root" name="NODE2" passwd="redhat123"/> > </fencedevices> Thanks Anoop -----Original Message----- From: linux-cluster-bounces@xxxxxxxxxx [mailto:linux-cluster-bounces@xxxxxxxxxx] On Behalf Of linux-cluster-request@xxxxxxxxxx Sent: Thursday, September 09, 2010 12:00 PM To: linux-cluster@xxxxxxxxxx Subject: Linux-cluster Digest, Vol 77, Issue 5 Send Linux-cluster mailing list submissions to linux-cluster@xxxxxxxxxx To subscribe or unsubscribe via the World Wide Web, visit https://www.redhat.com/mailman/listinfo/linux-cluster or, via email, send a message with subject or body 'help' to linux-cluster-request@xxxxxxxxxx You can reach the person managing the list at linux-cluster-owner@xxxxxxxxxx When replying, please edit your Subject line so it is more specific than "Re: Contents of Linux-cluster digest..." Today's Topics: 1. Re: need help - Fencing problem (ESGLinux) 2. Re: need help - Fencing problem (rhurst@xxxxxxxxxxxxxxxxx) 3. Re: need help - Fencing problem (Nehemias Jahcob) 4. Re: need help - Fencing problem (Ben Turner) ---------------------------------------------------------------------- Message: 1 Date: Thu, 9 Sep 2010 10:51:47 +0200 From: ESGLinux <esggrupos@xxxxxxxxx> To: linux clustering <linux-cluster@xxxxxxxxxx> Subject: Re: need help - Fencing problem Message-ID: <AANLkTinvwGjTYc01AgdghwTszSm2j5TJXyw+nEmEwGFe@xxxxxxxxxxxxxx> Content-Type: text/plain; charset="iso-8859-1" Hi, the only reason was that when I used as shared the speed of this device was very very low. Marked it as non-shared it works fine. I don?t know the reason. It was a try-error test, Greetings, ESG 2010/9/9 Jankowski, Chris <Chris.Jankowski@xxxxxx> > Why did you have to set iLO as non-shared? > > Thank and regards, > > Chris > > ------------------------------ > *From:* linux-cluster-bounces@xxxxxxxxxx [mailto: > linux-cluster-bounces@xxxxxxxxxx] *On Behalf Of *ESGLinux > *Sent:* Wednesday, 8 September 2010 22:57 > *To:* linux clustering > > *Subject:* Re: need help - Fencing problem > > Hello, > > Have you configured the iLO devices entering in the BIOS? > > I remenber I have to set up the user/pass in the iLO and marked the iLo as > not shared > > > HTH, > > ESG > > 2010/9/8 Girish Prajapati <girishpati@xxxxxxxxx> > >> Hello Everybody, >> i am having problem of fencing a cluster node let me explain indetail : >> I have installed RHEL 5.4 on HP Prolaint DL280 G5 servers and iLO 2as >> fencing device. Am managing cluster through Luci - (Conga). itseems >> everything is working fine. I can reboot cluster nodes through Luci and >> service get transfer to another node. After rebooting node connect to >> cluster automatically without any error. >> Problem is i can not do Fence this node through Luci, when i try to fence >> any node i get following error : >> >> Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo" reports: Unable >> to connect/login to fencing device >> Sep 8 14:51:16 node2 fence_node[9106]: Fence of "node1.drctmb.com" was >> unsuccessful >> >> my iLO license is : iLO 2 Advanced Evaluation >> Do i need to have license of iLO or there is problem in configuration of >> cluster ? >> how i can check cluster log in details. >> >> Appreciate your help. >> Thank you in advance. >> >> Regards, >> Girishkumar R Prajapati >> >> >> -- >> Linux-cluster mailing list >> Linux-cluster@xxxxxxxxxx >> https://www.redhat.com/mailman/listinfo/linux-cluster >> > > > -- > Linux-cluster mailing list > Linux-cluster@xxxxxxxxxx > https://www.redhat.com/mailman/listinfo/linux-cluster > -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://www.redhat.com/archives/linux-cluster/attachments/20100909/4398 a868/attachment.html> ------------------------------ Message: 2 Date: Thu, 9 Sep 2010 09:34:20 -0400 From: <rhurst@xxxxxxxxxxxxxxxxx> To: <linux-cluster@xxxxxxxxxx> Subject: Re: need help - Fencing problem Message-ID: <50168EC934B8D64AA8D8DD37F840F3DE05640628E6@xxxxxxxxxxxxxxxxxxxxxxxxx> Content-Type: text/plain; charset="us-ascii" For what it is worth, our experiences with HP iLO management cards: iLO found on G1 servers does not need to be licensed, AFAIK, it does not have the option to do so anyways. iLO2 found on G2 and beyond does not need to be licensed either, if you are only using it as a fencing device. We licensed all of ours, because it enabled useful KVM with remote media capabilities that are superior than our Raritan KVM infrastructure. Both management cards should have their firmware updated -- they were both problematic to us as factory-shipped, but applying their update packs allowed them to work as advertised. Also, can't you add "-v" for verbose output and also something like "-D /tmp/fence.out" to save debugging info to an output file? It might help some to see where exactly the failure is occuring. Good luck. ________________________________ From: linux-cluster-bounces@xxxxxxxxxx [mailto:linux-cluster-bounces@xxxxxxxxxx] On Behalf Of Girish Prajapati Sent: Wednesday, September 08, 2010 6:06 AM To: Linux-cluster@xxxxxxxxxx Subject: need help - Fencing problem Hello Everybody, i am having problem of fencing a cluster node let me explain indetail : I have installed RHEL 5.4 on HP Prolaint DL280 G5 servers and iLO 2as fencing device. Am managing cluster through Luci - (Conga). itseems everything is working fine. I can reboot cluster nodes through Luci and service get transfer to another node. After rebooting node connect to cluster automatically without any error. Problem is i can not do Fence this node through Luci, when i try to fence any node i get following error : Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo" reports: Unable to connect/login to fencing device Sep 8 14:51:16 node2 fence_node[9106]: Fence of "node1.drctmb.com" was unsuccessful my iLO license is : iLO 2 Advanced Evaluation Do i need to have license of iLO or there is problem in configuration of cluster ? how i can check cluster log in details. Appreciate your help. Thank you in advance. Regards, Girishkumar R Prajapati -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://www.redhat.com/archives/linux-cluster/attachments/20100909/d58c 6bb4/attachment.html> ------------------------------ Message: 3 Date: Thu, 9 Sep 2010 10:18:31 -0400 From: Nehemias Jahcob <nehemiasjahcob@xxxxxxxxx> To: linux clustering <linux-cluster@xxxxxxxxxx> Subject: Re: need help - Fencing problem Message-ID: <AANLkTim-nS3c8e67kPycd-u0XFOMERR8EJorG6+xHn4M@xxxxxxxxxxxxxx> Content-Type: text/plain; charset="iso-8859-1" 1. ) You can increase the verbosity level for troubleshooting?? <cluster alias="girish" config_version="*n+1*" name="girish"> ---- <fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3" * log_level="7"/>* <rm *log_level="7"*> ----- #ccs_tool update /etc/cluster/cluster.conf Copy-paste /var/log/messages 2.) What version of PSP you have installed?? 3.) If nothing works, I recommend using fence_ipmi Greetings! 2010/9/9 <rhurst@xxxxxxxxxxxxxxxxx> > For what it is worth, our experiences with HP iLO management cards: > > iLO found on G1 servers does not need to be licensed, AFAIK, it does not > have the option to do so anyways. > > iLO2 found on G2 and beyond does not need to be licensed either, if you are > only using it as a fencing device. We licensed all of ours, because it > enabled useful KVM with remote media capabilities that are superior than our > Raritan KVM infrastructure. > > Both management cards should have their firmware updated -- they were both > problematic to us as factory-shipped, but applying their update > packs allowed them to work as advertised. > > Also, can't you add "-v" for verbose output and also something like "-D > /tmp/fence.out" to save debugging info to an output file? It might help > some to see where exactly the failure is occuring. Good luck. > > ------------------------------ > *From:* linux-cluster-bounces@xxxxxxxxxx [mailto: > linux-cluster-bounces@xxxxxxxxxx] *On Behalf Of *Girish Prajapati > *Sent:* Wednesday, September 08, 2010 6:06 AM > *To:* Linux-cluster@xxxxxxxxxx > *Subject:* need help - Fencing problem > > Hello Everybody, > i am having problem of fencing a cluster node let me explain indetail : > I have installed RHEL 5.4 on HP Prolaint DL280 G5 servers and iLO 2as > fencing device. Am managing cluster through Luci - (Conga). itseems > everything is working fine. I can reboot cluster nodes through Luci and > service get transfer to another node. After rebooting node connect to > cluster automatically without any error. > Problem is i can not do Fence this node through Luci, when i try to fence > any node i get following error : > > Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo" reports: Unable > to connect/login to fencing device > Sep 8 14:51:16 node2 fence_node[9106]: Fence of "node1.drctmb.com" was > unsuccessful > > my iLO license is : iLO 2 Advanced Evaluation > Do i need to have license of iLO or there is problem in configuration of > cluster ? > how i can check cluster log in details. > > Appreciate your help. > Thank you in advance. > > Regards, > Girishkumar R Prajapati > > > -- > Linux-cluster mailing list > Linux-cluster@xxxxxxxxxx > https://www.redhat.com/mailman/listinfo/linux-cluster > -------------- next part -------------- An HTML attachment was scrubbed... URL: <https://www.redhat.com/archives/linux-cluster/attachments/20100909/ba94 0c01/attachment.html> ------------------------------ Message: 4 Date: Thu, 9 Sep 2010 11:58:45 -0400 (EDT) From: Ben Turner <bturner@xxxxxxxxxx> To: linux clustering <linux-cluster@xxxxxxxxxx> Subject: Re: need help - Fencing problem Message-ID: <155361964.174311284047925612.JavaMail.root@xxxxxxxxxxxxxxxxxxxxxxxxxxxx .redhat.com> Content-Type: text/plain; charset=utf-8 Judging from: "Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo" reports: Unable to connect/login to fencing device" Chances are you are not using the correct username/password/IP or the ilo is not configured for telnet logins. Try the following: 1. Login to the ilo via telnet from the command line. Be sure to use the username/password/IP you have in cluster.conf. 2. If that is successful try: # fence_ilo -v -a "Ilo IP from cluster.conf" -l "Ilo user from cluster.conf" -p "Ilo passwd from cluster.conf" -o status The -v will display exactly what the fence agent sees and is very useful for debugging failing fences. If the status fails send me the output. 3. If the fence_ilo successful try: # fence_node <node name from cluster.conf> If all 3 are successful then fencing is setup properly and there may be a problem running it from Luci, if any of the 3 fail post the error back to the list and I'll look at it. -Ben ----- "Girish Prajapati" <girishpati@xxxxxxxxx> wrote: > Hello, > i can run following command successfully from another node but still > getting same error message : > > fence_ilo -a "Ilo IP" -l "Ilo user" -p "Ilo passwd" -o reboot > > Sep 9 14:37:00 node2 openais[2904]: [CLM ] Members Joined: > Sep 9 14:37:00 node2 openais[2904]: [SYNC ] This node is within the > primary component and will provide service. > Sep 9 14:37:00 node2 openais[2904]: [TOTEM] entering OPERATIONAL > state. > Sep 9 14:37:00 node2 openais[2904]: [CLM ] got nodejoin message > 192.168.0.28 > Sep 9 14:37:00 node2 openais[2904]: [CPG ] got joinlist message from > node 1 > Sep 9 14:37:00 node2 fenced[2923]: node1.drctmb.com not a cluster > member after 0 sec post_fail_delay > Sep 9 14:37:00 node2 fenced[2923]: fencing node "node1.drctmb.com" > Sep 9 14:37:10 node2 fenced[2923]: agent "fence_ilo" reports: Unable > to connect/login to fencing device > Sep 9 14:37:10 node2 fenced[2923]: fence "node1.drctmb.com" failed > Sep 9 14:37:15 node2 fenced[2923]: fencing node "node1.drctmb.com" > Sep 9 14:37:26 node2 fenced[2923]: agent "fence_ilo" reports: Unable > to connect/login to fencing device > > node1 rebooted and get connect to the cluster but now my webby service > not working see below log : > > Broadcast message from root (Thu Sep 9 14:32:41 2010): > The system is going down for system halt NOW! > Sep 9 14:19:22 node1 last message repeated 17 times > Sep 9 14:32:41 node1 shutdown[25506]: shutting down for system halt > Sep 9 14:32:41 node1 pcscd: winscard.c:304:SCardConnect() Reader > E-Gate 0 0 Not Found > Sep 9 14:32:43 node1 modclusterd: shutdown succeeded > Sep 9 14:32:43 node1 rgmanager: [25593]: <notice> Shutting down > Cluster Service Manager... > Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Shutting down > Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Shutting down > Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Stopping service > service:webby > Sep 9 14:32:44 node1 avahi-daemon[3378]: Withdrawing address record > for 192.168.0.30 on eth0. > Read from remote host node1: Connection reset by peer > . > . > . > Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/hda, packet devices > [this device CD/DVD] not SMART capable > Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/sda, opened > Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/sda, IE (SMART) not > enabled, skip device Try 'smartctl -s on /dev/sda' to turn on SMART > features > Sep 9 14:35:42 node1 smartd[3585]: Monitoring 0 ATA and 0 SCSI devices > Sep 9 14:35:42 node1 smartd[3604]: smartd has fork()ed into background > mode. New PID=3604. > Sep 9 14:35:42 node1 avahi-daemon[3412]: Service "SFTP File Transfer > on node1" (/services/sftp-ssh.service) successfully established. > Sep 9 14:35:45 node1 pcscd: winscard.c:304:SCardConnect() Reader > E-Gate 0 0 Not Found > Sep 9 14:35:45 node1 last message repeated 3 times > Sep 9 14:35:45 node1 kernel: mtrr: type mismatch for d8000000,2000000 > old: uncachable new: write-combining > Sep 9 14:35:46 node1 clurgmgrd: [3491]: <err> Checking Existence Of > File /var/run/cluster/apache/apache:httpd.pid [apache:httpd] > Failed > - File Doesn't Exist > > > > It seems that there problem in fencing device configuration. > Please find here my cluster.conf : > > > <?xml version="1.0"?> > <cluster alias="girish" config_version="21" name="girish"> > <fence_daemon clean_start="0" post_fail_delay="0" > post_join_delay="3"/> > <clusternodes> > <clusternode name=" node2.drctmb.com " nodeid="1" votes="1"> > <fence> > <method name="1"> > <device name="NODE2"/> > </method> > </fence> > </clusternode> > <clusternode name="node1.drctmb.com" nodeid="2" votes="1"> > <fence> > <method name="1"> > <device name="NODE1"/> > </method> > </fence> > </clusternode> > </clusternodes> > <cman expected_votes="1" two_node="1"/> > <fencedevices> > <fencedevice agent="fence_ilo" hostname="node1.drctmb.com" > login="root" name="NODE1" passwd="redhat123"/> > <fencedevice agent="fence_ilo" hostname="node2.drctmb.com" > login="root" name="NODE2" passwd="redhat123"/> > </fencedevices> > <rm> > <failoverdomains> > <failoverdomain name="prefer_node1" nofailback="0" ordered="1" > restricted="1"> > <failoverdomainnode name="node2.drctmb.com" priority="2"/> > <failoverdomainnode name="node1.drctmb.com" priority="1"/> > </failoverdomain> > </failoverdomains> > <resources> > <fs device="/dev/sda1" force_fsck="0" force_unmount="0" fsid="8669" > fstype="ext3" mountpoint="/var/www/html" name="docroot" > self_fence="0"/> > <ip address="192.168.0.30" monitor_link="1"/> > <apache config_file="conf/httpd.conf" name="httpd" > server_root="/etc/httpd" shutdown_wait="5"/> > </resources> > <service autostart="1" domain="prefer_node1" exclusive="0" > name="webby" recovery="relocate"> > <ip ref="192.168.0.30"/> > <fs ref="docroot"/> > <apache ref="httpd"/> > </service> > </rm> > <fence_xvmd/> > </cluster> > ~ > > This is first time am working on Clustering so please help me. > Appreciate your help. > > Thank you. > > > > From: Brem Belguebli <brem.belguebli@xxxxxxxxx> > To: linux clustering <linux-cluster@xxxxxxxxxx> > Sent: Thu, September 9, 2010 11:30:28 AM > Subject: Re: need help - Fencing problem > > try run this from another node of the cluster > > fence_ilo -a "Ilo IP" -l "Ilo user" -p "Ilo passwd" -o reboot > > > Additionnally, by connecting thru http to the Ilo, you should be able > to > see Ilo logs (in the general tab) and see if it is due to a lack of > licensing > > > On Wed, 2010-09-08 at 22:29 -0700, Girish Prajapati wrote: > > Hello... > > > > I have already configure BIOS for iLO.. but am not sure why i don > need > > to shared ?? > > please anybody can help me out for this problem. > > Do i need any extra setup for fencing device ? > > thanks > > > > > > > > > ______________________________________________________________________ > > From: ESGLinux < esggrupos@xxxxxxxxx > > > To: linux clustering < linux-cluster@xxxxxxxxxx > > > Sent: Wed, September 8, 2010 2:57:25 PM > > Subject: Re: need help - Fencing problem > > > > Hello, > > > > > > Have you configured the iLO devices entering in the BIOS? > > > > > > I remenber I have to set up the user/pass in the iLO and marked the > > iLo as not shared > > > > > > > > > > HTH, > > > > > > ESG > > > > 2010/9/8 Girish Prajapati < girishpati@xxxxxxxxx > > > Hello Everybody, > > i am having problem of fencing a cluster node let me explain > > indetail : > > I have installed RHEL 5.4 on HP Prolaint DL280 G5 servers and > > iLO 2as fencing device. Am managing cluster through Luci - > > (Conga). itseems everything is working fine. I can reboot > > cluster nodes through Luci and service get transfer to another > > node. After rebooting node connect to cluster automatically > > without any error. > > Problem is i can not do Fence this node through Luci, when i > > try to fence any node i get following error : > > > > Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo" > > reports: Unable to connect/login to fencing device > > Sep 8 14:51:16 node2 fence_node[9106]: Fence of > > " node1.drctmb.com " was unsuccessful > > > > my iLO license is : iLO 2 Advanced Evaluation > > Do i need to have license of iLO or there is problem in > > configuration of cluster ? > > how i can check cluster log in details. > > > > Appreciate your help. > > Thank you in advance. > > > > Regards, > > Girishkumar R Prajapati > > > > > > > > -- > > Linux-cluster mailing list > > Linux-cluster@xxxxxxxxxx > > https://www.redhat.com/mailman/listinfo/linux-cluster > > > > > > > > -- > > Linux-cluster mailing list > > Linux-cluster@xxxxxxxxxx > > https://www.redhat.com/mailman/listinfo/linux-cluster > > > -- > Linux-cluster mailing list > Linux-cluster@xxxxxxxxxx > https://www.redhat.com/mailman/listinfo/linux-cluster > > > > -- > Linux-cluster mailing list > Linux-cluster@xxxxxxxxxx > https://www.redhat.com/mailman/listinfo/linux-cluster > -- > Linux-cluster mailing list > Linux-cluster@xxxxxxxxxx > https://www.redhat.com/mailman/listinfo/linux-cluster ------------------------------ -- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster End of Linux-cluster Digest, Vol 77, Issue 5 ******************************************** Notice: This e-mail message, together with any attachments, contains information of Merck & Co., Inc. (One Merck Drive, Whitehouse Station, New Jersey, USA 08889), and/or its affiliates Direct contact information for affiliates is available at http://www.merck.com/contact/contacts.html) that may be confidential, proprietary copyrighted and/or legally privileged. It is intended solely for the use of the individual or entity named on this message. If you are not the intended recipient, and have received this message in error, please notify us immediately by reply e-mail and then delete it from your system. -- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster