Hello,
i can run following command successfully from another node but still getting same error message :
fence_ilo -a "Ilo IP" -l "Ilo user" -p "Ilo passwd" -o reboot
Sep 9 14:37:00 node2 openais[2904]: [CLM ] Members Joined:
Sep 9 14:37:00 node2 openais[2904]: [SYNC ] This node is within the primary component and will provide service.
Sep 9 14:37:00 node2 openais[2904]: [TOTEM] entering OPERATIONAL state.
Sep 9 14:37:00 node2 openais[2904]: [CLM ] got nodejoin message 192.168.0.28
Sep 9 14:37:00 node2 openais[2904]: [CPG ] got joinlist message from node 1
Sep 9 14:37:00 node2 fenced[2923]: node1.drctmb.com not a cluster member after 0 sec post_fail_delay
Sep 9 14:37:00 node2 fenced[2923]: fencing node "node1.drctmb.com"
Sep 9 14:37:10 node2 fenced[2923]: agent "fence_ilo" reports: Unable to connect/login to fencing device
Sep 9 14:37:10 node2 fenced[2923]: fence "node1.drctmb.com" failed
Sep 9 14:37:15 node2 fenced[2923]: fencing node "node1.drctmb.com"
Sep 9 14:37:26 node2 fenced[2923]: agent "fence_ilo" reports: Unable to connect/login to fencing device
node1 rebooted and get connect to the cluster but now my webby service not working see below log :
Broadcast message from root (Thu Sep 9 14:32:41 2010):
The system is going down for system halt NOW!
Sep 9 14:19:22 node1 last message repeated 17 times
Sep 9 14:32:41 node1 shutdown[25506]: shutting down for system halt
Sep 9 14:32:41 node1 pcscd: winscard.c:304:SCardConnect() Reader E-Gate 0 0 Not Found
Sep 9 14:32:43 node1 modclusterd: shutdown succeeded
Sep 9 14:32:43 node1 rgmanager: [25593]: <notice> Shutting down Cluster Service Manager...
Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Shutting down
Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Shutting down
Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Stopping service service:webby
Sep 9 14:32:44 node1 avahi-daemon[3378]: Withdrawing address record for 192.168.0.30 on eth0.
Read from remote host node1: Connection reset by peer
.
.
.
Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/hda, packet devices [this device CD/DVD] not SMART capable
Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/sda, opened
Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/sda, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sda' to turn on SMART features
Sep 9 14:35:42 node1 smartd[3585]: Monitoring 0 ATA and 0 SCSI devices
Sep 9 14:35:42 node1 smartd[3604]: smartd has fork()ed into background mode. New PID=3604.
Sep 9 14:35:42 node1 avahi-daemon[3412]: Service "SFTP File Transfer on node1" (/services/sftp-ssh.service) successfully established.
Sep 9 14:35:45 node1 pcscd: winscard.c:304:SCardConnect() Reader E-Gate 0 0 Not Found
Sep 9 14:35:45 node1 last message repeated 3 times
Sep 9 14:35:45 node1 kernel: mtrr: type mismatch for d8000000,2000000 old: uncachable new: write-combining
Sep 9 14:35:46 node1 clurgmgrd: [3491]: <err> Checking Existence Of File /var/run/cluster/apache/apache:httpd.pid [apache:httpd] > Failed - File Doesn't Exist
It seems that there problem in fencing device configuration.
Please find here my cluster.conf :
<?xml version="1.0"?>
<cluster alias="girish" config_version="21" name="girish">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="node2.drctmb.com" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="NODE2"/>
</method>
</fence>
</clusternode>
<clusternode name="node1.drctmb.com" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="NODE1"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_ilo" hostname="node1.drctmb.com" login="root" name="NODE1" passwd="redhat123"/>
<fencedevice agent="fence_ilo" hostname="node2.drctmb.com" login="root" name="NODE2" passwd="redhat123"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="prefer_node1" nofailback="0" ordered="1" restricted="1">
<failoverdomainnode name="node2.drctmb.com" priority="2"/>
<failoverdomainnode name="node1.drctmb.com" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<fs device="/dev/sda1" force_fsck="0" force_unmount="0" fsid="8669" fstype="ext3" mountpoint="/var/www/html" name="docroot" self_fence="0"/>
<ip address="192.168.0.30" monitor_link="1"/>
<apache config_file="conf/httpd.conf" name="httpd" server_root="/etc/httpd" shutdown_wait="5"/>
</resources>
<service autostart="1" domain="prefer_node1" exclusive="0" name="webby" recovery="relocate">
<ip ref="192.168.0.30"/>
<fs ref="docroot"/>
<apache ref="httpd"/>
</service>
</rm>
<fence_xvmd/>
</cluster>
~
This is first time am working on Clustering so please help me.
Appreciate your help.
Thank you.
From: Brem Belguebli <brem.belguebli@xxxxxxxxx>
To: linux clustering <linux-cluster@xxxxxxxxxx>
Sent: Thu, September 9, 2010 11:30:28 AM
Subject: Re: need help - Fencing problem
try run this from another node of the cluster
fence_ilo -a "Ilo IP" -l "Ilo user" -p "Ilo passwd" -o reboot
Additionnally, by connecting thru http to the Ilo, you should be able to
see Ilo logs (in the general tab) and see if it is due to a lack of
licensing
On Wed, 2010-09-08 at 22:29 -0700, Girish Prajapati wrote:
> Hello...
>
> I have already configure BIOS for iLO.. but am not sure why i don need
> to shared ??
> please anybody can help me out for this problem.
> Do i need any extra setup for fencing device ?
> thanks
>
>
>
> ______________________________________________________________________
> From: ESGLinux <esggrupos@xxxxxxxxx>
> To: linux clustering <linux-cluster@xxxxxxxxxx>
> Sent: Wed, September 8, 2010 2:57:25 PM
> Subject: Re: need help - Fencing problem
>
> Hello,
>
>
> Have you configured the iLO devices entering in the BIOS?
>
>
> I remenber I have to set up the user/pass in the iLO and marked the
> iLo as not shared
>
>
>
>
> HTH,
>
>
> ESG
>
> 2010/9/8 Girish Prajapati <girishpati@xxxxxxxxx>
> Hello Everybody,
> i am having problem of fencing a cluster node let me explain
> indetail :
> I have installed RHEL 5.4 on HP Prolaint DL280 G5 servers and
> iLO 2as fencing device. Am managing cluster through Luci -
> (Conga). itseems everything is working fine. I can reboot
> cluster nodes through Luci and service get transfer to another
> node. After rebooting node connect to cluster automatically
> without any error.
> Problem is i can not do Fence this node through Luci, when i
> try to fence any node i get following error :
>
> Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo"
> reports: Unable to connect/login to fencing device
> Sep 8 14:51:16 node2 fence_node[9106]: Fence of
> "node1.drctmb.com" was unsuccessful
>
> my iLO license is : iLO 2 Advanced Evaluation
> Do i need to have license of iLO or there is problem in
> configuration of cluster ?
> how i can check cluster log in details.
>
> Appreciate your help.
> Thank you in advance.
>
> Regards,
> Girishkumar R Prajapati
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/linux-cluster
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster
i can run following command successfully from another node but still getting same error message :
fence_ilo -a "Ilo IP" -l "Ilo user" -p "Ilo passwd" -o reboot
Sep 9 14:37:00 node2 openais[2904]: [CLM ] Members Joined:
Sep 9 14:37:00 node2 openais[2904]: [SYNC ] This node is within the primary component and will provide service.
Sep 9 14:37:00 node2 openais[2904]: [TOTEM] entering OPERATIONAL state.
Sep 9 14:37:00 node2 openais[2904]: [CLM ] got nodejoin message 192.168.0.28
Sep 9 14:37:00 node2 openais[2904]: [CPG ] got joinlist message from node 1
Sep 9 14:37:00 node2 fenced[2923]: node1.drctmb.com not a cluster member after 0 sec post_fail_delay
Sep 9 14:37:00 node2 fenced[2923]: fencing node "node1.drctmb.com"
Sep 9 14:37:10 node2 fenced[2923]: agent "fence_ilo" reports: Unable to connect/login to fencing device
Sep 9 14:37:10 node2 fenced[2923]: fence "node1.drctmb.com" failed
Sep 9 14:37:15 node2 fenced[2923]: fencing node "node1.drctmb.com"
Sep 9 14:37:26 node2 fenced[2923]: agent "fence_ilo" reports: Unable to connect/login to fencing device
node1 rebooted and get connect to the cluster but now my webby service not working see below log :
Broadcast message from root (Thu Sep 9 14:32:41 2010):
The system is going down for system halt NOW!
Sep 9 14:19:22 node1 last message repeated 17 times
Sep 9 14:32:41 node1 shutdown[25506]: shutting down for system halt
Sep 9 14:32:41 node1 pcscd: winscard.c:304:SCardConnect() Reader E-Gate 0 0 Not Found
Sep 9 14:32:43 node1 modclusterd: shutdown succeeded
Sep 9 14:32:43 node1 rgmanager: [25593]: <notice> Shutting down Cluster Service Manager...
Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Shutting down
Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Shutting down
Sep 9 14:32:43 node1 clurgmgrd[3457]: <notice> Stopping service service:webby
Sep 9 14:32:44 node1 avahi-daemon[3378]: Withdrawing address record for 192.168.0.30 on eth0.
Read from remote host node1: Connection reset by peer
.
.
.
Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/hda, packet devices [this device CD/DVD] not SMART capable
Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/sda, opened
Sep 9 14:35:42 node1 smartd[3585]: Device: /dev/sda, IE (SMART) not enabled, skip device Try 'smartctl -s on /dev/sda' to turn on SMART features
Sep 9 14:35:42 node1 smartd[3585]: Monitoring 0 ATA and 0 SCSI devices
Sep 9 14:35:42 node1 smartd[3604]: smartd has fork()ed into background mode. New PID=3604.
Sep 9 14:35:42 node1 avahi-daemon[3412]: Service "SFTP File Transfer on node1" (/services/sftp-ssh.service) successfully established.
Sep 9 14:35:45 node1 pcscd: winscard.c:304:SCardConnect() Reader E-Gate 0 0 Not Found
Sep 9 14:35:45 node1 last message repeated 3 times
Sep 9 14:35:45 node1 kernel: mtrr: type mismatch for d8000000,2000000 old: uncachable new: write-combining
Sep 9 14:35:46 node1 clurgmgrd: [3491]: <err> Checking Existence Of File /var/run/cluster/apache/apache:httpd.pid [apache:httpd] > Failed - File Doesn't Exist
It seems that there problem in fencing device configuration.
Please find here my cluster.conf :
<?xml version="1.0"?>
<cluster alias="girish" config_version="21" name="girish">
<fence_daemon clean_start="0" post_fail_delay="0" post_join_delay="3"/>
<clusternodes>
<clusternode name="node2.drctmb.com" nodeid="1" votes="1">
<fence>
<method name="1">
<device name="NODE2"/>
</method>
</fence>
</clusternode>
<clusternode name="node1.drctmb.com" nodeid="2" votes="1">
<fence>
<method name="1">
<device name="NODE1"/>
</method>
</fence>
</clusternode>
</clusternodes>
<cman expected_votes="1" two_node="1"/>
<fencedevices>
<fencedevice agent="fence_ilo" hostname="node1.drctmb.com" login="root" name="NODE1" passwd="redhat123"/>
<fencedevice agent="fence_ilo" hostname="node2.drctmb.com" login="root" name="NODE2" passwd="redhat123"/>
</fencedevices>
<rm>
<failoverdomains>
<failoverdomain name="prefer_node1" nofailback="0" ordered="1" restricted="1">
<failoverdomainnode name="node2.drctmb.com" priority="2"/>
<failoverdomainnode name="node1.drctmb.com" priority="1"/>
</failoverdomain>
</failoverdomains>
<resources>
<fs device="/dev/sda1" force_fsck="0" force_unmount="0" fsid="8669" fstype="ext3" mountpoint="/var/www/html" name="docroot" self_fence="0"/>
<ip address="192.168.0.30" monitor_link="1"/>
<apache config_file="conf/httpd.conf" name="httpd" server_root="/etc/httpd" shutdown_wait="5"/>
</resources>
<service autostart="1" domain="prefer_node1" exclusive="0" name="webby" recovery="relocate">
<ip ref="192.168.0.30"/>
<fs ref="docroot"/>
<apache ref="httpd"/>
</service>
</rm>
<fence_xvmd/>
</cluster>
~
This is first time am working on Clustering so please help me.
Appreciate your help.
Thank you.
From: Brem Belguebli <brem.belguebli@xxxxxxxxx>
To: linux clustering <linux-cluster@xxxxxxxxxx>
Sent: Thu, September 9, 2010 11:30:28 AM
Subject: Re: need help - Fencing problem
try run this from another node of the cluster
fence_ilo -a "Ilo IP" -l "Ilo user" -p "Ilo passwd" -o reboot
Additionnally, by connecting thru http to the Ilo, you should be able to
see Ilo logs (in the general tab) and see if it is due to a lack of
licensing
On Wed, 2010-09-08 at 22:29 -0700, Girish Prajapati wrote:
> Hello...
>
> I have already configure BIOS for iLO.. but am not sure why i don need
> to shared ??
> please anybody can help me out for this problem.
> Do i need any extra setup for fencing device ?
> thanks
>
>
>
> ______________________________________________________________________
> From: ESGLinux <esggrupos@xxxxxxxxx>
> To: linux clustering <linux-cluster@xxxxxxxxxx>
> Sent: Wed, September 8, 2010 2:57:25 PM
> Subject: Re: need help - Fencing problem
>
> Hello,
>
>
> Have you configured the iLO devices entering in the BIOS?
>
>
> I remenber I have to set up the user/pass in the iLO and marked the
> iLo as not shared
>
>
>
>
> HTH,
>
>
> ESG
>
> 2010/9/8 Girish Prajapati <girishpati@xxxxxxxxx>
> Hello Everybody,
> i am having problem of fencing a cluster node let me explain
> indetail :
> I have installed RHEL 5.4 on HP Prolaint DL280 G5 servers and
> iLO 2as fencing device. Am managing cluster through Luci -
> (Conga). itseems everything is working fine. I can reboot
> cluster nodes through Luci and service get transfer to another
> node. After rebooting node connect to cluster automatically
> without any error.
> Problem is i can not do Fence this node through Luci, when i
> try to fence any node i get following error :
>
> Sep 8 14:51:16 node2 fence_node[9106]: agent "fence_ilo"
> reports: Unable to connect/login to fencing device
> Sep 8 14:51:16 node2 fence_node[9106]: Fence of
> "node1.drctmb.com" was unsuccessful
>
> my iLO license is : iLO 2 Advanced Evaluation
> Do i need to have license of iLO or there is problem in
> configuration of cluster ?
> how i can check cluster log in details.
>
> Appreciate your help.
> Thank you in advance.
>
> Regards,
> Girishkumar R Prajapati
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/linux-cluster
>
>
>
> --
> Linux-cluster mailing list
> Linux-cluster@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/linux-cluster
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster
-- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster