RE: Network failure results cluster environment unstable & fragile

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Javier 
Thanks for your input.
I wanted to know that, Is your issue get resolved using new fence package ?

With regard

Deval K.


-----Original Message-----
From: Pena, Francisco Javier [mailto:francisco_javier.pena@xxxxxxxxx] 
Sent: Monday, February 27, 2006 1:52 PM
To: Deval kulshrestha; linux clustering
Subject: RE:  Network failure results cluster
environmentunstable & fragile

Hi Deval,

If you are using iLO fencing, you could try the latest fence package
(1.32.10). I have seen a similar problem, and it is because recent iLO
firmware versions behave a little different (they try to make a soft
restart instead of a hard reboot). 

At least one of the nodes should get properly killed, and the surviving
one should keep all services.

Hope this helps. Regards,

Javier

> -----Original Message-----
> From: linux-cluster-bounces@xxxxxxxxxx 
> [mailto:linux-cluster-bounces@xxxxxxxxxx] On Behalf Of Deval 
> kulshrestha
> Sent: Saturday, February 25, 2006 6:33 AM
> To: 'linux clustering'
> Subject: RE:  Network failure results cluster 
> environmentunstable & fragile
> 
> 
> Please help me to resolve my problem 
> 
> 
> If network goes off on node1, and service which were not 
> running on node1  are started by node1 with shared storage 
> mount point, which was already  running on node 2 but both of 
> these nodes are not  able to communicate to  each other, 
> node2 anyway already running the same service with shared  
> storage mount point. Because of Fencing both of  these nodes 
> try to kill each other. Both of they got hanged up at 
> "Stoping Cluster manager Services.".In /var/log/messages, it 
> shows fencing s1, fence successful. 
>  
>  If we disable fencing than 
>  
>  If network comes back nodes don't synchronize with each 
> other. Shared  storage mount point is available to both the 
> servers. If they try to access  storage at same storage gives 
> IO errors. Hence this entire setup become very unstable, fragile.
> 
> --- Deval kulshrestha
> <deval.kulshrestha@xxxxxxxxxxxxxxx> wrote:
> 
> > Hi
> > 
> > I am struggling to get some help on following
> > configuration. This setup is
> > intended to put live in a data center for 24 x 7
> > x365, any issue that makes
> > my environment unstable is very critical here.
> > 
> > My HA Cluster Setup details
> > 
> > 1.	HP DL 360 G4p Server                       2nos.
> > 2.	HP MSA 500 G2 (SAN)                     1nos.
> > 3.	RedHat Enterprise Linux 4 ES  
> > 4.	Red Hat Cluster Suite 4
> > 
> > 
> > Server does have a HP SCSI HBA. MSA 500G2 is a scsi
> > based SAN. Both of these
> > server are connected to SAN using SCSI VHDCI cable.
> > I used a network switch
> > to establish network connectivity for the server.
> > created a disk array of
> > three HDD on SAN with two logical volumes than  I
> > have installed RHEL 4
> > Update 1 on both server(Servers are configured with
> > RAID 1) than installed
> > all HP drivers and management agents. After server 
> configuration and 
> > OS installation I have installed Red Hat Cluster Suite
> > v 4 on both the machine.
> > 
> >  
> > 
> > Than I have configured Cluster using Cluster
> > Configuration Manager. Added
> > member hosts, configured fence device and assigned
> > to member host(HP iLO is
> > certified as an fence device), Configured Failover
> > domain with node
> > priority, configured resources such as floating IP
> > address, File System,
> > Script, than configured service which need to be run
> > in HA mode.
> > 
> >  
> > 
> > After configuring this I have tested with various
> > scenario HA is working
> > properly, when ever powered off any machine ,
> > services fail over on
> > available node.
> > 
> > Problem:
> > 
> > 
> > If network goes off on node1, and service which were
> > not running on node1
> > are started by node1 with shared storage mount
> > point, which was already
> > running on node 2 but both of these nodes are not
> > able to communicate to
> > each other, node2 anyway already running the same
> > service with shared
> > storage mount point. Because of Fencing both of
> > these nodes try to kill each
> > other. Both of they got hanged up at "Stoping
> > Cluster manager Services.".In
> > /var/log/messages, it shows fencing s1, fence
> > successful.
> > 
> > If we disable fencing than
> > 
> > If network comes back nodes don't synchronize with
> > each other. Shared
> > storage mount point is available to both the
> > servers. If they try to access
> > storage at same storage gives IO errors. Hence this
> > entire setup become very
> > unstable, fragile.
> > 
> >  
> > 
> >  
> > 
> >  
> > 
> >  
> > 
> >  
> > 
> >  
> > 
> > With Regard
> > 
> > Deval
> > 
> > Progression Infonet Pvt. Ltd.
> > 55, Independent Electronic Modules, 
> > Sector - 18, Electronic City, 
> > Gurgaon - 122015
> > 
> > India
> > Tel          : - 0124 - 2455070, Ext. 215, Fax:
> > 91-124-2398647
> > Mobile   : - 98186 -82509 
> > URL        : - www.progression.com 
> > 
> >  
> > 
> >
> ===========================================================
> > Privileged or confidential information may be
> > contained
> > in this message. If you are not the addressee
> > indicated
> > in this message (or responsible for delivery of the
> > message to such person), please delete this message
> > and
> > kindly notify the sender by an emailed reply.
> > Opinions,
> > conclusions and other information in this message
> > that
> > do not relate to the official business of
> > Progression
> > and its associate entities shall be understood as
> > neither
> > given nor endorsed by them.
> >   
> > 
> >
> -------------------------------------------------------------
> > Progression Infonet Private Limited, Gurgaon
> > (Haryana), India
> > > --
> > 
> > Linux-cluster@xxxxxxxxxx
> >
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> __________________________________________________
> Do You Yahoo!?
> Tired of spam?  Yahoo! Mail has the best spam protection around 
> http://mail.yahoo.com 
> 
> 
> 
> ===========================================================
> Privileged or confidential information may be contained
> in this message. If you are not the addressee indicated
> in this message (or responsible for delivery of the 
> message to such person), please delete this message and
> kindly notify the sender by an emailed reply. Opinions, 
> conclusions and other information in this message that do not 
> relate to the official business of Progression and its 
> associate entities shall be understood as neither given nor 
> endorsed by them.
>   
> 
> -------------------------------------------------------------
> Progression Infonet Private Limited, Gurgaon (Haryana), India
> 
> --
> 
> Linux-cluster@xxxxxxxxxx 
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 
> 
> 
> ===========================================================
> Privileged or confidential information may be contained
> in this message. If you are not the addressee indicated
> in this message (or responsible for delivery of the 
> message to such person), please delete this message and
> kindly notify the sender by an emailed reply. Opinions, 
> conclusions and other information in this message that do not 
> relate to the official business of Progression and its 
> associate entities shall be understood as neither given nor 
> endorsed by them.
>   
> 
> -------------------------------------------------------------
> Progression Infonet Private Limited, Gurgaon (Haryana), India
> 
> 
> --
> 
> Linux-cluster@xxxxxxxxxx 
> https://www.redhat.com/mailman/listinfo/linux-cluster
> 



===========================================================
Privileged or confidential information may be contained
in this message. If you are not the addressee indicated
in this message (or responsible for delivery of the 
message to such person), please delete this message and
kindly notify the sender by an emailed reply. Opinions,
conclusions and other information in this message that
do not relate to the official business of Progression
and its associate entities shall be understood as neither
given nor endorsed by them.
  

-------------------------------------------------------------
Progression Infonet Private Limited, Gurgaon (Haryana), India

--

Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux