While this explains the situation somewhat, I was trying to bring a bit more clarity to the problem (without examining th esource). What exactly is happening when a 'initiates transition'? Shawn On Thu, Feb 21, 2008 at 3:30 AM, Christine Caulfield <ccaulfie@xxxxxxxxxx> wrote: > > Shawn Hood wrote: > > Though one instance of 'Initating transition' message seems to be > > normal , what could the behavior shown in the following log indicate? > > What exactly is happening during an 'Initating transition' message? > > > > Shawn > > > > Feb 14 15:25:55 odin kernel: CMAN: Initiating transition, generation 7 > > Feb 14 15:26:01 odin kernel: CMAN: removing node munin from the > > cluster : No response to messages > > Feb 14 15:26:01 odin kernel: CMAN: Initiating transition, generation 8 > > Feb 14 15:26:16 odin kernel: CMAN: Initiating transition, generation 9 > > Feb 14 15:26:31 odin kernel: CMAN: Initiating transition, generation 10 > > Feb 14 15:26:40 odin su(pam_unix)[20082]: session opened for user root > > by shood(uid=0) > > Feb 14 15:26:46 odin kernel: CMAN: Initiating transition, generation 11 > > Feb 14 15:27:01 odin kernel: CMAN: Initiating transition, generation 12 > > Feb 14 15:27:16 odin kernel: CMAN: Initiating transition, generation 13 > > Feb 14 15:27:31 odin kernel: CMAN: Initiating transition, generation 14 > > Feb 14 15:27:46 odin kernel: CMAN: Initiating transition, generation 15 > > Feb 14 15:28:01 odin kernel: CMAN: Initiating transition, generation 16 > > Feb 14 15:28:16 odin kernel: CMAN: Initiating transition, generation 17 > > Feb 14 15:28:31 odin kernel: CMAN: Initiating transition, generation 18 > > Feb 14 15:28:46 odin kernel: CMAN: too many transition restarts - will die > > Feb 14 15:28:46 odin kernel: CMAN: we are leaving the cluster. > > Inconsistent cluster view > > Feb 14 15:28:46 odin kernel: WARNING: dlm_emergency_shutdown > > Feb 14 15:28:46 odin kernel: WARNING: dlm_emergency_shutdown > > Feb 14 15:28:46 odin kernel: SM: 00000002 sm_stop: SG still joined > > Feb 14 15:28:46 odin kernel: SM: 01000004 sm_stop: SG still joined > > Feb 14 15:28:46 odin kernel: SM: 02000014 sm_stop: SG still joined > > Feb 14 15:28:46 odin ccsd[17392]: Cluster manager shutdown. Attemping > > > The usual cause of all those messages (not that it's usual!) is network > problems. Often a one-way connection can cause it, eg the node can send > messages but not receive them. There are pathological iptables rules > that can make that happen too. > > It's hard to be specific without knowing more, but I would investigate > the network connections, routers/switches and routing/iptables rules > > Chrissie > > -- > Linux-cluster mailing list > Linux-cluster@xxxxxxxxxx > https://www.redhat.com/mailman/listinfo/linux-cluster > -- Shawn Hood (910) 670-1819 Mobile -- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster