On Mon, Nov 17, 2008 at 09:32:46AM +0000, Nuno Fernandes wrote: > On Friday 14 November 2008 22:05:15 David Teigland wrote: > > On Fri, Nov 14, 2008 at 09:53:13PM +0000, Nuno Fernandes wrote: > > > > On Fri, Nov 14, 2008 at 10:00:13AM +0000, Nuno Fernandes wrote: > > > > dlm recovery appears to be stuck; this is usually due to a problem at > > > > the network level. The recovery seems to be caused by a node starting > > > > clvmd. > > > > > > Hi, > > > > > > I don't know if it helps, but groupd is using all available CPU, but > > > only in 2 of the nodes. > > > > That sounds like https://bugzilla.redhat.com/show_bug.cgi?id=444529 > > which is fixed in 5.3. I suspect that's the cause of you're problems. > > > > Dave > > Hi, > > Is there anyway i can unstuck the servers without rebooting all the > servers at the same time? Reboot just the nodes where groupd (or dlm_controld or gfs_controld) are running at 100% cpu. Dave -- Linux-cluster mailing list Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster