Re: Problem in clvmd/dlm_recoverd

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 15/11/2008, at 8:35 AM, David Teigland wrote:

On Fri, Nov 14, 2008 at 09:53:13PM +0000, Nuno Fernandes wrote:
On Fri, Nov 14, 2008 at 10:00:13AM +0000, Nuno Fernandes wrote:
dlm recovery appears to be stuck; this is usually due to a problem at the network level. The recovery seems to be caused by a node starting clvmd.
Hi,

I don't know if it helps, but groupd is using all available CPU, but
only in 2 of the nodes.

That sounds like https://bugzilla.redhat.com/show_bug.cgi?id=444529
which is fixed in 5.3.  I suspect that's the cause of you're problems.

Dave


We seem to be having the same problem on a 5 node virtual cluster where 3 of the nodes share a GFS mount.

A backup script runs on one node which does some heavy reads + writes to this mount at which point all three nodes jump to 100% cpu (90% iowait on the machine that is doing the backup, 100% system on the other two) and all LVM VGs, LVs and GFS mounts lock up.

Is there anything that could be tuned here to avoid this issue until a bug fix is released?

Regards,
Tom

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux