On Fri, 2005-03-11 at 00:27, David Teigland wrote: > On Wed, Mar 09, 2005 at 05:26:34PM -0800, Daniel McNeil wrote: > > I upgraded to 2.6.11 and the latest cvs a few days ago. > > I started my tests on Mar 7 16:01 and they hung on Mar 9 12:34. > > This is a 3 node cluster, but the test that hung only has 1 > > node with gfs mounted and it is trying to unmount: > > > > root 12500 12494 0 12:34 ? 00:00:01 umount /gfs_stripe5 > > > > $ cat /proc/12500/wchan > > .text.lock.ast > > > > dlm_astd is spinning as top shows: > > > > 12302 root 20 -5 0 0 0 R 99.9 0.0 280:28.23 dlm_astd > > I've found and fixed a missing lock around ast_queue. If that list > was corrupted it could explain this problem which looks like infinite > looping through that list. I upgrade to the latest cvs and I hit the same problem again. umount is hung: root 24099 24093 0 Mar14 ? 00:00:02 umount /gfs_stripe5 and dlm_astd is spinning: 23895 root 20 -5 0 0 0 R 99.9 0.0 1479:34 dlm_astd Any ideas? Is there any debug info that would be useful? Daniel