On Wed, Mar 09, 2005 at 05:26:34PM -0800, Daniel McNeil wrote: > I upgraded to 2.6.11 and the latest cvs a few days ago. > I started my tests on Mar 7 16:01 and they hung on Mar 9 12:34. > This is a 3 node cluster, but the test that hung only has 1 > node with gfs mounted and it is trying to unmount: > > root 12500 12494 0 12:34 ? 00:00:01 umount /gfs_stripe5 > > $ cat /proc/12500/wchan > .text.lock.ast > > dlm_astd is spinning as top shows: > > 12302 root 20 -5 0 0 0 R 99.9 0.0 280:28.23 dlm_astd I've found and fixed a missing lock around ast_queue. If that list was corrupted it could explain this problem which looks like infinite looping through that list. -- Dave Teigland <teigland@xxxxxxxxxx>