On Tue, Feb 23, 2016 at 05:13:35PM +0100, Jean-Tiare Le Bigot wrote: > Hi, > > We've hit kernel hang related to XFS reclaim under heavy I/O load on a > couple of storage servers using XFS over flashcache over a 3.13.y kernel. > > On the crash dumps, kthreadd is blocked, waiting for XFS to reclaim some > memory but the related reclaim job is queued on a worker_pool stuck > waiting for some I/O, itself depending on other jobs on other queues > which would require additional threads to go forward. Unfortunately > kthreadd is blocked. > The host has plenty of memory (~128GB), about 80% of which being used > for the page cache. > > It looks like this is fixed by commit > 7a29ac474a47eb8cf212b45917683ae89d6fa13b. That commit fixed a regression introduced, IIRC, in 3.19. The problem it fixed didn't exist before then, so I doubt you are seeing the problem that the above commit fixed. Perhaps you'd like to describe your problem along with the stack traces, etc so we have some idea of what you are trying to to fix? Cheers, Dave. -- Dave Chinner david@xxxxxxxxxxxxx _______________________________________________ xfs mailing list xfs@xxxxxxxxxxx http://oss.sgi.com/mailman/listinfo/xfs