Hello, On Mon, Mar 10, 2014 at 11:50:53AM -0400, Dave Jones wrote: > On Mon, Mar 10, 2014 at 11:01:06AM -0400, Tejun Heo wrote: > > > > On Sat, Mar 8, 2014 at 2:00 PM, Dave Jones <davej@xxxxxxxxxx> wrote: > > > > I left my fuzzing box running for the weekend, and checked in on it this evening, > > > > to find that none of the child processes were making any progress. > > > > cat'ing /proc/n/stack shows them all stuck in the same place.. > > > > Some examples: > > > > Dave, any chance you can post full sysrq-t dump? > > It's too big to fit in the ring-buffer, so some of it gets lost before > it hits syslog, but hopefully what made it to disk is enough. > http://codemonkey.org.uk/junk/sysrq-t Hmmm... this is puzzling. At least according to the slightly truncated (pids < 13) sysrq-t output, there's no kworker running lru_add_drain_per_cpu() and nothing blocked on lru_add_drain_all::lock can introduce any complex dependency. Also, at least from glancing over, I don't see anything behind lru_add_rain_per_cpu() which can get involved in a complex dependency chain. Assuming that the handful lost traces didn't reveal serious ah-has, it almost looks like workqueue either failed to initiate execution of a queued work item or flush_work() somehow got confused on a work item which already finished, both of which are quite unlikely given that we haven't had any simliar report on any other work items. I think it'd be wise to extend sysrq-t output to include the states of workqueue if for nothing else to easily rule out doubts about basic wq functions. Dave, is this as much information we're gonna get from the trinity instance? I assume trying to reproduce the case isn't likely to work? Thanks. -- tejun -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>