On Wed, Dec 21, 2016 at 08:36:59AM +0100, Michal Hocko wrote: > TL;DR > there is another version of the debugging patch. Just revert the > previous one and apply this one instead. It's still not clear what > is going on but I suspect either some misaccounting or unexpeted > pages on the LRU lists. I have added one more tracepoint, so please > enable also mm_vmscan_inactive_list_is_low. Right, I did just that and can provide a new log. I was also able, in this case, to reproduce the OOM issues again and not just the "page allocation stalls" that were the only thing visible in the previous log. However, the log comes from machine #2 again today, as I'm unfortunately forced to try this via VPN from work to home today, so I have exactly one attempt per machine before it goes down and locks up (and I can only restart it later tonight). Machine #1 failed to produce good looking results during its one attempt, but what machine #2 produced seems to be exactly what we've been trying to track down, and so its log us now up at: http://ftp.tisys.org/pub/misc/boerne_2016-12-22.log.xz Greetings Nils -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>