On Sat, Mar 14, 2015 at 8:25 PM, <jesper@xxxxxxxx> wrote: >> On Sat, Mar 14, 2015 at 8:05 PM, <jesper@xxxxxxxx> wrote: >>> Hi >>> I have a 3.13 (ubuntu LTS) server with 3TB of memory and under certain >>> load >>> conditions it can spiral off to 80+% system load. Per recommendation on >>> IRC >>> yesterday I have captured 2 perf reports (I'm new to perf, so I'm not >>> sure they tell precisely whats needed. >>> >>> Bad situation (high sysload 80%+) > > >> Hi Jesper, please take a look on >> http://marc.info/?l=linux-mm&m=141605213522925&w=2, there is a long >> and unfinished discussion as it seems very problematic to make a >> deterministic reproduction of the bug in our environments. If you can >> observe same lockups with more ease, it`ll help a lot in the issue >> pinning and fixing. > > > Hi Andrey. > > Yes it looks indeed familiar. I can do a fair amount of testing and our > normal production load triggers the problem 6-10 times per day and I'm > willing to garther data to help move forward. What do you suggest is next? > > Jesper > > There is a couple of patches suggested by Vlastimil and others through discussion, not me neither Christian was able to test them properly due to kind of environment where bug primarily live (production envs for both of us). The bare test-env reproducer is a big step forward indeed. Since then bug was reported a couple of times and workarounded (by setting ridiculously large amount of memory for vm.min_free), the larger memory room is (given intensive disk i/o which is able to fill all memory with certain ratio of active/inactive pages I suppose), the easier it is to catch the issue. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>