On Wed, 23 Nov 2011, Pekka Enberg wrote: > 2011/11/23 Christian Kujau <lists@xxxxxxxxxxxxxxx>: > > OK, with Christoph's patch applied, 3.2.0-rc2-00274-g6fe4c6d-dirty survives > > on this machine, with the disk & cpu workload that caused the machine to > > panic w/o the patch. Load was at 4-5 this time, which is expected for this > > box. I'll run a few more tests later on, but it seems ok for now. > > > > I couldn't resist and ran "slabinfo" anyway (after the workload!) - the > > box survived, nothing was printed in syslog either. Output attached. > > Christoph, Eric, would you mind sending me the final patches that > Christian tested? Maybe CC David too for extra pair of eyes. I think he only tested the patch that he showed us. Here is the patch cleaned up. Do you Want me to feed you the debug fixes patch by patch as well? Subject: slub: use irqsafe_cpu_cmpxchg for put_cpu_partial The cmpxchg must be irq safe. The fallback for this_cpu_cmpxchg only disables preemption which results in per cpu partial page operation potentially failing on non x86 platforms. Signed-off-by: Christoph Lameter <cl@xxxxxxxxx> --- mm/slub.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) Index: linux-2.6/mm/slub.c =================================================================== --- linux-2.6.orig/mm/slub.c 2011-11-23 09:10:48.000000000 -0600 +++ linux-2.6/mm/slub.c 2011-11-23 09:10:57.000000000 -0600 @@ -1969,7 +1969,7 @@ int put_cpu_partial(struct kmem_cache *s page->pobjects = pobjects; page->next = oldpage; - } while (this_cpu_cmpxchg(s->cpu_slab->partial, oldpage, page) != oldpage); + } while (irqsafe_cpu_cmpxchg(s->cpu_slab->partial, oldpage, page) != oldpage); stat(s, CPU_PARTIAL_FREE); return pobjects; } -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Fight unfair telecom internet charges in Canada: sign http://stopthemeter.ca/ Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>