On Mon, Jun 15, 2015 at 05:52:07PM +0200, Jesper Dangaard Brouer wrote: > From: Christoph Lameter <cl@xxxxxxxxx> > > [NOTICE: Already in AKPM's quilt-queue] > > First piece: acceleration of retrieval of per cpu objects > > If we are allocating lots of objects then it is advantageous to disable > interrupts and avoid the this_cpu_cmpxchg() operation to get these objects > faster. > > Note that we cannot do the fast operation if debugging is enabled, because > we would have to add extra code to do all the debugging checks. And it > would not be fast anyway. > > Note also that the requirement of having interrupts disabled > avoids having to do processor flag operations. > > Allocate as many objects as possible in the fast way and then fall back to > the generic implementation for the rest of the objects. > > Signed-off-by: Christoph Lameter <cl@xxxxxxxxx> > Cc: Jesper Dangaard Brouer <brouer@xxxxxxxxxx> > Cc: Pekka Enberg <penberg@xxxxxxxxxx> > Cc: David Rientjes <rientjes@xxxxxxxxxx> > Cc: Joonsoo Kim <iamjoonsoo.kim@xxxxxxx> > Signed-off-by: Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> > --- > mm/slub.c | 27 ++++++++++++++++++++++++++- > 1 file changed, 26 insertions(+), 1 deletion(-) > > diff --git a/mm/slub.c b/mm/slub.c > index 80f17403e503..d18f8e195ac4 100644 > --- a/mm/slub.c > +++ b/mm/slub.c > @@ -2759,7 +2759,32 @@ EXPORT_SYMBOL(kmem_cache_free_bulk); > bool kmem_cache_alloc_bulk(struct kmem_cache *s, gfp_t flags, size_t size, > void **p) > { > - return kmem_cache_alloc_bulk(s, flags, size, p); > + if (!kmem_cache_debug(s)) { > + struct kmem_cache_cpu *c; > + > + /* Drain objects in the per cpu slab */ > + local_irq_disable(); > + c = this_cpu_ptr(s->cpu_slab); > + > + while (size) { > + void *object = c->freelist; > + > + if (!object) > + break; > + > + c->freelist = get_freepointer(s, object); > + *p++ = object; > + size--; > + > + if (unlikely(flags & __GFP_ZERO)) > + memset(object, 0, s->object_size); > + } > + c->tid = next_tid(c->tid); > + > + local_irq_enable(); > + } > + > + return __kmem_cache_alloc_bulk(s, flags, size, p); > } > EXPORT_SYMBOL(kmem_cache_alloc_bulk); Now I found that we need to call slab_pre_alloc_hook() before any operation on kmem_cache to support kmemcg accounting. And, we need to call slab_post_alloc_hook() on every allocated objects to support many debugging features like as kasan and kmemleak Thanks. -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>