Re: [PATCH] mm: use vmalloc fallback path for certain memcg allocations

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri 02-12-16 09:54:17, Anatoly Stepanov wrote:
> Alex, Vlasimil, Michal, thanks for your responses!
> 
> On Fri, Dec 02, 2016 at 10:19:33AM +0100, Michal Hocko wrote:
> > Thanks for CCing me Vlastimil
> > 
> > On Fri 02-12-16 09:44:23, Vlastimil Babka wrote:
> > > On 12/01/2016 02:16 AM, Anatoly Stepanov wrote:
> > > > As memcg array size can be up to:
> > > > sizeof(struct memcg_cache_array) + kmemcg_id * sizeof(void *);
> > > > 
> > > > where kmemcg_id can be up to MEMCG_CACHES_MAX_SIZE.
> > > > 
> > > > When a memcg instance count is large enough it can lead
> > > > to high order allocations up to order 7.
> > 
> > This is definitely not nice and worth fixing! I am just wondering
> > whether this is something you have encountered in the real life. Having
> > thousands of memcgs sounds quite crazy^Wscary to me. I am not at all
> > sure we are prepared for that and some controllers would have real
> > issues with it AFAIR.
> 
> In our company we use custom-made lightweight container technology, the thing is
> we can have up to several thousands of them on a server.
> So those high-order allocations were observed on a real production workload.

OK, this is interesting. Definitely worth mentioning in the changelog!

[...]
> > 	/*
> > 	 * Do not invoke OOM killer for larger requests as we can fall
> > 	 * back to the vmalloc
> > 	 */
> > 	if (size > PAGE_SIZE)
> > 		gfp_mask |= __GFP_NORETRY | __GFP_NOWARN;
> 
> I think we should check against PAGE_ALLOC_COSTLY_ORDER anyway, as
> there's no big need to allocate large contiguous chunks here, at the
> same time someone in the kernel might really need them.

PAGE_ALLOC_COSTLY_ORDER is and should remain the page allocator internal
implementation detail and shouldn't spread out much outside. GFP_NORETRY
will already make sure we do not push hard here.

> 
> > 
> > 	ret = kzalloc(size, gfp_mask);
> > 	if (ret)
> > 		return ret;
> > 	return vzalloc(size);
> > 
> 
> > I also do not like memcg_alloc helper name. It suggests we are
> > allocating a memcg while it is used for cache arrays and slab LRUS.
> > Anyway this pattern is quite widespread in the kernel so I would simply
> > suggest adding kvmalloc function instead.
> 
> Agreed, it would be nice to have a generic call.
> I would suggest an impl. like this:
> 
> void *kvmalloc(size_t size)

gfp_t gfp_mask should be a parameter as this should be a generic helper.

> {
> 	gfp_t gfp_mask = GFP_KERNEL;


> 	void *ret;
> 
>  	if (size > PAGE_SIZE)
>  		gfp_mask |= __GFP_NORETRY | __GFP_NOWARN;
> 
> 
> 	if (size <= (PAGE_SIZE << PAGE_ALLOC_COSTLY_ORDER)) {
> 		ret = kzalloc(size, gfp_mask);
> 		if (ret)
> 			return ret;
> 	}

No, please just do as suggested above. Tweak the gfp_mask for higher
order requests and do kmalloc first with vmalloc as a  fallback.

-- 
Michal Hocko
SUSE Labs

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@xxxxxxxxx.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@xxxxxxxxx";> email@xxxxxxxxx </a>



[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]