Re: Number of arguments in vmalloc.c

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



> On Dec 3, 2018, at 2:49 PM, Matthew Wilcox <willy@xxxxxxxxxxxxx> wrote:
> 
> On Mon, Dec 03, 2018 at 02:04:41PM -0800, Nadav Amit wrote:
>> On Dec 3, 2018, at 8:13 AM, Matthew Wilcox <willy@xxxxxxxxxxxxx> wrote:
>>> On Mon, Dec 03, 2018 at 02:59:36PM +0100, Vlastimil Babka wrote:
>>>> On 11/28/18 3:01 PM, Matthew Wilcox wrote:
>>>>> Some of the functions in vmalloc.c have as many as nine arguments.
>>>>> So I thought I'd have a quick go at bundling the ones that make sense
>>>>> into a struct and pass around a pointer to that struct.  Well, it made
>>>>> the generated code worse,
>>>> 
>>>> Worse in which metric?
>>> 
>>> More instructions to accomplish the same thing.
>>> 
>>>>> so I thought I'd share my attempt so nobody
>>>>> else bothers (or soebody points out that I did something stupid).
>>>> 
>>>> I guess in some of the functions the args parameter could be const?
>>>> Might make some difference.
>>>> 
>>>> Anyway this shouldn't be a fast path, so even if the generated code is
>>>> e.g. somewhat larger, then it still might make sense to reduce the
>>>> insane parameter lists.
>>> 
>>> It might ... I'm not sure it's even easier to program than the original
>>> though.
>> 
>> My intuition is that if all the fields of vm_args were initialized together
>> (in the same function), and a 'const struct vm_args *' was provided as
>> an argument to other functions, code would be better (at least better than
>> what you got right now).
>> 
>> I’m not saying it is easily applicable in this use-case (since I didn’t
>> check).
> 
> Your intuition is wrong ...
> 
>   text	   data	    bss	    dec	    hex	filename
>   9466	     81	     32	   9579	   256b	before.o
>   9546	     81	     32	   9659	   25bb	.build-tiny/mm/vmalloc.o
>   9546	     81	     32	   9659	   25bb	const.o
> 
> indeed, there's no difference between with or without the const, according
> to 'cmp'.
> 
> Now, only alloc_vmap_area() gets to take a const argument.
> __get_vm_area_node() intentionally modifies the arguments.  But feel
> free to play around with this; you might be able to make it do something
> worthwhile.

I was playing with it (a bit). What I suggested (modifying
__get_vm_area_node() so it will not change arguments) helps a bit, but not
much.

One insight that I got is that at least part of the overhead comes from the
the stack protector code that gcc emits.




[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux