Re: [PATCH] kallsyms: new /proc/kallmodsyms with builtin modules and symbol sizes

Eugene Loh <eugene.loh@xxxxxxxxxx> · Sat, 16 Nov 2019 09:58:04 -0800

On 11/15/2019 09:26 AM, Linus Torvalds wrote:

Is a new config option even needed?

I suppose not.  The original motivations for this option no longer 
matter.  I can amend the patch if that makes sense.

Honestly, I think the "add the module name even when built-in" could
be done unconditionally with no backwards compatibility issues.  It's
not a new syntax, and shouldn't break anything, and looks like a
useful extension of the existing format - and one that existing tools
already have to be aware of.

The size thing is obviously different, but I find that much more
questionable. What's the use-case? If it's just about the occasional
big jumps, then adding a dummy entry for those (rare) cases sounds
like a much better option, and wouldn't break any existing code.

I don't see any upside at all in showing the "exact" function size
instead of a size rounded up to the usual 16 bytes or whatever.
Padding is real, and doesn't change anything.

Since there are very many gaps, adding dummy entries makes sense only 
for "big" jumps.  I don't know where one would want to draw the line for 
"big."  In any case, to identify such gaps, one would still need the "nm 
-S" information provided by this patch.

Meanwhile, there are some symbols that encompass others.  E.g.,
    ffffffff8a001000 1000 T hypercall_page
    ffffffff8a001000 20 t xen_hypercall_set_trap_table
    ffffffff8a001020 20 t xen_hypercall_mmu_update
    [...]
    ffffffff8a0016c0 20 t xen_hypercall_arch_6
    ffffffff8a0016e0 20 t xen_hypercall_arch_7
    ffffffff8a002000 8 T __startup_secondary_64
Something between ffffffff8a001700 and ffffffff8a002000 maps to 
hypercall_page, but you'd never know that from the "up until the next 
address" approach.

Symbols of zero size pose interesting questions.  E.g., do we want these 
zero-size symbols to have nonzero size?
    ffffffff8ac00e00 19 T __x86_indirect_thunk_r15
    ffffffff8ac00e19 0 T _etext
    ffffffff8ac00e19 0 T __indirect_thunk_end
    ffffffff8ac00e1c 0 R __start_notes
    ffffffff8ac00fd8 18 r _note_55
    ffffffff8ac01008 0 R __stop_notes

Size information also helps us distinguish symbol ranges when a 
zero-size symbol has the same address as another symbol.  E.g.,
    ffffffff8a9593e0 0 T __sched_text_start
    ffffffff8a9593e0 6b9 t __schedule
    ffffffff8a959aa0 9a T schedule
If we have an address above ffffffff8a9593e0, we see it belongs to 
__schedule.   Without size information, we might have assigned such an 
address to __sched_text_start.

Or how would one distinguish the symbols at ffffffff8b37d840 without 
size info?
    ffffffff8b37a870 0 R __start___param
    ffffffff8b37a870 28 r __param_initcall_debug
    ffffffff8b37a898 28 r __param_action
    [...]
    ffffffff8b37d7f0 28 r __param_disable_ipv6
    ffffffff8b37d818 28 r __param_disable
    ffffffff8b37d840 0 r __stop___param
    ffffffff8b37d840 0 r __start___modver   [configfs]
    ffffffff8b37d840 8 r __modver_attr      [configfs]

There are also symbols of zero size that mark the end of a segment and 
share the same address as the first symbol in the next segment. This 
means that the "up to the next symbol" algorithm leads to some 
confusion, e.g., overlapping segments.

Anyhow, given that we know real symbol sizes, why not just use them?