On Fri, Oct 7, 2022 at 2:43 PM Arnd Bergmann <arnd@xxxxxxxx> wrote: > > On Fri, Oct 7, 2022, at 9:04 PM, Nick Desaulniers wrote: > > On Fri, Oct 7, 2022 at 1:28 AM Arnd Bergmann <arnd@xxxxxxxx> wrote: > >> On Fri, Oct 7, 2022, at 12:21 AM, Nick Desaulniers wrote: > >> > On Thu, Mar 07, 2019 at 10:01:36AM +0100, Arnd Bergmann wrote: > >> > >> - If I mark 'do_select' as noinline_for_stack, the reported frame > >> size is decreased a lot and is suddenly independent of > >> -fsanitize=local-bounds: > >> fs/select.c:625:5: error: stack frame size (336) exceeds limit (100) in 'core_sys_select' [-Werror,-Wframe-larger-than] > >> int core_sys_select(int n, fd_set __user *inp, fd_set __user *outp, > >> fs/select.c:479:21: error: stack frame size (684) exceeds limit (100) in 'do_select' [-Werror,-Wframe-larger-than] > >> static noinline int do_select(int n, fd_set_bits *fds, struct timespec64 *end_time) > > > > I think this approach makes the most sense to me; the caller > > core_sys_select() has a large stack allocation `stack_fds`, and so > > does the callee do_select with `table`. Add in inlining and long live > > ranges and it makes sense that stack spills are going to tip us over > > the threshold set by -Wframe-larger-than. > > > > Whether you make do_select() `noinline_for_stack` conditional on > > additional configs like CC_IS_CLANG or CONFIG_UBSAN_LOCAL_BOUNDS is > > perhaps also worth considering. > > > > How would you feel about a patch that: > > 1. reverts commit ad312f95d41c ("fs/select: avoid clang stack usage warning") > > 2. marks do_select noinline_for_stack > > > > ? > > That is probably ok, but it does need proper testing to ensure that > there are no performance regressions. Any recommendations on how to do so? > Do you know if gcc inlines the > function by default? If not, we probably don't need to make it > conditional. Ah good idea. For i386 defconfig and x86_64 defconfig, it does not! Here's how I tested that: $ make -j128 defconfig fs/select.o $ llvm-objdump -Dr --disassemble-symbols=core_sys_select fs/select.o | grep do_select This seems to be affected by -fno-conserve-stack, a currently gcc-only command line flag. If I remove that, then i386 defconfig will inline do_select but x86_64 defconfig will not. I have a sneaking suspicion that -fno-conserve-stack and -Wframe-larger-than conspire in GCC to avoid inlining when doing so would trip `-Wframe-larger-than` warnings, but it's just a conspiracy theory; I haven't read the source. Probably should implement exactly that behavior in LLVM. I'll triple check 32b+64b arm configs next week to verify. But if GCC is not inlining do_select into core_sys_select then I think my patch https://lore.kernel.org/llvm/20221007201140.1744961-1-ndesaulniers@xxxxxxxxxx/ is on the right track; probably could drop the 32b-only condition and make a note of GCC in the commit message. Also, my colleague Paul just whipped up a neat tool to help debug -Wframe-larger-than. https://reviews.llvm.org/D135488 See the output from my run here: https://paste.debian.net/1256338/ It's a very early WIP, but I think it would be incredibly helpful to have this, and will probably help us improve Clang's stack usage. -- Thanks, ~Nick Desaulniers