Re: [PATCH bpf-next 1/2] bpf: Add kfuncs for read-only string operations

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Oct 2, 2024 at 9:51 PM Viktor Malik <vmalik@xxxxxxxxxx> wrote:
>
> On 10/2/24 18:55, Alexei Starovoitov wrote:
> > On Tue, Oct 1, 2024 at 11:12 PM Viktor Malik <vmalik@xxxxxxxxxx> wrote:
> >>
> >> On 10/1/24 19:40, Andrii Nakryiko wrote:
> >>> On Tue, Oct 1, 2024 at 10:34 AM Alexei Starovoitov
> >>> <alexei.starovoitov@xxxxxxxxx> wrote:
> >>>>
> >>>> On Tue, Oct 1, 2024 at 10:04 AM Andrii Nakryiko
> >>>> <andrii.nakryiko@xxxxxxxxx> wrote:
> >>>>>
> >>>>> On Tue, Oct 1, 2024 at 7:48 AM Alexei Starovoitov
> >>>>> <alexei.starovoitov@xxxxxxxxx> wrote:
> >>>>>>
> >>>>>> On Tue, Oct 1, 2024 at 4:26 AM Eduard Zingerman <eddyz87@xxxxxxxxx> wrote:
> >>>>>>>
> >>>>>>> On Mon, 2024-09-30 at 15:00 -0700, Andrii Nakryiko wrote:
> >>>>>>>
> >>>>>>> [...]
> >>>>>>>
> >>>>>>>> Right now, the only way to pass dynamically sized anything is through
> >>>>>>>> dynptr, AFAIU.
> >>>>>>>
> >>>>>>> But we do have 'is_kfunc_arg_mem_size()' that checks for __sz suffix,
> >>>>>>> e.g. used for bpf_copy_from_user_str():
> >>>>>>>
> >>>>>>> /**
> >>>>>>>  * bpf_copy_from_user_str() - Copy a string from an unsafe user address
> >>>>>>>  * @dst:             Destination address, in kernel space.  This buffer must be
> >>>>>>>  *                   at least @dst__sz bytes long.
> >>>>>>>  * @dst__sz:         Maximum number of bytes to copy, includes the trailing NUL.
> >>>>>>>  * ...
> >>>>>>>  */
> >>>>>>> __bpf_kfunc int bpf_copy_from_user_str(void *dst, u32 dst__sz, const void __user *unsafe_ptr__ign, u64 flags)
> >>>>>>>
> >>>>>>> However, this suffix won't work for strnstr because of the arguments order.
> >>>>>>
> >>>>>> Stating the obvious... we don't need to keep the order exactly the same.
> >>>>>>
> >>>>>> Regarding all of these kfuncs... as Andrii pointed out 'const char *s'
> >>>>>> means that the verifier will check that 's' points to a valid byte.
> >>>>>> I think we can do a hybrid static + dynamic safety scheme here.
> >>>>>> All of the kfunc signatures can stay the same, but we'd have to
> >>>>>> open code all string helpers with __get_kernel_nofault() instead of
> >>>>>> direct memory access.
> >>>>>> Since the first byte is guaranteed to be valid by the verifier
> >>>>>> we only need to make sure that the s+N bytes won't cause page faults
> >>>>>
> >>>>> You mean to just check that s[N-1] can be read? Given a large enough
> >>>>> N, couldn't it be that some page between s[0] and s[N-1] still can be
> >>>>> unmapped, defeating this check?
> >>>>
> >>>> Just checking s[0] and s[N-1] is not enough, obviously, and especially,
> >>>> since the logic won't know where nul byte is, so N is unknown.
> >>>> I meant to that all of str* kfuncs will be reading all bytes
> >>>> via __get_kernel_nofault() until they find \0.
> >>>
> >>> Ah, ok, I see what you mean now.
> >>>
> >>>> It can be optimized to 8 byte access.
> >>>> The open coding (aka copy-paste) is unfortunate, of course.
> >>>
> >>> Yep, this sucks.
> >>
> >> Yeah, that's quite annoying. I really wanted to avoid doing that. Also,
> >> we won't be able to use arch-optimized versions of the functions.
> >>
> >> Just to make sure I understand things correctly - can we do what Eduard
> >> suggested and add explicit sizes for all arguments using the __sz
> >> suffix? So something like:
> >>
> >>     const char *bpf_strnstr(const char *s1, u32 s1__sz, const char *s2, u32 s2__sz);
> >
> > That's ok-ish, but you probably want:
> >
> > const char *bpf_strnstr(void *s1, u32 s1__sz, void *s2, u32 s2__sz);
> >
> > and then to call strnstr() you still need to strnlen(s2, s2__sz).
> >
> > But a more general question... how always passing size will work
> > for bpftrace ? Does it always know the upper bound of storage where
> > strings are stored?
>
> Yes, it does. The strings must be read via the str() call (which
> internally calls bpf_probe_read_str) and there's an upper bound on the
> size of each string.

which sounds like a bpftrace current limitation and not something to
depend on from kfunc design pov.
Wouldn't you want strings to be arbitrary length?

> > I would think __get_kernel_nofault() approach is user friendlier.
>
> That's probably true but isn't there still the problem that strings are
> not necessarily null-terminated? And in such case, unbounded string
> functions may not terminate which is not allowed in BPF?

kfuncs that are searching for nul via loop with __get_kernel_nofault()
would certainly need an upper bound.
Something like PATH_MAX (4k) or XATTR_SIZE_MAX (64k)
would cover 99.99% of use cases.





[Index of Archives]     [Linux Samsung SoC]     [Linux Rockchip SoC]     [Linux Actions SoC]     [Linux for Synopsys ARC Processors]     [Linux NFS]     [Linux NILFS]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]


  Powered by Linux