Re: [PATCH bpf-next v4 04/13] bpf: Tag argument to be released in bpf_func_proto

Joanne Koong <joannelkoong@xxxxxxxxx> · Wed, 13 Apr 2022 11:39:32 -0700



On Wed, Apr 13, 2022 at 11:33 AM Joanne Koong <joannelkoong@xxxxxxxxx> wrote:
>
> On Tue, Apr 12, 2022 at 1:11 PM Kumar Kartikeya Dwivedi
> <memxor@xxxxxxxxx> wrote:
> >
> > On Tue, Apr 12, 2022 at 11:46:14PM IST, Joanne Koong wrote:
> > > On Sun, Apr 10, 2022 at 11:58 PM Kumar Kartikeya Dwivedi
> > > <memxor@xxxxxxxxx> wrote:
> > > >
> > > > Add a new type flag for bpf_arg_type that when set tells verifier that
> > > > for a release function, that argument's register will be the one for
> > > > which meta.ref_obj_id will be set, and which will then be released
> > > > using release_reference. To capture the regno, introduce a new field
> > > > release_regno in bpf_call_arg_meta.
> > > >
> > > > This would be required in the next patch, where we may either pass NULL
> > > > or a refcounted pointer as an argument to the release function
> > > > bpf_kptr_xchg. Just releasing only when meta.ref_obj_id is set is not
> > > > enough, as there is a case where the type of argument needed matches,
> > > > but the ref_obj_id is set to 0. Hence, we must enforce that whenever
> > > > meta.ref_obj_id is zero, the register that is to be released can only
> > > > be NULL for a release function.
> > > >
> > > > Signed-off-by: Kumar Kartikeya Dwivedi <memxor@xxxxxxxxx>
> > > > ---
> > > >  include/linux/bpf.h   |  5 ++++-
> > > >  kernel/bpf/ringbuf.c  |  4 ++--
> > > >  kernel/bpf/verifier.c | 46 ++++++++++++++++++++++++++++++++++++-------
> > > >  net/core/filter.c     |  2 +-
> > > >  4 files changed, 46 insertions(+), 11 deletions(-)
> > > >
> > > > diff --git a/include/linux/bpf.h b/include/linux/bpf.h
> > > > index e267db260cb7..a6d1982e8118 100644
> > > > --- a/include/linux/bpf.h
> > > > +++ b/include/linux/bpf.h
> > > > @@ -364,7 +364,10 @@ enum bpf_type_flag {
> > > >          */
> > > >         MEM_PERCPU              = BIT(4 + BPF_BASE_TYPE_BITS),
> > > >
> > > > -       __BPF_TYPE_LAST_FLAG    = MEM_PERCPU,
> > > > +       /* Indicates that the pointer argument will be released. */
> > > > +       PTR_RELEASE             = BIT(5 + BPF_BASE_TYPE_BITS),
> > > > +
> > > > +       __BPF_TYPE_LAST_FLAG    = PTR_RELEASE,
> > > >  };
> > > >
> > > >  /* Max number of base types. */
> > > > diff --git a/kernel/bpf/ringbuf.c b/kernel/bpf/ringbuf.c
> > > > index 710ba9de12ce..a22c21c0a7ef 100644
> > > > --- a/kernel/bpf/ringbuf.c
> > > > +++ b/kernel/bpf/ringbuf.c
> > > > @@ -404,7 +404,7 @@ BPF_CALL_2(bpf_ringbuf_submit, void *, sample, u64, flags)
> > > >  const struct bpf_func_proto bpf_ringbuf_submit_proto = {
> > > >         .func           = bpf_ringbuf_submit,
> > > >         .ret_type       = RET_VOID,
> > > > -       .arg1_type      = ARG_PTR_TO_ALLOC_MEM,
> > > > +       .arg1_type      = ARG_PTR_TO_ALLOC_MEM | PTR_RELEASE,
> > > >         .arg2_type      = ARG_ANYTHING,
> > > >  };
> > > >
> > > > @@ -417,7 +417,7 @@ BPF_CALL_2(bpf_ringbuf_discard, void *, sample, u64, flags)
> > > >  const struct bpf_func_proto bpf_ringbuf_discard_proto = {
> > > >         .func           = bpf_ringbuf_discard,
> > > >         .ret_type       = RET_VOID,
> > > > -       .arg1_type      = ARG_PTR_TO_ALLOC_MEM,
> > > > +       .arg1_type      = ARG_PTR_TO_ALLOC_MEM | PTR_RELEASE,
> > > >         .arg2_type      = ARG_ANYTHING,
> > > >  };
> > > >
> > > > diff --git a/kernel/bpf/verifier.c b/kernel/bpf/verifier.c
> > > > index 01d45c5010f9..6cc08526e049 100644
> > > > --- a/kernel/bpf/verifier.c
> > > > +++ b/kernel/bpf/verifier.c
> > > > @@ -245,6 +245,7 @@ struct bpf_call_arg_meta {
> > > >         struct bpf_map *map_ptr;
> > > >         bool raw_mode;
> > > >         bool pkt_access;
> > > > +       u8 release_regno;
> > > >         int regno;
> > > >         int access_size;
> > > >         int mem_size;
> > > > @@ -5300,6 +5301,11 @@ static bool arg_type_is_int_ptr(enum bpf_arg_type type)
> > > >                type == ARG_PTR_TO_LONG;
> > > >  }
> > > >
> > > > +static bool arg_type_is_release_ptr(enum bpf_arg_type type)
> > > > +{
> > > > +       return type & PTR_RELEASE;
> > > > +}
> > > > +
> > > Now that we have PTR_RELEASE as a bpf arg type descriptor, why do we
> > > still need is_release_function() in the verifier? I think we should
> > > just remove is_release_function() altogether - is_release_function()
> > > isn't functionally necessary now that we have PTR_RELEASE, and I don't
> > > think it's great that is_release_function() hardcodes specific
> > > functions into the verifier. What are your thoughts?
> >
> > We need it to (atleast) guard the meta.ref_obj_id release, otherwise you have to
> > check for PTR_RELEASE in all arguments to determine it is a release function.
> > I guess we could record whether function is release function in meta, then
> > looping over arguments won't be needed each time (probably best to do in
> > check_release_regno, and set it there).
> >
> I elaborated a bit more on this in my next comment, but I think we
> should just get rid of is_release_function() and use
> meta.release_regno to track in check_func_arg() if the function is a
> release function.
> > >
> > > >  static int int_ptr_type_to_size(enum bpf_arg_type type)
> > > >  {
> > > >         if (type == ARG_PTR_TO_INT)
> > > > @@ -5532,7 +5538,7 @@ int check_func_arg_reg_off(struct bpf_verifier_env *env,
> > > >                 /* Some of the argument types nevertheless require a
> > > >                  * zero register offset.
> > > >                  */
> > > > -               if (arg_type != ARG_PTR_TO_ALLOC_MEM)
> > > > +               if (base_type(arg_type) != ARG_PTR_TO_ALLOC_MEM)
> > > >                         return 0;
> > > >                 break;
> > > >         /* All the rest must be rejected, except PTR_TO_BTF_ID which allows
> > >
> > > Later on in this check_func_arg_reg_off() function, I think we can get
> > > rid of the hacky workaround for the PTR_TO_BTF_ID case where it relies
> > > on whether the function is a release function and reg->ref_obj_id is
> > > set, to determine whether the argument is a release arg or not. The
> > > arg type is passed directly to check_func_arg_reg_off(), so I think we
> > > could just use arg_type_is_release_ptr(arg_type) instead, which will
> > > also be more robust when/if we support having multiple release args in
> > > the future.
> >
> > Ok, sounds good.
> >
> > >
> > > > @@ -6124,12 +6130,31 @@ static bool check_btf_id_ok(const struct bpf_func_proto *fn)
> > > >         return true;
> > > >  }
> > > >
> > > > -static int check_func_proto(const struct bpf_func_proto *fn, int func_id)
> > > > +static bool check_release_regno(const struct bpf_func_proto *fn, int func_id,
> > > > +                               struct bpf_call_arg_meta *meta)
> > > > +{
> > > > +       int i;
> > > > +
> > > > +       for (i = 0; i < ARRAY_SIZE(fn->arg_type); i++) {
> > > > +               if (arg_type_is_release_ptr(fn->arg_type[i])) {
> > > > +                       if (!is_release_function(func_id))
> > > > +                               return false;
> > > > +                       if (meta->release_regno)
> > > > +                               return false;
> > > > +                       meta->release_regno = i + 1;
> > > > +               }
> > > > +       }
> > > > +       return !is_release_function(func_id) || meta->release_regno;
> > > > +}
> > > Is this check needed? There's already a check in check_func_arg that
> > > there can't be two arg registers with ref_obj_ids set. I think this
> > > already checks against the case where the user tries to pass in two
> > > release registers as arguments.
> >
> > This is different, this is about preventing the case where some func_id is
> > listed as release function, but none of its arguments were tagged as
> > PTR_RELEASE. It also doubles as a way to record the regno being released,
> > since we need to loop anyway.
> Why do we need to prevent the case where a release kernel helper
> function doesn't have any of its arguments tagged as PTR_RELEASE or
> conversely, that a non-release helper function has one of its
> arguments tagged with PTR_RELEASE? That would be a bug in the kernel
> then. I think we can just assume that this will never be the case.
>
> Given that, I'm in favor of just removing check_release_regno()
> altogether, and doing the meta->release_regno assignment + check for
> multiple PTR_MEM args in check_func_arg() right after the
> skip_type_check: goto. We already do the assignment + multiple
> instances check there for meta->ref_obj_id. That to me looks like the
> cleanest approach.
> >
> > If we are removing is_release_function, we can just make sure PTR_RELEASE is
> > only seen once, and consider such functions as release functions (and set
> > meta.release_function to true).
> I don't think you even need meta->release_function, since you already
> have meta->release_regno, no? You can just check whether
> meta->release_regno is non-zero.
> >
> > > > +
> > > > +static int check_func_proto(const struct bpf_func_proto *fn, int func_id,
> > > > +                           struct bpf_call_arg_meta *meta)
> > > >  {
> > > >         return check_raw_mode_ok(fn) &&
> > > >                check_arg_pair_ok(fn) &&
> > > >                check_btf_id_ok(fn) &&
> > > > -              check_refcount_ok(fn, func_id) ? 0 : -EINVAL;
> > > > +              check_refcount_ok(fn, func_id) &&
> > > > +              check_release_regno(fn, func_id, meta) ? 0 : -EINVAL;
> > > >  }
> > > >
> > > >  /* Packet data might have moved, any old PTR_TO_PACKET[_META,_END]
> > > > @@ -6808,7 +6833,7 @@ static int check_helper_call(struct bpf_verifier_env *env, struct bpf_insn *insn
> > > >         memset(&meta, 0, sizeof(meta));
> > > >         meta.pkt_access = fn->pkt_access;
> > > >
> > > > -       err = check_func_proto(fn, func_id);
> > > > +       err = check_func_proto(fn, func_id, &meta);
> > > >         if (err) {
> > > >                 verbose(env, "kernel subsystem misconfigured func %s#%d\n",
> > > >                         func_id_name(func_id), func_id);
> > > > @@ -6841,8 +6866,17 @@ static int check_helper_call(struct bpf_verifier_env *env, struct bpf_insn *insn
> > > >                         return err;
> > > >         }
> > > >
> > > > +       regs = cur_regs(env);
> > > > +
> > > >         if (is_release_function(func_id)) {
> > > > -               err = release_reference(env, meta.ref_obj_id);
> > > > +               err = -EINVAL;
> > > > +               if (meta.ref_obj_id)
> > > > +                       err = release_reference(env, meta.ref_obj_id);
> > > > +               /* meta.ref_obj_id can only be 0 if register that is meant to be
> > > > +                * released is NULL, which must be > R0.
> > > > +                */
> > > > +               else if (meta.release_regno && register_is_null(&regs[meta.release_regno]))
> > > > +                       err = 0;
> > > >                 if (err) {
> > > >                         verbose(env, "func %s#%d reference has not been acquired before\n",
> > > >                                 func_id_name(func_id), func_id);
>
> Also, I forgot to mention this earlier, but I think we also need to
> check here that meta.release_regno == meta.ref_obj_id; otherwise there
> could be the case where if a helper function takes in at least two
> parameters one of which is PTR_RELEASE, the program could pass in
> something with no ref obj id as the PTR_RELEASE arg, and a ref obj id
> arg as one of the other args.
Not meta.release_regno == meta.ref_obj_id, but some way of checking
that the release arg is the one that has the ref obj id set. I think
the easiest way to do this is to just check that the reg also has a
valid ref obj id when we do the meta.release_regno assignment.
>
> > > > @@ -6850,8 +6884,6 @@ static int check_helper_call(struct bpf_verifier_env *env, struct bpf_insn *insn
> > > >                 }
> > > >         }
> > > >
> > > > -       regs = cur_regs(env);
> > > > -
> > > >         switch (func_id) {
> > > >         case BPF_FUNC_tail_call:
> > > >                 err = check_reference_leak(env);
> > > > diff --git a/net/core/filter.c b/net/core/filter.c
> > > > index 143f442a9505..8eb01a997476 100644
> > > > --- a/net/core/filter.c
> > > > +++ b/net/core/filter.c
> > > > @@ -6621,7 +6621,7 @@ static const struct bpf_func_proto bpf_sk_release_proto = {
> > > >         .func           = bpf_sk_release,
> > > >         .gpl_only       = false,
> > > >         .ret_type       = RET_INTEGER,
> > > > -       .arg1_type      = ARG_PTR_TO_BTF_ID_SOCK_COMMON,
> > > > +       .arg1_type      = ARG_PTR_TO_BTF_ID_SOCK_COMMON | PTR_RELEASE,
> > > >  };
> > > >
> > > >  BPF_CALL_5(bpf_xdp_sk_lookup_udp, struct xdp_buff *, ctx,
> > > > --
> > > > 2.35.1
> > > >
> >
> > --
> > Kartikeya