Re: [PATCH bpf] bpf: Fix crash due to OOB access when reg->type > __BPF_REG_TYPE_MAX

Alexei Starovoitov <alexei.starovoitov@xxxxxxxxx> · Wed, 16 Feb 2022 11:49:56 -0800

On Wed, Feb 16, 2022 at 11:59:54PM +0530, Kumar Kartikeya Dwivedi wrote:
> > > > Just use reg2btf_ids[base_type(reg->type)] instead?
> > >
> > > That would be incorrect I think, then we'd allow e.g. PTR_TO_TCP_SOCK_OR_NULL
> > > and treat it as PTR_TO_TCP_SOCK, while current code only wants to permit
> > > non-NULL variants for these three.
> >
> > add && !type_flag(reg->type) ?
> >
> 
> Ok, we can do that. WRT Hao's suggestion, we do allow NULL for ptr_to_mem_ok
> case, so doing it for all of check_func_arg_match won't work.

Right.

> > But, first, please describe how you found it.
> > Tried to pass PTR_TO_BTF_ID_OR_NULL into kfunc?
> > Do you have some other changes in the verifier?
> 
> Yes, was working on [0], tried to come up with a test case where verifier
> printed bad register type being passed (one marked with a new flag), but noticed
> that it would fall out of __BPF_REG_TYPE_MAX limit during kfunc check. Also, it
> seems on non-KASAN build it actually doesn't crash sometimes, depends on what
> the value at that offset is.
> 
>   [0]: https://github.com/kkdwivedi/linux/commits/btf-ptr-in-map

Nice. Thanks a ton. I did a quick look. Seems to be ready to post on the list?
Can you add .c tests and post?

Only one suggestion so far:
Could you get rid of callback in btf_find_struct_field ?
The callbacks obscure the control flow and make the code harder to follow.
Do refactor btf_find_struct_field, but call btf_find_ptr_to_btf_id directly from it.
Then you wouldn't need this ugly part:
/* While callback cleans up after itself, later iterations can still
 * return error without calling cb, so call free function again.
 */
if (ret < 0)
  bpf_map_free_ptr_off_tab(map);

I've been thinking about the api for refcnted PTR_TO_BTF_ID in the map too.
Regarding your issue with cmpxchg. I've come up with two helpers:

BPF_CALL_3(bpf_kptr_try_set, void **, kptr, void *, ptr, int, refcnt_off)
{
        /* ptr is ptr_to_btf_id returned from bpf_*_lookup() with ptr->refcnt >= 1
         * refcount_inc() has to be done before cmpxchg() because
         * another cpu might do bpf_kptr_xchg+release.
         */
        refcount_inc((refcount_t *)(ptr + refcnt_off));
        if (cmpxchg(kptr, NULL, ptr)) {
                /* refcnt >= 2 here */
                refcount_dec((refcount_t *)(ptr + refcnt_off));
                return -EBUSY;
        }
        return 0;
}

and

BPF_CALL_2(bpf_kptr_get, void **, kptr, int, refcnt_off)
{
        void *ptr = READ_ONCE(kptr);

        if (!ptr)
                return 0;
        /* ptr->refcnt could be == 0 if another cpu did
         * ptr2 = bpf_kptr_xchg();
         * bpf_*_release(ptr2);
         */
        if (!refcount_inc_not_zero((refcount_t *)(ptr + refcnt_off)))
                return 0;
        return (long) ptr;
}

and instead of recognizing xchg insn directly in the verifier
I went with the helper:
BPF_CALL_2(bpf_kptr_xchg, void **, kptr, void *, ptr)
{
        /* ptr is ptr_to_btf_id returned from bpf_*_lookup() with ptr->refcnt >= 1
         * or ptr == NULL.
         * returns ptr_to_btf_id with refcnt >= 1 or NULL
         */
        return (long) xchg(kptr, ptr);
}
It was easier to deal with.
I don't have a strong preference for helper vs insn though.
Let's brainstorm after you post patches.

xchg or bpf_kptr_xchg only operation seems to be too limiting.
For example we might want to remember struct task_struct or struct net
in a map for faster access.
The __bpf_nf_ct_lookup is doing get_net_ns_by_id().
idr_find isn't fast enough for XDP.
We can store refcnted 'struct net *' for faster lookup.
Then multiple cpus will be reading that pointer the map, but if only 'xchg' is
available that use case won't work. One cpu will win the race.
So we need bpf_kptr_get() will be an acquire helper.
The bpf prog will do an explicit put_net(ptr); which will be a release kfunc.

Simply reading refcnted ptr_to_btf_id from a map is PTR_UNTRUSTED too for sleepable bpf progs.
Such pointers are under rcu_read_lock in non-sleepable, but it still feels safer
to always enforce refcnt inc/dec when accessing them to pass into helpers.

> I was planning to send a verifier test exercising this but it seems
> fixup_kfunc_btf_id support for test_verifier.c is not in bpf tree yet, so when

Obviously we have to wait until the merge window.

> it is merged I will provide a small test case, it is basically this on bpf-next:
> 
> diff --git a/tools/testing/selftests/bpf/verifier/calls.c b/tools/testing/selftests/bpf/verifier/calls.c
> index 829be2b9e08e..5f26007ceef1 100644
> --- a/tools/testing/selftests/bpf/verifier/calls.c
> +++ b/tools/testing/selftests/bpf/verifier/calls.c
> @@ -1,3 +1,22 @@
> +{
> +       "calls: trigger reg->type > __BPF_REG_TYPE_MAX",
> +       .insns = {
> +       BPF_MOV64_REG(BPF_REG_1, BPF_REG_10),
> +       BPF_ALU64_IMM(BPF_ADD, BPF_REG_1, -8),
> +       BPF_ST_MEM(BPF_DW, BPF_REG_1, 0, 0),
> +       BPF_RAW_INSN(BPF_JMP | BPF_CALL, 0, BPF_PSEUDO_KFUNC_CALL, 0, 0),
> +       BPF_MOV64_REG(BPF_REG_1, BPF_REG_0),
> +       BPF_RAW_INSN(BPF_JMP | BPF_CALL, 0, BPF_PSEUDO_KFUNC_CALL, 0, 0),
> +       BPF_EXIT_INSN(),
> +       },
> +       .prog_type = BPF_PROG_TYPE_SCHED_CLS,
> +       .result = REJECT,
> +       .errstr = "XXX,
> +       .fixup_kfunc_btf_id = {
> +               { "bpf_kfunc_call_test_acquire", 3 },
> +               { "bpf_kfunc_call_test_release", 5 },
> +       },
> +},
>  {
> 
> Sending it rn I think may lead to flaky CI, so we can wait.

Right. Let's wait until bpf tree merges into bpf-next.

> If all of this makes sense, should I respin?

Please do.
We can get bpf PR out asap after that.