> On Aug 29, 2019, at 8:47 AM, Daniel Borkmann <daniel@xxxxxxxxxxxxx> wrote: > >> On 8/29/19 7:12 AM, Alexei Starovoitov wrote: >> [...] >> +/* >> + * CAP_BPF allows the following BPF operations: >> + * - Loading all types of BPF programs >> + * - Creating all types of BPF maps except: >> + * - stackmap that needs CAP_TRACING >> + * - devmap that needs CAP_NET_ADMIN >> + * - cpumap that needs CAP_SYS_ADMIN >> + * - Advanced verifier features >> + * - Indirect variable access >> + * - Bounded loops >> + * - BPF to BPF function calls >> + * - Scalar precision tracking >> + * - Larger complexity limits >> + * - Dead code elimination >> + * - And potentially other features >> + * - Use of pointer-to-integer conversions in BPF programs >> + * - Bypassing of speculation attack hardening measures >> + * - Loading BPF Type Format (BTF) data >> + * - Iterate system wide loaded programs, maps, BTF objects >> + * - Retrieve xlated and JITed code of BPF programs >> + * - Access maps and programs via id >> + * - Use bpf_spin_lock() helper > > This is still very wide. Consider following example: app has CAP_BPF + > CAP_NET_ADMIN. Why can't we in this case *only* allow loading networking > related [plus generic] maps and programs? If it doesn't have CAP_TRACING, > what would be a reason to allow loading it? Same vice versa. There are > some misc program types like the infraread stuff, but they could continue > to live under [CAP_BPF +] CAP_SYS_ADMIN as fallback. I think categorizing > a specific list of prog and map types might be more clear than disallowing > some helpers like below (e.g. why choice of bpf_probe_read() but not > bpf_probe_write_user() etc). Wow, I didn’t notice that bpf_probe_write_user() existed. That should need something like CAP_PTRACE or CAP_SYS_ADMIN. I'm starting to think that something like this: https://lore.kernel.org/bpf/968f3551247a43e1104b198f2e58fb0595d425e7.1565040372.git.luto@xxxxxxxxxx/ should maybe be finished before CAP_BPF happens at all. It really looks like the bpf operations that need privilege need to get fully catalogued and dealt with rather than just coming up with a new capability that covers a huge swath. (bpf_probe_write_user() is also terminally broken on architectures like s390x, but that's not really relevant right now. I'm a bit surprised it works on x86 with SMAP, though.)