On Tue, Sep 8, 2020 at 8:20 AM Stanislav Fomichev <sdf@xxxxxxxxxx> wrote: > > On Mon, Sep 7, 2020 at 1:49 AM Toke Høiland-Jørgensen <toke@xxxxxxxxxx> wrote: > > > > Andrii Nakryiko <andrii.nakryiko@xxxxxxxxx> writes: > > > > >> May be we should talk about problem statement and goals. > > >> Do we actually need metadata per program or metadata per single .o > > >> or metadata per final .o with multiple .o linked together? > > >> What is this metadata? > > > > > > Yep, that's a very valid question. I've also CC'ed Andrey. > > > > For the libxdp use case, I need metadata per program. But I'm already > > sticking that in a single section and disambiguating by struct name > > (just prefixing the function name with a _ ), so I think it's fine to > > have this kind of "concatenated metadata" per elf file and parse out the > > per-program information from that. This is similar to the BTF-encoded > > "metadata" we can do today. > We've come full circle :-) > I think we discussed that approach originally - to stick everything > into existing global .data/.rodata and use some variable prefix for > the metadata. I'm fine with that approach. The only thing I don't That's what we wanted all along, but the problem was with keeping reference to bpf_map from bpf_prog. We eventually gave up and concluded that extra BPF command is necessary. But somewhere along the road we somehow concluded we need an entire new special map/section, and I didn't realize at first (and it seems it wasn't just me) that the latter part is unnecessary. > understand is - why bother with the additional .rodata.metadata > section and merging? > Can we unconditionally do BPF_PROG_BIND_MAP(.rodata) from libbpf (and > ignore the error) and be done? That's exactly what we are proposing, to stick to .rodata, instead of having extra .metadata section. Multiple .rodata/.data sections are orthogonal concerns, which we need to solve as well, because the compiler does emit many of them in some cases. So in that context, once we support multiple .rodata's, it would be possible to have metadata-only "sub-sections". But we don't have to do that, keeping everything simple and put into .rodata works just fine. > > Sticking to the original question: for our use-case, the metadata is > per .o file. I'm not sure how it would work in the 'multiple .o linked > together' use case. Ideally, we'd need to preserve all metadata? Just like in user-space, when you have multiple .c files compiled into .o files and later linked into a final library or binary, all the .data and .rodata sections are combined. That's what will happen with BPF .o files as well. So it will be automatically preserved, as you seem to want. > > > >> If it's just unreferenced by program read only data then no special names or > > >> prefixes are needed. We can introduce BPF_PROG_BIND_MAP to bind any map to any > > >> program and it would be up to tooling to decide the meaning of the data in the > > >> map. For example, bpftool can choose to print all variables from all read only > > >> maps that match "bpf_metadata_" prefix, but it will be bpftool convention only > > >> and not hard coded in libbpf. > > > > > > Agree as well. It feels a bit odd for libbpf to handle ".metadata" > > > specially, given libbpf itself doesn't care about its contents at all. > > > > > > So thanks for bringing this up, I think this is an important > > > discussion to have. > > > > I'm fine with having this be part of .rodata. One drawback, though, is > > that if any metadata is defined, it becomes a bit more complicated to > > use bpf_map__set_initial_value() because that now also has to include > > the metadata. Any way we can improve upon that? > Right. One additional thing we wanted this metadata to have is the > comm of the process who loaded this bpf program (to be filled/added by > libbpf). > I suppose .rodata.metadata section can help with that? .rodata.metadata has nothing to do with this. I'm also not sure whether it's a responsibility of libbpf to provide process's comm as a metadata, to be honest. Next thing it will be user name/user id, then cgroup name, then some other application-level concept and so on. I'd prefer to keep it simple and let applications handle that for themselves. Luckily, using a BPF skeleton this is **extremely** easy.