On Fri, Jan 28, 2022 at 2:33 PM Mauricio Vásquez <mauricio@xxxxxxxxxx> wrote: > > This commit implements the logic to record the relocation information > for the different kind of relocations. > > btfgen_record_field_relo() uses the target specification to save all the > types that are involved in a field-based CO-RE relocation. In this case > types resolved and added recursively (using btfgen_put_type()). > Only the struct and union members and their types) involved in the > relocation are added to optimize the size of the generated BTF file. > > On the other hand, btfgen_record_type_relo() saves the types involved in > a type-based CO-RE relocation. In this case all the members for the > struct and union types are added. This is not strictly required since > libbpf doesn't use them while performing this kind of relocation, > however that logic could change on the future. Additionally, we expect > that the number of this kind of relocations in an BPF object to be very > low, hence the impact on the size of the generated BTF should be > negligible. > > Finally, btfgen_record_enumval_relo() saves the whole enum type for > enum-based relocations. > > Signed-off-by: Mauricio Vásquez <mauricio@xxxxxxxxxx> > Signed-off-by: Rafael David Tinoco <rafael.tinoco@xxxxxxxxxxx> > Signed-off-by: Lorenzo Fontana <lorenzo.fontana@xxxxxxxxxx> > Signed-off-by: Leonardo Di Donato <leonardo.didonato@xxxxxxxxxx> > --- I've been thinking about this in background. This proliferation of hashmaps to store used types and their members really adds to complexity (and no doubt to memory usage and CPU utilization, even though I don't think either is too big for this use case). What if instead of keeping track of used types and members separately, we initialize the original struct btf and its btf_type, btf_member, btf_enum, etc types. We can carve out one bit in them to mark whether that specific entity was used. That way you don't need any extra hashmap maintenance. You just set or check bit on each type or its member to figure out if it has to be in the resulting BTF. This can be highest bit of name_off or type fields, depending on specific case. This will work well because type IDs never use highest bit and string offset can never be as high as to needing full 32 bits. You'll probably want to have two copies of target BTF for this, of course, but I think simplicity of bookkeeping trumps this inefficiency. WDYT? > tools/bpf/bpftool/gen.c | 260 +++++++++++++++++++++++++++++++++++++++- > 1 file changed, 257 insertions(+), 3 deletions(-) > [...]