On Tue, 2023-02-28 at 03:55 +0100, KP Singh wrote: > On Mon, Feb 27, 2023 at 9:48 PM Eduard Zingerman <eddyz87@xxxxxxxxx> wrote: > > > > On Mon, 2023-02-27 at 11:31 -0800, Andrii Nakryiko wrote: > > [...] > > > > > I'd start with understanding what BTF and DWARF differences are > > > > > causing the issue before trying to come up with the fix. For that we > > > > > don't even need config or repro steps, it should be enough to share > > > > > vmlinux with BTF and DWARF, and start from there. > > > > > > > > > > > > > Yes, I suspect that there is some kind of unanticipated > > > > anomaly for some DWARF encoding for some kind of objects, > > > > just need to find the root for the diverging type hierarchies. > > > > > > > > > But I'm sure Eduard is on top of this already (especially that he can > > > > > repro the issue now). > > > > > > > > I'm working on it, nothing to report yet, but I'm working on it. > > > > > > > > > > Thanks, please keep us posted! > > > > It is interesting how everything is interconnected. The patch for > > pahole below happens to help. I prepared it last week while working on > > new DWARF encoding scheme for btf_type_tag. > > > > I still need to track down which "unspecified_type" entries caused the > > issue in this particular case. Will post an update tomorrow. > > > > Meanwhile, Matt, KP, could you please verify the patch on your side? > > It is for the "next" branch of pahole. > > > > --- > > > > From 09fac63ca08e25aea499f827283b07cc87a7daab Mon Sep 17 00:00:00 2001 > > From: Eduard Zingerman <eddyz87@xxxxxxxxx> > > Date: Tue, 21 Feb 2023 19:23:00 +0200 > > Subject: [PATCH] dwarf_loader: Fix for BTF id drift caused by adding > > unspecified types > > > > Recent changes to handle unspecified types (see [1]) cause BTF ID drift. > > > > Specifically, the intent of commits [2], [3] and [4] is to render > > references to unspecified types as void type. > > However, as a consequence: > > - in `die__process_unit()` call to `cu__add_tag()` allocates `small_id` > > for unspecified type tags and adds these tags to `cu->types_table`; > > - `btf_encoder__encode_tag()` skips generation of BTF entries for > > `DW_TAG_unspecified_type` tags. > > > > Such logic causes ID drift if unspecified type is not the last type > > processed for compilation unit. `small_id` of each type following > > unspecified type in the `cu->types_table` would have its BTF id off by -1. > > Thus renders references established on recode phase invalid. > > > > This commit reverts `unspecified_type` id/tag tracking, instead: > > - `small_id` for unspecified type tags is set to 0, thus reference to > > unspecified type tag would render BTF id of a `void` on recode phase; > > - unspecified type tags are not added to `cu->types_table`. > > > > [1] https://lore.kernel.org/all/Y0R7uu3s%2FimnvPzM@xxxxxxxxxx/ > > [2] bcc648a10cbc ("btf_encoder: Encode DW_TAG_unspecified_type returning routines as void") > > [3] cffe5e1f75e1 ("core: Record if a CU has a DW_TAG_unspecified_type") > > [4] 75e0fe28bb02 ("core: Add DW_TAG_unspecified_type to tag__is_tag_type() set") > > > > Fixes: bcc648a10cbc ("btf_encoder: Encode DW_TAG_unspecified_type returning routines as void") > > Signed-off-by: Eduard Zingerman <eddyz87@xxxxxxxxx> > > Tested-by: KP Singh <kpsingh@xxxxxxxxxx> > Reported-by: Matt Bobrowski <mattbobrowski@xxxxxxxxxx> > > Thank you so much Eduard, this worked: > > * No duplicate BTF ID warnings > * No 15 minute BTF ID generation > * Matt's reproducer loads successfully. > > I had a sneaky suspicion that it was these unspecified types, which is > why my hacky patch which got unspecified types out of the way got > things to *mostly* work. Hi KP, Thanks a lot for testing! I found the root cause for the bug (took me longer than I would like to admit...). Using the patch below the reproducer from Matt works as expected and warnings are gone. Still, I think that my patch from yesterday is a more general approach, as it correctly handles unspecified types that occur in non-tail position, so I'll post that one. Thanks, Eduard --- >From daa53248e8a5087edbceaffe1fad51f9eb06e922 Mon Sep 17 00:00:00 2001 From: Eduard Zingerman <eddyz87@xxxxxxxxx> Date: Tue, 28 Feb 2023 19:44:22 +0200 Subject: [PATCH] btf_encoder: reset encoder->unspecified_type for each CU The field `encoder->unspecified_type` is set but not reset by function `btf_encoder__encode_cu()` when processed `cu` has unspecified type. The following sequence of events might occur when BTF encoding is requested: - CU with unspecified type is processed: - unspecified type id is 42 - encoder->unspecified_type is set to 42 - CU without unspecified type is processed next using the same `encoder` object: - some `struct foo` has id 42 in this CU - the references to `struct foo` are set 0 by function `btf_encoder__tag_type()`. This commit sets `encoder->unspecified_type` to 0 when CU does not have unspecified type. This issue was reported in thread [1]. See also [2]. [1] https://lore.kernel.org/bpf/Y%2FP1yxAuV6Wj3A0K@xxxxxxxxxx/ [2] https://lore.kernel.org/all/Y0R7uu3s%2FimnvPzM@xxxxxxxxxx/ Fixes: 52b25808e44a ("btf_encoder: Store type_id_off, unspecified type in encoder") Reported-by: Matt Bobrowski <mattbobrowski@xxxxxxxxxx> Signed-off-by: Eduard Zingerman <eddyz87@xxxxxxxxx> --- btf_encoder.c | 2 ++ 1 file changed, 2 insertions(+) diff --git a/btf_encoder.c b/btf_encoder.c index da776f4..24f4c65 100644 --- a/btf_encoder.c +++ b/btf_encoder.c @@ -1748,6 +1748,8 @@ int btf_encoder__encode_cu(struct btf_encoder *encoder, struct cu *cu, struct co encoder->type_id_off = btf__type_cnt(encoder->btf) - 1; if (encoder->cu->unspecified_type.tag) encoder->unspecified_type = encoder->cu->unspecified_type.type; + else + encoder->unspecified_type = 0; if (!encoder->has_index_type) { /* cu__find_base_type_by_name() takes "type_id_t *id" */ -- 2.39.1