Re: Encountered error while encoding BTF due to Unsupported DW_TAG_unspecified_type(0x3b)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Em Mon, Oct 10, 2022 at 05:08:26PM -0300, Arnaldo Carvalho de Melo escreveu:
> Em Mon, Oct 10, 2022 at 09:06:46AM -0300, Arnaldo Carvalho de Melo escreveu:
> > Em Fri, Oct 07, 2022 at 05:25:21PM -0700, Yonghong Song escreveu:
> > > For function entry_ibpb, actually the kernel has a prorotype,
 
> > > include/asm/nospec-branch.h:extern void entry_ibpb(void);
 
> > > But unfortunately since the function is defined in asm code.
> > > The actual type information is not encoded in dwarf so pahole
> > > does not have enough info from vmlinux dwarf to get func
> > > types. The compiler does not generate func types based on
> > > declarations.
 
> > Right
> 
> > > This may prevent people from tracing functions defined in
> > > asm code but this should be extremely rare.
> > > So I agree that BTF type id 0 is probably the best choice.
> > 
> > Thanks for your comments, that is the way I'll do it.
> 
> Done, some prep patches then the patch below.
> 
> If possible, please ack :-)

Its all now on my 'next' branch at:

git://git.kernel.org/pub/scm/devel/pahole/pahole.git
https://github.com/acmel/dwarves.git

https://git.kernel.org/pub/scm/devel/pahole/pahole.git/log/?h=next
 
> - Arnaldo
> 
> commit 3abc72d9d56ec0cbfffc1794d2bf5d527d1e88ba
> Author: Arnaldo Carvalho de Melo <acme@xxxxxxxxxx>
> Date:   Mon Oct 10 11:20:07 2022 -0300
> 
>     btf_encoder: Encode DW_TAG_unspecified_type returning routines as void
>     
>     Since we don´t have how to encode this info in BTF, and from what we
>     saw, at least in this case:
>     
>     Built binutils from git://sourceware.org/git/binutils-gdb.git, then used
>     gcc's -B option to point to the directory with the new as, that is built
>     as as-new, so make a symlink, ending up with:
>     
>       15e20ce2324a:~/git/linux # readelf -wi ./arch/x86/entry/entry.o
>       Contents of the .debug_info section:
>     
>         Compilation Unit @ offset 0:
>          Length:        0x35 (32-bit)
>          Version:       5
>          Unit Type:     DW_UT_compile (1)
>          Abbrev Offset: 0
>          Pointer Size:  8
>        <0><c>: Abbrev Number: 1 (DW_TAG_compile_unit)
>           <d>   DW_AT_stmt_list   : 0
>           <11>   DW_AT_low_pc      : 0
>           <19>   DW_AT_high_pc     : 19
>           <1a>   DW_AT_name        : (indirect string, offset: 0): arch/x86/entry/entry.S
>           <1e>   DW_AT_comp_dir    : (indirect string, offset: 0x17): /root/git/linux
>           <22>   DW_AT_producer    : (indirect string, offset: 0x27): GNU AS 2.39.50
>           <26>   DW_AT_language    : 32769  (MIPS assembler)
>        <1><28>: Abbrev Number: 2 (DW_TAG_subprogram)
>           <29>   DW_AT_name        : (indirect string, offset: 0x36): entry_ibpb
>           <2d>   DW_AT_external    : 1
>           <2d>   DW_AT_type        : <0x37>
>           <2e>   DW_AT_low_pc      : 0
>           <36>   DW_AT_high_pc     : 19
>        <1><37>: Abbrev Number: 3 (DW_TAG_unspecified_type)
>        <1><38>: Abbrev Number: 0
>     
>     So we have that asm label encoded by GNU AS 2.39.50 as a
>     DW_TAG_subprogram that has as its DW_AT_type the DW_TAG_unspecified_type
>     0x37 that we convert to 0 (void):
>     
>       15e20ce2324a:~/git/linux # pahole -J ./arch/x86/entry/entry.o
>       15e20ce2324a:~/git/linux # pahole -JV ./arch/x86/entry/entry.o
>       btf_encoder__new: 'entry.o' doesn't have '.data..percpu' section
>       Found 0 per-CPU variables!
>       Found 1 functions!
>       File entry.o:
>       [1] FUNC_PROTO (anon) return=0 args=(void)
>       [2] FUNC entry_ibpb type_id=1
>       15e20ce2324a:~/git/linux # pfunct -F btf ./arch/x86/entry/entry.o
>       entry_ibpb
>       15e20ce2324a:~/git/linux # pfunct --proto -F btf ./arch/x86/entry/entry.o
>       void  entry_ibpb(void);
>       15e20ce2324a:~/git/linux #
>     
>       15e20ce2324a:~/git/linux # tools/bpf/bpftool/bpftool btf dump file ./arch/x86/entry/entry.o format raw
>       [1] FUNC_PROTO '(anon)' ret_type_id=0 vlen=0
>       [2] FUNC 'entry_ibpb' type_id=1 linkage=static
>       15e20ce2324a:~/git/linux #
>     
>     I think this is what can be done to avoid having to skip ASM DWARF when
>     gets widely used, i.e. binutils gets updated.
>     
>     Cc: Andrii Nakryiko <andrii.nakryiko@xxxxxxxxx>,
>     Cc: Martin Liška <mliska@xxxxxxx>
>     Cc: Yonghong Song <yhs@xxxxxxxx>
>     Signed-off-by: Arnaldo Carvalho de Melo <acme@xxxxxxxxxx>
> 
> diff --git a/btf_encoder.c b/btf_encoder.c
> index fb2ca77e2e9bf144..a5fa04a84ee246ee 100644
> --- a/btf_encoder.c
> +++ b/btf_encoder.c
> @@ -593,6 +593,19 @@ static int32_t btf_encoder__add_func_param(struct btf_encoder *encoder, const ch
>  	}
>  }
>  
> +static int32_t btf_encoder__tag_type(struct btf_encoder *encoder, uint32_t type_id_off, uint32_t tag_type)
> +{
> +	if (tag_type == 0)
> +		return 0;
> +
> +	if (encoder->cu->unspecified_type.tag && tag_type == encoder->cu->unspecified_type.type) {
> +		// No provision for encoding this, turn it into void.
> +		return 0;
> +	}
> +
> +	return type_id_off + tag_type;
> +}
> +
>  static int32_t btf_encoder__add_func_proto(struct btf_encoder *encoder, struct ftype *ftype, uint32_t type_id_off)
>  {
>  	struct btf *btf = encoder->btf;
> @@ -603,7 +616,7 @@ static int32_t btf_encoder__add_func_proto(struct btf_encoder *encoder, struct f
>  
>  	/* add btf_type for func_proto */
>  	nr_params = ftype->nr_parms + (ftype->unspec_parms ? 1 : 0);
> -	type_id = ftype->tag.type == 0 ? 0 : type_id_off + ftype->tag.type;
> +	type_id = btf_encoder__tag_type(encoder, type_id_off, ftype->tag.type);
>  
>  	id = btf__add_func_proto(btf, type_id);
>  	if (id > 0) {
> @@ -966,6 +979,15 @@ static int btf_encoder__encode_tag(struct btf_encoder *encoder, struct tag *tag,
>  		return btf_encoder__add_enum_type(encoder, tag, conf_load);
>  	case DW_TAG_subroutine_type:
>  		return btf_encoder__add_func_proto(encoder, tag__ftype(tag), type_id_off);
> +        case DW_TAG_unspecified_type:
> +		/* Just don't encode this for now, converting anything with this type to void (0) instead.
> +		 *
> +		 * If we end up needing to encode this, one possible hack is to do as follows, as "const void".
> +		 *
> +		 * Returning zero means we skipped encoding a DWARF type.
> +		 */
> +               // btf_encoder__add_ref_type(encoder, BTF_KIND_CONST, 0, NULL, false);
> +               return 0;
>  	default:
>  		fprintf(stderr, "Unsupported DW_TAG_%s(0x%x): type: 0x%x\n",
>  			dwarf_tag_name(tag->tag), tag->tag, ref_type_id);
> @@ -1487,7 +1509,7 @@ int btf_encoder__encode_cu(struct btf_encoder *encoder, struct cu *cu, struct co
>  {
>  	uint32_t type_id_off = btf__type_cnt(encoder->btf) - 1;
>  	struct llvm_annotation *annot;
> -	int btf_type_id, tag_type_id;
> +	int btf_type_id, tag_type_id, skipped_types = 0;
>  	uint32_t core_id;
>  	struct function *fn;
>  	struct tag *pos;
> @@ -1510,8 +1532,13 @@ int btf_encoder__encode_cu(struct btf_encoder *encoder, struct cu *cu, struct co
>  	cu__for_each_type(cu, core_id, pos) {
>  		btf_type_id = btf_encoder__encode_tag(encoder, pos, type_id_off, conf_load);
>  
> +		if (btf_type_id == 0) {
> +			++skipped_types;
> +			continue;
> +		}
> +
>  		if (btf_type_id < 0 ||
> -		    tag__check_id_drift(pos, core_id, btf_type_id, type_id_off)) {
> +		    tag__check_id_drift(pos, core_id, btf_type_id + skipped_types, type_id_off)) {
>  			err = -1;
>  			goto out;
>  		}

-- 

- Arnaldo



[Index of Archives]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux