hi, Suren, On Thu, Nov 14, 2024 at 03:40:08PM -0800, Suren Baghdasaryan wrote: > On Wed, Nov 13, 2024 at 1:34 PM Suren Baghdasaryan <surenb@xxxxxxxxxx> wrote: > > > > On Wed, Nov 13, 2024 at 2:07 PM kernel test robot <oliver.sang@xxxxxxxxx> wrote: > > > > > > > > > > > > Hello, > > > > > > > > > we reported > > > "[linux-next:master] [alloc_tag] a9c60bb0d0: BUG:KASAN:vmalloc-out-of-bounds_in_load_module" > > > in > > > https://lore.kernel.org/all/202410281441.216670ac-lkp@xxxxxxxxx/ > > > > > > we noticed it seems there is following patch. > > > > > > we made below report just FYI that the commit still cause similar issue on > > > linux-next/master and not fixed on tip of linux-next/master when this bisect > > > is done. > > > > > > > > > kernel test robot noticed "BUG:KASAN:vmalloc-out-of-bounds_in_move_module" on: > > > > > > commit: 0f9b685626daa2f8e19a9788625c9b624c223e45 ("alloc_tag: populate memory for module tags as needed") > > > https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master > > > > > > [test failed on linux-next/master 929beafbe7acce3267c06115e13e03ff6e50548a] > > > > > > in testcase: rcuscale > > > version: > > > with following parameters: > > > > > > runtime: 300s > > > scale_type: srcu > > > > > > > > > > > > config: x86_64-randconfig-014-20241107 > > > compiler: gcc-12 > > > test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G > > > > > > (please refer to attached dmesg/kmsg for entire log/backtrace) > > > > > > > > > +------------------------------------------------+------------+------------+ > > > | | 0db6f8d782 | 0f9b685626 | > > > +------------------------------------------------+------------+------------+ > > > | boot_successes | 18 | 0 | > > > | boot_failures | 0 | 18 | > > > | BUG:KASAN:vmalloc-out-of-bounds_in_move_module | 0 | 18 | > > > | BUG:unable_to_handle_page_fault_for_address | 0 | 18 | > > > | Oops | 0 | 18 | > > > | RIP:kasan_metadata_fetch_row | 0 | 18 | > > > | Kernel_panic-not_syncing:Fatal_exception | 0 | 18 | > > > +------------------------------------------------+------------+------------+ > > > > > > > > > If you fix the issue in a separate patch/commit (i.e. not just a new version of > > > the same patch/commit), kindly add following tags > > > | Reported-by: kernel test robot <oliver.sang@xxxxxxxxx> > > > | Closes: https://lore.kernel.org/oe-lkp/202411132111.6a221562-lkp@xxxxxxxxx > > > > > > Thanks for the report! I'm looking into this but so far could not find > > an obvious issue. Will try to reproduce. > > For some reason I'm getting this panic when trying to follow the repro steps: sorry about this. kernel test robot run into some cluster issues these days and most of my time is occupied to solve them. I will look at this later when our cluster back to normal and give you an update sorry again for any inconvenience. > > # bin/lkp qemu -k /home/suren/linux-next/arch/x86_64/boot/bzImage -m > /home/suren/linux-next/out/modules.cgz job-script > ... > [ 22.813623][ T1] Kernel panic - not syncing: VFS: Unable to > mount root fs on unknown-block(0,0) > [ 22.815461][ T1] CPU: 0 UID: 0 PID: 1 Comm: swapper Not tainted > 6.12.0-rc7-next-20241113 #1 060e60d2378c08a3d0121faf43856b671a45697c > [ 22.817822][ T1] Hardware name: QEMU Standard PC (i440FX + PIIX, > 1996), BIOS 1.15.0-1 04/01/2014 > [ 22.819655][ T1] Call Trace: > [ 22.820399][ T1] <TASK> > [ 22.821077][ T1] panic+0x243/0x486 > [ 22.821965][ T1] ? crash_smp_send_stop+0x1c/0x1c > [ 22.823019][ T1] ? lock_release+0x17c/0x1b1 > [ 22.824006][ T1] mount_root_generic+0x31d/0x3d0 > [ 22.825064][ T1] ? init_rootfs+0x4c/0x4c > [ 22.826011][ T1] ? init_stat+0xd8/0xd8 > [ 22.826920][ T1] ? __asan_memcpy+0x3c/0x65 > [ 22.827880][ T1] ? getname_kernel+0x3dc/0x41e > [ 22.828887][ T1] prepare_namespace+0x21e/0x289 > [ 22.829895][ T1] ? mount_root+0xc6/0xc6 > [ 22.830819][ T1] ? fput+0x1b/0x194 > [ 22.831682][ T1] ? rest_init+0x183/0x183 > [ 22.832624][ T1] kernel_init+0x17/0x138 > [ 22.833535][ T1] ? rest_init+0x183/0x183 > [ 22.834490][ T1] ret_from_fork+0x20/0x54 > [ 22.835419][ T1] ? rest_init+0x183/0x183 > [ 22.836343][ T1] ret_from_fork_asm+0x11/0x20 > [ 22.837335][ T1] </TASK> > [ 22.838082][ T1] Kernel Offset: disabled > > I see that PeterZ had the same issue back in September: > https://lore.kernel.org/lkml/20240909091531.GA4723@xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx/ > , so might this be a known issue? If anyone has an idea what I'm doing > wrong I would appreciate your help. > Thanks, > Suren. > > > > > > > > > > > > > [ 153.897376][ T402] BUG: KASAN: vmalloc-out-of-bounds in move_module (kernel/module/main.c:2357) > > > [ 153.899141][ T402] Write of size 40 at addr ffffffffa0000000 by task modprobe/402 > > > [ 153.900837][ T402] > > > [ 153.901496][ T402] CPU: 0 UID: 0 PID: 402 Comm: modprobe Tainted: G T 6.12.0-rc6-00146-g0f9b685626da #1 87c8486a909ba2f90eff061a4c9c1fa5c9cd90ea > > > [ 153.904537][ T402] Tainted: [T]=RANDSTRUCT > > > [ 153.905500][ T402] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-debian-1.16.2-1 04/01/2014 > > > [ 153.907702][ T402] Call Trace: > > > [ 153.908510][ T402] <TASK> > > > [ 153.909241][ T402] print_address_description+0x65/0x2fa > > > [ 153.910663][ T402] print_report (mm/kasan/report.c:489) > > > [ 153.911771][ T402] ? move_module (kernel/module/main.c:2357) > > > [ 153.912825][ T402] kasan_report (mm/kasan/report.c:603) > > > [ 153.913821][ T402] ? move_module (kernel/module/main.c:2357) > > > [ 153.914904][ T402] kasan_check_range (mm/kasan/generic.c:183 mm/kasan/generic.c:189) > > > [ 153.916029][ T402] __asan_memcpy (mm/kasan/shadow.c:105 (discriminator 1)) > > > [ 153.917057][ T402] move_module (kernel/module/main.c:2357) > > > [ 153.918071][ T402] layout_and_allocate+0x446/0x523 > > > [ 153.919459][ T402] load_module (kernel/module/main.c:2985) > > > [ 153.920457][ T402] ? mode_strip_umask (fs/namei.c:3248) > > > [ 153.921557][ T402] init_module_from_file (kernel/module/main.c:3266) > > > [ 153.922825][ T402] ? __ia32_sys_init_module (kernel/module/main.c:3266) > > > [ 153.923992][ T402] ? __lock_release+0x106/0x38c > > > [ 153.925173][ T402] ? idempotent_init_module (kernel/module/main.c:3301) > > > [ 153.926364][ T402] ? lock_release (kernel/locking/lockdep.c:467 kernel/locking/lockdep.c:5848) > > > [ 153.944053][ T402] idempotent_init_module (kernel/module/main.c:3302) > > > [ 153.945164][ T402] ? init_module_from_file (kernel/module/main.c:3294) > > > [ 153.946268][ T402] ? security_capable (security/security.c:1143) > > > [ 153.947421][ T402] __do_sys_finit_module (include/linux/file.h:68 kernel/module/main.c:3330) > > > [ 153.948495][ T402] do_syscall_64 (arch/x86/entry/common.c:52 arch/x86/entry/common.c:83) > > > [ 153.949540][ T402] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130) > > > [ 153.950855][ T402] RIP: 0033:0x7f0f37df7719 > > > [ 153.951869][ T402] Code: 08 89 e8 5b 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d b7 06 0d 00 f7 d8 64 89 01 48 > > > All code > > > ======== > > > 0: 08 89 e8 5b 5d c3 or %cl,-0x3ca2a418(%rcx) > > > 6: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > > > d: 00 00 00 > > > 10: 90 nop > > > 11: 48 89 f8 mov %rdi,%rax > > > 14: 48 89 f7 mov %rsi,%rdi > > > 17: 48 89 d6 mov %rdx,%rsi > > > 1a: 48 89 ca mov %rcx,%rdx > > > 1d: 4d 89 c2 mov %r8,%r10 > > > 20: 4d 89 c8 mov %r9,%r8 > > > 23: 4c 8b 4c 24 08 mov 0x8(%rsp),%r9 > > > 28: 0f 05 syscall > > > 2a:* 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax <-- trapping instruction > > > 30: 73 01 jae 0x33 > > > 32: c3 ret > > > 33: 48 8b 0d b7 06 0d 00 mov 0xd06b7(%rip),%rcx # 0xd06f1 > > > 3a: f7 d8 neg %eax > > > 3c: 64 89 01 mov %eax,%fs:(%rcx) > > > 3f: 48 rex.W > > > > > > Code starting with the faulting instruction > > > =========================================== > > > 0: 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax > > > 6: 73 01 jae 0x9 > > > 8: c3 ret > > > 9: 48 8b 0d b7 06 0d 00 mov 0xd06b7(%rip),%rcx # 0xd06c7 > > > 10: f7 d8 neg %eax > > > 12: 64 89 01 mov %eax,%fs:(%rcx) > > > 15: 48 rex.W > > > [ 153.955810][ T402] RSP: 002b:00007ffccd7f7198 EFLAGS: 00000246 ORIG_RAX: 0000000000000139 > > > [ 153.957666][ T402] RAX: ffffffffffffffda RBX: 000055cc9f9fddd0 RCX: 00007f0f37df7719 > > > [ 153.959411][ T402] RDX: 0000000000000000 RSI: 000055cc9f9f24a0 RDI: 0000000000000004 > > > [ 153.961142][ T402] RBP: 000055cc9f9f24a0 R08: 0000000000000000 R09: 000055cc9f9ff250 > > > [ 153.962910][ T402] R10: 0000000000000004 R11: 0000000000000246 R12: 0000000000040000 > > > [ 153.964665][ T402] R13: 0000000000000000 R14: 000055cc9f9fdd80 R15: 0000000000000000 > > > [ 153.966393][ T402] </TASK> > > > [ 153.967209][ T402] > > > [ 153.967856][ T402] Memory state around the buggy address: > > > [ 153.969123][ T402] BUG: unable to handle page fault for address: fffffbfff3ffffe0 > > > [ 153.970807][ T402] #PF: supervisor read access in kernel mode > > > [ 153.972036][ T402] #PF: error_code(0x0000) - not-present page > > > [ 153.973220][ T402] PGD 417fdb067 P4D 417fdb067 PUD 417fd7067 PMD 0 > > > [ 153.974560][ T402] Oops: Oops: 0000 [#1] PREEMPT KASAN > > > [ 153.975758][ T402] CPU: 0 UID: 0 PID: 402 Comm: modprobe Tainted: G T 6.12.0-rc6-00146-g0f9b685626da #1 87c8486a909ba2f90eff061a4c9c1fa5c9cd90ea > > > [ 153.978853][ T402] Tainted: [T]=RANDSTRUCT > > > [ 153.979851][ T402] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.16.2-debian-1.16.2-1 04/01/2014 > > > [ 153.982008][ T402] RIP: 0010:kasan_metadata_fetch_row (mm/kasan/report_generic.c:186) > > > [ 153.983368][ T402] Code: 40 08 48 89 43 58 5b 31 c0 31 d2 31 c9 31 f6 31 ff c3 cc cc cc cc 66 0f 1f 00 b8 ff ff 37 00 48 c1 ee 03 48 c1 e0 2a 48 01 c6 <48> 8b 06 48 89 07 48 8b 46 08 48 89 47 08 31 c0 31 f6 31 ff c3 cc > > > All code > > > ======== > > > 0: 40 08 48 89 rex or %cl,-0x77(%rax) > > > 4: 43 58 rex.XB pop %r8 > > > 6: 5b pop %rbx > > > 7: 31 c0 xor %eax,%eax > > > 9: 31 d2 xor %edx,%edx > > > b: 31 c9 xor %ecx,%ecx > > > d: 31 f6 xor %esi,%esi > > > f: 31 ff xor %edi,%edi > > > 11: c3 ret > > > 12: cc int3 > > > 13: cc int3 > > > 14: cc int3 > > > 15: cc int3 > > > 16: 66 0f 1f 00 nopw (%rax) > > > 1a: b8 ff ff 37 00 mov $0x37ffff,%eax > > > 1f: 48 c1 ee 03 shr $0x3,%rsi > > > 23: 48 c1 e0 2a shl $0x2a,%rax > > > 27: 48 01 c6 add %rax,%rsi > > > 2a:* 48 8b 06 mov (%rsi),%rax <-- trapping instruction > > > 2d: 48 89 07 mov %rax,(%rdi) > > > 30: 48 8b 46 08 mov 0x8(%rsi),%rax > > > 34: 48 89 47 08 mov %rax,0x8(%rdi) > > > 38: 31 c0 xor %eax,%eax > > > 3a: 31 f6 xor %esi,%esi > > > 3c: 31 ff xor %edi,%edi > > > 3e: c3 ret > > > 3f: cc int3 > > > > > > Code starting with the faulting instruction > > > =========================================== > > > 0: 48 8b 06 mov (%rsi),%rax > > > 3: 48 89 07 mov %rax,(%rdi) > > > 6: 48 8b 46 08 mov 0x8(%rsi),%rax > > > a: 48 89 47 08 mov %rax,0x8(%rdi) > > > e: 31 c0 xor %eax,%eax > > > 10: 31 f6 xor %esi,%esi > > > 12: 31 ff xor %edi,%edi > > > 14: c3 ret > > > 15: cc int3 > > > [ 153.987254][ T402] RSP: 0018:ffffc9000218f9f8 EFLAGS: 00010082 > > > [ 153.988595][ T402] RAX: dffffc0000000000 RBX: ffffffff9fffff00 RCX: 0000000000000000 > > > [ 153.990325][ T402] RDX: 0000000000000000 RSI: fffffbfff3ffffe0 RDI: ffffc9000218fa04 > > > [ 153.992086][ T402] RBP: 00000000fffffffe R08: 0000000000000000 R09: 0000000000000000 > > > [ 153.993786][ T402] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffffa0000000 > > > [ 153.995554][ T402] R13: ffffffff864b4994 R14: ffffffff9fffff80 R15: 0000000000000028 > > > [ 153.997305][ T402] FS: 00007f0f37cf5040(0000) GS:ffffffff86989000(0000) knlGS:0000000000000000 > > > [ 153.999133][ T402] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > > [ 154.000578][ T402] CR2: fffffbfff3ffffe0 CR3: 0000000128853000 CR4: 00000000000006b0 > > > [ 154.002367][ T402] Call Trace: > > > [ 154.003318][ T402] <TASK> > > > [ 154.004087][ T402] ? __die_body (arch/x86/kernel/dumpstack.c:421) > > > [ 154.005074][ T402] ? page_fault_oops (arch/x86/mm/fault.c:710) > > > [ 154.006242][ T402] ? show_fault_oops (arch/x86/mm/fault.c:643) > > > [ 154.007368][ T402] ? search_module_extables (kernel/module/main.c:3369) > > > [ 154.008525][ T402] ? fixup_exception (arch/x86/mm/extable.c:321) > > > [ 154.009629][ T402] ? exc_page_fault (arch/x86/mm/fault.c:1479 arch/x86/mm/fault.c:1539) > > > [ 154.010771][ T402] ? asm_exc_page_fault (arch/x86/include/asm/idtentry.h:623) > > > [ 154.011853][ T402] ? kasan_metadata_fetch_row (mm/kasan/report_generic.c:186) > > > [ 154.013072][ T402] print_report (mm/kasan/report.c:466 mm/kasan/report.c:489) > > > [ 154.014122][ T402] ? move_module (kernel/module/main.c:2357) > > > [ 154.015238][ T402] kasan_report (mm/kasan/report.c:603) > > > [ 154.016231][ T402] ? move_module (kernel/module/main.c:2357) > > > [ 154.017255][ T402] kasan_check_range (mm/kasan/generic.c:183 mm/kasan/generic.c:189) > > > [ 154.018379][ T402] __asan_memcpy (mm/kasan/shadow.c:105 (discriminator 1)) > > > [ 154.019400][ T402] move_module (kernel/module/main.c:2357) > > > [ 154.020435][ T402] layout_and_allocate+0x446/0x523 > > > [ 154.021792][ T402] load_module (kernel/module/main.c:2985) > > > [ 154.022822][ T402] ? mode_strip_umask (fs/namei.c:3248) > > > [ 154.023928][ T402] init_module_from_file (kernel/module/main.c:3266) > > > [ 154.025069][ T402] ? __ia32_sys_init_module (kernel/module/main.c:3266) > > > [ 154.026265][ T402] ? __lock_release+0x106/0x38c > > > [ 154.027496][ T402] ? idempotent_init_module (kernel/module/main.c:3301) > > > [ 154.028688][ T402] ? lock_release (kernel/locking/lockdep.c:467 kernel/locking/lockdep.c:5848) > > > [ 154.029766][ T402] idempotent_init_module (kernel/module/main.c:3302) > > > [ 154.030985][ T402] ? init_module_from_file (kernel/module/main.c:3294) > > > [ 154.032192][ T402] ? security_capable (security/security.c:1143) > > > [ 154.033310][ T402] __do_sys_finit_module (include/linux/file.h:68 kernel/module/main.c:3330) > > > [ 154.034478][ T402] do_syscall_64 (arch/x86/entry/common.c:52 arch/x86/entry/common.c:83) > > > [ 154.035532][ T402] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:130) > > > [ 154.036819][ T402] RIP: 0033:0x7f0f37df7719 > > > [ 154.037865][ T402] Code: 08 89 e8 5b 5d c3 66 2e 0f 1f 84 00 00 00 00 00 90 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d b7 06 0d 00 f7 d8 64 89 01 48 > > > All code > > > ======== > > > 0: 08 89 e8 5b 5d c3 or %cl,-0x3ca2a418(%rcx) > > > 6: 66 2e 0f 1f 84 00 00 cs nopw 0x0(%rax,%rax,1) > > > d: 00 00 00 > > > 10: 90 nop > > > 11: 48 89 f8 mov %rdi,%rax > > > 14: 48 89 f7 mov %rsi,%rdi > > > 17: 48 89 d6 mov %rdx,%rsi > > > 1a: 48 89 ca mov %rcx,%rdx > > > 1d: 4d 89 c2 mov %r8,%r10 > > > 20: 4d 89 c8 mov %r9,%r8 > > > 23: 4c 8b 4c 24 08 mov 0x8(%rsp),%r9 > > > 28: 0f 05 syscall > > > 2a:* 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax <-- trapping instruction > > > 30: 73 01 jae 0x33 > > > 32: c3 ret > > > 33: 48 8b 0d b7 06 0d 00 mov 0xd06b7(%rip),%rcx # 0xd06f1 > > > 3a: f7 d8 neg %eax > > > 3c: 64 89 01 mov %eax,%fs:(%rcx) > > > 3f: 48 rex.W > > > > > > Code starting with the faulting instruction > > > =========================================== > > > 0: 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax > > > 6: 73 01 jae 0x9 > > > 8: c3 ret > > > 9: 48 8b 0d b7 06 0d 00 mov 0xd06b7(%rip),%rcx # 0xd06c7 > > > 10: f7 d8 neg %eax > > > 12: 64 89 01 mov %eax,%fs:(%rcx) > > > 15: 48 rex.W > > > > > > > > > The kernel config and materials to reproduce are available at: > > > https://download.01.org/0day-ci/archive/20241113/202411132111.6a221562-lkp@xxxxxxxxx > > > > > > > > > > > > -- > > > 0-DAY CI Kernel Test Service > > > https://github.com/intel/lkp-tests/wiki > > >