Re: [REGRESSION] Kernel booted via kexec fails to resume from hibernation

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi
On 01/13/25 at 10:28pm, Roberto Ricci wrote:
> After rebooting the system via kexec, hibernating and rebooting the machine, this oops occurs:
> 
[snip]
> 
> I will send the kernel config and dmesg in replies to this email.
> 

I tried your config (removed some config driver related which is not useful), but it can not boot on my kvm guest.
Firstly I saw a panic in ftrace path,  then I rebuilt the kernel without ftrace, it panicked again but in kvm related code path.
Both are not related to kexec at all so I suspect your bug is not kexec specific.

Ftrace panic:
----
    0.215515] software IO TLB: area num 4.
[    0.298815] SLUB: HWalign=64, Order=0-3, MinObjects=0, CPUs=4, Nodes=1
Poking KASLR using RDRAND RDTSC...
[    0.299813] ftrace: allocating 40381 entries in 158 pages
[    0.300603] Oops: general protection fault, probably for non-canonical address 0xdffffc0000008a30: 0000 [#1] PREEMPT SMP KASAN NOPTI
[    0.301877] KASAN: probably user-memory-access in range [0x0000000000045180-0x0000000000045187]
[    0.302734] CPU: 0 UID: 0 PID: 0 Comm: swapper Not tainted 6.13.0_ricci+ #913
[    0.303401] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-2.fc40 04/01/2014
[    0.304187] RIP: 0010:__text_poke+0x370/0x840
[    0.304591] Code: c6 48 8b 04 24 48 8d 78 08 e8 cc fb ff ff 48 c7 c0 80 51 04 00 48 8b 0d 96 32 d9 02 48 ba 00 00 00 00 00 fc ff df 48 c1 e8 03 <0f> b6 04 10 84 c0 74 06 0f 8e cc 03 00 00 65 0f b6 1d aa 3c ed 6e
[    0.306646] RSP: 0000:ffffffff94007d88 EFLAGS: 00010006
[    0.307139] RAX: 0000000000008a30 RBX: ffffffff93f0e070 RCX: ffff8881000744c0
[    0.307839] RDX: dffffc0000000000 RSI: 8000000072001063 RDI: ffff88810006d0d8
[    0.308634] RBP: ffffffff91001b24 R08: 0000000000000001 R09: fffffbfff2800f91
[    0.309359] R10: 0000000000000003 R11: 0000000000000100 R12: 0000000000000005
[    0.309994] R13: 0000000000000b24 R14: 0000000000000b29 R15: 8000000000000063
[    0.310631] FS:  0000000000000000(0000) GS:ffff88815b400000(0000) knlGS:0000000000000000
[    0.311351] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.311867] CR2: ffff88817ffff000 CR3: 0000000075070001 CR4: 00000000000200b0
[    0.312510] Call Trace:
[    0.312731]  <TASK>
[    0.312922]  ? __die_body.cold+0x19/0x2b
[    0.313280]  ? die_addr+0x46/0x70
[    0.313582]  ? exc_general_protection+0x150/0x240
[    0.314010]  ? asm_exc_general_protection+0x26/0x30
[    0.314454]  ? trace_initcall_start_cb+0x4/0x90
[    0.314868]  ? __text_poke+0x370/0x840
[    0.315211]  ? __text_poke+0x2ab/0x840
[    0.315552]  ? __pfx_text_poke_memcpy+0x10/0x10
[    0.315965]  ? __pfx___text_poke+0x10/0x10
[    0.316345]  ? mutex_lock+0x87/0xe0
[    0.316662]  ? __pfx_mutex_lock+0x10/0x10
[    0.317096]  ? ftrace_now+0xc0/0xd0
[    0.317431]  ? __pfx_ftrace_now+0x10/0x10
[    0.317813]  ? trace_initcall_start_cb+0x4/0x90
[    0.318245]  ftrace_modify_code_direct+0x58/0x70
[    0.318696]  ftrace_process_locs+0x514/0xb70
[    0.319105]  ftrace_init+0x80/0x150
[    0.319458]  start_kernel+0x181/0x3c0
[    0.319802]  x86_64_start_reservations+0x24/0x30
[    0.320219]  x86_64_start_kernel+0x84/0x90
[    0.320584]  common_startup_64+0x13e/0x141
[    0.320952]  </TASK>
[    0.321149] Modules linked in:
[    0.321423] ---[ end trace 0000000000000000 ]---
[    0.321832] RIP: 0010:__text_poke+0x370/0x840
[    0.322227] Code: c6 48 8b 04 24 48 8d 78 08 e8 cc fb ff ff 48 c7 c0 80 51 04 00 48 8b 0d 96 32 d9 02 48 ba 00 00 00 00 00 fc ff df 48 c1 e8 03 <0f> b6 04 10 84 c0 74 06 0f 8e cc 03 00 00 65 0f b6 1d aa 3c ed 6e
[    0.323906] RSP: 0000:ffffffff94007d88 EFLAGS: 00010006
[    0.324383] RAX: 0000000000008a30 RBX: ffffffff93f0e070 RCX: ffff8881000744c0
[    0.325025] RDX: dffffc0000000000 RSI: 8000000072001063 RDI: ffff88810006d0d8
[    0.325681] RBP: ffffffff91001b24 R08: 0000000000000001 R09: fffffbfff2800f91
[    0.326332] R10: 0000000000000003 R11: 0000000000000100 R12: 0000000000000005
[    0.326986] R13: 0000000000000b24 R14: 0000000000000b29 R15: 8000000000000063
[    0.327658] FS:  0000000000000000(0000) GS:ffff88815b400000(0000) knlGS:0000000000000000
[    0.328411] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.328931] CR2: ffff88817ffff000 CR3: 0000000075070001 CR4: 00000000000200b0
[    0.329595] Kernel panic - not syncing: Attempted to kill the idle task!
[    0.330278] ---[ end Kernel panic - not syncing: Attempted to kill the idle task! ]---

kvm panic:
-------
[    0.222231] rcu: srcu_init: Setting srcu_struct sizes based on contention.
[    0.222824] kfence: initialized - using 2097152 bytes for 255 objects at 0x(____ptrval____)-0x(____ptrval____)
[    0.223470] Oops: general protection fault, probably for non-canonical address 0xdffffc0000005356: 0000 [#1] PREEMPT SMP KASAN NOPTI
[    0.224200] KASAN: probably user-memory-access in range [0x0000000000029ab0-0x0000000000029ab7]
[    0.224747] CPU: 0 UID: 0 PID: 0 Comm: swapper/0 Not tainted 6.13.0_ricci+ #917
[    0.225223] Hardware name: QEMU Standard PC (Q35 + ICH9, 2009), BIOS 1.16.3-2.fc40 04/01/2014
[    0.225779] RIP: 0010:__sysvec_kvm_asyncpf_interrupt+0x64/0x110
[    0.226169] Code: 7c e4 4a 48 c7 c7 40 35 27 b7 e8 67 c2 cd 01 48 c7 c0 b0 9a 02 00 48 ba 00 00 00 00 00 fc ff df 48 89 c1 83 e0 07 48 c1 e9 03 <0f> b6 14 11 38 c2 7f 08 84 d2 0f 85 80 00 00 00 65 0f b6 1d 54 61
[    0.227327] RSP: 0000:ffff888157009fe0 EFLAGS: 00010006
[    0.227657] RAX: 0000000000000000 RBX: ffffffffb7a07e68 RCX: 0000000000005356
[    0.228088] RDX: dffffc0000000000 RSI: ffffffffb7273540 RDI: ffffffffb740be40
[    0.228523] RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed102ae07c08
[    0.228960] R10: ffff88815703e047 R11: ffff888157009ff8 R12: 0000000000000000
[    0.229390] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[    0.229848] FS:  0000000000000000(0000) GS:ffff888157000000(0000) knlGS:0000000000000000
[    0.230375] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.231089] CR2: ffff88815d201000 CR3: 000000015ba66001 CR4: 00000000000200b0
[    0.231567] Call Trace:
[    0.231727]  <IRQ>
[    0.231859]  ? __die_body.cold+0x19/0x26
[    0.232125]  ? die_addr+0x41/0x70
[    0.232354]  ? exc_general_protection+0x150/0x240
[    0.232663]  ? asm_exc_general_protection+0x26/0x30
[    0.232988]  ? __sysvec_kvm_asyncpf_interrupt+0x64/0x110
[    0.233341]  sysvec_kvm_asyncpf_interrupt+0x9b/0xe0
[    0.233668]  </IRQ>
[    0.233805]  <TASK>
[    0.233942]  asm_sysvec_kvm_asyncpf_interrupt+0x1a/0x20
[    0.234295] RIP: 0010:__x86_return_thunk+0x0/0x10
[    0.234606] Code: 01 00 00 00 cc e8 01 00 00 00 cc 48 81 c4 80 00 00 00 65 48 c7 05 3c 54 14 49 ff ff ff ff c3 cc 66 2e 0f 1f 84 00 00 00 00 00 <c3> 90 90 90 90 cc cc cc cc cc cc cc cc cc cc cc e9 eb ff ff ff 0f
[    0.235828] RSP: 0000:ffffffffb7a07f18 EFLAGS: 00000296
[    0.236168] RAX: 0000000000000096 RBX: 0000000000000000 RCX: 1ffff1102ae07af0
[    0.236677] RDX: dffffc0000000000 RSI: 0000000000000004 RDI: 000000000000000f
[    0.237148] RBP: 00000000000000b0 R08: 0000000000000001 R09: 0000000000000000
[    0.237610] R10: ffff8881571c1e40 R11: 0000000000000007 R12: 0000000000000000
[    0.238071] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000094770
[    0.238547]  start_kernel+0x263/0x3a0
[    0.238787]  x86_64_start_reservations+0x24/0x30
[    0.239088]  x86_64_start_kernel+0x84/0x90
[    0.239355]  common_startup_64+0x13e/0x141
[    0.239637]  </TASK>
[    0.239786] Modules linked in:
[    0.239987] ---[ end trace 0000000000000000 ]---
[    0.240287] RIP: 0010:__sysvec_kvm_asyncpf_interrupt+0x64/0x110
[    0.240676] Code: 7c e4 4a 48 c7 c7 40 35 27 b7 e8 67 c2 cd 01 48 c7 c0 b0 9a 02 00 48 ba 00 00 00 00 00 fc ff df 48 89 c1 83 e0 07 48 c1 e9 03 <0f> b6 14 11 38 c2 7f 08 84 d2 0f 85 80 00 00 00 65 0f b6 1d 54 61
[    0.241896] RSP: 0000:ffff888157009fe0 EFLAGS: 00010006
[    0.242242] RAX: 0000000000000000 RBX: ffffffffb7a07e68 RCX: 0000000000005356
[    0.242708] RDX: dffffc0000000000 RSI: ffffffffb7273540 RDI: ffffffffb740be40
[    0.243194] RBP: 0000000000000000 R08: 0000000000000001 R09: ffffed102ae07c08
[    0.243659] R10: ffff88815703e047 R11: ffff888157009ff8 R12: 0000000000000000
[    0.244122] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[    0.244595] FS:  0000000000000000(0000) GS:ffff888157000000(0000) knlGS:0000000000000000
[    0.245121] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    0.245502] CR2: ffff88815d201000 CR3: 000000015ba66001 CR4: 00000000000200b0
[    0.245968] Kernel panic - not syncing: Fatal exception in interrupt
[    0.246411] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---


You can find the kernel config here (with the ftrace enabled):
https://people.redhat.com/~ruyang/snakeyear/panic-ftrace.config

Thanks
Dave





[Index of Archives]     [LM Sensors]     [Linux Sound]     [ALSA Users]     [ALSA Devel]     [Linux Audio Users]     [Linux Media]     [Kernel]     [Gimp]     [Yosemite News]     [Linux Media]

  Powered by Linux