On 3/3/24 05:30, Steve Wahl wrote: >
The machines I work on are large, though. Can you give specifics on exactly how you are performing your kexec, and what hardware you are using when you hit this (especially memory size)? Have you made any special arrangements for the size of memory reserved for kexec on your system?
Hi Steve, I'm using a mainstream Lenovo laptop with an AMD APU (Ryzen 3 5300U), this is my secondary/testing machine using which I've built the kernels and performed the git bisection. I've attached the relevant journal logs and inxi output.
My primary laptop which I'm typing this from is of the same build but with a slightly better APU (Ryzen 5 5500U) in which I have replicated the problem using kernels from OpenSuse repos, both patched and vanilla but not ones I've built myself.
The only peculiarity with these machines I can think of is that its onboard graphics reserves/uses a portion of the normal RAM as its VRAM.
I have reproduced the issue by calling kexec directly using "kexec -l" & "kexec -e", systemctl kexec, and also using the default systemd service/script provided by OpenSuse. The exact command it uses is as follows.
Kexec call: kexec --kexec-syscall-auto --load '/usr/lib/modules/6.7.4-1-default/vmlinuz' --initrd='/boot/initrd-6.7.4-1-default' --append='root=/dev/mapper/suse-system splash=silent mitigations=auto quiet crashkernel=421M,high crashkernel=72M,low security=apparmor'
Note: when I used "kexec -l", I only included the root fs path in append and none of the other options to rule any side effects.
The problem can be reliably reproduced when kexec'ing from the faulty kernel into the same kernel. This why there are two boot entries for each kernel (6.7.7 and 6.7.5) in the attached journal logs.
Let me know if you need any further clarifications. Kind regards, Pavin Joseph.
System: Kernel: 6.7.4-1-default arch: x86_64 bits: 64 compiler: gcc v: 13.2.1 clocksource: hpet Console: pty pts/1 DM: GDM v: 45.0.1 Distro: openSUSE Tumbleweed-Slowroll 20240213 Machine: Type: Laptop System: LENOVO product: 82KD v: Lenovo V15 G2 ALC Ua serial: <filter> Chassis: type: 10 v: Lenovo V15 G2 ALC Ua serial: <filter> Mobo: LENOVO model: LNVNB161216 v: SDK0T76486WIN serial: <filter> part-nu: LENOVO_MT_82KD_BU_idea_FM_V15 G2 ALC Ua uuid: dd35a9a6-b90d-4f65-8723-88a4c29167c5 UEFI: LENOVO v: GLCN46WW date: 03/23/2022 Battery: ID-1: BAT0 charge: 36.4 Wh (94.8%) condition: 38.4/38.0 Wh (101.0%) power: 4.4 W volts: 8.4 min: 7.7 model: BYD L20B2PF0 type: Li-poly serial: <filter> status: discharging cycles: 59 Memory: System RAM: total: 16 GiB available: 13.11 GiB used: 3.32 GiB (25.3%) Array-1: capacity: 16 GiB slots: 2 modules: 2 EC: None max-module-size: 8 GiB note: est. Device-1: Channel-A DIMM 0 type: DDR4 detail: synchronous unbuffered (unregistered) size: 8 GiB speed: 2133 MT/s volts: 1.2 width (bits): data: 64 total: 64 manufacturer: Hynix part-no: HMA41GS6AFR8N-TF serial: <filter> Device-2: Channel-B DIMM 0 type: DDR4 detail: synchronous unbuffered (unregistered) size: 8 GiB speed: spec: 3200 MT/s actual: 2133 MT/s volts: 1.2 width (bits): data: 64 total: 64 manufacturer: Samsung part-no: M471A1G44AB0-CWE serial: N/A CPU: Info: quad core model: AMD Ryzen 3 5300U with Radeon Graphics bits: 64 type: MT MCP smt: enabled arch: Zen 2 rev: 1 cache: L1: 256 KiB L2: 2 MiB L3: 4 MiB Speed (MHz): avg: 838 high: 1928 min/max: 400/3900 volts: 1.2 V ext-clock: 100 MHz cores: 1: 1386 2: 400 3: 400 4: 1397 5: 400 6: 1928 7: 400 8: 400 bogomips: 41537 Flags: avx avx2 ht lm nx pae sse sse2 sse3 sse4_1 sse4_2 sse4a ssse3 svm Graphics: Device-1: AMD Lucienne vendor: Lenovo driver: amdgpu v: kernel arch: GCN-5 pcie: speed: 8 GT/s lanes: 16 ports: active: eDP-1 empty: HDMI-A-1 bus-ID: 04:00.0 chip-ID: 1002:164c class-ID: 0300 temp: 32.0 C Device-2: Syntek Integrated Camera driver: uvcvideo type: USB rev: 2.0 speed: 480 Mb/s lanes: 1 bus-ID: 1-3:2 chip-ID: 174f:2459 class-ID: fe01 serial: <filter> Display: server: X.org v: 1.21.1.11 with: Xwayland v: 23.2.4 compositor: gnome-shell driver: X: loaded: modesetting unloaded: fbdev,vesa dri: radeonsi gpu: amdgpu tty: 174x43 Monitor-1: eDP-1 model: ChiMei InnoLux 0x15f5 res: 1920x1080 dpi: 142 size: 344x193mm (13.54x7.6") diag: 394mm (15.5") modes: max: 1920x1080 min: 640x480 API: OpenGL Message: GL data unavailable in console for root. API: EGL Message: EGL data unavailable in console, eglinfo missing. Audio: Device-1: AMD Renoir Radeon High Definition Audio vendor: Lenovo driver: snd_hda_intel v: kernel pcie: speed: 8 GT/s lanes: 16 bus-ID: 04:00.1 chip-ID: 1002:1637 class-ID: 0403 Device-2: AMD ACP/ACP3X/ACP6x Audio Coprocessor vendor: Lenovo driver: N/A pcie: speed: 8 GT/s lanes: 16 bus-ID: 04:00.5 chip-ID: 1022:15e2 class-ID: 0480 Device-3: AMD Family 17h/19h HD Audio vendor: Lenovo driver: snd_hda_intel v: kernel pcie: speed: 8 GT/s lanes: 16 bus-ID: 04:00.6 chip-ID: 1022:15e3 class-ID: 0403 API: ALSA v: k6.7.4-1-default status: kernel-api with: aoss type: oss-emulator Server-1: PipeWire v: 1.0.3 status: n/a (root, process) with: 1: pipewire-pulse status: active 2: wireplumber status: active 3: pipewire-alsa type: plugin 4: pw-jack type: plugin Network: Device-1: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet vendor: Lenovo driver: r8169 v: kernel pcie: speed: Unknown lanes: 63 port: 3000 bus-ID: 01:00.0 chip-ID: 10ec:8168 class-ID: 0200 IF: enp1s0 state: down mac: <filter> Device-2: Realtek RTL8822CE 802.11ac PCIe Wireless Network Adapter vendor: Lenovo driver: rtw_8822ce v: N/A pcie: speed: 2.5 GT/s lanes: 1 port: 2000 bus-ID: 02:00.0 chip-ID: 10ec:c822 class-ID: 0280 IF: wlp2s0 state: up mac: <filter> IF-ID-1: wg-suse-laptop state: unknown speed: N/A duplex: N/A mac: N/A Bluetooth: Device-1: Realtek Bluetooth Radio driver: btusb v: 0.8 type: USB rev: 1.0 speed: 12 Mb/s lanes: 1 bus-ID: 3-4:2 chip-ID: 0bda:c123 class-ID: e001 serial: <filter> Report: btmgmt ID: hci0 rfk-id: 4 state: down bt-service: enabled,running rfk-block: hardware: no software: yes address: <filter> bt-v: 5.1 lmp-v: 10 Drives: Local Storage: total: 238.47 GiB used: 18.95 GiB (7.9%) ID-1: /dev/nvme0n1 vendor: Samsung model: MZALQ256HBJD-00BL2 size: 238.47 GiB speed: 31.6 Gb/s lanes: 4 tech: SSD serial: <filter> fw-rev: 5L2QFXM7 temp: 23.9 C scheme: GPT Partition: ID-1: / size: 219.97 GiB used: 18.83 GiB (8.6%) fs: btrfs dev: /dev/dm-1 mapped: suse-system ID-2: /boot/efi size: 511 MiB used: 4.7 MiB (0.9%) fs: vfat dev: /dev/nvme0n1p1 ID-3: /home size: 219.97 GiB used: 18.83 GiB (8.6%) fs: btrfs dev: /dev/dm-1 mapped: suse-system ID-4: /opt size: 219.97 GiB used: 18.83 GiB (8.6%) fs: btrfs dev: /dev/dm-1 mapped: suse-system ID-5: /var size: 219.97 GiB used: 18.83 GiB (8.6%) fs: btrfs dev: /dev/dm-1 mapped: suse-system Swap: ID-1: swap-1 type: partition size: 2 GiB used: 0 KiB (0.0%) priority: -2 dev: /dev/dm-2 mapped: suse-swap Sensors: System Temperatures: cpu: 33.6 C mobo: N/A gpu: amdgpu temp: 33.0 C Fan Speeds (rpm): N/A Info: Processes: 310 Power: uptime: 11h 49m states: freeze,mem,disk suspend: deep wakeups: 1 hibernate: platform Init: systemd v: 254 default: graphical Packages: pm: flatpak pkgs: 8 Compilers: gcc: 13.2.1 Shell: Sudo (sudo) v: 1.9.15p5 default: Bash v: 5.2.26 running-in: pty pts/1 inxi: 3.3.33
Attachment:
relevant_journal.log.xz
Description: application/xz