[REGRESSION][BISECTED] Scheduling errors with the AMD FX 8300 CPU

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello Thomas,

Tim reports a regression on his AMD FX 8300 CPU that causes him
scheduling errors (see dmesg below), initially on the latest stable
kernel from Arch Linux. The issue reproduces by simply booting the
kernel on his hardware. He also reported some ATA related errors (also
attached below the dmesg), of which I dont know whether they are
relevant or not.

We have debugged the issue together in the ticket in our bugtracker[0]
and collectively bisected it down to the following commit:

    c749ce393b8f ("x86/cpu: Use common topology code for AMD")

Tim (in CC) offered to be available for further debugging in this
thread.

Reported-by: Tim Teichmann <teichmanntim@xxxxxxxxxx>
Bisected-by: Christian Heusel <christian@xxxxxxxxx>

Cheers,
Chris


[0]: https://gitlab.archlinux.org/archlinux/packaging/packages/linux/-/issues/56

---

#regzbot link: https://gitlab.archlinux.org/archlinux/packaging/packages/linux/-/issues/56
#regzbot introduced: c749ce393b8f 
#regzbot title: Scheduling errors with the AMD FX 8300 CPU

---

dmesg output:

May 23 23:36:49 archlinux kernel: smp: Bringing up secondary CPUs ...
May 23 23:36:49 archlinux kernel: smpboot: x86: Booting SMP configuration:
May 23 23:36:49 archlinux kernel: .... node  #0, CPUs:      #2 #4 #6
May 23 23:36:49 archlinux kernel: __common_interrupt: 2.55 No irq handler for vector
May 23 23:36:49 archlinux kernel: __common_interrupt: 4.55 No irq handler for vector
May 23 23:36:49 archlinux kernel: __common_interrupt: 6.55 No irq handler for vector
May 23 23:36:49 archlinux kernel:  #1 #3 #5 #7
May 23 23:36:49 archlinux kernel: ------------[ cut here ]------------
May 23 23:36:49 archlinux kernel: WARNING: CPU: 1 PID: 0 at kernel/sched/core.c:6482 sched_cpu_starting+0x183/0x250
May 23 23:36:49 archlinux kernel: Modules linked in:
May 23 23:36:49 archlinux kernel: CPU: 1 PID: 0 Comm: swapper/1 Not tainted 6.9.1-arch1-2 #1 06928436e5a6b4805e171d14d8efa397d7db9ad0
May 23 23:36:49 archlinux kernel: Hardware name: Gigabyte Technology Co., Ltd. GA-78LMT-USB3 R2/GA-78LMT-USB3 R2, BIOS F1 11/08/2017
May 23 23:36:49 archlinux kernel: RIP: 0010:sched_cpu_starting+0x183/0x250
May 23 23:36:49 archlinux kernel: Code: 00 8b 0d b0 e0 10 02 39 c8 0f 83 71 ff ff ff 48 63 d0 48 8b 3c d5 20 53 71 a6 4c 01 e7 39 c3 75 c7 4c 89 bf 48 0d 00 00 eb c7 <0f> 0b eb c3 be 04 00>
May 23 23:36:49 archlinux kernel: RSP: 0000:ffffa6a3c00cfe38 EFLAGS: 00010087
May 23 23:36:49 archlinux kernel: RAX: 0000000000000002 RBX: 0000000000000001 RCX: 0000000000000008
May 23 23:36:49 archlinux kernel: RDX: 0000000000000002 RSI: fffffffffffffffc RDI: ffff9367aeb36540
May 23 23:36:49 archlinux kernel: RBP: ffff9367aea99b00 R08: ffff9367aea99b00 R09: 0000000000000003
May 23 23:36:49 archlinux kernel: R10: ffff9367aea99b00 R11: 0000000000000006 R12: 0000000000036540
May 23 23:36:49 archlinux kernel: R13: 0000000000036540 R14: 0000000000000001 R15: ffff9367aea36540
May 23 23:36:49 archlinux kernel: FS:  0000000000000000(0000) GS:ffff9367aea80000(0000) knlGS:0000000000000000
May 23 23:36:49 archlinux kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 23 23:36:49 archlinux kernel: CR2: 0000000000000000 CR3: 000000036dc20000 CR4: 00000000000406f0
May 23 23:36:49 archlinux kernel: Call Trace:
May 23 23:36:49 archlinux kernel:  <TASK>
May 23 23:36:49 archlinux kernel:  ? sched_cpu_starting+0x183/0x250
May 23 23:36:49 archlinux kernel:  ? __warn.cold+0x8e/0xe8
May 23 23:36:49 archlinux kernel:  ? sched_cpu_starting+0x183/0x250
May 23 23:36:49 archlinux kernel:  ? report_bug+0xff/0x140
May 23 23:36:49 archlinux kernel:  ? handle_bug+0x3c/0x80
May 23 23:36:49 archlinux kernel:  ? exc_invalid_op+0x17/0x70
May 23 23:36:49 archlinux kernel:  ? asm_exc_invalid_op+0x1a/0x20
May 23 23:36:49 archlinux kernel:  ? sched_cpu_starting+0x183/0x250
May 23 23:36:49 archlinux kernel:  ? sched_cpu_starting+0x15a/0x250
May 23 23:36:49 archlinux kernel:  ? __pfx_sched_cpu_starting+0x10/0x10
May 23 23:36:49 archlinux kernel:  cpuhp_invoke_callback+0x122/0x410
May 23 23:36:49 archlinux kernel:  __cpuhp_invoke_callback_range+0x64/0xc0
May 23 23:36:49 archlinux kernel:  start_secondary+0x9c/0x140
May 23 23:36:49 archlinux kernel:  common_startup_64+0x13e/0x141
May 23 23:36:49 archlinux kernel:  </TASK>
May 23 23:36:49 archlinux kernel: ---[ end trace 0000000000000000 ]---
May 23 23:36:49 archlinux kernel: __common_interrupt: 1.55 No irq handler for vector
May 23 23:36:49 archlinux kernel: __common_interrupt: 3.55 No irq handler for vector
May 23 23:36:49 archlinux kernel: __common_interrupt: 5.55 No irq handler for vector
May 23 23:36:49 archlinux kernel: __common_interrupt: 7.55 No irq handler for vector
May 23 23:36:49 archlinux kernel: smp: Brought up 1 node, 8 CPUs
May 23 23:36:49 archlinux kernel: smpboot: Total of 8 processors activated (53173.28 BogoMIPS)
May 23 23:36:49 archlinux kernel: ------------[ cut here ]------------
May 23 23:36:49 archlinux kernel: WARNING: CPU: 0 PID: 1 at kernel/sched/topology.c:2408 build_sched_domains+0x76b/0x12b0
May 23 23:36:49 archlinux kernel: Modules linked in:
May 23 23:36:49 archlinux kernel: CPU: 0 PID: 1 Comm: swapper/0 Tainted: G        W          6.9.1-arch1-2 #1 06928436e5a6b4805e171d14d8efa397d7db9ad0
May 23 23:36:49 archlinux kernel: Hardware name: Gigabyte Technology Co., Ltd. GA-78LMT-USB3 R2/GA-78LMT-USB3 R2, BIOS F1 11/08/2017
May 23 23:36:49 archlinux kernel: RIP: 0010:build_sched_domains+0x76b/0x12b0
May 23 23:36:49 archlinux kernel: Code: 63 4d 14 39 34 8a 0f 8e 73 fe ff ff 25 e9 ef ff ff 80 cc 04 41 89 46 3c e9 62 fe ff ff 41 c7 46 30 01 00 00 00 e9 55 fe ff ff <0f> 0b bb f4 ff ff ff>
May 23 23:36:49 archlinux kernel: RSP: 0018:ffffa6a3c001fe10 EFLAGS: 00010202
May 23 23:36:49 archlinux kernel: RAX: 00000000ffffff01 RBX: 0000000000000000 RCX: 00000000ffffff01
May 23 23:36:49 archlinux kernel: RDX: 00000000fffffff8 RSI: 0000000000000003 RDI: ffff9367aea19b00
May 23 23:36:49 archlinux kernel: RBP: ffff936480367a00 R08: ffff9367aea19b00 R09: 0000000000000000
May 23 23:36:49 archlinux kernel: R10: ffffa6a3c001fdd8 R11: 0000000000000000 R12: 0000000000000001
May 23 23:36:49 archlinux kernel: R13: ffff9367aea99b00 R14: 0000000000000001 R15: ffff936480942e80
May 23 23:36:49 archlinux kernel: FS:  0000000000000000(0000) GS:ffff9367aea00000(0000) knlGS:0000000000000000
May 23 23:36:49 archlinux kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 23 23:36:49 archlinux kernel: CR2: ffff9366eea01000 CR3: 000000036dc20000 CR4: 00000000000406f0
May 23 23:36:49 archlinux kernel: Call Trace:
May 23 23:36:49 archlinux kernel:  <TASK>
May 23 23:36:49 archlinux kernel:  ? build_sched_domains+0x76b/0x12b0
May 23 23:36:49 archlinux kernel:  ? __warn.cold+0x8e/0xe8
May 23 23:36:49 archlinux kernel:  ? build_sched_domains+0x76b/0x12b0
May 23 23:36:49 archlinux kernel:  ? report_bug+0xff/0x140
May 23 23:36:49 archlinux kernel:  ? handle_bug+0x3c/0x80
May 23 23:36:49 archlinux kernel:  ? exc_invalid_op+0x17/0x70
May 23 23:36:49 archlinux kernel:  ? asm_exc_invalid_op+0x1a/0x20
May 23 23:36:49 archlinux kernel:  ? build_sched_domains+0x76b/0x12b0
May 23 23:36:49 archlinux kernel:  ? kmalloc_trace+0x13a/0x320
May 23 23:36:49 archlinux kernel:  sched_init_smp+0x3e/0xc0
May 23 23:36:49 archlinux kernel:  ? stop_machine+0x30/0x40
May 23 23:36:49 archlinux kernel:  kernel_init_freeable+0x109/0x250
May 23 23:36:49 archlinux kernel:  ? __pfx_kernel_init+0x10/0x10
May 23 23:36:49 archlinux kernel:  kernel_init+0x1a/0x140
May 23 23:36:49 archlinux kernel:  ret_from_fork+0x34/0x50
May 23 23:36:49 archlinux kernel:  ? __pfx_kernel_init+0x10/0x10
May 23 23:36:49 archlinux kernel:  ret_from_fork_asm+0x1a/0x30
May 23 23:36:49 archlinux kernel:  </TASK>
May 23 23:36:49 archlinux kernel: ---[ end trace 0000000000000000 ]---

---

ATA stuff:

May 23 23:36:59 archlinux kernel: ata2.00: exception Emask 0x10 SAct 0x1fffe000 SErr 0x40d0002 action 0xe frozen
May 23 23:36:59 archlinux kernel: ata2.00: irq_stat 0x00000040, connection status changed
May 23 23:36:59 archlinux kernel: ata2: SError: { RecovComm PHYRdyChg CommWake 10B8B DevExch }
May 23 23:36:59 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
May 23 23:36:59 archlinux kernel: ata2.00: cmd 61/10:68:10:f8:07/00:00:00:00:00/40 tag 13 ncq dma 8192 out
                                           res 40/00:01:00:00:00/00:00:00:00:00/00 Emask 0x10 (ATA bus error)
May 23 23:36:59 archlinux kernel: ata2.00: status: { DRDY }
May 23 23:36:59 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
May 23 23:36:59 archlinux kernel: ata2.00: cmd 61/08:70:28:f8:07/00:00:00:00:00/40 tag 14 ncq dma 4096 out
                                           res 40/00:00:00:00:00/00:00:00:00:00/00 Emask 0x10 (ATA bus error)
May 23 23:36:59 archlinux kernel: ata2.00: status: { DRDY }
May 23 23:36:59 archlinux kernel: ata2.00: failed command: WRITE FPDMA QUEUED
May 23 23:36:59 archlinux kernel: ata2.00: cmd 61/18:78:50:f8:07/00:00:00:00:00/40 tag 15 ncq dma 12288 out
                                           res 40/00:68:00:00:00/00:00:00:00:00/40 Emask 0x10 (ATA

Attachment: signature.asc
Description: PGP signature


[Index of Archives]     [Linux Kernel]     [Kernel Development Newbies]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Hiking]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux