Re: [bug report] RIP: 0010:throtl_trim_slice+0xc6/0x320 caused kernel panic

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello, Yu , Ming

I check again and confirmed that the patch has been merged into the
latest master branch, and re-ran the test,this issue can no longer be
triggered

Thanks you!

On Tue, Oct 31, 2023 at 8:30 PM Yu Kuai <yukuai1@xxxxxxxxxxxxxxx> wrote:
>
> Hi,
>
> 在 2023/10/31 18:28, Changhui Zhong 写道:
> > Hi,Yu Kuai
> >
> > the "master" branch I use, already contain the patch you provide, but
> > I  still can triggered this issue
> >
> > Thanks,
> >
> >
> > On Tue, Oct 31, 2023 at 11:29 AM Yu Kuai <yukuai1@xxxxxxxxxxxxxxx> wrote:
> >>
> >> Hi,
> >>
> >> 在 2023/10/31 11:08, Changhui Zhong 写道:
> >>> Hello,
> >>>
> >>> triggered below issue with branch 'master',please help check,
> >>>
> >>> INFO: HEAD of cloned kernel
> >>> commit 888cf78c29e223fd808682f477c18cf8f61ad995
>
> It's right latest master contain the fix patch, but this commit doesn't
> contain it, can you please check this?
>
> Thanks,
> Kuai
>
> >>> Merge: 09a4a03c073b 6e6c6d6bc6c9
> >>> Author: Linus Torvalds <torvalds@xxxxxxxxxxxxxxxxxxxx>
> >>> Date:   Fri Oct 27 05:43:05 2023 -1000
> >>>
> >>>       Merge tag 'iommu-fix-v6.6-rc7' of
> >>> git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu
> >>>
> >>>       Pull iommu fix from Joerg Roedel:
> >>>
> >>>        - Fix boot regression for Sapphire Rapids with Intel VT-d driver
> >>>
> >>>       * tag 'iommu-fix-v6.6-rc7' of
> >>> git://git.kernel.org/pub/scm/linux/kernel/git/joro/iommu:
> >>>         iommu: Avoid unnecessary cache invalidations
> >>>
> >>> steps:
> >>> echo "+cpuset +cpu +io" > /sys/fs/cgroup/cgroup.subtree_control
> >>> mkdir /sys/fs/cgroup/test
> >>> MAJ=$(ls -l /dev/"$disk" | awk -F ',' '{print $1}' | awk -F ' ' '{print $NF}')
> >>> MIN=$(ls -l /dev/"$disk" | awk -F ',' '{print $2}' | awk -F ' ' '{print $1}')
> >>> echo "$MAJ:$MIN wbps=1024" > /sys/fs/cgroup/test/io.max
> >>> echo $$ > /sys/fs/cgroup/test/cgroup.procs
> >>> dd if=/dev/zero of=/dev/$disk bs=10k count=1 oflag=direct &
> >>> dd if=/dev/zero of=/dev/$disk bs=10k count=1 oflag=direct &
> >>> wait
> >>>
> >>> console log:
> >>> [ 6502.907379] divide error: 0000 [#1] PREEMPT SMP NOPTI
> >> This is already fixed by:
> >>
> >> https://lore.kernel.org/all/20231020223617.2739774-1-khazhy@xxxxxxxxxx/
> >>
> >> Thanks,
> >> Kuai
> >>> [ 6502.912447] CPU: 6 PID: 0 Comm: swapper/6 Not tainted 6.6.0-rc7+ #1
> >>> [ 6502.918711] Hardware name: Dell Inc. PowerEdge R650xs/0PPTY2, BIOS
> >>> 1.4.4 10/07/2021
> >>> [ 6502.926364] RIP: 0010:throtl_trim_slice+0xc6/0x320
> >>> [ 6502.931156] Code: 00 00 48 89 e8 48 f7 f1 48 29 d5 74 9f 40 0f b6
> >>> f6 48 89 df 89 34 24 e8 f8 f5 ff ff 8b 34 24 b9 e8 03 00 00 48 89 df
> >>> 48 f7 e5 <48> f7 f1 49 03 85 f8 01 00 00 49 89 c7 e8 78 e0 ff ff ba ff
> >>> ff ff
> >>> [ 6502.949902] RSP: 0018:ffa00000006ccdc0 EFLAGS: 00010887
> >>> [ 6502.955128] RAX: ffffffffffffd8f0 RBX: ff110001061aa000 RCX: 00000000000003e8
> >>> [ 6502.962260] RDX: 000000000000270f RSI: 0000000000000001 RDI: ff110001061aa000
> >>> [ 6502.969394] RBP: 0000000000002710 R08: ffffffffffffffff R09: ff1100010fa74818
> >>> [ 6502.976527] R10: ff110001061aa010 R11: ff1100010fa74830 R12: 0000000000000001
> >>> [ 6502.983660] R13: ff110001061aa008 R14: 0000000000000001 R15: 0000000000000021
> >>> [ 6502.990793] FS:  0000000000000000(0000) GS:ff1100046fd80000(0000)
> >>> knlGS:0000000000000000
> >>> [ 6502.998879] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> >>> [ 6503.004624] CR2: 000055d60b15b000 CR3: 00000001dce20005 CR4: 0000000000771ee0
> >>> [ 6503.011757] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> >>> [ 6503.018891] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
> >>> [ 6503.026023] PKRU: 55555554
> >>> [ 6503.028735] Call Trace:
> >>> [ 6503.031187]  <IRQ>
> >>> [ 6503.033207]  ? die+0x33/0x90
> >>> [ 6503.036093]  ? do_trap+0xe0/0x110
> >>> [ 6503.039413]  ? throtl_trim_slice+0xc6/0x320
> >>> [ 6503.043599]  ? do_error_trap+0x65/0x80
> >>> [ 6503.047351]  ? throtl_trim_slice+0xc6/0x320
> >>> [ 6503.051538]  ? exc_divide_error+0x36/0x50
> >>> [ 6503.055549]  ? throtl_trim_slice+0xc6/0x320
> >>> [ 6503.059735]  ? asm_exc_divide_error+0x16/0x20
> >>> [ 6503.064098]  ? throtl_trim_slice+0xc6/0x320
> >>> [ 6503.068281]  tg_dispatch_one_bio+0xf0/0x1e0
> >>> [ 6503.072469]  throtl_pending_timer_fn+0x1e5/0x510
> >>> [ 6503.077086]  ? __pfx_throtl_pending_timer_fn+0x10/0x10
> >>> [ 6503.082226]  ? __pfx_throtl_pending_timer_fn+0x10/0x10
> >>> [ 6503.087365]  call_timer_fn+0x24/0x130
> >>> [ 6503.091034]  __run_timers.part.0+0x1ee/0x280
> >>> [ 6503.095304]  ? __hrtimer_run_queues+0x121/0x2b0
> >>> [ 6503.099839]  ? sched_clock+0xc/0x30
> >>> [ 6503.103329]  run_timer_softirq+0x26/0x50
> >>> [ 6503.107256]  __do_softirq+0xc8/0x2ab
> >>> [ 6503.110834]  __irq_exit_rcu+0xa1/0xc0
> >>> [ 6503.114501]  sysvec_apic_timer_interrupt+0x72/0x90
> >>> [ 6503.119295]  </IRQ>
> >>> [ 6503.121400]  <TASK>
> >>> [ 6503.123507]  asm_sysvec_apic_timer_interrupt+0x16/0x20
> >>> [ 6503.128646] RIP: 0010:cpuidle_enter_state+0xc2/0x420
> >>> [ 6503.133611] Code: 00 e8 22 cf 4e ff e8 4d f4 ff ff 8b 53 04 49 89
> >>> c5 0f 1f 44 00 00 31 ff e8 0b a8 4d ff 45 84 ff 0f 85 3a 02 00 00 fb
> >>> 45 85 f6 <0f> 88 6e 01 00 00 49 63 d6 4c 2b 2c 24 48 8d 04 52 48 8d 04
> >>> 82 49
> >>> [ 6503.152357] RSP: 0018:ffa00000001ebe80 EFLAGS: 00000206
> >>> [ 6503.157582] RAX: ff1100046fdb2100 RBX: ff1100046fdbc7e0 RCX: 000000000000001f
> >>> [ 6503.164707] RDX: 0000000000000006 RSI: 000000003d1877c2 RDI: 0000000000000000
> >>> [ 6503.171842] RBP: 0000000000000003 R08: 000005ea137b9428 R09: 0000000000000000
> >>> [ 6503.178972] R10: 00000000000003dc R11: ff1100046fdb0be4 R12: ffffffffbe2b1900
> >>> [ 6503.186104] R13: 000005ea137b9428 R14: 0000000000000003 R15: 0000000000000000
> >>> [ 6503.193232]  cpuidle_enter+0x29/0x40
> >>> [ 6503.196811]  cpuidle_idle_call+0xfa/0x160
> >>> [ 6503.200822]  do_idle+0x7b/0xe0
> >>> [ 6503.203882]  cpu_startup_entry+0x26/0x30
> >>> [ 6503.207808]  start_secondary+0x115/0x140
> >>> [ 6503.211735]  secondary_startup_64_no_verify+0x17d/0x18b
> >>> [ 6503.216961]  </TASK>
> >>> [ 6503.219154] Modules linked in: binfmt_misc dm_crypt raid10 raid1
> >>> raid0 dm_raid raid456 async_raid6_recov async_memcpy async_pq
> >>> async_xor xor async_tx raid6_pq loop nf_tables nfnetlink tls
> >>> rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache
> >>> netfs rfkill sunrpc ipmi_ssif vfat fat intel_rapl_msr
> >>> intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common
> >>> i10nm_edac nfit libnvdimm x86_pkg_temp_thermal intel_powerclamp
> >>> coretemp kvm_intel kvm mgag200 irqbypass rapl iTCO_wdt i2c_algo_bit
> >>> acpi_ipmi drm_shmem_helper iTCO_vendor_support dax_hmem intel_cstate
> >>> ipmi_si cxl_acpi drm_kms_helper mei_me dell_smbios i2c_i801
> >>> ipmi_devintf isst_if_mbox_pci isst_if_mmio dcdbas cxl_core
> >>> intel_uncore dell_wmi_descriptor wmi_bmof pcspkr mei isst_if_common
> >>> intel_pch_thermal intel_vsec i2c_smbus ipmi_msghandler
> >>> acpi_power_meter drm fuse xfs libcrc32c sd_mod t10_pi sg ahci libahci
> >>> crct10dif_pclmul crc32_pclmul crc32c_intel libata tg3
> >>> ghash_clmulni_intel wmi dm_mirror dm_region_hash dm_log dm_mod
> >>> [ 6503.219202]  [last unloaded: scsi_debug]
> >>> [ 6503.311377] ---[ end trace 0000000000000000 ]---
> >>> [ 6503.346804] RIP: 0010:throtl_trim_slice+0xc6/0x320
> >>> [ 6503.351597] Code: 00 00 48 89 e8 48 f7 f1 48 29 d5 74 9f 40 0f b6
> >>> f6 48 89 df 89 34 24 e8 f8 f5 ff ff 8b 34 24 b9 e8 03 00 00 48 89 df
> >>> 48 f7 e5 <48> f7 f1 49 03 85 f8 01 00 00 49 89 c7 e8 78 e0 ff ff ba ff
> >>> ff ff
> >>> [ 6503.370344] RSP: 0018:ffa00000006ccdc0 EFLAGS: 00010887
> >>> [ 6503.375569] RAX: ffffffffffffd8f0 RBX: ff110001061aa000 RCX: 00000000000003e8
> >>> [ 6503.382703] RDX: 000000000000270f RSI: 0000000000000001 RDI: ff110001061aa000
> >>> [ 6503.389836] RBP: 0000000000002710 R08: ffffffffffffffff R09: ff1100010fa74818
> >>> [ 6503.396969] R10: ff110001061aa010 R11: ff1100010fa74830 R12: 0000000000000001
> >>> [ 6503.404100] R13: ff110001061aa008 R14: 0000000000000001 R15: 0000000000000021
> >>> [ 6503.411235] FS:  0000000000000000(0000) GS:ff1100046fd80000(0000)
> >>> knlGS:0000000000000000
> >>> [ 6503.419320] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> >>> [ 6503.425067] CR2: 000055d60b15b000 CR3: 00000001dce20005 CR4: 0000000000771ee0
> >>> [ 6503.432199] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
> >>> [ 6503.439332] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
> >>> [ 6503.446463] PKRU: 55555554
> >>> [ 6503.449176] Kernel panic - not syncing: Fatal exception in interrupt
> >>> [ 6503.455610] Kernel Offset: disabled
> >>> [ 6503.488281] ---[ end Kernel panic - not syncing: Fatal exception in
> >>> interrupt ]---
> >>>
> >>> Thanks,
> >>>
> >>> .
> >>>
> >>
> >
> > .
> >
>


-- 

Changhui  Zhong

Quality Engineer,Kernel QE

Red Hat

Red Hat Beijing - Raycom





[Index of Archives]     [Linux RAID]     [Linux SCSI]     [Linux ATA RAID]     [IDE]     [Linux Wireless]     [Linux Kernel]     [ATH6KL]     [Linux Bluetooth]     [Linux Netdev]     [Kernel Newbies]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Device Mapper]

  Powered by Linux