Re: Fwd: [bug report] WARNING at block/genhd.c:537 device_add_disk+0x1ad/0x390 observed with blktests block/001

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Wed, Sep 15, 2021 at 07:13:55PM +0800, Yi Zhang wrote:
> Hi Ming
> Just want let you know I found this issue during blktests block/001 on
> storage-qe-36 server, I send this report to linux-block, but the mail was
> blocked. :(

That is sad, :-(

BTW, the following patch can fix the issue:

https://lore.kernel.org/linux-block/20210915092547.990285-1-ming.lei@xxxxxxxxxx/T/#u


> 
> ---------- Forwarded message ---------
> From: Yi Zhang <yi.zhang@xxxxxxxxxx>
> Date: Wed, Sep 15, 2021 at 1:28 PM
> Subject: [bug report] WARNING at block/genhd.c:537
> device_add_disk+0x1ad/0x390 observed with blktests block/001
> To: linux-block <linux-block@xxxxxxxxxxxxxxx>
> 
> 
> Hello
> 
> The following kernel warnings/panic triggered on 5.15.0-rc1 with blktests
> block/001, feel free to let me know if you need more info.
> 
> [  138.287801] run blktests block/001 at 2021-09-15 01:15:33
> [  138.310004] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.319283] sd 15:0:0:0: Power-on or device reset occurred
> [  138.319292] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.333918] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.334012] sd 16:0:0:0: Power-on or device reset occurred
> [  138.348549] scsi_debug:sdebug_driver_probe: scsi_debug: trim poll_queues
> to 0. poll_q/nr_hw = (0/1)
> [  138.348629] sd 17:0:0:0: Power-on or device reset occurred
> [  138.363189] sd 18:0:0:0: Power-on or device reset occurred
> [  138.461052] ------------[ cut here ]------------
> [  138.465670] WARNING: CPU: 1 PID: 1161 at block/genhd.c:537
> device_add_disk+0x1ad/0x390
> [  138.473591] Modules linked in: dm_service_time scsi_debug
> rpcsec_gss_krb5 auth_rpcgss nfsv4 dns_resolver nfs lockd grace fscache
> netfs rfkill sunrpc vfat fat dm_multipath intel_rapl_msr intel_rapl_common
> isst_if_common skx_edac x86_pkg_temp_thermal intel_powerclamp coretemp
> kvm_intel kvm mgag200 i2c_algo_bit drm_kms_helper irqbypass iTCO_wdt
> crct10dif_pclmul iTCO_vendor_support crc32_pclmul ipmi_ssif syscopyarea
> sysfillrect sysimgblt ghash_clmulni_intel fb_sys_fops rapl acpi_ipmi drm
> ipmi_si intel_cstate mei_me ipmi_devintf dax_pmem_compat intel_uncore mei
> i2c_i801 pcspkr wmi_bmof nd_pmem device_dax lpc_ich ipmi_msghandler
> intel_pch_thermal i2c_smbus nd_btt dax_pmem_core acpi_power_meter xfs
> libcrc32c sd_mod t10_pi sg ahci libahci libata nfit tg3 megaraid_sas
> crc32c_intel libnvdimm wmi dm_mirror dm_region_hash dm_log dm_mod
> [  138.533963] BUG: kernel NULL pointer dereference, address:
> 0000000000000098
> [  138.546449] CPU: 1 PID: 1161 Comm: multipathd Tainted: G S        I
>   5.15.0-rc1 #1
> [  138.553403] #PF: supervisor read access in kernel mode
> [  138.553404] #PF: error_code(0x0000) - not-present page
> [  138.553405] PGD 0 P4D 0
> [  138.561405] Hardware name: Dell Inc. PowerEdge R640/06NR82, BIOS 2.11.2
> 004/21/2021
> [  138.566542]
> [  138.566543] Oops: 0000 [#1] SMP NOPTI
> [  138.566545] CPU: 2 PID: 0 Comm: swapper/2 Tainted: G S        I
> 5.15.0-rc1 #1
> [  138.571683] RIP: 0010:device_add_disk+0x1ad/0x390
> [  138.574221] Hardware name: Dell Inc. PowerEdge R640/06NR82, BIOS 2.11.2
> 004/21/2021
> [  138.574222] RIP: 0010:wb_timer_fn+0x37/0x320
> [  138.581872] Code: 81 3b 03 01 00 00 0f 85 39 ff ff ff e9 25 ff ff ff 0f
> 0b bd ea ff ff ff e9 30 ff ff ff 4c 89 ef e8 a8 b6 1b 00 e9 f8 fe ff ff
> <0f> 0b bd 01 00 00 00 e9 17 ff ff ff 0f 0b bd ea ff ff ff e9 0b ff
> [  138.583363] Code: 48 8b 5f 60 4c 8b 67 50 8b ab 98 00 00 00 8b 93 b8 00
> 00 00 8b 83 d8 00 00 00 4c 8b 6b 28 01 d5 01 c5 48 8b 43 60 48 8b 40 78
> <4c> 8b b0 98 00 00 00 4d 85 ed 0f 84 c0 00 00 00 48 83 7b 30 00 0f
> [  138.583365] RSP: 0018:ffffb4d7c0300e80 EFLAGS: 00010246
> [  138.587029] RSP: 0018:ffffb4d7c0e2bc40 EFLAGS: 00010282
> [  138.594672] RAX: 0000000000000000 RBX: ffff9fd941d74a00 RCX:
> 0000000000000020
> [  138.594674] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
> ffff9fd9418cfd80
> [  138.594674] RBP: 0000000000000000 R08: 0000000000000000 R09:
> 0000000000000000
> [  138.594675] R10: 000000000000001f R11: 00000000000003a8 R12:
> ffff9fd9420a0cc0
> [  138.599381]
> [  138.607032] R13: 0000000000000000 R14: ffff9fd9418cfd90 R15:
> ffff9fe87fa97f70
> [  138.607033] FS:  0000000000000000(0000) GS:ffff9fe87fa80000(0000)
> knlGS:0000000000000000
> [  138.607035] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  138.611306] RAX: 0000000000000000 RBX: ffff9fd9e4187400 RCX:
> dead000000000122
> [  138.630049] CR2: 0000000000000098 CR3: 000000010371e004 CR4:
> 00000000007706e0
> [  138.630051] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  138.630051] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [  138.630052] PKRU: 55555554
> [  138.648797] RDX: ffff9fd98299bf00 RSI: ffffffffa8eac593 RDI:
> 0000000000000000
> [  138.654021] Call Trace:
> [  138.654023]  <IRQ>
> [  138.654025]  ? blk_stat_free_callback_rcu+0x30/0x30
> [  138.659248] RBP: 00000000fffffffe R08: 0000000000000001 R09:
> ffffffffa8dfdb00
> [  138.666378]  call_timer_fn+0x24/0xf0
> [  138.673512] R10: ffff9fd94f701780 R11: 0000000000000001 R12:
> ffff9fda40024800
> [  138.680643]  run_timer_softirq+0x1c7/0x3d0
> [  138.680646]  ? update_process_times+0xb0/0xc0
> [  138.687777] R13: ffff9fda40024840 R14: ffff9fda40024800 R15:
> 0000000000000000
> [  138.689276]  ? tick_sched_handle.isra.24+0x1f/0x60
> [  138.696408] FS:  00007f473d28d700(0000) GS:ffff9fe87fa40000(0000)
> knlGS:0000000000000000
> [  138.704494]  ? timerqueue_add+0x6f/0x80
> [  138.704496]  ? enqueue_hrtimer+0x2f/0x70
> [  138.710242] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  138.717371]  ? ktime_get+0x3b/0xa0
> [  138.717373]  __do_softirq+0xc6/0x285
> [  138.724505] CR2: 00007f347b9e0590 CR3: 000000010c092006 CR4:
> 00000000007706e0
> [  138.731638]  irq_exit_rcu+0xa6/0xc0
> [  138.738771] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  138.741482]  sysvec_apic_timer_interrupt+0x6e/0x90
> [  138.741485]  </IRQ>
> [  138.748614] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [  138.751056]  asm_sysvec_apic_timer_interrupt+0x12/0x20
> [  138.753076] PKRU: 55555554
> [  138.757948] RIP: 0010:cpuidle_enter_state+0xd6/0x350
> [  138.765080] Call Trace:
> [  138.768652] Code: 49 89 c4 0f 1f 44 00 00 31 ff e8 05 05 98 ff 45 84 ff
> 74 12 9c 58 f6 c4 02 0f 85 32 02 00 00 31 ff e8 ce a4 9e ff fb 45 85 f6
> <0f> 88 e0 00 00 00 49 63 d6 4c 2b 24 24 48 8d 04 52 48 8d 04 82 49
> [  138.768653] RSP: 0018:ffffb4d7c01bbe80 EFLAGS: 00000202
> [  138.768654] RAX: ffff9fe87faa7c00 RBX: 0000000000000002 RCX:
> 000000000000001f
> [  138.775787]  dm_setup_md_queue+0xbe/0xf0 [dm_mod]
> [  138.779883] RDX: 0000002041447a1c RSI: 000000002c3ddb16 RDI:
> 0000000000000000
> [  138.779884] RBP: ffffd4d7bf680588 R08: 0000000000000002 R09:
> 0000000000027440
> [  138.779884] R10: 000000a2559bce5c R11: ffff9fe87faa6944 R12:
> 0000002041447a1c
> [  138.779885] R13: ffffffffaa4ca6a0 R14: 0000000000000002 R15:
> 0000000000000000
> [  138.784243]  table_load+0x1c0/0x2e0 [dm_mod]
> [  138.791376]  cpuidle_enter+0x29/0x40
> [  138.796169]  ? retrieve_status+0x1e0/0x1e0 [dm_mod]
> [  138.804251]  do_idle+0x257/0x2a0
> [  138.808092]  ctl_ioctl+0x1ad/0x420 [dm_mod]
> [  138.812009]  cpu_startup_entry+0x19/0x20
> [  138.812011]  start_secondary+0x116/0x150
> [  138.817758]  dm_ctl_ioctl+0xa/0x10 [dm_mod]
> [  138.821159]  secondary_startup_64_no_verify+0xc2/0xcb
> [  138.821162] Modules linked in:
> [  138.824740]  __x64_sys_ioctl+0x81/0xc0
> [  138.831872]  dm_service_time scsi_debug rpcsec_gss_krb5 auth_rpcgss
> nfsv4 dns_resolver
> [  138.835366]  do_syscall_64+0x37/0x80
> [  138.842496]  nfs lockd grace fscache netfs rfkill sunrpc vfat
> [  138.847290]  entry_SYSCALL_64_after_hwframe+0x44/0xae
> [  138.849386]  fat dm_multipath intel_rapl_msr intel_rapl_common
> isst_if_common skx_edac x86_pkg_temp_thermal
> [  138.856519] RIP: 0033:0x7f473b4dd62b
> [  138.861650]  intel_powerclamp coretemp kvm_intel kvm mgag200
> i2c_algo_bit drm_kms_helper irqbypass
> [  138.864362] Code: 0f 1e fa 48 8b 05 5d b8 2c 00 64 c7 00 26 00 00 00 48
> c7 c0 ff ff ff ff c3 66 0f 1f 44 00 00 f3 0f 1e fa b8 10 00 00 00 0f 05
> <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 2d b8 2c 00 f7 d8 64 89 01 48
> [  138.869319]  iTCO_wdt crct10dif_pclmul iTCO_vendor_support crc32_pclmul
> ipmi_ssif syscopyarea sysfillrect
> [  138.871773] RSP: 002b:00007f473d28b428 EFLAGS: 00000206
> [  138.890517]  sysimgblt ghash_clmulni_intel fb_sys_fops rapl acpi_ipmi
> drm ipmi_si intel_cstate mei_me
> [  138.895745]  ORIG_RAX: 0000000000000010
> [  138.902874]  ipmi_devintf dax_pmem_compat intel_uncore mei i2c_i801
> pcspkr wmi_bmof nd_pmem
> [  138.907582] RAX: ffffffffffffffda RBX: 00007f473bf6b270 RCX:
> 00007f473b4dd62b
> [  138.914713]  device_dax lpc_ich ipmi_msghandler intel_pch_thermal
> i2c_smbus nd_btt dax_pmem_core acpi_power_meter
> [  138.921848] RDX: 00007f472401dbf0 RSI: 00000000c138fd09 RDI:
> 0000000000000005
> [  138.928978]  xfs libcrc32c sd_mod t10_pi sg ahci libahci libata nfit
> [  138.936111] RBP: 00007f473bfa6f83 R08: 00007f473d289180 R09:
> 00007f473c1b5058
> [  138.940373]  tg3 megaraid_sas crc32c_intel libnvdimm wmi dm_mirror
> dm_region_hash
> [  138.943956] R10: 0000000000000000 R11: 0000000000000206 R12:
> 00007f472401dbf0
> [  138.948834]  dm_log dm_mod
> [  138.948836] CR2: 0000000000000098
> [  138.948837] ---[ end trace 3235ef2b3e8a55ec ]---
> [  138.952068] R13: 00007f472401dca0 R14: 0000000000000000 R15:
> 00007f4724007530
> [  138.959447] RIP: 0010:wb_timer_fn+0x37/0x320
> [  138.961114] ---[ end trace 3235ef2b3e8a55ed ]---
> [  138.965030] Code: 48 8b 5f 60 4c 8b 67 50 8b ab 98 00 00 00 8b 93 b8 00
> 00 00 8b 83 d8 00 00 00 4c 8b 6b 28 01 d5 01 c5 48 8b 43 60 48 8b 40 78
> <4c> 8b b0 98 00 00 00 4d 85 ed 0f 84 c0 00 00 00 48 83 7b 30 00 0f
> [  138.965032] RSP: 0018:ffffb4d7c0300e80 EFLAGS: 00010246
> [  138.969219] kobject_add_internal failed for dm (error: -2 parent: dm-3)
> [  138.974269] RAX: 0000000000000000 RBX: ffff9fd941d74a00 RCX:
> 0000000000000020
> [  138.974271] RDX: 0000000000000000 RSI: 0000000000000000 RDI:
> ffff9fd9418cfd80
> [  138.974271] RBP: 0000000000000000 R08: 0000000000000000 R09:
> 0000000000000000
> [  138.974272] R10: 000000000000001f R11: 00000000000003a8 R12:
> ffff9fd9420a0cc0
> [  138.974273] R13: 0000000000000000 R14: ffff9fd9418cfd90 R15:
> ffff9fe87fa97f70
> [  138.975397] scsi 15:0:0:0: Direct-Access     Linux    scsi_debug
> 0190 PQ: 0 ANSI: 7
> [  138.975475] sd 15:0:0:0: Attached scsi generic sg1 type 0
> [  138.975496] sd 15:0:0:0: Power-on or device reset occurred
> [  138.976516] sd 15:0:0:0: [sdc] Read Capacity(16) failed: Result:
> hostbyte=DID_ERROR driverbyte=DRIVER_OK
> [  138.976518] sd 15:0:0:0: [sdc] Sense not available.
> [  138.976520] sd 15:0:0:0: [sdc] Read Capacity(10) failed: Result:
> hostbyte=DID_ERROR driverbyte=DRIVER_OK
> [  138.976521] sd 15:0:0:0: [sdc] Sense not available.
> [  138.976524] sd 15:0:0:0: [sdc] 0 512-byte logical blocks: (0 B/0 B)
> [  138.976525] sd 15:0:0:0: [sdc] 0-byte physical blocks
> [  138.976526] sd 15:0:0:0: [sdc] Write Protect is off
> [  138.976527] sd 15:0:0:0: [sdc] Mode Sense: 00 00 00 00
> [  138.976528] sd 15:0:0:0: [sdc] Asking for cache data failed
> [  138.976529] sd 15:0:0:0: [sdc] Assuming drive cache: write through
> [  138.977337] ------------[ cut here ]------------
> [  138.981081] FS:  0000000000000000(0000) GS:ffff9fe87fa80000(0000)
> knlGS:0000000000000000
> [  138.981082] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> [  138.981083] CR2: 0000000000000098 CR3: 000000010371e004 CR4:
> 00000000007706e0
> [  138.981084] DR0: 0000000000000000 DR1: 0000000000000000 DR2:
> 0000000000000000
> [  138.988993] WARNING: CPU: 1 PID: 1161 at block/genhd.c:564
> del_gendisk+0x186/0x1c0
> [  138.992562] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7:
> 0000000000000400
> [  138.992563] PKRU: 55555554
> [  138.992564] Kernel panic - not syncing: Fatal exception in interrupt
> [  139.493526] Kernel Offset: 0x27a00000 from 0xffffffff81000000
> (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
> [  139.872338] ---[ end Kernel panic - not syncing: Fatal exception in
> interrupt ]---
> 
> 
> 
> -- 
> Best Regards,
>   Yi Zhang
> 
> 
> -- 
> Best Regards,
>   Yi Zhang

-- 
Ming




[Index of Archives]     [Linux RAID]     [Linux SCSI]     [Linux ATA RAID]     [IDE]     [Linux Wireless]     [Linux Kernel]     [ATH6KL]     [Linux Bluetooth]     [Linux Netdev]     [Kernel Newbies]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Device Mapper]

  Powered by Linux