Search Linux Wireless

Re: [PATCH] ath11k: Avoid false DEADLOCK warning reported by lockdep

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Baochen Qiang <quic_bqiang@xxxxxxxxxxx> wrote:

> With CONFIG_LOCKDEP=y and CONFIG_DEBUG_SPINLOCK=y, lockdep reports
> below warning:
> 
> [  166.059415] ============================================
> [  166.059416] WARNING: possible recursive locking detected
> [  166.059418] 5.15.0-wt-ath+ #10 Tainted: G        W  O
> [  166.059420] --------------------------------------------
> [  166.059421] kworker/0:2/116 is trying to acquire lock:
> [  166.059423] ffff9905f2083160 (&srng->lock){+.-.}-{2:2}, at: ath11k_hal_reo_cmd_send+0x20/0x490 [ath11k]
> [  166.059440]
>                but task is already holding lock:
> [  166.059442] ffff9905f2083230 (&srng->lock){+.-.}-{2:2}, at: ath11k_dp_process_reo_status+0x95/0x2d0 [ath11k]
> [  166.059491]
>                other info that might help us debug this:
> [  166.059492]  Possible unsafe locking scenario:
> 
> [  166.059493]        CPU0
> [  166.059494]        ----
> [  166.059495]   lock(&srng->lock);
> [  166.059498]   lock(&srng->lock);
> [  166.059500]
>                 *** DEADLOCK ***
> 
> [  166.059501]  May be due to missing lock nesting notation
> 
> [  166.059502] 3 locks held by kworker/0:2/116:
> [  166.059504]  #0: ffff9905c0081548 ((wq_completion)events){+.+.}-{0:0}, at: process_one_work+0x1f6/0x660
> [  166.059511]  #1: ffff9d2400a5fe68 ((debug_obj_work).work){+.+.}-{0:0}, at: process_one_work+0x1f6/0x660
> [  166.059517]  #2: ffff9905f2083230 (&srng->lock){+.-.}-{2:2}, at: ath11k_dp_process_reo_status+0x95/0x2d0 [ath11k]
> [  166.059532]
>                stack backtrace:
> [  166.059534] CPU: 0 PID: 116 Comm: kworker/0:2 Kdump: loaded Tainted: G        W  O      5.15.0-wt-ath+ #10
> [  166.059537] Hardware name: Intel(R) Client Systems NUC8i7HVK/NUC8i7HVB, BIOS HNKBLi70.86A.0059.2019.1112.1124 11/12/2019
> [  166.059539] Workqueue: events free_obj_work
> [  166.059543] Call Trace:
> [  166.059545]  <IRQ>
> [  166.059547]  dump_stack_lvl+0x56/0x7b
> [  166.059552]  __lock_acquire+0xb9a/0x1a50
> [  166.059556]  lock_acquire+0x1e2/0x330
> [  166.059560]  ? ath11k_hal_reo_cmd_send+0x20/0x490 [ath11k]
> [  166.059571]  _raw_spin_lock_bh+0x33/0x70
> [  166.059574]  ? ath11k_hal_reo_cmd_send+0x20/0x490 [ath11k]
> [  166.059584]  ath11k_hal_reo_cmd_send+0x20/0x490 [ath11k]
> [  166.059594]  ath11k_dp_tx_send_reo_cmd+0x3f/0x130 [ath11k]
> [  166.059605]  ath11k_dp_rx_tid_del_func+0x221/0x370 [ath11k]
> [  166.059618]  ath11k_dp_process_reo_status+0x22f/0x2d0 [ath11k]
> [  166.059632]  ? ath11k_dp_service_srng+0x2ea/0x2f0 [ath11k]
> [  166.059643]  ath11k_dp_service_srng+0x2ea/0x2f0 [ath11k]
> [  166.059655]  ath11k_pci_ext_grp_napi_poll+0x1c/0x70 [ath11k_pci]
> [  166.059659]  __napi_poll+0x28/0x230
> [  166.059664]  net_rx_action+0x285/0x310
> [  166.059668]  __do_softirq+0xe6/0x4d2
> [  166.059672]  irq_exit_rcu+0xd2/0xf0
> [  166.059675]  common_interrupt+0xa5/0xc0
> [  166.059678]  </IRQ>
> [  166.059679]  <TASK>
> [  166.059680]  asm_common_interrupt+0x1e/0x40
> [  166.059683] RIP: 0010:_raw_spin_unlock_irqrestore+0x38/0x70
> [  166.059686] Code: 83 c7 18 e8 2a 95 43 ff 48 89 ef e8 22 d2 43 ff 81 e3 00 02 00 00 75 25 9c 58 f6 c4 02 75 2d 48 85 db 74 01 fb bf 01 00 00 00 <e8> 63 2e 40 ff 65 8b 05 8c 59 97 5c 85 c0 74 0a 5b 5d c3 e8 00 6a
> [  166.059689] RSP: 0018:ffff9d2400a5fca0 EFLAGS: 00000206
> [  166.059692] RAX: 0000000000000002 RBX: 0000000000000200 RCX: 0000000000000006
> [  166.059694] RDX: 0000000000000000 RSI: ffffffffa404879b RDI: 0000000000000001
> [  166.059696] RBP: ffff9905c0053000 R08: 0000000000000001 R09: 0000000000000001
> [  166.059698] R10: ffff9d2400a5fc50 R11: 0000000000000001 R12: ffffe186c41e2840
> [  166.059700] R13: 0000000000000001 R14: ffff9905c78a1c68 R15: 0000000000000001
> [  166.059704]  free_debug_processing+0x257/0x3d0
> [  166.059708]  ? free_obj_work+0x1f5/0x250
> [  166.059712]  __slab_free+0x374/0x5a0
> [  166.059718]  ? kmem_cache_free+0x2e1/0x370
> [  166.059721]  ? free_obj_work+0x1f5/0x250
> [  166.059724]  kmem_cache_free+0x2e1/0x370
> [  166.059727]  free_obj_work+0x1f5/0x250
> [  166.059731]  process_one_work+0x28b/0x660
> [  166.059735]  ? process_one_work+0x660/0x660
> [  166.059738]  worker_thread+0x37/0x390
> [  166.059741]  ? process_one_work+0x660/0x660
> [  166.059743]  kthread+0x176/0x1a0
> [  166.059746]  ? set_kthread_struct+0x40/0x40
> [  166.059749]  ret_from_fork+0x22/0x30
> [  166.059754]  </TASK>
> 
> Since these two lockes are both initialized in ath11k_hal_srng_setup,
> they are assigned with the same key. As a result lockdep suspects that
> the task is trying to acquire the same lock (due to same key) while
> already holding it, and thus reports the DEADLOCK warning. However as
> they are different spinlock instances, the warning is false positive.
> 
> On the other hand, even no dead lock indeed, this is a major issue for
> upstream regression testing as it disables lockdep functionality.
> 
> Fix it by assigning separate lock class key for each srng->lock.
> 
> Tested-on: WCN6855 hw2.0 PCI WLAN.HSP.1.1-01720.1-QCAHSPSWPL_V1_V2_SILICONZ_LITE-1
> Signed-off-by: Baochen Qiang <quic_bqiang@xxxxxxxxxxx>
> Signed-off-by: Kalle Valo <quic_kvalo@xxxxxxxxxxx>

Patch applied to ath-next branch of ath.git, thanks.

767c94caf0ef ath11k: Avoid false DEADLOCK warning reported by lockdep

-- 
https://patchwork.kernel.org/project/linux-wireless/patch/20211209011949.151472-1-quic_bqiang@xxxxxxxxxxx/

https://wireless.wiki.kernel.org/en/developers/documentation/submittingpatches




[Index of Archives]     [Linux Host AP]     [ATH6KL]     [Linux Wireless Personal Area Network]     [Linux Bluetooth]     [Wireless Regulations]     [Linux Netdev]     [Kernel Newbies]     [Linux Kernel]     [IDE]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite Hiking]     [MIPS Linux]     [ARM Linux]     [Linux RAID]

  Powered by Linux