John W. Linville wrote: > On Wed, Oct 07, 2009 at 10:06:05AM -0500, Larry Finger wrote: >> In commit 26e5ab35b4c7b1d4cb487a11084520aed9a8d05e entitled "b43: Fix PPC >> crash in rfkill polling on unload", the call to stop polling should not have >> been placed inside the wl->mutex. The result was incorrect locking messages. >> >> Signed-off-by: Larry Finger <Larry.Finger@xxxxxxxxxxxx> >> --- >> >> John, >> >> I had not intended for the previous patch to be applied as I was waiting for >> the Bugzilla OP to test. He promised to do that today. In any case, that patch >> introduced a locking problem that needs to be fixed. >> >> Why do the one-liners cause so many problems? >> >> Larry >> --- >> >> Index: wireless-testing/drivers/net/wireless/b43/main.c >> =================================================================== >> --- wireless-testing.orig/drivers/net/wireless/b43/main.c >> +++ wireless-testing/drivers/net/wireless/b43/main.c >> @@ -4501,8 +4501,8 @@ static void b43_op_stop(struct ieee80211 >> >> cancel_work_sync(&(wl->beacon_update_trigger)); >> >> - mutex_lock(&wl->mutex); >> wiphy_rfkill_stop_polling(hw->wiphy); >> + mutex_lock(&wl->mutex); >> if (b43_status(dev) >= B43_STAT_STARTED) { >> dev = b43_wireless_core_stop(dev); >> if (!dev) > > OK, but why do we start polling under the lock but stop polling without > the lock? Should we start polling without holding the lock too? I'll test that, but I suspect it doesn't matter. Of course, the reason I put the stop under the lock was for symmetry, but then I got the following when shutting down: b43-phy0 debug: Removing Interface type 2 ======================================================= [ INFO: possible circular locking dependency detected ] 2.6.32-rc3-wl #225 ------------------------------------------------------- modprobe/25391 is trying to acquire lock: (&(&rfkill->poll_work)->work){+.+...}, at: [<ffffffff81054a7f>] __cancel_work_timer+0xd9/0x224 but task is already holding lock: (&wl->mutex){+.+.+.}, at: [<ffffffffa02ff3d0>] b43_op_stop+0x30/0x7f [b43] which lock already depends on the new lock. the existing dependency chain (in reverse order) is: -> #1 (&wl->mutex){+.+.+.}: [<ffffffff81069790>] __lock_acquire+0x140e/0x174d [<ffffffff81069b8b>] lock_acquire+0xbc/0xd9 [<ffffffff8128d420>] mutex_lock_nested+0x58/0x29c [<ffffffffa03150ea>] b43_rfkill_poll+0x3a/0xfc [b43] [<ffffffffa02c2f33>] ieee80211_rfkill_poll+0x26/0x28 [mac80211] [<ffffffffa027c028>] cfg80211_rfkill_poll+0x14/0x16 [cfg80211] [<ffffffffa0271081>] rfkill_poll+0x23/0x3d [rfkill] [<ffffffff81054224>] worker_thread+0x22c/0x332 [<ffffffff81057fd8>] kthread+0x7d/0x85 [<ffffffff8100caba>] child_rip+0xa/0x20 Moving the stop ooutside the lock cured the problem. Larry -- To unsubscribe from this list: send the line "unsubscribe linux-wireless" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html