On Mon, Nov 1, 2010 at 4:43 PM, Ben Gamari <bgamari@xxxxxxxxx> wrote: > On Mon, 1 Nov 2010 16:17:23 +0100, Björn Smedman <bjorn.smedman@xxxxxxxxxxx> wrote: >> Hi all, >> >> I have an application that creates and destroys a lot of ap vifs and >> does a lot of monitor frame injection. The recent ath9k rx locking >> fixes have helped with stability in this use-case but there still >> seems to be some tx/beacon related race condition(s). These manifests >> themselves as follows on an AR913x based router running >> compat-wireless-2010-10-19 (with locking fixes etc from openwrt): >> >> 1. TX DMA hangs under simultaneous high RX and TX load >> 2. TX is completely hung but chip is never reset > > I have also observed both of these behaviors with just a standard > hostapd single VIF configuration. Quite annoying. It seems to be better > with recent wireless-testing trees. > > - Ben Looking at the code here is the first passage that triggers a bad fuzzy feeling for me (beacon.c): skb = ieee80211_get_buffered_bc(hw, vif); /* * if the CABQ traffic from previous DTIM is pending and the current * beacon is also a DTIM. * 1) if there is only one vif let the cab traffic continue. * 2) if there are more than one vif and we are using staggered * beacons, then drain the cabq by dropping all the frames in * the cabq so that the current vifs cab traffic can be scheduled. */ spin_lock_bh(&cabq->axq_lock); cabq_depth = cabq->axq_depth; spin_unlock_bh(&cabq->axq_lock); if (skb && cabq_depth) { if (sc->nvifs > 1) { ath_print(common, ATH_DBG_BEACON, "Flushing previous cabq traffic\n"); ath_draintxq(sc, cabq, false); } } ath_beacon_setup(sc, avp, bf, info->control.rates[0].idx); while (skb) { ath_tx_cabq(hw, skb); skb = ieee80211_get_buffered_bc(hw, vif); } >From what I can tell there is no guarantee that CABQ TX DMA is stopped when ath_draintxq() is called. From ath_draintxq() point of view that looks like a bad idea (race between CPU and DMA). Also, that looking around "cabq_depth = cabq->axq_depth;" looks very peculiar. I believe it's correct (because nobody else puts anything into this queue and we don't care if it's shorter later on when we drain it) but I think it would be nice with a comment. Any thoughts? I can whip up and test a patch if there are no objections. /Björn -- To unsubscribe from this list: send the line "unsubscribe linux-wireless" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html