On Mon, Apr 10, 2017 at 09:53:17PM +0800, Liping Zhang wrote: > Hi Pablo, > > 2017-04-10 20:02 GMT+08:00 Pablo Neira Ayuso <pablo@xxxxxxxxxxxxx>: > [...] > >> Since ctnetlink_change_conntrack is unrelated to nf_conntrack_expect_lock, > >> so remove it can fix this issue. > > > > But packets may be updating a conntrack at the same time that we're > > mangling via ctnetlink, right? > > Yes, but in packets processing path, we use rcu_read_lock(), so using > spin_lock_bh(&nf_conntrack_expect_lock) here won't help anything. > > As a quick summary(just a reference): > 1. For CTA_TIMEOUT, there's no problem > 2. For CTA_MARK, no problem too > 3. For CTA_PROTOINFO, spin_lock_bh(&ct->lock) will be held, so no problem too > 4. For CTA_LABELS, it may race with packets path, but it seems not a big problem > 5. For CTA_SEQ_ADJ_ORIG... we should hold &ct->lock when do updating seqadj > (this one should require a new patch) > 6. For CTA_HELP, updating helpinfo may be a problem(I am not sure > about this part) > 7. For CTA_STATUS, I think it may cause a big problem, the bit set operation via > ctnetlink_change_status is not atomic, so it may clear the > IPS_DYING_BIT, for example: > CPU0(update CTA_STATUS) CPU1(packet path, set _DYING_) > ctnetlink_change_status -- > olds = ct->status -- > -- set_bit(IPS_DYING_BIT... > ct->status = olds | new status --> Here DYING_BIT will be cleared! > > But I think we can convert "ct->status |= status & ~(IPS_NAT_DONE_MASK > | IPS_NAT_MASK);" > to a series of atomic bit set operations to solve the 7th issue. > > And the issues listed above won't be solved by holding _expect_lock, > so I think we should get rid of the _expect_lock at first. I'm tossing this. I would like to see a patch series to address all issues with conntrack updates in one go. By when the central spinlock was removed, this was incorrectly converted to be safe. Since then on this has been broken. This should refer to patch ca7433df3a67 when fixing this. -- To unsubscribe from this list: send the line "unsubscribe netfilter-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html