On 9/8/21 9:15 PM, Jens Axboe wrote: > On 9/8/21 2:09 PM, Pavel Begunkov wrote: >> On 9/8/21 8:57 PM, Jens Axboe wrote: >>> On 9/8/21 1:49 PM, Pavel Begunkov wrote: >>>> In case of !SQPOLL, io_cqring_ev_posted_iopoll() doesn't provide a >>>> memory barrier required by waitqueue_active(&ctx->poll_wait). There is >>>> a wq_has_sleeper(), which does smb_mb() inside, but it's called only for >>>> SQPOLL. >>> >>> We can probably get rid of the need to even do so by having the slow >>> path (eg someone waiting on cq_wait or poll_wait) a bit more expensive, >>> but this should do for now. >> >> You have probably seen smp_mb__after_spin_unlock() trick [1], easy way >> to get rid of it for !IOPOLL. Haven't figured it out for IOPOLL, though >> >> [1] https://github.com/isilence/linux/commit/bb391b10d0555ba2d55aa8ee0a08dff8701a6a57 > > We can just synchronize the poll_wait() with a spinlock. It's kind of silly, > and it's especially silly since I bet nobody does poll(2) on the ring fd for > IOPOLL, but... fwiw, for the ebpf cat ev_posted() -> smb_mb() for taking ~3-5%. And there are non-bpf cases that may benefit from it. On my list to publish a refined version of the patch. -- Pavel Begunkov