Bart, On 10/3/17 08:44, Bart Van Assche wrote: > On Mon, 2017-10-02 at 16:15 +0900, Damien Le Moal wrote: >> When dispatching write requests to a zoned block device, only allow >> requests targeting an unlocked zone. Requests targeting a locked zone >> are left in the scheduler queue to preserve the initial write order. >> If no write request can be dispatched, allow reads to be dispatched >> even if the write batch is not done. >> >> To ensure that the search for an appropriate write request is atomic >> in deadline_fifo_request() and deadline_next_request() with reagrd to > ^^^^^^ > regard? Will fix. >> write requests zone lock state, introduce the spinlock zone_lock. >> Holding this lock while doing the search in these functions as well as >> when unlocking the target zone of a completed write request in >> dd_completed_request() ensure that the search does not pickup a write >> request in the middle of a zone queued write sequence. > > Since there is already a spinlock in the mq-deadline scheduler that serializes > most operations, do we really need a new spinlock? The dd->lock spinlock is used without IRQ disabling. So it does not protect against request completion method calls. That spinlock being a "big lock", I did not want to use it with IRQ disabled. >> /* >> * Write unlock the target zone of a write request. >> + * Clearing the target zone write lock bit is done with the scheduler zone_lock >> + * spinlock held so that deadline_next_request() and deadline_fifo_request() >> + * cannot see the lock state of a zone change due to a request completion during >> + * their eventual search for an appropriate write request. Otherwise, for a zone >> + * with multiple write requests queued, a non sequential write request >> + * can be chosen. >> */ >> static void deadline_wunlock_zone(struct deadline_data *dd, >> struct request *rq) >> { >> + unsigned long flags; >> + >> + spin_lock_irqsave(&dd->zone_lock, flags); >> + >> WARN_ON_ONCE(!test_and_clear_bit(blk_rq_zone_no(rq), dd->zones_wlock)); >> deadline_clear_request_zone_wlock(rq); >> + >> + spin_unlock_irqrestore(&dd->zone_lock, flags); >> } > > Is a request removed from the sort and fifo lists before being dispatched? If so, > does that mean that obtaining zone_lock in the above function is not necessary? Yes, a request is removed from the sort tree and fifo list before dispatching. But the dd->zone_lock spinlock is not there to protect that, dd->lock protects the sort tree and fifo list. dd->zone_lock was added to prevent the completed_request() method from changing a zone lock state while deadline_fifo_requests() or deadline_next_request() are running. Ex: Imagine this case: write request A for a zone Z is being executed (it was dispatched) so Z is locked. Dispatch runs and inspects the next requests in sort order. Let say we have the sequential writes B, C, D, E queued for the same zone Z. First B is inspected and cannot be dispatched (zone locked). Inspection move on to C, but before the that, A completes and Z is unlocked. Then C will be OK to go as the zone is now unlocked. But it is the wrong choice as it will result in out of order write. B must be the next request dispatched after A. dd->zone_lock prevents this from happening. Without this spinlock, the bad example case above happens very easily. >> static struct request * >> deadline_fifo_request(struct deadline_data *dd, int data_dir) >> { >> + struct request *rq; >> + unsigned long flags; >> + >> if (WARN_ON_ONCE(data_dir != READ && data_dir != WRITE)) >> return NULL; >> >> if (list_empty(&dd->fifo_list[data_dir])) >> return NULL; >> >> - return rq_entry_fifo(dd->fifo_list[data_dir].next); >> + if (!dd->zones_wlock || data_dir == READ) >> + return rq_entry_fifo(dd->fifo_list[data_dir].next); >> + >> + spin_lock_irqsave(&dd->zone_lock, flags); >> + >> + list_for_each_entry(rq, &dd->fifo_list[WRITE], queuelist) { >> + if (deadline_can_dispatch_request(dd, rq)) >> + goto out; >> + } >> + rq = NULL; >> + >> +out: >> + spin_unlock_irqrestore(&dd->zone_lock, flags); >> + >> + return rq; >> } > > Same question here: is it really necessary to obtain zone_lock? See above. Same comment. Best regards. -- Damien Le Moal, Western Digital