Javier I somehow missed these patches in the mailing list. Sorry for coming with feedback this late. I'll look at my filters - in any case, would you mind Cc'ing me in the future? > On 28 May 2018, at 10.58, Matias Bjørling <mb@xxxxxxxxxxx> wrote: > > From: Igor Konopko <igor.j.konopko@xxxxxxxxx> > > During sequential workloads we can met the case when almost all the > lines are fully written with data. In that case rate limiter will > significantly reduce the max number of requests for user IOs. Do you mean random writes? On fully sequential, a line will either be fully written, fully invalidated or on its way to be written. When invalidating the line, then the whole line will be invalidated and GC will free it without having to move valid data. > > Unfortunately in the case when round buffer is flushed to drive and > the entries are not yet removed (which is ok, since there is still > enough free entries in round buffer for user IO) we hang on user > IO due to not enough entries in rate limiter. The reason is that > rate limiter user entries are decreased after freeing the round > buffer entries, which does not happen if there is still plenty of > space in round buffer. > > This patch forces freeing the round buffer by calling > pblk_rb_sync_l2p and thus making new free entries in rate limiter, > when there is no enough of them for user IO. I can see why you might have problems with very low OP due to the rate limiter, but unfortunately this is not a good way of solving the problem. When you do this, you basically make the L2P to point to the device instead of pointing to the write cache, which in essence bypasses mw_cuints. As a result, if a read comes in to one of the synced entries, it will violate the device-host contract and most probably fail (for sure fail on > SLC). I think that the right way of solving this problem is separating the write and GC buffers and then assigning tokens to them. The write thread will then consume both buffers based on these tokens. In this case, user I/O will have a buffer for itself, which will be guaranteed to advance at the rate the rate-limiter is allowing it to. Note that the 2 buffers can be a single buffer with a new set of pointers so that the lookup can be done with a single bit. I have been looking for time to implement this for a while. If you want to give it a go, we can talk and I can give you some pointers on potential issues I have thought about. Javier
Attachment:
signature.asc
Description: Message signed with OpenPGP