Re: trying to understand READ_META, READ_SYNC, WRITE_SYNC & co

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Mon, Jun 21, 2010 at 12:04:06PM +0200, Jens Axboe wrote:

[..]
> > Now how do we use these flags in the block layer?
> >  
> >  - REQ_META
> > 
> > 	The only place where we ever use this flag is inside the
> > 	cfq scheduler. In cfq_choose_req we use it to give a meta
> > 	request priority over one that doesn't have it.  But before
> > 	that we already do the same preference check with rw_is_sync,
> > 	which evaluates to true for requests with that are either
> > 	reads or have REQ_SYNC set.  So for reads the REQ_META flag
> > 	here effectively is a no-op, and for writes it gives less
> > 	priority than REQ_SYNC.
> > 	In addition to that we use it to account for pending metadata
> > 	requests in cfq_rq_enqueued/cfq_remove_request which gets
> > 	checked in cfq_should_preempt to give priority to a meta
> > 	request if the other queue doesn't have any pending meta
> > 	requests.  But again this priority comes after a similar
> > 	check for sync requests that checks if the other queue has
> > 	been marked to have sync requests pending.
> 
> It's also annotation for blktrace, so you can tell which parts of the IO
> is meta data etc. The scheduler impact is questionable, I doubt it makes
> a whole lot of difference.

In my testing in the past, this was helping if lots of sequential readers
are running in a system (with 100ms slice each) and if there is another
reader doing small file reads. Without meta data based preemption check,
latency of opening small file used to be very high (Because all the sync
sequntial reader gets to consume their 100ms slices first).

Situation should be somewhat better now after corrado's changes of
reducing slice length dynamically. But I suspect that we will still
experience significantly reduced latency for small file reads in presece
of multiple sequntial reads going on.

Thanks
Vivek
--
To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Ext4 Filesystem]     [Union Filesystem]     [Filesystem Testing]     [Ceph Users]     [Ecryptfs]     [AutoFS]     [Kernel Newbies]     [Share Photos]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux Cachefs]     [Reiser Filesystem]     [Linux RAID]     [Samba]     [Device Mapper]     [CEPH Development]
  Powered by Linux