On Thu 01-11-12 15:56:34, Boaz Harrosh wrote: > On 11/01/2012 11:57 AM, Darrick J. Wong wrote: > > On Thu, Nov 01, 2012 at 11:21:22AM -0700, Boaz Harrosh wrote: > >> On 11/01/2012 12:58 AM, Darrick J. Wong wrote: > >>> This creates a per-backing-device counter that tracks the number of users which > >>> require pages to be held immutable during writeout. Eventually it will be used > >>> to waive wait_for_page_writeback() if nobody requires stable pages. > >>> > >> > >> There is two things I do not like: > >> 1. Please remind me why we need the users counter? > >> If the device needs it, it will always be needed. The below > >> queue_unrequire_stable_pages call at blk_integrity_unregister > >> only happens at device destruction time, no? > >> > >> It was also said that maybe individual filesystems would need > >> stable pages where other FSs using the same BDI do not. But > >> since the FS is at the driving seat in any case, it can just > >> do wait_for_writeback() regardless and can care less about > >> the bdi flag and the other FSs. Actually all those FSs already > >> do this this, and do not need any help. So this reason is > >> mute. > > > > The counter exists so that a filesystem can forcibly enable stable page writes > > even if the underlying device doesn't require it, because the generic fs/mm > > waiting only happens if stable_pages_required=1. The idea here was to allow a > > filesystem that needs stable page writes for its own purposes (i.e. data block > > checksumming) to be able to enforce the requirement even if the disk doesn't > > care (or doesn't exist). > > > > But the filesystem does not need BDI flag to do that, It can just call > wait_on_page_writeback() directly and or any other waiting like cifs does, > and this way will not affect any other partitions of the same BDI. So this flag > is never needed by the FS, it is always to service the device. But if they use some generic VFS functions, these VFS functions need to know whether to wait or not - e.g. grab_cache_page_write_begin() or block_page_mkwrite(). > > But maybe there are no such filesystems? > > > > Exactly, all the FSs that do care, already take care of it. I'm not exactly sure whether FSs that do care didn't start to use generic functions which now always call wait_on_page_writeback() for quite a few releases... Honza -- Jan Kara <jack@xxxxxxx> SUSE Labs, CR -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html