On Mon, Dec 31, 2012 at 10:58:15PM +0100, Jan Kara wrote: > On Mon 24-12-12 15:55:41, Zheng Liu wrote: > > From: Zheng Liu <wenqing.lz@xxxxxxxxxx> > > > > Currently all unwritten extent conversion work is pushed into a workqueue to be > > done because DIO end_io is in a irq context and this conversion needs to take > > i_data_sem locking. But we couldn't take a semaphore in a irq context. After > > tracking all extent status, we can first convert this unwritten extent in extent > > status tree, and call aio_complete() and inode_dio_done() to notify upper level > > that this dio has done. We don't need to be worried about exposing a stale data > > because we first try to lookup in extent status tree. So it makes us to see the > > latest extent status. Meanwhile we queue a work to convert this unwritten > > extent in extent tree. After this improvement, reader also needn't wait this > > conversion to be done when dioread_nolock is enabled. > > > > CC: Jan Kara <jack@xxxxxxx> > > CC: "Darrick J. Wong" <darrick.wong@xxxxxxxxxx> > > CC: Christoph Hellwig <hch@xxxxxxxxxxxxx> > > Signed-off-by: Zheng Liu <wenqing.lz@xxxxxxxxxx> > OK, so after reading other patches this should work fine. Just I think we > should somehow mark in the extent status tree that the extent tree is > inconsistent with what's on disk - something like extent dirty bit. It will > be set for UNWRITTEN extents where conversion is pending logically it would > also make sence to have it set for DELAYED extents. Then if we need to > reclaim some extents due to memory pressure we know we have to keep dirty > extents because those cache irreplacible information. What do you think? Dirty bit is a good idea for UNWRITTEN extent because we can feel free to reclaim all WRITTEN extents and all UNWRITTEN extents that are without dirty bit. But we can not reclaim DEALYED extents no matter whether they are dirty or not because they are used to lookup an delayed extent in fiemap, seek_data/hole, and bigalloc. So at least DEALYED extent must be kept in status tree. That is why in step 1 status tree only tracks all DELAYED extents in the tree. Thanks, - Zheng -- To unsubscribe from this list: send the line "unsubscribe linux-ext4" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html