On Mon, Aug 12, 2019 at 11:01:50PM +0530, RITESH HARJANI wrote: > > This patch series converts the ext4 direct IO code paths to make use of the > > iomap infrastructure and removes the old buffer_head direct-io based > > implementation. The result is that ext4 is converted to the newer framework > > and that it may _possibly_ gain a performance boost for O_SYNC | O_DIRECT IO. > > > > These changes have been tested using xfstests in both DAX and non-DAX modes > > using various configurations i.e. 4k, dioread_nolock, dax. > > I had some minor review comments posted on Patch-4. > But the rest of the patch series looks good to me. Thanks for the review, much appreciated! Also, apologies about any delayed response to your queries, I predominantly do all this work in my personal time. > I will also do some basic testing of xfstests which I did for my patches and > will revert back. Sounds good! > One query, could you please help answering below for my understanding :- > > I was under the assumption that we need to maintain > ext4_test_inode_state(inode, EXT4_STATE_DIO_UNWRITTEN) or > atomic_read(&EXT4_I(inode)->i_unwritten)) > in case of non-AIO directIO or AIO directIO case as well (when we may > allocate unwritten extents), > to protect with some kind of race with other parts of code(maybe > truncate/bufferedIO/fallocate not sure?) which may call for > ext4_can_extents_be_merged() > to check if extents can be merged or not. > > Is it not the case? > Now that directIO code has no way of specifying that this inode has > unwritten extent, will it not race with any other path, where this info was > necessary (like > in above func ext4_can_extents_be_merged())? Ah yes, I was under the same assumption when reviewing the code initially and one of my first solutions was to also use this dynamic 'state' flag in the ->end_io() handler. But, I fell flat on my face as that deemed to be problematic... This is because there can be multiple direct IOs to unwritten extents against the same inode, so you cannot possibly get away with tracking them using this single inode flag. So, hence the reason why we drop using EXT4_STATE_DIO_UNWRITTEN and use IOMAP_DIO_UNWRITTEN instead in the ->end_io() handler, which tracks whether _this_ particular IO has an underlying unwritten extent.