On Wed, 26 Feb 2014, Dave Chinner wrote: > Date: Wed, 26 Feb 2014 08:50:11 +1100 > From: Dave Chinner <david@xxxxxxxxxxxxx> > To: Lukáš Czerner <lczerner@xxxxxxxxxx> > Cc: linux-ext4@xxxxxxxxxxxxxxx, linux-fsdevel@xxxxxxxxxxxxxxx, xfs@xxxxxxxxxxx > Subject: Re: [PATCH 6/6] ext4/242: Add ext4 specific test for fallocate zero > range > > On Tue, Feb 25, 2014 at 10:01:06PM +0100, Lukáš Czerner wrote: > > On Wed, 26 Feb 2014, Dave Chinner wrote: > > > > > Date: Wed, 26 Feb 2014 07:53:49 +1100 > > > From: Dave Chinner <david@xxxxxxxxxxxxx> > > > To: Lukas Czerner <lczerner@xxxxxxxxxx> > > > Cc: linux-ext4@xxxxxxxxxxxxxxx, linux-fsdevel@xxxxxxxxxxxxxxx, xfs@xxxxxxxxxxx > > > Subject: Re: [PATCH 6/6] ext4/242: Add ext4 specific test for fallocate zero > > > range > > > > > > On Tue, Feb 25, 2014 at 08:15:28PM +0100, Lukas Czerner wrote: > > > > This is copy of xfs/242. However it's better to make it file system > > > > specific because the range can be zeroes either directly by writing > > > > zeroes, or converting to unwritten extent, so the actual result might > > > > differ from file system to file system. > > > > > > You could say the same thing about preallocation using unwritten > > > extents. Yet, funnily enough, we have generic tests for them because > > > all filesystems currently use unwritten extents for preallocation > > > and behave identically.... > > > > > > This test is no different - all filesystems currently use unwritten > > > extents, and so this test should be generic because all existing > > > filesystems *should* behave the same. > > > > > > When we get a filesystem that zeros rather uses unwritten extents, > > > we can add a new *generic* test that tests for zeroed data extents > > > rather than unwritten extents. All that we will need is a method of > > > checking what behaviour the filesystem has and adding that to a > > > _requires directive to ensure the correct generic fallocate tests > > > are run... > > > > Currently xfs/242 fails on xfs for me > > Really? Where's the bug report? I haven't seen a failure on xfs/242 > on any of my test machines for at least a year, even on 1k block > size filesystems... > > $ sudo ./check xfs/242 > FSTYP -- xfs (debug) > PLATFORM -- Linux/x86_64 test2 3.14.0-rc3-dgc+ > MKFS_OPTIONS -- -f -bsize=4096 /dev/vdb > MOUNT_OPTIONS -- /dev/vdb /mnt/scratch > > xfs/242 1s ... 0s > Ran: xfs/242 > Passed all 1 tests > $ Ok so once again. Yesterday was rally too late and I've misinterpreted the diff. It's not that xfs behaves differently, but rather ext4 behaves differently because we in fact have a code that will zero out entire unwritten extent if it's small enough rather than split it into unwritten and written. What we currently do not have is the zeroout of part of initialized extent on zero_range when the range to zero out is small enough. And ext4 behaves the same on the edges of the range as you describe below. Also xfs/242 fails for me on ppc64 but I've already sent that report. > > > > and it does behave differently than ext4. > > In what way? Does FALLOC_FL_ZERO_RANGE on XFS behave identically to > XFS_IOC_ZERO_RANGE, or is that different too? Or you haven't tested > it because you wrote this test as an ext4 specific test and so > haven't run this specific test exercising the FALLOC_FL_ZERO_RANGE > path in XFS? It does behave differently, but not because of zero_range code, but rather when writing into uninitialized extent which is small enough. The extent will not be split but rather converted to initialized and respective parts will be zeroed out. Btw that's actually the reason why we use _filter_hole_fiemap filtes instead of _filter_fiemap I've used in ext4/242. That said I think that both tests fs specific and fs independent have it's value so I'll create generic/242 as well by using _filter_hole_fiemap. And yes, I tested xfs with fzero and it does behave the same as XFS_IOC_ZERO_RANGE. > > IOWs, how do you know that what you are seeing is not a bug in the > ext4 (or XFS for that matter) implementation? > > > Also I had to change to 242.out a bit because ext4 was > > a little different. It seems to me that it was expected that when > > the extent is small enough it would be overwritten by zeroes rather > > than converted to unwritten, but I have not looked into > > implementation. > > The test assumes that sub-block head and tail ranges will be zeroed, > and everything else will be converted to unwritten extents. i.e. a > single block aligned range will get converted to unwritten, but a > single unaligned block range will result in the two overlapping > blocks being zeroed (because they still contain valid data). As I said above the difference in ext4 is that we're trying to avoid extent split when the extent is too small. But that's no different in punch hole testing, that's why we're using _filter_hole_fiemap filter in generic/255. > > This is the same sub-block zeroing behaviour as is done for > hole punching - the only difference between a hole punch and a zero > range on filesystems that use unwritten extents should be that the > range being operated on has unwritten extents rather a hole..... > > > Btw this kind of optimization is actually something I've been > > thinking of as well for ext4. Rather than going though the hassle of > > changing extents around it might be worth in some situation to zero > > out. But that's an optimization I have not implemented yet. > > Exactly my point - until such optimisations are implemented, all the > filesystems should be behaving the same way using unwritten extents, > just like for hole punching. Hence the tests should be checking that > the behaviour is the same across filesystems, just like we do for > hole punching. Using _filter_hole_fiemap filter in such test we would not make a difference between unwritten and written extent. However in the case of zero_range this somewhat make the test much less effective so it'll be worth having fs specific test as well as generic test I said above. Or we could actually directly inspect the data as we do in xfs/290, or generic/290 respectively. Thanks! -Lukas > > Cheers, > > Dave. >