Re: xfs_extent_busy_flush vs. aio

Avi Kivity <avi@xxxxxxxxxxxx> · Mon, 29 Jan 2018 13:44:14 +0200

On 01/29/2018 01:35 PM, Dave Chinner wrote:
On Mon, Jan 29, 2018 at 11:40:27AM +0200, Avi Kivity wrote:

On 01/25/2018 03:08 PM, Brian Foster wrote:
On Thu, Jan 25, 2018 at 10:50:40AM +0200, Avi Kivity wrote:
On 01/23/2018 07:39 PM, Brian Foster wrote:
Yeah, could be.. perhaps the issue is that despite the large amount of
total free space, the free space is too fragmented to satisfy a
particular allocation request..?
     from      to extents  blocks    pct
        1       1    2702    2702   0.00
        2       3     690    1547   0.00
        4       7     115     568   0.00
        8      15      60     634   0.00
       16      31      63    1457   0.00
       32      63     102    4751   0.00
       64     127    7940  895365   0.19
      128     255   49680 12422100   2.67
      256     511    1025  417078   0.09
      512    1023    4170 3660771   0.79
     1024    2047    2168 3503054   0.75
     2048    4095    2567 7729442   1.66
     4096    8191    8688 59394413  12.76
     8192   16383     310 3100186   0.67
    16384   32767     112 2339935   0.50
    32768   65535      35 1381122   0.30
    65536  131071       8  651391   0.14
   131072  262143       2  344196   0.07
   524288 1048575       4 2909925   0.62
1048576 2097151       3 3550680   0.76
4194304 8388607      10 82497658  17.72
8388608 16777215      10 158022653  33.94
16777216 24567552       5 122778062  26.37
total free extents 80469
total free blocks 465609690
average free extent size 5786.2

Looks like plenty of free large extents, with most of the free space
completely, unfragmented.

Indeed..
You need to look at each AG, not the overall summary. You could have
a suboptimal AG hidden in amongst that (e.g. near ENOSPC) and it's
that one AG that is causing all your problems.

There's many reasons this can happen, but the most common is the
working files in a directory (or subset of directories in the same
AG) have a combined space usage of larger than an AG ....

That's certainly possible, even likely (one huge directory with all of 
the files).

This layout is imposed on us by the compatibility gods. Is there a way 
to tell XFS to change its policy of on-ag-per-directory?

If not, then we'll have to find some workaround to the compatibility 
problem.

Lots of 16MB-32MB extents, too. 32MB is our allocation hint size, could have
something to do with it.

Most likely. Based on this, it's hard to say for certain why you'd be
running into allocation latency caused by busy extents.
One of only two reasons:

	1. the AG has a large enough free space, but they are all
	marked busy (i.e. just been freed), or

	2. The extent selected has had the busy range trimmed out of
	it and it's now less than the minimum extent length
	requested.

Both cases imply that we're allocating extents that have been very
recently freed, and that implies there is no other suitable non-busy
free space in the AG. Hence the need to look at the per-AG freespace
pattern rather than the global summary.

Also, it's worth dumping the freespace via xfs_spaceman as it walks
the in memory trees rather than the on-disk trees and so is properly
coherent with operations in progress. (i.e. xfs_spaceman -c "freesp
..." /mntpt)

That system has since been recycled, but we'll keep it in mind the next 
time we see it. Also, we'll have to fix the one-huge-directory problem 
for sure, one way or another.

--
To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html