Re: [RFC PATCH 0/14] xfs: Towards thin provisioning aware filesystems

Brian Foster <bfoster@xxxxxxxxxx> · Wed, 1 Nov 2017 10:17:21 -0400

On Wed, Nov 01, 2017 at 11:45:13AM +1100, Dave Chinner wrote:
> On Tue, Oct 31, 2017 at 07:24:32AM -0400, Brian Foster wrote:
> > On Tue, Oct 31, 2017 at 08:09:41AM +1100, Dave Chinner wrote:
> > > On Mon, Oct 30, 2017 at 09:31:17AM -0400, Brian Foster wrote:
> > > > On Thu, Oct 26, 2017 at 07:33:08PM +1100, Dave Chinner wrote:
> > > > > This patchset is aimed at filesystems that are installed on sparse
> > > > > block devices, a.k.a thin provisioned devices. The aim of the
> > > > > patchset is to bring the space management aspect of the storage
> > > > > stack up into the filesystem rather than keeping it below the
> > > > > filesystem where users and the filesystem have no clue they are
> > > > > about to run out of space.
> ....
> > > I get that "total_blocks" sounds better, but to me that's a capacity
> > > measurement, not an indication of the size of the underlying address
> > > space the block device has provided. m_usable_blocks is obviously a
> > > capacity measurement, but I was trying to convey that m_LBA_size is
> > > not a capacity measurement but an externally imposed addressing
> > > limit.
> > > 
> > > <shrug>
> > > 
> > > I guess if I document it well enough m_total_blocks will work.
> > > 
> > 
> > Hmm, yeah I see what you mean. Unfortunately I can't really think of
> > anything aside from m_total_blocks or perhaps m_phys_blocks at the
> > moment.
> 
> m_phys_blocks seems closer to the intent. If that's acceptible I'll
> change the code to that.
> 

That works for me, thanks..

BTW, was there ever any kind of solution to the metadata block
reservation issue in the thin case? We now hide metadata reservation
from the user via the m_usable_blocks account. If m_phys_blocks
represents a thin volume, how exactly do we prevent those metadata
allocations/writes from overrunning what the admin has specified as
"usable" with respect to the thin volume?

> > > > Finally, I tend to agree with Amir's comment with regard to
> > > > shrink/growfs... at least infosar as I understand his concern. If we do
> > > > support physical shrink in the future, what do we expect the interface
> > > > to look like in light of this change?
> > > 
> > > I don't expect it to look any different. It's exactly the same as
> > > growfs - thinspace filesystem will simply do a logical grow/shrink,
> > > fat filesystems will need to do a physical grow/shrink
> > > adding/removing AGs.
> > > 
> > 
> > How would you physically shrink a thin filesystem?
> 
> You wouldn't. There should never be a need to do this because it a
> thinspace shrink doesn't actually free any space - it's just a usage
> limit. fstrim is what actually shrinks the storage space used,
> regardless of the current maximum capcity of the thin filesystem.
> 

In other words, the answer is that we can't physically shrink a thin fs
because of a limitation on the growfs interface due to how we've used it
here. That's fine, I'd just prefer to be explicit about the behavior
rather than attempt to predict that certain operations will never have a
legitimate use case, however likely that might seem right now. More
importantly, then I can at least understand we're on the same page with
regard to the ramifications of this change and we can simply agree to
disagree. ;)

I do agree that the effect is essentially redundant on thin filesystems
from the perspective of administration of a thin pool. I don't really
have a specific use case in mind for why somebody would want to
physically shrink a thin enabled fs. From the thin management
perspective, it seems more likely to me that we'd find users who want to
physically grow a filesystem without logically growing it at the same
time (another limitation of the current interface) before we're in the
situation where physical shrink is a real concern.

My perspective on physical shrink is more that the functionality still
exists via the management of the thin volume itself, so if we someday
support physical shrink, somebody is bound to find a decent enough
reason to want to do it on a thin fs. Maybe they want to convert a thin
volume to a fat one or wanted to shrink the geometry to reduce metadata
overhead or something. Anyways, it seems a bit unfortunate that the only
reason they couldn't is due to too narrow an interface.

> So I don't really see any point in physically shrink a thin
> filesystem because that defeats the entire purpose of having a thin
> filesystem in the first place.
> 
> > > > Given that this already requires on-disk changes and the addition of a
> > > > feature bit, it seems prudent to me to update the growfs API
> > > > accordingly. Isn't a growfs new_usable_blocks field or some such all we
> > > > really need to address that concern?
> > > 
> > > I really do not see any reason for changing the growfs interface
> > > right now. If there's a problem in future that physical shrink
> > > introduces, we can rev the interface when the problem arises.
> > > 
> > 
> > Indeed. I'm not terribly concerned about the growfs interface, but in
> > the interest of trying to make sure the point is clear... does that mean
> > we'd add a new usable_blocks field to the growfs structure if/when
> > physical shrink is supported?
> 
> No. growfs simply says "grow/shrink to size X" and the kernel then
> determines if a physical or thin grow/shrink operation is performed
> and does the work/checks appropriate to make that work.
> 

I understand how the current change is intended to work... that doesn't
really answer the question. You acknowledge below that we could support
physical shrink on thin fs' by revving the growfs interface. I assumed
above that means the new interface would handle physical and usable
blocks separately. If that is not the case, then how would you propose
to define that new interface to handle both physical and logical shrink?

With that new/revved interface, would we not somehow have to maintain
backwards compatibility of the old interface with regard to thin enabled
filesystems?

> The only thing we need to ensure is that growfs has the necessary
> info to correctly set the size of the filesystem when it is doing
> operations that don't want to change the size of the filesystem
> (e.g. changing imaxpct).  i.e. an old growfs should be able to
> grow/shrink a thin filesystem correctly without being aware it is a
> thin filesystem.
> 
> > If so, doesn't that mean we'd have two
> > fields that are used to logically shrink the fs, depending on interface
> > version, or that we'd have some kernels that logically shrink based on
> > 'newblocks' and some that don't?
> 
> No.
> 
> > The larger question I have is what do we save by not just adding
> > usable_blocks to growfs_data now so the interface remains
> > self-explanatory? That seems relatively minor with respect to the
> > on-disk changes.
> 
> We avoid having to modify an API for no real reason. We don't change
> userspace APIs unless we have an actual need, and I'm not seeing
> anyone articulate an actual need to change the growfs ioctl.
> 

Understood. In this case, it seems to me we're changing the meaning of
an existing user interface to avoid changing the interface for a new
feature. IOW, what would the approach be here if we already supported
physical shrink?

> Further, changing the growfs API for thinspace filesystems means
> that old growfs binaries will do bad things on thinspace filesystems
> because growfs doesn't do version/feature checks. Hence we have to
> make thinspace grow operations work sanely for existing
> XFS_IOC_FSGEOMETRY_V4 callers, and that means we can't rely on a new
> XFS_IOC_GROWFSDATA ioctl just to operate on thinspace filesystems.
> 

Changing/revving the interface means the old interface continues to only
imply physical grow/shrink. That means the new interface is required to
manage logical grow/shrink, which is not that far off from other new
features we add that require updated userspace for support. We already
have to add a feature check to make the current interface perform a
logical shrink rather than an error, so I'm not following why that's
such a technical hurdle.

Brian

> *If* we add physical shrink support we can rev the growfs interface
> at that point in time *if we need to*. But I'm simply not convinced
> we will need to change it at all...
> 
> Cheers,
> 
> Dave.
> -- 
> Dave Chinner
> david@xxxxxxxxxxxxx
> --
> To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-xfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html