On Fri, Mar 08, 2024 at 12:20:31PM +1100, Dave Chinner wrote: > On Thu, Mar 07, 2024 at 03:59:07PM -0800, Eric Biggers wrote: > > On Fri, Mar 08, 2024 at 10:38:06AM +1100, Dave Chinner wrote: > > > > This makes all kernels with CONFIG_FS_VERITY enabled start preallocating memory > > > > for these bios, regardless of whether they end up being used or not. When > > > > PAGE_SIZE==4096 it comes out to about 134 KiB of memory total (32 bios at just > > > > over 4 KiB per bio, most of which is used for the BIO_MAX_VECS bvecs), and it > > > > scales up with PAGE_SIZE such that with PAGE_SIZE==65536 it's about 2144 KiB. > > > > > > Honestly: I don't think we care about this. > > > > > > Indeed, if a system is configured with iomap and does not use XFS, > > > GFS2 or zonefs, it's not going to be using the iomap_ioend_bioset at > > > all, either. So by you definition that's just wasted memory, too, on > > > systems that don't use any of these three filesystems. But we > > > aren't going to make that one conditional, because the complexity > > > and overhead of checks that never trigger after the first IO doesn't > > > actually provide any return for the cost of ongoing maintenance. > > > > > > Similarly, once XFS has fsverity enabled, it's going to get used all > > > over the place in the container and VM world. So we are *always* > > > going to want this bioset to be initialised on these production > > > systems, so it falls into the same category as the > > > iomap_ioend_bioset. That is, if you don't want that overhead, turn > > > the functionality off via CONFIG file options. > > > > "We're already wasting memory, therefore it's fine to waste more" isn't a great > > argument. > > Adding complexity just because -you- think the memory is wasted > isn't any better argument. I don't think the memory is wasted, and I > expect that fsverity will end up in -very- wide use across > enterprise production systems as container/vm image build > infrastructure moves towards composefs-like algorithms that have a > hard dependency on fsverity functionality being present in the host > filesytsems. > > > iomap_ioend_bioset is indeed also problematic, though it's also a bit different > > in that it's needed for basic filesystem functionality, not an optional feature. > > Yes, ext4 and f2fs don't actually use iomap for buffered reads, but IIUC at > > least there's a plan to do that. Meanwhile there's no plan for fsverity to be > > used on every system that may be using a filesystem that supports it. > > That's not the case I see coming for XFS - by the time we get this > merged and out of experimental support, the distro and application > support will already be there for endemic use of fsverity on XFS. > That's all being prototyped on ext4+fsverity right now, and you > probably don't see any of that happening. > > > We should take care not to over-estimate how many users use optional features. > > Linux distros turn on most kconfig options just in case, but often the common > > case is that the option is on but the feature is not used. > > I don't think that fsverity is going to be an optional feature in > future distros - we're already building core infrastructure based on > the data integrity guarantees that fsverity provides us with. IOWs, > I think fsverity support will soon become required core OS > functionality by more than one distro, and so we just don't care > about this overhead because the extra read IO bioset will always be > necessary.... That's great that you're finding fsverity to be useful! At the same time, please do keep in mind that there's a huge variety of Linux systems besides the "enterprise" systems using XFS that you may have in mind. The memory allocated for iomap_fsverity_bioset, as proposed, is indeed wasted on any system that isn't using both XFS *and* fsverity (as long as CONFIG_FS_VERITY=y, and either CONFIG_EXT4_FS, CONFIG_F2FS_FS, or CONFIG_BTRFS_FS is enabled too). The amount is also quadrupled on systems that use a 16K page size, which will become increasingly common in the future. It may be that part of your intention for pushing back against this optimization is to encourage filesystems to convert their buffered reads to iomap. I'm not sure it will actually have that effect. First, it's not clear that it will be technically feasible for the filesystems that support compression, btrfs and f2fs, to convert their buffered reads to iomap. Second, this sort of thing reinforces an impression that iomap is targeted at enterprise systems using XFS, and that changes that would be helpful on other types of systems are rejected. - Eric