On Sat, Aug 25, 2018 at 01:47:52PM +0300, Amir Goldstein wrote: > [+cc: Al,linux-unionfs] > > On Sat, Aug 25, 2018 at 2:39 AM Dave Chinner <david@xxxxxxxxxxxxx> wrote: > > > > On Fri, Aug 24, 2018 at 12:02:51PM +0300, Amir Goldstein wrote: > > > Since overlayfs implements stacked file operations, f_inode > > > is no longer euqivalent to f_mapping->host and xfs should use > > > the latter, same as generic_swapfile_activate(). > > > > Since when has file_inode() not pointed to the inode backing the > > struct file? > > > > > Using f_inode results in an attempt to dereference an xfs_inode > > > struct from an ovl_inode pointer: > > > > > > CPU: 0 PID: 2462 Comm: swapon Not tainted > > > 4.18.0-xfstests-12721-g33e17876ea4e #3402 > > > RIP: 0010:xfs_find_bdev_for_inode+0x23/0x2f > > > Call Trace: > > > xfs_iomap_swapfile_activate+0x1f/0x43 > > > __se_sys_swapon+0xb1a/0xee9 > > > > > > Fixes: d1d04ef8572b ("ovl: stack file ops") > > > > Oh, since about 3 days ago. > > > > So now we've got to deal with a vfs interface change that isn't > > documented, hasn't been clearly communicated prior to merging, and > > it subtly breaks a subset of callers. > > > > Well, when you put it this way... ;-) > > First of all - self NACK. > My fix is papering over a bigger issue, that is leaking of overlay > file/inode into xfs f_aops. That means any new operation vector introduced that passes a struct file is always going to need an overlay interposer function to ensure that the correct file is passed to the lower filesystem, yes? That seems like a bit of a landmine to leave for anyone implementing a new generic operation vector. Documentation patch? > I believe the correct fix right now would be to add an overlayfs hack > in swapon(), as well as some other hacks in mm/* syscalls > (e.g. readahead()). > > The virtue of merging stacked file operations was getting rid of many > VFS hacks, but the last chapter has not been written yet, or to put it > in Al's words [1]: > > "Uses of ->vm_file (and rules for those) are too convoluted to untangle > at the moment. I still would love to get that straightened out, but > it's not this cycle fodder, more's the pity..." > > So I expect this cycle will require adding a few temporary mm > syscall hacks, in the hope they will be more short lived than the > departing VFS hacks. Yuck. > > So, please enlighten me with a documentation patch before changing > > any XFS code: What is the new behaviour and the rules we must follow > > for calling file_inode()? > > > > Actually, I believe the intention was that fs developers don't need to worry > about using file_inode() at all, because before the change we had: > > - file passed in to xfs f_op's and a_ops is either overlay file OR xfs file > - file_inode() of either overlay/xfs file in xfs context is always xfs inode > - file->f_path in xfs context, BTW, was overlay path and therefore, > XFS_IOC_OPEN_BY_HANDLE was slightly broken in overlayfs over xfs, > as were several other fs specific ioctls > > After stacked file operations change we should have the rules: > > 1. file passed in to xfs f_op's is always xfs file (*) > 2. file passed in to xfs a_ops is always xfs file (**) > 3. file_inode() of overlay file is an overlay inode > > (*) as explicit file argument or on iocb->ki_filp > (**) as explicit file argument or on ->vm_file > > I believe that swapfile leaking an overlay file into xfs was an oversight, > that is breaking rule #2. Please add documentation explaining how this all works so pepole don't have to ask every time we come across a bug as a result of a missing/incorrect translation in overlay. Cheers, Dave. -- Dave Chinner david@xxxxxxxxxxxxx