On Sat, 2018-05-19 at 19:19 -0400, Theodore Y. Ts'o wrote: > On Sat, May 19, 2018 at 08:27:00AM -0700, Darrick J. Wong wrote: > > From: Darrick J. Wong <darrick.wong@xxxxxxxxxx> > > > > In inode_init_always(), we clear the inode mapping flags, which clears > > any retained error (AS_EIO, AS_ENOSC) bits. Unfortunately, we do not > > also clear wb_err, which means that old mapping errors can leak through > > to new inodes. > > > > This is crucial for the XFS inode allocation path because we recycle old > > in-core inodes and we do not want error state from an old file to leak > > into the new file. This bug was discovered by running generic/036 and > > generic/047 in a loop and noticing that the EIOs generated by the > > collision of direct and buffered writes in generic/036 would survive the > > remount between 036 and 047, and get reported to the fsyncs (on > > different files on a reformatted fs!) in generic/047. > > > > Since we're changing the semantics of inode_init_always, we must also > > change xfs_reinit_inode to retain the writeback error state when we go > > to recover an inode that has been torn down in the vfs but not yet > > disposed of by XFS. > > > > Signed-off-by: Darrick J. Wong <darrick.wong@xxxxxxxxxx> > > This may fix the generic/047 failure, but alas, it does not address > the shared/298 failure. > > Jeff's theory that we may need to clear the errseq_t information after > detaching the loop device makes sense to me; and in the case of the > loop device, we wouldn't be initializing the inode, so your patch > wouldn't do anything about that particular case. > > I poked at this a bit this morning and found that if I ran generic/361 twice in a row that I'd see a failure similar to what you were seeing. This patch seems to fix it for me. Ted, could you test this against your reproducer? If this works for you I'll plan to flesh out the patch description and get this into -next and eventually to Linus before the next release. Thanks, Jeff -------------------------8<-------------------- [PATCH] loop: clear wb_err in bd_inode when detaching backing file Signed-off-by: Jeff Layton <jlayton@xxxxxxxxxx> --- drivers/block/loop.c | 1 + 1 file changed, 1 insertion(+) diff --git a/drivers/block/loop.c b/drivers/block/loop.c index 5d4e31655d96..55cf554bc914 100644 --- a/drivers/block/loop.c +++ b/drivers/block/loop.c @@ -1068,6 +1068,7 @@ static int loop_clr_fd(struct loop_device *lo) if (bdev) { bdput(bdev); invalidate_bdev(bdev); + bdev->bd_inode->i_mapping->wb_err = 0; } set_capacity(lo->lo_disk, 0); loop_sysfs_exit(lo); -- 2.17.0