Am Mi., 28. Aug. 2019 um 16:23 Uhr schrieb Darrick J. Wong <darrick.wong@xxxxxxxxxx>: > On Wed, Aug 21, 2019 at 10:23:49PM +0200, Andreas Grünbacher wrote: > > Hi Darrick, > > > > Am So., 2. Dez. 2018 um 19:13 Uhr schrieb Darrick J. Wong > > <darrick.wong@xxxxxxxxxx>: > > > From: Darrick J. Wong <darrick.wong@xxxxxxxxxx> > > > > > > In commit 4721a601099, we tried to fix a problem wherein directio reads > > > into a splice pipe will bounce EFAULT/EAGAIN all the way out to > > > userspace by simulating a zero-byte short read. This happens because > > > some directio read implementations (xfs) will call > > > bio_iov_iter_get_pages to grab pipe buffer pages and issue asynchronous > > > reads, but as soon as we run out of pipe buffers that _get_pages call > > > returns EFAULT, which the splice code translates to EAGAIN and bounces > > > out to userspace. > > > > > > In that commit, the iomap code catches the EFAULT and simulates a > > > zero-byte read, but that causes assertion errors on regular splice reads > > > because xfs doesn't allow short directio reads. This causes infinite > > > splice() loops and assertion failures on generic/095 on overlayfs > > > because xfs only permit total success or total failure of a directio > > > operation. The underlying issue in the pipe splice code has now been > > > fixed by changing the pipe splice loop to avoid avoid reading more data > > > than there is space in the pipe. > > > > > > Therefore, it's no longer necessary to simulate the short directio, so > > > remove the hack from iomap. > > > > > > Fixes: 4721a601099 ("iomap: dio data corruption and spurious errors when pipes fill") > > > Reported-by: Amir Goldstein <amir73il@xxxxxxxxx> > > > Reviewed-by: Christoph Hellwig <hch@xxxxxx> > > > Signed-off-by: Darrick J. Wong <darrick.wong@xxxxxxxxxx> > > > --- > > > v2: split into two patches per hch request > > > --- > > > fs/iomap.c | 9 --------- > > > 1 file changed, 9 deletions(-) > > > > > > diff --git a/fs/iomap.c b/fs/iomap.c > > > index 3ffb776fbebe..d6bc98ae8d35 100644 > > > --- a/fs/iomap.c > > > +++ b/fs/iomap.c > > > @@ -1877,15 +1877,6 @@ iomap_dio_rw(struct kiocb *iocb, struct iov_iter *iter, > > > dio->wait_for_completion = true; > > > ret = 0; > > > } > > > - > > > - /* > > > - * Splicing to pipes can fail on a full pipe. We have to > > > - * swallow this to make it look like a short IO > > > - * otherwise the higher splice layers will completely > > > - * mishandle the error and stop moving data. > > > - */ > > > - if (ret == -EFAULT) > > > - ret = 0; > > > break; > > > } > > > pos += ret; > > > > I'm afraid this breaks the following test case on xfs and gfs2, the > > two current users of iomap_dio_rw. > > Hmm, I had kinda wondered if regular pipes still needed this help. > Evidently we don't have a lot of splice tests in fstests. :( So what do you suggest as a fix? > > Here, the splice system call fails with errno = EAGAIN when trying to > > "move data" from a file opened with O_DIRECT into a pipe. > > > > The test case can be run with option -d to not use O_DIRECT, which > > makes the test succeed. > > > > The -r option switches from reading from the pipe sequentially to > > reading concurrently with the splice, which doesn't change the > > behavior. > > > > Any thoughts? > > This would be great as an xfstest! :) Or perhaps something generalized from it. > Do you have one ready to go, or should I just make one from the source > code? The bug originally triggered in our internal cluster test system and I've recreated the test case I've included from the strace. That's all I have for now; feel free to take it, of course. It could be that the same condition can be triggered with one of the existing utilities (fio/fsstress/...). Thanks, Andreas