Re: [PATCH] CIFS: Use invalidate_inode_pages2 instead of invalidate_remote_inode (try #4)

Jeff Layton <jlayton@xxxxxxxxxx> · Wed, 23 Mar 2011 13:56:26 -0400

On Wed, 23 Mar 2011 12:32:03 -0500
Steve French <smfrench@xxxxxxxxx> wrote:

> On Wed, Mar 23, 2011 at 11:14 AM, Jeff Layton <jlayton@xxxxxxxxxx> wrote:
> > On Wed, 23 Mar 2011 12:27:31 +0300
> > Pavel Shilovsky <piastry@xxxxxxxxxxx> wrote:
> >
> >> 2011/3/23 Steve French <smfrench@xxxxxxxxx>:
> >> > Pavel,
> >> > Jeff and I talked about invalidate_mapping and its uses this
> >> > afternoon. ÂA few comments:
> >> > 1) Â Take a look at what happens on filemap_fdatawait call in
> >> > invalidate_mapping to see if it makes sense to return errors through
> >> > back to some of the callers of invalidate_mapping, in particular,
> >> > strict_fsync. ÂIf we can't write out the file data (ENOSPC, or host
> >> > down etc.), we want to make sure that the return code gets sent back
> >> > on any calls that can reasonably expect such an error.
> >>
> >> vfs_fsync_range has already done filemap_fdatawait call - so, in this
> >> case there is no need to do it again in cifs_invalidate_mapping. The
> >> only reason for calling this is to invalidate_inode_pages2 but any
> >> error there shouldn't affect fsync behavior, as I think.
> >>
> >> > 2) If invalidate_inode_pages2 fails (e.g. with EBUSY, because one of
> >> > the pages couldn't be freed from the mapping because it just got
> >> > redirtied right after we flushed it the line before) we set the
> >> > mapping to invalid but don't check
> >> > Â Âcifs_i->invalid_mapping
> >> > in many places. ÂShould we add checks for cifs_i->invalid_mapping in
> >> > more places?
> >>
> >> The one place where we should add such a check is read call, but it
> >> needs cifs_revalidate_file instead (that I am going to provide next)
> >> before generic_file_aio_read. In this case cifs_revalidate will check
> >> for invalid_mapping and needs to return a error if we could not
> >> invalidate all inode pages in cifs_invalidate_mapping (because it
> >> returns wrong data to the read call). But as you noticed lseek
> >> shouldn't think about this error and it uses cifs_revalidate_file too.
> >>
> >> So, we may add extra check for -EBUSY error code in callers of
> >> cifs_revalidate_{dentry,file} and cifs_invaliadate_mapping and
> >> separate them into two groups:
> >> 1) that aware about -EBUSY error code and return a error the it's caller.
> >> these are: cifs_d_revalidate, cifs_file_aio_read (future
> >> implementation), cifs_file_strict_mmap.
> >> 2) that doesn't aware about it and return ok in this case.
> >> these are: cifs_getattrs, cifs_lseek, cifs_fsync.
> >>
> >> Thoughts?
> >>
> >
> > The main reason I think we need to reconsider that error is that I
> > spent several months tracking down a rather nasty data corruption bug
> > relating to mmap on NFS in RHEL5 a few years ago:
> >
> > Â Âhttps://bugzilla.redhat.com/show_bug.cgi?id=435291
> >
> > Part of the problem there was that the NFS client ignored the return
> > code from invalidate_inode_pages2. The other part of the problem was a
> > lack of synchronization between mmap calls and the page fault handler.
> >
> > Needless to say, this was not a fun problem to track down. I think you
> > need to be very careful about ignoring errors from
> > invalidate_inode_pages2 and think carefully about what a failure there
> > means for all cases.
> 
> I agree - need to be careful, but IIRC the NFS problem would be that
> it has a launder_page method which is returning an error through
> invalidate_inode_pages2 while in the cifs case the data is forced to
> be written out through filemap_fdatawrite or filemap_write_and_wait.
> 

No. RHEL5 at that time didn't have a launder page method (it does now,
but that's another story).

-- 
Jeff Layton <jlayton@xxxxxxxxxx>
--
To unsubscribe from this list: send the line "unsubscribe linux-cifs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html