On Thu, Jun 04, 2015 at 10:39:53AM +0200, Jan Kara wrote: > Hello, > > we were recently debugging an issue where customer was hitting warnings > in xfs_vm_releasepage() which was complaining that the page it was called > for has delay-allocated buffers. After some debugging we realized that > indeed try_to_release_page() call from shrink_active_list() can happen for > a page in arbitrary state (that call happens only if > buffer_heads_over_limit is set so that is the reason why we normally don't > see that). > > Hence comes my question: What are the rules for when can ->releasepage() be > called? And what is the expected outcome? We are certainly guaranteed to > hold page lock. try_to_release_page() also makes sure the page isn't under > writeback. But what is ->releasepage() supposed to do with a dirty page? According to the comments on try_to_free_page: * The @gfp_mask argument specifies whether I/O may be performed to release * this page (__GFP_IO), and whether the call may block (__GFP_WAIT & __GFP_FS). It seems to me like we're supposed to try to clean it if possible, but it is clear that it can be called from contexts that don't allow IO and/or blocking so filesystems have to be able to reject dirty pages and the callers must be able to handle it. IOWs, calling ->releasepage() with a dirty page must be allowed to fail. IMO, trying to free a page that is dirty indicates poor reclaim selection, especially in the case of shrink_active_list() which is likely to be called in a GFP_NOFS context and so it shouldn't be asking for dirty pages to be reclaimed because we simply won't do the IO (i.e. same conditions where we warn and reject ->writepage calls). > Generally IFAIU we aren't supposed to discard dirty data but I wouldn't bet > on all filesystems getting it right because the common call paths make sure > page is clean. *nod* > I would almost say we should enforce !PageDirty in > try_to_release_page() if it was not for that ext3 nastyness of cleaning > buffers under a dirty page - hum, but maybe the right answer for that is > ripping ext3 out of tree (which would also allow us to get rid of some code > in the blocklayer for bouncing journaled data buffers when stable writes > are required). XFS is behaving correctly in that it rejects the attempt to release the dirty page, but the warning is there to indicate something higher up is doing something unusual. I'm happy for the higher up code to be fixed never to try to release a dirty page, even if that means finally removing the ext3 code base from the tree.. FWIW, dirty pages and therefore pinned bufferheads are limited by dirty thresholds, so really removing bufferheads from clean pages is really all that is necessary to bring it back below thresholds. Cheers, Dave. PS: This is yet another reason why I'm working towards removing bufferheads from XFS.... -- Dave Chinner david@xxxxxxxxxxxxx -- To unsubscribe from this list: send the line "unsubscribe linux-fsdevel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html