On Tue, Jul 22, 2014 at 5:08 PM, Michal Hocko <mhocko@xxxxxxx> wrote: > On Sat 19-07-14 13:39:11, Johannes Weiner wrote: >> On Fri, Jul 18, 2014 at 05:12:54PM +0200, Miklos Szeredi wrote: >> > On Fri, Jul 18, 2014 at 4:45 PM, Johannes Weiner <hannes@xxxxxxxxxxx> wrote: >> > >> > > I assumed the source page would always be new, according to this part >> > > in fuse_try_move_page(): >> > > >> > > /* >> > > * This is a new and locked page, it shouldn't be mapped or >> > > * have any special flags on it >> > > */ >> > > if (WARN_ON(page_mapped(oldpage))) >> > > goto out_fallback_unlock; >> > > if (WARN_ON(page_has_private(oldpage))) >> > > goto out_fallback_unlock; >> > > if (WARN_ON(PageDirty(oldpage) || PageWriteback(oldpage))) >> > > goto out_fallback_unlock; >> > > if (WARN_ON(PageMlocked(oldpage))) >> > > goto out_fallback_unlock; >> > > >> > > However, it's in the page cache and I can't really convince myself >> > > that it's not also on the LRU. Miklos, I have trouble pinpointing >> > > where oldpage is instantiated exactly and what state it might be in - >> > > can it already be on the LRU? >> > >> > oldpage comes from ->readpages() (*NOT* ->readpage()), i.e. readahead. >> > >> > AFAICS it is added to the LRU in read_cache_pages(), so it looks like >> > it is definitely on the LRU at that point. > > OK, so my understanding of the code was wrong :/ and staring at it for > quite a while didn't help much. The fuse code is so full of indirection > it makes my head spin. Definitely needs a rewrite. But forget the complexities for the moment and just consider this single case: ->readpages() is called to do some readahead, pages are locked, added to the page cache and, AFAICS, charged to a memcg (in add_to_page_cache_lru()). - fuse sends a READ request to userspace and it gets a reply with splice(... SPLICE_F_MOVE). What this means that a bunch of pages of indefinite origin are to replace (if possible) the pages already in the page cache. If not possible, for some reason, it falls back to copying the contents. So, AFAICS, the oldpage and the newpage can be charged to a different memcg. > > How should we test this code path, Miklos? fusexmp_fh -osplice_write,splice_move /mnt/fuse This will mirror / under /mnt/fuse and will use splice to move data from the underlying filesystem to the fuse filesystem, hopefully. It would be useful if it had some instrumentation telling us the actual number of pages successfully moved, but it doesn't have that yet. Thanks, Miklos -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>