On Tue 22-07-14 17:44:43, Miklos Szeredi wrote: > On Tue, Jul 22, 2014 at 5:08 PM, Michal Hocko <mhocko@xxxxxxx> wrote: > > On Sat 19-07-14 13:39:11, Johannes Weiner wrote: > >> On Fri, Jul 18, 2014 at 05:12:54PM +0200, Miklos Szeredi wrote: > >> > On Fri, Jul 18, 2014 at 4:45 PM, Johannes Weiner <hannes@xxxxxxxxxxx> wrote: > >> > > >> > > I assumed the source page would always be new, according to this part > >> > > in fuse_try_move_page(): > >> > > > >> > > /* > >> > > * This is a new and locked page, it shouldn't be mapped or > >> > > * have any special flags on it > >> > > */ > >> > > if (WARN_ON(page_mapped(oldpage))) > >> > > goto out_fallback_unlock; > >> > > if (WARN_ON(page_has_private(oldpage))) > >> > > goto out_fallback_unlock; > >> > > if (WARN_ON(PageDirty(oldpage) || PageWriteback(oldpage))) > >> > > goto out_fallback_unlock; > >> > > if (WARN_ON(PageMlocked(oldpage))) > >> > > goto out_fallback_unlock; > >> > > > >> > > However, it's in the page cache and I can't really convince myself > >> > > that it's not also on the LRU. Miklos, I have trouble pinpointing > >> > > where oldpage is instantiated exactly and what state it might be in - > >> > > can it already be on the LRU? > >> > > >> > oldpage comes from ->readpages() (*NOT* ->readpage()), i.e. readahead. > >> > > >> > AFAICS it is added to the LRU in read_cache_pages(), so it looks like > >> > it is definitely on the LRU at that point. > > > > OK, so my understanding of the code was wrong :/ and staring at it for > > quite a while didn't help much. The fuse code is so full of indirection > > it makes my head spin. > > Definitely needs a rewrite. But forget the complexities for the > moment and just consider this single case: > > ->readpages() is called to do some readahead, pages are locked, added > to the page cache and, AFAICS, charged to a memcg (in > add_to_page_cache_lru()). > > - fuse sends a READ request to userspace and it gets a reply with > splice(... SPLICE_F_MOVE). What this means that a bunch of pages of > indefinite origin are to replace (if possible) the pages already in > the page cache. If not possible, for some reason, it falls back to > copying the contents. So, AFAICS, the oldpage and the newpage can be > charged to a different memcg. OK, thanks for the clarification. I had this feeling but couldn't wrap my head around the indirection of the code. It seems that checkig PageCgroupUsed(new) and bail out early in mem_cgroup_migrate should just work, no? > > How should we test this code path, Miklos? > > fusexmp_fh -osplice_write,splice_move /mnt/fuse > > This will mirror / under /mnt/fuse and will use splice to move data > from the underlying filesystem to the fuse filesystem, hopefully. > > It would be useful if it had some instrumentation telling us the > actual number of pages successfully moved, but it doesn't have that > yet. Thanks I will try to play with this tomorrow when I have more time. -- Michal Hocko SUSE Labs -- To unsubscribe, send a message with 'unsubscribe linux-mm' in the body to majordomo@xxxxxxxxx. For more info on Linux MM, see: http://www.linux-mm.org/ . Don't email: <a href=mailto:"dont@xxxxxxxxx"> email@xxxxxxxxx </a>