On Thu, May 31, 2018 at 3:08 AM, Jan Kara <jack@xxxxxxx> wrote: [..] >> >> As far as I can see reflink+dax would require teaching kernel code >> >> paths that ->mapping may not be a singular relationship. Something >> >> along the line's of what Jerome was presenting at LSF to create a >> >> special value to indicate, "call back into the filesystem (or the page >> >> owner)" to perform this operation. >> >> >> >> In the meantime the kernel crashes when userspace accesses poisoned >> >> pmem via DAX. I assume that reworking rmap for the dax+reflink case >> >> should not block dax poison handling? Yell if you disagree. >> > >> > The thing is, up until get_user_pages() vs truncate series ("fs, dax: use >> > page->mapping to warn if truncate collides with a busy page" in >> > particular), DAX was perfectly fine with reflinks since we never needed >> > page->mapping. >> >> Sure, but if this rmap series had come first I still would have needed >> to implement ->mapping. So unless we invent a general ->mapping >> replacement and switch all mapping users, it was always going to >> collide with DAX eventually. > > Yes, my comment was more in direction that life would be easier if we could > keep DAX without rmap support but I guess that's just too cumbersome. I'm open to deeper reworks later. As it stands currently just calling madvise(..., MADV_HWPOISON) on a DAX mapping causes a page reference to be leaked because the madvise code has no clue about proper handling of DAX pages, and consuming real poison leads to a fatal condition / reset. I think fixing those bugs with the current rmap dependencies on ->mapping and ->index is step1 and step2 is a longer term solution for dax rmap that does also allow reflink. I.e. it's an rmap > reflink argument for now. > >> > Now this series adds even page->index dependency which makes >> > life for rmap with reflinks even harder. So if nothing else we should at >> > least make sure reflinked filesystems cannot be mounted with dax mount >> > option for now and seriously start looking into how to implement rmap with >> > reflinked files for DAX because this noticeably reduces its usefulness. >> >> This restriction is already in place. In xfs_reflink_remap_range() we have: >> >> /* Don't share DAX file data for now. */ >> if (IS_DAX(inode_in) || IS_DAX(inode_out)) >> goto out_unlock; > > OK, good. > >> All this said, perhaps we don't need to set ->link, it would just mean >> a wider search through the rmap tree to find if the given page is >> mapped. So, I think I can forgo setting ->link if I teach the rmap >> code to search the entire ->mapping. > > I guess you mean ->index in the above. And now when thinking about it I don't > think it is worth the complications to avoid using ->index. Ok, and yes I meant ->index... sorry, too much struct page on the brain presently.