> > > > Looking at this code some more.. How is it even correct? > > > > > > > > 1633 if (!isolate_lru_page(head)) { > > > > 1634 list_add_tail(&head->lru, &cma_page_list); > > > > > > > > Here we are only running under the read side of the mmap sem so multiple > > > > GUPs can be calling that sequence in parallel. I don't see any > > > > obvious exclusion that will prevent corruption of head->lru. The first > > > > GUP thread to do isolate_lru_page() will ClearPageLRU() and the second > > > > GUP thread will be a NOP for isolate_lru_page(). > > > > > > > > They will both race list_add_tail and other list ops. That is not OK. > > > > > > Good question. I studied it, and I do not see how this is OK. Worse, > > > this race is also exposable as a syscall instead of via driver: two > > > move_pages() run simultaneously. Perhaps in other places? > > > > > > move_pages() > > > kernel_move_pages() > > > mmget() > > > do_pages_move() > > > add_page_for_migratio() > > > mmap_read_lock(mm); > > > list_add_tail(&head->lru, pagelist); <- Not protected > > > > When this was CMA only it might have been rarer to trigger, but this > > move stuff sounds like it makes it much more broadly, eg on typical > > servers with RDMA exposed/etc > > > > Seems like it needs fixing as part of this too :\ > > Just to clarify the stack that I showed above is outside of gup, it is > the same issue that you pointed out that happens elsewhere. I suspect > there might be more. All of them should be addressed together. Hi Jason, I studied some more, and I think this is not a race: list_add_tail(&head->lru, &cma_page_list) is called only when isolate_lru_page(head) succeeds. isolate_lru_page(head) succeeds only when PageLRU(head) is true. However, in this function we also clear LRU flag before returning success. This means, that if we race with another thread, the other thread won't get to unprotected list_add_tail(&head->lru, &cma_page_list) until head is is back on LRU list. Please let me know if I am missing anything. Thank you, Pasha