On Mon, 9 Sep 2019 11:56:50 +0530 Anshuman Khandual <anshuman.khandual@xxxxxxx> wrote: [..] > > > > Hmm, I simply used this on my system to make pud_clear_tests() work, not > > sure if it works on all archs: > > > > pud_val(*pudp) |= RANDOM_NZVALUE; > > Which compiles on arm64 but then fails on x86 because of the way pmd_val() > has been defined there. on arm64 and s390 (with many others) pmd_val() is > a macro which still got the variable that can be used as lvalue but that is > not true for some other platforms like x86. > > arch/arm64/include/asm/pgtable-types.h: #define pmd_val(x) ((x).pmd) > arch/s390/include/asm/page.h: #define pmd_val(x) ((x).pmd) > arch/x86/include/asm/pgtable.h: #define pmd_val(x) native_pmd_val(x) > > static inline pmdval_t native_pmd_val(pmd_t pmd) > { > return pmd.pmd; > } > > Unless I am mistaken, the return value from this function can not be used as > lvalue for future assignments. > > mm/arch_pgtable_test.c: In function ‘pud_clear_tests’: > mm/arch_pgtable_test.c:156:17: error: lvalue required as left operand of assignment > pud_val(*pudp) |= RANDOM_ORVALUE; > ^~ > AFAICS pxx_val() were never intended to be used as lvalue and using it that way > might just happen to work on all those platforms which define them as macros. > They meant to just provide values for an entry as being determined by the platform. > > In principle pxx_val() on an entry was not supposed to be modified directly from > generic code without going through (again) platform helpers for any specific state > change (write, old, dirty, special, huge etc). The current use case is a deviation > for that. > > I originally went with memset() just to load up the entries with non-zero value so > that we know pxx_clear() are really doing the clearing. The same is being followed > for all pxx_same() checks. > > Another way for fixing the problem would be to mark them with known attributes > like write/young/huge etc instead which for sure will create non-zero entries. > We can do that for pxx_clear() and pxx_same() tests and drop RANDOM_NZVALUE > completely. Does that sound good ? Umm, not really. Those mkwrite/young/huge etc. helpers do only exist for page table levels where we can also have large mappings, at least on s390. Also, we do (on s390) again check for certain sanity before actually setting the bits. Good news is that at least for the pxx_same() checks the memset() is no problem, because pxx_same() does not do any checks other than the same check. For the pxx_clear_tests(), maybe it could be an option to put them behind the pxx_populate_tests(), and rely on them having properly populated (non-clear) values after that? [...] > > > > Actually, using get_unmapped_area() as suggested by Kirill could also > > solve this issue. We do create a new mm with 3-level page tables on s390, > > and the dynamic upgrade to 4 or 5 levels is then triggered exactly by > > arch_get_unmapped_area(), depending on the addr. But I currently don't > > see how / where arch_get_unmapped_area() is set up for such a dummy mm > > created by mm_alloc(). > > Normally they are set during program loading but we can set it up explicitly > for the test mm_struct if we need to but there are some other challenges. > > load_[aout|elf|flat|..]_binary() > setup_new_exec() > arch_pick_mmap_layout(). > > I did some initial experiments around get_unmapped_area(). Seems bit tricky > to get it working on a pure 'test' mm_struct. It expects a real user context > in the form of current->mm. Yes, that's where I stopped because it looked rather complicated :-) Not sure why Kirill suggested it initially, but if using get_unmapped_area() would only be necessary to get properly initialized page table levels on s390, you could also defer this to a later add-on patch. Regards, Gerald