Re: [PATCH RFC] mm: arm64: advance pte for contpte_ptep_set_access_flags

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, Sep 5, 2024 at 3:50 AM Ryan Roberts <ryan.roberts@xxxxxxx> wrote:
>
> On 04/09/2024 16:13, Will Deacon wrote:
> > (Adding Ryan, since you're asking him a question!)
>
> Thanks, Will!
>
> Afraid I don't do a good job of monitoring the list; I'm guessing there are
> automated ways to filter for mentions of my name so I catch this sort of thing
> in future?

It's not your fault. I just realized that, for some unknown reason, I forgot to
CC you.

>
> >
> > On Sat, Aug 31, 2024 at 10:06:40PM +1200, Barry Song wrote:
> >> On Sat, Aug 31, 2024 at 9:54 PM David Hildenbrand <david@xxxxxxxxxx> wrote:
> >>>
> >>> On 31.08.24 10:35, Barry Song wrote:
> >>>> From: Barry Song <v-songbaohua@xxxxxxxx>
> >>>>
> >>>> Hi Ryan, David,
> >>>> it seems contpte_ptep_set_access_flags() has never advanced
> >>>> pte pfn, and it is setting all entries' pfn to the first
> >>>> subpage. But I feel quite strange we never have a bug reported.
> >>>> Am I missing something?
> >>>>
> >>>> Fixes: 4602e5757bcc ("arm64/mm: wire up PTE_CONT for user mappings")
> >>>> Cc: Ard Biesheuvel <ardb@xxxxxxxxxx>
> >>>> Cc: John Hubbard <jhubbard@xxxxxxxxxx>
> >>>> Cc: Mark Rutland <mark.rutland@xxxxxxx>
> >>>> Cc: Catalin Marinas <catalin.marinas@xxxxxxx>
> >>>> Cc: David Hildenbrand <david@xxxxxxxxxx>
> >>>> Cc: Will Deacon <will@xxxxxxxxxx>
> >>>> Signed-off-by: Barry Song <v-songbaohua@xxxxxxxx>
> >>>> ---
> >>>>   arch/arm64/mm/contpte.c | 4 +++-
> >>>>   1 file changed, 3 insertions(+), 1 deletion(-)
> >>>>
> >>>> diff --git a/arch/arm64/mm/contpte.c b/arch/arm64/mm/contpte.c
> >>>> index a3edced29ac1..10dcd2641184 100644
> >>>> --- a/arch/arm64/mm/contpte.c
> >>>> +++ b/arch/arm64/mm/contpte.c
> >>>> @@ -421,8 +421,10 @@ int contpte_ptep_set_access_flags(struct vm_area_struct *vma,
> >>>>               ptep = contpte_align_down(ptep);
> >>>>               start_addr = addr = ALIGN_DOWN(addr, CONT_PTE_SIZE);
> >>>>
> >>>> -             for (i = 0; i < CONT_PTES; i++, ptep++, addr += PAGE_SIZE)
> >>>> +             for (i = 0; i < CONT_PTES; i++, ptep++, addr += PAGE_SIZE) {
> >>>>                       __ptep_set_access_flags(vma, addr, ptep, entry, 0);
> >>>> +                     entry = pte_advance_pfn(entry, 1);
> >>>> +             }
> >>>>
> >>>>               if (dirty)
> >>>>                       __flush_tlb_range(vma, start_addr, addr,
> >>>
> >>> Taking a closer look at __ptep_set_access_flags(), there is:
> >>>
> >>> /* only preserve the access flags and write permission *
> >>> pte_val(entry) &= PTE_RDONLY | PTE_AF | PTE_WRITE | PTE_DIRTY;
> >>>
> >>> So it looks like it doesn't need the PFN?
>
> Correct, I don't believe there is a bug here; __ptep_set_access_flags() only
> consumes the access flags from entry.
>
> >>
> >> right.
> >>
> >>>
> >>>
> >>> OTOH, there is the initial:
> >>>
> >>>
> >>> if (pte_same(pte, entry))
> >>>         return 0;
> >>>
> >>> check that might accelerate things.
>
> There is an equivalent check in contpte_ptep_set_access_flags() which is
> checking for the whole contpte block and returning early if so. So I don't think
> there is a problem here either.
>
> >>>
> >>> So unless I am missing something, this works as expected? (and if the
> >>> pte_same() would frequently be taken with your change would be worthwile
> >>> to optimize)
> >>
> >>
> >> Right. From page 1 to page (nr_pages - 1), we consistently get FALSE
> >> for pte_same().
> >> This seems quite strange. I think we might need to "fix" it, at least
> >> for the sake of code
> >> semantics. on the other hand, if pte_same() is not important, it
> >> should be dropped.
> >>
> >> Hi Ryan,
> >> what is your take on this?
>
> The code is correct and working as intended, AFAICT. But I accept that this is
> not exactly obvious. I'd be happy to Rb your proposed change if you feel it
> clarifies things.

If this is the case, I'd rather add some comments instead in v2?

diff --git a/arch/arm64/mm/contpte.c b/arch/arm64/mm/contpte.c
index a3edced29ac1..55107d27d3f8 100644
--- a/arch/arm64/mm/contpte.c
+++ b/arch/arm64/mm/contpte.c
@@ -421,6 +421,12 @@ int contpte_ptep_set_access_flags(struct
vm_area_struct *vma,
                ptep = contpte_align_down(ptep);
                start_addr = addr = ALIGN_DOWN(addr, CONT_PTE_SIZE);

+               /*
+                * We are not advancing entry because __ptep_set_access_flags()
+                * only consumes access flags from entry. And since we
have checked
+                * for the whole contpte block and returned early, pte_same()
+                * within __ptep_set_access_flags() is likely false.
+                */
                for (i = 0; i < CONT_PTES; i++, ptep++, addr += PAGE_SIZE)
                        __ptep_set_access_flags(vma, addr, ptep, entry, 0);
-- 
2.39.3 (Apple Git-146)

Thanks
Barry





[Index of Archives]     [Linux ARM Kernel]     [Linux ARM]     [Linux Omap]     [Fedora ARM]     [IETF Annouce]     [Bugtraq]     [Linux OMAP]     [Linux MIPS]     [eCos]     [Asterisk Internet PBX]     [Linux API]

  Powered by Linux