Hi, Jagdish, On Thu, 2022-04-14 at 18:53 +0530, Jagdish Gediya wrote: > On Fri, Apr 08, 2022 at 03:12:20PM +0800, Huang Ying wrote: [snip] > > + > > +static int numa_hint_fault_latency(struct page *page) > > +{ > > + int last_time, time; > > + > > + time = jiffies_to_msecs(jiffies); > > + last_time = xchg_page_access_time(page, time); > > + > > + return (time - last_time) & PAGE_ACCESS_TIME_MASK; > > This code can possibly consider cold page as hot, > > Assume, > > LAST_CPUPID_SHIFT = 12 > PAGE_ACCESS_TIME_BUCKETS = 0 > sysctl_numa_balancing_hot_threshold = 1000 > > Assume while changing pte, > jiffies_to_msecs(jiffies) = 0xAABB0100 > > So value saved in page->flags will be lowest 12 bits of 0xAABB0100 > which is 0x100. > > Assume when numa_hint_fault_latency() gets called, > time = jiffies_to_msecs(jiffies) = 0xAACC0100 > > So, time = 0xAACC0100, and last_time = 0x100, > time - last_time = 0xAACC0100 - 0x100 = 0xAACC0000 > 0xAACC0000 & PAGE_ACCESS_TIME_MASK = 0xAACC0000 & ((1 << 12) - 1) = 0 > > so the return value of this function is 0, the code will consider it as > hot page but it is cold page because actual difference is > 0xAACC0100 - 0xAABB0100 = 110000 ms > Yes. This is possible. > There may be more such scenarios. What do you think? The algorithm just works statistically correct. That is, for really hot pages, their hint page fault latency will be short and we can promote it when they are accessed. For cold pages, it's still possible for them to be identified as hot pages. But the possibility is much lower than that of the hot pages. We can try to improve further here. But as the first step, I want to keep the algorithm as simple as possible. Then we can try improve it step by step and show benefit in each step to justify the further optimization. > > +} > > + Best Regards, Huang, Ying