On Fri, Nov 11, 2022 at 10:24:02AM -0800, Yosry Ahmed wrote: > On Fri, Nov 11, 2022 at 2:08 AM Michal Koutný <mkoutny@xxxxxxxx> wrote: > > > > On Thu, Nov 10, 2022 at 11:35:34AM -0800, Yosry Ahmed <yosryahmed@xxxxxxxxxx> wrote: > > > OTOH, it will reduce the page counters, so if userspace is relying on > > > memory.current to gauge how much reclaim they want to do, it will make > > > it "appear" like the usage dropped. > > > > Assuming memory.current is used to drive the proactive reclaim, then > > this patch makes some sense (and is slightly better than draining upon > > every memory.current read(2)). > > I am not sure honestly. This assumes memory.reclaim is used in > response to just memory.current, which is not true in the cases I know > about at least. > > If you are using memory.reclaim merely based on memory.current, to > keep the usage below a specified number, then memory.high might be a > better fit? Unless this goal usage is a moving target maybe and you > don't want to keep changing the limits but I don't know if there are > practical use cases for this. > > For us at Google, we don't really look at the current usage, but > rather on how much of the current usage we consider "cold" based on > page access bit harvesting. I suspect Meta is doing something similar > using different mechanics (PSI). I am not sure if memory.current is a > factor in either of those use cases, but maybe I am missing something > obvious. Yeah, Meta drives proactive reclaim through psi feedback. We do consult memory.current to enforce minimums, just for safety reasons. But that's are very conservative parameter, the percpu fuzz doesn't make much of a difference there; certainly, we haven't had any problems with memory.reclaim not draining stocks. So I would agree that it's not entirely obvious why stocks should be drained as part of memory.reclaim. I'm curious what led to the patch.