Thanks for ccing Oscar, Andrew. After getting this patch into our internal production environment, I recently found a regression bug introduced by my commit a0157a2c735b ("mm/khugepaged: recover from poisoned file-backed memory"). Given it is only in mm-unstable, I wonder should I put out a v7 with the fix, or should I make it a new and separate commit? Sorry for the bug. On Mon, Nov 7, 2022 at 12:53 PM Andrew Morton <akpm@xxxxxxxxxxxxxxxxxxxx> wrote: > > On Sun, 6 Nov 2022 18:53:57 -0800 Jiaqi Yan <jiaqiyan@xxxxxxxxxx> wrote: > > > Memory DIMMs are subject to multi-bit flips, i.e. memory errors. > > As memory size and density increase, the chances of and number of > > memory errors increase. The increasing size and density of server > > RAM in the data center and cloud have shown increased uncorrectable > > memory errors. There are already mechanisms in the kernel to recover > > from uncorrectable memory errors. This series of patches provides > > the recovery mechanism for the particular kernel agent khugepaged > > when it collapses memory pages. > > Thanks, I'll toss v6 into mm-unstable for some testing, pending further review. > > When resending a patchset, please try to also cc the people who have > commented on previous versions. >