On Fri, Jul 19, 2024 at 11:57 AM Pedro Falcato <pedro.falcato@xxxxxxxxx> wrote: > > On Mon, Jul 1, 2024 at 2:07 AM Nhat Pham <nphamcs@xxxxxxxxx> wrote: > > > > On Sun, Jun 30, 2024 at 10:58 AM Pedro Falcato <pedro.falcato@xxxxxxxxx> wrote: > > > > > > Hi everyone, > > > > Hi Pedro, > > > > I have a separate theory. I also run the NVIDIA proprietary drivers. > slabinfo -a shows us: > [...] > :0000080 <- zswap_entry Acpi-Parse kernfs_iattrs_cache > uvm_tools_replay_data_t Acpi-State audit_tree_mark > [...] > > See the uvm_tools_replay_data_t there? Yeah, it's entirely possible > some random nvidia.ko bug has been corrupting zswap_entry from time to > time (which explains why e.g the big server people have not seen > this). Oh this is fascinating. I would never have guessed this. Thanks for doing the investigation! > I'm not sure if Yuxuan is running the same driver, but their kernel is > also proprietary-tainted. > > As such I'll refrain from posting more about this or similar bugs > until I can get a guarantee it happens with a non-tainted kernel > (fwiw, I have not seen crashes for 2 weeks or so, hopefully this issue > is fixed). Crossing my fingers :) > > Again, sorry for not checking the taint before posting this, and thank > you for your time :) No worries at all :) Let me know if there are new developments regarding this issue, or if you are able to confirm one way or another :)