On Mon, 2019-11-18 at 17:00 -0500, Johannes Weiner wrote: > Hi Qian, > > On Mon, Nov 18, 2019 at 04:39:19PM -0500, Qian Cai wrote: > > Since a few days ago, s390 starts to crash on linux-next while reading some > > sysfs. It is not always reproducible but seems pretty reproducible after running > > the whole MM test suite here, > > https://github.com/cailca/linux-mm/blob/master/test.sh > > > > the config: > > https://raw.githubusercontent.com/cailca/linux-mm/master/s390.config > > > > The stack trace on s390 is not particular helpful as both gdb and faddr2line are > > unable to point out which line causes the issue. > > > > # ./scripts/faddr2line vmlinux collect_percpu_times+0x2d6/0x798 > > bad symbol size: base: 0x00000000002076f8 end: 0x00000000002076f8 > > > > (gdb) list *(collect_percpu_times+0x2d6) > > 0x2079ce is in collect_percpu_times (./include/linux/compiler.h:199). > > 194 }) > > 195 > > 196 static __always_inline > > 197 void __read_once_size(const volatile void *p, void *res, int size) > > 198 { > > 199 __READ_ONCE_SIZE; > > 200 } > > 201 > > 202 #ifdef CONFIG_KASAN > > 203 /* > > > > Could it be some race conditions in PSI? > > psi doesn't do much lifetime management in itself: the psi_group is > embedded in the cgroup and the per-cpu data is freed right before the > cgroup itself is freed. An open file descriptor on the pressure files > will pin the cgroup and prevent it from being deleted. > > As it's reproducible, would you be able to bisect this problem? Yes, it is going to time-consuming though as I have not found a quick reproducer yet.