I bisected a regression on alpha to f2f84b05e02b (bug: consolidate warn_slowpath_fmt() usage) which looks totally innocuous. Reverting it on master confirms that it somehow is the trigger. At or a little after starting userspace, I'll see an oops like this: Unable to handle kernel paging request at virtual address 0000000000000000 CPU 0 kworker/u2:5(98): Oops -1 pc = [<0000000000000000>] ra = [<0000000000000000>] ps = 0000 Not tainted pc is at 0x0 ra is at 0x0 v0 = 0000000000000007 t0 = 0000000000000001 t1 = 0000000000000001 t2 = 0000000000000000 t3 = fffffc00bfe68780 t4 = 0000000000000001 t5 = fffffc00bf8cc780 t6 = 00000000026f8000 t7 = fffffc00bfe70000 s0 = fffffc000250d310 s1 = fffffc000250d310 s2 = fffffc000250d310 s3 = fffffc000250ca40 s4 = fffffc000250caa0 s5 = 0000000000000000 s6 = fffffc000250ca40 a0 = fffffc00024f0488 a1 = fffffc00bfe73d98 a2 = fffffc00bfe68800 a3 = fffffc00bf881400 a4 = 0001000000000000 a5 = 0000000000000002 t8 = 0000000000000000 t9 = 0000000000000000 t10= 0000000001321800 t11= 000000000000ba4e pv = fffffc000189ca00 at = 0000000000000000 gp = fffffc000253e430 sp = 0000000043a83c2e Disabling lock debugging due to kernel taint Trace: [<fffffc000105c8ac>] process_one_work+0x25c/0x5a0 [<fffffc000105cc4c>] worker_thread+0x5c/0x7d0 [<fffffc0001066c88>] kthread+0x188/0x1f0 [<fffffc0001011b48>] ret_from_kernel_thread+0x18/0x20 [<fffffc0001066b00>] kthread+0x0/0x1f0 [<fffffc000105cbf0>] worker_thread+0x0/0x7d0 Code: 00000000 00000000 00063301 000012e2 00001111 0005ffde It seems to cause a hard lock on an SMP system, but not on a system with a single CPU. Similarly, if I boot the SMP system (2 CPUs) with maxcpus=1 the oops doesn't happen. Until I tested on a non-SMP system today I suspected that it was unaffected, but I saw the oops there too. With the revert applied, I don't see a warning or an oops. Any clues how this patch could have triggered the oops? Here's the revert, with a trivial conflict resolved, that I've used in testing: From fdbdd0f606f0f412ee06c1152e33a22ca17102bc Mon Sep 17 00:00:00 2001 From: Matt Turner <mattst88@xxxxxxxxx> Date: Sun, 24 May 2020 20:46:00 -0700 Subject: [PATCH] Revert "bug: consolidate warn_slowpath_fmt() usage" This reverts commit f2f84b05e02b7710a201f0017b3272ad7ef703d1. --- include/asm-generic/bug.h | 3 ++- kernel/panic.c | 15 +++++++-------- 2 files changed, 9 insertions(+), 9 deletions(-) diff --git a/include/asm-generic/bug.h b/include/asm-generic/bug.h index 384b5c835ced..a4a311d4b4b0 100644 --- a/include/asm-generic/bug.h +++ b/include/asm-generic/bug.h @@ -82,7 +82,8 @@ struct bug_entry { extern __printf(4, 5) void warn_slowpath_fmt(const char *file, const int line, unsigned taint, const char *fmt, ...); -#define __WARN() __WARN_printf(TAINT_WARN, NULL) +extern void warn_slowpath_null(const char *file, const int line); +#define __WARN() warn_slowpath_null(__FILE__, __LINE__) #define __WARN_printf(taint, arg...) \ warn_slowpath_fmt(__FILE__, __LINE__, taint, arg) #else diff --git a/kernel/panic.c b/kernel/panic.c index b69ee9e76cb2..c8ed8046b484 100644 --- a/kernel/panic.c +++ b/kernel/panic.c @@ -603,20 +603,19 @@ void warn_slowpath_fmt(const char *file, int line, unsigned taint, { struct warn_args args;- pr_warn(CUT_HERE);
- - if (!fmt) { - __warn(file, line, __builtin_return_address(0), taint, - NULL, NULL); - return; - } - args.fmt = fmt; va_start(args.args, fmt); __warn(file, line, __builtin_return_address(0), taint, NULL, &args); va_end(args.args); } EXPORT_SYMBOL(warn_slowpath_fmt); + +void warn_slowpath_null(const char *file, int line) +{ + pr_warn(CUT_HERE); + __warn(file, line, __builtin_return_address(0), TAINT_WARN, NULL, NULL); +} +EXPORT_SYMBOL(warn_slowpath_null); #else void __warn_printk(const char *fmt, ...) { -- 2.26.2
Attachment:
signature.asc
Description: PGP signature