This is a note to let you know that I've just added the patch titled hwmon: (coretemp) Handle frozen hotplug state correctly to the 4.11-stable tree which can be found at: http://www.kernel.org/git/?p=linux/kernel/git/stable/stable-queue.git;a=summary The filename of the patch is: hwmon-coretemp-handle-frozen-hotplug-state-correctly.patch and it can be found in the queue-4.11 subdirectory. If you, or anyone else, feels it should not be added to the stable tree, please let <stable@xxxxxxxxxxxxxxx> know about it. >From 90b4f30b6d15222a509dacf47f29efef2b22571e Mon Sep 17 00:00:00 2001 From: Thomas Gleixner <tglx@xxxxxxxxxxxxx> Date: Wed, 10 May 2017 16:30:12 +0200 Subject: hwmon: (coretemp) Handle frozen hotplug state correctly From: Thomas Gleixner <tglx@xxxxxxxxxxxxx> commit 90b4f30b6d15222a509dacf47f29efef2b22571e upstream. The recent conversion to the hotplug state machine missed that the original hotplug notifiers did not execute in the frozen state, which is used on suspend on resume. This does not matter on single socket machines, but on multi socket systems this breaks when the device for a non-boot socket is removed when the last CPU of that socket is brought offline. The device removal locks up the machine hard w/o any debug output. Prevent executing the hotplug callbacks when cpuhp_tasks_frozen is true. Thanks to Tommi for providing debug information patiently while I failed to spot the obvious. Fixes: e00ca5df37ad ("hwmon: (coretemp) Convert to hotplug state machine") Reported-by: Tommi Rantala <tt.rantala@xxxxxxxxx> Tested-by: Tommi Rantala <tt.rantala@xxxxxxxxx> Signed-off-by: Thomas Gleixner <tglx@xxxxxxxxxxxxx> Signed-off-by: Guenter Roeck <linux@xxxxxxxxxxxx> Cc: "Chen, Yu C" <yu.c.chen@xxxxxxxxx> Signed-off-by: Greg Kroah-Hartman <gregkh@xxxxxxxxxxxxxxxxxxx> --- drivers/hwmon/coretemp.c | 14 ++++++++++++++ 1 file changed, 14 insertions(+) --- a/drivers/hwmon/coretemp.c +++ b/drivers/hwmon/coretemp.c @@ -605,6 +605,13 @@ static int coretemp_cpu_online(unsigned struct platform_data *pdata; /* + * Don't execute this on resume as the offline callback did + * not get executed on suspend. + */ + if (cpuhp_tasks_frozen) + return 0; + + /* * CPUID.06H.EAX[0] indicates whether the CPU has thermal * sensors. We check this bit only, all the early CPUs * without thermal sensors will be filtered out. @@ -654,6 +661,13 @@ static int coretemp_cpu_offline(unsigned struct temp_data *tdata; int indx, target; + /* + * Don't execute this on suspend as the device remove locks + * up the machine. + */ + if (cpuhp_tasks_frozen) + return 0; + /* If the physical CPU device does not exist, just return */ if (!pdev) return 0; Patches currently in stable-queue which might be from tglx@xxxxxxxxxxxxx are queue-4.11/efi-bgrt-skip-efi_bgrt_init-in-case-of-non-efi-boot.patch queue-4.11/hwmon-coretemp-handle-frozen-hotplug-state-correctly.patch queue-4.11/x86-microcode-intel-clear-patch-pointer-before-jettisoning-the-initrd.patch queue-4.11/efi-don-t-issue-error-message-when-booted-under-xen.patch queue-4.11/cpu-hotplug-drop-the-device-lock-on-error.patch queue-4.11/kthread-fix-use-after-free-if-kthread-fork-fails.patch queue-4.11/perf-core-drop-kernel-samples-even-though-u-is-specified.patch