Re: Many reports of laptops getting hot while suspended with kernels >= 5.16.10 || >= 5.17-rc1

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, Mar 10, 2022 at 10:07 AM Hans de Goede <hdegoede@xxxxxxxxxx> wrote:
>
> Hi,
>
> On 3/9/22 19:27, Rafael J. Wysocki wrote:
> > On Wed, Mar 9, 2022 at 5:34 PM Rafael J. Wysocki <rafael@xxxxxxxxxx> wrote:
> >>
> >> On Wed, Mar 9, 2022 at 5:33 PM Hans de Goede <hdegoede@xxxxxxxxxx> wrote:
> >>>
> >>> Hi,
> >>>
> >>> On 3/9/22 14:57, Rafael J. Wysocki wrote:
> >>>> On Wed, Mar 9, 2022 at 2:44 PM Hans de Goede <hdegoede@xxxxxxxxxx> wrote:
> >>>>>
> >>>>> Hi Rafael,
> >>>>>
> >>>>> We (Fedora) have been receiving a whole bunch of bug reports about
> >>>>> laptops getting hot/toasty while suspended with kernels >= 5.16.10
> >>>>> and this seems to still happen with 5.17-rc7 too.
> >>>>>
> >>>>> The following are all bugzilla.redhat.com bug numbers:
> >>>>>
> >>>>>    1750910 - Laptop failed to suspend and completely drained the battery
> >>>>>    2050036 - Framework laptop: 5.16.5 breaks s2idle sleep
> >>>>>    2053957 - Package c-states never go below C2
> >>>>>    2056729 - No lid events when closing lid / laptop does not suspend
> >>>>>    2057909 - Thinkpad X1C 9th in s2idle suspend still draining battery to zero over night , Ap
> >>>>>    2059668 - HP Envy Laptop deadlocks on entering suspend power state when plugged in. Case ge
> >>>>>    2059688 - Dell G15 5510 s2idle fails in 5.16.11 works in 5.16.10
> >>>>>
> >>>>> And one of the bugs has also been mirrored at bugzilla.kernel.org by
> >>>>> the reporter:
> >>>>>
> >>>>>  bko215641 - Dell G15 5510 s2idle fails in 5.16.11 works in 5.16.10
> >>>>>
> >>>>> The common denominator here (besides the kernel version) seems to
> >>>>> be that these are all Ice or Tiger Lake systems (I did not do
> >>>>> check this applies 100% to all bugs, but it does see, to be a pattern).
> >>>>>
> >>>>> A similar arch-linux report:
> >>>>>
> >>>>> https://bbs.archlinux.org/viewtopic.php?id=274292&p=2
> >>>>>
> >>>>> Suggest that reverting
> >>>>> "ACPI: PM: s2idle: Cancel wakeup before dispatching EC GPE"
> >>>>>
> >>>>> which was cherry-picked into 5.16.10 fixes things.
> >>>>
> >>>> Thanks for letting me know!
> >>>>
> >>>>> If you want I can create Fedora kernel test-rpms of a recent
> >>>>> 5.16.y with just that one commit reverted and ask users to
> >>>>> confirm if that helps. Please let me know if doing that woulkd
> >>>>> be useful ?
> >>>>
> >>>> Yes, it would.
> >>>>
> >>>> However, it follows from the arch-linux report linked above that
> >>>> 5.17-rc is fine, so it would be good to also check if reverting that
> >>>> commit from 5.17-rc helps.
> >>>
> >>> Ok, I've done Fedora kernel builds of both 5.16.13 and 5.17-rc7 with
> >>> the patch reverted and asked the bug-reporters to test both.
> >>
> >> Thanks!
> >
> > Also, in the cases where people have not tested 5.17-rc7 without any
> > reverts, it would be good to ask them to do so.
>
> Ok, done.
>
> > I have received another report related to this issue where the problem
> > is not present in 5.17-rc7 (see
> > https://lore.kernel.org/linux-pm/CAJZ5v0hKXyTtb1Jk=wqNV9_mZKdf3mmwF4bPOcmADyNnTkpMbQ@xxxxxxxxxxxxxx/).
>
> The first results from the Fedora test kernel builds are in:
>
> "HP Envy Laptop deadlocks on entering suspend power state when plugged in. Case gets very hot and requires a power button hold to restart"
> https://bugzilla.redhat.com/show_bug.cgi?id=2059668
>
> 5.16.9: good
> 5.16.10+: bad
> 5.16.13 with "ACPI: PM: s2idle: Cancel wakeup before dispatching EC GPE" reverted: good
> 5.17-rc7 with "ACPI: PM: s2idle: Cancel wakeup before dispatching EC GPE" reverted: good
> 5.17-rc7 (plain): good
>
> So this seems to match the arch-linux report and the email report
> you linked. There is a problem with the backport in 5.16.10+,
> while 5.17-rc7 is fine.
>
> > It is likely that the commit in question actually depends on some
> > other commits that were not backported into 5.16.y.
> I was thinking the same thing, but I've no idea which commits
> that would be.

I do have an idea, but regardless of this, IMO the least risky way
forward would be to request "stable" to drop "ACPI: PM: s2idle: Cancel
wakeup before dispatching EC GPE" which has been backported, because
it carried a Fixes tag and not because it was marked for "stable".

Let me do that.



[Index of Archives]     [Linux Kernel]     [Kernel Development Newbies]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Hiking]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux