On Tue, Jan 23, 2024 at 06:53:46PM +0100, Konrad Dybcio wrote: > On 1/23/24 16:59, Johan Hovold wrote: > > On Wed, Jan 03, 2024 at 01:36:08PM +0100, Konrad Dybcio wrote: > >> Commit b43f7ddc2b7a ("power: supply: qcom_battmgr: Register the power > >> supplies after PDR is up") moved the devm_power_supply_register() calls > >> so that the power supply devices are not registered before we go through > >> the entire initialization sequence (power up the ADSP remote processor, > >> wait for it to come online, coordinate with userspace..). > >> > >> Some firmware versions (e.g. on SM8550) seem to leave battmgr at least > >> partly initialized when exiting the bootloader and loading Linux. Check > >> if the power supply devices are registered before consuming the battmgr > >> notifications. > >> + if (!battmgr->bat_psy) > >> + return; > > > > This is not a proper fix. You register 3-4 class devices and only check > > one. Even if your checked the last one, there's no locking or barriers > > in place to prevent this from breaking. > > > > Deferred registration of the class devices also risks missing > > notifications as you'll be spending time on registration after the > > service has gone live. > > > > I'm sure all of this can be handled but as it is non-trivial and the > > motivation for the offending commit is questionable to begin with, I > > suggest reverting for now. > > > > I'll send a revert for Sebastian to consider. > > What you're saying is valid, but a "battery" device is always expected > to be present. Yes, but that's not the point. battmgr->bat_psy is the first class device pointer to be initialised, but that being set does not mean that the other pointers are not still NULL when you hit this callback. > If devm_power_supply_register fails, things would go very > south very fast anyway. Eh, no. Before the offending commit, if registration fails, we bail out from probe() before registering the PMIC GLINK client (and callbacks) so all is good. That is no longer the case since b43f7ddc2b7a ("power: supply: qcom_battmgr: Register the power supplies after PDR is up") which happily ignores errors and could theoretically result in all but the first class device being registered leading to further NULL derefs on notifications. I could have pointed this out in the commit message for the revert. > I personally don't see this being a terribly bad fix, but I'm open to > different propositions. It's not a correct fix, only a band-aid that papers over the immediate issue, I'm afraid. Let's revert and if you care deeply about this you can possibly propose a complete patch that addresses the above issues, even if I'm more inclined to leave things as they were and not spend more time on this. Johan