On Wed, 6 Oct 2021 at 00:19, Dmitry Osipenko <digetx@xxxxxxxxx> wrote: > > 05.10.2021 16:10, Ulf Hansson пишет: > > On Sat, 2 Oct 2021 at 22:44, Dmitry Osipenko <digetx@xxxxxxxxx> wrote: > >> > >> 01.10.2021 15:32, Ulf Hansson пишет: > >>>> +static __maybe_unused int tegra_clock_pm_suspend(struct device *dev) > >>>> +{ > >>>> + struct tegra_clk_device *clk_dev = dev_get_drvdata(dev); > >>>> + > >>>> + /* > >>>> + * Power management of the clock is entangled with the Tegra PMC > >>>> + * GENPD because PMC driver enables/disables clocks for toggling > >>>> + * of the PD's on/off state. > >>>> + * > >>>> + * The PMC GENPD is resumed in NOIRQ phase, before RPM of the clocks > >>>> + * becomes available, hence PMC can't use clocks at the early resume > >>>> + * phase if RPM is involved. For example when 3d clock is enabled, > >>>> + * it may enable the parent PLL clock that needs to be RPM-resumed. > >>>> + * > >>>> + * Secondly, the PLL clocks may be enabled by the low level suspend > >>>> + * code, so we need to assume that PLL is in enabled state during > >>>> + * suspend. > >>>> + * > >>>> + * We will keep PLLs and system clock resumed during suspend time. > >>>> + * All PLLs on all SoCs are low power and system clock is always-on, > >>>> + * so practically not much is changed here. > >>>> + */ > >>>> + > >>>> + return clk_prepare(clk_dev->hw->clk); > >>> I am trying to understand, more exactly, what you intend to achieve > >>> with the clk_prepare() here. It looks a bit weird, to me. Can you try > >>> to elaborate a bit more on the use case? > >> > >> The Tegra GENPD driver enable/disable clocks when domain is turned on. > > > > Okay. I noticed that in tegra_genpd_power_on(). And the same clocks > > are enabled/disabled also in tegra_genpd_power_off(), when powering > > off the PM domain. > > > > So I guess the problem kind of exists for tegra_genpd_power_off() too? > > Both OFF/ON are affected by the same problem. If domain was already > turned OFF before genpd_suspend_noirq(), then the OFF problem isn't visible. > > I reproduced the OFF problem by removing the clk prepare/unprepare from > the suspend/resume of the clk driver and making some extra changes to > clock tree topology and etc to trigger the problem on Nexus 7. > > tegra-pmc 7000e400.pmc: failed to turn off PM domain heg: -13 > > I happens from genpd_suspend_noirq() -> tegra_genpd_power_off() -> clk > -> GENPD -> I2C -> runtime-pm. > > -13 is EACCES, it comes from the runtime PM of I2C device. RPM is > prohibited/disabled during late (NOIRQ) suspend by the drivers core. Thanks for the clarification! > > >> This can't be done during early system resume, when domains are getting > >> turned on by the drivers core, because when clock is enabled, it's > >> getting prepared (RPM-resumed) and this preparation fails because > >> performance state of the clock goes up and it doesn't work during the > >> early resume time since I2C, which applies the state to hardware, is > >> suspended and can't work at that early time. > > > > This sounds complicated and I still don't quite follow all of it, sorry. > > > > So, tegra_genpd_power_on() gets called from genpd_resume_noirq(), when > > the first device of the attached devices to genpd gets resumed. And > > vice versa for tegra_genpd_power_off() and genpd_suspend_noirq(). > > > > Are you saying that trying to enable/disable clocks from > > tegra_genpd_power_on|off() in these paths doesn't work, because it > > would also require the performance state to be changed, which would > > fail because the I2C bus/driver is suspended? > > Yes, but it's actually not I2C bus/driver that is suspended, it's > runtime PM that is unavailable during NOIRQ. The I2C driver itself is > suspended after domains are turned OFF and resumed before they are > enabled. It's just runtime PM API that is unavailable. I'm wondering if > this could be changed. In principle what you ask for, is if we can avoid calling __pm_runtime_disable() in __device_suspend_late() (and vice versa in device_resume_early()). I think the short answer is no, at least from a generic point of view. Maybe we can figure out a way to allow this on a per device basis, as an opt-in solution. I am not sure what Rafael would think about this, let's see. Another option to address the problem is already available to use for these kinds of cases. This would be to add also a pair of ->suspend|resume() callbacks to I2C driver. Along the lines of the below. diff --git a/drivers/i2c/busses/i2c-tegra.c b/drivers/i2c/busses/i2c-tegra.c index c883044715f3..589bf872ab25 100644 --- a/drivers/i2c/busses/i2c-tegra.c +++ b/drivers/i2c/busses/i2c-tegra.c @@ -1918,6 +1918,7 @@ static int __maybe_unused tegra_i2c_resume(struct device *dev) } static const struct dev_pm_ops tegra_i2c_pm = { + SET_SYSTEM_SLEEP_PM_OPS(pm_runtime_put_noidle, pm_runtime_get_sync) SET_NOIRQ_SYSTEM_SLEEP_PM_OPS(tegra_i2c_suspend, tegra_i2c_resume) SET_RUNTIME_PM_OPS(tegra_i2c_runtime_suspend, tegra_i2c_runtime_resume, NULL) In this way, the device would already be runtime resumed, if there is call to pm_runtime_get_sync() from the clock framework due to the clk_prepare|unprepare() being called. If that also turns out to happen *after* runtime PM has been disabled for the device, the call to pm_runtime_get_sync() would still succeed (returning 1, see rpm_resume()), rather than a negative error code. Yes, we may end up runtime resuming the device during system suspend, even if it turns out not to be needed. Although, that doesn't seem to be the case for the Tegra I2C driver, right? > > I'm also wondering if we could add some 'was_enabled' flag to GENPDs, > setting it by genpd_suspend_noirq() for the enabled domains, and then > powering-on GENPDs from genpd_resume_noirq() only if they were in the > enabled state during genpd_suspend_noirq() time. It actually puzzled me > for a quite long time why GENPD core enables domains unconditionally > during early resume. This should solve a part of the problem and it > makes suspend/resume a bit safer because there is a smaller chance to > crash hardware during suspend, at least it's easier to debug. Just because the PM domain was already off at genpd_suspend_noirq(), doesn't mean that it can stay powered off at genpd_resume_noirq(). At least as is today. The main reason why genpd_resume_noirq() powers on the PM domain, is because it's not possible for the consumer drivers to rely on runtime PM to do it (because runtime PM has been disabled by the PM core). > > >> Secondly, Tegra has arch-specific low level assembly which touches > >> clocks during last phase of system suspend and in the beginning of > >> resume. Hence, clocks should stay prepared during suspend just because > >> technically clock should be prepared before it can be enabled. > > > > So the low level code is gating and ungating the clock behind the back > > of the clock driver then? Why is that done like that, more exactly? > > I revisited that code again, and it shouldn't touch the clocks. > I changed that code to not toggle the clocks [1] sometime ago, but > forgot about it. > > [1] https://git.kernel.org/linus/680ae4452 > > >>> Is this rather about making sure that the clock's corresponding PM > >>> domain stays powered on during system suspend? In that case, I think > >>> there may be an alternative option.... > >>> > >> > >> This is not about domain staying powered on, this is about keeping the > >> performance state of the domain high during suspend. > > > > Right, so the PM domain managed in tegra_genpd_power_on|off() can > > still be powered on/off, as long as the clock remains ungated? > > Not ungated, but prepared. Okay, thanks for clarifying! In summary, it sounds like you should be able to fix this problem in the I2C driver as I suggested above. If that works, that seems much better. Moreover, it would leave the clocks gated/unprepared when the system is fully suspended, which I guess is better from an energy point of view? Kind regards Uffe