On Thu, Dec 19, 2019 at 11:02:40PM +0100, Ulf Hansson wrote: > On Thu, 19 Dec 2019 at 19:06, Sudeep Holla <sudeep.holla@xxxxxxx> wrote: > > > > On Thu, Dec 19, 2019 at 04:48:39PM +0100, Ulf Hansson wrote: > > > On Thu, 19 Dec 2019 at 15:34, Sudeep Holla <sudeep.holla@xxxxxxx> wrote: > > > > > > > > On Wed, Dec 11, 2019 at 04:43:42PM +0100, Ulf Hansson wrote: > > > > > When the hierarchical CPU topology layout is used in DT and the PSCI OSI > > > > > mode is supported by the PSCI FW, let's initialize a corresponding PM > > > > > domain topology by using genpd. This enables a CPU and a group of CPUs, > > > > > when attached to the topology, to be power-managed accordingly. > > > > > > > > > > To trigger the attempt to initialize the genpd data structures let's use a > > > > > subsys_initcall, which should be early enough to allow CPUs, but also other > > > > > devices to be attached. > > > > > > > > > > The initialization consists of parsing the PSCI OF node for the topology > > > > > and the "domain idle states" DT bindings. In case the idle states are > > > > > compatible with "domain-idle-state", the initialized genpd becomes > > > > > responsible of selecting an idle state for the PM domain, via assigning it > > > > > a genpd governor. > > > > > > > > > > Note that, a successful initialization of the genpd data structures, is > > > > > followed by a call to psci_set_osi_mode(), as to try to enable the OSI mode > > > > > in the PSCI FW. In case this fails, we fall back into a degraded mode > > > > > rather than bailing out and returning an error code. > > > > > > > > > > Co-developed-by: Lina Iyer <lina.iyer@xxxxxxxxxx> > > > > > Signed-off-by: Lina Iyer <lina.iyer@xxxxxxxxxx> > > > > > Signed-off-by: Ulf Hansson <ulf.hansson@xxxxxxxxxx> > > > > > --- > > > > > > > > > > Changes in v4: > > > > > - None. > > > > > > > > > > --- > > > > > drivers/cpuidle/cpuidle-psci-domain.c | 267 ++++++++++++++++++++++++++ > > > > > drivers/cpuidle/cpuidle-psci.c | 4 +- > > > > > drivers/cpuidle/cpuidle-psci.h | 5 + > > > > > 3 files changed, 274 insertions(+), 2 deletions(-) > > > > > > > > > > diff --git a/drivers/cpuidle/cpuidle-psci-domain.c b/drivers/cpuidle/cpuidle-psci-domain.c > > > > > index 656ef3d59149..c2f94ba42222 100644 > > > > > --- a/drivers/cpuidle/cpuidle-psci-domain.c > > > > > +++ b/drivers/cpuidle/cpuidle-psci-domain.c > > > > > @@ -7,14 +7,281 @@ > > > > > * > > > > > */ > > > > > > > > > > +#define pr_fmt(fmt) "CPUidle PSCI: " fmt > > > > > + > > > > > #include <linux/cpu.h> > > > > > #include <linux/device.h> > > > > > #include <linux/kernel.h> > > > > > #include <linux/pm_domain.h> > > > > > #include <linux/pm_runtime.h> > > > > > +#include <linux/psci.h> > > > > > +#include <linux/slab.h> > > > > > +#include <linux/string.h> > > > > > > > > > > #include "cpuidle-psci.h" > > > > > > > > > > +struct psci_pd_provider { > > > > > + struct list_head link; > > > > > + struct device_node *node; > > > > > +}; > > > > > + > > > > > +static LIST_HEAD(psci_pd_providers); > > > > > +static bool osi_mode_enabled; > > > > > + > > > > > +static int psci_pd_power_off(struct generic_pm_domain *pd) > > > > > +{ > > > > > + struct genpd_power_state *state = &pd->states[pd->state_idx]; > > > > > + u32 *pd_state; > > > > > + > > > > > + /* If we have failed to enable OSI mode, then abort power off. */ > > > > > + if (!osi_mode_enabled) > > > > > + return -EBUSY; > > > > > + > > > > > > > > Why is above check needed ? Shouldn't we have disable/remove pd of > > > > OSI is not enabled ? > > > > > > Well, failing to enable OSI should in practice not happen, while it > > > theoretically it could. > > > > > > > I won't assume that. Since it's new and not tested yet, I prefer to assume > > it can fail. > > Yes, I agree. Hence the degraded mode. > > > > > > My approach to this has been to fall back to use a "degraded mode", > > > which seems quite common for these kind of situations. The degraded > > > mode means, we are preventing domain states from being used. > > > > > > > But why can't we just fail registering or remove if already added. > > We can, but there are more problems with that than leaving this in a > degraded mode, I think. See more below. > > > They are useless for "degraded mode" anyways. And it will ensure that > > data->dev is NULL. Sorry now I see why you said it can be NULL but I > > would rather not leave those unused genpd in place in case of error. > > data->dev would not be NULL in this case, because the > dev_pm_domain_attach_by_name() which is called when we attach the CPU > is going to return an error code, not NULL. > > That's because the connection is there in the DTB and thus it must > fail, in this case it would be with -EPROBE_DEFER (waiting for a genpd > provider to be registered). > > That would then lead to that the entire cpuidle-psci driver would fail > to initiate/probe. In my opinion, I think it's better to fall back > into a degraded mode, using all the idle states for the CPUs, but just > preventing the cluster idle states. > > Just wanted to make this more clear for you to consider. I am happy to > change in any way you suggest, but please confirm that you really want > another behaviour than the degraded mode. > Sorry but if OSI set failed in firmware, it will be operating in default/ PC mode and I *don't* want to create genpd for that. It's confusing. Even if you don't create all these genpd domains, it is still degraded mode and we are anyway not changing that. Let me know if my understanding is wrong here. I am sure, DTB may get copied to different platform and the firmware may not support OSI. I know we have logs, but creating and leaving those genpd domains unused will be just confusing. Please change that. -- Regards, Sudeep