On Thursday 30 Jan 2020 at 18:26:53 (+0000), Suzuki K Poulose wrote: [..] > > > > +Firmware (code running at higher exception levels, e.g. arm-tf) support is > > > > +needed to: > > > > + - Enable access for lower exception levels (EL2 and EL1) to the AMU > > > > + registers. > > > > + - Enable the counters. If not enabled these will read as 0. > > > > + - Save/restore the counters before/after the CPU is being put/brought up > > > > + from the 'off' power state. > > > > + > > > > +When using kernels that have this configuration enabled but boot with > > > > +broken firmware the user may experience panics or lockups when accessing > > > > +the counter registers. Even if these symptoms are not observed, the > > > > +values returned by the register reads might not correctly reflect reality. > > > > +Most commonly, the counters will read as 0, indicating that they are not > > > > +enabled. If proper support is not provided in firmware it's best to disable > > > > +CONFIG_ARM64_AMU_EXTN. > > > > > > For the sake of one kernel runs everywhere, do we need some other > > > mechanism to disable the AMU. e.g kernel parameter to disable amu > > > at runtime ? > > > > > > > The reason I've not added this is twofold: > > - Even if we add this, it should be in order to disable the use of the > > counters for a certain purpose, in this case frequency invariance. > > On its own AMU provides the counters but it does not mandate their > > use. > > - I could add something to disable the use of the core and cycle > > counters for frequency invariance at runtime, but I doubt that > > anyone would use it. Logically it makes sense to use the counters > > order to have a more accurate view of the performance that the CPUs > > are actually providing. Therefore, until anyone asks for this, I > > thought it's better to keep it simple and not add extra switches, > > until there is a use for them. > > > > Does it make sense? > > The comment is about addressing someone who must run an "AMU" enabled > kernel ("one kernel") on a system with potentially "broken firmware", > where there is no option to use the system as you mention above, > the firmware could panic. How common is the "broken firmware" ? > Right now there is no way to ensure "firmware" is sane and if > someone detects that firmware is broken, there is no way to > disable the AMU if they are running a standard distro kernel. > A kernel parameter could prevent the AMU capability from > being detected on a broken system and thus make it usable > (without the AMU of course). Now, if the "broken firmware" > is extremely rare, we could simply ignore this case and > ignore the suggestion. > > Suzuki > > Sorry Suzuki, I initially interpreted the question independently from the context and only thought about cases where they are working correctly but users might want to disable the use of them. In this case, I don't see any harm in adding a command line parameter to disable the use of the unit, even if it's only to support firmware that does not support AMU at all, rather than the implementation being broken. I'm not really sure how common bad firmware would be. I suppose that firmware as bad as to cause firmware panics and lockups would be quite rare, but scenarios where firmware might not properly support AMU and result in kernel lockups could be more often, and this would handle both. Thank you, Ionela.