Hi Suzuki, On Thursday 30 Jan 2020 at 15:04:27 (+0000), Suzuki Kuruppassery Poulose wrote: > Hi Ionela, > > On 18/12/2019 18:26, Ionela Voinescu wrote: > > The activity monitors extension is an optional extension introduced > > by the ARMv8.4 CPU architecture. > > > > Add initial documentation for the AMUv1 extension: > > - arm64/amu.txt: AMUv1 documentation > > - arm64/booting.txt: system registers initialisation > > - arm64/cpu-feature-registers.txt: visibility to userspace > > We have stopped adding "invisible" fields to the list. So, you > can drop the changes to cpu-feature-registers.txt. > > > > > Signed-off-by: Ionela Voinescu <ionela.voinescu@xxxxxxx> > > Cc: Catalin Marinas <catalin.marinas@xxxxxxx> > > Cc: Will Deacon <will@xxxxxxxxxx> > > Cc: Jonathan Corbet <corbet@xxxxxxx> > > --- > > Documentation/arm64/amu.rst | 107 ++++++++++++++++++ > > Documentation/arm64/booting.rst | 14 +++ > > Documentation/arm64/cpu-feature-registers.rst | 2 + > > Documentation/arm64/index.rst | 1 + > > 4 files changed, 124 insertions(+) > > create mode 100644 Documentation/arm64/amu.rst > > > > diff --git a/Documentation/arm64/amu.rst b/Documentation/arm64/amu.rst > > new file mode 100644 > > index 000000000000..62a6635522e1 > > --- /dev/null > > +++ b/Documentation/arm64/amu.rst > > @@ -0,0 +1,107 @@ > > +======================================================= > > +Activity Monitors Unit (AMU) extension in AArch64 Linux > > +======================================================= > > + > > +Author: Ionela Voinescu <ionela.voinescu@xxxxxxx> > > + > > +Date: 2019-09-10 > > + > > +This document briefly describes the provision of Activity Monitors Unit > > +support in AArch64 Linux. > > + > > + > > +Architecture overview > > +--------------------- > > + > > +The activity monitors extension is an optional extension introduced by the > > +ARMv8.4 CPU architecture. > > + > > +The activity monitors unit, implemented in each CPU, provides performance > > +counters intended for system management use. The AMU extension provides a > > +system register interface to the counter registers and also supports an > > +optional external memory-mapped interface. > > + > > +Version 1 of the Activity Monitors architecture implements a counter group > > +of four fixed and architecturally defined 64-bit event counters. > > + - CPU cycle counter: increments at the frequency of the CPU. > > + - Constant counter: increments at the fixed frequency of the system > > + clock. > > + - Instructions retired: increments with every architecturally executed > > + instruction. > > + - Memory stall cycles: counts instruction dispatch stall cycles caused by > > + misses in the last level cache within the clock domain. > > + > > +When in WFI or WFE these counters do not increment. > > + > > +The Activity Monitors architecture provides space for up to 16 architected > > +event counters. Future versions of the architecture may use this space to > > +implement additional architected event counters. > > + > > +Additionally, version 1 implements a counter group of up to 16 auxiliary > > +64-bit event counters. > > + > > +On cold reset all counters reset to 0. > > + > > + > > +Basic support > > +------------- > > + > > +The kernel can safely run a mix of CPUs with and without support for the > > +activity monitors extension. > > > Therefore, when CONFIG_ARM64_AMU_EXTN is > > +selected we unconditionally enable the capability to allow any late CPU > > +(secondary or hotplugged) to detect and use the feature. > > + > > +When the feature is detected on a CPU, a per-CPU variable (amu_feat) is > > +set, but this does not guarantee the correct functionality of the > > +counters, only the presence of the extension. > > nit: I would rather omit the implementation details (esp variable names) > in the documentation. It may become a pain to keep this in sync with the > code changes. You could simply mention, "we keep track of the availability > of the feature" per CPU. If someone wants to figure out > how, they can always read the code. > > > + > > +Firmware (code running at higher exception levels, e.g. arm-tf) support is > > +needed to: > > + - Enable access for lower exception levels (EL2 and EL1) to the AMU > > + registers. > > + - Enable the counters. If not enabled these will read as 0. > > + - Save/restore the counters before/after the CPU is being put/brought up > > + from the 'off' power state. > > + > > +When using kernels that have this configuration enabled but boot with > > +broken firmware the user may experience panics or lockups when accessing > > +the counter registers. Even if these symptoms are not observed, the > > +values returned by the register reads might not correctly reflect reality. > > +Most commonly, the counters will read as 0, indicating that they are not > > +enabled. If proper support is not provided in firmware it's best to disable > > +CONFIG_ARM64_AMU_EXTN. > > For the sake of one kernel runs everywhere, do we need some other > mechanism to disable the AMU. e.g kernel parameter to disable amu > at runtime ? > The reason I've not added this is twofold: - Even if we add this, it should be in order to disable the use of the counters for a certain purpose, in this case frequency invariance. On its own AMU provides the counters but it does not mandate their use. - I could add something to disable the use of the core and cycle counters for frequency invariance at runtime, but I doubt that anyone would use it. Logically it makes sense to use the counters order to have a more accurate view of the performance that the CPUs are actually providing. Therefore, until anyone asks for this, I thought it's better to keep it simple and not add extra switches, until there is a use for them. Does it make sense? P.S. I'll make all the other changes you've suggested in v3. Thank you, Ionela. > > diff --git a/Documentation/arm64/booting.rst b/Documentation/arm64/booting.rst > > index 5d78a6f5b0ae..a3f1a47b6f1c 100644 > > --- a/Documentation/arm64/booting.rst > > +++ b/Documentation/arm64/booting.rst > > @@ -248,6 +248,20 @@ Before jumping into the kernel, the following conditions must be met: > > - HCR_EL2.APK (bit 40) must be initialised to 0b1 > > - HCR_EL2.API (bit 41) must be initialised to 0b1 > > + For CPUs with Activity Monitors Unit v1 (AMUv1) extension present: > > + - If EL3 is present: > > + CPTR_EL3.TAM (bit 30) must be initialised to 0b0 > > + CPTR_EL2.TAM (bit 30) must be initialised to 0b0 > > + AMCNTENSET0_EL0 must be initialised to 0b1111 > > + AMCNTENSET1_EL0 must be initialised to a platform specific value > > + having 0b1 set for the corresponding bit for each of the auxiliary > > + counters present. > > + - If the kernel is entered at EL1: > > + AMCNTENSET0_EL0 must be initialised to 0b1111 > > + AMCNTENSET1_EL0 must be initialised to a platform specific value > > + having 0b1 set for the corresponding bit for each of the auxiliary > > + counters present. > > + > > The requirements described above for CPU mode, caches, MMUs, architected > > timers, coherency and system registers apply to all CPUs. All CPUs must > > enter the kernel in the same exception level. > > diff --git a/Documentation/arm64/cpu-feature-registers.rst b/Documentation/arm64/cpu-feature-registers.rst > > index b6e44884e3ad..4770ae54032b 100644 > > --- a/Documentation/arm64/cpu-feature-registers.rst > > +++ b/Documentation/arm64/cpu-feature-registers.rst > > @@ -150,6 +150,8 @@ infrastructure: > > +------------------------------+---------+---------+ > > | DIT | [51-48] | y | > > +------------------------------+---------+---------+ > > + | AMU | [47-44] | n | > > + +------------------------------+---------+---------+ > > As mentioned above, please drop it. > > > Suzuki