On 25/04/16 12:50, Huang, Tao wrote: > Hi, Marc: > On 2016?04?25? 18:39, Marc Zyngier wrote: >> On 25/04/16 11:06, Marc Zyngier wrote: >>> On 25/04/16 10:48, Huang, Tao wrote: >>>> Hi, Marc: >>>> On 2016?04?21? 19:30, Marc Zyngier wrote: >>>>> On Thu, 21 Apr 2016 18:47:20 +0800 >>>>> "Huang, Tao" <huangtao at rock-chips.com> wrote: >>>>> >>>>>> Hi, Mark: >>>>>> On 2016?04?21? 18:19, Mark Rutland wrote: >>>>>>> On Thu, Apr 21, 2016 at 11:58:12AM +0800, Jianqun Xu wrote: >>>>>>>> + cpu_l0: cpu at 0 { >>>>>>>> + device_type = "cpu"; >>>>>>>> + compatible = "arm,cortex-a53", "arm,armv8"; >>>>>>>> + reg = <0x0 0x0>; >>>>>>>> + enable-method = "psci"; >>>>>>>> + #cooling-cells = <2>; /* min followed by max */ >>>>>>>> + clocks = <&cru ARMCLKL>; >>>>>>>> + }; >>>>>>>> + cpu_b0: cpu at 100 { >>>>>>>> + device_type = "cpu"; >>>>>>>> + compatible = "arm,cortex-a72", "arm,armv8"; >>>>>>>> + reg = <0x0 0x100>; >>>>>>>> + enable-method = "psci"; >>>>>>>> + #cooling-cells = <2>; /* min followed by max */ >>>>>>>> + clocks = <&cru ARMCLKB>; >>>>>>>> + }; >>>>>>>> + >>>>>>>> + arm-pmu { >>>>>>>> + compatible = "arm,armv8-pmuv3"; >>>>>>>> + interrupts = <GIC_PPI 7 IRQ_TYPE_LEVEL_LOW>; >>>>>>>> + }; >>>>>>> This is wrong, and must go. There should be a separate node for the PMU >>>>>>> of each microarchitecture, with the appropriate compatible string to >>>>>>> represent that (see the juno dts). >>>>>> You are right. The first version we wrote is: >>>>>> pmu_a53 { >>>>>> compatible = "arm,cortex-a53-pmu"; >>>>>> interrupts = <GIC_PPI 7 IRQ_TYPE_LEVEL_LOW>; >>>>>> interrupt-affinity = <&cpu_l0>, >>>>>> <&cpu_l1>, >>>>>> <&cpu_l2>, >>>>>> <&cpu_l3>; >>>>>> }; >>>>>> >>>>>> pmu_a72 { >>>>>> compatible = "arm,cortex-a72-pmu"; >>>>>> interrupts = <GIC_PPI 7 IRQ_TYPE_LEVEL_LOW>; >>>>>> interrupt-affinity = <&cpu_b0>, >>>>>> <&cpu_b1>; >>>>>> }; >>>>>> but unfortunately, the arm pmu driver do not support PPI in two cluster >>>>>> well, >>>>>> so we have to replace with this implementation. >>>>>>> In this case things are messier as the same PPI number is being used >>>>>>> across clusters. Marc (Cc'd) has been working on PPI partitions, which >>>>>>> should allow us to support that. >>>>>> Great! So what we can do right now? Wait this feature, and delete >>>>>> arm-pmu node? >>>>> I'd rather you have a look at the patches, test them with your HW, >>>>> and comment on what doesn't work! >>>>> >>>>> You can find the patches over there: >>>>> >>>>> https://lkml.org/lkml/2016/4/11/182 >>>>> >>>>> and on the following branch: >>>>> >>>>> git://git.kernel.org/pub/scm/linux/kernel/git/maz/arm-platforms.git >>>>> irq/percpu-partition >>>> I tested these patches. Because our kernel is based on v4.4, so I back >>>> port most changes about >>>> include/linux/irqdomain.h >>>> kernel/irq/irqdomain.c >>>> drivers/irqchip/irq-gic-v3.c >>>> and change rk3399.dtsi base on your arm,gic-v3.txt: >>>> >>>> gic: interrupt-controller at fee00000 { >>>> compatible = "arm,gic-v3"; >>>> - #interrupt-cells = <3>; >>>> + #interrupt-cells = <4>; >>>> #address-cells = <2>; >>>> #size-cells = <2>; >>>> ... >>>> + >>>> + ppi-partitions { >>>> + part0: interrupt-partition-0 { >>>> + affinity = <&cpu_l0 &cpu_l1 &cpu_l2 &cpu_l3>; >>>> + }; >>>> + >>>> + part1: interrupt-partition-1 { >>>> + affinity = <&cpu_b0 &cpu_b1>; >>>> + }; >>>> + }; >>>> >>>> and change every interrupts from three cells to four cells, such as >>>> saradc: saradc at ff100000 { >>>> compatible = "rockchip,rk3399-saradc"; >>>> reg = <0x0 0xff100000 0x0 0x100>; >>>> - interrupts = <GIC_SPI 62 IRQ_TYPE_LEVEL_HIGH>; >>>> + interrupts = <GIC_SPI 62 IRQ_TYPE_LEVEL_HIGH 0>; >>>> #io-channel-cells = <1>; >>>> clocks = <&cru SCLK_SARADC>, <&cru PCLK_SARADC>; >>>> clock-names = "saradc", "apb_pclk"; >>>> >>>> and pmu define as: >>>> pmu_a53 { >>>> compatible = "arm,cortex-a53-pmu"; >>>> interrupts = <GIC_PPI 7 IRQ_TYPE_LEVEL_LOW &part0>; >>>> interrupt-affinity = <&cpu_l0>, >>>> <&cpu_l1>, >>>> <&cpu_l2>, >>>> <&cpu_l3>; >>>> }; >>>> >>>> pmu_a72 { >>>> compatible = "arm,cortex-a72-pmu", "arm,cortex-a57-pmu"; >>>> interrupts = <GIC_PPI 7 IRQ_TYPE_LEVEL_LOW &part1>; >>>> interrupt-affinity = <&cpu_b0>, >>>> <&cpu_b1>; >>>> }; >>>> >>>> It can boot. And I test with Android simpleperf stat and perf top, it works! >>>> So these patches work on RK3399. >>> Good, thanks for testing. >>> >>>> But as I mentioned, we must change every interrupt in dts, do you think >>>> this is acceptable? >>> I can't see why not. >>> >>>>> Of course, you'll have to hack a bit in the PMU code to make it >>>>> understand per-PMU affinity together with percpu interrupts, but it >>>>> wouldn't be fun if there was nothing to do... >>>> I don't change drivers/perf/arm_pmu.c, it just work. >>> Having had a look with Mark, it may work, but it is rather unsafe. I may >>> have a go at it, but I'm going to have to rely on you to test it (or you >>> can send me a board ;-). >> I came up with the following (untested) patch. Please let me know if this >> works for you. >> >> Thanks, >> >> M. >> >> >From b88c08bb689d3fe40c46788453a07ba22dae9220 Mon Sep 17 00:00:00 2001 >> From: Marc Zyngier <marc.zyngier at arm.com> >> Date: Mon, 25 Apr 2016 11:23:54 +0100 >> Subject: [PATCH] drivers/perf: arm-pmu: Handle per-interrupt affinity mask >> >> On a big-little system, PMUs can be wired to CPUs using per CPU >> interrups (PPI). In this case, it is important to make sure that >> the enable/disable do happen on the right set of CPUs. >> >> Do this by querying the corresponding cpumask on the corresponding >> paths >> >> Signed-off-by: Marc Zyngier <marc.zyngier at arm.com> >> --- >> drivers/perf/arm_pmu.c | 13 +++++++++++-- >> 1 file changed, 11 insertions(+), 2 deletions(-) >> >> diff --git a/drivers/perf/arm_pmu.c b/drivers/perf/arm_pmu.c >> index f700908..3de5e1c 100644 >> --- a/drivers/perf/arm_pmu.c >> +++ b/drivers/perf/arm_pmu.c >> @@ -603,7 +603,11 @@ static void cpu_pmu_free_irq(struct arm_pmu *cpu_pmu) >> >> irq = platform_get_irq(pmu_device, 0); >> if (irq >= 0 && irq_is_percpu(irq)) { >> - on_each_cpu(cpu_pmu_disable_percpu_irq, &irq, 1); >> + struct cpumask ppi_cpumask; >> + >> + irq_get_percpu_devid_partition(irq, &ppi_cpumask); >> + on_each_cpu_mask(&ppi_cpumask, cpu_pmu_disable_percpu_irq, >> + &irq, 1); >> free_percpu_irq(irq, &hw_events->percpu_pmu); >> } else { >> for (i = 0; i < irqs; ++i) { >> @@ -638,6 +642,8 @@ static int cpu_pmu_request_irq(struct arm_pmu *cpu_pmu, irq_handler_t handler) >> >> irq = platform_get_irq(pmu_device, 0); >> if (irq >= 0 && irq_is_percpu(irq)) { >> + struct cpumask ppi_cpumask; >> + >> err = request_percpu_irq(irq, handler, "arm-pmu", >> &hw_events->percpu_pmu); >> if (err) { >> @@ -645,7 +651,10 @@ static int cpu_pmu_request_irq(struct arm_pmu *cpu_pmu, irq_handler_t handler) >> irq); >> return err; >> } >> - on_each_cpu(cpu_pmu_enable_percpu_irq, &irq, 1); >> + >> + irq_get_percpu_devid_partition(irq, &ppi_cpumask); >> + on_each_cpu_mask(&ppi_cpumask, cpu_pmu_enable_percpu_irq, >> + &irq, 1); >> } else { >> for (i = 0; i < irqs; ++i) { >> int cpu = i; > This patch reduce the count call cpu_pmu_enable/disable_percpu_irq. For > example, if I call > perf.android top --cpu 0 > only cpus 0~3 will enable and disable. > > But the original code is work too because reference count is right too. > We just enable the irq we do not want, but there is not side effects. That's because partition_irq_[un]mask do check that they are called on a CPU that matches the affinity of that IRQ, and bail out if not. I'm tempted to put a big fat WARN_ON() there. If there wasn't that test, you'd end-up enabling the interrupts for the other PMU, and generate unexpected interrupts. > Anyway, this patch work. Thanks for testing. > I believe the really wrong thing is we have to set interrupt-affinity > on device tree, but we also set interrupt-partition too. The information > is duplicated. Indeed, and that's something that should be addressed separately. Thanks, M. -- Jazz is not dead. It just smells funny...