Re: [PATCH v2] cpufreq: intel_pstate: Change the calculation of next pstate

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thursday, May 01, 2014 02:30:42 PM Dirk Brandewie wrote:
> On 05/01/2014 02:00 PM, Stratos Karafotis wrote:
> > Currently the driver calculates the next pstate proportional to
> > core_busy factor, scaled by the ratio max_pstate / current_pstate.
> >
> > Using the scaled load (core_busy) to calculate the next pstate
> > is not always correct, because there are cases that the load is
> > independent from current pstate. For example, a tight 'for' loop
> > through many sampling intervals will cause a load of 100% in
> > every pstate.
> >
> > So, change the above method and calculate the next pstate with
> > the assumption that the next pstate should not depend on the
> > current pstate. The next pstate should only be directly
> > proportional to measured load.
> >
> > Tested on Intel i7-3770 CPU @ 3.40GHz.
> > Phoronix benchmark of Linux Kernel Compilation 3.1 test shows an
> > increase ~1.5% in performance. Below the test results using turbostat
> > (5 iterations):
> >
> > Without patch:
> >
> > Ph. avg Time	Total time	PkgWatt		Total Energy
> > 	79.63	266.416		57.74		15382.85984
> > 	79.63	265.609		57.87		15370.79283
> > 	79.57	266.994		57.54		15362.83476
> > 	79.53	265.304		57.83		15342.53032
> > 	79.71	265.977		57.76		15362.83152
> > avg	79.61	266.06		57.74		15364.36985
> >
> > With patch:
> >
> > Ph. avg Time	Total time	PkgWatt		Total Energy
> > 	78.23	258.826		59.14		15306.96964
> > 	78.41	259.110		59.15		15326.35650
> > 	78.40	258.530		59.26		15320.48780
> > 	78.46	258.673		59.20		15313.44160
> > 	78.19	259.075		59.16		15326.87700
> > avg	78.34	258.842		59.18		15318.82650
> >
> > The total test time reduced by ~2.6%, while the total energy
> > consumption during a test iteration reduced by ~0.35%
> >
> > Signed-off-by: Stratos Karafotis <stratosk@xxxxxxxxxxxx>
> > ---
> >
> > Changes v1 -> v2
> > 	- Enhance change log as Rafael and Viresh suggested
> >
> >
> >   drivers/cpufreq/intel_pstate.c | 15 +++++++--------
> >   1 file changed, 7 insertions(+), 8 deletions(-)
> >
> > diff --git a/drivers/cpufreq/intel_pstate.c b/drivers/cpufreq/intel_pstate.c
> > index 0999673..8e309db 100644
> > --- a/drivers/cpufreq/intel_pstate.c
> > +++ b/drivers/cpufreq/intel_pstate.c
> > @@ -608,28 +608,27 @@ static inline void intel_pstate_set_sample_time(struct cpudata *cpu)
> >   	mod_timer_pinned(&cpu->timer, jiffies + delay);
> >   }
> >
> > -static inline int32_t intel_pstate_get_scaled_busy(struct cpudata *cpu)
> > +static inline int32_t intel_pstate_get_busy(struct cpudata *cpu)
> >   {
> > -	int32_t core_busy, max_pstate, current_pstate;
> > +	int32_t core_busy, max_pstate;
> >
> >   	core_busy = cpu->sample.core_pct_busy;
> >   	max_pstate = int_tofp(cpu->pstate.max_pstate);
> > -	current_pstate = int_tofp(cpu->pstate.current_pstate);
> > -	core_busy = mul_fp(core_busy, div_fp(max_pstate, current_pstate));
> > +	core_busy = mul_fp(core_busy, max_pstate);
> 
> NAK,  The goal of this code is to find out how busy the core is at the current
> P state. This change will return a value WAY too high.
> 
> Assume core_busy is 100 and the max non-turbo P state is 34 (3.4GHz) this code
> would return a busy value of 3400. The PID  is trying to keep the busy value
> at the setpoint any value of ~3% will drive the P state to the highest turbo
> P state in this example.

Well, the problem is that the numbers above indicate an improvement in energy
efficiency as a result of this patch and we need to explain that result.

-- 
I speak only for myself.
Rafael J. Wysocki, Intel Open Source Technology Center.
--
To unsubscribe from this list: send the line "unsubscribe cpufreq" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Kernel Devel]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite Forum]     [Linux SCSI]

  Powered by Linux