On Wed, Feb 21, 2024 at 09:16:22PM +0200, Ville Syrjälä wrote: > On Wed, Feb 21, 2024 at 09:05:43PM +0200, Ville Syrjälä wrote: > > On Wed, Feb 21, 2024 at 09:53:21AM +0200, Jouni Högander wrote: > > > Current fast and IO wake lines calculation is assuming fast wake sync > > > length is 18 pulses. Let's improve this by checking the actual length. > > > > > > Also 10 us IO buffer wake time is currently assumed. This is not the case > > > with LunarLake and beyond. Fix this by adding getter for IO wake time and > > > return values there according to Bspec. > > > > > > Bspec: 65450 > > > > > > Signed-off-by: Jouni Högander <jouni.hogander@xxxxxxxxx> > > > --- > > > drivers/gpu/drm/i915/display/intel_psr.c | 40 +++++++++++++++++++----- > > > 1 file changed, 33 insertions(+), 7 deletions(-) > > > > > > diff --git a/drivers/gpu/drm/i915/display/intel_psr.c b/drivers/gpu/drm/i915/display/intel_psr.c > > > index 72cadad09db5..4a1e07411716 100644 > > > --- a/drivers/gpu/drm/i915/display/intel_psr.c > > > +++ b/drivers/gpu/drm/i915/display/intel_psr.c > > > @@ -1150,6 +1150,28 @@ static bool _lnl_compute_alpm_params(struct intel_dp *intel_dp, > > > return true; > > > } > > > > > > +/* > > > + * From Bspec: > > > + * > > > + * For Xe2 and beyond > > > + * RBR 15us, HBR1 11us, higher rates 10us > > > + * > > > + * For pre-Xe2 > > > + * 10 us > > > + */ > > > +static int get_io_wake_time(struct intel_dp *intel_dp, > > > > No point in passing that. You can dig out the i915 from the crtc state. > > > > > + struct intel_crtc_state *crtc_state) > > > > const > > > > > +{ > > > + struct drm_i915_private *i915 = dp_to_i915(intel_dp); > > > + > > > + if (DISPLAY_VER(i915) < 20 || crtc_state->port_clock > 270000) > > > + return 10; > > > + else if (crtc_state->port_clock > 162000) > > > + return 11; > > > + else > > > + return 15; > > > > The new rate dependent stuff should be a separate patch. > > > > And looks like the 10 usec will give us 44 usec io wake time, so > > that should probably be a separate patch as well, to avoid > > any functional changes when we introduce the formula. > > > > > +} > > > + > > > static bool _compute_alpm_params(struct intel_dp *intel_dp, > > > struct intel_crtc_state *crtc_state) > > > { > > > @@ -1157,13 +1179,17 @@ static bool _compute_alpm_params(struct intel_dp *intel_dp, > > > int io_wake_lines, io_wake_time, fast_wake_lines, fast_wake_time; > > > u8 max_wake_lines; > > > > > > - if (DISPLAY_VER(i915) >= 12) { > > > - io_wake_time = 42; > > > - /* > > > - * According to Bspec it's 42us, but based on testing > > > - * it is not enough -> use 45 us. > > > - */ > > > - fast_wake_time = 45; > > > + if (intel_dp->get_aux_fw_sync_len) { > > > + int io_wake_time = get_io_wake_time(intel_dp, crtc_state); > > > > Looks like this will shadow the variable you're trying to change. > > Does the compiler not complain about this? > > > > > + int tfw_exit_latency = 20; /* eDP spec */ > > > + int phy_wake = 4; /* eDP spec */ > > > + int preamble = 8; /* eDP spec */ > > > + int precharge = intel_dp->get_aux_fw_sync_len() - preamble; > > > + > > > + io_wake_time = max(precharge, io_wake_time) + preamble + > > > + phy_wake + tfw_exit_latency; > > > + fast_wake_time = precharge + preamble + phy_wake + > > > + tfw_exit_latency; > > > > > > /* TODO: Check how we can use ALPM_CTL fast wake extended field */ > > > max_wake_lines = 12; > > > > I would also convert the older platforms to use the formula. > > We do need to reverse calculate the io buffer on latency since > > AFAICS it's not directly specified in bspec. But I think > > that's better than not converting it since with the formula we > > can't totally screw things up when eg. changing the precharge > > length. > > Hmm. The older platforms are apparently using fast_wake=32 > which implies zero precharge pulses. That definitely does > not match what we program into the AUX control register... Looks like Windows just uses: pre-tgl: fast_wake=50 io_fast_wake=50 tgl-mtl: fast_wake=42 io_fast_wake=42 Also for pre-tgl they clamp these to 5-8 instead of using the min=7 we have. For tgl+ they do clamp to 7-12. And if the values exceed those limits they just proceed blindly with the clamped values, which is pretty dodgy. -- Ville Syrjälä Intel