Re: [PATCH] PCI: Re-enable downstream port LTR if it was previously enabled

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



[+cc Puranjay]

On Thu, Jan 14, 2021 at 04:47:24PM +0300, Mika Westerberg wrote:
> PCIe r5.0, sec 7.5.3.16 says that the downstream ports must reset the
> LTR enable bit if the link goes down (port goes DL_Down status). Now, if
> we had LTR previously enabled and the PCIe endpoint gets hot-removed and
> then hot-added back the ->ltr_path of the downstream port is still set
> but the port now does not have the LTR enable bit set anymore.

IIRC LTR is only needed for L1.2, and of course the LTR Capability
(Max Snoop/No-Snoop Latency registers) and the L1 PM Substates
Capability (LTR_L1.2_THRESHOLD) must be programmed before enabling
LTR.  For the bridge, I guess we're assuming those were programmed
before the hot-remove, and they remain valid after the hot-add.

But what about the endpoint that we hot-added?  How do we program its
LTR and L1 PM Substates Capabilities?  I know we have
aspm_calc_l1ss_info() for L1 PM Substates, but I really don't trust
it, and I don't think we do anything at all to program the LTR
Capability.

I used to think the LTR _DSM was a way to help us program the LTR
Capability, and Puranjay did a nice job implementing support for it
[1].  But I now think that _DSM doesn't give us enough information
(and of course it doesn't help at all for non-ACPI systems or for
hierarchies not integrated on the system board), so I didn't merge
Puranjay's work.

I tried to have some discussion in the PCI SIG about this, but it
never really went anywhere.  Here's my basic question, just for the
archives:

  I think the LTR capability Max Snoop registers could also use some
  clarification.  The base spec says "Software should set this to the
  platform's maximum supported latency or less."  I assume this
  platform data is supposed to come from the ACPI LTR _DSM.  The
  firmware spec says software should sum the latencies along the path
  between each downstream port (I wonder if this should say "Root
  Port"?) and an endpoint that supports LTR.  Switches not embedded in
  the platform will not have this _DSM, but I assume they contribute
  to this sum.  But I don't know *what* they contribute.

> For this reason check if the bridge upstrea had LTR enabled set
> previously and re-enable it before enabling LTR for the endpoint.

s/upstrea/upstream/
s/enabled set/enabled/

Seems like there could be more things in the upstream bridge that need
to be reprogrammed when the link comes back up (MPS, Common Clock
Configuration, etc?).

I don't see anything in the spec about link status affecting MPS, but
if we hot-removed a device that supported 4KB MPS and hot-added one
that only support 128B, we might need more extensive reconfiguration.
I haven't checked; maybe that's already covered?

I think Common Clock Config also depends on characteristics of the
hot-added device, so we might need to take a look at that, too.

If it turns out that we need to do more to the upstream bridge than
just this LTR thing, I wonder if we should pull it out to some kind of
"reconfig bridge" function so it's not buried in several random
places.

[1] https://lore.kernel.org/r/20201015080311.7811-1-puranjay12@xxxxxxxxx

> Reported-by: Utkarsh H Patel <utkarsh.h.patel@xxxxxxxxx>
> Signed-off-by: Mika Westerberg <mika.westerberg@xxxxxxxxxxxxxxx>
> ---
>  drivers/pci/probe.c | 17 ++++++++++++++++-
>  1 file changed, 16 insertions(+), 1 deletion(-)
> 
> diff --git a/drivers/pci/probe.c b/drivers/pci/probe.c
> index 0eb68b47354f..cd174e06f46f 100644
> --- a/drivers/pci/probe.c
> +++ b/drivers/pci/probe.c
> @@ -2153,7 +2153,7 @@ static void pci_configure_ltr(struct pci_dev *dev)
>  {
>  #ifdef CONFIG_PCIEASPM
>  	struct pci_host_bridge *host = pci_find_host_bridge(dev->bus);
> -	struct pci_dev *bridge;
> +	struct pci_dev *bridge = NULL;
>  	u32 cap, ctl;
>  
>  	if (!pci_is_pcie(dev))
> @@ -2191,6 +2191,21 @@ static void pci_configure_ltr(struct pci_dev *dev)
>  	if (pci_pcie_type(dev) == PCI_EXP_TYPE_ROOT_PORT ||
>  	    ((bridge = pci_upstream_bridge(dev)) &&
>  	      bridge->ltr_path)) {
> +		/*
> +		 * Downstream ports reset the LTR enable bit when the
> +		 * link goes down (e.g on hot-remove) so re-enable the
> +		 * bit here if not set anymore.
> +		 * PCIe r5.0, sec 7.5.3.16.
> +		 */
> +		if (bridge && pcie_downstream_port(bridge)) {

Why test for pcie_downstream_port(bridge) here?  "dev" is a PCIe
device, and "bridge" is a PCI device leading to "dev".  I think the
only possibilities are that "bridge" is a root port, a switch
downstream port, or a PCI-to-PCIe bridge, i.e., exactly what
pcie_downstream_port() tests for.

> +			pcie_capability_read_dword(bridge, PCI_EXP_DEVCTL2, &ctl);
> +			if (!(ctl & PCI_EXP_DEVCTL2_LTR_EN)) {
> +				pci_dbg(bridge, "re-enabling LTR\n");
> +				pcie_capability_set_word(bridge, PCI_EXP_DEVCTL2,
> +							 PCI_EXP_DEVCTL2_LTR_EN);
> +			}
> +		}
> +		pci_dbg(dev, "enabling LTR\n");
>  		pcie_capability_set_word(dev, PCI_EXP_DEVCTL2,
>  					 PCI_EXP_DEVCTL2_LTR_EN);
>  		dev->ltr_path = 1;
> -- 
> 2.29.2
> 



[Index of Archives]     [DMA Engine]     [Linux Coverity]     [Linux USB]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Greybus]

  Powered by Linux