On Fri, Feb 23, 2024 at 02:58:50PM -0600, Bjorn Helgaas wrote: > From: "David E. Box" <david.e.box@xxxxxxxxxxxxxxx> > > 4ff116d0d5fd ("PCI/ASPM: Save L1 PM Substates Capability for > suspend/resume") restored the L1 PM Substates Capability after resume, > which reduced power consumption by making the ASPM L1.x states work after > resume. > > a7152be79b62 ("Revert "PCI/ASPM: Save L1 PM Substates Capability for > suspend/resume"") reverted 4ff116d0d5fd because resume failed on some > systems, so power consumption after resume increased again. > > a7152be79b62 mentioned that we restore L1 PM substate configuration even > though ASPM L1 may already be enabled. This is due the fact that the > pci_restore_aspm_l1ss_state() was called before pci_restore_pcie_state(). > > Save and restore the L1 PM Substates Capability, following PCIe r6.1, sec > 5.5.4 more closely by: > > 1) Do not restore ASPM configuration in pci_restore_pcie_state() but > do that after PCIe capability is restored in pci_restore_aspm_state() > following PCIe r6.1, sec 5.5.4. > > 2) If BIOS reenables L1SS, particularly L1.2, we need to clear the > enables in the right order, downstream before upstream. Defer > restoring the L1SS config until we are at the downstream component. > Then update the config for both ends of the link in the prescribed > order. > > 3) Program ASPM L1 PM substate configuration before L1 enables. > > 4) Program ASPM L1 PM substate enables last, after rest of the fields > in the capability are programmed. > @@ -1645,12 +1647,23 @@ static void pci_restore_pcie_state(struct pci_dev *dev) > > cap = (u16 *)&save_state->cap.data[0]; > pcie_capability_write_word(dev, PCI_EXP_DEVCTL, cap[i++]); > - pcie_capability_write_word(dev, PCI_EXP_LNKCTL, cap[i++]); > + > + /* Restore LNKCTL register with ASPM control field clear */ > + lnkctl = cap[i++]; > + pcie_capability_write_word(dev, PCI_EXP_LNKCTL, > + lnkctl & ~PCI_EXP_LNKCTL_ASPMC); > + > pcie_capability_write_word(dev, PCI_EXP_SLTCTL, cap[i++]); > pcie_capability_write_word(dev, PCI_EXP_RTCTL, cap[i++]); > pcie_capability_write_word(dev, PCI_EXP_DEVCTL2, cap[i++]); > pcie_capability_write_word(dev, PCI_EXP_LNKCTL2, cap[i++]); > pcie_capability_write_word(dev, PCI_EXP_SLTCTL2, cap[i++]); > + > + pci_restore_aspm_l1ss_state(dev); > + > + /* Restore ASPM control after restoring L1SS state */ > + pcie_capability_set_word(dev, PCI_EXP_LNKCTL, > + lnkctl & PCI_EXP_LNKCTL_ASPMC); > } This part makes me wonder if the PCI_EXP_LNKCTL restore could be simplified by moving the L0s/L1 disable into pci_restore_aspm_l1ss_state() as in the patch below. It would be nice if the knowledge about L0s/L1 being disabled while restoring L1SS state were encapsulated in pci_restore_aspm_l1ss_state() instead of doing this dance in pci_restore_pcie_state(). I didn't include this in the v7 posting because it changes the sequence and I don't have a way to test it, and this whole thing is awfully tricky to get right. Bjorn diff --git a/drivers/pci/pci.c b/drivers/pci/pci.c index 4ea98665172d..5a4b501a3f41 100644 --- a/drivers/pci/pci.c +++ b/drivers/pci/pci.c @@ -1633,13 +1633,14 @@ static void pci_restore_pcie_state(struct pci_dev *dev) { int i = 0; struct pci_cap_saved_state *save_state; - u16 *cap, lnkctl; + u16 *cap; /* * Restore max latencies (in the LTR capability) before enabling * LTR itself in PCI_EXP_DEVCTL2. */ pci_restore_ltr_state(dev); + pci_restore_aspm_l1ss_state(dev); save_state = pci_find_saved_cap(dev, PCI_CAP_ID_EXP); if (!save_state) @@ -1654,23 +1655,12 @@ static void pci_restore_pcie_state(struct pci_dev *dev) cap = (u16 *)&save_state->cap.data[0]; pcie_capability_write_word(dev, PCI_EXP_DEVCTL, cap[i++]); - - /* Restore LNKCTL register with ASPM control field clear */ - lnkctl = cap[i++]; - pcie_capability_write_word(dev, PCI_EXP_LNKCTL, - lnkctl & ~PCI_EXP_LNKCTL_ASPMC); - + pcie_capability_write_word(dev, PCI_EXP_LNKCTL, cap[i++]); pcie_capability_write_word(dev, PCI_EXP_SLTCTL, cap[i++]); pcie_capability_write_word(dev, PCI_EXP_RTCTL, cap[i++]); pcie_capability_write_word(dev, PCI_EXP_DEVCTL2, cap[i++]); pcie_capability_write_word(dev, PCI_EXP_LNKCTL2, cap[i++]); pcie_capability_write_word(dev, PCI_EXP_SLTCTL2, cap[i++]); - - pci_restore_aspm_l1ss_state(dev); - - /* Restore ASPM control after restoring L1SS state */ - pcie_capability_set_word(dev, PCI_EXP_LNKCTL, - lnkctl & PCI_EXP_LNKCTL_ASPMC); } static int pci_save_pcix_state(struct pci_dev *dev) diff --git a/drivers/pci/pcie/aspm.c b/drivers/pci/pcie/aspm.c index c89ef87ed1c4..46352132bb14 100644 --- a/drivers/pci/pcie/aspm.c +++ b/drivers/pci/pcie/aspm.c @@ -105,6 +105,7 @@ void pci_restore_aspm_l1ss_state(struct pci_dev *pdev) struct pci_dev *parent = pdev->bus->self; u32 *cap, pl_ctl1, pl_ctl2, pl_l1_2_enable; u32 cl_ctl1, cl_ctl2, cl_l1_2_enable; + u16 clnkctl, plnkctl; /* * In case BIOS enabled L1.2 when resuming, we need to disable it first @@ -129,6 +130,17 @@ void pci_restore_aspm_l1ss_state(struct pci_dev *pdev) pl_ctl2 = *cap++; pl_ctl1 = *cap; + /* Make sure L0s/L1 are disabled before updating L1SS config */ + pcie_capability_read_word(pdev, PCI_EXP_LNKCTL, &clnkctl); + pcie_capability_read_word(parent, PCI_EXP_LNKCTL, &plnkctl); + if (FIELD_GET(PCI_EXP_LNKCTL_ASPMC, clnkctl) || + FIELD_GET(PCI_EXP_LNKCTL_ASPMC, plnkctl)) { + pcie_capability_write_word(pdev, PCI_EXP_LNKCTL, + clnkctl & ~PCI_EXP_LNKCTL_ASPMC); + pcie_capability_write_word(parent, PCI_EXP_LNKCTL, + plnkctl & ~PCI_EXP_LNKCTL_ASPMC); + } + /* * Disable L1.2 on this downstream endpoint device first, followed * by the upstream @@ -161,6 +173,13 @@ void pci_restore_aspm_l1ss_state(struct pci_dev *pdev) pci_write_config_dword(pdev, pdev->l1ss + PCI_L1SS_CTL1, cl_ctl1 | cl_l1_2_enable); } + + /* Restore L0s/L1 if they were enabled */ + if (FIELD_GET(PCI_EXP_LNKCTL_ASPMC, clnkctl) || + FIELD_GET(PCI_EXP_LNKCTL_ASPMC, plnkctl)) { + pcie_capability_write_word(parent, PCI_EXP_LNKCTL, clnkctl); + pcie_capability_write_word(pdev, PCI_EXP_LNKCTL, plnkctl); + } } #ifdef CONFIG_PCIEASPM