On Thu, Apr 14, 2022 at 08:06:32PM +0300, Mathias Nyman wrote: > On 14.4.2022 19.30, Evan Green wrote: > > Hi Alan and Mathias, > > > > On Thu, Apr 14, 2022 at 7:21 AM Alan Stern <stern@xxxxxxxxxxxxxxxxxxx> wrote: > >> Evan, this discussion suggests that you rewrite your patch as a series > >> of three: > >> > >> 1. Change choose_wakeup() so that for PM_EVENT_FREEZE, wakeup is > >> always disabled. > > > > If I understand this correctly, this means potentially runtime > > resuming the device so its wakeup setting can be consistently set to > > wakeups disabled across a freeze transition. The kernel already does this for you. All you have to do is change the routine so that it always decides that wakeup should be off for FREEZE. > > Got it I think in terms > > of the "how". > >> 2. Change the xhci-hcd interrupt handler so that port-status > >> changes are ignored if the port's root hub is suspended with > >> wakeup disabled. > > > > This part confuses me. This would be way deep under > > xhci_handle_event(), probably in handle_port_status(), just throwing > > away certain events that come in the ring. How would we know to go > > back and process those events later? We wouldn't. There's no need to process the events later. When a hub (including a root hub) is resumed, the hub driver checks the state of each port and takes whatever actions are needed to handle any changes that occurred while the hub was suspended. In fact, processing these events wouldnn't really accomplish very much in any case. The driver would request that the root hub be resumed, that request would be submitted to a work queue, and then nothing would happen because the work queue, like many other kernel threads, gets frozen during a hibernation transition. All that's really needed is to guarantee that the root hub would be resumed when we leave hibernation. And of course, this always happens regardless of what events were received in the meantime. > > I think we don't need to do this > > if we suspend the controller as in #3 below. The suspended (halted) > > controller wouldn't generate event interrupts (since the spec mentions > > port status change generation is gated on HCHalted). So we're already > > covered against receiving interrupts in this zone by halting the > > controller, and the events stay nicely pending for when we restart it > > in thaw. > > Was thinking the same here. It would be nice to have this to comply with > usb spec, keeping roothub from propagating connect/disconnect events > immediately after suspending it with wake flags cleared. > > But it's a lot of work to implement this, and for this issue, and linux > hibernate point of view I don't think it has any real benefit. > The actual device generating the interrupt is the host (parent of roothub), > and that will stop once freeze() is called for it in #3 The only reason that approach works is because we never disable resume requests during runtime suspend. But okay... > > Is the goal of #1 purely a setup change for #2, or does it stand on > > its own even if we nixed #2? Said differently, is #1 trying to ensure > > that wake signaling doesn't occur at all between freeze and thaw, even > > when the controller is suspended and guaranteed not to generate > > interrupts via its "normal" mechanism? I don't have a crisp mental > > picture of how the wake signaling works, but if the controller wake > > mechanism sidesteps the original problem of sending an MSI to a dead > > CPU (as in, it does not use MSIs), then it might be ok as-is. > > #1 is needed because xHCI can generate wake events even when halted if > device initiated resume signaling is detected on a roothub port. > Just like it can generate wake events on connect/disconnect if wake flags > are set. (xhci spec figure 4-34, see PLS=Resume) > We want to avoid those wakeups between freeze-thaw Think of it this way: All USB hubs, including root hubs, always relay a resume request upstream when one is received on a downstream port, no matter what their wakeup setting is. A hub's wakeup setting only controls whether it generates a resume request on its own in response to a port-status change. > So just #1 and #3 should probably solve this, and be an easier change. Let's try it and see. Alan Stern