On Fri, 23 Jun 2006, Benjamin Herrenschmidt wrote: > > > It's not so simple as just freezing khubd. Devices can be created and > > destroyed in responsing to requests from userspace (e.g., writing to > > /sys/.../bConfigurationValue). It's not at all clear to me how we could > > reliably prevent or delay such requests. Right now we rely on userspace > > and khubd _both_ being frozen. > > You can easily deal with userspace by either error'ing out when in > suspend or by blocking in the write to sysfs until resume. Erroring out is not a satisfactory option. Blocking might be okay; the real question being when. Having a global flag or system_state setting would help. But then how does the code know when to unblock? Adding a waitqueue to every usb_device seems like overkill... In fact, why not plug things at the source? Have device_add() and device_del() block, starting just after prepare() is over and continuing until just before finish() is called. If any drivers are bothered by this... well, they were notified. > > Perhaps the best answer is to require callers to lock the parent device > > when creating or removing a child (USB does this already). Under the > > assumption that you'll never want to create or remove a child of an > > already-suspended parent, things should be okay. The PM core _should_ be > > able to handle a device being added or removed while some parts of > > the system are suspended or frozen, just so long as the actual parent is > > still awake. Uevents can safely be queued until userspace is unfrozen or > > otherwise able to process them. > > But that means that you'll end up with potentially a new device inserted > that will be awake, the driver will not have had prepare() nor suspend() > called and the machine will go to sleep... You're right that the driver will not have seen prepare(), but you're wrong about suspend(). When a new device structure is registered, it is added to the end of the list of all unsuspended devices. On each iteration of the suspend loop, the PM core removes the last entry from the list and calls its suspend method. Thus the new device's suspend method will be called right away. > Then there is the problem of those hotplug events that can't be handled > during the suspend process Why is this a problem? There are other times when hotplug events can't be handled, and we seem to survive them okay. > etc.. > > I think it's sane to just forbid/block insertion of new devices during > suspend. Will make life easier for everybody. It's hard to know what the ramifications are without actually trying it. > > I'm concerned about remote wakeup events arriving at inconvenient times > > during STR or STD. Sometimes you might want them to abort the suspend, > > sometimes you might want to just drop them, and sometimes you might want > > them to wake the system up right after it goes to sleep. It would be nice > > to get this straightened out. > > It's not even clear to me that there is not a race in HW with wakeup > events in that case. I'd put that problem far beyond just getting a > stable suspend/resume process though right now on the priority list. Agreed. Alan Stern