On Tuesday 11 August 2009, Alan Stern wrote: > On Tue, 11 Aug 2009, Rafael J. Wysocki wrote: > > > In fact, we don't need the layers at all. The only thing we have to assure is > > that, during resume, the devices given device depends on will be handled > > before we start to handle this particular device (inversely during suspend). > > > > Please note that we're not even allowed to start executing the device's > > resume callback before the callbacks of the devices it depends on have > > returned (the same applies to the suspend callbacks, but the other way around). > > The general algorithm for maximum parallelism goes as follows: Start by > resuming (in parallel) all the devices which don't depend on anything > else. Each time a resume finishes, you go on to resume (in parallel) > all the devices which depend only on resumed devices and which haven't > yet started to resume. > > As described, this can require a large number of threads. It also > requires detailed knowledge of which devices depend on others, which we > don't have. It's even more complicated than that. Assume we have 7 devices, A-G, such that A is the parent of B and C, B is the parent of D and E, and C is the parent of F and G. Assume in addition that the PM dependencies between the devices are fully reflected by the device tree structure (ie. there are no dependencies that aren't reflected parent-child relationships) and that B and G take 0.5 s to resume while the others take < 1 ms each. So, the total sequential resume time is 2 s + O(1 ms). Now, if we used the above algorithm, we'd first resume DEFG which would take 1 s because of G, then we'd resume BC which would take 1 s because of B and the total resume time is again 2 s + O(1 ms). However, one can observe that B doesn't need to wait for G to resume, because they are independent of each other. So, we can resume BDE in parallel with CFG, while of course DE have to wait for B and so on, but this way we can theoretically reduce the total resume time to 1 s + O(1 ms). The question is how to do that and it seems to me that we can use completions for this purpose. Namely, add a completion to each device with the following rules: 1) all completions are reset before dpm_resume(), 2) before executing the ->resume() callback for device D, we wait for the completion of the D's parent, 3) we complete the D's completion after executing its ->resume() callback. Also, the items executed in parallel are now the "wait for the parent's completion, run our callback and complete our completion" things. At first sight I don't see anything fundamentally wrong with this approach. Thanks, Rafael -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html