On 06/27/2013 03:04 AM, Tejun Heo wrote: > Hey, > > On Wed, Jun 26, 2013 at 11:58:48PM +0530, Srivatsa S. Bhat wrote: >> Yes, we were discussing hot-unplug latency for use-cases such as >> suspend/resume. We didn't want to make those operations slower in the >> process of removing stop_machine() from hotplug. > > Can you please explain why tho? Sure. > How much would it help in terms of > power-saving? Or are there other issues in taking longer to shut down > cpus? > Basically, power-management techniques (like suspend/resume as used on smartphones etc) are implemented using a controlling algorithm which controls the aggressiveness of power-management depending on the load on the system. Today, the cpuidle subsystem in Linux handles cpuidle transitions using that technique, so I'll explain using that as an example. Similar strategies are used for suspend/resume also. For every cpuidle state, we have atleast 2 parameters that determine how usable it is - 1. exit latency 2. target residency. The exit latency captures the amount of time it takes to come out of the power-saving state, from the time you asked it to come out. This is an estimate of how "deep" the power-saving state is. The deeper it is, the longer it takes to come out. (And of course, deeper states give higher power-savings). The target residency is a value which represents the break-even for that state. It tells us how long we should stay in that idle state (at a minimum) before we ask it to come out, to actually get some good power-savings out of that state. (Note that entry and exit to an idle state itself consumes some power, so there won't be any power-savings if we keep entering and exiting a deep state without staying in that state long enough). Once the idle states are characterized like this, the cpuidle subsystem uses a cpuidle "governor" to actually decide which of the states to enter, given an idle period of time. The governor keeps track of the load on the system and predicts the length of the next idle period, and based on that prediction, it selects the idle state whose characteristics (exit latency/target residency) match the predicted idle time closely. So as we can see, the "deeper" the idle state, the lower its chance of getting used during regular workload runs, because it is deemed too costly. So entry/exit latency of an idle state is a very important aspect which determines how often you can use that state. Ideally we want states which are "deep" in the sense that they give huge power-savings, but "light" in the sense that their entry/exit latencies are low. Such states give us the best benefits, since we can use them aggressively. Now a similar argument applies for suspend/resume (on smartphones etc). Suspend/resume already gives good power-savings. But if we make it slower, the chances of using it profitably begins to reduce. That's why, IMHO, its important to keep a check on the latency of CPU hotplug and reduce it if possible. And that becomes even more important as systems keep sporting more and more CPUs. Regards, Srivatsa S. Bhat -- To unsubscribe from this list: send the line "unsubscribe linux-arch" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html