* Balbir Singh <balbir@xxxxxxxxxxxxxxxxxx> [2009-10-05 09:02:56]: > * Len Brown <lenb@xxxxxxxxxx> [2009-10-03 01:56:32]: > > > This driver does not use the kernel's CPU hot-plug mechanism > > because after the transient emergency is over, the system must > > be returned to its normal state, and hotplug would permanently > > break both cpusets and binding. > > > > Why does hotplug break cpusets and binding? CPU hotplug will break any cpuset that is setup with that cpu. Userspace needs to register for notifications and redo the cpusets to get desired behaviour. Since these emergencies are short durations, these may require constant re-creation of cpusets leading to an administrative overhead and added complexity. Hence Peter had suggested a transparent method to reduce system capacity and not really knock out a single cpu. The goal is to run something like 6 cpus at a time in an 8 cpu system without starving any single cpu. Also real time jobs should still allowed to be run since they will generally be very short running. In this patch series, the main design issue was about running a forced-idle thread that may affect scheduling fairness. > > So to force idle, the driver creates a power saving thread. > > The scheduler will migrate the thread to the preferred CPU. > > The thread has max priority and has SCHED_RR policy, > > so it can occupy one CPU. To save power, the thread will > > invoke the deep C-state entry instructions. > > > > To avoid starvation, the thread will sleep 5% of the time > > time for every second (current RT scheduler has threshold > > to avoid starvation, but if other CPUs are idle, > > the CPU can borrow CPU timer from other, > > which makes the mechanism not work here) > > > > Vaidyanathan Srinivasan has proposed scheduler enhancements > > to allow injecting idle time into the system. This driver doesn't > > depend on those enhancements, but could cut over to them > > when they are available. > > > > Peter Z. does not favor upstreaming this driver until > > the those scheduler enhancements are in place. However, > > we favor upstreaming this driver now because it is useful > > now, and can be enhanced over time. > > > > Signed-off-by: Shaohua Li <shaohua.li@xxxxxxxxx> > > NACKed-by: Peter Zijlstra <a.p.zijlstra@xxxxxxxxx> > > Cc: Vaidyanathan Srinivasan <svaidy@xxxxxxxxxxxxxxxxxx> > > Signed-off-by: Len Brown <len.brown@xxxxxxxxx> > > This is a first a patch with a NACKed-by, could we please have more > discussion on the proposed design. This is the most recent reference I have: http://marc.info/?l=linux-acpi&m=124650086915649&w=2 --Vaidy -- To unsubscribe from this list: send the line "unsubscribe linux-acpi" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html