On Tue, 1 Oct 2013, Johannes Weiner wrote: > On Tue, Oct 01, 2013 at 04:31:23PM -0700, David Rientjes wrote: > > for_each_online_cpu() needs the protection of {get,put}_online_cpus() so > > cpu_online_mask doesn't change during the iteration. > > There is no problem report here. > > Is there a crash? > No. > If it's just accuracy of the read, why would we care about some > inaccuracies in counters that can change before you even get the > results to userspace? And care to the point where we hold up CPU > hotplugging for this? > cpu_hotplug.lock is held while a cpu is going down, it's a coarse lock that is used kernel-wide to synchronize cpu hotplug activity. Memcg has a cpu hotplug notifier, called while there may not be any cpu hotplug refcounts, which drains per-cpu event counts to memcg->nocpu_base.events to maintain a cumulative event count as cpus disappear. Without get_online_cpus() in mem_cgroup_read_events(), it's possible to account for the event count on a dying cpu twice, and this value may be significantly large. In fact, all memcg->pcp_counter_lock use should be nested by {get,put}_online_cpus(). This fixes that issue and ensures the reported statistics are not vastly over-reported during cpu hotplug. > Also, the fact that you directly sent this to Linus suggests there is > some urgency for this fix. What's going on? > I believe users of cpu hotplug still want event counts that are approximate to the real value and that this is 3.12 material. -- To unsubscribe from this list: send the line "unsubscribe cgroups" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html