On 7/27/2016 9:55 AM, Christoph Lameter wrote:
The critical piece of code is this: /* * Cycle through CPUs to check if the CPUs stay synchronized * to each other. */ next_cpu = cpumask_next(raw_smp_processor_id(), cpu_online_mask); if (next_cpu >= nr_cpu_ids) next_cpu = cpumask_first(cpu_online_mask); watchdog_timer.expires += WATCHDOG_INTERVAL; add_timer_on(&watchdog_timer, next_cpu); Should we just cycle through the cpus that are not isolated? Otherwise we need to have some means to check the clocksources for accuracy remotely (probably impossible for TSC etc).
That sounds like the right idea - use the housekeeping cpu mask instead of the cpu online mask. Should be a straightforward patch; do you want to do that and test it in your configuration, and I'll include it in the next spin of the patch series? Thanks for your testing! -- Chris Metcalf, Mellanox Technologies http://www.mellanox.com -- To unsubscribe from this list: send the line "unsubscribe linux-api" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html