On Fri, Dec 08, 2017 at 11:50:00AM +0100, Vitaly Kuznetsov wrote: > When we run nested KVM on Hyper-V guests we need to update masterclocks for > all guests when L1 migrates to a host with different TSC frequency. > Implement the procedure in the following way: > - Pause all guests. > - Tell our host (Hyper-V) to stop emulating TSC accesses. > - Update our gtod copy, recompute clocks. > - Unpause all guests. > > This is somewhat similar to cpufreq but we have two important differences: > we can only disable TSC emulation globally (on all CPUs) and we don't know > the new TSC frequency until we turn the emulation off so we can't > 'prepare' ourselves to the event. > > Signed-off-by: Vitaly Kuznetsov <vkuznets@xxxxxxxxxx> > --- > arch/x86/kvm/x86.c | 45 +++++++++++++++++++++++++++++++++++++++++++++ > 1 file changed, 45 insertions(+) > > diff --git a/arch/x86/kvm/x86.c b/arch/x86/kvm/x86.c > index 96e04a0cb921..04d90712ffd2 100644 > --- a/arch/x86/kvm/x86.c > +++ b/arch/x86/kvm/x86.c > @@ -68,6 +68,7 @@ > #include <asm/div64.h> > #include <asm/irq_remapping.h> > #include <asm/mshyperv.h> > +#include <asm/hypervisor.h> > > #define CREATE_TRACE_POINTS > #include "trace.h" > @@ -5946,6 +5947,43 @@ static void tsc_khz_changed(void *data) > __this_cpu_write(cpu_tsc_khz, khz); > } > > +void kvm_hyperv_tsc_notifier(void) > +{ > +#ifdef CONFIG_X86_64 > + struct kvm *kvm; > + struct kvm_vcpu *vcpu; > + int cpu; > + > + spin_lock(&kvm_lock); > + list_for_each_entry(kvm, &vm_list, vm_list) > + kvm_make_mclock_inprogress_request(kvm); > + > + hyperv_stop_tsc_emulation(); > + > + /* TSC frequency always matches when on Hyper-V */ > + for_each_present_cpu(cpu) > + per_cpu(cpu_tsc_khz, cpu) = tsc_khz; > + kvm_max_guest_tsc_khz = tsc_khz; > + > + list_for_each_entry(kvm, &vm_list, vm_list) { > + struct kvm_arch *ka = &kvm->arch; > + > + spin_lock(&ka->pvclock_gtod_sync_lock); > + > + pvclock_update_vm_gtod_copy(kvm); > + > + kvm_for_each_vcpu(cpu, vcpu, kvm) > + kvm_make_request(KVM_REQ_CLOCK_UPDATE, vcpu); > + > + kvm_for_each_vcpu(cpu, vcpu, kvm) > + kvm_clear_request(KVM_REQ_MCLOCK_INPROGRESS, vcpu); > + > + spin_unlock(&ka->pvclock_gtod_sync_lock); > + } > + spin_unlock(&kvm_lock); Can't you skip all this if the tsc frequency hasn't changed (which should probably be the case when the CPU supports tsc frequency scaling)? Roman.