Re: [PATCH 16/24] Implement VMLAUNCH and VMRESUME

Avi Kivity <avi@xxxxxxxxxx> · Sun, 26 Sep 2010 14:56:54 +0200

 On 09/26/2010 01:14 PM, Nadav Har'El wrote:
On Mon, Jun 14, 2010, Avi Kivity wrote about "Re: [PATCH 16/24] Implement VMLAUNCH and VMRESUME":
>  >+	vmx_set_cr0(vcpu,
>  >+		(get_shadow_vmcs(vcpu)->guest_cr0&
>  >+			~get_shadow_vmcs(vcpu)->cr0_guest_host_mask) |
>  >+		(get_shadow_vmcs(vcpu)->cr0_read_shadow&
>  >+			get_shadow_vmcs(vcpu)->cr0_guest_host_mask));
>  >+
>  >+	/* However, vmx_set_cr0 incorrectly enforces KVM's relationship
>  >between
>  >+	 * GUEST_CR0 and CR0_READ_SHADOW, e.g., that the former is the same
>  >as
>  >+	 * the latter with with TS added if !fpu_active. We need to take the
>  >+	 * actual GUEST_CR0 that L1 wanted, just with added TS if !fpu_active
>  >+	 * like KVM wants (for the "lazy fpu" feature, to avoid the costly
>  >+	 * restoration of fpu registers until the FPU is really used).
>  >+	 */
>  >+	vmcs_writel(GUEST_CR0, get_shadow_vmcs(vcpu)->guest_cr0 |
>  >+		(vcpu->fpu_active ? 0 : X86_CR0_TS));
>  >
>
>  Please update vmx_set_cr0() instead.

How would you like that I do that?
I could split vmx_set_cr0(vcpu, cr0) into a __vmx_set_cr0(vcpu, cr0, hw_cr0)
and vmx_set_cr0 that calls it. Is this what you had in mind? Won't it be
a little ugly? I agree, though, that it will avoid the vmwriting GUEST_CR0
twice in the nested case.

Just move the extra calculations into vmx_set_cr0().  Check if you're in 
nested mode, and if so apply cr0_guest_host_mask.

The vmlaunch/vmresume code becomes kvm_set_cr0(vcpu, 
get_shadow_vmcs(vcpu)->guest_cr0).

>  >+	/* we have to set the X86_CR0_PG bit of the cached cr0, because
>  >+	 * kvm_mmu_reset_context enables paging only if X86_CR0_PG is set in
>  >+	 * CR0 (we need the paging so that KVM treat this guest as a paging
>  >+	 * guest so we can easly forward page faults to L1.)
>  >+	 */
>  >+	vcpu->arch.cr0 |= X86_CR0_PG;
>  >
>
>  Since this version doesn't support unrestricted nested guests, cr0.pg
>  will be already set or we will have failed vmentry.

I believe without this "hack", things didn't work properly during boot of
L2, when cr0_read_shadow.pg was not yet set. I think PG is set in guest_cr0,
but not in cr0_read_shadow, which is what vcpu->arch.cr0 caches.

I don't see how vcpu->arch.cr0 can cache cr0_read_shadow.  All the mmu 
calculations depend on vcpu->arch.cr0, which must be what the processor 
uses for translations.  cr0_read_shadow is only use to emulate read 
access to cr0 (note we need to both update the real CR0_READ_SHADOW, and 
to consider the virtual CR0_READ_SHADOW when emulating).

--
error compiling committee.c: too many arguments to function

--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html