Re: [PATCH] arm64: kvm: Remove redundant KVM_ARM64_FP_HOST flag

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Tue, Jul 07, 2020 at 10:33:50PM +0100, Andrew Scull wrote:
> On Tue, Jul 07, 2020 at 05:59:58PM +0100, Dave Martin wrote:
> > On Tue, Jul 07, 2020 at 03:57:13PM +0100, Andrew Scull wrote:
> > > The FPSIMD registers can be in one of three states:
> > >  (a) loaded with the user task's state
> > >  (b) loaded with the vcpu's state
> > >  (c) dirty with transient state
> > > 
> > > KVM_ARM64_FP_HOST identifies the case (a). When loading the vcpu state,
> > > this is used to decide whether to save the current FPSIMD registers to
> > > the user task.
> > > 
> > > However, at the point of loading the vcpu's FPSIMD state, it is known
> > > that we are not in state (b). States (a) and (c) can be distinguished by
> > > by checking the TIF_FOREIGN_FPSTATE bit, as was previously being done to
> > > prepare the KVM_ARM64_FP_HOST flag but without the need for mirroring
> > > the state.
> > 
> > In general there's another case
> > 
> > 	(d) loaded with some unrelated user task's state
> > 
> > I have a vague memory that the hyp trap code is supposed to save state
> > back to whatever task it belonged to -- but functions like
> > kvm_arch_vcpu_run_map_fp() make me suspicious that if this can happen,
> > it doesn't work correctly.
> > 
> > Since you're digging anyway, I'll answer in the form of a question:
> > when we reach __hyp_handle_fpsimd(), can the state in the FPSIMD/SVE
> > regs be unsaved data belonging to another task?  I'd hope not, because
> > fpsimd_thread_switch() should have saved any dirty regs when scheduling
> > that other thread out.
> 
> IIUC, (d) would fall under state (c) as the switch to the vcpu's user
> task will have set the TIF_FOREIGN_FPSTATE bit after saving the previous
> task's state in fpsimd_thread_switch, as you hoped.
> 
> > If the regs can't be owned by another task, then there may be some scope
> > for simplifying the code along the lines you suggest...
> > 
> > (See also below)
> > 
> > > 
> > > Signed-off-by: Andrew Scull <ascull@xxxxxxxxxx>
> > > ---
> > > This is the result of trying to get my head around the FPSIMD handling.
> > > If I've misunderstood something I'll be very happy to have it explained
> > > to me :)
> > 
> > Er, me too.  It's a while since I worked on this ;)
> 
> Thank you for the detailed reply, it's given me some very helpful
> context!
> 
> > > ---
> > >  arch/arm64/include/asm/kvm_host.h       | 11 +++++----
> > >  arch/arm64/kvm/fpsimd.c                 |  1 -
> > >  arch/arm64/kvm/hyp/include/hyp/switch.h | 30 +++++++++++++++++--------
> > >  3 files changed, 26 insertions(+), 16 deletions(-)
> > > 
> > > diff --git a/arch/arm64/include/asm/kvm_host.h b/arch/arm64/include/asm/kvm_host.h
> > > index e0920df1d0c1..d3652745282d 100644
> > > --- a/arch/arm64/include/asm/kvm_host.h
> > > +++ b/arch/arm64/include/asm/kvm_host.h
> > > @@ -370,12 +370,11 @@ struct kvm_vcpu_arch {
> > >  /* vcpu_arch flags field values: */
> > >  #define KVM_ARM64_DEBUG_DIRTY		(1 << 0)
> > >  #define KVM_ARM64_FP_ENABLED		(1 << 1) /* guest FP regs loaded */
> > > -#define KVM_ARM64_FP_HOST		(1 << 2) /* host FP regs loaded */
> > > -#define KVM_ARM64_HOST_SVE_IN_USE	(1 << 3) /* backup for host TIF_SVE */
> > > -#define KVM_ARM64_HOST_SVE_ENABLED	(1 << 4) /* SVE enabled for EL0 */
> > > -#define KVM_ARM64_GUEST_HAS_SVE		(1 << 5) /* SVE exposed to guest */
> > > -#define KVM_ARM64_VCPU_SVE_FINALIZED	(1 << 6) /* SVE config completed */
> > > -#define KVM_ARM64_GUEST_HAS_PTRAUTH	(1 << 7) /* PTRAUTH exposed to guest */
> > > +#define KVM_ARM64_HOST_SVE_IN_USE	(1 << 2) /* backup for host TIF_SVE */
> > > +#define KVM_ARM64_HOST_SVE_ENABLED	(1 << 3) /* SVE enabled for EL0 */
> > > +#define KVM_ARM64_GUEST_HAS_SVE		(1 << 4) /* SVE exposed to guest */
> > > +#define KVM_ARM64_VCPU_SVE_FINALIZED	(1 << 5) /* SVE config completed */
> > > +#define KVM_ARM64_GUEST_HAS_PTRAUTH	(1 << 6) /* PTRAUTH exposed to guest */
> > >  
> > >  #define vcpu_has_sve(vcpu) (system_supports_sve() && \
> > >  			    ((vcpu)->arch.flags & KVM_ARM64_GUEST_HAS_SVE))
> > > diff --git a/arch/arm64/kvm/fpsimd.c b/arch/arm64/kvm/fpsimd.c
> > > index e329a36b2bee..4e9afeb31989 100644
> > > --- a/arch/arm64/kvm/fpsimd.c
> > > +++ b/arch/arm64/kvm/fpsimd.c
> > > @@ -65,7 +65,6 @@ void kvm_arch_vcpu_load_fp(struct kvm_vcpu *vcpu)
> > >  	vcpu->arch.flags &= ~(KVM_ARM64_FP_ENABLED |
> > >  			      KVM_ARM64_HOST_SVE_IN_USE |
> > >  			      KVM_ARM64_HOST_SVE_ENABLED);
> > > -	vcpu->arch.flags |= KVM_ARM64_FP_HOST;
> > 
> > I'm wondering whether the original code is buggy here.
> > 
> > If the FPSIMD/SVE regs contain some other task's data, we'd overwrite
> > current's regs with that data when running __hyp_handle_fpsimd().
> > 
> > Maybe we should have been checking TIF_FOREIGN_FPSTATE here.  If the
> > issue can happen, your version may fix it.
> > 
> > If we wanted to keep the separate flag (see below for some rationale),
> > it might make sense to do:
> > 
> >   	vcpu->arch.flags &= ~(KVM_ARM64_FP_ENABLED |
> >   			      KVM_ARM64_HOST_SVE_IN_USE |
> >   			      KVM_ARM64_HOST_SVE_ENABLED |
> > 			      KVM_ARM64_FP_HOST);
> > 	if (!test_thread_flag(TIF_FOREIGN_FPSTATE))
> > 		vcpu->arch.flags |= KVM_ARM64_FP_HOST;
> 
> This suggestion looks like a nicer way of writing the current behaviour.
> The code at the moment checks the TIF_FOREIGN_FPSTATE bit in
> update_fp_enabled and clears the host flag if it is set. So it doesn't
> appear to be buggy but this would be a nice patch.
> 
> > >  	if (test_thread_flag(TIF_SVE))
> > >  		vcpu->arch.flags |= KVM_ARM64_HOST_SVE_IN_USE;
> > > diff --git a/arch/arm64/kvm/hyp/include/hyp/switch.h b/arch/arm64/kvm/hyp/include/hyp/switch.h
> > > index 8f622688fa64..beadf17f12a6 100644
> > > --- a/arch/arm64/kvm/hyp/include/hyp/switch.h
> > > +++ b/arch/arm64/kvm/hyp/include/hyp/switch.h
> > > @@ -33,16 +33,24 @@ extern const char __hyp_panic_string[];
> > >  static inline bool update_fp_enabled(struct kvm_vcpu *vcpu)
> > >  {
> > >  	/*
> > > -	 * When the system doesn't support FP/SIMD, we cannot rely on
> > > -	 * the _TIF_FOREIGN_FPSTATE flag. However, we always inject an
> > > -	 * abort on the very first access to FP and thus we should never
> > > -	 * see KVM_ARM64_FP_ENABLED. For added safety, make sure we always
> > > +	 * When entering the vcpu during a KVM_VCPU_RUN call before the vcpu
> > > +	 * has used FPSIMD, FPSIMD is disabled for the vcpu and will trap when
> > > +	 * it is first used. The FPSIMD state currently bound to the cpu is
> > > +	 * that of the user task.
> > > +	 *
> > > +	 * After the vcpu has used FPSIMD, on subsequent entries into the vcpu
> > > +	 * for the same KVM_VCPU_RUN call, the vcpu's FPSIMD state is bound to
> > > +	 * the cpu. Therefore, if _TIF_FOREIGN_FPSTATE is set, we know the
> > > +	 * FPSIMD registers no longer contain the vcpu's state. In this case we
> > > +	 * must, once again, disable FPSIMD.
> > > +	 *
> > > +	 * When the system doesn't support FPSIMD, we cannot rely on the
> > > +	 * _TIF_FOREIGN_FPSTATE flag. For added safety, make sure we always
> > >  	 * trap the accesses.
> > >  	 */
> > >  	if (!system_supports_fpsimd() ||
> > >  	    vcpu->arch.host_thread_info->flags & _TIF_FOREIGN_FPSTATE)
> > > -		vcpu->arch.flags &= ~(KVM_ARM64_FP_ENABLED |
> > > -				      KVM_ARM64_FP_HOST);
> > > +		vcpu->arch.flags &= ~KVM_ARM64_FP_ENABLED;
> > >  
> > >  	return !!(vcpu->arch.flags & KVM_ARM64_FP_ENABLED);
> > >  }
> > > @@ -245,7 +253,13 @@ static inline bool __hyp_handle_fpsimd(struct kvm_vcpu *vcpu)
> > >  
> > >  	isb();
> > >  
> > > -	if (vcpu->arch.flags & KVM_ARM64_FP_HOST) {
> > > +	/*
> > > +	 * The trap means that the vcpu's FPSIMD state is not loaded. If
> > > +	 * _TIF_FOREIGN_FPSTATE is set, the current state does not need to be
> > > +	 * saved. Otherwise, the user task's state is currently loaded and
> > > +	 * needs to be saved.
> > > +	 */
> > > +	if (!(vcpu->arch.host_thread_info->flags & _TIF_FOREIGN_FPSTATE)) {
> > >  		/*
> > >  		 * In the SVE case, VHE is assumed: it is enforced by
> > >  		 * Kconfig and kvm_arch_init().
> > > @@ -260,8 +274,6 @@ static inline bool __hyp_handle_fpsimd(struct kvm_vcpu *vcpu)
> > >  		} else {
> > >  			__fpsimd_save_state(vcpu->arch.host_fpsimd_state);
> > >  		}
> > > -
> > > -		vcpu->arch.flags &= ~KVM_ARM64_FP_HOST;
> > 
> > Without this, there is no record that we did anything.  Are you sure we
> > can't save junk data into the host context the next time we take this
> > trap?  Will we re-enable the trap unnecessarily?
> 
> The record keeping is a little strange. In this handler, the FP_ENABLED
> flag gets set and then, on exit to the host, kvm_arch_vcpu_ctxsync_fp
> binds the vcpu FPSIMD state.
> 
> I've assumed that the vcpu state will remain enabled and the trap
> disabled until the next exit to host. Maybe that is too brittle an
> assumption to make?
> 
> > I think previously I was trying to avoid things like
> > test_and_clear_thread_flag() here because there were limitations on what
> > we could call from the hyp code.  Maybe that's not such an issue now.
> > 
> > I'm slightly uneasy about setting TIF_FOREIGN_FPSTATE from the hyp
> > switch code however.  The order in which this flag is touched with
> > respect to other metadata is important in preemptible code, so I
> > preferred to avoid bare writing of this thread flag: rather it should
> > be set and cleared through the fpsimd_bind_*() and fpsimd_flush_*()
> > functions if at all possible, but these didn't seem suitable for calling
> > from hyp at the time.
> > 
> > In the hyp code we are of course not preemptible, so we would probably
> > get away with a bare set_thread_flag() in practice.
> 
> I'm not familiar with the different contexts required for these updates
> so I'll have to do a bit more learning to saw anything meaningful on
> this one!
> 
> > Overall, it felt a bit cleaner to have separate metdata for the hyp
> > code, and sync it with the host's metadata in clearly defined places
> > (the functions in arch/arm64/kvm/fpsimd.c).  KVM_ARM64_FP_HOST plays
> > that role for the hyp code at present.
> 
> This makes sense. At the very least I hope to add some explanation to
> aid future readers if there aren't any simplification that can be made.

This is all starting to come back to me, but I'll need to spend a bit of
time going over it.

>From memory there may be some partial redundancy between some of the
flags, but most of it is there for a reason... in some cases just to
keep host context switch and vcpu context handling as separate as
possible.


Note that the kvm_arch_vcpu_ctxsync_fp() thing is there because a
softirq that uses kernel_neon_begin() may trigger a lazy fp save on the
host side once we enable interrupts, even if we don't get preempted.
This means that host needs to be told where to save the context if that
occurs.

To observe this kind of thing, you can try adding some code to the
timer softirq handler that does

	kernel_neon_begin();
	asm volatile ( /* add code here to splat the fp regs */
			::: "memory" );
	kernel_neon_end();

Cheers
---Dave
_______________________________________________
kvmarm mailing list
kvmarm@xxxxxxxxxxxxxxxxxxxxx
https://lists.cs.columbia.edu/mailman/listinfo/kvmarm



[Index of Archives]     [Linux KVM]     [Spice Development]     [Libvirt]     [Libvirt Users]     [Linux USB Devel]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux