Re: [PATCH v2] MIPS: Fix ejtag handler on SMP

Paul Burton <paul.burton@xxxxxxxx> · Fri, 1 Jun 2018 16:07:48 -0700

Hi Heiher,

On Tue, Apr 03, 2018 at 07:41:02PM +0800, r@xxxxxx wrote:
> From: Heiher <r@xxxxxx>
> 
> On SMP systems, the shared ejtag debug buffer may be overwritten by
> other cores, because every cores can generate ejtag exception at
> same time.
> 
> Unfortunately, in that context, it's difficult to relax more registers
> to access per cpu buffers. so use ll/sc to serialize the access.
> 
> Signed-off-by: Heiher <r@xxxxxx>
> ---
>  arch/mips/kernel/genex.S | 18 ++++++++++++++++++
>  1 file changed, 18 insertions(+)
> 
> diff --git a/arch/mips/kernel/genex.S b/arch/mips/kernel/genex.S
> index 37b9383eacd3..1af8c83835ef 100644
> --- a/arch/mips/kernel/genex.S
> +++ b/arch/mips/kernel/genex.S
> @@ -354,6 +354,16 @@ NESTED(ejtag_debug_handler, PT_SIZE, sp)
>  	sll	k0, k0, 30	# Check for SDBBP.
>  	bgez	k0, ejtag_return
>  
> +#ifdef CONFIG_SMP
> +	PTR_LA	k0, ejtag_debug_buffer
> +1:	ll	k0, LONGSIZE(k0)
> +	bnez	k0, 1b
> +	PTR_LA	k0, ejtag_debug_buffer
> +	sc	k0, LONGSIZE(k0)
> +	beqz	k0, 1b

If this branch is taken then we're in trouble k0 is now 0x0 so the ll
will try to load from address 0+LONGSIZE. So we'll take a TLBL exception
& panic.

The easiest solution would be to move the 1 label a line earlier to the
PTR_LA, so that we repeat the PTR_LA. I like the clever trick that the
data returned from the ll should either be 0 or the address of
ejtag_debug_buffer, which meant the bnez a little earlier wouldn't have
this problem, and I see that trivially moving the 1 label would make
that case redundantly repeat the PTR_LA too but in practice this should
be rarely executed so the extra couple of instructions ought not to
matter.

I think if we're being really correct this should take into account
R10000_LLSC_WAR too, and use beqzl when that's set (see asm/cmpxchg.h).

> +	sync

This is being used as the equivalent of an smp_llsc_mb(), so it would be
good to comment to that effect & clarify that we need an ordering
barrier to ensure that writes to ejtag_debug_buffer can't overtake the
acquisition of this lock.

The sync could also be omitted if either of
CONFIG_WEAK_REORDERING_BEYOND_LLSC or CONFIG_SMP aren't enabled, but
that might just clutter up the rarely executed code for little gain so I
won't insist.

> +#endif
> +
>  	PTR_LA	k0, ejtag_debug_buffer
>  	LONG_S	k1, 0(k0)
>  	SAVE_ALL
> @@ -363,7 +373,12 @@ NESTED(ejtag_debug_handler, PT_SIZE, sp)
>  	PTR_LA	k0, ejtag_debug_buffer
>  	LONG_L	k1, 0(k0)
>  
> +#ifdef CONFIG_SMP
> +	sw	zero, LONGSIZE(k0)
> +#endif
> +
>  ejtag_return:
> +	back_to_back_c0_hazard
>  	MFC0	k0, CP0_DESAVE
>  	.set	mips32
>  	deret
> @@ -377,6 +392,9 @@ ejtag_return:
>  	.data
>  EXPORT(ejtag_debug_buffer)
>  	.fill	LONGSIZE
> +#ifdef CONFIG_SMP

I think it would make sense to EXPORT(ejtag_debug_buffer_spinlock) or
something to that effect here, and use that symbol when accessing the
lock above. That would make it easier to see at a glance that this is an
open-coded spinlock.

Having said all of that, I'm a little concerned about the approach in
general because it relies on CPUs always returning from
ejtag_exception_handler() to release the lock. If a CPU doesn't return
from ejtag_exception_handler() then any other CPUs that take an EJTAG
exception will hang silently. In practice ejtag_exception_handler() is
currently quite simple so we probably will always return, so I'd
consider this patch an improvement on what we have now once the comments
above are addressed.

One obvious alternative would be to make ejtag_debug_buffer per-CPU, or
probably more easily making it an array of NR_CPUS length. It would be
tricky to access though until MIPSr6, where AUI would help, or nanoMIPS
where an ADDIU[48] could encode the whole address of ejtag_debug_buffer
to add to the CPU offset.

Another option would be to use a KScratch register where they're
present, but pre-r6 we usually can't guarantee they exist so that would
complicate things again.

Thanks,
    Paul

> +	.fill	LONGSIZE
> +#endif
>  	.previous
>  
>  	__INIT
> -- 
> 2.16.3
> 
>