Re: [QUESTION]stuck in SeaBIOS because of losing a SMI

Paolo Bonzini <pbonzini@xxxxxxxxxx> · Wed, 3 Aug 2016 12:32:56 +0200

On 03/08/2016 11:43, Xulei (Stone) wrote:
> Hi, all:
> Recently I use a shell script to continuously reset a vm to see what may happen.
> After one day, the vm is stuck. Looking from the following seabios log and
> kvm trace log, it seems like losing a SMI or SeaBIOS can not handle a SMI.
> This problem is reproducible on my machine (SeaBIOS 1.9.1, Qemu 2.6.0,
> Kmod 4.4.11).

Thanks for the report!

The more interesting part of the trace would be around the pio_write at
0xb2.  Also please try kernel commit c43203cab1e, which just went into 4.8.

Paolo

> ==================my shell script===
> #!/bin/bash
> while((1))
> do
> virsh reset VMNAME
> sleep 1
> done
> 
> ==================kvm trace log===
> CPU 0/KVM-13843 [020] d... 1025056.813494: kvm_entry: vcpu 0
> CPU 0/KVM-13843 [020] .... 1025056.813495: kvm_exit: reason IO_INSTRUCTION rip 0xef10e info b30048 0
> CPU 0/KVM-13843 [020] .... 1025056.813495: kvm_emulate_insn: 0:ef10e:e4 b3 (prot32)
> CPU 0/KVM-13843 [020] .... 1025056.813496: kvm_userspace_exit: reason KVM_EXIT_IO (2)
> CPU 0/KVM-13843 [020] .... 1025056.813496: kvm_fpu: unload
> CPU 0/KVM-13843 [020] .... 1025056.813497: kvm_pio: pio_read at 0xb3 size 1 count 1 val 0x1
> CPU 0/KVM-13843 [020] .... 1025056.813497: kvm_fpu: load
> CPU 0/KVM-13843 [020] d... 1025056.813497: kvm_entry: vcpu 0
> CPU 0/KVM-13843 [020] .... 1025056.813498: kvm_exit: reason IO_INSTRUCTION rip 0xef10e info b30048 0
> CPU 0/KVM-13843 [020] .... 1025056.813498: kvm_emulate_insn: 0:ef10e:e4 b3 (prot32)
> CPU 0/KVM-13843 [020] .... 1025056.813499: kvm_userspace_exit: reason KVM_EXIT_IO (2)
> CPU 0/KVM-13843 [020] .... 1025056.813499: kvm_fpu: unload
> CPU 0/KVM-13843 [020] .... 1025056.813500: kvm_pio: pio_read at 0xb3 size 1 count 1 val 0x1
> CPU 0/KVM-13843 [020] .... 1025056.813500: kvm_fpu: load
> 
> ==================my seabios log===
> --- a/roms/seabios/src/fw/smm.c
> +++ b/roms/seabios/src/fw/smm.c
> @@ -65,7 +65,8 @@ handle_smi(u16 cs)
>      u8 cmd = inb(PORT_SMI_CMD);
>      struct smm_layout *smm = MAKE_FLATPTR(cs, 0);
>      u32 rev = smm->cpu.i32.smm_rev & SMM_REV_MASK;
> -    dprintf(DEBUG_HDL_smi, "handle_smi cmd=%x smbase=%p\n", cmd, smm);
> +    if(cmd == 0x00) {
> +    	dprintf(1, "handle_smi cmd=%x smbase=%p\n", cmd, smm);
> +    }
>  
>      if (smm == (void*)BUILD_SMM_INIT_ADDR) {
>          // relocate SMBASE to 0xa0000
> @@ -147,14 +148,14 @@ smm_relocate_and_restore(void)
>  {
>      /* init APM status port */
>      outb(0x01, PORT_SMI_STATUS);
> +   dprintf(1,"before SMI====\n");
> 
>      /* raise an SMI interrupt */
>      outb(0x00, PORT_SMI_CMD);
> +    dprintf(1,"after SMI=====\n");
> 
>      /* wait until SMM code executed */
>      while (inb(PORT_SMI_STATUS) != 0x00)
>          ;
> +   dprintf(1,"smm code executes complete====\n");
> 
> And the log output like this:
> 2016-08-03 16:23:15PCI: Using 00:02.0 for primary VGA
> 2016-08-03 16:23:15smm_device_setup start
> 2016-08-03 16:23:15init smm
> 2016-08-03 16:23:15before SMI====
> 2016-08-03 16:23:15after SMI=====     <----always stuck here, unless i destroy it
> 
> As above, it is obviously that if bios doesn't handle_smi, PORT_SMI_STATUS is
> always 0x01. smm_relocate_and_restore() will always in the while loop.
> 
> Why does bios handle_smi at this point, is this a kvm bug? or SeaBIOS bug?
> 
> --------------
> Xulei (Stone)
> 
--
To unsubscribe from this list: send the line "unsubscribe kvm" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html