Hi all! I'm using KVM-62 on a host with Ubuntu Hardy Heron server amd64 installed from Ubuntu repositories with a productive VM running an application server. I am observing in the VM a 'swapper tainted' in some of the processors which cause that the application server is spontaneously restarted. Can this be due to some bug of KVM? The VM is running Debian GNU/Linux Lenny 5.0.2 with kernel 2.6.26-2-686-bigmem with 4 GiB of RAM, 4 CPUs and 'pci=noacpi' kernel option in order to avoid transmit timed out from network interface which turn it inaccessible to the rest of the network. The host machine has 2.6.24-19-server. Extracted data from the VM: /var/log/syslog.1: ================== Jul 23 01:47:40 aps2 kernel: [44260.559685] BUG: soft lockup - CPU#3 stuck for 2857s! [swapper:0] Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop parport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev ext3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not tainted (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[<c01304e0>] EFLAGS: 00000202 CPU: 3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at run_timer_softirq+0x10b/0x17c Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f748bf28 EBX: f752f6e0 ECX: c02c8b32 EDX: f748bf28 Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: f0d05b5c EDI: f7482000 EBP: c013067e ESP: f748bf1c Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: b7d7d050 CR3: 00381000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c013067e>] ? process_timeout+0x0/0x5 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c012d11d>] ? __do_softirq+0x66/0xd3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c012d1cf>] ? do_softirq+0x45/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c012d486>] ? irq_exit+0x35/0x67 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c01152b2>] ? smp_apic_timer_interrupt+0x6b/0x75 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107656>] ? default_idle+0x0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0109364>] ? apic_timer_interrupt+0x28/0x30 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107656>] ? default_idle+0x0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c011a124>] ? native_safe_halt+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107683>] ? default_idle+0x2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c01075ce>] ? cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] ======================= Jul 23 01:47:40 aps2 kernel: [44260.559685] BUG: soft lockup - CPU#1 stuck for 2857s! [swapper:0] Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop parport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev ext3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not tainted (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[<c011a124>] EFLAGS: 00000246 CPU: 1 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at native_safe_halt+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f7474000 EBX: c0107656 ECX: 0304e000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: 00000001 EDI: 00000000 EBP: 00000000 ESP: f7475fa8 Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: 08186bf1 CR3: 35167000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107683>] default_idle+0x2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c01075ce>] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] ======================= Jul 23 01:47:40 aps2 kernel: [44260.559685] BUG: soft lockup - CPU#2 stuck for 2857s! [swapper:0] Jul 23 01:47:40 aps2 kernel: [44260.565670] Modules linked in: ipv6 loop parport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev ext3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.565670] Jul 23 01:47:40 aps2 kernel: [44260.565670] Pid: 0, comm: swapper Not tainted (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP: 0060:[<c011a124>] EFLAGS: 00000246 CPU: 2 Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP is at native_safe_halt+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.565670] EAX: f747e000 EBX: c0107656 ECX: 03059000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.565670] ESI: 00000002 EDI: 00000000 EBP: 00000000 ESP: f747ffa8 Jul 23 01:47:40 aps2 kernel: [44260.565670] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.565670] CR0: 8005003b CR2: 09e89be7 CR3: 379bf000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.565670] [<c0107683>] default_idle+0x2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.565670] [<c01075ce>] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.565670] ======================= /var/log/messages: ================== Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop parport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev ext3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not tainted (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[<c01304e0>] EFLAGS: 00000202 CPU: 3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at run_timer_softirq+0x10b/0x17c Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f748bf28 EBX: f752f6e0 ECX: c02c8b32 EDX: f748bf28 Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: f0d05b5c EDI: f7482000 EBP: c013067e ESP: f748bf1c Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: b7d7d050 CR3: 00381000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c013067e>] ? process_timeout+0x0/0x5 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c012d11d>] ? __do_softirq+0x66/0xd3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c012d1cf>] ? do_softirq+0x45/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c012d486>] ? irq_exit+0x35/0x67 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c01152b2>] ? smp_apic_timer_interrupt+0x6b/0x75 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107656>] ? default_idle+0x0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0109364>] ? apic_timer_interrupt+0x28/0x30 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107656>] ? default_idle+0x0/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c011a124>] ? native_safe_halt+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107683>] ? default_idle+0x2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c01075ce>] ? cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] ======================= Jul 23 01:47:40 aps2 kernel: [44260.559685] Modules linked in: ipv6 loop parport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev ext3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.559685] Jul 23 01:47:40 aps2 kernel: [44260.559685] Pid: 0, comm: swapper Not tainted (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP: 0060:[<c011a124>] EFLAGS: 00000246 CPU: 1 Jul 23 01:47:40 aps2 kernel: [44260.559685] EIP is at native_safe_halt+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.559685] EAX: f7474000 EBX: c0107656 ECX: 0304e000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.559685] ESI: 00000001 EDI: 00000000 EBP: 00000000 ESP: f7475fa8 Jul 23 01:47:40 aps2 kernel: [44260.559685] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.559685] CR0: 8005003b CR2: 08186bf1 CR3: 35167000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.559685] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c0107683>] default_idle+0x2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.559685] [<c01075ce>] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.559685] ======================= Jul 23 01:47:40 aps2 kernel: [44260.565670] Modules linked in: ipv6 loop parport_pc parport i2c_piix4 button i2c_core psmouse pcspkr serio_raw evdev ext3 jbd mbcache ide_disk 8139too floppy 8139cp mii piix ide_pci_generic ide_core ata_generic libata scsi_mod dock thermal processor fan thermal_sys Jul 23 01:47:40 aps2 kernel: [44260.565670] Jul 23 01:47:40 aps2 kernel: [44260.565670] Pid: 0, comm: swapper Not tainted (2.6.26-2-686-bigmem #1) Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP: 0060:[<c011a124>] EFLAGS: 00000246 CPU: 2 Jul 23 01:47:40 aps2 kernel: [44260.565670] EIP is at native_safe_halt+0x2/0x3 Jul 23 01:47:40 aps2 kernel: [44260.565670] EAX: f747e000 EBX: c0107656 ECX: 03059000 EDX: 007def2c Jul 23 01:47:40 aps2 kernel: [44260.565670] ESI: 00000002 EDI: 00000000 EBP: 00000000 ESP: f747ffa8 Jul 23 01:47:40 aps2 kernel: [44260.565670] DS: 007b ES: 007b FS: 00d8 GS: 0000 SS: 0068 Jul 23 01:47:40 aps2 kernel: [44260.565670] CR0: 8005003b CR2: 09e89be7 CR3: 379bf000 CR4: 000006f0 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000 Jul 23 01:47:40 aps2 kernel: [44260.565670] DR6: ffff0ff0 DR7: 00000400 Jul 23 01:47:40 aps2 kernel: [44260.565670] [<c0107683>] default_idle+0x2d/0x53 Jul 23 01:47:40 aps2 kernel: [44260.565670] [<c01075ce>] cpu_idle+0xab/0xcb Jul 23 01:47:40 aps2 kernel: [44260.565670] ======================= Application server log: ======================= INFO | jvm 1 | 2009/07/23 00:56:31 | : 899860K->64068K(943744K), 0.0864870 secs] 2231330K->1402078K(2516608K), 0.0868390 secs] INFO | jvm 1 | 2009/07/23 00:56:32 | java.lang.NullPointerException INFO | jvm 1 | 2009/07/23 00:56:32 | java.lang.NullPointerException INFO | jvm 1 | 2009/07/23 00:56:32 | java.lang.NullPointerException ERROR | wrapper | 2009/07/23 01:47:40 | JVM appears hung: Timed out waiting for signal from JVM. ERROR | wrapper | 2009/07/23 01:47:40 | JVM did not exit on request, terminated INFO | wrapper | 2009/07/23 01:47:40 | JVM exited on its own while waiting to kill the application. STATUS | wrapper | 2009/07/23 01:47:40 | JVM exited in response to signal SIGKILL (9). STATUS | wrapper | 2009/07/23 01:47:44 | Launching a JVM... INFO | jvm 2 | 2009/07/23 01:47:46 | Wrapper (Version 3.2.3) http://wrapper.tanukisoftware.org INFO | jvm 2 | 2009/07/23 01:47:46 | Copyright 1999-2006 Tanuki Software, Inc. All Rights Reserved. INFO | jvm 2 | 2009/07/23 01:47:46 | INFO | jvm 2 | 2009/07/23 01:47:47 | Enter 's' to shutdown, 'r' to restart... KVM Parameters on host machine: =============================== kvm -hda /dev/vm/aps2-raiz -hdb /dev/vm/aps2-space -hdc \ /dev/vm/aps2-index -hdd /dev/vm/aps2-cache -m 4096 -smp 4 -net \ nic,vlan=0,macaddr=00:16:3E:00:00:27 -net tap -daemonize -vnc :5 \ -k es -localtime -monitor telnet:localhost:4005,server,nowait \ -serial telnet:localhost:4045,server,nowait Thanks in advance for your reply: Regards, Daniel -- Fingerprint: BFB3 08D6 B4D1 31B2 72B9 29CE 6696 BF1B 14E6 1D37 Powered by Debian GNU/Linux Squeeze - Linux user #188.598
Attachment:
signature.asc
Description: Digital signature