On Thu, Oct 05, 2006 at 09:49:34PM -0700, Randy Dunlap wrote: > On Thu, 5 Oct 2006 18:45:55 -0700 Bryce Harrington wrote: > > > On recent -git kernels, I'm getting an Oops when running the > > lhcs_regression test suite. I can replicate the issue with the > > following steps: > 'M' says that the machine has experienced an abnormal "machine check", > like a processor or memory fault. > > Do you have a log that shows that? > > The message log (URL) below isn't for this same kernel oops > (maybe same oops, but it's 2.6.19-rc1 kernel). > Bug reports should use same info. Sorry, you're correct. Here is a URL for the 2.6.18-git22 kernel: http://crucible.osdl.org/runs/2397/sysinfo/ > > RIP: 0010:[<ffffffff80255287>] [<ffffffff80255287>] __drain_pages+0x29/0x5f > > RSP: 0018:ffff81003f1b3dd8 EFLAGS: 00010082 > > RAX: 0000000000000001 RBX: 0000000000000082 RCX: 0000000000000000 > > RDX: ffff81000000c580 RSI: 00000000fffffffe RDI: ffff81000000c000 > > RBP: 0000000000000000 R08: 00000000fffffffe R09: ffff81007f1e63f0 > > R10: 0000000000000000 R11: 0000000000000000 R12: ffff81000000c580 > > R13: 0000000000000001 R14: 0000000000000001 R15: ffff81003f1b3f50 > > FS: 00002ab3e8a136d0(0000) GS:ffffffff806e3000(0000) knlGS:0000000000000000 > > CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b > > CR2: 0000000000000000 CR3: 000000007eea1000 CR4: 00000000000006e0 > > Process bash (pid: 7203, threadinfo ffff81003f1b2000, task ffff81003f1c07c0) > > Stack: 0000000000000001 0000000000000001 0000000000000007 0000000000000007 > > 0000000000000003 ffffffff802564b6 ffffffff8060d320 ffffffff805644a7 > > ffffffff8060dbe0 0000000000000001 0000000000000001 ffffffff8023c49d > > Call Trace: > > [<ffffffff802564b6>] page_alloc_cpu_notify+0x12/0x28 > > [<ffffffff805644a7>] notifier_call_chain+0x23/0x32 > > [<ffffffff8023c49d>] blocking_notifier_call_chain+0x22/0x36 > > [<ffffffff80248ff9>] _cpu_down+0x17f/0x23d > > [<ffffffff802490de>] cpu_down+0x27/0x3c > > [<ffffffff804419c7>] store_online+0x0/0x6b > > [<ffffffff804419ec>] store_online+0x25/0x6b > > [<ffffffff802ac4b7>] sysfs_write_file+0xad/0xd7 > > [<ffffffff80273ff7>] vfs_write+0xaf/0x14e > > [<ffffffff80274149>] sys_write+0x45/0x6e > > [<ffffffff8020965e>] system_call+0x7e/0x83 > > > > > > Code: 8b 75 00 48 8d 55 10 31 c9 4c 89 e7 e8 32 fa ff ff c7 45 00 > > RIP [<ffffffff80255287>] __drain_pages+0x29/0x5f > > RSP <ffff81003f1b3dd8> > > CR2: 0000000000000000 > > > > > > I am also seeing this issue on linux-2.6.19-rc1. > > > > The test output that identified the presence of the bug: > > http://crucible.osdl.org/runs/2416/test_output/lhcs_regression.log > > > > Some info about the amd64 system is available here: > > http://crucible.osdl.org/runs/2416/sysinfo/amd01.console > > http://crucible.osdl.org/runs/2416/sysinfo/amd01.messages > > http://crucible.osdl.org/runs/2416/sysinfo/amd01.1/proc/ > > http://crucible.osdl.org/runs/2416/sysinfo/amd01.1/INFO/ > > http://crucible.osdl.org/runs/2416/sysinfo/amd01.1/etc/ > > > > I do not see this issue on the x86, ia64, or x86_64 Xeon systems. > > I can provide additional detail about these systems if needed. > > > --- > ~Randy