On Fri, 16 Sep 2005 14:43:42 -0700 Tom McNeal <tmcneal@xxxxxxxxxx> wrote: > Hi - > > When running the stress tests, after a few hours, a panic occurs > due to a kernel page fault for address 0x0 while executing one > of the brctl commands. We don't know which one, yet. Has anyone > run across this? > > do_cpu invoked from kernel context! in traps.c:do_cpu, line 787: > ...snip... > Process brctl (pid: 23999, stackpage=813d8000) > ...snip... > note: brctl[23999] exited with preempt_count 2 > Unable to handle kernel paging request at virtual address 00000000, > epc == > 801131e8, ra == 8011c5d8 > Oops in fault.c:do_page_fault, line 213: > ...etc.... > > This is basically in the 2.4.17 kernel, with some of the security > fixes. > > Tom > Some basics: * 2.4.17 is pretty old, can you at least try 2.4.30 or later. Better yet, 2.6 * Why are you doing bridge commands during the stress test? it is pretty much a setup and forget it thing. * are you using SMP? Locking in bridge code for 2.4 is pretty weak and there are probably holes. I ended up reworking the whole locking model of bridge code for 2.6 for speed and correctness.