> Which branch did you pull this from? CVS HEAD is currently highly unstable. > FC4 is recommended. > > It looks like the SM_RETRY is still in sm_barrier.c:add_barrier_callback - > that should have come out on most branches by now. I tested both cvs head and cvs RHEL4, i do not know anythink about FC4. Why so many branches??? Which of them is definitively more stable? Howewer *IT SEEMS* recompiling 2.6.11 without PREEMPTIBLE KERNEL avoids the random oops! Actually my gfs is happy :) this could be a good starting point, due to different messages in the mailing list referencing the same problem. Howewer some opss continues to present when i reboot the servers without umount the filesystem cleanly, But i suppose that this could be while iscsi is died before the system umounts local filesystem during shutdown procedure. May 2 23:04:21 simlistener kernel: Unable to handle kernel NULL pointer dereference at virtual address 00000024 May 2 23:04:21 simlistener kernel: printing eip: May 2 23:04:21 simlistener kernel: dedb978a May 2 23:04:21 simlistener kernel: *pde = 00000000 May 2 23:04:21 simlistener kernel: Oops: 0000 [#1] May 2 23:04:21 simlistener kernel: PREEMPT May 2 23:04:21 simlistener kernel: Modules linked in: gfs lock_dlm dlm lock_harness cman iscsi_tcp iscsi_if May 2 23:04:21 simlistener kernel: CPU: 0 May 2 23:04:21 simlistener kernel: EIP: 0060:[<dedb978a>] Not tainted VLI May 2 23:04:21 simlistener kernel: EFLAGS: 00010212 (2.6.11.7) May 2 23:04:21 simlistener kernel: EIP is at gfs_ail_start_trans+0x4a/0x1f0 [gfs] May 2 23:04:21 simlistener kernel: eax: 00000000 ebx: da609ec0 ecx: da609e80 edx: ded5e420 May 2 23:04:21 simlistener kernel: esi: dacd66d0 edi: da609ebc ebp: da609000 esp: da609e48 May 2 23:04:21 simlistener kernel: ds: 007b es: 007b ss: 0068 May 2 23:04:21 simlistener kernel: Process umount (pid: 3690, threadinfo=da609000 task=dd5a1a80) May 2 23:04:21 simlistener kernel: Stack: da609ec0 da609e60 00000000 da609000 da609ec0 00000000 00000000 da609e60 May 2 23:04:21 simlistener kernel: ded5c000 ded705c0 da609e60 dedd19e5 ded5c000 da609e60 00000000 ded705ac May 2 23:04:21 simlistener kernel: ded5c000 da609000 da609ec0 da1fd100 dedbb041 ded5c000 00000400 ded5c000 May 2 23:04:21 simlistener kernel: Call Trace: May 2 23:04:21 simlistener kernel: [<dedd19e5>] gfs_ail_start+0x75/0xc0 [gfs] May 2 23:04:21 simlistener kernel: [<dedbb041>] gfs_sync_meta+0x31/0x60 [gfs] May 2 23:04:21 simlistener kernel: [<dedeb6c3>] gfs_make_fs_ro+0x53/0xb0 [gfs] May 2 23:04:21 simlistener kernel: [<dede12bd>] gfs_put_super+0x2bd/0x300 [gfs] May 2 23:04:21 simlistener kernel: [<c015bad7>] generic_shutdown_super+0x127/0x140 May 2 23:04:21 simlistener kernel: [<dedde4a2>] gfs_kill_sb+0x32/0x6e [gfs] May 2 23:04:21 simlistener kernel: [<c015b88e>] deactivate_super+0x6e/0xa0 May 2 23:04:21 simlistener kernel: [<c017313f>] sys_umount+0x3f/0xa0 May 2 23:04:21 simlistener kernel: [<c014882d>] do_munmap+0x13d/0x180 May 2 23:04:21 simlistener kernel: [<c01488c0>] sys_munmap+0x50/0x80 May 2 23:04:21 simlistener kernel: [<c01731b5>] sys_oldumount+0x15/0x20 May 2 23:04:21 simlistener kernel: [<c010273f>] syscall_call+0x7/0xb May 2 23:04:21 simlistener kernel: Code: 00 31 c0 89 44 24 14 b8 01 00 00 00 e8 50 99 35 e1 8b 5f 04 39 fb 8b 73 04 74 2b$ May 2 23:04:21 simlistener kernel: <6>note: umount[3690] exited with preempt_count 1 May 2 23:04:21 simlistener kernel: scheduling while atomic: umount/0x10000001/3690 May 2 23:04:21 simlistener kernel: [<c03a66f2>] schedule+0x522/0x530 May 2 23:04:21 simlistener kernel: [<c0143c0e>] unmap_page_range+0x7e/0xa0 May 2 23:04:21 simlistener kernel: [<c0143de0>] unmap_vmas+0x1b0/0x210 May 2 23:04:21 simlistener kernel: [<c0148bec>] exit_mmap+0x7c/0x170 May 2 23:04:21 simlistener kernel: [<c0114677>] mmput+0x37/0xb0 May 2 23:04:21 simlistener kernel: [<c011938b>] do_exit+0x9b/0x3c0 May 2 23:04:21 simlistener kernel: [<c010392b>] die+0x18b/0x190 May 2 23:04:21 simlistener kernel: [<c0116e27>] printk+0x17/0x20 May 2 23:04:21 simlistener kernel: [<c01113da>] do_page_fault+0x2da/0x5d5 May 2 23:04:21 simlistener kernel: [<c03a7261>] __wait_on_bit+0x51/0x70 May 2 23:04:21 simlistener kernel: [<c012c050>] wake_bit_function+0x0/0x60 May 2 23:04:21 simlistener kernel: [<dedd2938>] log_free_buf+0x58/0x60 [gfs] May 2 23:04:21 simlistener kernel: [<c03a68b9>] wait_for_completion+0xc9/0xf0 May 2 23:04:21 simlistener kernel: [<c0111100>] do_page_fault+0x0/0x5d5 May 2 23:04:21 simlistener kernel: [<c0103177>] error_code+0x2b/0x30 May 2 23:04:21 simlistener kernel: [<dedb978a>] gfs_ail_start_trans+0x4a/0x1f0 [gfs] May 2 23:04:21 simlistener kernel: [<dedd19e5>] gfs_ail_start+0x75/0xc0 [gfs] May 2 23:04:21 simlistener kernel: [<dedbb041>] gfs_sync_meta+0x31/0x60 [gfs] May 2 23:04:21 simlistener kernel: [<dedeb6c3>] gfs_make_fs_ro+0x53/0xb0 [gfs] May 2 23:04:21 simlistener kernel: [<dede12bd>] gfs_put_super+0x2bd/0x300 [gfs] May 2 23:04:21 simlistener kernel: [<c015bad7>] generic_shutdown_super+0x127/0x140 May 2 23:04:21 simlistener kernel: [<dedde4a2>] gfs_kill_sb+0x32/0x6e [gfs] May 2 23:04:21 simlistener kernel: [<c015b88e>] deactivate_super+0x6e/0xa0 May 2 23:04:21 simlistener kernel: [<c017313f>] sys_umount+0x3f/0xa0 May 2 23:04:21 simlistener kernel: [<c014882d>] do_munmap+0x13d/0x180 May 2 23:04:21 simlistener kernel: [<c01488c0>] sys_munmap+0x50/0x80 May 2 23:04:21 simlistener kernel: [<c01731b5>] sys_oldumount+0x15/0x20 May 2 23:04:21 simlistener kernel: [<c010273f>] syscall_call+0x7/0xb Best Regards -- Dott. Ranaldo Nicola Sistemi di Elaborazione C.S.I. (Centro di Servizi Informativi di Ateneo) Sede di Monte Sant'Angelo Via Cinthia n.4 80126 Napoli Tel. 081/676638 Fax. 081/676628 Email: ranaldo@xxxxxxxx -- Linux-cluster@xxxxxxxxxx http://www.redhat.com/mailman/listinfo/linux-cluster