RE: Kernel Oops gfs:gfs_glock_dq

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




hi,

did you look to manpages of gfs_tool ? I guess, you should change number of the glockd process. right now, it's not enough for your system.

. . .
. . .
# gfs_tool margs "oopses_ok,num_glockd=X"
# mount
. . .

the X must be a integer value (1...32).

Mehmet CELIK
Istanbul/TURKEY

From: Reiner Rottmann <rottmann@xxxxxxx>
Reply-To: linux clustering <linux-cluster@xxxxxxxxxx>
To: linux-cluster@xxxxxxxxxx
Subject:  Kernel Oops gfs:gfs_glock_dq
Date: Wed, 22 Aug 2007 11:57:01 +0200

Hi,

we experienced a kernel oops in gfs:gfs_glock_dq for the first time and we
would like to know if there are others who are affected by this issue.

Although the node was successfully fenced by the cluster and could rejoin
without any troubles we are curious about what happened. There are no
suspicious log entries preceding the oops messages.

Is this perhaps a known bug and maybe already solved?


# cat /var/log/messages
...
Aug 17 17:45:00 node01.mgmt Unable to handle kernel NULL pointer dereference
at 0000000000000000 RIP:
Aug 17 17:45:00 node01.mgmt  <ffffffffa0179aef>{:gfs:gfs_glock_dq+191}
Aug 17 17:45:00 node01.mgmt  PML4 9363d067 PGD d5119067 PMD 0
Aug 17 17:45:00 node01.mgmt  Oops: 0000 [1] SMP
Aug 17 17:45:00 node01.mgmt  CPU 3
Aug 17 17:45:00 node01.mgmt  Modules linked in: parport_pc lp parport
netconsole netdump autofs4 i2c_dev i2c_core sunrpc ds yenta_socket
pcmcia_core ext3 j
bd button battery ac ohci_hcd hw_random scsi_transport_fc md5 ipv6 lock_dlm(U) dlm(U) gfs(U) lock_harness(U) cman(U) 8021q bonding(U) qla2300(U) qla2xxx_co
nf(U) qla2xxx(U) sd_mod scsi_mod dm_snapshot dm_mirror dm_mod tg3
Aug 17 17:45:00 node01.mgmt  Pid: 13218, comm: site_search_dev Not tainted
2.6.9-42.0.3.ELsmp
Aug 17 17:45:00 node01.mgmt  RIP: 0010:[<ffffffffa0179aef>]
<ffffffffa0179aef>{:gfs:gfs_glock_dq+191}
Aug 17 17:45:00 node01.mgmt  RSP: 0018:0000010042b33ce8  EFLAGS: 00010203
Aug 17 17:45:00 node01.mgmt RAX: 000001009cc5c368 RBX: 0000000000000000 RCX:
00000100af058d50
Aug 17 17:45:00 node01.mgmt RDX: 0000000000000000 RSI: 0000000000000000 RDI:
000001009cc5c35c
Aug 17 17:45:00 node01.mgmt RBP: 000001009cc5c330 R08: 0000000000000000 R09:
0000000000000000
Aug 17 17:45:00 node01.mgmt R10: 0000000000000000 R11: 00000100af058ce0 R12:
000001009cc5c35c
Aug 17 17:45:01 node01.mgmt R13: 0000010008fcc5e8 R14: ffffff00005a3000 R15:
00000100234079a8
Aug 17 17:45:01 node01.mgmt FS: 0000002a95561a20(0000) GS:ffffffff804e5300
(0000) knlGS:0000000000000000
Aug 17 17:45:01 node01.mgmt CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b Aug 17 17:45:01 node01.mgmt CR2: 0000000000000000 CR3: 0000000001050000 CR4:
00000000000006e0
Aug 17 17:45:01 node01.mgmt Process site_search_dev (pid: 13218, threadinfo
0000010042b32000, task 000001004886d7f0)
Aug 17 17:45:01 node01.mgmt  Stack: 0000010042b33cf8 00012c5400000000
ffffffffa01b99c0 0000010008fcc5e8
Aug 17 17:45:01 node01.mgmt         0000010008fcc5c0 0000010008fcc5e8
00000100dd2625e8 0000000000000000
Aug 17 17:45:01 node01.mgmt         0000000000000000 ffffffffa0179c08
Aug 17 17:45:01 node01.mgmt  Call
Trace:<ffffffffa0179c08>{:gfs:gfs_glock_dq_uninit+9}
<ffffffffa018ef63>{:gfs:gfs_flock+222}
Aug 17 17:45:01 node01.mgmt
<ffffffffa01e1781>{:lock_dlm:get_resource+84}
<ffffffffa01e23d7>{:lock_dlm:lm_dlm_punlock+642}
Aug 17 17:45:01 node01.mgmt <ffffffff8018e5e4>{locks_remove_flock+97}
<ffffffff8017a21d>{__fput+73}
Aug 17 17:45:01 node01.mgmt         <ffffffff80178e48>{filp_close+103}
<ffffffff80178ed1>{sys_close+130}
Aug 17 17:45:01 node01.mgmt         <ffffffff8011026a>{system_call+126}
Aug 17 17:45:01 node01.mgmt
Aug 17 17:45:01 node01.mgmt Code: 48 8b 02 0f 18 08 48 8d 45 38 48 39 c2 74
0c ff 44 24 0c 49
Aug 17 17:45:01 node01.mgmt RIP <ffffffffa0179aef>{:gfs:gfs_glock_dq+191} RSP
<0000010042b33ce8>
Aug 17 17:45:01 node01.mgmt  CR2: 0000000000000000
...

$ uname -a
Linux node01 2.6.9-42.0.3.ELsmp #1 SMP Fri Oct 6 06:28:26 CDT 2006 x86_64
x86_64 x86_64 GNU/Linux

$ rpm -qa
...
hp_qla2x00src-8.01.06-7
kernel-smp-2.6.9-42.0.3.EL
kernel-smp-devel-2.6.9-42.0.3.EL
ccs-1.0.7-0
cman-1.0.11-0
cman-kernel-smp-2.6.9-45.8
dlm-1.0.1-1
dlm-kernel-smp-2.6.9-44.3
GFS-6.1.6-1
GFS-kernel-smp-2.6.9-60.3
magma-1.0.6-0
magma-plugins-1.0.9-0
...


--
Gruss / Regards,

Dipl.-Ing. (FH) Reiner Rottmann

Phone: +49-89 452 3538-12

http://www.atix.de/
http://open-sharedroot.org/

PGP Key ID: 0xCA67C5A6
PGP Key Fingerprint = BF59FF006360B6E8D48F26B10D9F5A84CA67C5A6

**
ATIX Informationstechnologie und Consulting AG
Einsteinstr. 10
85716 Unterschleissheim
Deutschland/Germany

Phone: +49-89 452 3538-0
Fax:   +49-89 990 1766-0

Registergericht: Amtsgericht Muenchen
Registernummer: HRB 168930
USt.-Id.: DE209485962

Vorstand:
Marc Grimme, Mark Hlawatschek, Thomas Merz (Vors.)

Vorsitzender des Aufsichtsrats:
Dr. Martin Buss


<< attach4 >>




--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

_________________________________________________________________
Find a local pizza place, movie theater, and more?.then map the best route! http://maps.live.com/default.aspx?v=2&ss=yp.bars~yp.pizza~yp.movie%20theater&cp=42.358996~-71.056691&style=r&lvl=13&tilt=-90&dir=0&alt=-1000&scene=950607&encType=1&FORM=MGAC01

--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster

[Index of Archives]     [Corosync Cluster Engine]     [GFS]     [Linux Virtualization]     [Centos Virtualization]     [Centos]     [Linux RAID]     [Fedora Users]     [Fedora SELinux]     [Big List of Linux Books]     [Yosemite Camping]

  Powered by Linux