We just had a GFS client node crash (and it took one of my compute clusters with it, but I can deal with that) with the following message in /var/log/messages:
Sep 20 13:50:12 front-0 kernel:
Sep 20 13:50:12 front-0 kernel: GFS: Assertion failed on line 200 of file trans.c
Sep 20 13:50:12 front-0 kernel: GFS: assertion: "!gfs_get_transaction(sdp)"
Sep 20 13:50:12 front-0 kernel: GFS: time = 1127238612
Sep 20 13:50:12 front-0 kernel: GFS: fsid=hopkins:bst.2
Sep 20 13:50:12 front-0 kernel:
Sep 20 13:50:12 front-0 kernel: Kernel panic: GFS: Record message above and reboot.
The other GFS clients and the server are fine.
Any chance someone could give me an idea on why there'd be a failure here, so I can have a better idea on what to tune on the system?
We recently (ie. yesterday) bumped the number of NFS server processes on this machine from 8 to 24, if that will help...
Thanks.
--
Jerry Gilyeat, RHCE
Systems Administrator
Molecular Microbiology and Immunology
Johns Hopkins Bloomberg School of Public Health
-- Linux-cluster@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/linux-cluster