Eric Renfro wrote:
Here's the stack traces I'm getting when it faults:
Jan 13 03:31:27 cweb1 kernel: [1387920.160141] INFO: task
flush-251:1:27497 blocked for more than 120 seconds.
Jan 13 03:31:27 cweb1 kernel: [1387920.160802] "echo 0 >
/proc/sys/kernel/hung_task_timeout_secs" disables this message.
As the error says, this isn't an actual kernel ooops, it means that
something in your stack (likely the iSCSI implementation since that is
the fixed thing throughout what you tested) is blocking somewhere.
What version of RHEL, gfs2 and iscsi are you using? My guess is that
iscsi might be getting into a race somewhere and locks up. Have you
tried connecting both clients to just a single server via iscsi to see
if the problem goes away?
Gordan
--
Linux-cluster mailing list
Linux-cluster@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/linux-cluster