On Thu, Jul 28, 2011 at 05:47:57PM -0400, Trond Myklebust wrote: > On Thu, 2011-07-28 at 17:24 -0400, Gregory Magoon wrote: > > Thanks for the tips...unfortunately, making the changes you suggest (removing > > timeo, rsize, wsize options) doesn't seem to address the issue with MPICH2 and > > NFSv4. > > Have you turned off delegations on the server? I wouldn't expect them to > help much on an MPI workload. Note, you can do that with "echo 0 >/proc/sys/fs/leases-enable before starting nfsd. > Otherwise, you might want to post a comparison of your results from > 'nfsstat' for your workload on NFSv3 and NFSv4. Yes. Taking a sample of the network traffic once it gets stuck might also be interesting. (Wait for it to get stuck, the run "tcpdump -s0 -wtmp.pcap", let it go for a second (longer if that doesn't get anything), then interrupt it and send us tmp.pcap.) Or if it gets stuck immediately, then you could start the tcpdump before the you start your tests and capture everything. But if it doesn't get stuck immediately that could be a ton of data. --b. -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html