Re: Performance/stability problems with nfs shares

Dawid Stawiarski <neeo@xxxxxxxx> · Sun, 04 Aug 2013 09:29:37 +0200

W dniu 02.08.2013 17:47, J. Bruce Fields pisze:
On Fri, Aug 02, 2013 at 04:37:57PM +0200, Dawid Stawiarski wrote:
W dniu 02.08.2013 15:12, Jeff Layton pisze:
Typically, a stack trace like that indicates that the process is
waiting for the server to respond. The first thing I would do would be
to ascertain whether the server is actually responding to these
requests.

The same share is accessible on other nodes, so the problem involves
only one of the nodes (completly random) at a time.

It's still conceivable that a server problem could cause it to stop
responding to calls only from a single client--it'd be useful if
possible to check a trace to see if that's what's happening.  If the
traffic is really huge then capturing and analyzing a good trace may be
difficult.

We have almost one hundred NFS nodes, and they're all using the same 
shares with about the same volume of traffic - and only one of the nodes 
at a time has a problem for only one share (other shares from the SAME 
server work OK on the "failing" node) - so it's hard to belive it's the 
server that causes the problem. And yes - with that amount of traffic 
it's hard to make a meaningfull trace on the server side (as noted 
before server is NexentaStor based, and not Linux).

Dawid

Attachment:
smime.p7s

Description: Kryptograficzna sygnatura S/MIME