Re: Randomly inaccessible files through NFS

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I would try to tcpdump all NFS traffic starting when the client is in
the "stable" state (including the MOUNT call). Once it's in the
"unstable" state, I would stop the capture then try to figure out
exactly at what point it switched from "stable" to "unstable" (maybe
figure out when exactly the NFS4ERR_EXPIRED start to happen) and track
it down to a specific NFS pattern.

I don't know much about NFS really so I cannot be more specific. Yes,
this probably requires lot of storage to capture all the traffic and
lot of time to analyse the captured data.

On Fri, Aug 17, 2012 at 11:26 AM, Denis V. Nagorny
<dvnagorny@xxxxxxxxxxxxxx> wrote:
> 15.08.2012 11:54, Denis V. Nagorny пишет:
>
>> Hello,
>>
>> Using Scientific Linux 6.1 (I think it's equal to RH EL 6.1) we met the
>> strange issue.  Several last months we have problem. After one or two days
>> of successful work, files on nfs server begins to be randomly unacessible.
>> I doesn't mean that files becames hidden or something like this. It means
>> that attempts to open some random files may be unsuccessful. Usually restart
>> of nfs server makes situation better but for several days only. There are no
>> any messages about errors in logs on server and clients machines. Can
>> anybody point me how can I try to understand what happens at least. Sorry
>> for my english.
>>
>> Denis.
>
>
> Hello again,
>
> I've made some additional experiments. It looks like nfs clients can be in
> one of two states: "quite stable" and "quite unstable". Clients are usually
> stable but after some heavy job with a lot of I/O with NFS server clients
> become "quite unstable" and fails even with single file operations with NFS
> server. In this state I can't unmount NFS shares and so on.  I've tried to
> analyse with wireshark and found that in unstable state there are a lot of
> NFS4ERR_EXPIRED answers from NFS server.  In one of experiments I've changed
> NICs in both machines involved - result the same. So I'm still looking for
> the ways to understand the problem.
> Can anybody give me any advices?
>
> Denis
>
> --
> To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at  http://vger.kernel.org/majordomo-info.html
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux