Re: NFSv4 memory allocation bug?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, Jan 13, 2011 at 06:25:23PM +0100, Txema Heredia Genestar wrote:
>  El 13/01/11 17:19, J. Bruce Fields escribiÃ:
> >On Thu, Jan 13, 2011 at 04:48:26PM +0100, Txema Heredia Genestar wrote:
> >>  Hi Bruce, thanks for your answer
> >>
> >>
> >>El 12/01/11 19:35, J. Bruce Fields escribiÃ:
> >>>On Wed, Jan 12, 2011 at 06:14:53PM +0100, Txema Heredia Genestar wrote:
> >>>>Additionally, I have checked tcpdump and found, when mounting an
> >>>>NFS4 drive from a working storage-system:
> >>>>...
> >>>>12:38:06.372303 IP client.907>   storage.nfs: . ack 29 win 46
> >>>><nop,nop,timestamp 4063464822 174132214>
> >>>>12:38:06.372429 IP client.2364980656>   storage.nfs: 148 getattr [|nfs]
> >>>>12:38:06.372792 IP storage.nfs>   client.2364980656: reply ok 248
> >>>>getattr [|nfs]
> >>>>12:38:06.372958 IP client.2381757872>   storage.nfs: 172 getattr [|nfs]
> >>>>12:38:06.373132 IP storage.nfs>   client.2381757872: reply ok 88
> >>>>getattr [|nfs]
> >>>>12:38:06.373157 IP client.2398535088>   storage.nfs: 176 getattr [|nfs]
> >>>>12:38:06.373316 IP storage.nfs>   client.2398535088: reply ok 100
> >>>>getattr [|nfs]
> >>>>12:38:06.373339 IP client.2415312304>   storage.nfs: 172 getattr [|nfs]
> >>>>
> >>>>
> >>>>But when I mount from the same client, the NFS4 share from my server
> >>>>gets stuck on the "getattr" call
> >>>>...
> >>>>12:36:37.051840 IP client.926>   server.nfs: . ack 29 win 140
> >>>><nop,nop,timestamp 4063375488 434039929>
> >>>>12:36:37.051903 IP client.1734362088>   server.nfs: 148 getattr [|nfs]
> >>>>12:36:37.090274 IP server.nfs>   client.926: . ack 192 win 4742
> >>>><nop,nop,timestamp 434039939 4063375488>
> >>>>---silence---
> >>>Something like wireshark would give a few more details.
> >>I have wiresharked it and I don't see any differences between the
> >>"getattr" packages in both cases. Do you want me to paste them in a
> >>specific format?
> >I'm curious which attributes were requested.  In particular, is the
> >unreplied-to getattr the *first* time that the client requests the owner
> >or owner_group attributes?
> >
> 
> Yes, the "unreplied-to" getattr call is the very first (and only)
> time it those are requested:

Yeah, so almost certainly some idmapping problem.

> >Might also be worth looking at the nfs4.idtoname cache contents after
> >the hang:
> >
> >	rpcdebug -m rpc -s cache
> >	cat /proc/net/rpc/nfs4.idtoname/content
> >
> >I seem to recall c9b6cbe56d3ac471e6cd72a59ec9e324b3417016 or
> >0a725fc4d3bfc4734164863d6c50208b109ca5c7 being possible causes of hangs.
> >
> >--b.
> 
> Unfortunately, rpcdebug is not present in this server. So my
> /proc/net/rpc/nfs4.idtoname/content file is empty.
> 
> May this command be of any use?
> "echo "65535" > /proc/sys/sunrpc/rpc_debug"

Yeah doing that may get you a little more information.

Note that kernel's pretty out of date with respect to mainstream so your
best bet is your distro support or seeing if you can reproduce the
problem on a more recent kernel.

--b.
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html


[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux