Re: [RFC] SUNRPC connect timeout case network request delay

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 03/04/2010 05:12 AM, Mi Jinlong wrote:
Hi,

When I use the nfs at RHEL with kernel-2.6.31 and nfs-utils-1.2.0,
I got a problem, but i don't sure.

The process:

Step1: [22:40:9] Open file /mnt/nfs/file(OK)
Step2: [22:40:9] Lock file
        [22:40:9] Lock file success
Step3: [22:40:9] Reboot the nfs server
        If reboot success, put<enter>  to continue
Step4: [22:42:16] Write data to file
        [22:42:16] Write data success
Step5: [22:42:16] Unlock file
        [22:46:30] Unlock file success.
Step6: [22:46:30] Close file /mnt/nfs/file
        [22:46:30] Close fiel /mnt/nfs/file success

The problem is at step5, unlock file takes 4 min, it's a long time than expected.
When traceing the kernel, I find SUNRPC call call_connect timeout many times,
one timeout is 1min.

The kernel's TCP reconnect logic will retry until it succeeds, without letting the upper level make progress. For some reason, it is having difficulty reconnecting with your server.

I think it's a problem of kernel, but i don't know why, can someone help me ?

# sudo rpcdebug -m rpc -s xprt trans

run your test, and then look in your kernel log. It might tell you what error is occurring. Another good thing to do is capture a network trace with

# sudo tcpdump -s 1540 -w /tmp/raw

while you run your test.

This is similar to the symptoms of other connect problems I've encountered recently.

--
chuck[dot]lever[at]oracle[dot]com
--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux