Re: [NFS] I/O Errors with hard mounts

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I collected some more information on the problem we are seeing.

Here's what I've got:

1) SuSE 10.1 (2.6.16 kernel): running ls -R, hit Control-C-- often see
an "I/O Error", for example:

/gne/home/aa/barfod.files/mac.backup/Avi's/TNFR-IgG/Mutants/mAbs:
11.15.91
/bin/ls: reading directory
/gne/home/aa/barfod.files/mac.backup/Avi's/TNFR-IgG/Mutants/mAbs/11.15.91:
Input/output error

Here's what I captured from RPC and NFS debugging.  No "disconnect"
message like I saw before, but:

Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 xprt_transmit(136)
Jun  9 09:50:30 lablnx01 kernel: RPC:      xs_tcp_send_request(136) = 136
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 xmit complete
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 sleep_on(queue
"xprt_pending" time 4340153030)
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 added to queue
ffff81046bca5d20 "xprt_pending"
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 setting alarm for 60000 ms
Jun  9 09:50:30 lablnx01 kernel: RPC:
wake_up_next(ffff81046bca5cd0 "xprt_resend")
Jun  9 09:50:30 lablnx01 kernel: RPC:
wake_up_next(ffff81046bca5c80 "xprt_sending")
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 sync task going to sleep
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 got signal
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 __rpc_wake_up_task (now 4340153035)
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 disabling timer
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 removed from queue
ffff81046bca5d20 "xprt_pending"
Jun  9 09:50:30 lablnx01 kernel: RPC:      __rpc_wake_up_task done
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 sync task resuming
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 deleting timer
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206, return -512, status -512
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 release task
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 release request ffff81046f718000
Jun  9 09:50:30 lablnx01 kernel: RPC:
wake_up_next(ffff81046bca5d70 "xprt_backlog")
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 releasing UNIX cred ffff810c70dbba40
Jun  9 09:50:30 lablnx01 kernel: RPC:
rpc_release_client(ffff81046c15b800, 1)
Jun  9 09:50:30 lablnx01 kernel: RPC: 46206 freeing task
Jun  9 09:50:30 lablnx01 kernel: NFS reply readdir: -512
Jun  9 09:50:30 lablnx01 kernel: NFS: find_dirent_page() returns -5
Jun  9 09:50:30 lablnx01 kernel: NFS: readdir_search_pagecache() returned -5
Jun  9 09:50:30 lablnx01 kernel: NFS: dentry_delete(mAbs/11.15.91, 8)


Note the "reply readdir: -512", is that consistent with what you guys
are saying?

Noticeably, I cannot get the error message on a host with a newer kernel.

Dave

-------------------------------------------------------------------------
Check out the new SourceForge.net Marketplace.
It's the best place to buy or sell services for
just about anything Open Source.
http://sourceforge.net/services/buy/index.php
_______________________________________________
NFS maillist  -  NFS@xxxxxxxxxxxxxxxxxxxxx
https://lists.sourceforge.net/lists/listinfo/nfs
_______________________________________________
Please note that nfs@xxxxxxxxxxxxxxxxxxxxx is being discontinued.
Please subscribe to linux-nfs@xxxxxxxxxxxxxxx instead.
    http://vger.kernel.org/vger-lists.html#linux-nfs

--
To unsubscribe from this list: send the line "unsubscribe linux-nfs" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html

[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux