Re: kernel oops in generic/013 on an rdma mount (over either soft roce or iwarp)

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




> On Nov 10, 2020, at 1:18 PM, Olga Kornievskaia <aglo@xxxxxxxxx> wrote:
> 
> On Tue, Nov 10, 2020 at 12:44 PM Chuck Lever <chuck.lever@xxxxxxxxxx> wrote:
>> 
>> 
>>> On Nov 10, 2020, at 11:51 AM, Olga Kornievskaia <aglo@xxxxxxxxx> wrote:
>>> 
>>> On Tue, Nov 10, 2020 at 9:42 AM Chuck Lever <chuck.lever@xxxxxxxxxx> wrote:
>>>> 
>>>>> Which those changes applied, I get the following oops:
>>>> 
>>>> What's your workload? Do you have a reproducer?
>>> 
>>> I ran generic/013 linux-to-linux.
>> 
>> I'm not able to reproduce the problem.
> 
> Are you on hardware? This is over soft roce/iwarp. I will try hardware
> but it'll take me time.

Since it appears to work correctly when a hardware RDMA device is in
use, that approach would be a waste of your time, methinks. Can you try
debugging with your soft RDMA device?

Start by identifying what NFS operation is failing, and what configuration
of chunks it is using.


>> xfstest: mount options: vers=4.2,proto=rdma,sec=sys,rsize=262144,wsize=131072
>> 
>> FSTYP         -- nfs
>> PLATFORM      -- Linux/x86_64 manet 5.10.0-rc1-00015-g6d4bab79ed4f #1297 SMP Sat Oct 31 12:56:30 EDT 2020
>> 
>> generic/001 22s ...  22s
>> generic/002 1s ...  2s
>> generic/003     [not run] this test requires a valid $SCRATCH_DEV
>> generic/004     [not run] O_TMPFILE is not supported
>> generic/005 1s ...  2s
>> generic/006 10s ...  9s
>> generic/007 40s ...  39s
>> generic/008     [not run] xfs_io fzero  failed (old kernel/wrong fs?)
>> generic/009     [not run] xfs_io fzero  failed (old kernel/wrong fs?)
>> generic/010     [not run] /home/cel/src/xfstests/src/dbtest not built
>> generic/011 6s ...  6s
>> generic/012     [not run] xfs_io fiemap  failed (old kernel/wrong fs?)
>> generic/013 9s ...  9s
>> generic/014 10s ...  8s
>> generic/015     [not run] this test requires a valid $SCRATCH_DEV
>> generic/016     [not run] xfs_io fiemap  failed (old kernel/wrong fs?)
>> generic/017     [not run] this test requires a valid $SCRATCH_DEV
>> generic/018     [not run] this test requires a valid $SCRATCH_DEV
>> 
>> I must be missing something that you have in your environment.
>> 
>> 
>> --
>> Chuck Lever

--
Chuck Lever







[Index of Archives]     [Linux Filesystem Development]     [Linux USB Development]     [Linux Media Development]     [Video for Linux]     [Linux NILFS]     [Linux Audio Users]     [Yosemite Info]     [Linux SCSI]

  Powered by Linux