Re: [Open-FCoE] System crashes with increased drive count

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On initiator side, there were also a lot of messages:

May 21 11:01:52 poc1 kernel: [ 3374.393864] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.396149] host7: xid  fc1: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.396155] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.397069] host7: xid  602: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.397075] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.397482] host7: xid  a46: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.398443] host7: xid  602: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.398498] host7: xid  6ce: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.398863] host7: xid  6ce: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.398869] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.399449] host7: xid  6a2: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.399476] host7: xid  3e1: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.399486] host7: xid  dcd: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.399866] host7: xid  dcd: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.399872] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.400472] host7: xid  d6c: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.401442] host7: xid  585: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.401453] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.402457] host7: xid  585: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.402480] host7: xid  742: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.403602] host7: xid  4a7: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.403607] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.403674] host7: xid  907: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.403678] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:52 poc1 kernel: [ 3374.404276] host7: fcp: 00061e: xid
0084-08c2: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:52 poc1 kernel: [ 3374.404281] host7: xid   84: f_ctl  90000 seq  1
May 21 11:01:52 poc1 kernel: [ 3374.404492] host7: xid  dac: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.404506] host7: xid  70a: Exchange
timer armed : 8000 msecs
May 21 11:01:52 poc1 kernel: [ 3374.405384] host7: xid  585: Exchange
timer canceled
May 21 11:01:52 poc1 kernel: [ 3374.405390] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)

More and more "fc_fcp_recv_data set ERROR" messages show up in the file later:

May 21 11:01:53 poc1 kernel: [ 3374.614142] host7: fcp: 00061e: xid
01ed-0925: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.614147] host7: xid  1ed: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.616658] host7: xid  4e7: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.617657] host7: xid  54a: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.617958] host7: xid  54a: Exchange
timer canceled
May 21 11:01:53 poc1 kernel: [ 3374.617965] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:53 poc1 kernel: [ 3374.619656] host7: xid  ded: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.620627] host7: xid  489: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.620744] host7: fcp: 00061e: xid
01cd-038e: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.620747] host7: xid  1cd: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.621078] host7: xid  1cd: BLS rctl
85 - BLS reject received
May 21 11:01:53 poc1 kernel: [ 3374.621389] host7: xid  3c1: Exchange
timer canceled
May 21 11:01:53 poc1 kernel: [ 3374.621395] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:53 poc1 kernel: [ 3374.621622] host7: xid  a64: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.622056] host7: fcp: 00061e: xid
0107-06e4: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.622060] host7: xid  107: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.622370] host7: xid   cc: exch: BLS
rctl 84 - BLS accept
May 21 11:01:53 poc1 kernel: [ 3374.622381] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_CMD_ABORTED
May 21 11:01:53 poc1 kernel: [ 3374.622491] host7: fcp: 00061e: xid
00a5-0862: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.622496] host7: xid   a5: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.622866] host7: xid  b86: Exchange
timer canceled
May 21 11:01:53 poc1 kernel: [ 3374.622870] host7: xid  1e6: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.622889] host7: xid  dcd: Exchange
timer canceled
May 21 11:01:53 poc1 kernel: [ 3374.622897] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:53 poc1 kernel: [ 3374.623491] host7: xid  1e6: BLS rctl
85 - BLS reject received
May 21 11:01:53 poc1 kernel: [ 3374.623637] host7: xid  4a0: Exchange
timer canceled
May 21 11:01:53 poc1 kernel: [ 3374.623719] host7: fcp: 00061e: xid
012d-0869: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.623724] host7: xid  12d: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.624626] host7: xid  fa4: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.624824] host7: xid  fa4: Exchange
timer canceled
May 21 11:01:53 poc1 kernel: [ 3374.624829] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_DATA_UNDRUN (scsi)
May 21 11:01:53 poc1 kernel: [ 3374.625133] host7: fcp: 00061e: xid
00a4-0e4f: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.625137] host7: xid   a4: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.625404] host7: xid   a4: BLS rctl
85 - BLS reject received
May 21 11:01:53 poc1 kernel: [ 3374.625536] host7: fcp: 00061e: xid
01e7-05e7: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.625540] host7: xid  1e7: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.625637] host7: xid  fa4: Exchange
timer armed : 8000 msecs
May 21 11:01:53 poc1 kernel: [ 3374.625842] host7: xid  1e7: BLS rctl
85 - BLS reject received
May 21 11:01:53 poc1 kernel: [ 3374.626210] host7: fcp: 00061e: xid
01a7-0666: DDP I/O in fc_fcp_recv_data set ERROR
May 21 11:01:53 poc1 kernel: [ 3374.626213] host7: xid  1a7: f_ctl  90000 seq  1
May 21 11:01:53 poc1 kernel: [ 3374.626581] host7: xid  102: exch: BLS
rctl 84 - BLS accept
May 21 11:01:53 poc1 kernel: [ 3374.626590] host7: fcp: 00061e:
Returning DID_ERROR to scsi-ml due to FC_CMD_ABORTED
May 21 11:01:53 poc1 kernel: [ 3374.626602] host7: fcp: 00061e: xid
01c7-0862: DDP I/O in fc_fcp_recv_data set ERROR

On Wed, May 21, 2014 at 2:03 PM, Jun Wu <jwu@xxxxxxxxxxxx> wrote:
> I enabled the Tx offload and set the debug_loggings as suggested. A
> few minutes run generated a 1.5GB messages file.
>
> The file has repeated patterns of the following:
>
> 18630713 May 21 11:01:53 poc2 kernel: [ 1528.182334] host7: xid  b48:
> f_ctl 800000 seq  1
> 18630714 May 21 11:01:53 poc2 kernel: [ 1528.182345] host7: xid  b48:
> f_ctl 880008 seq  2
> 18630715 May 21 11:01:53 poc2 kernel: [ 1528.182601] host7: xid  38e:
> f_ctl 800000 seq  1
> 18630716 May 21 11:01:53 poc2 kernel: [ 1528.182621] host7: xid  38e:
> f_ctl 880008 seq  2
> 18630717 May 21 11:01:53 poc2 kernel: [ 1528.182771] host7: xid  74a:
> f_ctl 800000 seq  1
> 18630718 May 21 11:01:53 poc2 kernel: [ 1528.182785] host7: xid  74a:
> f_ctl 880008 seq  2
> 18630719 May 21 11:01:53 poc2 kernel: [ 1528.183161] host7: xid  e4f:
> f_ctl 800000 seq  1
> 18630720 May 21 11:01:53 poc2 kernel: [ 1528.183181] host7: xid  e4f:
> f_ctl 880008 seq  2
> 18630721 May 21 11:01:53 poc2 kernel: [ 1528.184285] host7: xid  666:
> f_ctl 800000 seq  1
> 18630722 May 21 11:01:53 poc2 kernel: [ 1528.184301] host7: xid  666:
> f_ctl 880008 seq  2
> 18630723 May 21 11:01:53 poc2 kernel: [ 1528.184550] host7: xid  c20:
> f_ctl 800000 seq  1
> 18630724 May 21 11:01:53 poc2 kernel: [ 1528.184589] host7: xid  c20:
> f_ctl 880008 seq  2
> 18630725 May 21 11:01:53 poc2 kernel: [ 1528.185198] host7: xid  607:
> f_ctl 800000 seq  1
> 18630726 May 21 11:01:53 poc2 kernel: [ 1528.185213] host7: xid  607:
> f_ctl 880008 seq  2
> 18630727 May 21 11:01:53 poc2 kernel: [ 1528.185659] host7: xid  925:
> f_ctl 800000 seq  1
> 18630728 May 21 11:01:53 poc2 kernel: [ 1528.185662] host7: xid  b48:
> f_ctl 800000 seq  1
> 18630729 May 21 11:01:53 poc2 kernel: [ 1528.185672] host7: xid  b48:
> f_ctl 880008 seq  2
> 18630730 May 21 11:01:53 poc2 kernel: [ 1528.185680] host7: xid  925:
> f_ctl 880008 seq  2
> 18630731 May 21 11:01:53 poc2 kernel: [ 1528.185751] host7: xid  b61:
> f_ctl 800000 seq  1
> 18630732 May 21 11:01:53 poc2 kernel: [ 1528.185765] host7: xid  b61:
> f_ctl 880008 seq  2
> 18630733 May 21 11:01:53 poc2 kernel: [ 1528.186413] host7: xid  829:
> f_ctl 800000 seq  1
> 18630734 May 21 11:01:53 poc2 kernel: [ 1528.186425] host7: xid  829:
> f_ctl 880008 seq  2
> 18630735 May 21 11:01:53 poc2 kernel: [ 1528.186785] host7: xid  6e4:
> f_ctl 800000 seq  1
> 18630736 May 21 11:01:53 poc2 kernel: [ 1528.186817] host7: xid  6e4:
> f_ctl 880008 seq  2
> 18630737 May 21 11:01:53 poc2 kernel: [ 1528.186932] host7: xid  dab:
> f_ctl 800000 seq  1
> 18630738 May 21 11:01:53 poc2 kernel: [ 1528.186946] host7: xid  dab:
> f_ctl 880008 seq  2
> 18630739 May 21 11:01:53 poc2 kernel: [ 1528.187907] host7: xid  8ac:
> f_ctl 800000 seq  1
> 18630740 May 21 11:01:53 poc2 kernel: [ 1528.187920] host7: xid  8ac:
> f_ctl 880008 seq  2
> 18630741 May 21 11:01:53 poc2 kernel: [ 1528.188656] host7: xid  38e:
> f_ctl 800000 seq  1
> 18630742 May 21 11:01:53 poc2 kernel: [ 1528.188675] host7: xid  38e:
> f_ctl 880008 seq  2
> 18630743 May 21 11:01:53 poc2 kernel: [ 1528.188889] host7: xid  b61:
> f_ctl 800000 seq  1
> 18630744 May 21 11:01:53 poc2 kernel: [ 1528.188899] host7: xid  b61:
> f_ctl 880008 seq  2
> 18630745 May 21 11:01:53 poc2 kernel: [ 1528.189281] host7: xid  88d:
> f_ctl 800000 seq  1
> 18630746 May 21 11:01:53 poc2 kernel: [ 1528.189301] host7: xid  88d:
> f_ctl 880008 seq  2
> 18630747 May 21 11:01:53 poc2 kernel: [ 1528.189378] host7: xid  c20:
> f_ctl 800000 seq  1
> 18630748 May 21 11:01:53 poc2 kernel: [ 1528.189392] host7: xid  c20:
> f_ctl 880008 seq  2
> 18630749 May 21 11:01:53 poc2 kernel: [ 1528.189836] host7: xid  862:
> f_ctl 800000 seq  1
> 18630750 May 21 11:01:53 poc2 kernel: [ 1528.189850] host7: xid  862:
> f_ctl 880008 seq  2
> 18630751 May 21 11:01:53 poc2 kernel: [ 1528.191740] host7: xid  6e4:
> Exchange timer armed : 0 msecs
> 18630752 May 21 11:01:53 poc2 kernel: [ 1528.191747] host7: xid  6e4:
> f_ctl 800000 seq  1
> 18630753 May 21 11:01:53 poc2 kernel: [ 1528.191756] host7: xid  6e4:
> f_ctl 800000 seq  2
> 18630754 May 21 11:01:53 poc2 kernel: [ 1528.191763] host7: xid  6e4:
> Exchange timed out
> 18630755 May 21 11:01:53 poc2 kernel: [ 1528.191777] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 458752,
> lso_max <0x10000>
> 18630756 May 21 11:01:53 poc2 kernel: [ 1528.191782] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 393216,
> lso_max <0x10000>
> 18630757 May 21 11:01:53 poc2 kernel: [ 1528.191786] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 327680,
> lso_max <0x10000>
> 18630758 May 21 11:01:53 poc2 kernel: [ 1528.191790] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 262144,
> lso_max <0x10000>
> 18630759 May 21 11:01:53 poc2 kernel: [ 1528.191794] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 196608,
> lso_max <0x10000>
> 18630760 May 21 11:01:53 poc2 kernel: [ 1528.191798] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 131072,
> lso_max <0x10000>
> 18630761 May 21 11:01:53 poc2 kernel: [ 1528.191801] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 65536,
> lso_max <0x10000>
> 18630762 May 21 11:01:53 poc2 kernel: [ 1528.191805] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 0,
> lso_max <0x10000>
> 18630763 May 21 11:01:53 poc2 kernel: [ 1528.192163] host7: xid  b48:
> f_ctl 800000 seq  1
> 18630764 May 21 11:01:53 poc2 kernel: [ 1528.192166] host7: xid  607:
> f_ctl 800000 seq  1
> 18630765 May 21 11:01:53 poc2 kernel: [ 1528.192176] host7: xid  607:
> f_ctl 880008 seq  2
> 18630766 May 21 11:01:53 poc2 kernel: [ 1528.192180] host7: xid  b48:
> f_ctl 880008 seq  2
> 18630767 May 21 11:01:53 poc2 kernel: [ 1528.192266] host7: xid  666:
> f_ctl 800000 seq  1
>
> Above is the first time ft_queue_data_in message shows up in the file.
> Here is another instance:
>
> 18631537 May 21 11:01:53 poc2 kernel: [ 1528.333876] host7: xid  74a:
> f_ctl 800000 seq  1
> 18631538 May 21 11:01:53 poc2 kernel: [ 1528.333893] host7: xid  74a:
> f_ctl 880008 seq  2
> 18631539 May 21 11:01:53 poc2 kernel: [ 1528.334816] host7: xid  c20:
> f_ctl 800000 seq  1
> 18631540 May 21 11:01:53 poc2 kernel: [ 1528.334834] host7: xid  c20:
> f_ctl 880008 seq  2
> 18631541 May 21 11:01:53 poc2 kernel: [ 1528.334847] host7: xid  b81:
> f_ctl 800000 seq  1
> 18631542 May 21 11:01:53 poc2 kernel: [ 1528.334858] host7: xid  983:
> f_ctl 800000 seq  1
> 18631543 May 21 11:01:53 poc2 kernel: [ 1528.334864] host7: xid  b81:
> f_ctl 880008 seq  2
> 18631544 May 21 11:01:53 poc2 kernel: [ 1528.334881] host7: xid  983:
> f_ctl 880008 seq  2
> 18631545 May 21 11:01:53 poc2 kernel: [ 1528.334972] host7: xid  686:
> f_ctl 800000 seq  1
> 18631546 May 21 11:01:53 poc2 kernel: [ 1528.334985] host7: xid  686:
> f_ctl 880008 seq  2
> 18631547 May 21 11:01:53 poc2 kernel: [ 1528.335036] host7: xid  704:
> f_ctl 800000 seq  1
> 18631548 May 21 11:01:53 poc2 kernel: [ 1528.335052] host7: xid  704:
> f_ctl 880008 seq  2
> 18631549 May 21 11:01:53 poc2 kernel: [ 1528.335078] host7: xid  627:
> f_ctl 800000 seq  1
> 18631550 May 21 11:01:53 poc2 kernel: [ 1528.335088] host7: xid  627:
> f_ctl 880008 seq  2
> 18631551 May 21 11:01:53 poc2 kernel: [ 1528.335202] host7: xid  b48:
> f_ctl 800000 seq  1
> 18631552 May 21 11:01:53 poc2 kernel: [ 1528.335214] host7: xid  b48:
> f_ctl 880008 seq  2
> 18631553 May 21 11:01:53 poc2 kernel: [ 1528.335381] host7: xid  74a:
> Exchange timer armed : 0 msecs
> 18631554 May 21 11:01:53 poc2 kernel: [ 1528.335386] host7: xid  74a:
> f_ctl 800000 seq  1
> 18631555 May 21 11:01:53 poc2 kernel: [ 1528.335388] host7: xid  74a:
> Exchange timed out
> 18631556 May 21 11:01:53 poc2 kernel: [ 1528.335672] host7: xid  869:
> f_ctl 800000 seq  1
> 18631557 May 21 11:01:53 poc2 kernel: [ 1528.335688] host7: xid  869:
> f_ctl 880008 seq  2
> 18631558 May 21 11:01:53 poc2 kernel: [ 1528.336213] host7: xid  965:
> f_ctl 800000 seq  1
> 18631559 May 21 11:01:53 poc2 kernel: [ 1528.336232] host7: xid  965:
> f_ctl 880008 seq  2
> 18631560 May 21 11:01:53 poc2 kernel: [ 1528.336477] host7: xid  74a:
> f_ctl 800000 seq  2
> 18631561 May 21 11:01:53 poc2 kernel: [ 1528.336482] ft_queue_data_in:
> Failed to send frame ffff8802e25c1e00, xid <0x74a>, remaining 196608,
> lso_max <0x10000>
> 18631562 May 21 11:01:53 poc2 kernel: [ 1528.336486] ft_queue_data_in:
> Failed to send frame ffff8802e25c1e00, xid <0x74a>, remaining 131072,
> lso_max <0x10000>
> 18631563 May 21 11:01:53 poc2 kernel: [ 1528.336489] host7: xid  74a:
> f_ctl 800000 seq  3
> 18631564 May 21 11:01:53 poc2 kernel: [ 1528.337645] host7: xid  86d:
> f_ctl 800000 seq  1
> 18631565 May 21 11:01:53 poc2 kernel: [ 1528.337759] host7: xid  86d:
> f_ctl 880008 seq  2
> 18631566 May 21 11:01:53 poc2 kernel: [ 1528.337827] host7: xid  44e:
> f_ctl 800000 seq  1
> 18631567 May 21 11:01:53 poc2 kernel: [ 1528.337846] host7: xid  44e:
> f_ctl 880008 seq  2
> 18631568 May 21 11:01:53 poc2 kernel: [ 1528.340521] host7: xid  8c2:
> Exchange timer armed : 0 msecs
> 18631569 May 21 11:01:53 poc2 kernel: [ 1528.340526] host7: xid  8c2:
> f_ctl 800000 seq  1
> 18631570 May 21 11:01:53 poc2 kernel: [ 1528.340667] host7: xid  8c2:
> Exchange timed out
> 18631571 May 21 11:01:53 poc2 kernel: [ 1528.341064] host7: xid  983:
> f_ctl 800000 seq  1
> 18631572 May 21 11:01:53 poc2 kernel: [ 1528.341087] host7: xid  983:
> f_ctl 880008 seq  2
> 18631573 May 21 11:01:53 poc2 kernel: [ 1528.341286] host7: xid  b48:
> f_ctl 800000 seq  1
> 18631574 May 21 11:01:53 poc2 kernel: [ 1528.341306] host7: xid  b48:
> f_ctl 880008 seq  2
> 18631575 May 21 11:01:53 poc2 kernel: [ 1528.341522] host7: xid  869:
> Exchange timer armed : 0 msecs
> 18631576 May 21 11:01:53 poc2 kernel: [ 1528.341528] host7: xid  869:
> f_ctl 800000 seq  1
> 18631577 May 21 11:01:53 poc2 kernel: [ 1528.341539] host7: xid  869:
> Exchange timed out
> 18631578 May 21 11:01:53 poc2 kernel: [ 1528.341966] host7: xid  965:
> f_ctl 800000 seq  1
> 18631579 May 21 11:01:53 poc2 kernel: [ 1528.341979] host7: xid  965:
> f_ctl 880008 seq  2
> 18631580 May 21 11:01:53 poc2 kernel: [ 1528.342450] host7: xid  627:
> f_ctl 800000 seq  1
> 18631581 May 21 11:01:53 poc2 kernel: [ 1528.342467] host7: xid  627:
> f_ctl 880008 seq  2
> 18631582 May 21 11:01:53 poc2 kernel: [ 1528.342945] host7: xid  70a:
> f_ctl 800000 seq  1
> 18631583 May 21 11:01:53 poc2 kernel: [ 1528.342959] host7: xid  70a:
> f_ctl 880008 seq  2
>
> After stripping out the repeated "host7: xid  xxx: f_ctl 800000 seq
> x" messages, we have:
>
> 18630507:May 21 11:01:53 poc2 kernel: [ 1528.148314] host7: xid  d6b:
> Exchange timer armed : 0 msecs
> 18630509:May 21 11:01:53 poc2 kernel: [ 1528.148357] host7: xid  d6b:
> Exchange timed out
> 18630642:May 21 11:01:53 poc2 kernel: [ 1528.169143] host7: xid  88c:
> Exchange timer armed : 0 msecs
> 18630644:May 21 11:01:53 poc2 kernel: [ 1528.169197] host7: xid  88c:
> Exchange timed out
> 18630751:May 21 11:01:53 poc2 kernel: [ 1528.191740] host7: xid  6e4:
> Exchange timer armed : 0 msecs
> 18630754:May 21 11:01:53 poc2 kernel: [ 1528.191763] host7: xid  6e4:
> Exchange timed out
> 18630755:May 21 11:01:53 poc2 kernel: [ 1528.191777] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 458752,
> lso_max <0x10000>
> 18630756:May 21 11:01:53 poc2 kernel: [ 1528.191782] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 393216,
> lso_max <0x10000>
> 18630757:May 21 11:01:53 poc2 kernel: [ 1528.191786] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 327680,
> lso_max <0x10000>
> 18630758:May 21 11:01:53 poc2 kernel: [ 1528.191790] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 262144,
> lso_max <0x10000>
> 18630759:May 21 11:01:53 poc2 kernel: [ 1528.191794] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 196608,
> lso_max <0x10000>
> 18630760:May 21 11:01:53 poc2 kernel: [ 1528.191798] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 131072,
> lso_max <0x10000>
> 18630761:May 21 11:01:53 poc2 kernel: [ 1528.191801] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 65536,
> lso_max <0x10000>
> 18630762:May 21 11:01:53 poc2 kernel: [ 1528.191805] ft_queue_data_in:
> Failed to send frame ffff8805bf245e00, xid <0x6e4>, remaining 0,
> lso_max <0x10000>
> 18630777:May 21 11:01:53 poc2 kernel: [ 1528.195937] host7: xid  666:
> Exchange timer armed : 0 msecs
> 18630779:May 21 11:01:53 poc2 kernel: [ 1528.195983] host7: xid  666:
> Exchange timed out
> 18630780:May 21 11:01:53 poc2 kernel: [ 1528.196336] host7: xid  862:
> Exchange timer armed : 0 msecs
> 18630782:May 21 11:01:53 poc2 kernel: [ 1528.196348] host7: xid  862:
> Exchange timed out
> 18630879:May 21 11:01:53 poc2 kernel: [ 1528.211080] host7: xid  b61:
> Exchange timer armed : 0 msecs
> 18630881:May 21 11:01:53 poc2 kernel: [ 1528.211137] host7: xid  b61:
> Exchange timed out
> 18630898:May 21 11:01:53 poc2 kernel: [ 1528.214253] host7: xid  38e:
> Exchange timer armed : 0 msecs
> 18630900:May 21 11:01:53 poc2 kernel: [ 1528.214284] host7: xid  38e:
> Exchange timed out
> 18631007:May 21 11:01:53 poc2 kernel: [ 1528.236137] host7: xid  88d:
> Exchange timer armed : 0 msecs
> 18631009:May 21 11:01:53 poc2 kernel: [ 1528.236172] host7: xid  88d:
> Exchange timed out
> 18631176:May 21 11:01:53 poc2 kernel: [ 1528.264223] host7: xid  607:
> Exchange timer armed : 0 msecs
> 18631178:May 21 11:01:53 poc2 kernel: [ 1528.264239] host7: xid  607:
> Exchange timed out
> 18631185:May 21 11:01:53 poc2 kernel: [ 1528.266310] host7: xid  82d:
> Exchange timer armed : 0 msecs
> 18631187:May 21 11:01:53 poc2 kernel: [ 1528.266351] host7: xid  82d:
> Exchange timed out
> 18631226:May 21 11:01:53 poc2 kernel: [ 1528.275452] host7: xid  8a2:
> Exchange timer armed : 0 msecs
> 18631228:May 21 11:01:53 poc2 kernel: [ 1528.275463] host7: xid  8a2:
> Exchange timed out
> 18631553:May 21 11:01:53 poc2 kernel: [ 1528.335381] host7: xid  74a:
> Exchange timer armed : 0 msecs
> 18631555:May 21 11:01:53 poc2 kernel: [ 1528.335388] host7: xid  74a:
> Exchange timed out
> 18631561:May 21 11:01:53 poc2 kernel: [ 1528.336482] ft_queue_data_in:
> Failed to send frame ffff8802e25c1e00, xid <0x74a>, remaining 196608,
> lso_max <0x10000>
> 18631562:May 21 11:01:53 poc2 kernel: [ 1528.336486] ft_queue_data_in:
> Failed to send frame ffff8802e25c1e00, xid <0x74a>, remaining 131072,
> lso_max <0x10000>
> 18631568:May 21 11:01:53 poc2 kernel: [ 1528.340521] host7: xid  8c2:
> Exchange timer armed : 0 msecs
> 18631570:May 21 11:01:53 poc2 kernel: [ 1528.340667] host7: xid  8c2:
> Exchange timed out
> 18631575:May 21 11:01:53 poc2 kernel: [ 1528.341522] host7: xid  869:
> Exchange timer armed : 0 msecs
> 18631577:May 21 11:01:53 poc2 kernel: [ 1528.341539] host7: xid  869:
> Exchange timed out
> 18631660:May 21 11:01:53 poc2 kernel: [ 1528.356897] host7: xid  e4f:
> Exchange timer armed : 0 msecs
> 18631662:May 21 11:01:53 poc2 kernel: [ 1528.356975] host7: xid  e4f:
> Exchange timed out
> 18631825:May 21 11:01:53 poc2 kernel: [ 1528.398431] host7: xid  965:
> Exchange timer armed : 0 msecs
> 18631827:May 21 11:01:53 poc2 kernel: [ 1528.398509] host7: xid  965:
> Exchange timed out
> 18631844:May 21 11:01:53 poc2 kernel: [ 1528.401772] host7: xid  44e:
> Exchange timer armed : 0 msecs
> 18631846:May 21 11:01:53 poc2 kernel: [ 1528.401850] host7: xid  44e:
> Exchange timed out
> 18631859:May 21 11:01:53 poc2 kernel: [ 1528.404996] host7: xid  704:
> Exchange timer armed : 0 msecs
> 18631861:May 21 11:01:53 poc2 kernel: [ 1528.405043] host7: xid  704:
> Exchange timed out
> 18631876:May 21 11:01:53 poc2 kernel: [ 1528.412608] host7: xid  86d:
> Exchange timer armed : 0 msecs
> 18631878:May 21 11:01:53 poc2 kernel: [ 1528.412619] host7: xid  86d:
> Exchange timed out
> 18631919:May 21 11:01:53 poc2 kernel: [ 1528.431939] host7: xid  686:
> Exchange timer armed : 0 msecs
> 18631921:May 21 11:01:53 poc2 kernel: [ 1528.432000] host7: xid  686:
> Exchange timed out
> 18631950:May 21 11:01:53 poc2 kernel: [ 1528.441542] host7: xid  def:
> Exchange timer armed : 0 msecs
> 18631952:May 21 11:01:53 poc2 kernel: [ 1528.441573] host7: xid  def:
> Exchange timed out
> 18631967:May 21 11:01:53 poc2 kernel: [ 1528.447848] host7: xid  3ce:
> Exchange timer armed : 0 msecs
> 18631969:May 21 11:01:53 poc2 kernel: [ 1528.447863] host7: xid  3ce:
> Exchange timed out
> 18632008:May 21 11:01:53 poc2 kernel: [ 1528.453868] host7: xid  627:
> Exchange timer armed : 0 msecs
> 18632010:May 21 11:01:53 poc2 kernel: [ 1528.453879] host7: xid  627:
> Exchange timed out
> 18655343:May 21 11:01:56 poc2 kernel: [ 1531.747863] host7: xid  48e:
> Exchange timer armed : 0 msecs
> 18655345:May 21 11:01:56 poc2 kernel: [ 1531.747879] host7: xid  48e:
> Exchange timed out
> 18655414:May 21 11:01:56 poc2 kernel: [ 1531.758175] host7: xid  667:
> Exchange timer armed : 0 msecs
> 18655416:May 21 11:01:56 poc2 kernel: [ 1531.758204] host7: xid  667:
> Exchange timed out
> 18655533:May 21 11:01:56 poc2 kernel: [ 1531.776753] host7: xid  6c6:
> Exchange timer armed : 0 msecs
> 18655535:May 21 11:01:56 poc2 kernel: [ 1531.776765] host7: xid  6c6:
> Exchange timed out
> 18655832:May 21 11:01:56 poc2 kernel: [ 1531.818184] host7: xid  647:
> Exchange timer armed : 0 msecs
> 18655834:May 21 11:01:56 poc2 kernel: [ 1531.818227] host7: xid  647:
> Exchange timed out
> 18655893:May 21 11:01:56 poc2 kernel: [ 1531.825287] host7: xid  46e:
> Exchange timer armed : 0 msecs
> 18655895:May 21 11:01:56 poc2 kernel: [ 1531.825344] host7: xid  46e:
> Exchange timed out
> 18656116:May 21 11:01:56 poc2 kernel: [ 1531.863739] host7: xid  dab:
> Exchange timer armed : 0 msecs
> 18656118:May 21 11:01:56 poc2 kernel: [ 1531.863765] host7: xid  dab:
> Exchange timed out
> 18656124:May 21 11:01:56 poc2 kernel: [ 1531.865740] host7: xid  8ac:
> Exchange timer armed : 0 msecs
> 18656129:May 21 11:01:56 poc2 kernel: [ 1531.865800] host7: xid  8ac:
> Exchange timed out
> 18656130:May 21 11:01:56 poc2 kernel: [ 1531.865813] host7: xid  5e7:
> Exchange timer armed : 2000 msecs
> 18656267:May 21 11:01:56 poc2 kernel: [ 1531.891151] host7: xid  e6f:
> Exchange timer armed : 0 msecs
> 18656269:May 21 11:01:56 poc2 kernel: [ 1531.891177] host7: xid  e6f:
> Exchange timed out
> 18656320:May 21 11:01:56 poc2 kernel: [ 1531.899587] host7: xid  82c:
> Exchange timer armed : 0 msecs
> 18656322:May 21 11:01:56 poc2 kernel: [ 1531.899606] host7: xid  82c:
> Exchange timed out
> 18656355:May 21 11:01:56 poc2 kernel: [ 1531.906904] host7: xid  646:
> Exchange timer armed : 0 msecs
> 18656357:May 21 11:01:56 poc2 kernel: [ 1531.906952] host7: xid  646:
> Exchange timed out
> 18656608:May 21 11:01:56 poc2 kernel: [ 1531.960276] host7: xid  945:
> Exchange timer armed : 0 msecs
> 18656610:May 21 11:01:56 poc2 kernel: [ 1531.960293] host7: xid  945:
> Exchange timed out
> 18656613:May 21 11:01:56 poc2 kernel: [ 1531.960985] host7: xid  724:
> Exchange timer armed : 0 msecs
> 18656615:May 21 11:01:56 poc2 kernel: [ 1531.961002] host7: xid  724:
> Exchange timed out
> 18656634:May 21 11:01:56 poc2 kernel: [ 1531.968643] host7: xid  8ad:
> Exchange timer armed : 0 msecs
> 18656636:May 21 11:01:56 poc2 kernel: [ 1531.968660] host7: xid  8ad:
> Exchange timed out
> 18656641:May 21 11:01:56 poc2 kernel: [ 1531.974843] host7: xid  5e7:
> Exchange timer armed : 10000 msecs
> 18656642:May 21 11:01:56 poc2 kernel: [ 1531.974848] host7: xid  5e7:
> f_ctl  90000 seq  1
> 18656655:May 21 11:01:56 poc2 kernel: [ 1531.983896] host7: xid  687:
> Exchange timer armed : 0 msecs
> 18656657:May 21 11:01:56 poc2 kernel: [ 1531.983912] host7: xid  687:
> Exchange timed out
> 18656746:May 21 11:01:56 poc2 kernel: [ 1532.011935] host7: xid  5e7:
> Exchange timer armed : 10000 msecs
> 18656747:May 21 11:01:56 poc2 kernel: [ 1532.011940] host7: xid  5e7:
> f_ctl  90000 seq  2
> 18669030:May 21 11:01:58 poc2 kernel: [ 1533.870105] host7: xid  5e7:
> Exchange timed out
> 18841592:May 21 11:02:27 poc2 kernel: [ 1562.646991] host7: xid  90d:
> Exchange timer armed : 0 msecs
> 18841595:May 21 11:02:27 poc2 kernel: [ 1562.647066] host7: xid  90d:
> Exchange timed out
> 18842498:May 21 11:02:27 poc2 kernel: [ 1562.795804] host7: xid  8ac:
> Exchange timer armed : 0 msecs
> 18842500:May 21 11:02:27 poc2 kernel: [ 1562.795820] host7: xid  8ac:
> Exchange timed out
> 18842547:May 21 11:02:27 poc2 kernel: [ 1562.805335] host7: xid  94c:
> Exchange timer armed : 0 msecs
> 18842549:May 21 11:02:27 poc2 kernel: [ 1562.805351] host7: xid  94c:
> Exchange timed out
> 18842584:May 21 11:02:27 poc2 kernel: [ 1562.810550] host7: xid  f2f:
> Exchange timer armed : 0 msecs
> 18842586:May 21 11:02:27 poc2 kernel: [ 1562.810605] host7: xid  f2f:
> Exchange timed out
> 18842593:May 21 11:02:27 poc2 kernel: [ 1562.811735] host7: xid  7e6:
> Exchange timer armed : 0 msecs
> 18842595:May 21 11:02:27 poc2 kernel: [ 1562.811795] host7: xid  7e6:
> Exchange timed out
> 18842740:May 21 11:02:27 poc2 kernel: [ 1562.837943] host7: xid  e8b:
> Exchange timer armed : 0 msecs
> 18842742:May 21 11:02:27 poc2 kernel: [ 1562.837966] host7: xid  8cd:
> Exchange timer armed : 0 msecs
> 18842743:May 21 11:02:27 poc2 kernel: [ 1562.837968] host7: xid  e8b:
> Exchange timed out
> 18842745:May 21 11:02:27 poc2 kernel: [ 1562.838038] host7: xid  8cd:
> Exchange timed out
> 18842872:May 21 11:02:27 poc2 kernel: [ 1562.863661] host7: xid  824:
> Exchange timer armed : 0 msecs
> 18842874:May 21 11:02:27 poc2 kernel: [ 1562.863674] host7: xid  824:
> Exchange timed out
> 18842951:May 21 11:02:27 poc2 kernel: [ 1562.876390] host7: xid  766:
> Exchange timer armed : 0 msecs
> 18842953:May 21 11:02:27 poc2 kernel: [ 1562.876405] host7: xid  766:
> Exchange timed out
> 18842960:May 21 11:02:27 poc2 kernel: [ 1562.878444] host7: xid  ca8:
> Exchange timer armed : 0 msecs
> 18842962:May 21 11:02:27 poc2 kernel: [ 1562.878470] host7: xid  ca8:
> Exchange timed out
> 18843025:May 21 11:02:27 poc2 kernel: [ 1562.892335] host7: xid  a25:
> Exchange timer armed : 0 msecs
> 18843027:May 21 11:02:27 poc2 kernel: [ 1562.892400] host7: xid  a25:
> Exchange timed out
> 18843092:May 21 11:02:27 poc2 kernel: [ 1562.905946] host7: xid  be1:
> Exchange timer armed : 0 msecs
> 18843094:May 21 11:02:27 poc2 kernel: [ 1562.905971] host7: xid  be1:
> Exchange timed out
> 18843239:May 21 11:02:27 poc2 kernel: [ 1562.934586] host7: xid  52e:
> Exchange timer armed : 0 msecs
> 18843241:May 21 11:02:27 poc2 kernel: [ 1562.934601] host7: xid  52e:
> Exchange timed out
> 18843264:May 21 11:02:27 poc2 kernel: [ 1562.938141] host7: xid  bc8:
> Exchange timer armed : 0 msecs
> 18843266:May 21 11:02:27 poc2 kernel: [ 1562.938164] host7: xid  bc8:
> Exchange timed out
> 18843353:May 21 11:02:27 poc2 kernel: [ 1562.959150] host7: xid  c21:
> Exchange timer armed : 0 msecs
> 18843356:May 21 11:02:27 poc2 kernel: [ 1562.959194] ft_queue_data_in:
> 10 callbacks suppressed
> 18843357:May 21 11:02:27 poc2 kernel: [ 1562.959196] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 458752,
> lso_max <0x10000>
> 18843358:May 21 11:02:27 poc2 kernel: [ 1562.959199] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 393216,
> lso_max <0x10000>
> 18843359:May 21 11:02:27 poc2 kernel: [ 1562.959202] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 327680,
> lso_max <0x10000>
> 18843360:May 21 11:02:27 poc2 kernel: [ 1562.959204] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 262144,
> lso_max <0x10000>
> 18843361:May 21 11:02:27 poc2 kernel: [ 1562.959208] host7: xid  c21:
> Exchange timed out
> 18843362:May 21 11:02:27 poc2 kernel: [ 1562.959209] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 196608,
> lso_max <0x10000>
> 18843363:May 21 11:02:27 poc2 kernel: [ 1562.959212] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 131072,
> lso_max <0x10000>
> 18843364:May 21 11:02:27 poc2 kernel: [ 1562.959214] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 65536,
> lso_max <0x10000>
> 18843365:May 21 11:02:27 poc2 kernel: [ 1562.959217] ft_queue_data_in:
> Failed to send frame ffff8800ba6e6400, xid <0xc21>, remaining 0,
> lso_max <0x10000>
> 18843416:May 21 11:02:27 poc2 kernel: [ 1562.968554] host7: xid  f4f:
> Exchange timer armed : 0 msecs
> 18843426:May 21 11:02:27 poc2 kernel: [ 1562.968656] host7: xid  f4f:
> Exchange timed out
> 18843581:May 21 11:02:27 poc2 kernel: [ 1562.991761] host7: xid  7e4:
> Exchange timer armed : 0 msecs
> 18843583:May 21 11:02:27 poc2 kernel: [ 1562.991853] host7: xid  7e4:
> Exchange timed out
> 18843690:May 21 11:02:27 poc2 kernel: [ 1563.011282] host7: xid  a05:
> Exchange timer armed : 0 msecs
>
> Thanks
>
>
> On Wed, May 21, 2014 at 9:21 AM, Vasu Dev <vasu.dev@xxxxxxxxxxxxxxx> wrote:
>> On Tue, 2014-05-20 at 22:29 -0700, Jun Wu wrote:
>>> MTU were 1500 for both initiator and target.
>>> I used "ethtool -K p4p1 tso off" to turn off tcp segmentation offload
>>> on all machines. Register setting after the command is shown below.
>>>
>>> [root@poc3 jkong]# ethtool -k p4p1
>>> Features for p4p1:
>>> rx-checksumming: on
>>> tx-checksumming: on
>>>         tx-checksum-ipv4: on
>>>         tx-checksum-ip-generic: off [fixed]
>>>         tx-checksum-ipv6: on
>>>         tx-checksum-fcoe-crc: on [fixed]
>>>         tx-checksum-sctp: on
>>> scatter-gather: on
>>>         tx-scatter-gather: on
>>>         tx-scatter-gather-fraglist: off [fixed]
>>> tcp-segmentation-offload: off
>>>         tx-tcp-segmentation: off
>>>         tx-tcp-ecn-segmentation: off [fixed]
>>>         tx-tcp6-segmentation: off
>>> udp-fragmentation-offload: off [fixed]
>>> generic-segmentation-offload: on
>>> generic-receive-offload: on
>>> large-receive-offload: off
>>> rx-vlan-offload: on
>>> tx-vlan-offload: on
>>> ntuple-filters: off
>>> receive-hashing: on
>>> highdma: on [fixed]
>>> rx-vlan-filter: on
>>> vlan-challenged: off [fixed]
>>> tx-lockless: off [fixed]
>>> netns-local: off [fixed]
>>> tx-gso-robust: off [fixed]
>>> tx-fcoe-segmentation: on [fixed]
>>> tx-gre-segmentation: off [fixed]
>>> tx-ipip-segmentation: off [fixed]
>>> tx-sit-segmentation: off [fixed]
>>> tx-udp_tnl-segmentation: off [fixed]
>>> tx-mpls-segmentation: off [fixed]
>>> fcoe-mtu: on [fixed]
>>> tx-nocache-copy: on
>>> loopback: off [fixed]
>>> rx-fcs: off [fixed]
>>> rx-all: off
>>> tx-vlan-stag-hw-insert: off [fixed]
>>> rx-vlan-stag-hw-parse: off [fixed]
>>> rx-vlan-stag-filter: off [fixed]
>>> l2-fwd-offload: off
>>>
>>> Info on NIC drivers
>>>
>>> [root@poc3 jkong]# ethtool -i p4p1
>>> driver: ixgbe
>>> version: 3.15.1-k
>>> firmware-version: 0x80000208
>>> bus-info: 0000:08:00.0
>>> supports-statistics: yes
>>> supports-test: yes
>>> supports-eeprom-access: yes
>>> supports-register-dump: yes
>>> supports-priv-flags: no
>>>
>>> After the change, I repeated the same test and got similar failure on
>>> target side:
>>>
>>> [12253.032595] ft_queue_data_in: Failed to send frame
>>> ffff88062a638600, xid <0xa0c>, remaining 458752, lso_max <0x10000>
>>
>> It is send frame failure and to find out what caused send failure more
>> debug info in low level fcoe Tx path functions will be helpful, it can
>> be  done by:-
>>
>> # echo 0xFF > /sys/module/libfc/parameters/debug_logging
>> # echo 0x1 > /sys/module/fcoe/parameters/debug_logging
>>
>> Disabling Tx offload may not help here and instead would slow down Tx,
>> so have them restored.
>>
>> Also, are you using switch between hosts and target ? In any case you
>> would need DCB PFC or PAUSE enabled to avoid excessive Tx retries though
>> that should not cause send failure.
>>
>>
>> //Vasu
>>
>>
>>> [12253.032605] ft_queue_data_in: Failed to send frame
>>> ffff88062a638600, xid <0xa0c>, remaining 393216, lso_max <0x10000>
>>> [12253.032609] ft_queue_data_in: Failed to send frame
>>> ffff88062a638600, xid <0xa0c>, remaining 327680, lso_max <0x10000>
>>> [12253.032613] ft_queue_data_in: Failed to send frame
>>> ffff88062a638600, xid <0xa0c>, remaining 262144, lso_max <0x10000>
>>> [12284.299877] ft_queue_data_in: Failed to send frame
>>> ffff8803202ec600, xid <0x3a2>, remaining 196608, lso_max <0x10000>
>>> [12284.299885] ft_queue_data_in: Failed to send frame
>>> ffff8803202ec600, xid <0x3a2>, remaining 131072, lso_max <0x10000>
>>> [12284.299889] ft_queue_data_in: Failed to send frame
>>> ffff8803202ec600, xid <0x3a2>, remaining 65536, lso_max <0x10000>
>>> [12284.299892] ft_queue_data_in: Failed to send frame
>>> ffff8803202ec600, xid <0x3a2>, remaining 0, lso_max <0x10000>
>>> [12284.451810] ft_queue_data_in: Failed to send frame
>>> ffff88061deb1400, xid <0xecf>, remaining 458752, lso_max <0x10000>
>>> [12284.451818] ft_queue_data_in: Failed to send frame
>>> ffff88061deb1400, xid <0xecf>, remaining 393216, lso_max <0x10000>
>>> [12284.451824] ft_queue_data_in: Failed to send frame
>>> ffff88061deb1400, xid <0xecf>, remaining 327680, lso_max <0x10000>
>>> [12284.451827] ft_queue_data_in: Failed to send frame
>>> ffff88061deb1400, xid <0xecf>, remaining 262144, lso_max <0x10000>
>>> [12284.451831] ft_queue_data_in: Failed to send frame
>>> ffff88061deb1400, xid <0xecf>, remaining 196608, lso_max <0x10000>
>>> [12284.451834] ft_queue_data_in: Failed to send frame
>>> ffff88061deb1400, xid <0xecf>, remaining 131072, lso_max <0x10000>
>>> [12347.503478] ft_queue_data_in: 2 callbacks suppressed
>>> [12347.503486] ft_queue_data_in: Failed to send frame
>>> ffff8806142bc800, xid <0xb4f>, remaining 458752, lso_max <0x10000>
>>> [12347.503492] ft_queue_data_in: Failed to send frame
>>> ffff8806142bc800, xid <0xb4f>, remaining 393216, lso_max <0x10000>
>>> [12347.503496] ft_queue_data_in: Failed to send frame
>>> ffff8806142bc800, xid <0xb4f>, remaining 327680, lso_max <0x10000>
>>> [12347.503517] ft_queue_data_in: Failed to send frame
>>> ffff8806142bc800, xid <0xb4f>, remaining 262144, lso_max <0x10000>
>>> [12378.402412] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 458752, lso_max <0x10000>
>>> [12378.402420] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 393216, lso_max <0x10000>
>>> [12378.402425] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 327680, lso_max <0x10000>
>>> [12378.402428] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 262144, lso_max <0x10000>
>>> [12378.402432] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 196608, lso_max <0x10000>
>>> [12378.402436] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 131072, lso_max <0x10000>
>>> [12378.402440] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 65536, lso_max <0x10000>
>>> [12378.402444] ft_queue_data_in: Failed to send frame
>>> ffff88062ddeac00, xid <0x6a5>, remaining 0, lso_max <0x10000>
>>> [13049.224513] ft_queue_data_in: Failed to send frame
>>> ffff880614588c00, xid <0xd2f>, remaining 196608, lso_max <0x10000>
>>> [13049.224524] ft_queue_data_in: Failed to send frame
>>> ffff880614588c00, xid <0xd2f>, remaining 131072, lso_max <0x10000>
>>> [13049.224528] ft_queue_data_in: Failed to send frame
>>> ffff880614588c00, xid <0xd2f>, remaining 65536, lso_max <0x10000>
>>> [13049.224532] ft_queue_data_in: Failed to send frame
>>> ffff880614588c00, xid <0xd2f>, remaining 0, lso_max <0x10000>
>>> [13052.511306] ft_queue_data_in: Failed to send frame
>>> ffff88062d49f000, xid <0x8ae>, remaining 196608, lso_max <0x10000>
>>> [13052.511313] ft_queue_data_in: Failed to send frame
>>> ffff88062d49f000, xid <0x8ae>, remaining 131072, lso_max <0x10000>
>>> [13052.511317] ft_queue_data_in: Failed to send frame
>>> ffff88062d49f000, xid <0x8ae>, remaining 65536, lso_max <0x10000>
>>> [13052.511321] ft_queue_data_in: Failed to send frame
>>> ffff88062d49f000, xid <0x8ae>, remaining 0, lso_max <0x10000>
>>> [13087.976748] ft_queue_data_in: Failed to send frame
>>> ffff88031afc9c00, xid <0x96b>, remaining 458752, lso_max <0x10000>
>>> [13087.998453] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 458752, lso_max <0x10000>
>>> [13087.998459] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 393216, lso_max <0x10000>
>>> [13087.998463] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 327680, lso_max <0x10000>
>>> [13087.998467] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 262144, lso_max <0x10000>
>>> [13087.998470] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 196608, lso_max <0x10000>
>>> [13087.998474] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 131072, lso_max <0x10000>
>>> [13087.998478] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 65536, lso_max <0x10000>
>>> [13087.998482] ft_queue_data_in: Failed to send frame
>>> ffff88032c881200, xid <0xb23>, remaining 0, lso_max <0x10000>
>>> [13119.177286] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 458752, lso_max <0x10000>
>>> [13119.177297] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 393216, lso_max <0x10000>
>>> [13119.177302] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 327680, lso_max <0x10000>
>>> [13119.177307] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 262144, lso_max <0x10000>
>>> [13119.177311] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 196608, lso_max <0x10000>
>>> [13119.177316] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 131072, lso_max <0x10000>
>>> [13119.177321] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 65536, lso_max <0x10000>
>>> [13119.177325] ft_queue_data_in: Failed to send frame
>>> ffff88062dff7400, xid <0xfcf>, remaining 0, lso_max <0x10000>
>>> [13122.335322] ------------[ cut here ]------------
>>> [13122.335336] WARNING: CPU: 6 PID: 2165 at
>>> include/scsi/fc_frame.h:173 fcoe_percpu_receive_thread+0x507/0x53c
>>> [fcoe]()
>>> [13122.335338] Modules linked in: async_memcpy async_xor xor async_tx
>>> fcoe libfcoe tcm_fc libfc scsi_transport_fc scsi_tgt target_core_pscsi
>>> target_core_file target_core_iblock iscsi_target_mod target_core_mod
>>> 8021q garp mrp bridge stp llc iTCO_wdt gpio_ich iTCO_vendor_support
>>> coretemp kvm_intel kvm crc32c_intel microcode serio_raw i2c_i801
>>> lpc_ich mfd_core ses enclosure i7core_edac ioatdma edac_core shpchp
>>> acpi_cpufreq nfsd auth_rpcgss nfs_acl lockd sunrpc radeon
>>> drm_kms_helper ttm drm ixgbe igb ata_generic mdio pata_acpi ptp
>>> pata_jmicron pps_core i2c_algo_bit aacraid dca i2c_core [last
>>> unloaded: vd]
>>> [13122.335390] CPU: 6 PID: 2165 Comm: fcoethread/6 Tainted: GF
>>>  O 3.13.10-200.zbfcoepatch.fc20.x86_64 #1
>>> [13122.335392] Hardware name: Supermicro X8DTN/X8DTN, BIOS 2.1c       10/28/2011
>>> [13122.335394]  0000000000000009 ffff88062b04bdd0 ffffffff81687eac
>>> 0000000000000000
>>> [13122.335400]  ffff88062b04be08 ffffffff8106d4dd ffffe8ffffc41748
>>> ffff88062a444700
>>> [13122.335404]  ffff8800b7e926e8 0000000000000002 ffff88062b04be88
>>> ffff88062b04be18
>>> [13122.335408] Call Trace:
>>> [13122.335419]  [<ffffffff81687eac>] dump_stack+0x45/0x56
>>> [13122.335426]  [<ffffffff8106d4dd>] warn_slowpath_common+0x7d/0xa0
>>> [13122.335430]  [<ffffffff8106d5ba>] warn_slowpath_null+0x1a/0x20
>>> [13122.335435]  [<ffffffffa0651517>]
>>> fcoe_percpu_receive_thread+0x507/0x53c [fcoe]
>>> [13122.335440]  [<ffffffffa0651010>] ? fcoe_set_port_id+0x50/0x50 [fcoe]
>>> [13122.335446]  [<ffffffff8108f2f2>] kthread+0xd2/0xf0
>>> [13122.335450]  [<ffffffff8108f220>] ? insert_kthread_work+0x40/0x40
>>> [13122.335458]  [<ffffffff81696dbc>] ret_from_fork+0x7c/0xb0
>>> [13122.335461]  [<ffffffff8108f220>] ? insert_kthread_work+0x40/0x40
>>> [13122.335464] ---[ end trace e4509e1053f499ac ]---
>>>
>>> Thanks,
>>>
>>> Jun
>>>
>>> On Tue, May 20, 2014 at 11:03 AM, Nicholas A. Bellinger
>>> <nab@xxxxxxxxxxxxxxx> wrote:
>>> > On Mon, 2014-05-19 at 17:29 -0700, Jun Wu wrote:
>>> >> Hi Nicholas,
>>> >>
>>> >> We downloaded the source of our running kernel (3.13.10-200) and
>>> >> applied your percpu-ida pre-allocation regression fix, then compiled
>>> >> and installed the kernel. I repeated the same test three times,
>>> >> running 10 fio sessions to 10 drives on the target through fcoe vn2vn.
>>> >> In the first two tests, the target machine hung with the following
>>> >> messages:
>>> >>
>>> >> 15231 May 19 11:49:27 poc1 kernel: [ 1073.783229] ft_queue_data_in:
>>> >> Failed to send frame ffff880c0b188200, xid <0x2a5>, remaining 196608,
>>> >> lso_max <0x10000>
>>> >> 15232 May 19 11:49:27 poc1 kernel: [ 1073.783238] ft_queue_data_in:
>>> >> Failed to send frame ffff880c0b188200, xid <0x2a5>, remaining 131072,
>>> >> lso_max <0x10000>
>>> >> 15233 May 19 11:49:27 poc1 kernel: [ 1073.783242] ft_queue_data_in:
>>> >> Failed to send frame ffff880c0b188200, xid <0x2a5>, remaining 65536,
>>> >> lso_max <0x10000>
>>> >> 15234 May 19 11:49:27 poc1 kernel: [ 1073.783245] ft_queue_data_in:
>>> >> Failed to send frame ffff880c0b188200, xid <0x2a5>, remaining 0,
>>> >> lso_max <0x10000>
>>> >> 15235 May 19 11:49:30 poc1 kernel: [ 1076.907061] ft_queue_data_in:
>>> >> Failed to send frame ffff880c1d1df000, xid <0x305>, remaining 196608,
>>> >> lso_max <0x10000>
>>> >> 15236 May 19 11:49:30 poc1 kernel: [ 1076.907068] ft_queue_data_in:
>>> >> Failed to send frame ffff880c1d1df000, xid <0x305>, remaining 131072,
>>> >> lso_max <0x10000>
>>> >> 15237 May 19 11:49:30 poc1 kernel: [ 1076.907073] ft_queue_data_in:
>>> >> Failed to send frame ffff880c1d1df000, xid <0x305>, remaining 65536,
>>> >> lso_max <0x10000>
>>> >> 15238 May 19 11:49:30 poc1 kernel: [ 1076.907077] ft_queue_data_in:
>>> >> Failed to send frame ffff880c1d1df000, xid <0x305>, remaining 0,
>>> >> lso_max <0x10000>
>>> >> 15239 May 19 11:50:01 poc1 kernel: [ 1107.918910] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 458752,
>>> >> lso_max <0x10000>
>>> >> 15240 May 19 11:50:01 poc1 kernel: [ 1107.918918] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 393216,
>>> >> lso_max <0x10000>
>>> >> 15241 May 19 11:50:01 poc1 kernel: [ 1107.918922] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 327680,
>>> >> lso_max <0x10000>
>>> >> 15242 May 19 11:50:01 poc1 kernel: [ 1107.918925] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 262144,
>>> >> lso_max <0x10000>
>>> >> 15243 May 19 11:50:01 poc1 kernel: [ 1107.918929] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 196608,
>>> >> lso_max <0x10000>
>>> >> 15244 May 19 11:50:01 poc1 kernel: [ 1107.918932] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 131072,
>>> >> lso_max <0x10000>
>>> >> 15245 May 19 11:50:01 poc1 kernel: [ 1107.918936] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 65536,
>>> >> lso_max <0x10000>
>>> >> 15246 May 19 11:50:01 poc1 kernel: [ 1107.918939] ft_queue_data_in:
>>> >> Failed to send frame ffff88060cd40800, xid <0x3cb>, remaining 0,
>>> >> lso_max <0x10000>
>>> >> 15247 May 19 11:50:05 poc1 kernel: [ 1111.450900] ft_queue_data_in:
>>> >> Failed to send frame ffff880c0b24ca00, xid <0xea6>, remaining 196608,
>>> >> lso_max <0x10000>
>>> >> 15248 May 19 11:50:05 poc1 kernel: [ 1111.450908] ft_queue_data_in:
>>> >> Failed to send frame ffff880c0b24ca00, xid <0xea6>, remaining 131072,
>>> >> lso_max <0x10000>
>>> >> 15249 May 19 11:51:12 poc1 kernel: [ 1178.698434] ft_queue_data_in: 6
>>> >> callbacks suppressed
>>> >> 15250 May 19 11:51:12 poc1 kernel: [ 1178.698440] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 458752,
>>> >> lso_max <0x10000>
>>> >> 15251 May 19 11:51:12 poc1 kernel: [ 1178.698446] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 393216,
>>> >> lso_max <0x10000>
>>> >> 15252 May 19 11:51:12 poc1 kernel: [ 1178.698449] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 327680,
>>> >> lso_max <0x10000>
>>> >> 15253 May 19 11:51:12 poc1 kernel: [ 1178.698453] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 262144,
>>> >> lso_max <0x10000>
>>> >> 15254 May 19 11:51:12 poc1 kernel: [ 1178.698456] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 196608,
>>> >> lso_max <0x10000>
>>> >> 15255 May 19 11:51:12 poc1 kernel: [ 1178.698460] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 131072,
>>> >> lso_max <0x10000>
>>> >> 15256 May 19 11:51:12 poc1 kernel: [ 1178.698463] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 65536,
>>> >> lso_max <0x10000>
>>> >> 15257 May 19 11:51:12 poc1 kernel: [ 1178.698467] ft_queue_data_in:
>>> >> Failed to send frame ffff88060ba97400, xid <0xb8a>, remaining 0,
>>> >> lso_max <0x10000>
>>> >>
>>> >
>>> > The call into lport->tt.seq_send() libfc code is failing to send
>>> > outgoing solicited data-in.  From the output, note the LSO (large
>>> > segment offload aka TCP segment offload) feature has been enabled by the
>>> > underlying NIC hardware.
>>> >
>>> > So in order to isolate possible issues, I'd recommend:
>>> >
>>> > - Disabling hardware offloads on both initiator and target sides (LRO +
>>> >   LSO) using ethtool -K
>>> > - Disabling any jumbo frames settings on either side
>>> >
>>> > Is there any other non standard network and/or switch settings that are
>>> > in place..?  Also, please confirm what your NIC + switch setup looks
>>> > like.
>>> >
>>> > Rob & Open-FCoE folks, is there anything else to take into consideration
>>> > here..?
>>> >
>>> >>
>>> >> I didn't see the previous message "unable to handle kernel NULL
>>> >> pointer dereference at 0000000000000048". So it must have been fixed
>>> >> by your change.
>>> >>
>>> >
>>> > Thanks for confirming that bit.
>>> >
>>> > --nab
>>> >
>>> _______________________________________________
>>> fcoe-devel mailing list
>>> fcoe-devel@xxxxxxxxxxxxx
>>> http://lists.open-fcoe.org/mailman/listinfo/fcoe-devel
>>
>>
--
To unsubscribe from this list: send the line "unsubscribe target-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux SCSI]     [Kernel Newbies]     [Linux SCSI Target Infrastructure]     [Share Photos]     [IDE]     [Security]     [Git]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Linux ATA RAID]     [Linux IIO]     [Device Mapper]

  Powered by Linux