Re: [RFC] Clear out stuck ops to prevent iSER from going init D state

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, Jan 26, 2017 at 3:52 AM, Sagi Grimberg <sagi@xxxxxxxxxxx> wrote:
>> It is getting stuck in target_wait_for_sess_cmds. ib_drain_qp is
>> returning just fine. I put in a bunch of debug statements and here is
>> a clip of them from a previous test.
>
>
> It won't make a difference, and we need both the rq and the sq drained
> before we continue with session teardown.
>
> Robert,
>
> I'm still missing the information of which commands are pending without
> a completion.
>
> Please activate target_wait_for_sess_cmds() debug logging.
>
> You can turn on the whole file with:
> echo "file target_core_transport.c +p" >
> /sys/kernel/debug/dynamic_debug/control

OK, I'm using commit bb3d5f95f46dfe19554820dbce620d31504dafb2 from the
bvanassche/scsi-target-for-v4.11 tree. I've exported the target with
InitialR2T=No then I ran
echo 4 > /sys/module/ib_isert/parameters/debug_level
echo "file target_core_transport.c +p" > /sys/kernel/debug/dynamic_debug/control

but it seems there is no new information during the period of the hang.

[  806.924011] isert: isert_recv_done: DMA: 0x7eff323000, iSCSI
opcode: 0x01, ITT: 0x000001a3, flags: 0xa1 dlen: 4096
[  806.924011] isert: isert_recv_done: ISER ISCSI_CTRL PDU
[  806.924014] isert: isert_create_send_desc: tx_desc ffff887eff0e9c30
lkey mismatch, fixing
[  806.924015] isert: isert_init_tx_hdrs: Setup tx_sg[0].addr:
0x7eff0e9c30 length: 76 lkey: 0x425a
[  806.924016] isert: isert_put_response: Posting SCSI Response
[  806.924017] isert: isert_create_send_desc: tx_desc ffff887eff0ed278
lkey mismatch, fixing
[  806.924017] isert: isert_init_tx_hdrs: Setup tx_sg[0].addr:
0x7eff0ed278 length: 76 lkey: 0x425a
[  806.924018] isert: isert_put_response: Posting SCSI Response
[  806.924019] isert: isert_create_send_desc: tx_desc ffff887eff0f14d0
lkey mismatch, fixing
[  806.924019] isert: isert_init_tx_hdrs: Setup tx_sg[0].addr:
0x7eff0f14d0 length: 76 lkey: 0x425a
[  806.924020] isert: isert_put_response: Posting SCSI Response
[  806.924021] isert: isert_create_send_desc: tx_desc ffff887eff0f26e8
lkey mismatch, fixing
[  806.924022] isert: isert_init_tx_hdrs: Setup tx_sg[0].addr:
0x7eff0f26e8 length: 76 lkey: 0x425a
[  806.924022] isert: isert_put_response: Posting SCSI Response
[  817.066975] isert: isert_cma_handler: disconnected (10): status 0
id ffff887efe7be400 np ffff887f1c648d80
[  817.077425] isert: isert_wait_conn: Starting conn ffff887f25f84000
[  817.084212] isert: isert_conn_terminate: Terminating conn
ffff887f25f84000 state 3
[  819.202986] isert: isert_cma_handler: timewait exit (15): status 0
id ffff887efe7be400 np ffff887f1c648d80
[  819.607370] isert: isert_cma_handler: connect request (4): status 0
id ffff887efec36400 np ffff887f1c648d80
[  819.617859] isert: isert_connect_request: cma_id: ffff887efec36400,
portal: ffff887f1c648d80
[  819.617873] isert: isert_device_get: Found iser device
ffff887f20df5f40 refcount 2
[  819.626055] isert: isert_set_nego_params: Using initiator_depth: 16
[  819.626056] isert: isert_set_nego_params: Using remote invalidation
[  819.632925] isert: isert_comp_get: conn ffff887f25f86000, using
comp ffff887efe7be818 min_index: 1
[  819.643201] isert: isert_login_post_recv: Setup sge: addr:
7ebdb50000 length: 8268 0x0000425a
[  819.644010] isert: isert_cma_handler: established (9): status 0 id
ffff887efec36400 np ffff887f1c648d80
[  819.653970] isert: isert_connected_handler: conn ffff887f25f86000
[  819.660627] isert: isert_connected_handler: np ffff887f1c648d80:
Allow accept_np to continue
[  819.669680] isert: isert_accept_np: Processing isert_conn:
ffff887f25f86000
[  819.669688] isert: isert_get_login_rx: before login_req comp conn:
ffff887f25f86000
[  819.872665] isert: isert_rx_login_req: conn ffff887f25f86000
[  819.879052] isert: isert_rx_login_req: Using login payload size:
504, rx_buflen: 504 MAX_KEY_VALUE_PAIRS: 8192
[  819.879053] isert: isert_get_login_rx: before login_comp conn:
ffff887f1a457800
[  819.886973] isert: isert_get_login_rx: processing login->req:
ffff887effcb13a4
[  819.894936] isert: isert_get_sup_prot_ops: conn ffff887f25f86000 PI
offload disabled
[  834.639402] isert: isert_cma_handler: disconnected (10): status 0
id ffff887efec36400 np ffff887f1c648d80
[  835.203732] iSCSI Login timeout on Network Portal 0.0.0.0:3260
[  836.893231] isert: isert_cma_handler: connect request (4): status 0
id ffff887efec31800 np ffff887f1c648d80
[  836.903737] isert: isert_connect_request: cma_id: ffff887efec31800,
portal: ffff887f1c648d80
[  836.903752] isert: isert_device_get: Found iser device
ffff887f20df5f40 refcount 3
[  836.911938] isert: isert_set_nego_params: Using initiator_depth: 16
[  836.911951] isert: isert_set_nego_params: Using remote invalidation
[  836.918819] isert: isert_comp_get: conn ffff887f25f82000, using
comp ffff887efe7be830 min_index: 2
[  836.928920] isert: isert_login_post_recv: Setup sge: addr:
7f11f34000 length: 8268 0x0000425a
[  836.929796] isert: isert_cma_handler: established (9): status 0 id
ffff887efec31800 np ffff887f1c648d80
[  836.939815] isert: isert_connected_handler: conn ffff887f25f82000
[  836.946522] isert: isert_connected_handler: np ffff887f1c648d80:
Allow accept_np to continue
[  851.970046] isert: isert_cma_handler: disconnected (10): status 0
id ffff887efec31800 np ffff887f1c648d80
[  851.980368] isert: isert_conn_terminate: Terminating conn
ffff887f25f82000 state 1
[  851.989703] isert: isert_release_work: Starting release conn
ffff887f25f82000
[  851.997449] isert: isert_release_work: Destroying conn
ffff887f25f82000
[  852.004665] isert: isert_release_kref: conn ffff887f25f82000 final
kref kworker/u66:0/80
[  852.013351] isert: isert_connect_release: conn ffff887f25f82000
[  852.014178] isert: isert_device_put: device ffff887f20df5f40
refcount 2
[  854.223515] isert: isert_cma_handler: connect request (4): status 0
id ffff887efec33800 np ffff887f1c648d80
[  854.234033] isert: isert_connect_request: cma_id: ffff887efec33800,
portal: ffff887f1c648d80
[  854.234039] isert: isert_device_get: Found iser device
ffff887f20df5f40 refcount 3
[  854.242271] isert: isert_set_nego_params: Using initiator_depth: 16
[  854.242272] isert: isert_set_nego_params: Using remote invalidation
[  854.249204] isert: isert_comp_get: conn ffff887f25f82000, using
comp ffff887efe7be830 min_index: 2
[  854.259359] isert: isert_login_post_recv: Setup sge: addr:
7f11f34000 length: 8268 0x0000425a
[  854.260222] isert: isert_cma_handler: established (9): status 0 id
ffff887efec33800 np ffff887f1c648d80
[  854.270220] isert: isert_connected_handler: conn ffff887f25f82000
[  854.276901] isert: isert_connected_handler: np ffff887f1c648d80:
Allow accept_np to continue
[  869.295790] isert: isert_cma_handler: disconnected (10): status 0
id ffff887efec33800 np ffff887f1c648d80

I don't think journal is able to log all the messages before the hang
if any of those are important,

Jan 26 10:31:08 prv-0-13-roberttest kernel: isert: isert_get_login_rx:
before login_req comp conn: ffff887f25f84000
Jan 26 10:31:08 prv-0-13-roberttest kernel: isert: isert_rx_login_req:
conn ffff887f25f84000
Jan 26 10:31:08 prv-0-13-roberttest kernel: isert: isert_get_login_rx:
before login_comp conn: ffff887f23c95800
Jan 26 10:31:08 prv-0-13-roberttest kernel: isert: isert_get_login_rx:
processing login->req: ffff883f6614f8e4
Jan 26 10:31:08 prv-0-13-roberttest kernel: isert:
isert_get_sup_prot_ops: conn ffff887f25f84000 PI offload disabled
Jan 26 10:31:08 prv-0-13-roberttest kernel:
iSCSI/iqn.2004-12.com.betterservers:10.91.0.14: Unsupported SCSI
Opcode 0xa3, sending CHECK_CONDITION.
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 639 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 101 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 62 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 60 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 62 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 52 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 68 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 56 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 57 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 49 kernel messages
Jan 26 10:31:30 prv-0-13-roberttest journal: Missed 63 kernel messages
...
Jan 26 10:31:31 prv-0-13-roberttest journal: Missed 99 kernel messages
Jan 26 10:31:31 prv-0-13-roberttest journal: Missed 123 kernel messages
Jan 26 10:31:31 prv-0-13-roberttest journal: Missed 216 kernel messages
Jan 26 10:31:31 prv-0-13-roberttest journal: Missed 64 kernel messages
Jan 26 10:31:31 prv-0-13-roberttest journal: Missed 7 kernel messages
Jan 26 10:31:34 prv-0-13-roberttest kernel: WRITE_SAME sectors: 4088
exceeds max_write_same_len: 0
Jan 26 10:31:34 prv-0-13-roberttest rsyslogd-2177: imjournal: begin to
drop messages due to rate-limiting
Jan 26 10:40:03 prv-0-13-roberttest rsyslogd-2177: imjournal: 1210115
messages lost due to rate-limiting
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert: isert_cma_handler:
connect request (4): status 0 id ffff887efec32c00 np ffff887f1c648d80
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert: isert_device_get:
Found iser device ffff887f20df5f40 refcount 3
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert:
isert_set_nego_params: Using remote invalidation
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert: isert_comp_get:
conn ffff887f25f82000, using comp ffff887efe7be830 min_index: 2
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert: isert_cma_handler:
established (9): status 0 id ffff887efec32c00 np ffff887f1c648d80
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert:
isert_connected_handler: conn ffff887f25f82000
Jan 26 10:40:03 prv-0-13-roberttest kernel: isert:
isert_connected_handler: np ffff887f1c648d80: Allow accept_np to
continue
Jan 26 10:40:03 prv-0-13-roberttest systemd: Removed slice user-0.slice.
Jan 26 10:40:03 prv-0-13-roberttest systemd: Stopping user-0.slice.
Jan 26 10:40:18 prv-0-13-roberttest kernel: isert: isert_cma_handler:
disconnected (10): status 0 id ffff887efec32c00 np ffff887f1c648d80
Jan 26 10:40:18 prv-0-13-roberttest kernel: isert:
isert_conn_terminate: Terminating conn ffff887f25f82000 state 1
Jan 26 10:40:18 prv-0-13-roberttest kernel: isert: isert_release_work:
Starting release conn ffff887f25f82000
Jan 26 10:40:18 prv-0-13-roberttest kernel: isert: isert_release_work:
Destroying conn ffff887f25f82000
Jan 26 10:40:18 prv-0-13-roberttest kernel: isert: isert_release_kref:
conn ffff887f25f82000 final kref kworker/u66:2/1743
Jan 26 10:40:18 prv-0-13-roberttest kernel: isert: isert_device_put:
device ffff887f20df5f40 refcount 2
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert: isert_cma_handler:
connect request (4): status 0 id ffff887efec31c00 np ffff887f1c648d80
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert: isert_device_get:
Found iser device ffff887f20df5f40 refcount 3
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert:
isert_set_nego_params: Using remote invalidation
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert: isert_comp_get:
conn ffff887f25f82000, using comp ffff887efe7be830 min_index: 2
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert: isert_cma_handler:
established (9): status 0 id ffff887efec31c00 np ffff887f1c648d80
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert:
isert_connected_handler: conn ffff887f25f82000
Jan 26 10:40:21 prv-0-13-roberttest kernel: isert:
isert_connected_handler: np ffff887f1c648d80: Allow accept_np to
continue
Jan 26 10:40:36 prv-0-13-roberttest kernel: isert: isert_cma_handler:
disconnected (10): status 0 id ffff887efec31c00 np ffff887f1c648d80
Jan 26 10:40:36 prv-0-13-roberttest kernel: isert:
isert_conn_terminate: Terminating conn ffff887f25f82000 state 1
Jan 26 10:40:36 prv-0-13-roberttest kernel: isert: isert_release_work:
Starting release conn ffff887f25f82000
Jan 26 10:40:36 prv-0-13-roberttest kernel: isert: isert_release_work:
Destroying conn ffff887f25f82000

I thought I saw some new messages in dmesg in regards to the new debug
command when I logged into the target, but I can't find them in
/var/log/messages now. I assume that I would find the messages in
dmesg, please let me know if I should be looking elsewhere.

Thanks for your help and patience.

----------------
Robert LeBlanc
PGP Fingerprint 79A2 9CA4 6CC4 45DD A904  C70E E654 3BB2 FA62 B9F1
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html



[Index of Archives]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Photo]     [Yosemite News]     [Yosemite Photos]     [Linux Kernel]     [Linux SCSI]     [XFree86]
  Powered by Linux