On 7/18/22 3:45 AM, Dmitry Bogdanov wrote: > Hi Mike, > > On Thu, Jul 14, 2022 at 11:44:25AM -0500, Mike Christie wrote: >> >> On 7/13/22 3:42 PM, Dmitry Bogdanov wrote: >>> Sometimes an initiator does not send data for WRITE commands and tries >>> to abort it. The abort hangs waiting for frontend driver completion. >>> iSCSI driver waits for for data and that timeout eventually initiates >>> connection reinstatment. The connection closing releases the commands in >>> the connection, but those aborted commands still did not handle the >>> abort and did not decrease a command ref counter. Because of that the >>> connection reinstatement hangs indefinitely and prevents re-login for >>> that initiator. >>> >>> This patch adds a handling in TCM of the abort for the WRITE_PENDING >>> commands at connection closing moment to make it possible to release >>> them. >>> >>> Signed-off-by: Dmitry Bogdanov <d.bogdanov@xxxxxxxxx> >>> --- >>> drivers/target/iscsi/iscsi_target.c | 13 ++++++++++--- >>> 1 file changed, 10 insertions(+), 3 deletions(-) >>> >>> diff --git a/drivers/target/iscsi/iscsi_target.c b/drivers/target/iscsi/iscsi_target.c >>> index e368f038ff5c..27eca5e72f52 100644 >>> --- a/drivers/target/iscsi/iscsi_target.c >>> +++ b/drivers/target/iscsi/iscsi_target.c >>> @@ -26,6 +26,7 @@ >>> #include <target/target_core_base.h> >>> #include <target/target_core_fabric.h> >>> >>> +#include <target/target_core_backend.h> >>> #include <target/iscsi/iscsi_target_core.h> >>> #include "iscsi_target_parameters.h" >>> #include "iscsi_target_seq_pdu_list.h" >>> @@ -4171,7 +4172,8 @@ static void iscsit_release_commands_from_conn(struct iscsit_conn *conn) >>> >>> if (se_cmd->se_tfo != NULL) { >>> spin_lock_irq(&se_cmd->t_state_lock); >>> - if (se_cmd->transport_state & CMD_T_ABORTED) { >>> + if (se_cmd->t_state != TRANSPORT_WRITE_PENDING && >>> + se_cmd->transport_state & CMD_T_ABORTED) { >>> /* >>> * LIO's abort path owns the cleanup for this, >>> * so put it back on the list and let >>> @@ -4191,8 +4193,13 @@ static void iscsit_release_commands_from_conn(struct iscsit_conn *conn) >>> list_del_init(&cmd->i_conn_node); >>> >>> iscsit_increment_maxcmdsn(cmd, sess); >>> - iscsit_free_cmd(cmd, true); >>> - >>> + if (cmd->se_cmd.t_state == TRANSPORT_WRITE_PENDING && >>> + cmd->se_cmd.transport_state & CMD_T_ABORTED) { >>> + /* handle an abort in TCM */ >>> + target_complete_cmd(&cmd->se_cmd, SAM_STAT_TASK_ABORTED); >>> >> >> Will we have an extra ref left on the se_cmd if TAS is used so the se_cmd >> does not get freed? >> >> For TAS, it looks like we would do: >> >> - target_handle_abort -> queue_status. This would not do anything because >> before calling iscsit_release_commands_from_conn we have killed the iscsi tx >> thread. >> >> - target_handle_abort -> transport_cmd_check_stop_to_fabric -> check_stop_free -> >> target_put_sess_cmd. >> >> iscsi creates the se_cmd with TARGET_SCF_ACK_KREF set so do we have one ref >> left? > Yes, you are right. TAS case is not covered by my patch. But that is > actually another bug (that iSCSI does not complete responses in case of > connection closed). What do you mean this is a bug already? I mean is there a leak or spec violation? Spec wise we don't need to send a response to the initiator when the connection is closed for a single connection session and ERL=0. We just can't because the connection is down. And the initiator knows it will not be getting a response because the connection is gone and cleans up on it's side. If TRANSPORT_WRITE_PENDING is not set then we will drive the cleanup of commands internally and not leak memory right? Is there a bug in this path where we also leak memory? If that path is ok, can't we handle the TRANSPORT_WRITE_PENDING is set case in a similar way? This was the patch I had proposed when we discussed this last time. It's completely untested, but just to show what I mean. I think it should probably check the t_state like how you did it instead of adding a transport_state bit. The idea is that the command is never going to get completed and we can't send a response because the connection is down. The iscsi layer knows all this and that it hasn't sent the cmd to LIO core for backend processing, so it forces the cleanup. diff --git a/drivers/target/iscsi/iscsi_target.c b/drivers/target/iscsi/iscsi_target.c index 2c54c5d8412d..d0e80a2b653b 100644 --- a/drivers/target/iscsi/iscsi_target.c +++ b/drivers/target/iscsi/iscsi_target.c @@ -4088,7 +4088,8 @@ static void iscsit_release_commands_from_conn(struct iscsi_conn *conn) if (se_cmd->se_tfo != NULL) { spin_lock_irq(&se_cmd->t_state_lock); - if (se_cmd->transport_state & CMD_T_ABORTED) { + if (se_cmd->transport_state & CMD_T_ABORTED && + se_cmd->transport_state & CMD_T_SUBMITTED) { /* * LIO's abort path owns the cleanup for this, * so put it back on the list and let @@ -4096,7 +4097,7 @@ static void iscsit_release_commands_from_conn(struct iscsi_conn *conn) */ list_move_tail(&cmd->i_conn_node, &conn->conn_cmd_list); - } else { + } else if (!(se_cmd->transport_state & CMD_T_ABORTED)) { se_cmd->transport_state |= CMD_T_FABRIC_STOP; } spin_unlock_irq(&se_cmd->t_state_lock); @@ -4108,8 +4109,12 @@ static void iscsit_release_commands_from_conn(struct iscsi_conn *conn) list_del_init(&cmd->i_conn_node); iscsit_increment_maxcmdsn(cmd, sess); - iscsit_free_cmd(cmd, true); + if (se_cmd->transport_state & CMD_T_ABORTED && + !(se_cmd->transport_state & CMD_T_SUBMITTED)) + iscsit_free_cmd(cmd, false, true); + else + iscsit_free_cmd(cmd, true, false); } } diff --git a/drivers/target/iscsi/iscsi_target_util.c b/drivers/target/iscsi/iscsi_target_util.c index 6dd5810e2af1..931586595044 100644 --- a/drivers/target/iscsi/iscsi_target_util.c +++ b/drivers/target/iscsi/iscsi_target_util.c @@ -742,7 +742,7 @@ void __iscsit_free_cmd(struct iscsi_cmd *cmd, bool check_queues) conn->conn_transport->iscsit_unmap_cmd(conn, cmd); } -void iscsit_free_cmd(struct iscsi_cmd *cmd, bool shutdown) +void iscsit_free_cmd(struct iscsi_cmd *cmd, bool shutdown, bool force_cleanup) { struct se_cmd *se_cmd = cmd->se_cmd.se_tfo ? &cmd->se_cmd : NULL; int rc; @@ -751,10 +751,14 @@ void iscsit_free_cmd(struct iscsi_cmd *cmd, bool shutdown) __iscsit_free_cmd(cmd, shutdown); if (se_cmd) { - rc = transport_generic_free_cmd(se_cmd, shutdown); + rc = transport_generic_free_cmd(se_cmd, + force_cleanup ? false : shutdown); if (!rc && shutdown && se_cmd->se_sess) { __iscsit_free_cmd(cmd, shutdown); target_put_sess_cmd(se_cmd); + } else if (se_cmd->sess && force_cleanup) { + __iscsit_free_cmd(cmd, true); + target_put_sess_cmd(se_cmd); } } else { iscsit_release_cmd(cmd); diff --git a/drivers/target/target_core_transport.c b/drivers/target/target_core_transport.c index 14c6f2bb1b01..eb233ea8db65 100644 --- a/drivers/target/target_core_transport.c +++ b/drivers/target/target_core_transport.c @@ -1554,7 +1554,7 @@ int transport_handle_cdb_direct( * this to be called for initial descriptor submission. */ cmd->t_state = TRANSPORT_NEW_CMD; - cmd->transport_state |= CMD_T_ACTIVE; + cmd->transport_state |= (CMD_T_ACTIVE | CMD_T_SUBMITTED); /* * transport_generic_new_cmd() is already handling QUEUE_FULL, @@ -2221,7 +2221,7 @@ void target_execute_cmd(struct se_cmd *cmd) return; spin_lock_irq(&cmd->t_state_lock); - cmd->t_state = TRANSPORT_PROCESSING; + cmd->t_state = (TRANSPORT_PROCESSING | CMD_T_SUBMITTED); cmd->transport_state |= CMD_T_ACTIVE | CMD_T_SENT; spin_unlock_irq(&cmd->t_state_lock); diff --git a/include/target/target_core_base.h b/include/target/target_core_base.h index fb11c7693b25..b759ec810fa9 100644 --- a/include/target/target_core_base.h +++ b/include/target/target_core_base.h @@ -511,6 +511,7 @@ struct se_cmd { #define CMD_T_COMPLETE (1 << 2) #define CMD_T_SENT (1 << 4) #define CMD_T_STOP (1 << 5) +#define CMD_T_SUBMITTED (1 << 6) #define CMD_T_TAS (1 << 10) #define CMD_T_FABRIC_STOP (1 << 11) spinlock_t t_state_lock;