Hi, On Fri, May 29, 2020 at 3:52 AM Maulik Shah <mkshah@xxxxxxxxxxxxxx> wrote: > > From: Lina Iyer <ilina@xxxxxxxxxxxxxx> > > Requests sent to RPMH can be sent as fire-n-forget or response required, > with the latter ensuring the command has been completed by the hardware > accelerator. Commands in a request with tcs_cmd::wait set, would ensure > that those select commands are sent as response required, even though > the actual TCS request may be fire-n-forget. > > Also, commands with .wait flag were also guaranteed to be complete > before the following command in the TCS is sent. This means that the > next command of the same request blocked until the current request is > completed. This could mean waiting for a voltage to settle or series of > NOCs be configured before the next command is sent. But drivers using > this feature have never cared about the serialization aspect. By not > enforcing the serialization we can allow the hardware to run in parallel > improving the performance. > > Let's clarify the usage of this member in the tcs_cmd structure to mean > only completion and not serialization. This should also improve the > performance of bus requests where changes could happen in parallel. > Also, CPU resume from deep idle may see benefits from certain wake > requests. > > Signed-off-by: Lina Iyer <ilina@xxxxxxxxxxxxxx> You are posting the patch and so you need your SoB too. > --- > drivers/soc/qcom/rpmh-rsc.c | 19 ++++++++----------- > include/soc/qcom/tcs.h | 5 +++-- > 2 files changed, 11 insertions(+), 13 deletions(-) > > diff --git a/drivers/soc/qcom/rpmh-rsc.c b/drivers/soc/qcom/rpmh-rsc.c > index 076fd27..d99e639 100644 > --- a/drivers/soc/qcom/rpmh-rsc.c > +++ b/drivers/soc/qcom/rpmh-rsc.c > @@ -413,8 +413,7 @@ static irqreturn_t tcs_tx_done(int irq, void *p) > cmd = &req->cmds[j]; > sts = read_tcs_cmd(drv, RSC_DRV_CMD_STATUS, i, j); > if (!(sts & CMD_STATUS_ISSUED) || > - ((req->wait_for_compl || cmd->wait) && > - !(sts & CMD_STATUS_COMPL))) { > + (cmd->wait && !(sts & CMD_STATUS_COMPL))) { I don't quite understand this part of the change. Why don't you need to check "req->wait_for_compl" anymore? You are still setting "CMD_MSGID_RESP_REQ" if "req->wait_for_compl" is set and none of the code in your patch actually sets "cmd->wait" (unlike what's implied in your change to the header file). Maybe some previous version of the patch _was_ actually setting "cmd->wait" and when you stopped doing that you forgot to restore this part of the change? > pr_err("Incomplete request: %s: addr=%#x data=%#x", > drv->name, cmd->addr, cmd->data); > err = -EIO; > @@ -433,7 +432,6 @@ static irqreturn_t tcs_tx_done(int irq, void *p) > skip: > /* Reclaim the TCS */ > write_tcs_reg(drv, RSC_DRV_CMD_ENABLE, i, 0); > - write_tcs_reg(drv, RSC_DRV_CMD_WAIT_FOR_CMPL, i, 0); Should you also be removing the write to RSC_DRV_CMD_WAIT_FOR_CMPL in both tcs_write() and tcs_invalidate()? Those are the only two places left that set this register and they both always set it to 0. > writel_relaxed(BIT(i), drv->tcs_base + RSC_DRV_IRQ_CLEAR); > spin_lock(&drv->lock); > clear_bit(i, drv->tcs_in_use); > @@ -465,23 +463,23 @@ static irqreturn_t tcs_tx_done(int irq, void *p) > static void __tcs_buffer_write(struct rsc_drv *drv, int tcs_id, int cmd_id, > const struct tcs_request *msg) > { > - u32 msgid, cmd_msgid; > + u32 msgid; > + u32 cmd_msgid = CMD_MSGID_LEN | CMD_MSGID_WRITE; > u32 cmd_enable = 0; > - u32 cmd_complete; > struct tcs_cmd *cmd; > int i, j; > > - cmd_msgid = CMD_MSGID_LEN; > + /* Convert all commands to RR when the request has wait_for_compl set */ > cmd_msgid |= msg->wait_for_compl ? CMD_MSGID_RESP_REQ : 0; > - cmd_msgid |= CMD_MSGID_WRITE; > - > - cmd_complete = read_tcs_reg(drv, RSC_DRV_CMD_WAIT_FOR_CMPL, tcs_id); > > for (i = 0, j = cmd_id; i < msg->num_cmds; i++, j++) { > cmd = &msg->cmds[i]; > cmd_enable |= BIT(j); > - cmd_complete |= cmd->wait << j; > msgid = cmd_msgid; > + /* > + * Additionally, if the cmd->wait is set, make the command > + * response reqd even if the overall request was fire-n-forget. > + */ > msgid |= cmd->wait ? CMD_MSGID_RESP_REQ : 0; > > write_tcs_cmd(drv, RSC_DRV_CMD_MSGID, tcs_id, j, msgid); > @@ -490,7 +488,6 @@ static void __tcs_buffer_write(struct rsc_drv *drv, int tcs_id, int cmd_id, > trace_rpmh_send_msg_rcuidle(drv, tcs_id, j, msgid, cmd); > } > > - write_tcs_reg(drv, RSC_DRV_CMD_WAIT_FOR_CMPL, tcs_id, cmd_complete); > cmd_enable |= read_tcs_reg(drv, RSC_DRV_CMD_ENABLE, tcs_id); > write_tcs_reg(drv, RSC_DRV_CMD_ENABLE, tcs_id, cmd_enable); > } > diff --git a/include/soc/qcom/tcs.h b/include/soc/qcom/tcs.h > index 7a2a055..d1c87fd 100644 > --- a/include/soc/qcom/tcs.h > +++ b/include/soc/qcom/tcs.h > @@ -1,6 +1,6 @@ > /* SPDX-License-Identifier: GPL-2.0 */ > /* > - * Copyright (c) 2016-2019, The Linux Foundation. All rights reserved. > + * Copyright (c) 2016-2020, The Linux Foundation. All rights reserved. > */ > > #ifndef __SOC_QCOM_TCS_H__ > @@ -30,7 +30,7 @@ enum rpmh_state { > * > * @addr: the address of the resource slv_id:18:16 | offset:0:15 > * @data: the resource state request > - * @wait: wait for this request to be complete before sending the next > + * @wait: Ensure that this command is complete before returning I have a hard time understanding what you're trying to convey here. Do we even need the "wait" in this structure? When you call rpmh_write() we use DEFINE_RPMH_MSG_ONSTACK which sets "wait_for_compl", right? ...so I guess you're expecting this to be used for rpmh_write_async()? ...but we never wait in that case, do we? ...or are you expecting rpmh_write_batch() to take advantage of this somehow? > */ > struct tcs_cmd { > u32 addr; > @@ -43,6 +43,7 @@ struct tcs_cmd { > * > * @state: state for the request. > * @wait_for_compl: wait until we get a response from the h/w accelerator > + * (sets the cmd->wait for all commmands in the request) s/commmands/scommands Also: I don't think this actually goes and modifies "cmd->wait" for all the commands in the request, does it? Maybe say "same as setting cmd->wait for all the commands in the request"? -Doug