Hi Prashanth, On Wed, Apr 24, 2024, Prashanth K wrote: > On 24-04-24 07:03 am, Thinh Nguyen wrote: > > On Mon, Apr 22, 2024, Prashanth K wrote: > > > + * the DWC3 controller (This is enabled by setting GUCTL2[14]) > > > > Did we check to know that we set GUCTL2.Rst_actbitlater to start > > polling for CMDACT? > GUCTL2[14] is set only for DWC3_usb3 controllers prior to version 310a, > because the register is not available for other versions/revisions. > > dwc3/core.c - dwc3_core_init() > /* > * ENDXFER polling is available on version 3.10a and later of > * the DWC_usb3 controller. It is NOT available in the > * DWC_usb31 controller. > */ > if (DWC3_VER_IS_WITHIN(DWC3, 310A, ANY)) { > reg = dwc3_readl(dwc->regs, DWC3_GUCTL2); > reg |= DWC3_GUCTL2_RST_ACTBITLATER; > dwc3_writel(dwc->regs, DWC3_GUCTL2, reg); > } > > Hence the we have added the CMDACT polling in else case of the following > check in __dwc3_stop_active_transfer(). > > if (!DWC3_IP_IS(DWC3) || DWC3_VER_IS_PRIOR(DWC3, 310A)) > > > > > > + */ > > > + do { > > > + reg = dwc3_readl(dep->regs, DWC3_DEPCMD); > > > + if (!(reg & DWC3_DEPCMD_CMDACT)) > > > + break; > > > + udelay(2); > > > > udelay of 2 is really small. Try at least 200us. > Agreed, Will make it 200us with 5 retries. I'm not really sure how many > retries we need here. I just made the aggregate to 1ms since we use 1ms > delay for other revisions. > > > > > + } while (--retries); > > > + > > > + if (!retries && (dwc->ep0state != EP0_SETUP_PHASE)) { > > > + dep->flags |= DWC3_EP_DELAY_STOP; > > > + return -ETIMEDOUT; > > > + } > > > + } > > > + > > > dep->flags &= ~DWC3_EP_TRANSFER_STARTED; > > > } else if (!ret) { > > > dep->flags |= DWC3_EP_END_TRANSFER_PENDING; > > > -- > > > 2.25.1 > > > > > > > Did you observe issues with DWC_usb31? How much longer did your setup > > need to complete End Transfer command? > > > The problem here is that we are immediately unmapping the request after > issuing the ENDXER, we aren't waiting for the completion unlike other > commands. > > 90.872628: dbg_send_ep_cmd: ep1in: cmd 'End Transfer' [30c08] params > 00000000 00000000 00000000 --> status: UNKNOWN > 90.872652: dbg_ep_queue: ep1in: req 937bc75c length 0/1069 zsI ==> -108 > > From the above traces, we can we see that the time gap between send_ep_cmd > and dequeue/unmap is 24us, which is pretty small. > And immediately we got an smmu fault (since ENDXFER didn't complete in this > 24us time-frame). I dumped the DWC3 regs and saw that cmdact was not cleared > for ep1in which means the cmd never got completed. > > 90.873360: [DEPCMD(3): 0xc83c 0x00030C08] > --> CMDACT (bit10) still set. > > > I would prefer a solution that applies for all IPs. Do you observe any > > impact should we increase the mdelay()? I don't expect much impact since > > this should only happen during endpoint disbling, which is not a common > > operation. > > > I checked the following comments in dwc3_stop_active_transfer(), since we > don't use IOC for ENDXFER, we are use CMDACT polling on revisions which > supports it, other revisions uses delay of 1ms instead. > > But currently for DWC3_usb3 >= 310a, we don't poll cmdact (even though it > supports it) nor we wait 1ms for command completion. For me waiting 1ms was > also helping, but that doesn't guarantee the command completion though (we > just assume that it gets completed by that time). > > * As of IP version 3.10a of the DWC_usb3 IP, the controller > * supports a mode to work around the above limitation. The > * software can poll the CMDACT bit in the DEPCMD register > * after issuing a EndTransfer command. This mode is enabled > * by writing GUCTL2[14]. This polling is already done in the > * dwc3_send_gadget_ep_cmd() function so if the mode is > * enabled, the EndTransfer command will have completed upon > * returning from this function. > * > * This mode is NOT available on the DWC_usb31 IP. In this > * case, if the IOC bit is not set, then delay by 1ms > * after issuing the EndTransfer command. This allows for the > * controller to handle the command completely before DWC3 > * remove requests attempts to unmap USB request buffers. > > But anyways I conducted a small experiment to calculate the ENDXFER cmd > completion, added traces before writing to DEPCMD and immediately polled for > the CMDACT to get cleared. Observed that it takes 10-15us on average > (checked on DWC3_REVISION_310A), but these are the best case scenarios. > > 191.623016: dwc3_writel: addr xx offset 000c value 00040c08 - write to > DEPCMD > 191.623027: dwc3_readl: addr xx offset 000c value 00040808 -- CMACT cleared > 191.623076: dwc3_gadget_ep_cmd: ep2out: cmd 'End Transfer' [40c08] params > 00000000 00000000 00000000 --> status: Successful > . > 191.623425: dwc3_writel: addr xx offset 000c value 00070c08 > 191.623436: dwc3_readl: addr xx offset 000c value 00070808 > 191.623483: dwc3_gadget_ep_cmd: ep3in: cmd 'End Transfer' [70c08] params > 00000000 00000000 00000000 --> status: Successful > Thanks for the data. Ok, I remember now why we did what we did. I just notice the Fixes commit you tag: b353eb6dc285 ("usb: dwc3: gadget: Skip waiting for CMDACT cleared during endxfer") I forgot that at one point we skip CMDACT for End Transfer command. Let's not poll for CMDACT for End Transfer command and unconditionally wait 1ms. Otherwise we may run into the issue being stuck with CMDACT again while SETUP packet is not DMA out again. 1ms should be plenty of time for the End Transfer command to complete. It should look like this: diff --git a/drivers/usb/dwc3/gadget.c b/drivers/usb/dwc3/gadget.c index f94f68f1e7d2..dad30c6ab19d 100644 --- a/drivers/usb/dwc3/gadget.c +++ b/drivers/usb/dwc3/gadget.c @@ -1724,8 +1724,7 @@ static int __dwc3_stop_active_transfer(struct dwc3_ep *dep, bool force, bool int dep->resource_index = 0; if (!interrupt) { - if (!DWC3_IP_IS(DWC3) || DWC3_VER_IS_PRIOR(DWC3, 310A)) - mdelay(1); + mdelay(1); dep->flags &= ~DWC3_EP_TRANSFER_STARTED; } else if (!ret) { dep->flags |= DWC3_EP_END_TRANSFER_PENDING; Thanks, Thinh Ps. also please Cc stable.