Re: USB network gadget / DWC3 issue

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi

Op 13-04-2021 om 04:17 schreef Thinh Nguyen:
Ferry Toth wrote:
Op 11-04-2021 om 02:04 schreef Thinh Nguyen:
Ferry Toth wrote:
Hi, some corrections below.

Op 10-04-2021 om 15:29 schreef Ferry Toth:
Op 09-04-2021 om 15:26 schreef Ferry Toth:
Hi,

Op 08-04-2021 om 23:12 schreef Thinh Nguyen:
Ferry Toth wrote:
Op 07-04-2021 om 15:34 schreef Andy Shevchenko:
On Wed, Apr 7, 2021 at 3:24 AM Thinh Nguyen
<Thinh.Nguyen@xxxxxxxxxxxx>  wrote:
Thinh Nguyen wrote:
...

I took a look at the "bad" and "normal" tracepoints. There are a
few
1-second delays where the host tried to bring the device back and
resume from low power:

         ksoftirqd/0-10      [000] d.s.   231.501808:
dwc3_gadget_ep_cmd: ep3in: cmd 'Update Transfer' [60007] params
00000000 00000000 00000000 --> status: Successful
         ksoftirqd/0-10      [000] d.s.   231.501809: dwc3_readl:
addr
00000000d68ecd36 value 0000a610
         ksoftirqd/0-10      [000] d.s.   231.501810: dwc3_writel:
addr
00000000d68ecd36 value 0000a710
              <idle>-0       [000] d.h. 232.499418: dwc3_readl:
addr
00000000a15e0e35 value 00000034
              <idle>-0       [000] d.h. 232.499423: dwc3_readl:
addr
00000000bb67b585 value 00001000
              <idle>-0       [000] d.h. 232.499425:
dwc3_writel: addr
00000000bb67b585 value 80001000
              <idle>-0       [000] d.h. 232.499427:
dwc3_writel: addr
00000000a15e0e35 value 00000034
         irq/15-dwc3-476     [000] d...   232.499480: dwc3_event:
event
(00000401): WakeUp [U0]
         irq/15-dwc3-476     [000] d...   232.499492: dwc3_event:
event
(00000401): WakeUp [U0]
         irq/15-dwc3-476     [000] d...   232.499496: dwc3_event:
event
(00006088): ep2out: Transfer In Progress [0] (SIm)
         irq/15-dwc3-476     [000] d...   232.499501:
dwc3_complete_trb: ep2out: trb 00000000c7ce524e (E179:D170) buf
0000000008273540 size 1463 ctrl 00000818 (hlcS:sC:normal)
         irq/15-dwc3-476     [000] d...   232.499518:
dwc3_gadget_giveback: ep2out: req 0000000012e296cf length 73/1536
zsI ==> 0
         irq/15-dwc3-476     [000] d...   232.499562:
dwc3_ep_queue:
ep2out: req 0000000012e296cf length 0/1536 zsI ==> -115
         irq/15-dwc3-476     [000] d...   232.499601:
dwc3_prepare_trb:
ep2out: trb 000000008c083777 (E180:D170) buf 0000000002a7e9c0
size
1536 ctrl 00000819 (HlcS:sC:normal)


Your device is operating in highspeed right? Try to turn off LPM
from
host and see if that helps with the speed throttling issue. (If
you're
using xHCI host, then set XHCI_HW_LPM_DISABLE). It may also help
with
the connection issue you saw.

It seems to be an issue from host, but I can't tell for sure
unless we
have some USB traffic analyzer that shows what's going on.
Have you
tried different hosts?

You can also disable LPM from the gadget side by setting
dwc->dis_enblslpm_quirk.
Ferry, it can be done by adding a corresponding property to the
dwc3-pci.c for Intel Merrifield platform. I'll check also for my
case
and perhaps I can collect some traces in my case later on when I
have
more time for that.

Ok thanks all. Here is what I tried:

Another computer (Acer 720P brainwashed chromebook), I tried both
full
speed and high speed. Still throttling but less bad.

Then on desktop, with Edison kernel 5.12-rc5 as above + this patch:

diff --git a/drivers/usb/dwc3/dwc3-pci.c
b/drivers/usb/dwc3/dwc3-pci.c

index 4c5c6972124a..a9268c085840 100644

--- a/drivers/usb/dwc3/dwc3-pci.c

+++ b/drivers/usb/dwc3/dwc3-pci.c

@@ -122,6 +122,7 @@ static const struct property_entry
dwc3_pci_mrfld_properties[] = {

PROPERTY_ENTRY_STRING("linux,extcon-name", "mrfld_bcove_pwrsrc"),

PROPERTY_ENTRY_BOOL("snps,dis_u3_susphy_quirk"),

PROPERTY_ENTRY_BOOL("snps,dis_u2_susphy_quirk"),

+ PROPERTY_ENTRY_BOOL("snps,dis_enblslpm_quirk"),

PROPERTY_ENTRY_BOOL("linux,sysdev_is_parent"),

{}

};

This fixes the throttling but reveals I had actually at least 2
bugs:

1) throttling due to LPM, this seems solved now, thanks to much!
Now that we can confirm the speed throttling is related to LPM. We
can
try to experiment further. (IMO, LPM is an important feature and
totally disabling LPM seems like using a sledgehammer to crack a nut)

I suspect that your phy/HW has a higher low power exit latency. I
don't
think you provided any HIRD threshold property in your setup
right? So
by default, dwc3 sets the base line BESL value to 1 (or 150us).
Unless
you know what your phy/HW is capable of, try to test and increase the
recommended BESL value. The range can be from 0 to 15 where 0 is
150us
and 15 is 10ms. Maybe try 6 (i.e. 1ms).

diff --git a/drivers/usb/dwc3/gadget.c b/drivers/usb/dwc3/gadget.c
index 60e850a395a2..423533df8927 100644
--- a/drivers/usb/dwc3/gadget.c
+++ b/drivers/usb/dwc3/gadget.c
@@ -2895,7 +2895,7 @@ static void dwc3_gadget_config_params(struct
usb_gadget *g,
                    * recommended BESL baseline to 1 and clamp the
BESL deep to be
                    * within 2 to 15.
                    */
-               params->besl_baseline = 1;
+               params->besl_baseline = 6;
                   if (dwc->is_utmi_l1_suspend)
                          params->besl_deep =
                                   clamp_t(u8, dwc->hird_threshold, 2,
15);

I will try and report back, hopefully this evening.
I tried this and it seems to have the same effect as dis_enblslpm_quirk
2) a problem with usb plug detection

When I unplug/replug the gadget cable I need to do that at least
another
time before gadget is detected. So unplug/replug/unplug/replug
seems to
work.

Also this platform has a HW switch to select host/device mode, with
separate connectors for host and device.

When I flip the switch to host it immediately changes to host.

Flipping to device leaves the LEDs on my connected usb hub on, so
it's
still powered (but not operational).

Flipping fast host/device (within 1/2 sec) hub LEDs turns off. But I
still need to additionally unplug/replug the gadget cable to get
that to
work.

The connection issue can come from different things. Please narrow
it down
and make sure that you don't use any defective cable or bad hub.
Even then,
it's difficult to determine whose fault it is from just the dmesg
and driver
logs alone without looking at the USB traffic at the packet level.

Btw, is your setup DRD? If you're switching mode, then I know that
dwc3 right
now doesn't implement mode switching correctly.
Yes, we use Extcon driver to support DRD.
You can see the discussion we have here:
https://urldefense.com/v3/__https://lore.kernel.org/linux-usb/20210108015115.27920-1-john.stultz@xxxxxxxxxx/T/*t__;Iw!!A4F2R9G_pg!MXee1rloMlVeQuXlR60t94lr_6imLoVLTEFXzYWhS27dZFAFtH5AWssCZxlDLGcaKy2f$


I see, that might indeed be related. I will try the patches to see if
that works and report back.
I applied both patches:

usb: dwc3: Trigger a GCTL soft reset when switching modes in DRD

usb: dwc3: Fix DRD mode change sequence following programming guide

It doesn't have an effect on the need to unplug/replug neither on the
problems switch from host/device mode.
When I test the correct kernel it does have an effect :-)

In most cases the need to unplug/replug is removed, but not always. In
the cases when I need to retry the host journal shows "can't set config
#1, error -110"
It's most likely because the driver didn't provide time for the clocks
synchronization before clearing the GCTL soft reset. I noted that issue
in the patch in the discussion thread. I can send out a patch next week.

The switch from host->device and device->host mode seems to be resolved.

Strangely, iperf3 now reports 130 Mbits/sec (down from 200 Mbits/sec).

Did this happen with disabling LPM or with increasing BESL baseline?
Note that increasing the recommended BESL is not the same as disabling
LPM. With the recommended BESL provided, the host can decide when it
should put the device in low power so that the device has enough time to
wake up. With LPM enabled, there maybe some minor speed degradation but
not that much. Anyway, you can experiment with the BESL value to have
the acceptable speed while still have power saving capability (or
completely disable LPM if power saving is not an issue for you).
I tried both, the result was exactly the same.
That's strange... Also, enabling LPM should not impact the performance that
much at all. What's changed to your setup?

Strange indeed. Only the combination of both patches had that effect.

And stranger, I backported everything to 5.10.27 and that effect is not present. But then I still need to plug/replug the cable 2x.

Anyway, can you try this patch instead of John Stult's. There are a couple
of issues from his patches.

diff --git a/drivers/usb/dwc3/core.c b/drivers/usb/dwc3/core.c
index 05e2e54cbbdc..675e861fda1a 100644
--- a/drivers/usb/dwc3/core.c
+++ b/drivers/usb/dwc3/core.c
@@ -14,6 +14,7 @@
  #include <linux/kernel.h>
  #include <linux/slab.h>
  #include <linux/spinlock.h>
+#include <linux/mutex.h>
  #include <linux/platform_device.h>
  #include <linux/pm_runtime.h>
  #include <linux/interrupt.h>
@@ -40,6 +41,8 @@
#define DWC3_DEFAULT_AUTOSUSPEND_DELAY 5000 /* ms */ +static DEFINE_MUTEX(mode_switch_lock);
+
  /**
   * dwc3_get_dr_mode - Validates and sets dr_mode
   * @dwc: pointer to our context structure
@@ -114,13 +117,20 @@ void dwc3_set_prtcap(struct dwc3 *dwc, u32 mode)
         dwc->current_dr_role = mode;
  }
+static int dwc3_core_soft_reset(struct dwc3 *dwc);
+
  static void __dwc3_set_mode(struct work_struct *work)
  {
         struct dwc3 *dwc = work_to_dwc(work);
         unsigned long flags;
+       unsigned int hw_mode;
         int ret;
         u32 reg;
+ mutex_lock(&mode_switch_lock);
+
+       hw_mode = DWC3_GHWPARAMS0_MODE(dwc->hwparams.hwparams0);
+
         pm_runtime_get_sync(dwc->dev);
if (dwc->current_dr_role == DWC3_GCTL_PRTCAP_OTG)
@@ -154,6 +164,24 @@ static void __dwc3_set_mode(struct work_struct *work)
                 break;
         }
+ if (hw_mode == DWC3_GHWPARAMS0_MODE_DRD) {
+               reg = dwc3_readl(dwc->regs, DWC3_GCTL);
+               reg |= DWC3_GCTL_CORESOFTRESET;
+               dwc3_writel(dwc->regs, DWC3_GCTL, reg);
+
+               /*
+                * Wait for internal clocks to synchronized. DWC_usb31 and
+                * DWC_usb32 may need at least 50ms (less for DWC_usb3). To
+                * keep it consistent across different IPs, let's wait up to
+                * 100ms before clearing GCTL.CORESOFTRESET.
+                */
+               msleep(100);
+
+               reg = dwc3_readl(dwc->regs, DWC3_GCTL);
+               reg &= ~DWC3_GCTL_CORESOFTRESET;
+               dwc3_writel(dwc->regs, DWC3_GCTL, reg);
+       }
+
         spin_lock_irqsave(&dwc->lock, flags);
dwc3_set_prtcap(dwc, dwc->desired_dr_role);
@@ -178,6 +206,9 @@ static void __dwc3_set_mode(struct work_struct *work)
                 }
                 break;
         case DWC3_GCTL_PRTCAP_DEVICE:
+               if (hw_mode == DWC3_GHWPARAMS0_MODE_DRD)
+                       dwc3_core_soft_reset(dwc);
+
                 dwc3_event_buffers_setup(dwc);
if (dwc->usb2_phy)
@@ -200,6 +231,7 @@ static void __dwc3_set_mode(struct work_struct *work)
  out:
         pm_runtime_mark_last_busy(dwc->dev);
         pm_runtime_put_autosuspend(dwc->dev);
+       mutex_unlock(&mode_switch_lock);
  }
void dwc3_set_mode(struct dwc3 *dwc, u32 mode)

I will try this.
Thanks,
Thinh



[Index of Archives]     [Linux Media]     [Linux Input]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [Old Linux USB Devel Archive]

  Powered by Linux