Re: DM3730 + MUSB host mode + DMA + USB Ethernet dongle = Fail

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, May 7, 2015 at 2:17 PM, Tim Nordell <tim.nordell@xxxxxxxxxxx> wrote:
> Hi -
>
> On 05/07/15 11:16, Felipe Balbi wrote:
>>
>> Hi,
>>
>> On Thu, May 07, 2015 at 09:06:06AM -0700, Tony Lindgren wrote:
>>>
>>> * Tim Nordell <tim.nordell@xxxxxxxxxxx> [150506 16:19]:
>>>>
>>>> Hi all -
>>>>
>
> *snip*
>
>>>>
>>>> Something about this feels race conditiony too.  Depending on the kernel
>>>> configuration options in the kernel debug menu (which slows down other
>>>> parts
>>>> of the kernel), or if I enable verbose debug message output from the
>>>> MUSB
>>>> driver, it lasts much, much longer before keeling over as a system.  You
>>>> can
>>>> restore normal behavior by unplugging/replugging in the device.
>>>
>>>
>>> This sounds like some issue dealing with certain size transfers where
>>> there's data remaining after a DMA transfer. You might be able to make
>>> it easily reproducable with a variable size ping from your PC even with
>>> debugging enabled. Something like the script below.
>>>
>>> Regards,
>>>
>>> Tony
>>>
>>> 8< --------------
>>> #!/bin/bash
>>>
>>> device=$1
>>> size=$2
>>>
>>> while [ 1 ]; do
>>>         #echo "Pinging with size $size"
>>>         if ! ping -w0 -c1 $device -s$size > /dev/null 2>&1; then
>>>                 break;
>>>         fi
>>>         size=$(expr $SIZE + 1)
>
>
> (This line should have referred to $size instead of $SIZE.)
>
>>>
>>>         if [ $size -gt 8192 ]; then
>>>                 size=1
>>>         fi
>>> done
>>>
>>> echo "Test ran up to $size"
>>
>>
>> I'd be very much interested in getting such logs :-)
>>
>
> Unfortunately, the driver works with the script above (after fixing the one
> line) so it's not a "certain size" style scenario.
>
> Using iperf I can get traffic to stop flowing within a second of usage so
> fortunately it's highly reproducible.  One thing I tried this morning was to
> simply "ignore" the fact the RXCSR register didn't have the expected bit and
> continue in the driver.  While the throughput was abysmal (and DMA "errors"
> were reported), at least the driver didn't die.  I could still ping after
> stopping iperf.  (I don't think this is the appropriate thing to do, but it
> was just a quick and dirty test.)
>
> However, iperf does have the ability to adjust the packet size while still
> spewing out a high-rate (configurable bandwidth) of packets. Armed with
> this, I modified the ping test above to utilize iperf instead at various
> packet sizes (granted, this would get fragmented above the MTU size by the
> IP stack), leaving it at 1000 packets per second.
>
> Doing that as an exercise with the Ethernet dongle, it first keeled over at
> 1059 byte UDP datagrams (1101 bytes on Ethernet - I don't know how many USB
> transactions this split into exactly), at 1000 packets per second.  However,
> it doesn't consistently keel over at the same bytes per datagram.  Another
> attempt keeled over at 1052 byte UDP datagrams.
>
> Upping the ante to 1500 packets per second, and jumping at 16-byte intervals
> for the size, it lasted from 1000 to 1352 byte size packets and keeled over
> at 1368 byte sized packets.
>
> Some of the configurations of iperf against MUSB in DMA mode causes the ASIX
> driver (the Ethernet dongle driver) to complain too, even to the point of
> doing a NULL dereference (I will dig into this dereference). Not seen with
> the MUSB in PIO mode.
>
> Note: Same system/kernel, configured as an Ethernet gadget does not exhibit
> the issues, although, I'm not convinced it's actually using DMA for this.
> INTC 93 does not have any activity as a gadget (INTC 92 gets all the
> traffic).  I can improve the performance (~35% improvement) in PIO mode of
> an Ethernet gadget by configuring the dynamic FIFO in the MUSB to utilize
> double buffering, but DMA mode still falls over with a USB ethernet dongle
> attached with double buffering.  I haven't tried the series of iperf
> bandwidth tests against double buffering yet.  I also have the idle loops in
> the system disabled to improve interrupt latency ("nohlt cpuidle.off=1").
>
> - Tim
>

I'm sorry if you are already aware of this information.
Just wanted to share my experience.

AFAIK, there is a hardware bug with MUSB Host controller.
See ARM_MPU.KERNEL.39 at
http://processors.wiki.ti.com/index.php/Sitara_SDK_6.00.00_Release_Notes
I assume it is common across AM335x and DM3730.

TI has this pre-mature TX complete interrupt issue on host controller side.
Freescale has a similar pre-mature TX interrupt issue on device controller side.

It looks like TI has a patch to work around that issue.
https://gitorious.org/opensuse/kernel/commit/a655f481d83d6d37bec0a2ddfdd24c30ff8f541f

Maybe, in your case would it be due to the 25us wait loop which is too
short (I guess)?

To me TI's approach looks complicated.

Freescale's approach looks a lot simpler.
http://git.freescale.com/git/cgit.cgi/imx/linux-2.6-imx.git/commit/drivers/usb/gadget/arcotg_udc.c?h=imx_2.6.28&id=f34bd00b7b0295e723158db1f55f28416026ad94


-chan
--
To unsubscribe from this list: send the line "unsubscribe linux-omap" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Linux Arm (vger)]     [ARM Kernel]     [ARM MSM]     [Linux Tegra]     [Linux WPAN Networking]     [Linux Wireless Networking]     [Maemo Users]     [Linux USB Devel]     [Video for Linux]     [Linux Audio Users]     [Yosemite Trails]     [Linux Kernel]     [Linux SCSI]

  Powered by Linux