On Fri, Jun 14, 2019 at 09:25:51AM -0400, Sven Van Asbroeck wrote: > On Fri, Jun 14, 2019 at 6:49 AM Fabio Estevam <festevam@xxxxxxxxx> wrote: > > > > According to the original report from Sven the issue started to happen > > on 5.0, so it would be good to add a Fixes tag and Cc stable so that > > this fix could be backported to 5.0/5.1 stable trees. > > Good catch ! > > However, the issue is highly timing-dependent. It will come and go depending > on the kernel version, devicetree and defconfig. If it works for me on > 4.19, that > doesn't mean the bug is gone on 4.19. > > Looking at the commit history, I think the commit below possibly introduced the > issue. Until this commit, sdma_run_channel() would wait on the interrupt > before proceeding. It has been there since 4.8: > > Fixes: 1d069bfa3c78 ("dmaengine: imx-sdma: ack channel 0 IRQ in the > interrupt handler") I think this is correct. Starting with this commit, the interrupt status fr channel 0 is no longer cleared in sdma_run_channel0() and sdma_int_handler() is always called for channel 0. During firmware loading the interrupts are enabled again just before the clocks are disabled. The interrupt is pending at this moment so on a single core system I think this will always work as expected. If the firmware loading and the interrupt handler run on different cores then this is racy. Maybe something else changed to make this more likely? With this new change sdma_int_handler() is no longer called for channel 0 right, so you should also remove the special handling there. Michael -- Pengutronix e.K. | | Industrial Linux Solutions | http://www.pengutronix.de/ | Peiner Str. 6-8, 31137 Hildesheim, Germany | Phone: +49-5121-206917-0 | Amtsgericht Hildesheim, HRA 2686 | Fax: +49-5121-206917-5555 |