On 17/09/2019 13:33, Russell King - ARM Linux admin wrote: [...]
Further debug shows: coherent=0 - sdhci device is not cache coherent swapper pgtable: 4k pages, 39-bit VAs, pgdp=0000000081cac000 [ffffff8010fd5200] pgd=000000237ffff003, pud=000000237ffff003, pmd=000000237fffb003, pte=00e800236d62270f The mapping for the ADMA table seems to be using MAIR index 3, which is MT_MEMORY_NC, so should be non-cacheable. vmallocinfo: 0xffffff8010fd5000-0xffffff8010fd7000 8192 dma_direct_alloc+0x4c/0x54 user So this memory has been remapped. Could there be an alias that has cache lines still in the cache for the physical address, and could we be hitting those cache lines while accessing through a non-cacheable mapping? (On 32-bit ARM, this is "unpredictable" and this problem definitely _feels_ like it has unpredictable attributes!) Also, given that this memory is mapped NC, then surely __dma_flush_area() should have no effect? However, it _does_ have the effect of reliably solving the problem, which to me implies that there _are_ cache lines in this NC mapping.
The non-cacheable mapping of the descriptor table will still have its cacheable linear map alias, so it's quite likely that the invalidate aspect of __dma_flush_area(), rather than the clean, is what's making the difference - if using __dma_clean_area() instead doesn't help, it would more or less confirm that.
One possibility in that case is that you might actually have the rare backwards coherency problem - if the device *is* actually snooping the cache, then it could hit lines which were speculatively prefetched via the cacheable alias before the descriptors were fully written, rather than the up-to-date data which went straight to RAM via the NC mapping. I'd try declaring the device as "dma-coherent" to see if that's actually true (and it should become pretty obvious if it isn't).
Robin.