On Wed, 24 Jul 2024, James Bottomley wrote: > On Wed, 2024-07-24 at 20:17 +0200, Mikulas Patocka wrote: > > Hi > > > > Thanks for fixing the cache aliasing issues on PA-RISC in the commit > > 72d95924ee35c8cd16ef52f912483ee938a34d49. > > > > I think there is still one problem left - and that is > > ARCH_DMA_MINALIGN. Currently, it is 16, which is obviously wrong. > > I don't think that's obvious, why is it wrong? Suppose that two unrelated kernel subsystems allocate a 16-byte region and the regions happen to be placed next to each other - in the same cacheline. The first subsystem does DMA into its 16-byte region and the second subsystem writes to its region using cacheable write. Because the regions share a cache line, the cacheable write may corrupt the data that were loaded using DMA. For example 1. load a cache line into CPU cache 2. DMA writes to the region that was loaded 3. the cache line is modified 4. the cache line is written back and it overwrites the result of the DMA operation ARCH_DMA_MINALIGN prevents this situation from hapenning. See the explanation of ARCH_DMA_MINALIGN in Documentation/core-api/dma-api-howto.rst > > Some comments n the kernel say that PA8900 has L2 cache with 128-byte > > line size, so I think that ARCH_DMA_MINALIGN should be 128 as well. > > The L2+ caches on PA88 and 89 systems are PIPT and fully coherent with > the PCI bus, so the L2+ line size doesn't matter that much (well, > except we could possibly get better performance with more judicious DMA > alignment). I didn't know that. > All the parisc coherency protocols rely on the CPU L1 cache, which is > still VIPT. Additionally, the CPU architects kept the minimum line > size for the L1 at 16, so even in the later CPUs which have larger > actual VIPT cache line sizes there's a splitting mechanism which means > they can operate coherency protocols at a line size of 16. This was > done so the only spinlock primitive parisc has (LDCW) can still operate > correctly with only 16 bytes of alignment. But you still need to flush L1 cache when doing DMA. > > The question is - can the CPU speculatively mark a cache line as > > dirty and write it back? > > No, the CPU may only mark a line as dirty if something actually wrote > to it; it may not do it speculatively. The L1 cache can speculatively > move in clean lines if a TLB exists for them and once a line is marked > dirty it's within the gift of the CPU to decide when to write it back > absent a flush. That's good. > > If yes, we have a big problem - Linux assumes that a part of the > > page may be used for DMA transfer and another part of that page may > > be used for normal cacheable structures. If the PA-RISC CPU > > speculatively prefetched and wrote back a cache line, it could > > corrupt the DMA transfer. > > The L2 PIPT PCI coherence protocol ensures that DMA can't corrupt > memory adjacent objects on PA88 and 89. Earlier CPUs, which were fully > VIPT, do suffer from this problem because they have no PCI coherence, > but they all operate at a line size of 16 anyway and so > ARCH_DMA_MINALIGN works for them. I read somewhere on internet that PA7300LC has 32-byte cache line size. So, ARCH_DMA_MINALIGN 16 is too small for it. > > If the CPU doesn't speculatively mark cache lines as dirty, then > > increasing ARCH_DMA_MINALIGN would be sufficient solution. > > Well, it's relatively safe to try without exploding all our hashed > spinlocks because the LDCW alignment isn't tied to this (it's a > separate #define in ldcw.h) if you want to benchmark it. > > James Mikulas