On Thu, 25 Jul 2024, John David Anglin wrote: > On 2024-07-24 3:25 p.m., James Bottomley wrote: > > On Wed, 2024-07-24 at 20:17 +0200, Mikulas Patocka wrote: > >> Hi > >> > >> Thanks for fixing the cache aliasing issues on PA-RISC in the commit > >> 72d95924ee35c8cd16ef52f912483ee938a34d49. > >> > >> I think there is still one problem left - and that is > >> ARCH_DMA_MINALIGN. Currently, it is 16, which is obviously wrong. > > I don't think that's obvious, why is it wrong? > I see this comment in arch/arm64/include/asm/cache.h: > > /* > * Memory returned by kmalloc() may be used for DMA, so we must make > * sure that all such allocations are cache aligned. Otherwise, > * unrelated code may cause parts of the buffer to be read into the > * cache before the transfer is done, causing old data to be seen by > * the CPU. > */ > #define ARCH_DMA_MINALIGN (128) > #define ARCH_KMALLOC_MINALIGN (8) > > L1_CACHE_BYTES is 64 on arm64. > > Possibly, the same can occur on parisc. > > Dave L1_CACHE_BYTES is a performance hint that is used to avoid cache line ping-pong when multiple CPUs modify nearby data. ARCH_DMA_MINALIGN is the biggest possible cache line size to avoid DMA data corruption. As there are some arm64 machines with 128-byte cache line, arm64 has to define it to 128. James said that the L2 cache on PA8800/8900 is coherent with PCI. So, I think that ARCH_DMA_MINALIGN should be 64 (is that the L1 cache line size on PA8800/8900?). L1_CACHE_BYTES could be 128 to avoid ping-pong between sockets. For ARCH_KMALLOC_MINALIGN, there is an arm64 commit 9382bc44b5f58ccee375f08f518e53c0280051dc, it is an optimization, so that they can use cache line size probed at startup instead of 128. I think we don't have to do this optimization on PA-RISC, the PA-RISC machines with 16-byte or 32-byte cache line size are old and slow - so that there is no point in trying to optimize kmalloc pools for them. Mikulas