Hi Leon- I created a different patch, see attachment. It aligns the start _and_ end of the DMA mapped region, places large arrays so they encounter a page boundary, and leaves slack space around each array so there is no possibility of a shared DMA cacheline or other activity in that memory. I am able to reproduce the Local Protection Errors with this patch applied and SLUB debugging disabled.
Thanks Chuck for proving that the dma alignment is not the issue here. I suggest that we go with my dma coherent patch for now until Leon and the Mellanox team can debug this one with the HW/FW folks and find out what is going on. Leon, I had my share of debugging this area on mlx4/mlx5 areas. If you want I can help with debugging this one. -- To unsubscribe from this list: send the line "unsubscribe linux-nfs" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html