On 2013-08-12 7:08 PM, David Daney wrote: > On 08/12/2013 04:22 AM, Felix Fietkau wrote: >> Several DMA ops are no-op on many platforms, and the indirection through >> the mips_dma_map_ops function table is causing the compiler to emit >> unnecessary code. >> >> Inlining visibly improves network performance in my tests (on a 24Kc >> based system), and also slightly reduces code size of a few drivers. >> >> Signed-off-by: Felix Fietkau <nbd@xxxxxxxxxxx> >> --- >> arch/mips/Kconfig | 4 + >> arch/mips/include/asm/dma-mapping.h | 360 +++++++++++++++++++++++++++++++++++- >> arch/mips/mm/dma-default.c | 161 ++-------------- >> 3 files changed, 372 insertions(+), 153 deletions(-) > That is not a very pleasing diffstat. I know. But altering the generic include (of which I duplicated the inlined code here) would make things even worse. I believe the improvement in the generated code is worth it though. I just did some fresh performance tests with an 400 MHz AR7242 system (MIPS 24Kc), bridging packets from one Ethernet port to another. I'm running TCP iperf through this device. Without this patch, I get 710-760 MBit/s with heavy fluctuation. With this patch, I get 780-790 MBit/s with little fluctuation. Most other MIPS systems will probably see similar improvements in DMA heavy drivers. For Octeon, I don't expect any visible performance change, and the change shouldn't make it any worse either. - Felix