Re: [PATCH v12 1/5] asm-generic: Improve csum_fold

"Arnd Bergmann" <arnd@xxxxxxxx> · Wed, 13 Dec 2023 08:52:01 +0100



On Wed, Dec 13, 2023, at 02:18, Charlie Jenkins wrote:
> This csum_fold implementation introduced into arch/arc by Vineet Gupta
> is better than the default implementation on at least arc, x86, and
> riscv. Using GCC trunk and compiling non-inlined version, this
> implementation has 41.6667%, 25% fewer instructions on riscv64, x86-64
> respectively with -O3 optimization. Most implmentations override this
> default in asm, but this should be more performant than all of those
> other implementations except for arm which has barrel shifting and
> sparc32 which has a carry flag.
>
> Signed-off-by: Charlie Jenkins <charlie@xxxxxxxxxxxx>

Acked-by: Arnd Bergmann <arnd@xxxxxxxx>