Re: [PATCH nf-next 8/8] nft_set_pipapo: Introduce AVX2-based lookup implementation

Florian Westphal <fw@xxxxxxxxx> · Wed, 20 Nov 2019 16:16:53 +0100

Stefano Brivio <sbrivio@xxxxxxxxxx> wrote:
> If the AVX2 set is available, we can exploit the repetitive
> characteristic of this algorithm to provide a fast, vectorised
> version by using 256-bit wide AVX2 operands for bucket loads and
> bitwise intersections.
> 
> In most cases, this implementation consistently outperforms rbtree
> set instances despite the fact they are configured to use a given,
> single, ranged data type out of the ones used for performance
> measurements by the nft_concat_range.sh kselftest.

I think in that case it makes sense to remove rbtree once this new
set type has had some upstream exposure and let pipapo handle the
range sets.

Stefano, if I understand this right then we could figure out which
implementation (C or AVX) is used via "grep avx2 /proc/cpuinfo".

If not, I think we might want to expose some additional debug info
on set dumps.