On Sun, 4 Feb 2024 at 08:52, John Sanpe <sanpeqf@xxxxxxxxx> wrote: > > Optimized the performance of the three functions (load_ecc8 store_ecc8 > and bch_encode) using a larger calculation length. Honestly, with any optimization, you should quote performance numbers. Also, it's questionable how meaningful this is, considering that most architectures dop the bit swap with a byte lookup, and the 32-bit bit swap is just four byte lookups. For all we know, this only makes things worse. Finally, if you actually want to do things in bigger chunks, that ->swap_bits conditional should probably be moved out of the loops, not just have that questionable 8->32 bit expansion. Linus