Artur Skawina wrote: > > and my atom seems to like the compact loops too: no, that was wrong, i forgot to turn off the ondemand governor... the unrolled loops are in fact much faster and the numbers look more reasonable, after a few tweaks even on a P4. Now i just need to check how well it does compared to the asm implementations... artur # TIME[s] SPEED[MB/s] # ATOM rfc3174 2.199 27.75 linus 0.8642 70.62 linusas 1.606 38.01 linusas2 0.8763 69.65 mozilla 2.813 21.7 mozillaas 2.539 24.04 # P4 rfc3174 1.402 43.53 linus 0.5835 104.6 linusas 0.4625 132 linusas2 0.4456 137 mozilla 1.529 39.91 mozillaas 1.131 53.96 # P3 rfc3174 5.019 12.16 linus 1.86 32.81 linusas 3.108 19.64 linusas2 1.812 33.68 mozilla 6.431 9.49 mozillaas 5.868 10.4 -- To unsubscribe from this list: send the line "unsubscribe git" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html