On Tue, Apr 24, 2012 at 9:10 AM, Peter Zijlstra <a.p.zijlstra@xxxxxxxxx> wrote: > Grow rudimentary u128 support without relying on gcc/libgcc. > > +#ifndef add_u128 > +static inline u128 add_u128(u128 a, u128 b) > +{ > + u128 res; > + > + res.hi = a.hi + b.hi; > + res.lo = a.lo + b.lo; > + > + if (res.lo < a.lo || res.lo < b.lo) > + res.hi++; This is wrong. Or at least stupid. Just do one of the comparisons, not both. If overflow occurs, the result will be smaller than *either* of the added numbers, so comparing both is just silly and confused. So just pick one. Also, it might be worth looking at code generation, to see if it's better to just do a.hi += b.hi; a.low += b.low; if (a.low < b.low) a.hi++; return a; because that might make it clear that there are fewer actual values live at any particular time. But gcc may not care. Try it. Also, for the multiply, please make sure gcc knows to do a "32x32->64" multiplication, rather than thinking it needs to do full 64x64 multiplies.. I'm not sure gcc understands that as you wrote it. You are probably better off actually using 32-bit values, and then an explicit cast, ie u32 a32_0 = .. low 32 bits of a .. u32 b32_0 = .. low 32 bits of b .. u64 res64_0 = (u64) a32_0 * (u64) b32_0; but if gcc understands it from the shifts and masks, I guess it doesn't matter. Linus -- To unsubscribe from this list: send the line "unsubscribe linux-arch" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html