Meissner, Michael wrote:
Note all of the Linux 64-bit distributions only ship one set of binaries, so the code generated by the compilers is the same. There is at least one instruction (cmpxcg16) that Intel has that AMD doesn't, but the compiler doesn't generate it.
However, using GCC 4.x and optimizing for 'k8' vs. 'nocona' can make a noticeable performance difference. The resulting code may run on both sorts of CPUs, but the instruction scheduling is apparently quite different.