On 1/9/2018 3:41 AM, Paolo Bonzini wrote:
The above ("IBRS simply disables the indirect branch predictor") was my take-away message from private discussion with Intel. My guess is that the vendors are just handwaving a spec that doesn't match what they have implemented, because honestly a microcode update is unlikely to do much more than an old-fashioned chicken bit. Maybe on Skylake it does though, since the performance characteristics of IBRS are so different from previous processors. Let's ask Arjan who might have more information about it, and hope he actually can disclose it...
IBRS will ensure that, when set after the ring transition, no earlier branch prediction data is used for indirect branches while IBRS is set (this is a english summary of two pages of technical spec so it lacks the language lawyer precision) because of this promise, the implementation tends to be impactful and it is very strongly recommended that retpoline is used instead of IBRS. (with all the caveats already on lkml) the IBPB is different, this is a covenient thing for switching between VM guests etc