> cpus execute what they see. speculative execution does the same > except results are not committed to visible registers and stay > in renanmed/shadow set. There is no 'undo' of the speculative execution. > The whole issue is that cache and branch predictor don't have > a shadow unlike registers. Can I suggest you read something like "Exploitig Value Locaity to Exceed The Dataflow Limit" by Lipasti and Shen 1996. In other words there are at least two problems with Linus proposal 1. The ffff/0000 mask has to be generated and that has to involve speculative flows. 2. There are processors on the planet that may speculate not just what instruction to execute but faced with a stall on an input continue by using an educated guess at the value that will appear at the input in future. > > > I think "lets sprinkle lfence everywhere" approach is going to > > > cause serious performance degradation. Yet people pushing for lfence > > > didn't present any numbers. > > > > Normally people who propose security fixes don't have to argue about the > > fact they added 30 clocks to avoid your box being 0wned. > > > > If you look at the patches very very few are in remotely hot paths, which > > is good news. The ones in hot paths definitely need careful scrutiny but > > the priority ought to be fixing and then optimizing unless its obvious > > how to tackle it. > > fdtable, access_ok and networking are clearly hot, Those points are not when compared to a lot of paths executed far far more (like packet processing). The ipv4/6 ones are mixing lfences and a lot of stores so probably hurt with lfence. fdtable worries me a lot less and is anyway clearly required. I have reviewing the packet building ones on my TODO list for a reason. Alan