Hi Borislav, Peter, On Tue, Jul 05, 2022 at 02:50:34PM -0700, H. Peter Anvin wrote: > On July 5, 2022 12:57:04 PM PDT, Borislav Petkov <bp@xxxxxxxxx> wrote: > >On Tue, Jul 05, 2022 at 09:44:17PM +0200, Jason A. Donenfeld wrote: > >> Oh, huh. Maybe in that case I should adjust the message to say "consider > >> using `random.trust_cpu=0`," which is the thing that would actually make > >> a security difference. > > > >Why isn't that option documented in > >Documentation/admin-guide/kernel-parameters.txt? Maybe you're not grepping the right tree? zx2c4@thinkpad ~/Projects/random-linux $ grep trust_cpu Documentation/admin-guide/kernel-parameters.txt random.trust_cpu={on,off} https://git.kernel.org/pub/scm/linux/kernel/git/crng/random.git/tree/Documentation/admin-guide/kernel-parameters.txt#n4506 > >> But actually, one thing that wasn't clear to me was: does `nordrand` > >> affect what userspace sees? While random.c is okay in lots of > >> circumstances, I could imagine `nordrand` playing a role in preventing > >> userspace from using it, which might be desirable. Is this the case? If > >> so, I can remove the nordrand chunk from this patch for v2. If not, I'll > >> adjust the text to mention `random.trust_cpu=0`. > > > >Unfortunately, it doesn't disable the instruction. It would be lovely if > >we had a switch like that... > > > >That's why this message is supposed to be noisy so that people can pay > >attention at least. I was wondering if it somehow removed it from cpuid. But I guess that's not possible. So okay, no real userspace effect. I think I agree with you then: > >> In the sense that random.c can handle mostly any input without making > >> the quality worse. So, you can't accidentally taint it. The only risk is > >> if it thinks RDRAND is good and trustable when it isn't, but that's what > >> `random.trust_cpu=0` is for. > > > >And that's why I'm saying that if you detect RDRAND returning the > >same thing over and over again, you should simply stop using it. > >Automatically. Not rely on the user to do anything. > > > > It's just math. The only variable is your confidence level, i.e. at > what level do you decide that the likelihood of pure chance is way > smaller than the likelihood of hardware failure. For example, the > likelihood of m n-bit samples in a row being identical is > 2^-(n*(m-3/2)), and the likelihood of the CPU being destroyed by a > meterorite in the same microsecond is about 2^-100. I think I'm on board with that general plan of adding a little online selftest that's better than what's there now and using that to get rid of nordrand. I don't want to instrument every invocation like you suggested, because this has effects on forward secrecy (e.g. it's nice to burn previous results from memory). But doing a little test at boot up better than what we have now seems like a good idea. So let's do this - I'll send a v2 changing this patch to be a bit more boring and just get rid of CONFIG_ARCH_RANDOM. That'll be straight forward. And then Peter - do you want to take a stab at doing the selftest in order to get rid of nordrand? Or would you prefer I try? It sounds like you have a specific idea of what you'd like there, so maybe that's best? For now, v2 of this patch sans nordrand is incoming shortly. Jason