On Sun, Jul 28, 2019 at 10:00:09PM -0400, Theodore Y. Ts'o wrote: > On Fri, Jul 26, 2019 at 03:41:41PM -0700, Eric Biggers wrote: > > +- The kernel cannot magically wipe copies of the master key(s) that > > + userspace might have as well. Therefore, userspace must wipe all > > + copies of the master key(s) it makes as well. Naturally, the same > > + also applies to all higher levels in the key hierarchy. Userspace > > + should also follow other security precautions such as mlock()ing > > + memory containing keys to prevent it from being swapped out. > > Normally, shouldn't userspace have wiped all copies of the master key > after they have called ADD_KEY? Why should they be left hanging > around? Waiting until REMOVE_KEY to remove other copies of the master > key seems.... late. Correct, normally userspace should wipe its copy of the key immediately after adding it to the kernel. I'll clarify that here. > > > +- In general, decrypted contents and filenames in the kernel VFS > > + caches are freed but not wiped. Therefore, portions thereof may be > > + recoverable from freed memory, even after the corresponding key(s) > > + were wiped. To partially solve this, you can set > > + CONFIG_PAGE_POISONING=y in your kernel config and add page_poison=1 > > + to your kernel command line. However, this has a performance cost. > > ... and even this won't help if you have swap configured.... Yes, but that's a larger issue. Unencrypted data can be written to swap and then be recovered from disk offline. This has nothing to do with whether the key is ever removed on-line or not. So swap really could use its own mention somewhere else, maybe in the "Offline attacks" section. > > > +v1 encryption policies have some weaknesses with respect to online > > +attacks: > > + > > +- There is no verification that the provided master key is correct. > > + Consequently, malicious users can associate the wrong key with > > + encrypted files, even files to which they have only read-only > > + access. > > Yes, but they won't be able to trick other users into using that > incorrect key. With the old interface, it gets written into the > user's session keyring, which won't get used by another user. And > with the newer interface, only root is allowed to set v1 key. > As mentioned in a previous reply, they *can* trick other users into using that incorrect key, by opening files using that incorrect key. The incorrect key is then cached for everyone. (This assumes the other users have at least read access to the file. If it's mode 0700, this won't work.) > > +Master keys should be pseudorandom, i.e. indistinguishable from random > > +bytestrings of the same length. This implies that users **must not** > > +directly use a password as a master key, zero-pad a shorter key, or > > +repeat a shorter key. > > These paragraphs starts a bit funny, since we first say "should" in > the first sentence, and then it's followed up by "**must not**" in the > second sentence. Basically, they *could* do this, but it would just > weaken the security of the system significantly. > > At the very least, we should explain the basis of the recommendation. I think we should go with "must" instead of "should". Basically the point of this paragraph is to explain that the API takes a real cryptographic key of the full given length. Otherwise the security guarantees for the algorithms the master key may be used in (AES-128-ECB KDF, HKDF-SHA512, or Adiantum) aren't guaranteed to hold. One can argue about how much of a problem this actually is, like how unsalted HKDF on a key with unevenly distributed entropy is *probably* fine in practice (and much better than the AES-128-ECB KDF). But the security proof for unsalted HKDF actually still assumes a pseudorandom key. It's only randomly salted HKDF that doesn't. I'd strongly prefer to go with *must* for things that are necessary for the security proofs or cryptanalysis to apply, even if they *might* still be "good enough" in practice. I'll try to find a better way to word this paragraph. > > > +The KDF used for a particular master key differs depending on whether > > +the key is used for v1 encryption policies or for v2 encryption > > +policies. Users **must not** use the same key for both v1 and v2 > > +encryption policies. > > "Must not" seems a bit strong. If they do, and a v1 per-file key and > nonce leaks out, then the encryption key will be compromised. So the > strength of the key will be limited by the weaknesses of the v1 > scheme. But it's not like using a that was originally meant for v1, > and then using it for v2, causes any additional weakness. Right? > Probably, but we don't know for sure. It's theoretically possible that cryptanalysis of two cryptographic primitives A and B, where they are each given the same key, could be much easier than attacking A or B individually. So again, I'd prefer to go with *must not* for things where there is no theory of cryptography that says it is okay, even if *probably* someone could get away with doing it in practice. - Eric