Re: ghash

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Fri, Jul 19, 2019 at 08:49:02PM +0000, Pascal Van Leeuwen wrote:
> Hi Eric,
> 
> > -----Original Message-----
> > From: linux-crypto-owner@xxxxxxxxxxxxxxx <linux-crypto-owner@xxxxxxxxxxxxxxx> On Behalf Of Eric Biggers
> > Sent: Friday, July 19, 2019 9:57 PM
> > To: Pascal Van Leeuwen <pvanleeuwen@xxxxxxxxxxxxxx>
> > Cc: linux-crypto@xxxxxxxxxxxxxxx; Herbert Xu <herbert@xxxxxxxxxxxxxxxxxxx>; davem@xxxxxxxxxxxxx
> > Subject: Re: ghash
> > 
> > Hi Pascal,
> > 
> > On Fri, Jul 19, 2019 at 07:26:02PM +0000, Pascal Van Leeuwen wrote:
> > > > -----Original Message-----
> > > > From: linux-crypto-owner@xxxxxxxxxxxxxxx <linux-crypto-owner@xxxxxxxxxxxxxxx> On Behalf Of Eric Biggers
> > > > Sent: Friday, July 19, 2019 6:16 PM
> > > > To: Pascal Van Leeuwen <pvanleeuwen@xxxxxxxxxxxxxx>
> > > > Cc: linux-crypto@xxxxxxxxxxxxxxx; Herbert Xu <herbert@xxxxxxxxxxxxxxxxxxx>; davem@xxxxxxxxxxxxx
> > > > Subject: Re: ghash
> > > >
> > > > On Fri, Jul 19, 2019 at 02:05:01PM +0000, Pascal Van Leeuwen wrote:
> > > > > Hi,
> > > > >
> > > > > While implementing GHASH support for the inside-secure driver and wondering why I couldn't get
> > > > > the test vectors to pass I have come to the conclusion that ghash-generic.c actually does *not*
> > > > > implement GHASH at all. It merely implements the underlying chained GF multiplication, which,
> > > > > I understand, is convenient as a building block for e.g. aes-gcm but is is NOT the full GHASH.
> > > > > Most importantly, it does NOT actually close the hash, so you can trivially add more data to the
> > > > > authenticated block (i.e. the resulting output cannot be used directly without external closing)
> > > > >
> > > > > GHASH is defined as GHASH(H,A,C) whereby you do this chained GF multiply on a block of AAD
> > > > > data padded to 16 byte alignment with zeroes, followed by a block of ciphertext padded to 16
> > > > > byte alignment with zeroes, followed by a block that contains both AAD and cipher length.
> > > > >
> > > > > See also https://en.wikipedia.org/wiki/Galois/Counter_Mode
> > > > >
> > > > > Regards,
> > > > > Pascal van Leeuwen
> > > > > Silicon IP Architect, Multi-Protocol Engines @ Verimatrix
> > > > > www.insidesecure.com
> > > > >
> > > >
> > > > Yes that's correct.  The hash APIs don't support multi-argument hashes, so
> > > > there's no natural way for it to be "full GHASH".  So it relies on the caller to
> > > > format the AAD and ciphertext into a single stream.  IMO it really should be
> > > > called something like "ghash_core".
> > > >
> > > > Do you have some question or suggestion, or was this just an observation?
> > > >
> > > Well, considering it's pretending to be GHASH I was more less considering this a bug report ...
> > >
> > > There's the inherent danger that someone not aware of the actual implementation tries to
> > > use it as some efficient (e.g. due to instruction set support) secure authentication function.
> > > Which, without proper external data formatting, it's surely not in its current form. This is
> > > not something you will actually notice when just using it locally for something (until
> > > someone actually breaks it).
> > 
> > You do understand that GHASH is not a MAC, right?  It's only a universal
> > function.  
> >
> It's a universal keyed hash. Which you could use as a MAC, although, admittedly,
> it would be rather weak, which is why the tag is usually additionally encrypted.
> (which you could do externally, knowing that that's needed with GHASH)
> In any case, the crypto API's ghash does not do what you would expect of a GHASH.

Well you could also use CRC-32 "as a MAC".  That doesn't actually make it a MAC.

> 
> > Specifically, "almost-epsilon-XOR-universal".  So even if there was a
> > more natural API to access GHASH, it's still incorrect to use it outside of a
> > properly reviewed crypto mode of operation.  IOW, anyone using GHASH directly as
> > a MAC is screwed anyway no matter which API they are using, or misusing.
> > 
> "Anyone" may actually know what they're doing. But rely on ghash being GHASH ...
> So this is a rather lame argument.
> 
> > >
> > > And then there was the issue of wanting the offload it to hardware, but that's kind of hard
> > > if the software implementation does not follow the spec where the hardware does ...
> > >
> > > I think more care should be taken with the algorithm naming - if it has a certain name,
> > > you expect it to follow the matching specification (fully). I have already identified 2 cases
> > > now (xts and ghash) where that is not actually the case.
> > >
> > 
> > If you take an (AD, Ctext) pair and format it into a single data stream the way
> > the API requires, then you will get the result defined by the specification.  So
> > it does follow the specification as best it can given the API which takes a
> > single data stream.  As I said though, I think "ghash_core" would be a better
> > name.  Note that it was added 10 years ago; I'm not sure it can actually be
> > renamed, but there may be a chance since no one should be using it directly.
> > 
> Ok, I understand that it's legacy so maybe we should keep it as is.
> It's terribly confusing though - I spent days trying to get my acceleration to
> work (i.e. pass testmgr) only to realise after reverse engineering the generic 
> implementation that it's not really ghash at all.
> 
> I guess the real problem is that this information can currently only be 
> obtained by fully reverse engineering the implementation. And I'm a firm
> believer in the natural order of things: programmers write code, compilers 
> read code, not the other way around ...
> 
> > So are you proposing that it be renamed?  Or are you proposing that a multi
> > argument hashing API be added?  Or are you proposing that universal functions
> > not be exposed through the crypto API?  What specifically is your constructive
> > suggestion to improve things?
> > 
> I guess my constructive suggestion *for the future* would be to be more careful
> with the naming. Don't give something a "known" name if it does not comply with
> the matching specification. Renaming stuff now is probably counter-productive,
> but putting some remarks somewhere (near the actual test vectors may work?)
> about implementation x not actually being known entity X would be nice.
> (Or even just some reference on where the test vectors came from!)
> 

I think a comment at the top of ghash-generic.c would be helpful, similar to the
one I wrote in nhpoly1305.c explaining that particular algorithm.

I'm surprised that you spent "days" debugging this, though.  Since the API gives
you a single data stream, surely you would have had to check at some point how
the two formal arguments (AAD, ciphertext) were encoded into it?  Were you just
passing the whole thing as the AAD or something?  Also to reiterate, it actually
does implement the GHASH algorithm correctly; the two formal parameters just
have to be encoded into the single data stream in a particular way.

- Eric



[Index of Archives]     [Kernel]     [Gnu Classpath]     [Gnu Crypto]     [DM Crypt]     [Netfilter]     [Bugtraq]

  Powered by Linux