On 14/04/10 03:44, ext Herbert Xu wrote:
On Tue, Apr 13, 2010 at 06:21:44PM +0300, Dmitry Kasatkin wrote:
On 13/04/10 18:16, ext Uri Simchoni wrote:
Doing step 3 using sw is probably faster than by hw (because it's short and avoid all the hw setup), so the suggested approach is probably faster than generic async hmac.
Yes. that is exactly what happens in hw - it is much slower.
And I do not see any problems as well with finishing it with sw.
I never said that you can't do it in software. I just don't
want every driver to write its own copy of hmac in software.
I understand that it is better to have more "generic" approach and not
duplicate that.
I am not objecting that.
The key point is that we use HW accelerators to get speedup, release CPU
for something else and reduce power consumption (if possible and
specific hw does it more energy efficiently).
Base on above facts hw drivers need to make certain optimization.
And that optimization often does not fit well to the "generic" way.
Like just with import/export.
Problems for hw:
1. To have a good performance with DMA we need to have large buffer.
Not just 64 bytes block. state becomes large
2. supporting concurrent requests means switching HW state and it takes
a time.
All that burden significantly slowdown hw acceleration and increase CPU
load.
I guess my points are understandable.
thanks
If you're going to do it in software, then let's make an ahash
version of hmac.
Cheers,
--
To unsubscribe from this list: send the line "unsubscribe linux-crypto" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html