Re: Adding compression/checksum support for bluestore.

Willem Jan Withagen <wjw@xxxxxxxxxxx> · Thu, 7 Apr 2016 11:51:56 +0200

On 7-4-2016 04:59, Chris Dunlop wrote:
On Thu, Apr 07, 2016 at 12:52:48AM +0000, Allen Samuels wrote:
So, what started this entire thread was Sage's suggestion that for HDD we
would want to increase the size of the block under management. So if we
assume something like a 32-bit checksum on a 128Kbyte block being read
from 5ZB Then the odds become:

1 - (2^-32 * (1-(10^-15))^(128 * 8 * 1024) - 2^-32 + 1) ^ ((5 * 8 * 10^21) / (4 * 8 * 1024))

Which is

0.257715899051042299960931575773635333355380139960141052927

Which is 25%. A big jump ---> That's my point :)

Oops, you missed adjusting the second checksum term, it should be:

1 - (2^-32 * (1-(10^-15))^(128 * 8 * 1024) - 2^-32 + 1) ^ ((5 * 8 * 10^21) / (128 * 8 * 1024))
= 0.009269991973796787500153031469968391191560327904558440721

...which is different to the 4K block case starting at the 12th digit. I.e. not very different.

Which is my point! :)

Sorry for posting something this vague, but my memory (and Google) is 
playing games with me.

I have not so recently read some articles about this when I was studying 
ZFS which has a
similar problem. Since it also aims for ZettaByte storage, and what I 
took from that discussion
is that most of the CRC32 checksumtypes are susceptible to bit-error 
clustering. Which means that
there is a bigger chance for a faulty block or set of error bits to go 
undetected.

Like I said, sorry for not being able to be more specific atm.

There is an article on Wikipedia that discusses Fletcher algorithms, 
strength and weakness:
https://en.wikipedia.org/wiki/Fletcher's_checksum

--WjW

--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html