Re: Failure probability with largish deployments

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Kyle,

It would be great if you could share how you invoked the tool. I'm tempting to play with it and an example would help a great deal :-)

Cheers

On 20/12/2013 22:37, Kyle Bader wrote:
> Using your data as inputs to in the Ceph reliability calculator [1]
> results in the following:
> 
> Disk Modeling Parameters
>     size:           3TiB
>     FIT rate:        826 (MTBF = 138.1 years)
>     NRE rate:    1.0E-16
> RAID parameters
>     replace:           6 hours
>     recovery rate:  500MiB/s (100 minutes)
>     NRE model:              fail
>     object size:            4MiB
> 
> Column legends
> 1 storage unit/configuration being modeled
> 2 probability of object survival (per 1 years)
> 3 probability of loss due to site failures (per 1 years)
> 4 probability of loss due to drive failures (per 1 years)
> 5 probability of loss due to NREs during recovery (per 1 years)
> 6 probability of loss due to replication failure (per 1 years)
> 7 expected data loss per Petabyte (per 1 years)
> 
>     storage               durability    PL(site)  PL(copies)
> PL(NRE)     PL(rep)    loss/PiB
>     ----------            ----------  ----------  ----------
> ----------  ----------  ----------
>     RAID-6: 9+2              6-nines   0.000e+00   2.763e-10
> 0.000011%   0.000e+00   9.317e+07
> 
> 
> Disk Modeling Parameters
>     size:           3TiB
>     FIT rate:        826 (MTBF = 138.1 years)
>     NRE rate:    1.0E-16
> RADOS parameters
>     auto mark-out:     10 minutes
>     recovery rate:    50MiB/s (40 seconds/drive)
>     osd fullness:      75%
>     declustering:    1100 PG/OSD
>     NRE model:              fail
>     object size:      4MB
>     stripe length:   1100
> 
> Column legends
> 1 storage unit/configuration being modeled
> 2 probability of object survival (per 1 years)
> 3 probability of loss due to site failures (per 1 years)
> 4 probability of loss due to drive failures (per 1 years)
> 5 probability of loss due to NREs during recovery (per 1 years)
> 6 probability of loss due to replication failure (per 1 years)
> 7 expected data loss per Petabyte (per 1 years)
> 
>     storage               durability    PL(site)  PL(copies)
> PL(NRE)     PL(rep)    loss/PiB
>     ----------            ----------  ----------  ----------
> ----------  ----------  ----------
>     RADOS: 3 cp             10-nines   0.000e+00   5.232e-08
> 0.000116%   0.000e+00   6.486e+03
> 
> [1] https://github.com/ceph/ceph-tools/tree/master/models/reliability
> 

-- 
Loïc Dachary, Artisan Logiciel Libre

Attachment: signature.asc
Description: OpenPGP digital signature

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux