Re: Sharing SSD journals and SSD drive choice

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 02-05-17 23:53, David Turner wrote:
> I was only interjecting on the comment "So that is 5 . Which is real
> easy to obtain" and commenting on what the sustained writes into a
> cluster of 2,000 OSDs would require to actually sustain that 5 MBps on
> each SSD journal.

Reading your calculation below I understand where the 2000 comes from.
I meant that hardware of the previous millennium could easily write 5
Mbyte/sec sustained. :)

This does NOT otherwise invalidate your interesting math below.
And your conclusion is an important one.

Go back to 200 OSDs and 25 SSDs and you end up with 40MB/s sustained
writes to wear out your SSDs exactly at then end of the warranty.
Higher sustained writes will linearly shorten your SSD lifetime.

--WjW

> My calculation was off because I forgot replica size, but my corrected
> math is this...
> 
> 5 MBps per journal device
> 8 OSDs per journal (overestimated number as most do 4)
> 2,000 OSDs based on what you said "Which is real easy to obtain, even
> with hardware 0f 2000."
> 3 replicas
> 
> 2,000 OSDs / 8 OSDs per journal = 250 journal SSDs
> 250 SSDs * 5 MBps = 1,250 MBps / 3 replicas = 416.67 MBps required
> sustained cluster write speed to cause each SSD to average 5 MBps on
> each journal device.
> 
> Swap out any variable you want to match your environment.  For example,
> if you only have 4 OSDs per journal device, that number would be double
> for a cluster this size to require a cluster write speed of
> 833.33 MBps to average 5 MBps on each journal.  Also if you have less
> than 2,000 OSDs, then everything shrinks fast.
> 
> 
> On Tue, May 2, 2017 at 5:39 PM Willem Jan Withagen <wjw@xxxxxxxxxxx
> <mailto:wjw@xxxxxxxxxxx>> wrote:
> 
>     On 02-05-17 19:54, David Turner wrote:
>     > Are you guys talking about 5Mbytes/sec to each journal device? 
>     Even if
>     > you had 8 OSDs per journal and had 2000 osds... you would need a
>     > sustained 1.25 Gbytes/sec to average 5Mbytes/sec per journal device.
> 
>     I'm not sure I'm following this...
>     But I'm rather curious.
>     Are you saying that the required journal bandwidth versus OSD write
>     bandwidth has an approx 1:200 ratio??
> 
>     Note that I took it the other way.
>     Given the Intel specs
>      - What sustained bandwidth is allowed to have the device last its
>     lifetime.
>      - How much more usage would a 3710 give in regards to a 3520 SSD per
>        dollar spent.
> 
>     --WjW
> 
>     > On Tue, May 2, 2017 at 1:47 PM Willem Jan Withagen
>     <wjw@xxxxxxxxxxx <mailto:wjw@xxxxxxxxxxx>
>     > <mailto:wjw@xxxxxxxxxxx <mailto:wjw@xxxxxxxxxxx>>> wrote:
>     >
>     >     On 02-05-17 19:16, Дробышевский, Владимир wrote:
>     >     > Willem,
>     >     >
>     >     >   please note that you use 1.6TB Intel S3520 endurance
>     rating in your
>     >     > calculations but then compare prices with 480GB model, which
>     has only
>     >     > 945TBW or 1.1DWPD (
>     >     >
>     >   
>      https://ark.intel.com/products/93026/Intel-SSD-DC-S3520-Series-480GB-2_5in-SATA-6Gbs-3D1-MLC
>     >     > ). It also worth to notice that S3710 has tremendously
>     higher write
>     >     > speed\IOPS and especially SYNC writes. Haven't seen S3520
>     real sync
>     >     > write tests yet but don't think they differ much from S3510
>     ones.
>     >
>     >     Arrgh, you are right. I guess I had too many pages open, and
>     copied the
>     >     wrong one.
>     >
>     >     But the good news is that the stats were already in favour of
>     the 3710
>     >     so this only increases that conclusion.
>     >
>     >     The bad news is that the sustained write speed goes down with a
>     >     factor 4.
>     >     So that is 5Mbyte/sec. Which is real easy to obtain, even with
>     hardware
>     >     0f 2000.
>     >
>     >     --WjW
>     >
>     >
>     >     > Best regards,
>     >     > Vladimir
>     >     >
>     >     > 2017-05-02 21:05 GMT+05:00 Willem Jan Withagen
>     <wjw@xxxxxxxxxxx <mailto:wjw@xxxxxxxxxxx>
>     >     <mailto:wjw@xxxxxxxxxxx <mailto:wjw@xxxxxxxxxxx>>
>     >     > <mailto:wjw@xxxxxxxxxxx <mailto:wjw@xxxxxxxxxxx>
>     <mailto:wjw@xxxxxxxxxxx <mailto:wjw@xxxxxxxxxxx>>>>:
>     >     >
>     >     >     On 27-4-2017 20:46, Alexandre DERUMIER wrote:
>     >     >     > Hi,
>     >     >     >
>     >     >     >>> What I'm trying to get from the list is /why/ the
>     >     "enterprise" drives
>     >     >     >>> are important. Performance? Reliability? Something else?
>     >     >     >
>     >     >     > performance, for sure (for SYNC write,
>     >   
>      https://www.sebastien-han.fr/blog/2014/10/10/ceph-how-to-test-if-your-ssd-is-suitable-as-a-journal-device/
>     >     >
>     >     
>     <https://www.sebastien-han.fr/blog/2014/10/10/ceph-how-to-test-if-your-ssd-is-suitable-as-a-journal-device/>)
>     >     >     >
>     >     >     > Reliabity : yes, enteprise drive have supercapacitor
>     in case
>     >     of powerfailure, and endurance (1 DWPD for 3520, 3 DWPD for 3610)
>     >     >     >
>     >     >     >
>     >     >     >>> Also, 4 x Intel DC S3520 costs as much as 1 x Intel DC
>     >     S3610. Obviously
>     >     >     >>> the single drive leaves more bays free for OSD
>     disks, but
>     >     is there any
>     >     >     >>> other reason a single S3610 is preferable to 4 S3520s?
>     >     Wouldn't 4xS3520s
>     >     >     >>> mean:
>     >     >     >
>     >     >     > where do you see this price difference ?
>     >     >     >
>     >     >     > for me , S3520 are around 25-30% cheaper than S3610
>     >     >
>     >     >     I just checked for the DCS3520 on
>     >     >
>     >     
>     https://ark.intel.com/nl/products/93005/Intel-SSD-DC-S3520-Series-1_6TB-2_5in-SATA-6Gbs-3D1-MLC
>     >     >
>     >     
>     <https://ark.intel.com/nl/products/93005/Intel-SSD-DC-S3520-Series-1_6TB-2_5in-SATA-6Gbs-3D1-MLC>
>     >     >
>     >     >     And is has a TBW of 2925 (Terrabytes Write over life time) =
>     >     2,9 PB
>     >     >     the warranty is 5 years.
>     >     >
>     >     >     Now if I do the math:
>     >     >       2925 * 104 /5 /365 /24 /60 = 1,14 Gbyte/min to be written.
>     >     >       which is approx 20Mbyte /sec
>     >     >       or approx 10Gbit/min = 0,15 Gbit/sec
>     >     >
>     >     >     And that is only 20% of the capacity of that SATA link.
>     >     >     Also writing 20Mbyte/sec sustained is not really that
>     hard for
>     >     modern
>     >     >     systems.
>     >     >
>     >     >     Now a 400Gb 3710 takes 8.3 PB, which is ruffly 3 times
>     as much.
>     >     >     so it will last 3 times longer.
>     >     >
>     >     >     Checking Amazone, I get
>     >     >             $520 for the DC S3710-400G
>     >     >             $300 for the DC S3520-480G
>     >     >
>     >     >     So that is less than a factor of 2 for using the S3710's
>     and a
>     >     3 times
>     >     >     longer lifetime. To be exact (8.3/520) / (2,9/300) =
>     1.65 more
>     >     bang for
>     >     >     your buck.
>     >     >
>     >     >     But still do not expect your SSDs to last very long if the
>     >     write rate is
>     >     >     much over that 20Mbyte/sec
>     >     >
>     >     >     --WjW
>     >     >
>     >     >
>     >     >
>     >     >     _______________________________________________
>     >     >     ceph-users mailing list
>     >     >     ceph-users@xxxxxxxxxxxxxx
>     <mailto:ceph-users@xxxxxxxxxxxxxx> <mailto:ceph-users@xxxxxxxxxxxxxx
>     <mailto:ceph-users@xxxxxxxxxxxxxx>>
>     >     <mailto:ceph-users@xxxxxxxxxxxxxx
>     <mailto:ceph-users@xxxxxxxxxxxxxx> <mailto:ceph-users@xxxxxxxxxxxxxx
>     <mailto:ceph-users@xxxxxxxxxxxxxx>>>
>     >     >     http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>     >     >     <http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com>
>     >     >
>     >     >
>     >     >
>     >     >
>     >     > --
>     >     >
>     >     > С уважением,
>     >     > Дробышевский Владимир
>     >     > Компания "АйТи Город"
>     >     > +7 343 2222192 <tel:+7%20343%20222-21-92>
>     <tel:+7%20343%20222-21-92>
>     >     >
>     >     > ИТ-консалтинг
>     >     > Поставка проектов "под ключ"
>     >     > Аутсорсинг ИТ-услуг
>     >     > Аутсорсинг ИТ-инфраструктуры
>     >
>     >     _______________________________________________
>     >     ceph-users mailing list
>     >     ceph-users@xxxxxxxxxxxxxx <mailto:ceph-users@xxxxxxxxxxxxxx>
>     <mailto:ceph-users@xxxxxxxxxxxxxx <mailto:ceph-users@xxxxxxxxxxxxxx>>
>     >     http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>     >
> 

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com




[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux