Re: IO scheduler & osd_disk_thread_ioprio_class

Jan Schermer <jan@xxxxxxxxxxx> · Tue, 23 Jun 2015 14:24:05 +0200

The drive in my test machine at the moment is a Kingston KC300 - not the most “DC” class but with good DWPD rating and worked well for us (beware of newer revisions though!).

Throttling is not the problem, I am either saturating my SATA link capacity or the drive (or both), either way CFQ is what completely corks it up...

> On 23 Jun 2015, at 14:21, Eneko Lacunza <elacunza@xxxxxxxxx> wrote:
> 
> Hi Jan,
> 
> What SSD model?
> 
> I've seen SSDs work quite well usually but suddenly give a totally awful performance for some time (not those 8K you see though).
> 
> I think there was some kind of firmware process involved, I had to replace the drive with a serious DC one.
> 
> El 23/06/15 a las 14:07, Jan Schermer escribió:
>> Yes, but that’s a separate issue :-)
>> Some drives are just slow (100 IOPS) for synchronous writes with no other load.
>> The drives I’m testing have ~8K IOPS when not under load - having them drop to 10 IOPS is a huge problem. If it’s indeed a CFQ problem (as I suspect) then no matter what drive you have you will have problems.
>> 
>> Jan
>> 
>>> On 23 Jun 2015, at 14:03, Dan van der Ster <dan@xxxxxxxxxxxxxx> wrote:
>>> 
>>> Oh sorry, I had missed that. Indeed that is surprising. Did you read
>>> the recent thread ("SSD IO performance") discussing the relevance of
>>> O_DSYNC performance for the journal?
>>> 
>>> Cheers, Dan
>>> 
>>> On Tue, Jun 23, 2015 at 1:54 PM, Jan Schermer <jan@xxxxxxxxxxx> wrote:
>>>> I only use SSDs, which is why I’m so surprised at the CFQ behaviour - the drive can sustain tens of thousand of reads per second, thousands of writes - yet saturating it with reads drops the writes to 10 IOPS - that’s mind boggling to me.
>>>> 
>>>> Jan
>>>> 
>>>>> On 23 Jun 2015, at 13:43, Dan van der Ster <dan@xxxxxxxxxxxxxx> wrote:
>>>>> 
>>>>> On Tue, Jun 23, 2015 at 1:37 PM, Jan Schermer <jan@xxxxxxxxxxx> wrote:
>>>>>> Yes, I use the same drive
>>>>>> 
>>>>>> one partition for journal
>>>>>> other for xfs with filestore
>>>>>> 
>>>>>> I am seeing slow requests when backfills are occuring - backfills hit the filestore but slow requests are (most probably) writes going to the journal - 10 IOPS is just to few for anything.
>>>>>> 
>>>>>> 
>>>>>> My Ceph version is dumpling - that explains the integers.
>>>>>> So it’s possible it doesn’t work at all?
>>>>> I thought that bug was fixed. You can check if it worked by using
>>>>> "iotop -b -n1" and looking for threads with the idle priority.
>>>>> 
>>>>>> Bad news about the backfills no being in the disk thread, I might have to use deadline after all.
>>>>> If your experience follows the same paths of most users, eventually
>>>>> deep scrubs will cause latency issues and you'll switch back to cfq
>>>>> plus ionicing the disk thread.
>>>>> 
>>>>> Are you using Ceph RBD or object storage? If RBD, eventually you'll
>>>>> find that you need to put the journals on an SSD.
>>>>> 
>>>>> Cheers, Dan
>> _______________________________________________
>> ceph-users mailing list
>> ceph-users@xxxxxxxxxxxxxx
>> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
> 
> 
> -- 
> Zuzendari Teknikoa / Director Técnico
> Binovo IT Human Project, S.L.
> Telf. 943575997
>      943493611
> Astigarraga bidea 2, planta 6 dcha., ofi. 3-2; 20180 Oiartzun (Gipuzkoa)
> www.binovo.es
> 
> _______________________________________________
> ceph-users mailing list
> ceph-users@xxxxxxxxxxxxxx
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com