Re: Ceph and EnhanceIO cache

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I assume this was caching RBD’s?

 

I have found you have to adjust the cache flushing parameters to allow flashcache to take full advantage of the high queue depths Ceph excels at. I guess it depends on workload to an extent as well. It’s probably also quite easily to max out the SDD used for the caching, something like the hsgt SSD800 range would be ideal.

 

From: Andrei Mikhailovsky [mailto:andrei@xxxxxxxxxx]
Sent: 26 June 2015 12:35
To: Nick Fisk
Cc: Dominik Zalewski; ceph-users@xxxxxxxxxxxxxx
Subject: Re: Ceph and EnhanceIO cache

 

Hi Nick,

 

I've played with Flashcache and EnhanceIO, but I've decided not to use it for production in the end. The reason was that using both has increased the amount of slow requests that I had on the cluster and I have also noticed somewhat higher level of iowait on the vms. At that time, I didn't have much time to investigate the slow requests issue and I wasn't sure what exactly is causing them. All I can say is that after disabling the caching the slow requests have stopped.

 

Perhaps others could share if they had any issues.

 

THanks

 


From: "Nick Fisk" <nick@xxxxxxxxxx>
To: "Dominik Zalewski" <dzalewski@xxxxxxxxxxxxx>
Cc: ceph-users@xxxxxxxxxxxxxx
Sent: Friday, 26 June, 2015 11:12:25 AM
Subject: Re: Ceph and EnhanceIO cache

 

I think flashcache bombs out, I must admit I have tested that yet, but as I would only be running it in writecache mode, there is no requirement I can think of for it to keep on running gracefully.

 

From: ceph-users [mailto:ceph-users-bounces@xxxxxxxxxxxxxx] On Behalf Of Dominik Zalewski
Sent: 26 June 2015 10:54
To: Nick Fisk
Cc: ceph-users@xxxxxxxxxxxxxx
Subject: Re: Ceph and EnhanceIO cache

 

Thanks for your reply. 

 

Do you know by any chance how flashcache handles SSD going offline?

 

Here is an snip from enhanceio wiki page:

 

Failure of an SSD device in read-only and write-through modes is

              handled gracefully by allowing I/O to continue to/from the
              source volume. An application may notice a drop in performance but it
              will not receive any I/O errors.
 
              Failure of an SSD device in write-back mode obviously results in the
              loss of dirty blocks in the cache. To guard against this data loss, two
              SSD devices can be mirrored via RAID 1.
 
              EnhanceIO identifies device failures based on error codes. Depending on
              whether the failure is likely to be intermittent or permanent, it takes
              the best suited action.
 
Looking at mailing list and github commits, both flashcache and enhanceio had not much going on since last  year.
 
Dominik

 

 

On Fri, Jun 26, 2015 at 10:28 AM, Nick Fisk <nick@xxxxxxxxxx> wrote:

> -----Original Message-----
> From: ceph-users [mailto:ceph-users-bounces@xxxxxxxxxxxxxx] On Behalf Of
> Dominik Zalewski
> Sent: 26 June 2015 09:59
> To: ceph-users@xxxxxxxxxxxxxx
> Subject: Ceph and EnhanceIO cache
>
> Hi,
>
> I came across this blog post mentioning using EnhanceIO (fork of flashcache)
> as cache for OSDs.
>
> http://xo4t.mj.am/link/xo4t/jg64otv/1/IlU5Wz3kOGc_MZisWxiHFQ/aHR0cDovL3hvNHQubWouYW0vbGluay94bzR0L2pndTg5NXYvMS9EbkVDQ25pdS1IV2ZUb2pwTE4xSXFBL2FIUjBjRG92TDNkM2R5NXpaV0poYzNScFpXNHRhR0Z1TG1aeUwySnNiMmN2TWpBeE5DOHhNQzh3Tmk5alpYQm9MV0Z1WkMxbGJtaGhibU5sYVc4dg
>
> http://xo4t.mj.am/link/xo4t/jg64otv/2/Ucpx5tHbl5BqHJn8VSGzgw/aHR0cDovL3hvNHQubWouYW0vbGluay94bzR0L2pndTg5NXYvMi9GVHhzMjlTaFJJcU5PZWtUQW8yd0t3L2FIUjBjSE02THk5bmFYUm9kV0l1WTI5dEwzTjBaV010YVc1akwwVnVhR0Z1WTJWSlR3
>
> I'm planning to test it with 5x 1TB HGST Travelstar 7k1000 2.5inch OSDs and
> using 256GB Transcend SSD as enhanceio cache.
>
> I'm wondering if anyone is using EnhanceIO or others like bcache, dm-cache
> with Ceph in production and what is your experience/results.

Not using in production, but have been testing all of the above both caching the OSD and RBD's.

 

If your RBD's are being used in scenarios where small sync writes are important (iscsi,database's) then caching the RBD's is almost essential. My findings:-

 

FlashCache - Probably the best of the bunch. Has sequential override and hopefully facebook will continue to maintain it
EnhanceIO - Nice that you can hot add the cache, but is likely to no longer be maintained, so risky for production
DMCache - Well maintained, but biggest problem is that it only caches writes for blocks that are already in cache
Bcache - Didn't really spend much time looking at this. Looks as if development activity is dying down and potential stability issues
DM-WriteBoost - Great performance, really suits RBD requirements. Unfortunately the ram buffering part seems to not play safe with iSCSI use.

 

With something like flashcache on a RBD I can max out the SSD with small sequential write IO's and it then passes them to the RBD in nice large IO's. Bursty random writes also benefit.

 

Caching the OSD's, or more specifically a small section where the levelDB lives can greatly improve small block write performance. Flashcache is best for this as you can limit the sequential cutoff to the levelDB transaction size. DMcache is potentially an option as well. You can probably halve OSD latency by doing this.

 

>
> Thanks
>
> Dominik

 

 

 

 


Image removed by sender.
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

 


_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux