Hi Brian I'm not sure if it applies to your application, and I'm not an expert. However, we have been running our solution for about a year now, and we have one of our MDS's in standby-replay. Sadly we have found a bug with extensive memory usage, and when we needed to replay, it took up to a minute, even with standby-replay. However, our solution is an online site, so a minute is the same as ages. I have also read that you are now allowed to run multiple MDS's on a file system. It will, of course, add more memory usage, and it could also lead to less performance. But the feature should be fully supported in the Pacific release. I'm curious what the correct solution is here. Best regards Daniel On Wed, Oct 6, 2021 at 12:07 AM Brian Kim <bkimstunnaboss@xxxxxxxxx> wrote: > Dear ceph-users, > > We have a ceph cluster with 3 MDS's and recently had to replay our cache > which is taking an extremely long time to complete. Is there some way to > speed up this process as well as apply some checkpoint so it doesn't have > to start all the way from the beginning? > > -- > Best Wishes, > Brian Kim > _______________________________________________ > ceph-users mailing list -- ceph-users@xxxxxxx > To unsubscribe send an email to ceph-users-leave@xxxxxxx > _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx