Hello Iain, Does anyone have any ideas of what could be the issue here or anywhere we > can check what is going on?? > > You could be hitting the slow backfill/recovery issue with mclock_scheduler. Could you please provide the output of the following commands? 1. ceph versions 2. ceph config get osd.<id> osd_op_queue 3. ceph config show osd.<id> | grep osd_max_backfills 4. ceph config show osd.<id> | grep osd_recovery_max_active 5. ceph config show-with-defaults osd.<id> | grep osd_mclock where 'id' can be any valid osd id With the mclock_scheduler enabled and with 17.2.5, it is not possible to override recovery settings like 'osd_max_backfills' and other recovery related config options. To improve the recovery rate, you can temporarily switch the mClock profile to 'high_recovery_ops' on all the OSDs by issuing: ceph config set osd osd_mclock_profile high_recovery_ops During recovery with this profile, you may notice a dip in the client ops performance which is expected. Once the recovery is done, you can switch the mClock profile back to the default 'high_client_ops' profile. Please note that the upcoming Quincy release will address the slow backfill issues along with other usability improvements. -Sridhar _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx