Snap operation throttling (again)

Andrey Korolyov <andrey@xxxxxxx> · Tue, 19 May 2015 18:06:08 +0300

Hello,

this question was brought many times before, and also solved in a
various ways - snap trimmer, scheduler` priorities and persistent fix
(for a ReplicatedPG issue), but it seems that the current Ceph
versions may suffer as well during the rollback operations on large
images and on large scale. Given CFQ scheduler for rotating media and
~10 percentage of an utilization as an initial preconditions, the
rollback operation of an one-fourth terabyte image over 100 OSDs may
result in a significant latency impact and, for such configuration,
breakage of a 30s request completion barrier. Although recent
improvements did very well in means of the congestion control for a
slow media for many kind of non-client ops, this exact issue remains.
I think it can be solved by another sleeper knob but unsure where its
proper place should be.

Thanks for suggestions!
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com