Hey folks,
recently I realized that OSD's fast shutdown (which is a default
behavior) results our new feature - dynamic allocator's map restoration
- in being working in a suboptimal mode. Due to nongracefull shutdown it
has to recover allocator's map through onode enumeration on each OSD
startup. Which might apparently take some time. Moreover RocksDB
apparently performs a sort of recovery in this case too - may be not
that long but still visible.
Please also note that one might miss the above issues when using
vstart.sh - it has got osd_fast_shutdown set to false.
I created the following ticket to track the issue:
https://tracker.ceph.com/issues/53266
Additionally we've already made some additional tricks in the code for
this fast shutdown mode, e.g. osd_fast_shutdown_notify_mon_option.
Hence given the above shouldn't we revise the need for this fast
shutdown feature? IIUC the presense of various bugs along the regular
shutdown path was one of the primary rationales for new mode
introduction. But IMO properly running graceful shutdown is a sort of
code's quality mark. And aren't we just moving the complexity/burden
from shutdown procedure to the startup one this way? So may be we better
invest in making shutdown clean enough?
Thanks,
--
Igor Fedotov
Ceph Lead Developer
Looking for help with your Ceph cluster? Contact us at https://croit.io
croit GmbH, Freseniusstr. 31h, 81247 Munich
CEO: Martin Verges - VAT-ID: DE310638492
Com. register: Amtsgericht Munich HRB 231263
Web: https://croit.io | YouTube: https://goo.gl/PGE1Bx
_______________________________________________
Dev mailing list -- dev@xxxxxxx
To unsubscribe send an email to dev-leave@xxxxxxx