http://tracker.ceph.com/issues/38724 ^seems this bug is related. I’ve added notes to it. Triggers seem to be a node reboot or remove or add a new OSD. There seem to be pack port duplicates for Mimic and Luminous
This may have an impact to production when multiple OSDs fail to start repeatedly after hitting the BUG. Linux stops the start due to too many attempts. Our production VM becomes unresponsive for about 10 minuets and then the OSD try to start again and typically starts. Sometimes it does not and we go another 10 minuets. I have had this happen and the Prod VM crashes. |
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com