Hi Folks,
it looks like we've got pretty severe issue in Pacific/Quincy code base
which causes data corruption when upgrading to Pacific, see:
https://tracker.ceph.com/issues/53062
The fix is available at https://github.com/ceph/ceph/pull/43687
IMO some desired further action items would be:
1) Review and backport to P ASAP
2) Inform users to refrain from using quick-fix/repair on upgraded(!)
Pacific clusters until the next(hopefully) minor release.
3) Revise QA suite which permitted such an error to sneak in.
The above mentioned patch doesn't provide data recovery for already
broken OSDs (which is actually doable) - do you think we need something
like that at the moment? There're just a couple of complains from users
so far and hopefully they've managed to recover their clusters...
Thanks,
Igor
--
Igor Fedotov
Ceph Lead Developer
Looking for help with your Ceph cluster? Contact us at https://croit.io
croit GmbH, Freseniusstr. 31h, 81247 Munich
CEO: Martin Verges - VAT-ID: DE310638492
Com. register: Amtsgericht Munich HRB 231263
Web: https://croit.io | YouTube: https://goo.gl/PGE1Bx
_______________________________________________
Dev mailing list -- dev@xxxxxxx
To unsubscribe send an email to dev-leave@xxxxxxx