Hi!
originally your issue looked like the ones from https://tracker.ceph.com/issues/42223
And it looks like lack of some key information for
FreeListManager in RocksDB.
Once you have it present we can check the content of the RocksDB
to prove this hypothesis, please let me know if you want the
guideline for that.
The last log is different, the key record is probably:
-2> 2019-10-09 23:03:47.011 7fb4295a7700 -1 rocksdb:
submit_common error: Corruption: block checksum mismatch: expected
2181709173, got 2130853119 in db/204514.sst offset 0 size 61648
code = 2 Rocksdb transaction:
which most probably denotes data corruption in DB. Unfortunately
for now I can't say if this is related to the original issue or
not.
This time it reminds the issue shared in this mailing list a
while ago by Stefan Priebe. The post caption is "Bluestore OSDs
keep crashing in BlueStore.cc: 8808: FAILED assert(r == 0)"
So first of all I'd suggest to distinguish these issues for now
and try to troubleshoot them separately.
As for the first case I'm wondering if you have any OSDs still
failing this way, i.e. asserting in allocator and showing 0
extents loaded: "_open_alloc loaded 0 B in 0 extents"
If so lets check DB content first.
For the second case I'm wondering the most if the issue is
permanent for a specific OSD or it disappears after OSD/node
restart as it occurred in Stefan's case?