if possible, could you share the mds logs at debug level 20 you'll need to set debug_mds = 20 in the conf file until the crash and revert the level to the default after mds crash On Tue, Jul 18, 2023 at 9:12 PM <dxodnd@xxxxxxxxx> wrote: > hello. > I am using ROK CEPH and have 20 MDSs in use. 10 are in rank 0-9 and 10 are > in standby. > I have one ceph filesystem, and 2 mds are trimming. > Under one FILESYSTEM, there are 6 MDSs in RESOLVE, 1 MDS in REPLAY, and 3 > in ACTIVE. > For some reason, since 36 hours ago, RESOLVE is stuck in TRIMMING, and so > are the MDSs in REPLAY. > I've also tried FAILing each MDS, but to no avail. > I think something should change when the MDS in REPLAY goes to RESOLVE, > but I don't know what. > Even looking at the logs of the REPLAY MDS, it's hard to see any messages > other than it is TERMINATED every 11 minutes. > I'm desperate for someone's help. > _______________________________________________ > ceph-users mailing list -- ceph-users@xxxxxxx > To unsubscribe send an email to ceph-users-leave@xxxxxxx > > -- Milind _______________________________________________ ceph-users mailing list -- ceph-users@xxxxxxx To unsubscribe send an email to ceph-users-leave@xxxxxxx