On Tue, Jul 11, 2017 at 3:23 PM, Webert de Souza Lima <webert.boss@xxxxxxxxx> wrote: > Hello, > > today I got a MDS respawn with the following message: > > 2017-07-11 07:07:55.397645 7ffb7a1d7700 1 mds.b handle_mds_map i > (10.0.1.2:6822/28190) dne in the mdsmap, respawning myself "dne in the mdsmap" is what an MDS says when the monitors have concluded that the MDS is dead, but the MDS is really alive. "dne" stands for "does not exist", so the MDS is complaining that it has been removed from the mdsmap. The message could definitely be better worded! You can see this happen in certain buggy cases where the MDS is failing to send beacon messages to the mons, even though it is really alive -- if you're stuck in rejoin, then that is probably related: try increasing the log verbosity to work out where the MDS is stuck while it's sitting in the rejoin state. John > > it happened 3 times within 5 minutes. After so, the MDS took 50 minutes to > recover. > I can't find what exactly that message means and how to avoid it. > > I'll be glad to provide any further information. Thanks! > > > Regards, > > Webert Lima > DevOps Engineer at MAV Tecnologia > Belo Horizonte - Brasil > > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com > _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com