Hi Nat, Can you try the below patch? It looks like the journal replay isn't handling the 'session close' event in the journal properly because the client has already reconnected to the MDS. (If so, shutting down the client would also allow the MDS to restart by avoiding the bug.) Please let me know if this does the trick! Thanks- sage diff --git a/src/mds/journal.cc b/src/mds/journal.cc index ec2013d..64fc6a3 100644 --- a/src/mds/journal.cc +++ b/src/mds/journal.cc @@ -725,6 +725,8 @@ void ESession::replay(MDS *mds) Session *session = mds->sessionmap.get_session(client_inst.name); if (session->is_closed()) mds->sessionmap.remove_session(session); + else + session->clear(); // the client has reconnected; keep the Session, but reset } } -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html