On Sun, Oct 20, 2019 at 1:53 PM Stefan Kooman <stefan@xxxxxx> wrote: > > Dear list, > > Quoting Stefan Kooman (stefan@xxxxxx): > > > I wonder if this situation is more likely to be hit on Mimic 13.2.6 than > > on any other system. > > > > Any hints / help to prevent this from happening? > > We have had this happening another two times now. In both cases the MDS > recovers, becomes active (for a few seconds), and crashes again. It won't > come out of this loop by itself. When put in deug mode "debug_mds = > 10/10) we won't hit the bug and it stays active. After a few minutes we > disable debug (live, ceph tell mds.* config set debug_mds 0/0) and it > keeps running (Heisenbug)... until hours later when it crashes again and the story > repeats itself. > > So unfortunately no more debug information available, but at least a > workaround to get it active again. > delete 'mdsX_openfiles.0' object from cephfs metadata pool. (X is rank of the crashed mds) > Gr. Stefan > > -- > | BIT BV https://www.bit.nl/ Kamer van Koophandel 09090351 > | GPG: 0xD14839C6 +31 318 648 688 / info@xxxxxx > _______________________________________________ > ceph-users mailing list > ceph-users@xxxxxxxxxxxxxx > http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com _______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com