Forgot the attachments. Besides, is there any way to get the cluster running again without restarting all client nodes? On Tue, May 19, 2015 at 10:45 AM, Yan, Zheng <ukernel@xxxxxxxxx> wrote: > On Tue, May 19, 2015 at 4:31 PM, Markus Blank-Burian <burian@xxxxxxxxxxx> wrote: >> I am afraid, I hit the same bug. Giant worked fine, but after upgrading to >> hammer (0.94.1) and putting some load on it, the MDSs eventually crashed and >> now I am stuck in clientreplay most of the time. I am also using the cephfs >> kernel client (3.18.y). As I didn't find a corresponding tracker entry .. is >> there already a patch available? >> > > Please send mds log and /sys/kernel/debug/ceph/*/mdsc on client > machine to us. Besides, Is there warnings like "cluster [WRN] slow > request [several thousands or more ] seconds old, received at ...: > client_request(client.734537:23 ...) " in your ceph cluster log. > > Regards > Yan, Zheng
Attachment:
ceph-mds-bagheera.log.gz
Description: GNU Zip compressed data
Attachment:
ceph-mds-bagheera2.log.gz
Description: GNU Zip compressed data
Attachment:
ceph-mon-bagheera.log.gz
Description: GNU Zip compressed data
Attachment:
ceph-mon-bagheera2.log.gz
Description: GNU Zip compressed data
Attachment:
mdsc.gz
Description: GNU Zip compressed data
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com