Mon hangs when started after Emperor upgrade

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi guys,

I upgraded a working cluster from Dumpling to Emperor which went OK. All mons, osds and mds running 0.72.2 on Fedora 18 now.

I then installed the ceph-extras repo and let it update curl, libcurl and leveldb.

Next I tried restarting a mon - but it won't start up again. It just just hangs:


# service ceph start mon.e
=== mon.e ===
Starting Ceph mon.e on stor5...


The only thing logged is this:


2014-03-30 20:31:31.829959 7f3c989877c0 0 ceph version 0.72.2 (a913ded2ff138aefb8cb84d347d72164099cfd60), process ceph-mon, pid 21310


I have debugging cranked all the way up (debug ms=20, mon=20, paxos=20 and auth=20).

When looking at ps, I have a ceph-mon process and nothing else related to this (i.e. not create keys process or similar hanging).

An strace on the process shows:


 futex(0x7fffed5128d8, FUTEX_WAIT_PRIVATE, 2, NULL


"ceph mon stat" shows the following:


e3: 5 mons at {a=10.0.0.1:6789/0,b=10.0.0.2:6789/0,c=10.0.0.3:6789/0,d=10.0.0.7:6789/0,e=10.0.0.8:6789/0}, election epoch 26362, quorum 0,1,2,3 a,b,c,d


So the rest of the cluster is working (it is mon.e I tried to restart).

Not really much to go on. Anyone seen something similar, or have an idea?


Thanks in advance,
--
Jens Kristian Søgaard, Mermaid Consulting ApS,
jens@xxxxxxxxxxxxxxxxxxxx,
http://www.mermaidconsulting.com/
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com





[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux