Emergency! Production cluster is down

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hi Guys,

Need help. I had 3 monitors nodes and 2 went down ( Disk got corrupted ). after some time even 3rd monitor went unresponsive. so i rebooted the 3rd node. it came up but ceph is not working .

so i tried to remove 2 failed monitors from ceph.conf file and restarted the mon and osd. but still ceph is not up.

please find log files as attached.

1. Log file of ceph-mon.openstack01-vm001.log ( Monitor node )

http://paste.openstack.org/show/530944/

2. ceph.conf

http://paste.openstack.org/show/530945/

3. ceph -w output

http://paste.openstack.org/show/530947/

4. ceph mon dump

http://paste.openstack.org/show/530950/

what error i see is

monclient(hunting): authenticate timed out after 300

librados: client.admin authentication error (110) Connection timed out

Any suggestions? please help ... 

Thanks 
Chandra

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux