Re: About single monitor recovery

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, 3 Aug 2013, Yu Changyuan wrote:
> I run a tiny ceph cluster with only one monitor. After a reboot the system,
> the monitor refuse to start.
> I try to start ceph-mon manually with command 'ceph -f -i a',  below is
> first few lines of the output:
> 
> starting mon.a rank 0 at 192.168.1.10:6789/0 mon_data
> /var/lib/ceph/mon/ceph-a fsid 554bee60-9602-4017-a6e1-ceb6907a218c
> mon/AuthMonitor.cc: In function 'virtual void
> AuthMonitor::update_from_paxos()' thread 7f9e3b0db780 time 2013-08-03
> 20:27:29.208156
> mon/AuthMonitor.cc: 147: FAILED assert(ret == 0)
> 
> The full log is at: https://gist.github.com/yuchangyuan/0a0a56a14fa4649ec2c8

This is 0.61.1.  Can you try again with 0.61.7 to rule out anything there?

> So, are there any way to make the monitor work again?
> 
> I have a backup of /var/lib/ceph/mon/ceph-a  in 2013-08-01, and success
> start the monitor with these files,
> but rados and other command not work because osd keep saying the monitor is
> the wrong node(that's right, it's actually the node 2 days ago).

In general that is not going to work well as the cluster does not like to 
warp back in time.  If it does not start with .7 (I suspect it won't), can 
you send us a tarball of the mon data directory so we can see what is 
awry?

sage
_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux