2 of 3 monitors down and to recover

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hey,

I tried to simulate the failure of 2 monitors including their monmap, and to bring them up in my testing cluster.

The ceph version is 10.2.5, the OS is REHL7.2, and the testing cluster has 3 nodes with 3 monitors and 24 osds, each node has 1 monitor and 8 osds.

So, I stopped the 2 (the mon.a and mon.c) of 3 ceph-mon daemon and delete all file of directory /var/lib/ceph/mon/ceph-a/store.db, and then I tried to recover the ceph-mon daemon referring to the ceph documents, but there is something unexcepted happened. The recovery procedure is as following:

To grab the monmap from another monitor (stop the monitor daemon before extract):
# ceph mon ?i b ?extract-monmap /tmp/monmap

The 2 ceph-mon are down, so I do need to stop the monitor.

To inject the monmap:

# ceph-mon ?i a ?inject-monmap /tmp/monmap

There is an error after that:
Invalid argument: /var/lib/ceph/mon/ceph-a/store.db: does not exist (create_if_missing is false)……

Error opening mon data directory at ‘/var/lib/ceph/mon/ceph-a’: (22) Invalid argument.

 

But the directory /var/lib/ceph/ceph-a is exist and the owner is ceph:ceph, why does it happen?

And is my simulation and recovery procedure right?

 

Best regards,

He taotao

EMAILHETAOTAO818@xxxxxxxxxxxxx

 


********************************************************************************************************************************
The information in this email is confidential and may be legally privileged. If you have received this email in error or are not the intended recipient, please immediately notify the sender and delete this message from your computer. Any use, distribution, or copying of this email other than by the intended recipient is strictly prohibited. All messages sent to and from us may be monitored to ensure compliance with internal policies and to protect our business.
Emails are not secure and cannot be guaranteed to be error free as they can be intercepted, amended, lost or destroyed, or contain viruses. Anyone who communicates with us by email is taken to accept these risks.

收发邮件者请注意:
本邮件含保密信息,若误收本邮件,请务必通知发送人并直接删去,不得使用、传播或复制本邮件。
进出邮件均受到本公司合规监控。邮件可能发生被截留、被修改、丢失、被破坏或包含计算机病毒等不安全情况。
********************************************************************************************************************************

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux