1/3 mons down! mon do not rejoin

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



hi folks

I have a cluster running ceph 14.2.22 on ubuntu 18.04 and some hours
ago one of the mons stopped working and the on-call team rebooted the
node; not the mon is is not joining the ceph-cluster.

TCP ports of mons are open and reachable!

ceph health detail
HEALTH_WARN 1/3 mons down, quorum osd02,osd03
MON_DOWN 1/3 mons down, quorum osd02,osd03
    mon.osd01 (rank 0) addr
[v2:10.152.28.171:3300/0,v1:10.152.28.171:6789/0] is down (out of
quorum)

I like to add a new 3rd mon to the cluster on osd04 but I'm a bit
scared as it can result in 50% of the mons are not in reach!?

Question: should I remove the mon on osd01 first and recreate the
demon before starting a new mon on osd04?


Thanks for your input!
Ansgar
_______________________________________________
ceph-users mailing list -- ceph-users@xxxxxxx
To unsubscribe send an email to ceph-users-leave@xxxxxxx



[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux