On 29.10.2012 18:59, Wido den Hollander wrote:
On 10/29/2012 03:48 PM, Roman Alekseev wrote:
Hello,
I have 3 monitors on different nodes and when 'mon.a' was stopped whole
cluster stopped work too.
My conf: http://pastebin.com/hT3qEhUF
Could someone explain how to fix such kind of failure?
Could you explain a bit more about the setup?
Which version are you running?
What do you mean with failure? Is the ceph -s command still working?
How sure are you that you didn't catch a bug that killed all three
monitors? Are those processes actually up and running?
Did you check the logs of the monitors?
Could you let us know?
Thanks!
Wido
Hi Wido,
I'm running ceph version 0.48.1argonaut.
The "ceph -s" command doesn't work until I start that monitor again.
Under failure I mean that ceph commands (such as ceph -s , -w, ceph mon
dump etc) don't respond. I've re-added all three mons and found the
following situations:
Situation A:
1) mon.a is disabled:
health HEALTH_WARN 1 mons down, quorum 1,2 b,c (cluster works)
2) mon.b is disabled:
health HEALTH_WARN 1 mons down, quorum 0,1 a,c (cluster works)
3) mon.c is disabled:
health HEALTH_WARN 1 mons down, quorum 0,2 a,b (cluster works)
Situation B:
If 2 mons are disabled all cluster stop working.
So cluster works only when 2 monitors are running.
Is it correct ?
--
Kind regards,
R. Alekseev
--
To unsubscribe from this list: send the line "unsubscribe ceph-devel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html