On Fri, Nov 6, 2015 at 10:06 AM, Nathan Cutler <ncutler@xxxxxxx> wrote: > Hi Ceph: > > Recently I encountered some a "clock skew" issue with 0.94.3. I have > some small demo clusters in AWS. When I boot them up, in most cases the > cluster will start in HEALTH_WARN due to clock skew on some of the MONs. > > I surmise that this is due to a race condition between the ceph-mon and > ntpd systemd services. Sometimes ntpd.service starts *after* ceph-mon - > in this case the MON sees a wrong/unsynchronized time value. > > Now, even though ntpd.service starts (and fixes the time value) very > soon afterwards, the cluster remains in clock skew for a long time - but > that is a separate issue. What I would like to ask is this: > > Is there any reasonable Ceph cluster node configuration that does not > include running the NTP daemon? Only if there is some other time service replacing it. I don't really know of anyone using alternative ntp daemons, but it's a possibility to consider before introducing a hard dependency on ntpd. > If the answer is "no", would it make sense to make NTP a runtime > dependency and tell the ceph-mon systemd service to wait for > ntpd.service before it starts? Just waiting for the service is quick, but it doesn't achieve any effect on the clock other than promising that it will be synced at some point in the future. Wouldn't we have to wait for time sync rather than just waiting for the service? That could take a while. My hunch is that users wouldn't appreciate the mon blocking until times were in sync, they'd probably prefer to go ahead and start up, but raise a warning (like we currently do). Given all that, maybe the question is actually: why do the mons stay in the skew state for so long after the clocks are corrected? John -- To unsubscribe from this list: send the line "unsubscribe ceph-devel" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html