I think I solved the problem. The clock skew only happens when restarting a node to simulate hardware failure. The virtual comes up with a skewed clock and ceph services start before ntp has time to adjust it, then there's a delay before ceph rechecks the clock skew.
Lee
On Thu, Mar 26, 2015 at 11:21 AM, Sage Weil <sage@xxxxxxxxxxxx> wrote:
IIRC the mons re-check sync every 5 minutes. Does the warning persist?On Thu, 26 Mar 2015, Gregory Farnum wrote:
> On Thu, Mar 26, 2015 at 7:44 AM, Lee Revell <rlrevell@xxxxxxxxx> wrote:
> > I have a virtual test environment of an admin node and 3 mon + osd nodes,
> > built by just following the quick start guide. It seems to work OK but ceph
> > is constantly complaining about clock skew much greater than reality.
> > Clocksource on the virtuals is kvm-clock and they also run ntpd.
> >
> > ceph-admin-node
> > 26 Mar 10:35:29 ntpdate[2647]: adjust time server 91.189.94.4 offset
> > 0.000802 sec
> >
> > ceph-node-1
> > 26 Mar 10:35:35 ntpdate[4250]: adjust time server 91.189.94.4 offset
> > 0.002537 sec
> >
> > ceph-node-2
> > 26 Mar 10:35:42 ntpdate[1708]: adjust time server 91.189.94.4 offset
> > -0.000214 sec
> >
> > ceph-node-3
> > 26 Mar 10:35:49 ntpdate[1964]: adjust time server 91.189.94.4 offset
> > 0.001490 sec
> >
> > ceph@ceph-admin-node:~/my-cluster$ ceph -w
> > cluster db460aa2-5129-4aaa-8b2e-43eac727124e
> > health HEALTH_WARN clock skew detected on mon.ceph-node-2
> > monmap e3: 3 mons at
> > {ceph-node-1=192.168.122.121:6789/0,ceph-node-2=192.168.122.131:6789/0,ceph-node-3=192.168.122.141:6789/0},
> > election epoch 140, quorum 0,1,2 ceph-node-1,ceph-node-2,ceph-node-3
> > mdsmap e54: 1/1/1 up {0=ceph-node-1=up:active}
> > osdmap e182: 3 osds: 3 up, 3 in
> > pgmap v3594: 840 pgs, 8 pools, 7163 MB data, 958 objects
> > 29850 MB used, 27118 MB / 60088 MB avail
> > 840 active+clean
>
> What clock skews is it reporting? I don't remember the defaults, but
> if ntp is consistently adjusting your clocks by a couple of
> milliseconds then I don't think Ceph is going to be very happy about
> it.
Does it go away if you restart the mons?
sage
_______________________________________________ ceph-users mailing list ceph-users@xxxxxxxxxxxxxx http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com