Re: ceph falsely reports clock skew?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



I think I solved the problem. The clock skew only happens when restarting a node to simulate hardware failure. The virtual comes up with a skewed clock and ceph services start before ntp has time to adjust it, then there's a delay before ceph rechecks the clock skew.

Lee

On Thu, Mar 26, 2015 at 11:21 AM, Sage Weil <sage@xxxxxxxxxxxx> wrote:
On Thu, 26 Mar 2015, Gregory Farnum wrote:
> On Thu, Mar 26, 2015 at 7:44 AM, Lee Revell <rlrevell@xxxxxxxxx> wrote:
> > I have a virtual test environment of an admin node and 3 mon + osd nodes,
> > built by just following the quick start guide.  It seems to work OK but ceph
> > is constantly complaining about clock skew much greater than reality.
> > Clocksource on the virtuals is kvm-clock and they also run ntpd.
> >
> > ceph-admin-node
> > 26 Mar 10:35:29 ntpdate[2647]: adjust time server 91.189.94.4 offset
> > 0.000802 sec
> >
> > ceph-node-1
> > 26 Mar 10:35:35 ntpdate[4250]: adjust time server 91.189.94.4 offset
> > 0.002537 sec
> >
> > ceph-node-2
> > 26 Mar 10:35:42 ntpdate[1708]: adjust time server 91.189.94.4 offset
> > -0.000214 sec
> >
> > ceph-node-3
> > 26 Mar 10:35:49 ntpdate[1964]: adjust time server 91.189.94.4 offset
> > 0.001490 sec
> >
> > ceph@ceph-admin-node:~/my-cluster$ ceph -w
> >     cluster db460aa2-5129-4aaa-8b2e-43eac727124e
> >      health HEALTH_WARN clock skew detected on mon.ceph-node-2
> >      monmap e3: 3 mons at
> > {ceph-node-1=192.168.122.121:6789/0,ceph-node-2=192.168.122.131:6789/0,ceph-node-3=192.168.122.141:6789/0},
> > election epoch 140, quorum 0,1,2 ceph-node-1,ceph-node-2,ceph-node-3
> >      mdsmap e54: 1/1/1 up {0=ceph-node-1=up:active}
> >      osdmap e182: 3 osds: 3 up, 3 in
> >       pgmap v3594: 840 pgs, 8 pools, 7163 MB data, 958 objects
> >             29850 MB used, 27118 MB / 60088 MB avail
> >                  840 active+clean
>
> What clock skews is it reporting? I don't remember the defaults, but
> if ntp is consistently adjusting your clocks by a couple of
> milliseconds then I don't think Ceph is going to be very happy about
> it.

IIRC the mons re-check sync every 5 minutes.  Does the warning persist?
Does it go away if you restart the mons?

sage

_______________________________________________
ceph-users mailing list
ceph-users@xxxxxxxxxxxxxx
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

[Index of Archives]     [Information on CEPH]     [Linux Filesystem Development]     [Ceph Development]     [Ceph Large]     [Ceph Dev]     [Linux USB Development]     [Video for Linux]     [Linux Audio Users]     [Yosemite News]     [Linux Kernel]     [Linux SCSI]     [xfs]


  Powered by Linux