Re: [Last-Call] [Ntp] Secdir last call review of draft-ietf-ntp-chronos-16

Michael Schapira <schapiram@xxxxxxxxxxxxx> · Wed, 12 Jul 2023 09:35:05 +0300

Hi Miroslav,
You are referring to a part of the paper that motivates the Khronos approach by examining the drawbacks of a strawman solution in which Khronos watchdog queries are as frequent as NTPv4's (under the title "A Failed Simple Server-Assignment Scheme"). The relevant discussion is from Section C onwards.

Khronos can certainly be used with normal pool zones without introducing meaningful overhead. No fast servers are required. The reason is that (1) Khronos queries are much rarer than NTPv4's (see analysis), and (2) even in the highly unlikely event that panic mode is triggered multiple times, the overhead is not significant because every two triggers must be separated by multiple "normal" (and infrequent) querying rounds (see my previous email).

(The paper I referred to does indeed propose using a *subet of the current* NTP pool for *further enhancing security*, but it does not impose any lower bounds on their speed, since the induced overhead is low, and, while this could make security even better, I believe should be the topic of a future RFC. :)

Best,
Michael

On Wed, Jul 12, 2023 at 9:13 AM Miroslav Lichvar <mlichvar@xxxxxxxxxx> wrote:
On Tue, Jul 11, 2023 at 10:03:19PM +0300, Michael Schapira wrote:

> Following up on the above discussion, I wanted to point out that the load

> on NTP servers induced by Khronos is something that we have extensively

> analyzed, both theoretically and empirically. Please see

> https://www.michaelschapira.com/_files/ugd/3b1e1e_278d283d14bf4dc994128eb4be88484a.pdf

It seems your own analysis in the paper agrees that this indeed is

a serious problem. Quoting from page 11:

  Overloading timeservers. We show that, if deployed at scale,

  the above discussed simple heuristic for assigning servers to

  clients, runs into the risk of inducing signiﬁcant deviations

  from today’s load distribution, in which a large fraction of the

  timeservers experiences over 200x increase in load. Intuitively,

  this is because today’s practice of assigning timeservers to NTP

  clients with probability that is proportional to their netspeed

  values (see Section III-B) is at odds with Chronos’ uniform

  distribution of load across large sets of servers, irrespective

  of their netspeed values (which is needed for establishing its

  security guarantees [6]). Consequently, if the sets of servers

  with which Chronos clients synchronize, and the frequency

  of synchronization, are not chosen with care, low-netspeed

  servers will experience a huge surge in load.

This definitely needs to be explained in the Khronos draft and maybe

spell it out that Khronos must not be used with the normal zones in

pool.ntp.org. The paper proposes a solution using a special zone where

only selected fast servers are included, but that doesn't seem to

exist yet. I don't remember seeing it proposed on the project's

discourse (https://community.ntppool.org).

-- 

Miroslav Lichvar

-- 
last-call mailing list
last-call@xxxxxxxx
https://www.ietf.org/mailman/listinfo/last-call