Re: Last Call: <draft-ietf-dane-openpgpkey-05.txt> (Using DANE to Associate OpenPGP public keys with email addresses) to Proposed Standard

John C Klensin <john-ietf@xxxxxxx> · Fri, 11 Sep 2015 14:45:22 -0400

Paul,

I am trying very hard, as I believe John Levine is trying, to
identify some substantive and significant issues that we see
with this draft.  I don't think the discussion is moved forward
by the your apparently taking sufficient offense at those
comments and explanations to respond in the tone that you have
adopted.  In the hope of keeping temperatures down, I won't
quote or respond further on those points.

However, because I'm fairly well acquainted with the authors of
RFC 5890/5891 and 6530, I do feel a need to try to correct
misunderstandings that your reading of those documents and their
predecessors seem to have produced.

--On Friday, September 11, 2015 13:38 -0400 Paul Wouters
<paul@xxxxxxxxx> wrote:

>> I note in that regard that the first sentence of Section 3 of
>> the I-D is simply false.
> 
> The first sentence reads:
> 
>     The DNS does not allow the use of all characters that are
> supported
>     in the "local-part" of email addresses as defined in
> [RFC5322] and
>     [RFC6530].
> 
> RFC 6530 Section 7.1:
> 
>  	Permits the use of UTF-8 strings in email addresses, both
> local
>  	parts and domain names.

SMTP has a syntax rule -- going back at least to RFC 821 and
transposed into RFC 822 or vice versa (I don't remember and it
makes no difference at this point) -- that prohibits non-ASCII
characters in local parts and imposes that and additional
restrictions on the domain part of mailboxes.  Those
restrictions are specific to email although the SMTP domain
rules became the basis for the "preferred syntax" in RFC
1034/1035.  Those mailbox syntax rules are fairly widely
enforced in email applications.  What RFC 6530 "permits" is the
use of non-ASCII strings in mailbox names as an extension to
SMTP that eliminates that strict ASCII-only syntax rule.  It has
almost nothing to do with the DNS.

RFC 6055 essentially argues that applications should do the
encodings required by IDNA as late (and close to actual DNS
lookups) as possible.  RFC 6530ff are consistent with that
recommendation.

> So you are claiming that DNS can take any UTF-8 character? I
> guess you  should file an errata for IDNA RFC-3492.

RFC 3492 ("Punycode") simply specifies an ASCII-compatible
encoding ("ACE") that can be used when non-ASCII characters are
problematic.  RFC 3490 and its successors 5890, 5801, and the
explanations in 5894 provide, I thought very clearly,
explanations of why such an encoding was (and is) desirable for
IDN use in applications.  Key issues included maintaining a
reasonable level of compatibility with already-deployed
applications that had ASCII-only restrictions and working around
some canonicalization and encoding issues that the installed
base of DNS servers did not handle well.  But there was never a
claim that the DNS couldn't store characters coded in UTF-8 or,
for that matter, any bit sequence in octets.   The latter
capability is made quite clear in the DNS base documents
(1034/1035) and RFC 2181.

Now, for your application, you are defining a new RR Type.
There are no deployed applications that are likely to look that
RR Type up that have ASCII (or "preferred syntax") restrictions
so the IDNA arguments for sticking with an ACE form (whether
using the Punycode algorithm or something you invent) do not
apply.  It may well be that simply using UTF-8 strings is
inappropriate.  However, especially if you are trying to
establish linkages to email mailbox addresses that _must_ be in
UTF-8 or its ASCII subset (not coincidentally, largely because
of the same "no one but the delivery server interprets" rules
that John and I have cited several times), using it directly is
the most obvious solution, obvious to the point that, IMO, you
need to justify anything else.  Whatever that justification
might be, it cannot be that the DNS cannot represent non-ASCII
characters (UTF-8 or otherwise), or even arbitrary octets,
because it definitely can handle such strings in labels and has
been able do so since its inception.

best regards,
    john