[Last-Call] Re: Last Call: <draft-bray-unichars-10.txt> (Unicode Character Repertoire Subsets) to Proposed Standard

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




--On Sunday, February 9, 2025 16:58 -0800 Rob Sayre
<sayrer@xxxxxxxxx> wrote:

> On Sun, Feb 9, 2025 at 3:50 PM John C Klensin <john-ietf@xxxxxxx>
> wrote:
> 
>> "use of private code points is
>> ok if you have some sort of reason".
>> 
> 
> It seems like that is the reason they exist. This is  (U+F8FF).*
> It might be a square on your computer, but I can still send it.
> 
> This one is a bit of an artifact, since people mostly just use
> stickers/images when they want to use non-standard emoji these days.
> 
> I'll send it to the IETF list here, but not to be a dork. It shows
> that these must be dealt with, if you want to interoperate with
> HTTP and SMTP etc in the real world.
> 
> The same cannot be said for control characters, surrogates, and
> other things eliminated here. One can imagine that a JSON protocol
> that needed to interoperate with XML might choose "XML Characters".

I see what you are getting at and think that we may have slightly
different definitions of "interoperate".  Mine includes at least a
high expectation that what a recipient or reader sees will be at
least a good approximation to what the sender intended to transmit or
write.   If you send/write U+F8FF anticipating that I will be using
that Harvard definition (but without asking first), I think my seeing
a little box or some sort of "?"-like rendering is a failure,
although not as serious a failure as if I saw a frowning face and
construed it as U+2639 or, for that matter, U+1F615.  In those first
two cases, I would at least have a good clue that my system didn't
understand that particular private use; in the second two, no clue.
In yours, as I understand it, it is sufficient that, if U+F8FF went
into the message/ onto the web page and was visible to the MUA or
browser at the other end, that is ok -- and, at least nothing in
transit messed it up and turned the code point into something else
entirely.

I'm not going to disagree with that... just different definitions/
perspectives.  However, I think it gets me back to the point I tried
to make in my earlier note: because the recipient/reader may not see
what was intended -- whether they see an indication of an
unrecognized code point and some symbol from an entirely separate
private agreement -- I think a recommendation that it is ok to use
private code points must be accompanied by text that explains the
possible problems and risks.  Or the document needs a good and very
clear explanation that these three subsets are safe to transmit (or
use on web pages) because it is unlikely that anything in transit
will mess them up in an attempt to fix Unicode errors but that
binding of code points to any sort of rendering or interpretation
considerations is Someone Else's Problem.

   john


> 
> thanks,
> Rob
> 
> * https://hea-www.harvard.edu/~fine/OSX/unicode_apple_logo.html


-- 
last-call mailing list -- last-call@xxxxxxxx
To unsubscribe send an email to last-call-leave@xxxxxxxx




[Index of Archives]     [IETF Annoucements]     [IETF]     [IP Storage]     [Yosemite News]     [Linux SCTP]     [Linux Newbies]     [Mhonarc]     [Fedora Users]

  Powered by Linux