[Last-Call] Re: Last Call: <draft-bray-unichars-10.txt> (Unicode Character Repertoire Subsets): W3C I18N Review

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Yeah, here's the link:

<https://docs.oracle.com/en/java/javase/21/docs/api/java.base/java/nio/charset/CharsetEncoder.html#:~:text=How%20an%20encoding%20error%20is%20handled%20depends>

That's from Java 1.4, and I doubt they can change it. I'm sure the newer APIs use �.

Whether that needs to be covered is not important to me.

thanks,
Rob



On Wed, Feb 12, 2025 at 9:13 AM Tim Bray <tbray@xxxxxxxxxxxxxx> wrote:
Eh, I guess that language should be changed to say Java in some cases uses “?”.  Or maybe just lose the whole “although some popular software platforms…” phrase. That � exists and is designed for this purpose is just a fact and worth stating. -T

On Feb 12, 2025 at 9:06:59 AM, Rob Sayre <sayrer@xxxxxxxxx> wrote:


On Mon, Feb 10, 2025 at 1:09 PM Addison Phillips <addisoni18n@xxxxxxxxx> wrote:

#1983: Replacement character examples

replacing problematic code points, ideally with "�" (U+FFFD,
REPLACEMENT CHARACTER), although some popular software platforms,
notably Java, use "?".

This is probably incorrect. Java replaces with U+FFFD in most Unicode processing (including decoding from legacy encodings). (Encoding to legacy encodings in Java use "?"). There are other places, such as certain browsers, where "?" is used in a Unicode context.


Apologies if I am misinterpreting, but I was surprised to learn that Java does use "?" sometimes. See this example:


thanks,
Rob

--
last-call mailing list -- last-call@xxxxxxxx
To unsubscribe send an email to last-call-leave@xxxxxxxx
-- 
last-call mailing list -- last-call@xxxxxxxx
To unsubscribe send an email to last-call-leave@xxxxxxxx

[Index of Archives]     [IETF Annoucements]     [IETF]     [IP Storage]     [Yosemite News]     [Linux SCTP]     [Linux Newbies]     [Mhonarc]     [Fedora Users]

  Powered by Linux