UTF-8, UTF-16 and UTF-32
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Hello GCC,
Now I have had the time to pull myself off the ceiling, I realise the
problem is that Unix/GCC is supporting both UTF-8 and UTF-32, while Windows
is supporting UTF-8 and UTF-16. And the solution is for both Unix and
Windows to support all three Unicode formats.
I have had to spend the last several days totally writing from scratch the
UTF-16 string functions, and realise that with a bit of common sense every
thing can work out okay. Hopefully quick action to move wchar_t to 2 bytes
and create another type for 4 byte strings, we can see a lot of problems
solved. Maybe have UTF-16 strings with L"My String" and UTF-32 with LL"My
String" notations.
I hope your steering committee can see that there will be lots of UTF-16
text files out there, with a lot of code required to be written to process
those files and while UTF-8 will not support many none Latin based
languages, UTF-32 will not support many none Human base languages - i.e. no
signal system is fault free.
Thanks,
Dallas
http://www.ekkySoftware.com/
[Index of Archives]
[Linux C Programming]
[Linux Kernel]
[eCos]
[Fedora Development]
[Fedora Announce]
[Autoconf]
[The DWARVES Debugging Tools]
[Yosemite Campsites]
[Yosemite News]
[Linux GCC]