-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 > What byte string are you really entering here? What's coming through in > your email is \344 ... which is not valid UTF8. But I suspect something > may have translated it before it got to my inbox. Damn charsets :-) The character indeed was \344 aka "ä", but my mailer sends latin, not unicode. In order to avoid interaction with gcc, cat and others else I've written a new program, reading from a file. gcc -o unicode unicode.c LC_CTYPE=de_DE.utf8 ./unicode uni.data should yield (xterm -u8, LC_CTYPE=en_US.utf8 works as well) uni.out Mit freundlichem Gruß / With kind regards Holger Klawitter - -- lists <at> klawitter <dot> de -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.2.2 (GNU/Linux) iD8DBQFAfA1/1Xdt0HKSwgYRAhldAJoCcNrZ7BGnG1m2SXX/lR1ngqGooQCcDYOF SlzlbLAJk7/e6rzYZyL7yE4= =/3bH -----END PGP SIGNATURE-----
Attachment:
unicode-testcase.tar.gz
Description: application/tgz
---------------------------(end of broadcast)--------------------------- TIP 8: explain analyze is your friend