John C Klensin wrote: > Sandeep's question raises another interesting issue. I just > went back and reread RFC 2640. It does not seem to address the > "TYPE A" issue at all. It does say (Section 2, paragraph 1) > "Clients and servers are, however, under no obligation to > perform any conversion on the contents of a file for operations > such as STOR or RETR", which I would take to imply that it > anticipates I18N FTP operations to be entirely binary ("TYPE I") > although that is not explicit. As for Japanese processing, UTF-8 is not visible by users and on the network, because UTF-8 is not only useless but also harmful. Instead, ISO-2022-JP, ShiftJIS and EUC are the major character sets. Some ftp implementations does assume (sometimes depending on environment variables) network character code ShiftJIS or EUC and perform appropriate conversions, which garbles UTF-8. On the other hand, if you use ISO-2022-JP, which is 7 bit pure and ASCII compatible (in a sense, it is pure ASCII), we can safely use ASCII mode of vanilla ftp and there is no confusion as long as we are in ASCII environment. Similar encoding can be profiled using ISO 2022 to obtain a fully internationalized, 7 bit pure, ASII compatible character encoding. The only problem for RFC2460 was that it does not need MIME for charset and 8bit extension that it makes it clear that MIME is useless. Note that long term state maintainance of full ISO 2022 is not more complex than that of UTF-8. Note also that, carefully profiled ISO 2022, such as ISO-2022-JP, requires state maintainance a lot simpler than that of UTF-8. > Whether the characters in use are UTF-8 or not, we've still got > that issue with line-endings. Line-ending issues of any ISO 2022 based encoding are just as simple as those of ASCII. Masataka Ohta _______________________________________________ Ietf@xxxxxxxx https://www1.ietf.org/mailman/listinfo/ietf