[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Q] assume charset in raw 8bit headers



Hello,

I have a question on CharsetConverters.  I am planning to
use UTF-8 filter like following.

<CharsetConverters override>
plain;          mhonarc::htmlize;
us-ascii;       mhonarc::htmlize;
default;        MHonArc::UTF8::str2sgml;     MHonArc/UTF8.pm
</CharsetConverters>

However, I sometimes receive illegal mails which have raw 8bit
characters in headers (like Subject: and From:).  I heard that
Russian people sometimes use mail softwares which generate such
illegal headers.

This, I'd like to assume that raw 8bit characters are all KOI8-R
and convert these 8bit characters into either
 - SGML entity expressions,
 - &#xxx; expressions where xxx mean decimal Unicode codepoints, or
 - UTF-8 characters.
How can I configure MHonArc to achieve this?

---
Tomohiro KUBOTA <kubota@debian.or.jp>


---------------------------------------------------------------------
To sign-off this list, send email to majordomo@mhonarc.org with the
message text UNSUBSCRIBE MHONARC-USERS


[Index of Archives]     [Bugtraq]     [Yosemite News]     [Mhonarc Home]