[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
[Q] assume charset in raw 8bit headers
Hello,
I have a question on CharsetConverters. I am planning to
use UTF-8 filter like following.
<CharsetConverters override>
plain; mhonarc::htmlize;
us-ascii; mhonarc::htmlize;
default; MHonArc::UTF8::str2sgml; MHonArc/UTF8.pm
</CharsetConverters>
However, I sometimes receive illegal mails which have raw 8bit
characters in headers (like Subject: and From:). I heard that
Russian people sometimes use mail softwares which generate such
illegal headers.
This, I'd like to assume that raw 8bit characters are all KOI8-R
and convert these 8bit characters into either
- SGML entity expressions,
- &#xxx; expressions where xxx mean decimal Unicode codepoints, or
- UTF-8 characters.
How can I configure MHonArc to achieve this?
---
Tomohiro KUBOTA <kubota@debian.or.jp>
---------------------------------------------------------------------
To sign-off this list, send email to majordomo@mhonarc.org with the
message text UNSUBSCRIBE MHONARC-USERS
[Index of Archives]
[Bugtraq]
[Yosemite News]
[Mhonarc Home]