On Wed, 08 Nov 2006 00:48:39 -0500 mickg <mickg@xxxxxxxxx> wrote: > Just to put my money where my mouth is, I have implemented a (stupid) > prototype that does: If no known charset is native to libxml2 > detected , a recompiled version of mod_proxy_html now uses iconv > (eventually via the xmlFindCharEncodingHandler function) to convert > from the source encoding to UTF-8. Interesting. You've gone one up on my aliasing proposal, for what looks like rather less work than I thought that would take. I might snarf the basic idea for Version 3. > If no encoding info is specified, it assumes windows-1251 (yes, > stupid, but still). But not stupid if we make it a configurable default! > It does work on my _own_ website, where it quite happily converts > win-1251 to utf-8. Once I fix the memory leak (any help appreciated), > I'll be happy. See http://www.apachetutor.org/dev/pools for an easy way to deal with the memory. > And a great many thanks to Nick Kew for getting me off my lazy ... to > start coding (which, honestly, I am better at than administering > systems). :-) > BTW, I still have no clue why I cannot do this with mod_charset_lite. Neither am I. But a closer look at mod_charset_lite has been on my TODO list for so long it's probably on a permanent back-burner. Did you also look at the full mod_charset? AIUI it was written by Russian developers, so cyrillic was presumably important to them. -- Nick Kew Application Development with Apache - the Apache Modules Book http://www.apachetutor.org/ --------------------------------------------------------------------- The official User-To-User support forum of the Apache HTTP Server Project. See <URL:http://httpd.apache.org/userslist.html> for more info. To unsubscribe, e-mail: users-unsubscribe@xxxxxxxxxxxxxxxx " from the digest: users-digest-unsubscribe@xxxxxxxxxxxxxxxx For additional commands, e-mail: users-help@xxxxxxxxxxxxxxxx