Bad charset conversion with mod_proxy

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello everybody,

while reverse proxying a page via mod_proxy_html (Apache 2.4.4) I get 
wrong output charset for some parts of page. When I comment Include 
conf/extra/proxy-html.conf the page is OK, but of course rewriting 
internal links doesn't work. When I uncomment it, rewriting works just 
OK, but charset in one button and in a posted form data is bad.

Config:
Apache 2.4.4 self compiled
Solaris 10

LoadFile   /opt/csw/lib/libxml2.so
LoadModule proxy_html_module modules/mod_proxy_html.so
LoadModule xml2enc_module modules/mod_xml2enc.so

...

  ProxyHTMLDoctype XHTML
  ProxyRequests off
  xml2EncDefault UTF-8 #or commented

       ProxyPass /bohdalec/ http://10.16.133.38:20001/obj/   max=320

         <Location /bohdalec>

         ProxyPassReverse /
           ProxyHTMLEnable On
           ProxyHTMLURLMap /            /bohdalec/
           ProxyPassReverseCookiePath    /  /bohdalec/
         </Location>

I have found some clues in error log that I suppose it can be the reason.

In default setup:

[Thu Jul 18 14:14:39.753939 2013] [xml2enc:debug] [pid 13753:tid 26] 
mod_xml2enc.c(178): [client 10.16.8.40:62379] AH01430: Content-Type is 
text/html, referer: http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:14:39.753979 2013] [xml2enc:debug] [pid 13753:tid 26] 
mod_xml2enc.c(254): [client 10.16.8.40:62379] AH01434: Charset 
ISO-8859-1 not supported by libxml2; trying apr_xlate, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:14:39.754001 2013] [xml2enc:error] [pid 13753:tid 26] 
[client 10.16.8.40:62379] AH01435: Charset ISO-8859-1 not supported.  
Consider aliasing it?, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:14:39.754021 2013] [xml2enc:warn] [pid 13753:tid 26] 
[client 10.16.8.40:62379] AH01436: No usable charset information; using 
configuration default, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php

When using  xml2EncDefault UTF-8 option:

[Thu Jul 18 14:12:17.563736 2013] [xml2enc:debug] [pid 13739:tid 25] 
mod_xml2enc.c(178): [client 10.16.8.40:62324] AH01430: Content-Type is 
text/html, referer: http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:12:17.563774 2013] [xml2enc:debug] [pid 13739:tid 25] 
mod_xml2enc.c(254): [client 10.16.8.40:62324] AH01434: Charset UTF-8 not 
supported by libxml2; trying apr_xlate, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:12:17.563796 2013] [xml2enc:error] [pid 13739:tid 25] 
[client 10.16.8.40:62324] AH01435: Charset UTF-8 not supported. Consider 
aliasing it?, referer: http://registrace.praha.eu/bohdalec/wc-objini.php
[Thu Jul 18 14:12:17.563815 2013] [xml2enc:warn] [pid 13739:tid 25] 
[client 10.16.8.40:62324] AH01436: No usable charset information; using 
configuration default, referer: 
http://registrace.praha.eu/bohdalec/wc-objini.php


If I read it well, the problem is that libxml2 doesn't support neither 
ISO-8859-1 or UTF-8. But according to documentation, they are always 
supported and UTF-8 is the default encoding. I have tried 3 versions of 
libxml2 - default Solaris version, CSW version and self compiled one. I 
doubt all three versions could have such a basic problem as not 
supporting default charset. Do I miss something important?

Thanks for any help

Toni

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscribe@xxxxxxxxxxxxxxxx
For additional commands, e-mail: users-help@xxxxxxxxxxxxxxxx






[Index of Archives]     [Open SSH Users]     [Linux ACPI]     [Linux Kernel]     [Linux Laptop]     [Kernel Newbies]     [Security]     [Netfilter]     [Bugtraq]     [Squid]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux RAID]     [Samba]     [Video 4 Linux]     [Device Mapper]

  Powered by Linux