On Tue, Dec 08, 2009 at 04:11:56PM +0100, Daniel Veillard wrote: > On Tue, Dec 08, 2009 at 02:40:49PM +0000, Daniel P. Berrange wrote: > > ACK, this is a better approach because it works when a browser is pointed > > to the HTML files locally too. > > Okay, applied, this should propagate to the web site within an hour or > so, Well this didn't work, and after some debugging I found the problem. Out html.in are processed with xsltproc --html option, i.e. using the HTML parser, with default to ISO latin if out of ascii range and no meta information is provided, and the resulting .html file was correct XML (XHTML) in UTF-8 but the initil UTF-8 2 bytes were misinterpreted in the HTML parsing. The solution is to add a meta header to the news.html.in file indicating the content is UTF-8, then the proper UTF-8 HTML is generated, and I checked the Web server doesn't add any HTTP header for the encoding, so now we use UTF-8 from source to destination, and the problem should be fixed. Daniel -- Daniel Veillard | libxml Gnome XML XSLT toolkit http://xmlsoft.org/ daniel@xxxxxxxxxxxx | Rpmfind RPM search engine http://rpmfind.net/ http://veillard.com/ | virtualization library http://libvirt.org/
commit 604c70fd5da43c4105fa92162fd7cb4e9de840e1 Author: Daniel Veillard <veillard@xxxxxxxxxx> Date: Wed Dec 9 11:16:48 2009 +0100 Fix the news file non-ascii characters by adding an explicit HTML encoding meta description diff --git a/docs/news.html.in b/docs/news.html.in index 1635a5a..50ad247 100644 --- a/docs/news.html.in +++ b/docs/news.html.in @@ -1,5 +1,8 @@ <?xml version="1.0"?> <html> + <head> + <meta http-equiv="Content-Type" content="text/html; charset=utf-8" /> + </head> <body> <h1 >Releases</h1> <p>Here is the list of official releases, however since it is early on in the
-- Libvir-list mailing list Libvir-list@xxxxxxxxxx https://www.redhat.com/mailman/listinfo/libvir-list