Hello Simon, On Mon, Feb 17, 2014 at 10:34 PM, Simon Paillard <spaillard@xxxxxxxxxx> wrote: > On Mon, Feb 17, 2014 at 04:44:19PM +0100, Michael Kerrisk (man-pages) wrote: >> On Mon, Feb 17, 2014 at 4:33 PM, Simon Paillard <spaillard@xxxxxxxxxx> wrote: >> > On Mon, Feb 17, 2014 at 02:15:51PM +0100, Michael Kerrisk (man-pages) wrote: >> >> On 02/16/2014 02:48 PM, Simon Paillard wrote: >> >> > Introduced in 33972b0b231b9b0ee708f27fe42461dbca6e2968 >> >> >> >> Bother! I still didn't get it right. >> >> >> >> Thanks for the patch! Applied. >> > >> > BTW, I check the rendering of the iso 8859 -11 and -13 manpages before/after >> > the change and I think there is something to be fixed too. >> >> Oh -- that was automated with some scripts by Peter Schiffer, and the >> results looked okay to me, but maybe I did not look closely enough. >> What problems do you see? > > Using the following method: > for file in $(git diff --stat --name-only man-pages-3.58..man-pages-3.59 man* ) ; \ > do diff -u \ > <( man -l <( git show man-pages-3.58:${file} ) ) \ > <( man -l <( git show man-pages-3.59:${file} ) ) >> all-diff ; \ > done > > On my laptop, it shows differences for : > > man -l man7/iso_8859-11.7 > 3.58: 241 161 A1 ก THAI CHARACTER KO KAI > 3.59: 241 161 A1 ¡ THAI CHARACTER KO KAI > > man -l man7/iso_8859-13.7 > 3.58: 241 161 A1 ‘ RIGHT DOUBLE QUOTATION MARK > 3.59: 241 161 A1 ¡ RIGHT DOUBLE QUOTATION MARK > > Given the intro of these manpages: > The following table displays the characters in ISO 8859-13, which are > printable and unlisted in the ascii(7) manual page. The fourth column will > only show the proper glyphs in an environment configured for ISO 8859-13. > > I wonder the point of coding it in unicode, but I'm no locale expert. The main point is, I think, that the pages will now display correctly even if your environment is not configured for the character set encoding. For example, many of those pages didn't display correctly on my systems. (And it looks like this was also at least sometimes the case for you, since RIGHT DOUBLE QUOTATION MARK was not rendering correctly for you in iso_8859-13.7.) Also, UTF-8 is pretty much the web standard now, AFAICS, and this change makes HTML renderings of the pages easier (and more likely to be correct). Cheers, Michael -- Michael Kerrisk Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/ Linux/UNIX System Programming Training: http://man7.org/training/ -- To unsubscribe from this list: send the line "unsubscribe linux-man" in the body of a message to majordomo@xxxxxxxxxxxxxxx More majordomo info at http://vger.kernel.org/majordomo-info.html