Re: iso_8859-{11,13}.7 fix encoding issues ?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Hello Simon,

On Mon, Feb 17, 2014 at 10:34 PM, Simon Paillard <spaillard@xxxxxxxxxx> wrote:
> On Mon, Feb 17, 2014 at 04:44:19PM +0100, Michael Kerrisk (man-pages) wrote:
>> On Mon, Feb 17, 2014 at 4:33 PM, Simon Paillard <spaillard@xxxxxxxxxx> wrote:
>> > On Mon, Feb 17, 2014 at 02:15:51PM +0100, Michael Kerrisk (man-pages) wrote:
>> >> On 02/16/2014 02:48 PM, Simon Paillard wrote:
>> >> > Introduced in 33972b0b231b9b0ee708f27fe42461dbca6e2968
>> >>
>> >> Bother! I still didn't get it right.
>> >>
>> >> Thanks for the patch! Applied.
>> >
>> > BTW, I check the rendering of the iso 8859 -11 and -13 manpages before/after
>> > the change and I think there is something to be fixed too.
>>
>> Oh -- that was automated with some scripts by Peter Schiffer, and the
>> results looked okay to me, but maybe I did not look closely enough.
>> What problems do you see?
>
> Using the following method:
> for file in $(git diff --stat --name-only man-pages-3.58..man-pages-3.59 man* ) ; \
>         do diff -u      \
>                 <( man -l <( git show man-pages-3.58:${file} ) )        \
>                 <( man -l <( git show man-pages-3.59:${file} ) ) >> all-diff ; \
> done
>
> On my laptop, it shows differences for :
>
> man -l man7/iso_8859-11.7
> 3.58:   241   161   A1     ก     THAI CHARACTER KO KAI
> 3.59:   241   161   A1     ¡     THAI CHARACTER KO KAI
>
> man -l man7/iso_8859-13.7
> 3.58:   241   161   A1     ‘     RIGHT DOUBLE QUOTATION MARK
> 3.59:   241   161   A1     ¡     RIGHT DOUBLE QUOTATION MARK
>
> Given the intro of these manpages:
>        The  following  table displays the characters in ISO 8859-13, which are
>        printable and unlisted in the ascii(7) manual page.  The fourth column will
>        only show the proper glyphs in an environment configured for ISO 8859-13.
>
> I wonder the point of coding it in unicode, but I'm no locale expert.

The main point is, I think, that the pages will now display correctly
even if your environment is not configured for the character set
encoding. For example, many of those pages didn't display correctly on
my systems. (And it looks like this was also at least sometimes the
case for you, since RIGHT DOUBLE QUOTATION MARK was not rendering
correctly for you in iso_8859-13.7.) Also, UTF-8 is pretty much the
web standard now, AFAICS, and this change makes HTML renderings of the
pages easier (and more likely to be correct).

Cheers,

Michael



-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/
--
To unsubscribe from this list: send the line "unsubscribe linux-man" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Kernel Documentation]     [Netdev]     [Linux Ethernet Bridging]     [Linux Wireless]     [Kernel Newbies]     [Security]     [Linux for Hams]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux RAID]     [Linux Admin]     [Samba]

  Powered by Linux