Re: [PATCH] unicode.7: update to reflect past developments

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 06/10/2014 10:39 AM, Marko Myllynen wrote:
> Hi,
> 
> the unicode(7) page will look more modern with few small changes, please see below.

Thanks, Marko. Applied.

Cheers,

Michael


>>From a3e9003950b6226b83ec319639bd8ecb9932275b Mon Sep 17 00:00:00 2001
> From: Marko Myllynen <myllynen@xxxxxxxxxx>
> Date: Mon, 9 Jun 2014 17:03:38 +0300
> Subject: [PATCH] unicode.7: update to reflect past developments
> 
> - drop old BUGS section, editors cope with UTF-8 ok these days,
>   and perhaps the state-of-the-art is better described elsewhere
>   anyway than in a man page
> - drop old suggestion about avoiding combined characters
> - refer to LANANA for Linux zone, add registry file reference
> - drop a reference to an inactive/dead mailing list
> - update some reference URLs
> ---
>  man7/unicode.7 |   43 ++++++++-----------------------------------
>  1 files changed, 8 insertions(+), 35 deletions(-)
> 
> diff --git a/man7/unicode.7 b/man7/unicode.7
> index 3eb1054..2fd8407 100644
> --- a/man7/unicode.7
> +++ b/man7/unicode.7
> @@ -213,14 +213,6 @@ and
>  tells, how many positions (0\(en2) the cursor is advanced by the
>  output of a character.
>  .PP
> -Under Linux, in general only the BMP at implementation level 1 should
> -be used at the moment.
> -Up to two combining characters per base
> -character for certain scripts (in particular Thai) are also supported
> -by some UTF-8 terminal emulators and ISO 10646 fonts (level 2), but in
> -general precomposed characters should be preferred where available
> -(Unicode calls this
> -.BR "Normalization Form C" ).
>  .SS Private area
>  In the
>  .BR BMP ,
> @@ -232,8 +224,10 @@ range 0xe000 to 0xefff which can be used individually by any end-user
>  and the Linux zone in the range 0xf000 to 0xf8ff where extensions are
>  coordinated among all Linux users.
>  The registry of the characters
> -assigned to the Linux zone is currently maintained by H. Peter Anvin
> -<Peter.Anvin@xxxxxxxxx>.
> +assigned to the Linux zone is maintained by LANANA and the registry
> +itself is
> +.I Documentation/unicode.txt
> +in the Linux kernel sources.
>  .SS Literature
>  .TP 0.2i
>  *
> @@ -244,7 +238,7 @@ for Standardization, Geneva, 2000.
>  
>  This is the official specification of
>  .BR UCS .
> -Available as a PDF file on CD-ROM from
> +Available from
>  .UR http://www.iso.ch/
>  .UE .
>  .TP
> @@ -267,7 +261,7 @@ which improved wide and multibyte character support even further.
>  *
>  Unicode Technical Reports.
>  .RS
> -.UR http://www.unicode.org\:/unicode\:/reports/
> +.UR http://www.unicode.org\:/reports/
>  .UE
>  .RE
>  .TP
> @@ -276,39 +270,18 @@ Markus Kuhn: UTF-8 and Unicode FAQ for UNIX/Linux.
>  .RS
>  .UR http://www.cl.cam.ac.uk\:/~mgk25\:/unicode.html
>  .UE
> -
> -Provides subscription information for the
> -.I linux-utf8
> -mailing list, which is the best place to look for advice on using
> -Unicode under Linux.
>  .RE
>  .TP
>  *
>  Bruno Haible: Unicode HOWTO.
>  .RS
> -.UR ftp://ftp.ilog.fr\:/pub\:/Users\:/haible\:/utf8\:/Unicode-HOWTO.html
> +.UR http://www.tldp.org\:/HOWTO\:/Unicode-HOWTO.html
>  .UE
>  .RE
> -.SH BUGS
> -When this man page was last revised, the GNU C Library support for
> -.B UTF-8
> -locales was mature and XFree86 support was in an advanced state, but
> -work on making applications (most notably editors) suitable for use in
> -.B UTF-8
> -locales was still fully in progress.
> -Current general
> -.B UCS
> -support under Linux usually provides for CJK double-width characters
> -and sometimes even simple overstriking combining characters, but
> -usually does not include support for scripts with right-to-left
> -writing direction or ligature substitution requirements such as
> -Hebrew, Arabic, or the Indic scripts.
> -These scripts are currently
> -supported only in certain GUI applications (HTML viewers, word processors)
> -with sophisticated text rendering engines.
>  .\" .SH AUTHOR
>  .\" Markus Kuhn <mgk25@xxxxxxxxxxxx>
>  .SH SEE ALSO
> +.BR locale (1),
>  .BR setlocale (3),
>  .BR charsets (7),
>  .BR utf-8 (7)
> 


-- 
Michael Kerrisk
Linux man-pages maintainer; http://www.kernel.org/doc/man-pages/
Linux/UNIX System Programming Training: http://man7.org/training/
--
To unsubscribe from this list: send the line "unsubscribe linux-man" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at  http://vger.kernel.org/majordomo-info.html




[Index of Archives]     [Kernel Documentation]     [Netdev]     [Linux Ethernet Bridging]     [Linux Wireless]     [Kernel Newbies]     [Security]     [Linux for Hams]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux RAID]     [Linux Admin]     [Samba]

  Powered by Linux