Unicode related query

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]



Greetings,

I am able to get a english word list in <file> by using the following command

cat <file> | tr -sc A-Za-z '\012'

My question is how to specify unicode character and ASCII.
Specifically text text file containing 3 byte sequence starting with
\x0e in the tr command.

I am able to see the character using:

echo -e '\xe0\xa5\xbf'

What regex incantation would make tr give the results I want?

I am new to unicode.

Regards,

Rajagopal
_______________________________________________
CentOS mailing list
CentOS@xxxxxxxxxx
http://lists.centos.org/mailman/listinfo/centos

[Index of Archives]     [CentOS]     [CentOS Announce]     [CentOS Development]     [CentOS ARM Devel]     [CentOS Docs]     [CentOS Virtualization]     [Carrier Grade Linux]     [Linux Media]     [Asterisk]     [DCCP]     [Netdev]     [Xorg]     [Linux USB]
  Powered by Linux