Re: Questions about Unicode Normalization Form

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



在 2024/4/6 21:26, James Bottomley 写道:
On Sat, 2024-04-06 at 17:54 +0800, HAN Yuwei wrote:
Hi, all.

I have encountered someone else's Unicode Normalization Form(NF)
problem today. And I wonder how Linux process filenames in Unicode.

After some search I found that everybody seems like processed it on
user input level, and nothing is mentioned about how vfs or specific
filesystem treated this problem. ZFS treated it with a option
"normalization" explicitly. Windows (or NTFS?) says "There is no need
to perform any Unicode normalization on path and file name strings".

Unicode have a dedicated FAQ about this:
https://unicode.org/faq/normalization.html

Is there any conclusion or discussion I missed?
This question is way to broad to answer.  Why don't you look in

fs/unicode

Sorry, I am not very familiar with Unicode nor kernel. Correct me if wrong.

As to what I have read, kernel seems like using NFD when processing all UTF-8 related string. If fs is using these helper function, then I can be sure kernel is applying NFD to every UTF-8 filenames. But I can't find any references to these helper function on Github mirror, how are they used by fs code?

and see where the helpers are used and then ask a more specific
question.

James

Attachment: OpenPGP_signature.asc
Description: OpenPGP digital signature


[Index of Archives]     [Linux Ext4 Filesystem]     [Union Filesystem]     [Filesystem Testing]     [Ceph Users]     [Ecryptfs]     [NTFS 3]     [AutoFS]     [Kernel Newbies]     [Share Photos]     [Security]     [Netfilter]     [Bugtraq]     [Yosemite News]     [MIPS Linux]     [ARM Linux]     [Linux Security]     [Linux Cachefs]     [Reiser Filesystem]     [Linux RAID]     [NTFS 3]     [Samba]     [Device Mapper]     [CEPH Development]

  Powered by Linux