在 2024/4/6 21:26, James Bottomley 写道:
On Sat, 2024-04-06 at 17:54 +0800, HAN Yuwei wrote:Hi, all. I have encountered someone else's Unicode Normalization Form(NF) problem today. And I wonder how Linux process filenames in Unicode. After some search I found that everybody seems like processed it on user input level, and nothing is mentioned about how vfs or specific filesystem treated this problem. ZFS treated it with a option "normalization" explicitly. Windows (or NTFS?) says "There is no need to perform any Unicode normalization on path and file name strings". Unicode have a dedicated FAQ about this: https://unicode.org/faq/normalization.html Is there any conclusion or discussion I missed?This question is way to broad to answer. Why don't you look in fs/unicode
Sorry, I am not very familiar with Unicode nor kernel. Correct me if wrong.As to what I have read, kernel seems like using NFD when processing all UTF-8 related string. If fs is using these helper function, then I can be sure kernel is applying NFD to every UTF-8 filenames. But I can't find any references to these helper function on Github mirror, how are they used by fs code?
and see where the helpers are used and then ask a more specific question. James
Attachment:
OpenPGP_signature.asc
Description: OpenPGP digital signature