> Sorry, a better example would've been "ñ" (U+00F1). You can also > represent it as "n" (U+006E) followed by "◌̃" (U+0303 -- "combining > tilde"). Both forms are defined by Unicode to be canonically equivalent > so it would be incorrect to treat the two Unicode strings differently > (that isn't quite the case for "Å"). So utf8-normalize will convert "ñ" (U+00F1) and "n" (U+006E) followed by "◌̃" to a utf8 code, and both are the same, right? Then compare it byte by byte.