Sergei Haller wrote: > On Wed, 20 Jul 2005, Ludwig Nussel (LN) wrote: > > >>Klaus Schmidinger wrote: >> >>>[...] >>>To me, a character is an entity that's always the same size (preferably >>>one byte). UTF-8 breaks with this, so if you have a string that has, >>>e.g. a strlen() of 10, you can't be sure that this will be really 10 >>>printing >>>characters because there might be some "escaped" characters. > > > I think the confusion comes from the assumption that a character is > exactly one byte long. > > strlen counts bytes not characters. > > in utf-8 a character can be up to 4 (or was it 8) bytes long. > > IIRC, there are new functions to count characters (wstrlen, wstrcmp, > etc.) Aren't you confusing this with "wide character" functions? Klaus