Re: Re: Detecting The Encoding Of A Text File

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, 2009-11-26 at 15:39 +0200, דניאל דנון wrote:

> If windows notepad can detect encoding there must be a way to do it
> yourself.
> 
> Maybe try to get the file's headers, I think it should also contain the
> encoding of the file...
> 
> 2009/11/26 Nitsan Bin-Nun <nitsanbn@xxxxxxxxx>
> 
> > Someone have already suggested it but I haven't tried it yet.
> >
> > The thing is that right now it contains Hebrew, but tommorrow this file
> > will
> > be in German or any other accented language.
> > I'm trying to create a function which would detect the encoding and convert
> > it into UTF8.
> >
> > (I don't have much experience in encoding.. :( )
> >
> > 2009/11/26 Nisse Engström <news.NOSPAM.0ixbtqKe@xxxxxxxx>
> >
> > > On Thu, 26 Nov 2009 06:55:31 +0200, Nitsan Bin-Nun wrote:
> > >
> > > > Hi,
> > > >
> > > > I have been trying for the last couple of hours to determine the
> > > > encoding of a text file (.txt in windowz).
> > > >
> > > > I have this code:
> > > >
> > > >         $contents = file_get_contents($config['
> > > > txt_dir'] . $file);
> > > >         $encoding = mb_detect_encoding($contents,
> > > > "UTF-8,ISO-8859-1,WINDOWS-1252"); //,Windows-1255
> > > >
> > > >         echo "||encoding:".$encoding."||";
> > > >
> > > >         if ($encoding == 'UTF-8')
> > > >         {
> > > >             $utfcontents = $contents;
> > > >         }
> > > >         else if ($encoding == 'ISO-8859-1')
> > > >         {
> > > >             $utfcontents = utf8_encode($contents);
> > > >         }
> > > >
> > > >         var_dump($utfcontents);
> > > >
> > > > The $encoding is ISO-8859-1, the text file contains Hebrew characters,
> > > then
> > > > I'm converting it to utf8.
> > > >
> > > > The above code is outputing gibbrish, it seems that it has converted it
> > > in
> > > > some way but not in the
> > > > proper way that it should have converted it.
> > >
> > > If you know that the file contains Hebrew, maybe you should
> > > try converting from ISO-8859-8?
> > >
> > >
> > > /Nisse
> > >
> > > --
> > > PHP General Mailing List (http://www.php.net/)
> > > To unsubscribe, visit: http://www.php.net/unsub.php
> > >
> > >
> >
> 
> 
> 


A plain text file wouldn't have headers like that would it? At least,
not in the sense that an image file has a header, or an office document
file has a header.

Thanks,
Ash
http://www.ashleysheridan.co.uk



[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux