Re: opening utf-8 files - chinese mb characters

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 





Andrew Ballard wrote:
On Wed, Apr 8, 2009 at 11:38 AM, Per Jessen <per@xxxxxxxxxxxx> wrote:
Merlin Morgenstern wrote:

Hello everybody,

I am having some trouble with utf-8 encoding. The html file containes
chinese characters and looks ok, when opened in a browser.

Now I want to extract some text from the file. In order to do this I
do:

$handle = fopen($file, "r");
$contents = fread($handle, filesize($file));

echo $contents;

The chinese characters are gone by then. They show up as questinomarks
or wired characters. To fix it I tried to add:

$contents = utf8_decode($contents);
header("Content-Type: text/html; charset=utf-8");

But still... no luck :-(

Has somebody an idea why??
Check that the page really is displayed with the right encoding - in FF,
Ctrl-I.

/Per


--
Per Jessen, Zürich (16.6°C)

A bit off topic, but Ctrl+I no longer brings up the Page Info in
Firefox like it used to -- at least on my Windows computers. (It opens
the bookmark list in the sidebar.) Does it do differently under Linux?

Andrew

I use windows right click for this.

Actually I isolated the problem, but still I can't figure out how to solve it. The page is not utf-8, but ugb2312.

I am already sending the header through php:
header("Content-Type: text/html; charset=ugb2312");

And with apache http.conf:
AddDefaultCharset ugb2312

However firefox still claims that it is utf-8 which is causing the strange characters.

What a night mare! Does anybody have an idea on how to get closer to solving this?

Regards, Merlin

--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux