RE: strange character

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 




On 27 February 2015 14:36:50 GMT+00:00, hadi <almarzuki2011@xxxxxxxxxxx> wrote:
>Sorry for the miss understanding. I understand know what you're trying
>to do.
>
>Here is the web site im getting error while trying to download rss feed
>from it.
>http://www.sahafah.net/rss.php
>
>
>> -----Original Message-----
>> From: Peter West [mailto:peter.b.west@xxxxxxxxx] On Behalf Of Peter
>> West
>> Sent: Friday, February 27, 2015 5:26 AM
>> To: hadi; PHP General
>> Subject: Re:  strange character
>> 
>> Here are the contents of test.txt
>> iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii
>> 
>> 
>> Obviously, this is not the data you are receiving from the internet.
>> 
>> As you say, you do not understand.
>> 
>> What http:// address are you accessing when you get the error?
>> 
>> Peter West
>> "...and behold, something greater than Jonah is here."
>> 
>> > On 27 Feb 2015, at 1:46 am, hadi <almarzuki2011@xxxxxxxxxxx> wrote:
>> >
>> > Here its. I'm attaching it.
>> > But I'm not understanding how this can solve my problem..
>> >
>> >> -----Original Message-----
>> >> From: Peter West [mailto:lists@xxxxxxxxx]
>> >> Sent: Thursday, February 26, 2015 4:28 PM
>> >> To: hadi
>> >> Subject: Re:  strange character
>> >>
>> >> Hi Hadi,
>> >>
>> >> Can you send me the original test.txt as well, please.
>> >>
>> >> Peter West
>> >> "...and behold, something greater than Jonah is here."
>> >>
>> >>> On 26 Feb 2015, at 8:35 pm, hadi <almarzuki2011@xxxxxxxxxxx>
>wrote:
>> >>>
>> >>> I did this the second time im sending the attach file.
>> >>>
>> >>> -----Original Message-----
>> >>> From: hadi [mailto:almarzuki2011@xxxxxxxxxxx]
>> >>> Sent: Thursday, February 26, 2015 9:59 AM
>> >>> To: 'Peter West'
>> >>> Cc: 'PHP General'
>> >>> Subject: RE:  strange character
>> >>>
>> >>> Here is the attach file.
>> >>>
>> >>>> -----Original Message-----
>> >>>> From: Peter West [mailto:lists@xxxxxxxxx]
>> >>>> Sent: Thursday, February 26, 2015 1:07 AM
>> >>>> To: hadi
>> >>>> Cc: PHP General
>> >>>> Subject: Re:  strange character
>> >>>>
>> >>>> That doesn't make sense to me.  Send me the test file.
>> >>>>
>> >>>> Peter West
>> >>>> "...and behold, something greater than Jonah is here."
>> >>>>
>> >>>>> On 26 Feb 2015, at 1:49 am, hadi <almarzuki2011@xxxxxxxxxxx>
>> wrote:
>> >>>>>
>> >>>>> I did what you asked and here is the result.
>> >>>>> Frist cmp, cmp test.txt test.utf8 give nothing output.
>> >>>>> Second cmp, cmp test.txt test.1256 getting this result cmp: EOF
>on
>> >> test.txt.
>> >>>>> But im not understanding how this related to my php error what
>im
>> >> getting.
>> >>>>>
>> >>>>> And thanks peter for standing with me in my issue.
>> >>>>> What im trying to achieve here to remove the strange character
>by
>> >>>>> translating utf-8 to CP1256, but unfortunately getting error
>from
>> >>>>> php
>> >>>>> (Notice: iconv(): Detected an illegal character in input string
>in
>> >>>>> )
>> >>>>>
>> >>>>> I did test on the strange character and its working prefect,
>but
>> >>>>> when it's
>> >>>> come to downloading rss feed, it give error.
>> >>>>>
>> >>>>> <?php
>> >>>>>
>> >>>>>  $str = '(–)';
>> >>>>>
>> >>>>>  echo iconv("UTF-8", 'CP1256//TRANSLIT', $str); ?>
>> >>>>>
>> >>>>>> -----Original Message-----
>> >>>>>> From: Peter West [mailto:lists@xxxxxxxxx]
>> >>>>>> Sent: Wednesday, February 25, 2015 3:49 PM
>> >>>>>> To: hadi
>> >>>>>> Cc: PHP General
>> >>>>>> Subject: Re:  strange character
>> >>>>>>
>> >>>>>> If the file is really us-ascii, then there are no characters
>in
>> >>>>>> that file with the 8th bit set, and you can do this:
>> >>>>>>
>> >>>>>> iconv -f ASCII -t UTF8 test.txt >test.utf8 iconv -f ASCII -t
>> >>>>>> CP1256 test.txt
>> >>>>>>> test.1256 cmp test.txt test.utf8
>> >>>>>> (should be no messages about differences.) cmp test.txt
>test.1256
>> >>>>>> (should be no messages about differences.)
>> >>>>>>
>> >>>>>> If that all works, then the test text you have is not the same
>as
>> >>>>>> the text you have the problem with.  If you have any problems
>> >>>>>> with this sequence, tell us what they are.
>> >>>>>>
>> >>>>>> Peter West
>> >>>>>> "...and behold, something greater than Jonah is here."
>> >>>>>>
>> >>>>>>> On 25 Feb 2015, at 10:35 pm, hadi <almarzuki2011@xxxxxxxxxxx>
>> >> wrote:
>> >>>>>>>
>> >>>>>>> Im using linux centos.  I have access to terminal.
>> >>>>>>>
>> >>>>>>> I did what you asked converting the test.txt to CP1256 And to
>> >>>>>>> verify that I did,
>> >>>>>>>
>> >>>>>>> file -bi test.txt
>> >>>>>>>
>> >>>>>>> text/plain; charset=us-ascii
>> >>>>>>>
>> >>>>>>> not showing cp1256 encoding. But when I do,
>> >>>>>>>
>> >>>>>>> iconv -l | grep CP1256
>> >>>>>>> CP1256//
>> >>>>>>> I get CP1256 encoding.
>> >>>>>>>
>> >>>>>>>> -----Original Message-----
>> >>>>>>>> From: Peter West [mailto:lists@xxxxxxxxx]
>> >>>>>>>> Sent: Wednesday, February 25, 2015 3:11 PM
>> >>>>>>>> To: hadi; PHP General
>> >>>>>>>> Subject: Re:  strange character
>> >>>>>>>>
>> >>>>>>>>> On 25 Feb 2015, at 9:06 pm, hadi
><almarzuki2011@xxxxxxxxxxx>
>> >>>> wrote:
>> >>>>>>>>>
>> >>>>>>>>> Hi Peter,
>> >>>>>>>>>
>> >>>>>>>>> I want to convert from utf-8 to CP1256 so the strange
>> >>>>>>>>> character can be
>> >>>>>>>> fixed.
>> >>>>>>>>
>> >>>>>>>> I think it's either a) already in CP1256, or it's in some
>other
>> >>>>>>>> character set altogether.  The error message is telling you
>> >>>>>>>> that it's not
>> >>>>>> recognised as UTF-8.
>> >>>>>>>>
>> >>>>>>>> Do you have access to the iconv program in a terminal?  If
>you
>> >>>>>>>> are on a linux or OS X system, just open a terminal and type
>> >>>>>>>>
>> >>>>>>>> iconv --help
>> >>>>>>>>
>> >>>>>>>> If you have iconv installed you will get a help message.
>> >>>>>>>>
>> >>>>>>>> Get the text you are trying to convert into a text file, and
>> >>>>>>>> just try various conversions using iconv, until it looks
>right.
>> >>>>>>>> Let's say your text is in the file unknown.txt.
>> >>>>>>>>
>> >>>>>>>> iconv -l
>> >>>>>>>>
>> >>>>>>>> will list all of the character sets that iconv knows about.
>> >>>>>>>> Find the likely candidates and just try
>> >>>>>>>>
>> >>>>>>>> iconv -t UTF8 -f CP1252 unknown.txt iconv -t UTF8 -f CP1256
>> >>>>>>>> unknown.txt iconv -t UTF8 -f CP1254 unknown.txt
>> >>>>>>>>
>> >>>>>>>> etc, until it looks right.
>> >>>>>>>>
>> >>>>>>>> Peter West
>> >>>>>>>> "...and behold, something greater than Jonah is here."
>> >>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>>> -----Original Message-----
>> >>>>>>>>>> From: Peter West [mailto:lists@xxxxxxxxx]
>> >>>>>>>>>> Sent: Wednesday, February 25, 2015 1:46 PM
>> >>>>>>>>>> To: hadi
>> >>>>>>>>>> Subject: Re:  strange character
>> >>>>>>>>>>
>> >>>>>>>>>> Aren't you going the wrong way?  It looks as though the
>text
>> >>>>>>>>>> you are trying to convert is one of the 8-bit character
>sets.
>> >>>>>>>>>> From these sets you get bad characters because a character
>> >>>>>>>>>> with the MSBit set will be interpreted by a
>> >>>>>>>>>> UTF-8 system as a multi-byte character.
>> >>>>>>>>>>
>> >>>>>>>>>> Which way do you want to go: from CP1256 to UTF-8 or vice
>> >> versa?
>> >>>>>>>>>>
>> >>>>>>>>>> Peter West
>> >>>>>>>>>> "...and behold, something greater than Jonah is here."
>> >>>>>>>>>>
>> >>>>>>>>>>> On 25 Feb 2015, at 7:15 pm, hadi
>> <almarzuki2011@xxxxxxxxxxx>
>> >>>>>> wrote:
>> >>>>>>>>>>>
>> >>>>>>>>>>> Hi,
>> >>>>>>>>>>>
>> >>>>>>>>>>>
>> >>>>>>>>>>> Im trying to use (iconv("UTF-8", 'CP1256//TRANSLIT',
>$rss);)
>> >>>>>>>>>>> to convert strange character like (–) to proper
>character,
>> >>>>>>>>>>> but im getting error
>> >>>>>>>>>>>
>> >>>>>>>>>>> I googled about it but never found anything about it.
>> >>>>>>>>>>>
>> >>>>>>>>>>> Here is the error
>> >>>>>>>>>>>
>> >>>>>>>>>>> Notice: iconv(): Detected an illegal character in input
>> >>>>>>>>>>> string in /var/www/html/rssfeed/sahafah.php on line 35
>> >>>>>>>>>>>
>> >>>>>>>>>>>
>> >>>>>>>>>>> --
>> >>>>>>>>>>> PHP General Mailing List (http://www.php.net/) To
>> >>>>>>>>>>> unsubscribe,
>> >>>> visit:
>> >>>>>>>>>>> http://www.php.net/unsub.php
>> >>>>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>>
>> >>>>>>>>
>> >>>>>>>>
>> >>>>>>>> --
>> >>>>>>>> PHP General Mailing List (http://www.php.net/) To
>unsubscribe,
>> >> visit:
>> >>>>>>>> http://www.php.net/unsub.php
>> >>>>>>>
>> >>>>>>>
>> >>>>>
>> >>>>>
>> >>>>>
>> >>>>> --
>> >>>>> PHP General Mailing List (http://www.php.net/) To unsubscribe,
>visit:
>> >>>>> http://www.php.net/unsub.php
>> >>>>>
>> >>>>
>> >>>>
>> >>>> --
>> >>>> PHP General Mailing List (http://www.php.net/) To unsubscribe,
>visit:
>> >>>> http://www.php.net/unsub.php
>> >>>
>> >>> <test.1256.gz><test.utf8.gz>
>> >
>> > <test.gz>
>
>
>
>--
>PHP General Mailing List (http://www.php.net/)
>To unsubscribe, visit: http://www.php.net/unsub.php

Hi Hadi,

Please don't top post.

I've had a look at the RSS feed and it is set to utf-8, both in the xml declaration and the http headers.

I have to assume the broken characters are because of something in your script. Are you sure your PHP is saved into a utf-8 format? 
Thanks,
Ash

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php





[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux