On 27 February 2015 14:36:50 GMT+00:00, hadi <almarzuki2011@xxxxxxxxxxx> wrote: >Sorry for the miss understanding. I understand know what you're trying >to do. > >Here is the web site im getting error while trying to download rss feed >from it. >http://www.sahafah.net/rss.php > > >> -----Original Message----- >> From: Peter West [mailto:peter.b.west@xxxxxxxxx] On Behalf Of Peter >> West >> Sent: Friday, February 27, 2015 5:26 AM >> To: hadi; PHP General >> Subject: Re: strange character >> >> Here are the contents of test.txt >> iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii >> >> >> Obviously, this is not the data you are receiving from the internet. >> >> As you say, you do not understand. >> >> What http:// address are you accessing when you get the error? >> >> Peter West >> "...and behold, something greater than Jonah is here." >> >> > On 27 Feb 2015, at 1:46 am, hadi <almarzuki2011@xxxxxxxxxxx> wrote: >> > >> > Here its. I'm attaching it. >> > But I'm not understanding how this can solve my problem.. >> > >> >> -----Original Message----- >> >> From: Peter West [mailto:lists@xxxxxxxxx] >> >> Sent: Thursday, February 26, 2015 4:28 PM >> >> To: hadi >> >> Subject: Re: strange character >> >> >> >> Hi Hadi, >> >> >> >> Can you send me the original test.txt as well, please. >> >> >> >> Peter West >> >> "...and behold, something greater than Jonah is here." >> >> >> >>> On 26 Feb 2015, at 8:35 pm, hadi <almarzuki2011@xxxxxxxxxxx> >wrote: >> >>> >> >>> I did this the second time im sending the attach file. >> >>> >> >>> -----Original Message----- >> >>> From: hadi [mailto:almarzuki2011@xxxxxxxxxxx] >> >>> Sent: Thursday, February 26, 2015 9:59 AM >> >>> To: 'Peter West' >> >>> Cc: 'PHP General' >> >>> Subject: RE: strange character >> >>> >> >>> Here is the attach file. >> >>> >> >>>> -----Original Message----- >> >>>> From: Peter West [mailto:lists@xxxxxxxxx] >> >>>> Sent: Thursday, February 26, 2015 1:07 AM >> >>>> To: hadi >> >>>> Cc: PHP General >> >>>> Subject: Re: strange character >> >>>> >> >>>> That doesn't make sense to me. Send me the test file. >> >>>> >> >>>> Peter West >> >>>> "...and behold, something greater than Jonah is here." >> >>>> >> >>>>> On 26 Feb 2015, at 1:49 am, hadi <almarzuki2011@xxxxxxxxxxx> >> wrote: >> >>>>> >> >>>>> I did what you asked and here is the result. >> >>>>> Frist cmp, cmp test.txt test.utf8 give nothing output. >> >>>>> Second cmp, cmp test.txt test.1256 getting this result cmp: EOF >on >> >> test.txt. >> >>>>> But im not understanding how this related to my php error what >im >> >> getting. >> >>>>> >> >>>>> And thanks peter for standing with me in my issue. >> >>>>> What im trying to achieve here to remove the strange character >by >> >>>>> translating utf-8 to CP1256, but unfortunately getting error >from >> >>>>> php >> >>>>> (Notice: iconv(): Detected an illegal character in input string >in >> >>>>> ) >> >>>>> >> >>>>> I did test on the strange character and its working prefect, >but >> >>>>> when it's >> >>>> come to downloading rss feed, it give error. >> >>>>> >> >>>>> <?php >> >>>>> >> >>>>> $str = '(–)'; >> >>>>> >> >>>>> echo iconv("UTF-8", 'CP1256//TRANSLIT', $str); ?> >> >>>>> >> >>>>>> -----Original Message----- >> >>>>>> From: Peter West [mailto:lists@xxxxxxxxx] >> >>>>>> Sent: Wednesday, February 25, 2015 3:49 PM >> >>>>>> To: hadi >> >>>>>> Cc: PHP General >> >>>>>> Subject: Re: strange character >> >>>>>> >> >>>>>> If the file is really us-ascii, then there are no characters >in >> >>>>>> that file with the 8th bit set, and you can do this: >> >>>>>> >> >>>>>> iconv -f ASCII -t UTF8 test.txt >test.utf8 iconv -f ASCII -t >> >>>>>> CP1256 test.txt >> >>>>>>> test.1256 cmp test.txt test.utf8 >> >>>>>> (should be no messages about differences.) cmp test.txt >test.1256 >> >>>>>> (should be no messages about differences.) >> >>>>>> >> >>>>>> If that all works, then the test text you have is not the same >as >> >>>>>> the text you have the problem with. If you have any problems >> >>>>>> with this sequence, tell us what they are. >> >>>>>> >> >>>>>> Peter West >> >>>>>> "...and behold, something greater than Jonah is here." >> >>>>>> >> >>>>>>> On 25 Feb 2015, at 10:35 pm, hadi <almarzuki2011@xxxxxxxxxxx> >> >> wrote: >> >>>>>>> >> >>>>>>> Im using linux centos. I have access to terminal. >> >>>>>>> >> >>>>>>> I did what you asked converting the test.txt to CP1256 And to >> >>>>>>> verify that I did, >> >>>>>>> >> >>>>>>> file -bi test.txt >> >>>>>>> >> >>>>>>> text/plain; charset=us-ascii >> >>>>>>> >> >>>>>>> not showing cp1256 encoding. But when I do, >> >>>>>>> >> >>>>>>> iconv -l | grep CP1256 >> >>>>>>> CP1256// >> >>>>>>> I get CP1256 encoding. >> >>>>>>> >> >>>>>>>> -----Original Message----- >> >>>>>>>> From: Peter West [mailto:lists@xxxxxxxxx] >> >>>>>>>> Sent: Wednesday, February 25, 2015 3:11 PM >> >>>>>>>> To: hadi; PHP General >> >>>>>>>> Subject: Re: strange character >> >>>>>>>> >> >>>>>>>>> On 25 Feb 2015, at 9:06 pm, hadi ><almarzuki2011@xxxxxxxxxxx> >> >>>> wrote: >> >>>>>>>>> >> >>>>>>>>> Hi Peter, >> >>>>>>>>> >> >>>>>>>>> I want to convert from utf-8 to CP1256 so the strange >> >>>>>>>>> character can be >> >>>>>>>> fixed. >> >>>>>>>> >> >>>>>>>> I think it's either a) already in CP1256, or it's in some >other >> >>>>>>>> character set altogether. The error message is telling you >> >>>>>>>> that it's not >> >>>>>> recognised as UTF-8. >> >>>>>>>> >> >>>>>>>> Do you have access to the iconv program in a terminal? If >you >> >>>>>>>> are on a linux or OS X system, just open a terminal and type >> >>>>>>>> >> >>>>>>>> iconv --help >> >>>>>>>> >> >>>>>>>> If you have iconv installed you will get a help message. >> >>>>>>>> >> >>>>>>>> Get the text you are trying to convert into a text file, and >> >>>>>>>> just try various conversions using iconv, until it looks >right. >> >>>>>>>> Let's say your text is in the file unknown.txt. >> >>>>>>>> >> >>>>>>>> iconv -l >> >>>>>>>> >> >>>>>>>> will list all of the character sets that iconv knows about. >> >>>>>>>> Find the likely candidates and just try >> >>>>>>>> >> >>>>>>>> iconv -t UTF8 -f CP1252 unknown.txt iconv -t UTF8 -f CP1256 >> >>>>>>>> unknown.txt iconv -t UTF8 -f CP1254 unknown.txt >> >>>>>>>> >> >>>>>>>> etc, until it looks right. >> >>>>>>>> >> >>>>>>>> Peter West >> >>>>>>>> "...and behold, something greater than Jonah is here." >> >>>>>>>> >> >>>>>>>>> >> >>>>>>>>>> -----Original Message----- >> >>>>>>>>>> From: Peter West [mailto:lists@xxxxxxxxx] >> >>>>>>>>>> Sent: Wednesday, February 25, 2015 1:46 PM >> >>>>>>>>>> To: hadi >> >>>>>>>>>> Subject: Re: strange character >> >>>>>>>>>> >> >>>>>>>>>> Aren't you going the wrong way? It looks as though the >text >> >>>>>>>>>> you are trying to convert is one of the 8-bit character >sets. >> >>>>>>>>>> From these sets you get bad characters because a character >> >>>>>>>>>> with the MSBit set will be interpreted by a >> >>>>>>>>>> UTF-8 system as a multi-byte character. >> >>>>>>>>>> >> >>>>>>>>>> Which way do you want to go: from CP1256 to UTF-8 or vice >> >> versa? >> >>>>>>>>>> >> >>>>>>>>>> Peter West >> >>>>>>>>>> "...and behold, something greater than Jonah is here." >> >>>>>>>>>> >> >>>>>>>>>>> On 25 Feb 2015, at 7:15 pm, hadi >> <almarzuki2011@xxxxxxxxxxx> >> >>>>>> wrote: >> >>>>>>>>>>> >> >>>>>>>>>>> Hi, >> >>>>>>>>>>> >> >>>>>>>>>>> >> >>>>>>>>>>> Im trying to use (iconv("UTF-8", 'CP1256//TRANSLIT', >$rss);) >> >>>>>>>>>>> to convert strange character like (–) to proper >character, >> >>>>>>>>>>> but im getting error >> >>>>>>>>>>> >> >>>>>>>>>>> I googled about it but never found anything about it. >> >>>>>>>>>>> >> >>>>>>>>>>> Here is the error >> >>>>>>>>>>> >> >>>>>>>>>>> Notice: iconv(): Detected an illegal character in input >> >>>>>>>>>>> string in /var/www/html/rssfeed/sahafah.php on line 35 >> >>>>>>>>>>> >> >>>>>>>>>>> >> >>>>>>>>>>> -- >> >>>>>>>>>>> PHP General Mailing List (http://www.php.net/) To >> >>>>>>>>>>> unsubscribe, >> >>>> visit: >> >>>>>>>>>>> http://www.php.net/unsub.php >> >>>>>>>>>>> >> >>>>>>>>> >> >>>>>>>>> >> >>>>>>>> >> >>>>>>>> >> >>>>>>>> -- >> >>>>>>>> PHP General Mailing List (http://www.php.net/) To >unsubscribe, >> >> visit: >> >>>>>>>> http://www.php.net/unsub.php >> >>>>>>> >> >>>>>>> >> >>>>> >> >>>>> >> >>>>> >> >>>>> -- >> >>>>> PHP General Mailing List (http://www.php.net/) To unsubscribe, >visit: >> >>>>> http://www.php.net/unsub.php >> >>>>> >> >>>> >> >>>> >> >>>> -- >> >>>> PHP General Mailing List (http://www.php.net/) To unsubscribe, >visit: >> >>>> http://www.php.net/unsub.php >> >>> >> >>> <test.1256.gz><test.utf8.gz> >> > >> > <test.gz> > > > >-- >PHP General Mailing List (http://www.php.net/) >To unsubscribe, visit: http://www.php.net/unsub.php Hi Hadi, Please don't top post. I've had a look at the RSS feed and it is set to utf-8, both in the xml declaration and the http headers. I have to assume the broken characters are because of something in your script. Are you sure your PHP is saved into a utf-8 format? Thanks, Ash -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php