> -----Original Message----- > From: Ashley Sheridan [mailto:ash@xxxxxxxxxxxxxxxxxxxx] > Sent: Friday, February 27, 2015 6:37 PM > To: hadi; 'Peter West'; 'PHP General' > Subject: RE: strange character > > > > On 27 February 2015 14:36:50 GMT+00:00, hadi > <almarzuki2011@xxxxxxxxxxx> wrote: > >Sorry for the miss understanding. I understand know what you're trying > >to do. > > > >Here is the web site im getting error while trying to download rss feed > >from it. > >http://www.sahafah.net/rss.php > > > > > >> -----Original Message----- > >> From: Peter West [mailto:peter.b.west@xxxxxxxxx] On Behalf Of Peter > >> West > >> Sent: Friday, February 27, 2015 5:26 AM > >> To: hadi; PHP General > >> Subject: Re: strange character > >> > >> Here are the contents of test.txt > >> iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiii > >> > >> > >> Obviously, this is not the data you are receiving from the internet. > >> > >> As you say, you do not understand. > >> > >> What http:// address are you accessing when you get the error? > >> > >> Peter West > >> "...and behold, something greater than Jonah is here." > >> > >> > On 27 Feb 2015, at 1:46 am, hadi <almarzuki2011@xxxxxxxxxxx> wrote: > >> > > >> > Here its. I'm attaching it. > >> > But I'm not understanding how this can solve my problem.. > >> > > >> >> -----Original Message----- > >> >> From: Peter West [mailto:lists@xxxxxxxxx] > >> >> Sent: Thursday, February 26, 2015 4:28 PM > >> >> To: hadi > >> >> Subject: Re: strange character > >> >> > >> >> Hi Hadi, > >> >> > >> >> Can you send me the original test.txt as well, please. > >> >> > >> >> Peter West > >> >> "...and behold, something greater than Jonah is here." > >> >> > >> >>> On 26 Feb 2015, at 8:35 pm, hadi <almarzuki2011@xxxxxxxxxxx> > >wrote: > >> >>> > >> >>> I did this the second time im sending the attach file. > >> >>> > >> >>> -----Original Message----- > >> >>> From: hadi [mailto:almarzuki2011@xxxxxxxxxxx] > >> >>> Sent: Thursday, February 26, 2015 9:59 AM > >> >>> To: 'Peter West' > >> >>> Cc: 'PHP General' > >> >>> Subject: RE: strange character > >> >>> > >> >>> Here is the attach file. > >> >>> > >> >>>> -----Original Message----- > >> >>>> From: Peter West [mailto:lists@xxxxxxxxx] > >> >>>> Sent: Thursday, February 26, 2015 1:07 AM > >> >>>> To: hadi > >> >>>> Cc: PHP General > >> >>>> Subject: Re: strange character > >> >>>> > >> >>>> That doesn't make sense to me. Send me the test file. > >> >>>> > >> >>>> Peter West > >> >>>> "...and behold, something greater than Jonah is here." > >> >>>> > >> >>>>> On 26 Feb 2015, at 1:49 am, hadi <almarzuki2011@xxxxxxxxxxx> > >> wrote: > >> >>>>> > >> >>>>> I did what you asked and here is the result. > >> >>>>> Frist cmp, cmp test.txt test.utf8 give nothing output. > >> >>>>> Second cmp, cmp test.txt test.1256 getting this result cmp: EOF > >on > >> >> test.txt. > >> >>>>> But im not understanding how this related to my php error what > >im > >> >> getting. > >> >>>>> > >> >>>>> And thanks peter for standing with me in my issue. > >> >>>>> What im trying to achieve here to remove the strange character > >by > >> >>>>> translating utf-8 to CP1256, but unfortunately getting error > >from > >> >>>>> php > >> >>>>> (Notice: iconv(): Detected an illegal character in input string > >in > >> >>>>> ) > >> >>>>> > >> >>>>> I did test on the strange character and its working prefect, > >but > >> >>>>> when it's > >> >>>> come to downloading rss feed, it give error. > >> >>>>> > >> >>>>> <?php > >> >>>>> > >> >>>>> $str = '(–)'; > >> >>>>> > >> >>>>> echo iconv("UTF-8", 'CP1256//TRANSLIT', $str); ?> > >> >>>>> > >> >>>>>> -----Original Message----- > >> >>>>>> From: Peter West [mailto:lists@xxxxxxxxx] > >> >>>>>> Sent: Wednesday, February 25, 2015 3:49 PM > >> >>>>>> To: hadi > >> >>>>>> Cc: PHP General > >> >>>>>> Subject: Re: strange character > >> >>>>>> > >> >>>>>> If the file is really us-ascii, then there are no characters > >in > >> >>>>>> that file with the 8th bit set, and you can do this: > >> >>>>>> > >> >>>>>> iconv -f ASCII -t UTF8 test.txt >test.utf8 iconv -f ASCII -t > >> >>>>>> CP1256 test.txt > >> >>>>>>> test.1256 cmp test.txt test.utf8 > >> >>>>>> (should be no messages about differences.) cmp test.txt > >test.1256 > >> >>>>>> (should be no messages about differences.) > >> >>>>>> > >> >>>>>> If that all works, then the test text you have is not the same > >as > >> >>>>>> the text you have the problem with. If you have any problems > >> >>>>>> with this sequence, tell us what they are. > >> >>>>>> > >> >>>>>> Peter West > >> >>>>>> "...and behold, something greater than Jonah is here." > >> >>>>>> > >> >>>>>>> On 25 Feb 2015, at 10:35 pm, hadi > <almarzuki2011@xxxxxxxxxxx> > >> >> wrote: > >> >>>>>>> > >> >>>>>>> Im using linux centos. I have access to terminal. > >> >>>>>>> > >> >>>>>>> I did what you asked converting the test.txt to CP1256 And to > >> >>>>>>> verify that I did, > >> >>>>>>> > >> >>>>>>> file -bi test.txt > >> >>>>>>> > >> >>>>>>> text/plain; charset=us-ascii > >> >>>>>>> > >> >>>>>>> not showing cp1256 encoding. But when I do, > >> >>>>>>> > >> >>>>>>> iconv -l | grep CP1256 > >> >>>>>>> CP1256// > >> >>>>>>> I get CP1256 encoding. > >> >>>>>>> > >> >>>>>>>> -----Original Message----- > >> >>>>>>>> From: Peter West [mailto:lists@xxxxxxxxx] > >> >>>>>>>> Sent: Wednesday, February 25, 2015 3:11 PM > >> >>>>>>>> To: hadi; PHP General > >> >>>>>>>> Subject: Re: strange character > >> >>>>>>>> > >> >>>>>>>>> On 25 Feb 2015, at 9:06 pm, hadi > ><almarzuki2011@xxxxxxxxxxx> > >> >>>> wrote: > >> >>>>>>>>> > >> >>>>>>>>> Hi Peter, > >> >>>>>>>>> > >> >>>>>>>>> I want to convert from utf-8 to CP1256 so the strange > >> >>>>>>>>> character can be > >> >>>>>>>> fixed. > >> >>>>>>>> > >> >>>>>>>> I think it's either a) already in CP1256, or it's in some > >other > >> >>>>>>>> character set altogether. The error message is telling you > >> >>>>>>>> that it's not > >> >>>>>> recognised as UTF-8. > >> >>>>>>>> > >> >>>>>>>> Do you have access to the iconv program in a terminal? If > >you > >> >>>>>>>> are on a linux or OS X system, just open a terminal and type > >> >>>>>>>> > >> >>>>>>>> iconv --help > >> >>>>>>>> > >> >>>>>>>> If you have iconv installed you will get a help message. > >> >>>>>>>> > >> >>>>>>>> Get the text you are trying to convert into a text file, and > >> >>>>>>>> just try various conversions using iconv, until it looks > >right. > >> >>>>>>>> Let's say your text is in the file unknown.txt. > >> >>>>>>>> > >> >>>>>>>> iconv -l > >> >>>>>>>> > >> >>>>>>>> will list all of the character sets that iconv knows about. > >> >>>>>>>> Find the likely candidates and just try > >> >>>>>>>> > >> >>>>>>>> iconv -t UTF8 -f CP1252 unknown.txt iconv -t UTF8 -f CP1256 > >> >>>>>>>> unknown.txt iconv -t UTF8 -f CP1254 unknown.txt > >> >>>>>>>> > >> >>>>>>>> etc, until it looks right. > >> >>>>>>>> > >> >>>>>>>> Peter West > >> >>>>>>>> "...and behold, something greater than Jonah is here." > >> >>>>>>>> > >> >>>>>>>>> > >> >>>>>>>>>> -----Original Message----- > >> >>>>>>>>>> From: Peter West [mailto:lists@xxxxxxxxx] > >> >>>>>>>>>> Sent: Wednesday, February 25, 2015 1:46 PM > >> >>>>>>>>>> To: hadi > >> >>>>>>>>>> Subject: Re: strange character > >> >>>>>>>>>> > >> >>>>>>>>>> Aren't you going the wrong way? It looks as though the > >text > >> >>>>>>>>>> you are trying to convert is one of the 8-bit character > >sets. > >> >>>>>>>>>> From these sets you get bad characters because a character > >> >>>>>>>>>> with the MSBit set will be interpreted by a > >> >>>>>>>>>> UTF-8 system as a multi-byte character. > >> >>>>>>>>>> > >> >>>>>>>>>> Which way do you want to go: from CP1256 to UTF-8 or vice > >> >> versa? > >> >>>>>>>>>> > >> >>>>>>>>>> Peter West > >> >>>>>>>>>> "...and behold, something greater than Jonah is here." > >> >>>>>>>>>> > >> >>>>>>>>>>> On 25 Feb 2015, at 7:15 pm, hadi > >> <almarzuki2011@xxxxxxxxxxx> > >> >>>>>> wrote: > >> >>>>>>>>>>> > >> >>>>>>>>>>> Hi, > >> >>>>>>>>>>> > >> >>>>>>>>>>> > >> >>>>>>>>>>> Im trying to use (iconv("UTF-8", 'CP1256//TRANSLIT', > >$rss);) > >> >>>>>>>>>>> to convert strange character like (–) to proper > >character, > >> >>>>>>>>>>> but im getting error > >> >>>>>>>>>>> > >> >>>>>>>>>>> I googled about it but never found anything about it. > >> >>>>>>>>>>> > >> >>>>>>>>>>> Here is the error > >> >>>>>>>>>>> > >> >>>>>>>>>>> Notice: iconv(): Detected an illegal character in input > >> >>>>>>>>>>> string in /var/www/html/rssfeed/sahafah.php on line 35 > >> >>>>>>>>>>> > >> >>>>>>>>>>> > >> >>>>>>>>>>> -- > >> >>>>>>>>>>> PHP General Mailing List (http://www.php.net/) To > >> >>>>>>>>>>> unsubscribe, > >> >>>> visit: > >> >>>>>>>>>>> http://www.php.net/unsub.php > >> >>>>>>>>>>> > >> >>>>>>>>> > >> >>>>>>>>> > >> >>>>>>>> > >> >>>>>>>> > >> >>>>>>>> -- > >> >>>>>>>> PHP General Mailing List (http://www.php.net/) To > >unsubscribe, > >> >> visit: > >> >>>>>>>> http://www.php.net/unsub.php > >> >>>>>>> > >> >>>>>>> > >> >>>>> > >> >>>>> > >> >>>>> > >> >>>>> -- > >> >>>>> PHP General Mailing List (http://www.php.net/) To unsubscribe, > >visit: > >> >>>>> http://www.php.net/unsub.php > >> >>>>> > >> >>>> > >> >>>> > >> >>>> -- > >> >>>> PHP General Mailing List (http://www.php.net/) To unsubscribe, > >visit: > >> >>>> http://www.php.net/unsub.php > >> >>> > >> >>> <test.1256.gz><test.utf8.gz> > >> > > >> > <test.gz> > > > > > > > >-- > >PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: > >http://www.php.net/unsub.php > > Hi Hadi, > > Please don't top post. > > I've had a look at the RSS feed and it is set to utf-8, both in the xml > declaration and the http headers. > > I have to assume the broken characters are because of something in your > script. Are you sure your PHP is saved into a utf-8 format? Yes Ash, my php script saved in UTF-8 format. -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php