Re: Re: Convert .docx /.pdf file to .txt

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Sat, 2008-12-06 at 09:17 +0000, Colin Guthrie wrote:
> 'Twas brillig, and Jagdeep Singh at 06/12/08 06:46 did gyre and gimble:
> > Hi!
> > 
> > I need a function to fetch text from docx file, but it is showing formated
> > characters in output. I was using fopen, fgets etc function .
> > 
> > I want to fetch text from .docx and save it to .txt file Without special
> > characters (Microsoft formated characters)
> > 
> > Is there any function or an example??
> 
> Nothing that I know of built in to PHP (although as docx is just XML 
> AFAIK, you could just write an XSLT to extract the content).
> 
> You could also shell out to an application that would do it for you.
> 
> After about 5 seconds on google I found:
> 
> http://sourceforge.net/project/showfiles.php?group_id=235455
> 
> If it works or not I have no idea!
> 
> Col
> 
> -- 
> 
> Colin Guthrie
> gmane(at)colin.guthr.ie
> http://colin.guthr.ie/
> 
> Day Job:
>    Tribalogic Limited [http://www.tribalogic.net/]
> Open Source:
>    Mandriva Linux Contributor [http://www.mandriva.com/]
>    PulseAudio Hacker [http://www.pulseaudio.org/]
>    Trac Hacker [http://trac.edgewall.org/]
> 
> 
Also, as far as I believe, docx isn't just an XML document, but rather a
zip file of xml docs and other media.


Ash
www.ashleysheridan.co.uk


-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux