Ashley Sheridan wrote: > Hi All, > > I recently asked a question regarding reading a PDF with PHP. I've tried > Zend_pdf, but all this is able to give me is the number of pages in a > PDF, and cannot extract the text from the PDF files I have. I thought > I'd try a different method, and try to extract the text straight from > the Word document which is used to generate the PDF. Does anyone have > any experience with this sort of thing, or enough to suggest a library > which is capable of this? > > > Ash > www.ashleysheridan.co.uk > It might be worth lokking into apps like catdoc http://freshmeat.net/projects/catdoc/ its able to extract texts from doc/ppt/xls files (similar to "strings" command but takes care of files internal encoding for you). Reading README in this package might give you some tips -- PHP General Mailing List (http://www.php.net/) To unsubscribe, visit: http://www.php.net/unsub.php