RE: extract text from pdf

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



If this is on a *nix box, I would suggest using the pdf2text command
within shell_exec.  It should work as long as the PDF isn't a scanned
image.  Obviously it won't get text off the images, and you'd want to
make sure that any input to filenames (if they're dynamic) are verified
and scrubbed first so people can't access different files on the
filesystem.

HTH,
Ray

> -------- Original Message --------
> Subject:  extract text from pdf
> From: cajbecu <cajbecu@xxxxxxxxx>
> Date: Thu, May 11, 2006 1:45 am
> To: "'PHP General (E-mail)'" <php-general@xxxxxxxxxxxxx>
> 
> Hello,
> 
> Is there any posibility to extract all text from a PDF file? (I have
> read all the documentation about PHP PDF-Lib but no answer...)
> 
> Thanks in advance,
> 	cajbecu
> 
> -- 
> PHP General Mailing List (http://www.php.net/)
> To unsubscribe, visit: http://www.php.net/unsub.php

-- 
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php


[Index of Archives]     [PHP Home]     [Apache Users]     [PHP on Windows]     [Kernel Newbies]     [PHP Install]     [PHP Classes]     [Pear]     [Postgresql]     [Postgresql PHP]     [PHP on Windows]     [PHP Database Programming]     [PHP SOAP]

  Powered by Linux