PDF data extract

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



i want to extract all the xml data that is at the bottom of all pdf's (it
will be only ones i create so i know that information will be there)
the xml is like this
         <dc:title>
            <rdf:Alt>
               <rdf:li xml:lang="x-default">{*TITLE*}</rdf:li>
            </rdf:Alt>
         </dc:title>
         <dc:creator>
            <rdf:Seq>
               <rdf:li>{*AUTHOR*}</rdf:li>
            </rdf:Seq>
         </dc:creator>
thats an extract of it, the text in the {**} is the information i want to
extract i belive this can be done with regexp's, how ever if theres a fast
way i would like to know
this information on the pdf can be found in the last couple of lines of a
.pdf document, to see this open it up in notepad and scroll to the bottom
any help would be nice thanks
--
~Philip Pryce

[Index of Archives]     [PHP Home]     [PHP Users]     [Postgresql Discussion]     [Kernel Newbies]     [Postgresql]     [Yosemite News]

  Powered by Linux