PDF data extract

"Philip Pryce" <philip.waters.pryce@xxxxxxxxx> · Fri, 24 Feb 2006 19:28:40 +0000



i want to extract all the xml data that is at the bottom of all pdf's (it
will be only ones i create so i know that information will be there)
the xml is like this
         <dc:title>
            <rdf:Alt>
               <rdf:li xml:lang="x-default">{*TITLE*}</rdf:li>
            </rdf:Alt>
         </dc:title>
         <dc:creator>
            <rdf:Seq>
               <rdf:li>{*AUTHOR*}</rdf:li>
            </rdf:Seq>
         </dc:creator>
thats an extract of it, the text in the {**} is the information i want to
extract i belive this can be done with regexp's, how ever if theres a fast
way i would like to know
this information on the pdf can be found in the last couple of lines of a
.pdf document, to see this open it up in notepad and scroll to the bottom
any help would be nice thanks
--
~Philip Pryce