Re: PDF to text?

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 12Aug2011 12:09, Bob Goodwin <bobgoodwin@xxxxxxxxxxxx> wrote:
| On 12/08/11 12:04, Genes MailLists wrote:
| > On 08/12/2011 11:58 AM, Bob Goodwin wrote:
| >> On 12/08/11 11:22, Genes MailLists wrote:
| >>> On 08/12/2011 11:16 AM, Madhav Ancha wrote:
| >>>     You could try this fedora app:  pdftotext
| >>>
| >>          As can be seen I tried several combinations, thought perhaps it
| >>          couldn't handle the file nam in quotes "Couier  etc" but nothing
| >>          seems to do it?
| >>
| >    Is it possible the PDF contains an image of the text rather than text
| > itself ?
| 
|         I'm not sure, how would I tell? It's an attachment to an html
|         cover letter. The Fedora default app, disolays it with no
|         complaints.

Is it ridiculously large for the amount of text? Does it seem to have
scanner artifacts in the text - "graininess" if you peer closely, fuzzy
text instead of perfectly formed letters (i.e. a picture of text instead
of text rendered by your computer from a font)?

Personally I use pdftohtml to convert PDFs (then an HTML-to-text
pipeline on the end of that). Possibly pdftotext does exactly that
anyway. Of course it achieves nothing for me if the PDF is a scan.

Cheers,
-- 
Cameron Simpson <cs@xxxxxxxxxx> DoD#743
http://www.cskk.ezoshosting.com/cs/

Many companies are just now realizing that testing for the year 2000
problems will likely be more time-consuming and expensive than the
fix-it phase.   - Bob Evans, Information Week, September 1997
-- 
users mailing list
users@xxxxxxxxxxxxxxxxxxxxxxx
To unsubscribe or change subscription options:
https://admin.fedoraproject.org/mailman/listinfo/users
Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines

[Index of Archives]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [EPEL Devel]     [Fedora Magazine]     [Fedora Summer Coding]     [Fedora Laptop]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Desktop]     [Fedora Fonts]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Yosemite News]     [Gnome Users]     [KDE Users]     [Fedora Art]     [Fedora Docs]     [Fedora Sparc]     [Libvirt Users]     [Fedora ARM]

  Powered by Linux