Re: reading pictures of text in pdf

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On Thu, Nov 12, 2015 at 4:10 PM, Brian Tew <montanalag@xxxxxxxxx> wrote:
> Is there anything in linux that can convert a pdf file that is a picture of text
> into real actual plain text?

Assuming there's no DRM involved, tesseract-OCR is probably your best
bet. <https://code.google.com/p/tesseract-ocr/>. The source code has
moved to <https://github.com/tesseract-ocr> but the documentation
seems to still be on code.google.com.

Best regards,

Paul

-- 
[Notice not included in the above original message:  The U.S. National
Security Agency neither confirms nor denies that it intercepted this
message.]

_______________________________________________
Blinux-list mailing list
Blinux-list@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/blinux-list



[Index of Archives]     [Linux Speakup]     [Fedora]     [Linux Kernel]     [Yosemite News]     [Big List of Linux Books]