command line scanned pdf to text

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



Am I the last to find this?
 command line ocr tesseract 
won't directly support .pdf but
pdftocairo
produces .jpg among others which tesseract will read.

May not do well with collumns but not too bad.

Is there anything better?

Thanks
tom Fowle
_______________________________________________
Speakup mailing list
Speakup@xxxxxxxxxxxxxxxxx
http://linux-speakup.org/cgi-bin/mailman/listinfo/speakup




[Index of Archives]     [Linux for the Blind]     [Fedora Discussioin]     [Linux Kernel]     [Yosemite News]     [Big List of Linux Books]
  Powered by Linux