On 27.01.2014 15:01, Tom Horsley wrote: > I remember seeing an OCR program once that would accept pdf files, > so it didn't need to recognize characters, but it still applied > all the OCR layout recognition algorithms to try and detect > the "proper" way to treat the document. I suspect calibre is > doing very limited layout analysis (perhaps none). OCR app that doesn't OCR!? :) > I seem to remember seeing libreoffice can import PDFs these > days. I wonder if it is any better at layout? If so, you > could import PDF and export HTML from office. LibreOffice - File\Export as PDF…\General\General - Embed OpenDocument file Makes this PDF easily editable in LibreOffice LibreOffice Help: Embed OpenDocument file This setting enables you to export the document as a .pdf file containing two file formats: PDF and ODF. In PDF viewers it behaves like a normal .pdf file and it remains fully editable in LibreOffice. poma -- users mailing list users@xxxxxxxxxxxxxxxxxxxxxxx To unsubscribe or change subscription options: https://admin.fedoraproject.org/mailman/listinfo/users Fedora Code of Conduct: http://fedoraproject.org/code-of-conduct Guidelines: http://fedoraproject.org/wiki/Mailing_list_guidelines Have a question? Ask away: http://ask.fedoraproject.org