Re: pdftohtml encoding question

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



On 3/10/08, François Patte <francois.patte@xxxxxxxxxxxxxxxxxxxxxxxx> wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> bonsoir,
>
> I am trying to convert a pdf file into html using pdftohtml provided by f8.
>
> I get an html file with "nice" characters like: ’ insead of apostroph,
> or Ã(c) instead of é...
>
> so i think that there is some coding problem.
>
> Using man pdftohtml, I got this info:
> - -enc <string>
> ~ output text encoding name
>
>
> but, I am unable to guess what is the syntax to use in order to have a
> correct output in utf8 for:
>
> Error: Couldn't find unicodeMap file for the 'utf8' encoding
>
> is the only answer I get if I try:
>
> pdftohtml -enc utf8 myfile.pdf
>
>
> i tried utf-8, latin1, latin-1, ISO_8859-1, .... without any success.
>
>
> If somebody knows... many thnaks in advance.

I don't, but

man pdftohtml

 ->  Pdftohtml was developed by Gueorgui Ovtcharov and Rainer Dorsch. It  is
     based and benefits a lot from Derek Noonburg?s xpdf package.

man xpdf

 ->  -enc encoding-name
          Sets the encoding to use for  text  output.   The  encoding-name
          must  be  defined  with  the unicodeMap command (see xpdfrc(5)).
          This defaults to "Latin1" (which is a built-in encoding).  [con-
          fig file: textEncoding]

man xpdfrc

 ->  unicodeMap encoding-name map-file
          [...]
          The Latin1, ASCII7, Symbol, ZapfDingbats,  UTF-8,  and
          UCS-2 encodings are predefined.

I'm afraid you'll have to read the elided part if you need an encoding
other than these six.

Hope this helps,

Andras

-- 
fedora-list mailing list
fedora-list@xxxxxxxxxx
To unsubscribe: https://www.redhat.com/mailman/listinfo/fedora-list
[Index of Archives]     [Older Fedora Users]     [Fedora Announce]     [Fedora Package Announce]     [EPEL Announce]     [Fedora Magazine]     [Fedora News]     [Fedora Summer Coding]     [Fedora Laptop]     [Fedora Cloud]     [Fedora Advisory Board]     [Fedora Education]     [Fedora Security]     [Fedora Scitech]     [Fedora Robotics]     [Fedora Maintainers]     [Fedora Infrastructure]     [Fedora Websites]     [Anaconda Devel]     [Fedora Devel Java]     [Fedora Legacy]     [Fedora Desktop]     [Fedora Fonts]     [ATA RAID]     [Fedora Marketing]     [Fedora Management Tools]     [Fedora Mentors]     [SSH]     [Fedora Package Review]     [Fedora R Devel]     [Fedora PHP Devel]     [Kickstart]     [Fedora Music]     [Fedora Packaging]     [Centos]     [Fedora SELinux]     [Fedora Legal]     [Fedora Kernel]     [Fedora OCaml]     [Coolkey]     [Virtualization Tools]     [ET Management Tools]     [Yum Users]     [Tux]     [Yosemite News]     [Gnome Users]     [KDE Users]     [Fedora Art]     [Fedora Docs]     [Asterisk PBX]     [Fedora Sparc]     [Fedora Universal Network Connector]     [Libvirt Users]     [Fedora ARM]

  Powered by Linux