Re: returning to bookshare.org topic

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

 



With Bookshare books, when you unzip them, you have a pile of
files.  One is the xml file, the content of the book.  There's another
one, named daisyTransform.xsl.
You can use that xsl file to turn the xml into html, with xsltproc:

xsltproc daisyTransform.xsl Book.xml > Book.html

The xsltproc command is probably in a package named libxslt or something
similar.

I wrote a little script to convert html to text with elinks, using all
the options and settings I prefer.  It also produces output in UTF-8, so
Unicode characters are faithfully preserved.
http://the-brannons.com/useful-scripts/cmb-html2txt
By default, elinks indents everything by 3 spaces, which is kind of
annoying.  So you'll want the line

    set document.browse.margin_width = 0

in your config file, either /etc/elinks/elinks.conf or
$HOME/.elinks/elinks.conf.
If there's an existing document.browse.margin_width line, just modify
that one.

Good luck,
-- Chris

_______________________________________________
Blinux-list mailing list
Blinux-list@xxxxxxxxxx
https://www.redhat.com/mailman/listinfo/blinux-list



[Index of Archives]     [Linux Speakup]     [Fedora]     [Linux Kernel]     [Yosemite News]     [Big List of Linux Books]