quick unrtf question?

Jason White jason at jasonjgw.net
Wed Dec 11 00:25:41 UTC 2013

Geoff Shang  <blinux-list at redhat.com> wrote:
>I'm not a Bookshare member as I'm not in the USA.  

To be clear, you need not be in the USA to become a Bookshare member. However,
the range of titles available to you will be restricted to those which
Bookshare has permission to distribute internationally - mostly books received
directly from publishers, and there are thousands of those.
> But if what I've seen 
>is a typical representation of Bookshare books, it's trivial to convert 
>these to HTML.
>When I first saw this thread, I was wondering if you were wanting to 
>convert Word 2007/2010 docx files.  These files are really zip files with 
>an XML document and a bunch of related files.

For those:
loffice --headless --convert-to html filename.docx

This requires LibreOffice to be installed, but it will give you the most
accurate conversion available. Of course, the quality of the result is limited
by the quality of the original file: many word processor users don't know how
to use styles correctly, if at all.

