Christopher Chaltain <blinux-list at redhat.com> wrote: >There's also pdftohtml and pdftotext. If you can get the original files (much better for accessibility purposes), they can be converted to HTML with LibreOffice: loffice --headless --convert-to html filenames