Extracting ASCII text from a PDF Document

Kirk Reiser kirk at braille.uwo.ca
Thu Aug 12 13:14:21 UTC 2010


pdftotext is a different program, mine with the -v argument returns:

pdftotext version 3.02
Copyright 1996-2007 Glyph & Cog, LLC


It also outputs to a file with the basename but containing a .txt
extension.  I believe it is part of the xpdf utilities.


On Thu, 12 Aug 2010, Martin McCormick wrote:

> Kirk Reiser writes:
>> What happens when you run pdftotext on the file?
>
> $ pstotext  BCD996XT_v1.04.00_Protocol.pdf
>
> < BCD996XT Operation Specification >
> 200
> GPL Ghostscript GPL Ghostscript 8.628.62: : Unrecoverable error, exit code 1
> Unrecoverable error, exit code 1
> 7.13. REMOTE
>
> I think that 7.13.remote is some stray text that got outputted
> from the file before pstotext exploded.
>
> 	The output would have gone to standard output had it
> worked.
>
> Martin
>
> _______________________________________________
> Blinux-list mailing list
> Blinux-list at redhat.com
> https://www.redhat.com/mailman/listinfo/blinux-list
>

--
Kirk Reiser				The Computer Braille Facility
e-mail: kirk at braille.uwo.ca		University of Western Ontario
phone: (519) 661-3061




More information about the Blinux-list mailing list