OCR on linux
Willem van der Walt
wvdwalt at csir.co.za
Thu Apr 24 10:12:07 UTC 2008
You can try tesseract or another ocr engine without having sane working if
you have an image file that you know has text in like a .tif file or so.
Sane is needed to get the stuff from the paper in the scanner into an
Note: the current latest svn version of ocropus does not run, try a
version from about two weeks ago.
On Thu, 24 Apr 2008, Daniel Dalton wrote:
> On Fri, 18 Apr 2008, John J. Boyer wrote:
> > Try tesseract, which is available from code.google. It seems to be
> > pretty good. I don't have the URL handy but try googling for tesseract
> > code.google.
> Good, I'll try it.
> I guess I should get sane working first though.
> Daniel Dalton
> <d.dalton at iinet.net.au>
> Blinux-list mailing list
> Blinux-list at redhat.com
This message is subject to the CSIR's copyright terms and conditions, e-mail legal notice, and implemented Open Document Format (ODF) standard.
The full disclaimer details can be found at http://www.csir.co.za/disclaimer.html.
This message has been scanned for viruses and dangerous content by MailScanner,
and is believed to be clean. MailScanner thanks Transtec Computers for their support.
More information about the Blinux-list