OCR on linux

Willem van der Walt wvdwalt at csir.co.za
Thu Apr 24 10:12:07 UTC 2008


You can try tesseract or another ocr engine without having sane working if 
you have an image file that you know has text in like a .tif file or so.
Sane is needed to get the stuff from the paper in the scanner into an 
image file.
Note: the current latest svn version of ocropus does not run, try a 
version from about two weeks ago.


On Thu, 24 Apr 2008, Daniel Dalton wrote:

> On Fri, 18 Apr 2008, John J. Boyer wrote:
> 
> > Try tesseract, which is available from code.google. It seems to be
> > pretty good. I don't have the URL handy but try googling for tesseract
> > code.google.
> 
> Good, I'll try it.
> I guess I should get sane working first though.
> 
> Thanks,
> 
> -- 
> Daniel Dalton
> 
> http://members.iinet.net.au/~ddalton/
> <d.dalton at iinet.net.au>
> 
> _______________________________________________
> Blinux-list mailing list
> Blinux-list at redhat.com
> https://www.redhat.com/mailman/listinfo/blinux-list
> 

-- 
This message is subject to the CSIR's copyright terms and conditions, e-mail legal notice, and implemented Open Document Format (ODF) standard. 
The full disclaimer details can be found at http://www.csir.co.za/disclaimer.html.

This message has been scanned for viruses and dangerous content by MailScanner, 
and is believed to be clean.  MailScanner thanks Transtec Computers for their support.




More information about the Blinux-list mailing list