Book People Archive

HP's open-source Tesseract OCR, any experience?



Tom Breuel pointed out to me a new project up at sourceforge, called
"tesseract-ocr", with "lvincent" listed as admin -- presumably Luc
Vincent (a document image processing expert now at Google).  There are
no files there, but they do seem to be at the University of Nevada -
Las Vegas ISRI site, at
http://www.isri.unlv.edu/downloads/ocr-prerelease-20051201.tar.bz2,
under the Apache 2 license.  According to the report at
http://www.isri.unlv.edu/downloads/AT-1995.pdf, it does a nice job, as
far as accuracy goes.

I was wondering if any adventurous explorer had tried it out yet, and
if so, what the results were like?

Bill