HP's open-source Tesseract OCR, any experience?
- From: Bill Janssen <bill@[redacted]>
- Subject: HP's open-source Tesseract OCR, any experience?
- Date: Fri, 17 Mar 2006 21:07:31 PST
Tom Breuel pointed out to me a new project up at sourceforge, called
"tesseract-ocr", with "lvincent" listed as admin -- presumably Luc
Vincent (a document image processing expert now at Google). There are
no files there, but they do seem to be at the University of Nevada -
Las Vegas ISRI site, at
http://www.isri.unlv.edu/downloads/ocr-prerelease-20051201.tar.bz2,
under the Apache 2 license. According to the report at
http://www.isri.unlv.edu/downloads/AT-1995.pdf, it does a nice job, as
far as accuracy goes.
I was wondering if any adventurous explorer had tried it out yet, and
if so, what the results were like?
Bill