Conferences >Ninth International Conferenc...

An Overview of the Tesseract OCR Engine

Download PDF
Download References
Request Permissions
Save to
Alerts

Abstract:

The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on...Show More

Metadata

Abstract:

The Tesseract OCR engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier.

Published in: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007)

Date of Conference: 23-26 September 2007

Date Added to IEEE Xplore: 05 November 2007

ISBN Information:

ISSN Information:

DOI: 10.1109/ICDAR.2007.4376991

Conference Location: Curitiba, Brazil

Contents

1. Introduction – Motivation and History

Tesseract is an open-source OCR engine that was developed at HP between 1984 and 1994. Like a supernova, it appeared from nowhere for the 1995 UNLV Annual Test of OCR Accuracy [1], shone brightly with its results, and then vanished back under the same cloak of secrecy under which it had been developed. Now for the first time, details of the architecture and algorithms can be revealed.

References is not available for this document.

MIT Libraries

MIT Libraries

An Overview of the Tesseract OCR Engine

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction – Motivation and History

References

IEEE Account

Purchase Details

Profile Information

Need Help?

MIT Libraries

MIT Libraries

An Overview of the Tesseract OCR Engine

Alerts

Abstract:

Metadata

Abstract:

ISSN Information:

1. Introduction – Motivation and History

References