Abstract:
The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital rol...Show MoreMetadata
Abstract:
The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital role. This is the underlying philosophy of the Devanagari document recognition system described in this work. The knowledge sources we use are mostly statistical in nature or in the form of a word dictionary tailored specifically for optical character recognition (OCR). We do not perform any reasoning on these. However, we explore their relative importance and role in the hierarchy. Some of the knowledge sources are acquired a priori by an automated training process while others are extracted from the text as it is processed. A complete Devanagari OCR system has been designed and tested with real-life printed documents of varying size and font. Most of the documents used were photocopies of the original. A performance of approximately 90% correct recognition is achieved.
Published in: IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans ( Volume: 30, Issue: 4, July 2000)
DOI: 10.1109/3468.852443
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Source Of Knowledge ,
- Optical Character Recognition ,
- Forms Of Knowledge ,
- Correct Recognition ,
- Words In The Lexicon ,
- Classification Process ,
- System Architecture ,
- Position Of Point ,
- Contextual Knowledge ,
- Vertical Bars ,
- Text Lines ,
- Hierarchical Manner ,
- Number Of Boxes ,
- True Quality ,
- Indian Languages ,
- Width Analysis
Keywords assist with retrieval of results and provide a means to discovering other relevant content. Learn more.
- IEEE Keywords
- Index Terms
- Source Of Knowledge ,
- Optical Character Recognition ,
- Forms Of Knowledge ,
- Correct Recognition ,
- Words In The Lexicon ,
- Classification Process ,
- System Architecture ,
- Position Of Point ,
- Contextual Knowledge ,
- Vertical Bars ,
- Text Lines ,
- Hierarchical Manner ,
- Number Of Boxes ,
- True Quality ,
- Indian Languages ,
- Width Analysis