Abstract:
The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital rol...Show MoreMetadata
Abstract:
The reading process has been widely studied and there is a general agreement among researchers that knowledge in different forms and at different levels plays a vital role. This is the underlying philosophy of the Devanagari document recognition system described in this work. The knowledge sources we use are mostly statistical in nature or in the form of a word dictionary tailored specifically for optical character recognition (OCR). We do not perform any reasoning on these. However, we explore their relative importance and role in the hierarchy. Some of the knowledge sources are acquired a priori by an automated training process while others are extracted from the text as it is processed. A complete Devanagari OCR system has been designed and tested with real-life printed documents of varying size and font. Most of the documents used were photocopies of the original. A performance of approximately 90% correct recognition is achieved.
Published in: IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans ( Volume: 30, Issue: 4, July 2000)
DOI: 10.1109/3468.852443
Citations are not available for this document.
Cites in Papers - |
Cites in Papers - IEEE (25)
Select All
1.
Sandhya Arora, Latesh Malik, Sonakshi Goyal, Debotosh Bhattacharjee, Mita Nasipuri, Ondrej Krejcar, "Devanagari Character Recognition: A Comprehensive Literature Review", IEEE Access, vol.13, pp.1249-1284, 2025.
2.
Aditi Moudgil, Saravjeet Singh, "Devanagari manuscript labeling using feature extraction: A survey and future directions", 2021 9th International Conference on Reliability, Infocom Technologies and Optimization (Trends and Future Directions) (ICRITO), pp.1-4, 2021.
3.
Shilpa Mangesh Pande, Bineet Kumar Jha, "Character Recognition System for Devanagari Script Using Machine Learning Approach", 2021 5th International Conference on Computing Methodologies and Communication (ICCMC), pp.899-903, 2021.
4.
Agam Dwivedi, Rohit Saluja, Ravi Kiran Sarvadevabhatla, "An OCR for Classical Indic Documents Containing Arbitrarily Long Words", 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), pp.2386-2393, 2020.
5.
Sobia Habib, Manoj Kumar Shukla, Rajiv Kapoor, "Degraded Script Identification of Urdu and Devanagari Document-A Survey", 2019 4th International Conference on Information Systems and Computer Networks (ISCON), pp.79-83, 2019.
6.
Brijeshwar Dessai, Amit Patil, "A Deep Learning Approach for Optical Character Recognition of Handwritten Devanagari Script", 2019 2nd International Conference on Intelligent Computing, Instrumentation and Control Technologies (ICICICT), vol.1, pp.1160-1165, 2019.
7.
S. T. Deokate, N. J. Uke, "Hybrid methods for Segmenting and Identifying the Marathi Text", 2019 IEEE 5th International Conference for Convergence in Technology (I2CT), pp.1-5, 2019.
8.
Meduri Avadesh, Navneet Goyal, "Optical Character Recognition for Sanskrit Using Convolution Neural Networks", 2018 13th IAPR International Workshop on Document Analysis Systems (DAS), pp.447-452, 2018.
9.
Deepika Gupta, Soumen Bag, "An Efficient Character Segmentation Approach for Handwritten Hindi Text", 2018 5th International Conference on Signal Processing and Integrated Networks (SPIN), pp.730-734, 2018.
10.
Vishal Chavan, Abhijit Malage, Kapil Mehrotra, Manish Kumar Gupta, "Printed text recognition using BLSTM and MDLSTM for Indian languages", 2017 Fourth International Conference on Image Information Processing (ICIIP), pp.1-6, 2017.
11.
Deepa Berchmans, S S Kumar, "Optical character recognition: An overview and an insight", 2014 International Conference on Control, Instrumentation, Communication and Computational Technologies (ICCICCT), pp.1361-1365, 2014.
12.
Latesh Malik, "A Graph Based Approach for Handwritten Devanagri Word Recogntion", 2012 Fifth International Conference on Emerging Trends in Engineering and Technology, pp.309-313, 2012.
13.
R. Jayadevan, Satish R. Kolhe, Pradeep M. Patil, Umapada Pal, "Offline Recognition of Devanagari Script: A Survey", IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), vol.41, no.6, pp.782-796, 2011.
14.
Vandana M. Ladwani, Latesh Malik, "Novel Approach to Segmentation of Handwritten Devnagari Word", 2010 3rd International Conference on Emerging Trends in Engineering and Technology, pp.219-224, 2010.
15.
Anilkumar N. Holambe, Sushilkumar N. Holambe, Ravinder.C. Thool, "Comparative study of Devanagari handwritten and printed character & numerals recognition using Nearest-Neighbor classifiers", 2010 3rd International Conference on Computer Science and Information Technology, vol.1, pp.426-430, 2010.
16.
Tummala Ranga Babu, N. Venkata Rao, L. Pratap Reddy, T. S. Krishna Prabhu, B. Raveendra Babu, "Syllable segmentation of Telugu document images", 2010 2nd International Conference on Signal Processing Systems, vol.2, pp.V2-78-V2-82, 2010.
17.
Ujjwal Bhattacharya, B.B. Chaudhuri, "Handwritten Numeral Databases of Indian Scripts and Multistage Recognition of Mixed Numerals", IEEE Transactions on Pattern Analysis and Machine Intelligence, vol.31, no.3, pp.444-457, 2009.
18.
A.S. Chandrasekhara Sastry, Satyaprasad Lanka, P.Paul Clee, L. Pratap Reddy, "Combining spatial and transform features for the recognition of middle zone components of Telugu", TENCON 2008 - 2008 IEEE Region 10 Conference, pp.1-5, 2008.
19.
Pratap Reddy L., Satyaprasad L., A.S.C.S. Sastry, "Middle Zone Component Extraction and Recognition of Telugu Document Image", Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol.2, pp.584-588, 2007.
20.
B.V. Dhandra, Mallikarjun Hangarge, Ravindra Hegadi, V.S. Malemath, "Word Level Script Identification in Bilingual Documents through Discriminating Features", 2007 International Conference on Signal Processing, Communications and Networking, pp.630-635, 2007.
21.
S. Kompalli, S. Setlur, V. Govindaraju, "Design and comparison of segmentation driven and recognition driven Devanagari OCR", Second International Conference on Document Image Analysis for Libraries (DIAL'06), pp.7 pp.-102, 2006.
22.
U. Bhattacharya, B.B. Chaudhuri, "Databases for research on recognition of handwritten characters of Indian scripts", Eighth International Conference on Document Analysis and Recognition (ICDAR'05), pp.789-793 Vol. 2, 2005.
23.
S.W.K. Chan, "Extraction of salient textual patterns: synergy between lexical cohesion and contextual coherence", IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, vol.34, no.2, pp.205-218, 2004.
24.
Venu Govindaraju, Swapnil Khedekar, Suryaprakash Kompalli, F. Farooq, Srirangaraj Setlur, Ramanaprasad Vemulapati, "Tools for enabling digital access to multi-lingual Indic documents", First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings., pp.122-133, 2004.
25.
S. Setlur, S. Kompalli, V. Ramanaprasad, V. Govindaraju, "Creation of data resources and design of an evaluation test bed for Devanagari script recognition", Proceedings. Seventeenth Workshop on Parallel and Distributed Simulation, pp.55-61, 2003.
Cites in Papers - Other Publishers (32)
1.
Snehal Gaikwad, Sanjay Nalbalwar, Anil Nandgaonkar, "Contextual information based segmentation and recognition of upper modifiers from Devanagari script", International Journal of Information Technology, 2023.
2.
Arti Khaparde, Vaidehi Deshmukh, Manisha Kowdiki, "Enhanced Nature-Inspired Algorithm-based Hybrid Deep Learning for Character Recognition in Sanskrit Language", Sensing and Imaging, vol.24, no.1, 2023.
3.
Sanasam Inunganbi, "A systematic review on handwritten document analysis and recognition", Multimedia Tools and Applications, 2023.
4.
Shaheera Saba Mohd Naseem Akhter, Priti P. Rege, "Hyper Parameter Optimization of CRNN for Printed Devanagari Script Recognition using Taguchis Method", ACM Transactions on Asian and Low-Resource Language Information Processing, vol.22, no.4, pp.1, 2023.
5.
H. R. Shiva Kumar, A. G. Ramakrishnan, "Lipi Gnani", ACM Transactions on Asian and Low-Resource Language Information Processing, vol.19, no.4, pp.1, 2020.
6.
Ankit K. Sharma, Dipak M. Adhyaru, Tanish H. Zaveri, "A Survey on Devanagari Character Recognition", Smart Systems and IoT: Innovations in Computing, vol.141, pp.429, 2020.
7.
Bhavesh Kataria, Harikrishna B. Jethva, "CNN-Bidirectional LSTM Based Optical Character Recognition of Sanskrit Manuscripts : A Comprehensive Systematic Literature Review", International Journal of Scientific Research in Computer Science, Engineering and Information Technology, pp.1362, 2019.
8.
Munish Kumar, M. K. Jindal, R. K. Sharma, Simpel Rani Jindal, "Character and numeral recognition for non-Indic and Indic scripts: a survey", Artificial Intelligence Review, vol.52, no.4, pp.2235, 2019.
9.
Narasimha Reddy Soora, Parag S. Deshpande, "Review of Feature Extraction Techniques for Character Recognition", IETE Journal of Research, vol.64, no.2, pp.280, 2018.
10.
Arindam Chaudhuri, Krupa Mandaviya, Pratixa Badelia, Soumya K. Ghosh, Optical Character Recognition Systems for Different Languages with Soft Computing, vol.352, pp.217, 2017.
11.
Khushneet Jindal, Rajiv Kumar, "A Novel Shape-Based Character Segmentation Method for Devanagari Script", Arabian Journal for Science and Engineering, 2017.
12.
Arindam Chaudhuri, Krupa Mandaviya, Pratixa Badelia, Soumya K. Ghosh, Optical Character Recognition Systems for Different Languages with Soft Computing, vol.352, pp.193, 2017.
13.
Munish Kumar, M. K. Jindal, R. K. Sharma, "Offline Handwritten Gurmukhi Character Recognition: Analytical Study of Different Transformations", Proceedings of the National Academy of Sciences, India Section A: Physical Sciences, 2016.
14.
Munish Kumar, M. K. Jindal, R. K. Sharma, Proceedings of the Second International Conference on Soft Computing for Problem Solving (SocProS 2012), December 28-30, 2012, vol.236, pp.711, 2014.
15.
Soumen Bag, Gaurav Harit, Partha Bhowmick, "Recognition of Bangla compound characters using structural decomposition", Pattern Recognition, vol.47, no.3, pp.1187, 2014.
16.
Munish Kumar, R. K. Sharma, M. K. Jindal, "Efficient Feature Extraction Techniques for Offline Handwritten Gurmukhi Character Recognition", National Academy Science Letters, vol.37, no.4, pp.381, 2014.
17.
O. V. Ramana Murthy, Sujoy Roy, Vipin Narang, M. Hanmandlu, Shorya Gupta, "An approach to divide pre-detected Devanagari words from the scene images into characters", Signal, Image and Video Processing, vol.7, no.6, pp.1071, 2013.
18.
Munish Kumar, R. K. Sharma, M. K. Jindal, "Size of Training Set Vis-à-vis Recognition Accuracy of Handwritten Character Recognition System", Journal of Emerging Technologies in Web Intelligence, vol.5, no.4, 2013.
19.
SOUMEN BAG, GAURAV HARIT, "A survey on optical character recognition for Bangla and Devanagari scripts", Sadhana, vol.38, no.1, pp.133, 2013.
20.
Vikas J. Dongre, Vijay H. Mankar, "Development of Comprehensive Devnagari Numeral and Character Database for Offline Handwritten Character Recognition", Applied Computational Intelligence and Soft Computing, vol.2012, pp.1, 2012.
21.
Yan Min Chen, Xi Zhong Lou, Jing Wen Zhan, Shan Qiang Wu, "Research on Coherence Structure Construction for Chinese Text", Key Engineering Materials, vol.474-476, pp.1648, 2011.
22.
Soumen Bag, Gaurav Harit, Advances in Computer Science and Information Technology, vol.131, pp.358, 2011.
23.
Jasbir Singh, Gurpreet Singh Lehal, Information Systems for Indian Languages, vol.139, pp.144, 2011.
24.
Munish Kumar, M. K. Jindal, R. K. Sharma, Advances in Digital Image Processing and Information Technology, vol.205, pp.268, 2011.
25.
Akshata Doiphode, Leena Ragha, Computational Intelligence and Information Technology, vol.250, pp.621, 2011.
26.
LATESH MALIK, P. S. DESHPANDE, "RECOGNITION OF HANDWRITTEN DEVANAGARI SCRIPT", International Journal of Pattern Recognition and Artificial Intelligence, vol.24, no.05, pp.809, 2010.
27.
Suryaprakash Kompalli, Srirangaraj Setlur, Venu Govindaraju, "Devanagari OCR using a recognition driven segmentation framework and stochastic language models", International Journal on Document Analysis and Recognition (IJDAR), vol.12, no.2, pp.123, 2009.
28.
B. B. Chaudhuri, Guide to OCR for Indic Scripts, pp.27, 2009.
29.
Henry S. Baird, Venugopal Govindaraju, Daniel P. Lopresti, Document Analysis Systems VI, vol.3163, pp.1, 2004.
30.
Suryaprakash Kompalli, Srirangaraj Setlur, Venugopal Govindaraju, Document Analysis Systems VI, vol.3163, pp.28, 2004.