Abstract
The performance various academic and commercial text recognition solutions for many languages world-wide has been satisfactory. Many projects now use the ocr as a reliable module. As of now, Indian languages are far away from this state, which is unfortunate. Beyond many challenges due to script and language, this space is adversely affected by the scattered nature of research, lack of systematic evaluation, and poor resource dissemination. In this work, we aim to design and implement a web-based system that could indirectly address some of these aspects that hinder the development of ocr for Indian languages. We hope that such an attempt will help in (i) providing and establishing a consolidated view of state-of-the-art performances for character and word recognition at one place (ii) sharing resources and practices (iii) establishing standard benchmarks that clearly explain the capabilities and limitations of the recognition methods (iv) bringing research attempts from a wide variety of languages, scripts, and modalities into a common forum. We believe the proposed system will play a critical role in further promoting the research in the Indian language text recognition domain.
Keywords
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
Overfitting is a modeling error that occurs when a function is too closely fit a limited set of data points.
- 7.
References
Achanta, R., Hastie, T.J.: Telugu OCR framework using deep learning. ArXiv (2015)
Ashwin, T.V., Sastry, P.S.: A font and size-independent OCR system for printed Kannada documents using support vector machines. Sadhana 27, 35–38 (2002)
Bansal, V., Sinha, R.: A complete OCR for printed Hindi text in Devanagari script. In: ICDAR (2001)
Bansal, V., Sinha, R.M.K.: A complete OCR for printed Hindi text in Devanagari script. In: ICDAR (2001)
Basu, S., Das, N., Sarkar, R., Kundu, M., Nasipuri, M., Basu, D.K.: Handwritten Bangla alphabet recognition using an MLP based classifier. CoRR (2012)
Breuel, T.M.: High performance text recognition using a hybrid convolutional-LSTM implementation. In: ICDAR (2017)
Chandramouli, C., General, R.: Census of India 2011. Government of India, Provisional Population Totals, New Delhi (2011)
Chaudhuri, B.B.: A complete handwritten numeral database of Bangla – a major Indic script. In: IWFHR (2006)
Cordts, M., et al.: The Cityscapes dataset for semantic urban scene understanding. In: CVPR (2016)
Das, N., Das, B., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: Handwritten Bangla basic and compound character recognition using MLP and SVM classifier. ArXiv (2010)
Datta, A.K.: A generalized formal approach for description and analysis of major Indian scripts. IETE J. Res. (1984)
Dutta, K., Mathew, M., Krishnan, P., Jawahar, C.V.: Localizing and recognizing text in lecture videos. In: ICFHR (2018)
Everingham, M., Van Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The Pascal visual object classes (VOC) challenge. IJCV 88, 303–338 (2010)
Gaur, A., Yadav, S.: Handwritten Hindi character recognition using k-means clustering and SVM. ISETTLIS (2015)
Gupta, V., Rathna, G.N., Ramakrishnan, K.: Automatic Kannada text extraction from camera captured images. In: MCDES, IISc Centenary Conference (2008)
Jain, M., Mathew, M., Jawahar, C.V.: Unconstrained OCR for Urdu using deep CNN-RNN hybrid networks. In: ACPR (2017)
Jomy, J., Pramod, K.V., Kannan, B.: Handwritten character recognition of south Indian scripts: a review. CoRR (2011)
Jordan, M.I., Mitchell, T.M.: Machine learning: trends, perspectives, and prospects. Science 349, 255–260 (2015)
Karatzas, D., Gómez, L., Nicolaou, A., Rusiñol, M.: The robust reading competition annotation and evaluation platform. In: DAS (2018)
Karatzas, D., et al.: ICDAR 2015 competition on robust reading. In: ICDAR (2015)
Karatzas, D., et al.: ICDAR 2013 robust reading competition. In: ICDAR (2013)
Klakow, D., Peters, J.: Testing the correlation of word error rate and perplexity. Speech Commun. 38, 19–28 (2002)
Krishnan, P., Sankaran, N., Singh, A.K., Jawahar, C.: Towards a robust OCR system for Indic scripts. In: DAS (2014)
Kumar, A., Jawahar, C.V.: Content-level annotation of large collection of printed document images. In: ICDAR (2007)
Levenshtein, V.: Binary codes capable of correcting deletions, insertions and reversals. Soviet Physics Doklady 10, 707–710 (1966)
Lucas, S.M., Panaretos, A., Sosa, L., Tang, A., Wong, S., Young, R.: ICDAR 2003 robust reading competitions. In: ICDAR (2003)
Mathew, M., Jain, M., Jawahar, C.V.: Benchmarking scene text recognition in Devanagari, Telugu and Malayalam (2017)
Mathew, M., Singh, A.K., Jawahar, C.V.: Multilingual OCR for Indic scripts. In: DAS (2016)
Nag, S., et al.: Offline extraction of Indic regional language from natural scene image using text segmentation and deep convolutional sequence. ArXiv (2018)
Negi, A., Bhagvati, C., Krishna, B.: An OCR system for Telugu. In: ICDAR (2001)
Omee, F.Y., Himel, S.S., Bikas, M.A.N.: A complete workflow for development of Bangla OCR. CoRR (2012)
Pal, U., Chaudhuri, B.: Indian script character recognition: a survey. Pattern Recogn. 37, 1887–1899 (2004)
Sankaran, N., Jawahar, C.V.: Recognition of printed Devanagari text using BLSTM neural network (2012)
Sarkar, R., Das, N., Basu, S., Kundu, M., Nasipuri, M., Basu, D.K.: Word level script identification from Bangla and Devanagri handwritten texts mixed with Roman script. CoRR (2010)
Setlur, S., Kompalli, S., Ramanaprasad, V., Govindaraju, V.: Creation of data resources and design of an evaluation test bed for Devanagari script recognition. In: WPDS (2003)
Shahab, A., Shafait, F., Dengel, A.: ICDAR 2011 robust reading competition challenge 2: reading text in scene images. In: ICDAR (2011)
Sheshadri, K., Ambekar, P.K.T., Prasad, D.P., Kumar, R.P.: An OCR system for printed Kannada using k-means clustering. In: ICIT (2010)
Sinha, R.M.K.: A journey from Indian scripts processing to Indian language processing. IEEE Ann. Hist. Comput. 31, 8–31 (2009)
Smith, R.: An overview of the Tesseract OCR engine. In: ICDAR (2007)
Stiehl, U.: Sanskrit-kompendium. Economica Verlag (2002)
Ye, Q., Doermann, D.S.: Text detection and recognition in imagery: a survey. IEEE Trans. Pattern Anal. Mach. Intell. 37, 1480–1500 (2015)
Zhu, Y., Yao, C., Bai, X.: Scene text detection and recognition: recent advances and future trends. Front. Comput. Sci. (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Tulsyan, K., Srivastava, N., Mondal, A., Jawahar, C.V. (2020). A Benchmark System for Indian Language Text Recognition. In: Bai, X., Karatzas, D., Lopresti, D. (eds) Document Analysis Systems. DAS 2020. Lecture Notes in Computer Science(), vol 12116. Springer, Cham. https://doi.org/10.1007/978-3-030-57058-3_6
Download citation
DOI: https://doi.org/10.1007/978-3-030-57058-3_6
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-57057-6
Online ISBN: 978-3-030-57058-3
eBook Packages: Computer ScienceComputer Science (R0)