Abstract
Handwriting recognition is an important application of pattern recognition subject. Although some research studies of handwriting recognition of a few major Indian scripts can be found in the literature, the same is not true for many of the Indian scripts. Malayalam is one such script, and automatic recognition issues of this script remain largely unexplored till date. On the other hand, there are nearly 40 million people mainly living in the southern part of India whose native language is Malayalam. In the present article, we present our recent study of Malayalam offline handwritten word recognition. The main contributions of the present study are (a) pioneering development of a database for offline handwritten word samples of Malayalam script and (b) its benchmark recognition results based on a transfer learning strategy which involves a deep convolutional neural network (CNN) architecture for feature extraction and a support vector machine (SVM) for classification. Recognition result of the proposed architecture on the writer independent test set of Malayalam handwritten word sample database is quite satisfactory. Moreover, the same architecture has been found to improve the existing state of the art of offline handwriting recognition of several major Indian scripts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Arica, N., Yarman-Vural, F.T.: An overview of character recognition focused on off-line handwriting. IEEE Trans. Syst. Man Cybern. Part C (Applications and Reviews) 31(2), 216–233 (2001)
Bharati, A., Rao, K.P., Sangal, R., Bendre, S.: Basic statistical analysis of corpus and cross comparison among corpora. Technical Report of the Indian Institute of Information Technology, pp. 1–11 (2000)
Bhattacharya, U., Chaudhuri, B.B.: Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans. Pattern Anal. Mach. Intell. 31(3), 444–457 (2009)
Bhattacharya, U., Ghosh, S.K., Parui, S.: A two stage recognition scheme for handwritten Tamil characters. In: 9th International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 1, pp. 511–515 (2007)
Bhattacharya, U., Shridhar, M., Parui, S.K., Sen, P.K., Chaudhuri, B.B.: Offline recognition of handwritten Bangla characters: an efficient two-stage approach. Pattern Anal. Appl. 15(4), 445–458 (2012)
Bhowmik, T.K., Parui, S.K., Bhattacharya, U., Shaw, B.: An HMM based recognition scheme for handwritten Oriya numerals. In: 9th International Conference on Information Technology (ICIT ’06), pp. 105–110 (2006)
Bishop, C.M.: Pattern Recognition and Machine Learning. Springer, Berlin (2006)
Bottou, L.: Stochastic Gradient Descent Tricks, pp. 421–436. Springer, Berlin, Heidelberg (2012)
Burges, C.J.C.: A tutorial on support vector machines for pattern recognition. Data Min. Knowl. Discov. 2(2), 121–167 (1998)
Glorot, X., Bengio, Y.: Understanding the difficulty of training deep feedforward neural networks. In: International Conference on Artificial Intelligence and Statistics (2010)
Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Database development and recognition of handwritten Devanagari legal amount words. In: International Conference on Document Analysis and Recognition, pp. 304–308 (2011)
John, J., Pramod, K.V., Balakrishnan, K., Chaudhuri, B.B.: A two stage approach for handwritten Malayalam character recognition. In: 14th International Conference on Frontiers in Handwriting Recognition, pp. 199–204 (2014)
Kang, L., Kumar, J., Ye, P., Li, Y., Doermann, D.: Convolutional neural networks for document image classification. In: 22nd International Conference on Pattern Recognition (2014)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)
Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Pereira, F., Burges, C.J.C., Bottou, L., Weinberger, K.Q. (eds.) Advances in Neural Information Processing Systems 25, pp. 1097–1105 (2012)
LeCun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Madhvanath, S., Govindaraju, V.: The role of holistic paradigms in handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 23(2), 149–164 (2001)
Maitra, D.S., Bhattacharya, U., Parui, S.K.: CNN based common approach to handwritten character recognition of multiple scripts. In: 13th International Conference on Document Analysis and Recognition (ICDAR), pp. 1021–1025 (2015)
Neeba, N.: Large Scale Character Classification. Ph.D. thesis, International Institute of Information Technology Hyderabad, India (2010)
Neeba, N., Namboodiri, A., Jawahar, C., Narayanan, P.: Recognition of Malayalam documents. In: Guide to OCR for Indic Scripts, pp. 125–146. Springer, Berlin (2009)
Pal, U., Chaudhuri, B.: Indian script character recognition: a survey. Pattern Recognit. 37(9), 1887–1899 (2004)
Pal, U., Roy, K., Kimura, F.: A lexicon driven method for unconstrained Bangla handwritten word recognition. In: 10th International Workshop on Frontiers in Handwriting Recognition (2006)
Pan, S.J., Yang, Q.: A survey on transfer learning. IEEE Trans. Knowl. Data Eng. 22(10), 1345–1359 (2010)
Plamondon, R., Srihari, S.N.: On-line and off-line handwriting recognition: a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 63–84 (2000)
Prema, S., Joseph, M.: Malayalam Frequency Count Study Report. Technical Report, Department of Linguistics, University of Kerala (2001)
Shaw, B., Bhattacharya, U., Parui, S.K.: Combination of features for efficient recognition of offline handwritten Devanagari words. In: 14th International Conference on Frontiers in Handwriting Recognition, pp. 240–245 (2014)
Shaw, B., Bhattacharya, U., Parui, S.K.: Offline handwritten Devanagari word recognition: information fusion at feature and classifier levels. In: 3rd IAPR Asian Conference on Pattern Recognition (ACPR), pp. 720–724 (2015)
Szegedy, C., Liu, W., Jia, Y., Sermanet, P., Reed, S., Anguelov, D., Erhan, D., Vanhoucke, V., Rabinovich, A.: Going deeper with convolutions. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1–9 (2015)
Trier, Ø., Jain, A.K., Taxt, T.: Feature extraction methods for character recognition–a survey. Pattern Recogn 29, 641–662 (1996)
Varma, A.R.R.R.: Kerala Panineeyam. Kozhikode Poorna, 3rd ed (1997)
Zeiler, M.D.: Adadelta: An Adaptive Learning Rate Method. arXiv:1212.5701 (2012)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Jino, P.J., Balakrishnan, K., Bhattacharya, U. (2019). Offline Handwritten Malayalam Word Recognition Using a Deep Architecture. In: Bansal, J., Das, K., Nagar, A., Deep, K., Ojha, A. (eds) Soft Computing for Problem Solving. Advances in Intelligent Systems and Computing, vol 816. Springer, Singapore. https://doi.org/10.1007/978-981-13-1592-3_73
Download citation
DOI: https://doi.org/10.1007/978-981-13-1592-3_73
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-1591-6
Online ISBN: 978-981-13-1592-3
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)