Abstract
It is important to recognize the destination city name correctly for a postal document to reach its desired address. In India people often mix up scripts while writing the address. Often the script of the destination city name is different from the other part of the postal document. This is common in India due to the multilingual and multi script nature of the country. In this paper, a Convolutional Neural Network (CNN) based approach towards the recognition of handwritten multilingual multiscript Indian city names is presented. Experiments were performed not only in a single script scenario but also in multi script, considering English, Bangla and Devanagari scripts. An accuracy of 91.72% was obtained on 106 city names in mixed script scenario from the proposed scheme and the data set will be made available to the researcher on request. Further experiments were also performed with different script combinations and obtained results up to 98.01%. The system also produced a mean performance difference of approximately ± 1% for successive changes in the data set size, thereby pointing to the robustness of the proposed architecture.
Similar content being viewed by others
References
Acharyya A, Rakshit S, Sarkar R, Basu S, Nasipuri M (2013) Handwritten word recognition using mlp based classifier: a holistic approach. International Journal of Computer Science Issues (IJCSI) 10(2 Part 2):422
Basu S, Seth SS, Sarkar P, Das B, Dey S, Ghosh S (2005) Recognition of pincodes from indian postal documents. Soft Comput, 239–245
Bera S, Shrivastava VK (2020) Analysis of various optimizers on deep convolutional neural network model in the application of hyperspectral remote sensing image classification. Int J Remote Sens 41(7):2664–2683
Chaudhuri BB (1995) Relational studies between phoneme and grapheme statistics in current bangla. J Acoust Soc India 23:67–77
Chevtchenko SF, Vale R F, Macario V, Cordeiro F R (2018) A convolutional neural network with feature fusion for real-time hand posture recognition. Appl Soft Comput 73:748–766
Gao X, Jin L (2012) A vision-based fast chinese postal envelope identification system. J Inform Sci Eng 28(1):31–49
Ghosh D, Dube T, Shivaprasad A (2010) Script recognition—a review. IEEE Trans Pattern Anal Machine Intell 32(12):2142–2161
Ghosh M, Mukherjee H, Obaidullah SM, Santosh KC, Das N, Roy K (2019) Identifying the presence of graphical texts in scene images using cnn. In: 2019 international conference on document analysis and recognition workshops (ICDARW), vol 1. IEEE, pp 86–91
Ghosh M, Roy S S, Mukherjee H, Obaidullah SM, Santosh KC, Roy K (2021) Understanding movie poster: transfer-deep learning approach for graphic-rich text recognition. Vis Comput, 1–20
Halder C, Obaidullah S M, Santosh K C, Roy K (2018) Content independent writer identification on bangla script: A document level approach. Int J Pattern Recognit Artif Intell 32(9):1856011:1–1856011:24
Hijam D, Saharia S, Nirmal Y (2018) Towards a complete character set meitei mayek handwritten character recognition. In: 2018 fourth international conference on computing communication control and automation (ICCUBEA). IEEE, pp 1–5
Hou Y, Zhao H (2017) Handwritten digit recognition based on depth neural network. In: international conference on intelligent informatics and biomedical sciences 2017, Track2: Artificial Intelligence, Robotics and Human-Computer Interaction, Okinawa, Japan, pp 35–38
Howard A G, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861
Kaur H, Kumar M (2018) Benchmark dataset: offline handwritten gurmukhi city names for postal automation. In: Workshop on document analysis and recognition. Springer, pp 152–159
Kaur H, Kumar M (2021) Offline handwritten gurumukhi word recognition using extreme gradient boosting methodology. Soft Comput 25(6):4451–4464
LeCun Y, Bengio Y, Hinton G (2015) Deep learning. nature 521(7553):436–444
Liu L, Lu S, Lu Y, Suen CY (2014) Application of pr techniques to mail sorting in china. In: Proceedings of the 2014 international conference on computer science & software engineering, pp 1–7
Mukherjee H, Dhar A, Obaidullah S M, Phadikar S, Roy K (2020) Image-based features for speech signal classification. Multimed Tools Appl 79(47):34913–34929
Nagabhushan P, Angadi SA, Anami BS (2005) Symbolic data structure for postal address representation and address validation through symbolic knowledge base. In: International conference on pattern recognition and machine intelligence. Springer, pp 388–394
Nagabhushan P, Angadi SA, Anami BS (2006) A fuzzy symbolic inference system for postal address component extraction and labelling. In: International conference on fuzzy systems and knowledge discovery. Springer, pp 937–946
Nagabhushan P, Angadi SA, Anami BS (2009) A soft computing model for mapping incomplete/approximate postal addresses to mail delivery points. Appl Soft Comput 9(2):806–816
Obaidullah S M, Halder C, Santosh KC, Das N, Roy K (2018) Phdindic_11: page-level handwritten document image dataset of 11 official indic scripts for script identification. Multimed Tools Appl 77(2):1643–1678
Olivas-Padilla BE, Chacon-Murguia MI (2019) Classification of multiple motor imagery using deep convolutional neural networks and spatial filters. Appl Soft Comput 75:461–472
Pal U, Jayadevan R, Sharma N (2012) Handwriting recognition in indian regional scripts: a survey of offline techniques. ACM Trans on Asian Language Inform Process (TALIP) 11(1):1–35
Pal U, Roy K, Kimura F (2009) A lexicon-driven handwritten city-name recognition scheme for indian postal automation. IEICE Trans Inform Syst 92(5):1146–1158
Pal U, Roy R K, Kimura F (2010) Bangla and english city name recognition for indian postal automation. In: 2010 20th international conference on pattern recognition. IEEE, pp 1985–1988
Pal U, Roy RK, Kimura F (2012) Multi-lingual city name recognition for indian postal automation. In: 2012 international conference on frontiers in handwriting recognition. IEEE, pp 169–173
Pal U, Roy R K, Roy K, Kimura F (2009) Indian multi-script full pin-code string recognition for postal automation. In: 2009 10th international conference on document analysis and recognition. IEEE, pp 456–460
Patel MS, Reddy S L, Naik A J (2015) An efficient way of handwritten english word recognition. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: Theory and Applications (FICTA) 2014. Springer, pp 563–571
Rakshit P, Halder C, Ghosh S, Roy K (2018) Line, word, and character segmentation from bangla handwritten texta precursor toward bangla hocr. In: Advanced computing and systems for security. Springer, pp 109–120
Roy K (2008) On the development of an optical character recognition system for indian postal automation. Ph.D. Thesis, PhD Thesis, Jadavpur University
Roy K, Vajda S, Pal U, Chaudhuri BB, Belaïd A (2005) A system for indian postal automation. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 1060–1064
Roy R K, Pal U, Roy K, Kimura F (2020) A system for recognition of destination address in postal documents of india. Malaysian J Comput Sci 33(3):202–216
Sang J, Yu J, Jain R, Lienhart R, Cui P, Feng J (2018) Deep learning for multimedia: Science or technology?. In: Proceedings of the 26th ACM international conference on multimedia, pp 1354–1355
Sharma N, Sengupta A, Sharma R, Pal U, Blumenstein M (2017) Pincode detection using deep cnn for postal automation. In: 2017 international conference on image and vision computing New Zealand (IVCNZ). IEEE, pp 1–6
Shaw B, Parui S K, Shridhar M (2008) Offline handwritten devanagari word recognition: A segmentation based approach. In: 2008 19th international conference on pattern recognition. IEEE, pp 1–4
Thadchanamoorthy S, Kodikara ND, Premaretne HL, Pal U, Kimura F (2013) Tamil handwritten city name database development and recognition for postal automation. In: 2013 12th international conference on document analysis and recognition. IEEE, pp 793–797
Vajda S, Roy K, Pal U, Chaudhuri B B, Belaid A (2009) Automation of indian postal documents written in bangla and english. Int J Pattern Recognit Artif Intell 23(08):1599–1632
Wanchoo SA, Yadav P, Anuse A (2016) A survey on devanagari character recognition for indian postal system automation. Int J Appl Eng Res 11 (6):4529–4536
Wang Y, Chen Y, Yang N, Zheng L, Dey N, Ashour A S, Rajinikanth V, Tavares JMRS, Shi F (2019) Classification of mice hepatic granuloma microscopic images based on a deep convolutional neural network. Appl Soft Comput 74:40–50
Zhang D, Han X, Deng C (2018) Review on the research and practice of deep learning and reinforcement learning in smart grids. CSEE J Power Energ Syst 4(3):362–370
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Roy, R.K., Mukherjee, H., Roy, K. et al. CNN based recognition of handwritten multilingual city names. Multimed Tools Appl 81, 11501–11517 (2022). https://doi.org/10.1007/s11042-022-12193-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12193-8