Skip to main content
Log in

CNN based recognition of handwritten multilingual city names

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

It is important to recognize the destination city name correctly for a postal document to reach its desired address. In India people often mix up scripts while writing the address. Often the script of the destination city name is different from the other part of the postal document. This is common in India due to the multilingual and multi script nature of the country. In this paper, a Convolutional Neural Network (CNN) based approach towards the recognition of handwritten multilingual multiscript Indian city names is presented. Experiments were performed not only in a single script scenario but also in multi script, considering English, Bangla and Devanagari scripts. An accuracy of 91.72% was obtained on 106 city names in mixed script scenario from the proposed scheme and the data set will be made available to the researcher on request. Further experiments were also performed with different script combinations and obtained results up to 98.01%. The system also produced a mean performance difference of approximately ± 1% for successive changes in the data set size, thereby pointing to the robustness of the proposed architecture.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. Acharyya A, Rakshit S, Sarkar R, Basu S, Nasipuri M (2013) Handwritten word recognition using mlp based classifier: a holistic approach. International Journal of Computer Science Issues (IJCSI) 10(2 Part 2):422

    Google Scholar 

  2. Basu S, Seth SS, Sarkar P, Das B, Dey S, Ghosh S (2005) Recognition of pincodes from indian postal documents. Soft Comput, 239–245

  3. Bera S, Shrivastava VK (2020) Analysis of various optimizers on deep convolutional neural network model in the application of hyperspectral remote sensing image classification. Int J Remote Sens 41(7):2664–2683

    Article  Google Scholar 

  4. Chaudhuri BB (1995) Relational studies between phoneme and grapheme statistics in current bangla. J Acoust Soc India 23:67–77

    Google Scholar 

  5. Chevtchenko SF, Vale R F, Macario V, Cordeiro F R (2018) A convolutional neural network with feature fusion for real-time hand posture recognition. Appl Soft Comput 73:748–766

    Article  Google Scholar 

  6. Gao X, Jin L (2012) A vision-based fast chinese postal envelope identification system. J Inform Sci Eng 28(1):31–49

    MathSciNet  Google Scholar 

  7. Ghosh D, Dube T, Shivaprasad A (2010) Script recognition—a review. IEEE Trans Pattern Anal Machine Intell 32(12):2142–2161

    Article  Google Scholar 

  8. Ghosh M, Mukherjee H, Obaidullah SM, Santosh KC, Das N, Roy K (2019) Identifying the presence of graphical texts in scene images using cnn. In: 2019 international conference on document analysis and recognition workshops (ICDARW), vol 1. IEEE, pp 86–91

  9. Ghosh M, Roy S S, Mukherjee H, Obaidullah SM, Santosh KC, Roy K (2021) Understanding movie poster: transfer-deep learning approach for graphic-rich text recognition. Vis Comput, 1–20

  10. Halder C, Obaidullah S M, Santosh K C, Roy K (2018) Content independent writer identification on bangla script: A document level approach. Int J Pattern Recognit Artif Intell 32(9):1856011:1–1856011:24

    Article  Google Scholar 

  11. Hijam D, Saharia S, Nirmal Y (2018) Towards a complete character set meitei mayek handwritten character recognition. In: 2018 fourth international conference on computing communication control and automation (ICCUBEA). IEEE, pp 1–5

  12. Hou Y, Zhao H (2017) Handwritten digit recognition based on depth neural network. In: international conference on intelligent informatics and biomedical sciences 2017, Track2: Artificial Intelligence, Robotics and Human-Computer Interaction, Okinawa, Japan, pp 35–38

  13. Howard A G, Zhu M, Chen B, Kalenichenko D, Wang W, Weyand T, Andreetto M, Adam H (2017) Mobilenets: Efficient convolutional neural networks for mobile vision applications. arXiv:1704.04861

  14. Kaur H, Kumar M (2018) Benchmark dataset: offline handwritten gurmukhi city names for postal automation. In: Workshop on document analysis and recognition. Springer, pp 152–159

  15. Kaur H, Kumar M (2021) Offline handwritten gurumukhi word recognition using extreme gradient boosting methodology. Soft Comput 25(6):4451–4464

    Article  Google Scholar 

  16. LeCun Y, Bengio Y, Hinton G (2015) Deep learning. nature 521(7553):436–444

    Article  Google Scholar 

  17. Liu L, Lu S, Lu Y, Suen CY (2014) Application of pr techniques to mail sorting in china. In: Proceedings of the 2014 international conference on computer science & software engineering, pp 1–7

  18. Mukherjee H, Dhar A, Obaidullah S M, Phadikar S, Roy K (2020) Image-based features for speech signal classification. Multimed Tools Appl 79(47):34913–34929

    Article  Google Scholar 

  19. Nagabhushan P, Angadi SA, Anami BS (2005) Symbolic data structure for postal address representation and address validation through symbolic knowledge base. In: International conference on pattern recognition and machine intelligence. Springer, pp 388–394

  20. Nagabhushan P, Angadi SA, Anami BS (2006) A fuzzy symbolic inference system for postal address component extraction and labelling. In: International conference on fuzzy systems and knowledge discovery. Springer, pp 937–946

  21. Nagabhushan P, Angadi SA, Anami BS (2009) A soft computing model for mapping incomplete/approximate postal addresses to mail delivery points. Appl Soft Comput 9(2):806–816

    Article  Google Scholar 

  22. Obaidullah S M, Halder C, Santosh KC, Das N, Roy K (2018) Phdindic_11: page-level handwritten document image dataset of 11 official indic scripts for script identification. Multimed Tools Appl 77(2):1643–1678

    Article  Google Scholar 

  23. Olivas-Padilla BE, Chacon-Murguia MI (2019) Classification of multiple motor imagery using deep convolutional neural networks and spatial filters. Appl Soft Comput 75:461–472

    Article  Google Scholar 

  24. Pal U, Jayadevan R, Sharma N (2012) Handwriting recognition in indian regional scripts: a survey of offline techniques. ACM Trans on Asian Language Inform Process (TALIP) 11(1):1–35

    Article  Google Scholar 

  25. Pal U, Roy K, Kimura F (2009) A lexicon-driven handwritten city-name recognition scheme for indian postal automation. IEICE Trans Inform Syst 92(5):1146–1158

    Article  Google Scholar 

  26. Pal U, Roy R K, Kimura F (2010) Bangla and english city name recognition for indian postal automation. In: 2010 20th international conference on pattern recognition. IEEE, pp 1985–1988

  27. Pal U, Roy RK, Kimura F (2012) Multi-lingual city name recognition for indian postal automation. In: 2012 international conference on frontiers in handwriting recognition. IEEE, pp 169–173

  28. Pal U, Roy R K, Roy K, Kimura F (2009) Indian multi-script full pin-code string recognition for postal automation. In: 2009 10th international conference on document analysis and recognition. IEEE, pp 456–460

  29. Patel MS, Reddy S L, Naik A J (2015) An efficient way of handwritten english word recognition. In: Proceedings of the 3rd international conference on frontiers of intelligent computing: Theory and Applications (FICTA) 2014. Springer, pp 563–571

  30. Rakshit P, Halder C, Ghosh S, Roy K (2018) Line, word, and character segmentation from bangla handwritten texta precursor toward bangla hocr. In: Advanced computing and systems for security. Springer, pp 109–120

  31. Roy K (2008) On the development of an optical character recognition system for indian postal automation. Ph.D. Thesis, PhD Thesis, Jadavpur University

  32. Roy K, Vajda S, Pal U, Chaudhuri BB, Belaïd A (2005) A system for indian postal automation. In: Eighth international conference on document analysis and recognition (ICDAR’05). IEEE, pp 1060–1064

  33. Roy R K, Pal U, Roy K, Kimura F (2020) A system for recognition of destination address in postal documents of india. Malaysian J Comput Sci 33(3):202–216

    Google Scholar 

  34. Sang J, Yu J, Jain R, Lienhart R, Cui P, Feng J (2018) Deep learning for multimedia: Science or technology?. In: Proceedings of the 26th ACM international conference on multimedia, pp 1354–1355

  35. Sharma N, Sengupta A, Sharma R, Pal U, Blumenstein M (2017) Pincode detection using deep cnn for postal automation. In: 2017 international conference on image and vision computing New Zealand (IVCNZ). IEEE, pp 1–6

  36. Shaw B, Parui S K, Shridhar M (2008) Offline handwritten devanagari word recognition: A segmentation based approach. In: 2008 19th international conference on pattern recognition. IEEE, pp 1–4

  37. Thadchanamoorthy S, Kodikara ND, Premaretne HL, Pal U, Kimura F (2013) Tamil handwritten city name database development and recognition for postal automation. In: 2013 12th international conference on document analysis and recognition. IEEE, pp 793–797

  38. Vajda S, Roy K, Pal U, Chaudhuri B B, Belaid A (2009) Automation of indian postal documents written in bangla and english. Int J Pattern Recognit Artif Intell 23(08):1599–1632

    Article  Google Scholar 

  39. Wanchoo SA, Yadav P, Anuse A (2016) A survey on devanagari character recognition for indian postal system automation. Int J Appl Eng Res 11 (6):4529–4536

    Google Scholar 

  40. Wang Y, Chen Y, Yang N, Zheng L, Dey N, Ashour A S, Rajinikanth V, Tavares JMRS, Shi F (2019) Classification of mice hepatic granuloma microscopic images based on a deep convolutional neural network. Appl Soft Comput 74:40–50

    Article  Google Scholar 

  41. Zhang D, Han X, Deng C (2018) Review on the research and practice of deep learning and reinforcement learning in smart grids. CSEE J Power Energ Syst 4(3):362–370

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Kaushik Roy.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Roy, R.K., Mukherjee, H., Roy, K. et al. CNN based recognition of handwritten multilingual city names. Multimed Tools Appl 81, 11501–11517 (2022). https://doi.org/10.1007/s11042-022-12193-8

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-12193-8

Keywords

Navigation