Skip to main content
Log in

Handwritten Bangla character recognition using convolutional neural networks: a comparative study and new lightweight model

  • Original Article
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

Handwriting is a crucial way to enhance character recognition and learn new words. However, the Bangla characters consist of very complex shapes and similar patterns. Deep learning (DL) techniques have become a prominent solution for handwritten Bangla character recognition (HBCR) due to their ability to extract high-level features from complex data. Several DL techniques have been proposed for HBCR, but they are computationally expensive and large in model size and thus not suitable for use in resource-constrained devices such as smartphones. In this study, we have evaluated the state-of-the-art DL models for HBCR. For this, we have used four existing datasets and created a merged dataset (by combining the four) for cross-dataset evaluation. We have provided a comparative performance analysis of the state-of-the-art DL models for HBCR. Additionally, we have proposed a new lightweight DL model for HBCR and evaluated its performance. The proposed DL model consists of 74 layers, including sub-layers, and its architecture is divided into five similar blocks. It includes the convolutional layers of (3, 3) and (5, 5) kernels, (1,1) stride, and the maximum pool layer of the (2, 2) pool size. The proposed model achieved accuracy, model size, loading and testing times of 96.87%, 13 MB, 9.11 s, and 7.95 s, respectively. The experimental results show that our model outperformed state-of-the-art models in terms of efficiency (loading and testing time) and model size with competitive accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5

Similar content being viewed by others

Data availability statements

The datasets generated during and/or analysed during the current study are available from the corresponding author upon reasonable request.

References

  1. Du S, Ibrahim M, Shehata M, Badawy W (2012) Automatic license plate recognition (ALPR): a state-of-the-art review. IEEE Trans Circuits Syst Video Technol 23(2):311–325

    Article  Google Scholar 

  2. Srikantan JFG, Srihari S (1994) Handprinted character/digit recognition using a multiple feature/resolution philosophy. In: Proceeding of the fourth int’l workshop frontiers in handwriting recognition

  3. Weng Y, Xia C (2020) A new deep learning-based handwritten character recognition system on mobile computing devices. Mobile Netw Appl 25(2):402–411

    Article  Google Scholar 

  4. Simons GF, Fennig CD (2017) Summary by language size. Languages of World, Ethnologue

    Google Scholar 

  5. Rabby ASA, Haque S, Islam S, Abujar S, Hossain SA (2018) Bornonet: Bangla handwritten characters recognition using convolutional neural network. Procedia Comput Sci 143:528–535

    Article  Google Scholar 

  6. Opu MNI, Islam MR, Kabir MA, Hossain MS, Islam MM (2022) Learn2write: augmented reality and machine learning-based mobile app to learn writing. Computers. https://doi.org/10.3390/computers11010004

    Article  Google Scholar 

  7. Rakshit S, Ghosal D, Das T, Dutta S, Basu S (2010) Development of a multi-user recognition engine for handwritten Bangla basic characters and digits. arXiv preprint arXiv:1003.5897

  8. Chaudhuri B (2006) A complete handwritten numeral database of Bangla—a major indic script. In Proceedings of tenth international workshop on frontiers in handwriting recognition, Suvisoft, Baule, France

  9. Liu C-L, Suen CY (2009) A new benchmark on the recognition of handwritten Bangla and Farsi numeral characters. Pattern Recogn 42(12):3287–3295

    Article  Google Scholar 

  10. Surinta O, Schomaker L, Wiering M (2013) A comparison of feature and pixel-based methods for recognizing handwritten Bangla digits. In: 2013 12th international conference on document analysis and recognition. IEEE, pp 165–169

  11. Bhowmik TK, Ghanty P, Roy A, Parui SK (2009) SVM-based hierarchical architectures for handwritten Bangla character recognition. Int J Doc Anal Recognit (IJDAR) 12(2):97–108

    Article  Google Scholar 

  12. Rabby ASA, Abujar S, Haque S, Hossain SA (2019) Bangla handwritten digit recognition using convolutional neural network. In: Emerging technologies in data mining and information security. Springer, Berlin, pp 111–122

  13. Hakim SA, et al. (2019) Handwritten Bangla numeral and basic character recognition using deep convolutional neural network. In: 2019 international conference on electrical, computer and communication engineering (ECCE). IEEE, pp 1–6

  14. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

  15. He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778

  16. Krizhevsky A, Sutskever I, Hinton GE (2012) Imagenet classification with deep convolutional neural networks. In: Advances in neural information processing systems, pp 1097–1105

  17. Hasnat MA, Habib SM, Khan M (2008) A high performance domain specific OCR for Bangla script. In: Novel algorithms and techniques in telecommunications, automation and industrial electronics. Springer, Berlin, pp 174–178

  18. Pal U, Chaudhuri B (1994) OCR in Bangla: an indo-Bangladeshi language. In: Proceedings of the 12th IAPR international conference on pattern recognition, vol 3-conference C: signal processing (Cat. No. 94CH3440-5). IEEE, vol 2, pp 269–273

  19. Chowdhury A, Foysal A, Islam S (2016) Bangla character recognition for android devices. Int J Comput Appl 136(11):13–19

    Google Scholar 

  20. Pal U, Chaudhuri B (2000) Automatic recognition of unconstrained off-line Bangla handwritten numerals. In: International conference on multimodal interfaces. Springer, Berlin, pp 371–378

  21. Bhattacharya U, Chaudhuri B (2003) A majority voting scheme for multiresolution recognition of handprinted numerals. In: Seventh international conference on document analysis and recognition, 2003. Proceedings. IEEE, pp 16–20

  22. Basu S, Das N, Sarkar, R., Kundu M, Nasipuri M, Basu DK (2012) An MLP based approach for recognition of handwritten bangla’numerals. arXiv preprint arXiv:1203.0876

  23. Xu J-w, Xu J, Lu Y (2008) Handwritten Bangla digit recognition using hierarchical Bayesian network. In: 2008 3rd international conference on intelligent system and knowledge engineering. IEEE, vol 1, pp 1096–1099

  24. Cireşan D, Meier U (2015) Multi-column deep neural networks for offline handwritten Chinese character classification. In: 2015 international joint conference on neural networks (IJCNN). IEEE, pp 1–6

  25. Roy S, Das N, Kundu M, Nasipuri M (2017) Handwritten isolated Bangla compound character recognition: a new benchmark using a novel deep learning approach. Pattern Recogn Lett 90:15–21

    Article  Google Scholar 

  26. Maitra DS, Bhattacharya U, Parui SK (2015) CNN based common approach to handwritten character recognition of multiple scripts. In: 2015 13th International Conference on Document Analysis and Recognition (ICDAR). IEEE, pp 1021–1025

  27. Chowdhury RR, Hossain MS, Ul Islam R, Andersson K, Hossain S (2019) Bangla handwritten character recognition using convolutional neural network with data augmentation. In: 2019 Joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (icIVPR). IEEE, pp 318–323

  28. Sethy A, Patra PK, Nayak SR (2020) Offline handwritten numeral recognition using convolution neural network. Mach Vis Insp Syst 1:197–212. https://doi.org/10.1002/9781119682042.ch9

    Article  Google Scholar 

  29. Alom MZ, Sidike P, Hasan M, Taha TM, Asari VK (2018) Handwritten Bangla character recognition using the state-of-the-art deep convolutional neural networks. Comput Intell Neurosci. https://doi.org/10.1155/2018/6747098

    Article  Google Scholar 

  30. Shawon A, Rahman MJ-U, Mahmud F, Zaman MA (2018) Bangla handwritten digit recognition using deep CNN for large and unbiased dataset. In: 2018 international conference on Bangla speech and language processing (ICBSLP). IEEE, pp 1–6

  31. Saha C, Faisal RH, Rahman MM (2019) Bangla handwritten basic character recognition using deep convolutional neural network. In: 2019 joint 8th international conference on informatics, electronics and vision (ICIEV) and 2019 3rd international conference on imaging, vision and pattern recognition (icIVPR). IEEE, pp 190–195

  32. Rabby ASA, Haque S, Abujar S, Hossain SA (2018) Ekushnet: using convolutional neural network for Bangla handwritten recognition. Procedia Comput Sci 143:603–610

    Article  Google Scholar 

  33. Sandler M, Howard A, Zhu M, Zhmoginov A, Chen L-C (2018) Mobilenetv2: inverted residuals and linear bottlenecks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4510–4520

  34. Wang Z, Li C, Wang X (2021) Convolutional neural network pruning with structural redundancy reduction. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 14913–14922

  35. Wang Z, Li C, Wang X, Wang D (2019) Towards efficient convolutional neural networks through low-error filter saliency estimation. In: PRICAI 2019: trends in Artificial Intelligence: 16th Pacific rim international conference on artificial intelligence, Cuvu, Yanuca Island, Fiji, August 26–30, 2019, Proceedings, Part II 16, pp 255–267. Springer, Berlin

  36. Wang Z, Li C (2022) Channel pruning via lookahead search guided reinforcement learning. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 2029–2040

  37. Hinton G, Vinyals O, Dean J (2015) Distilling the knowledge in a neural network. arXiv preprint arXiv:1503.02531

  38. Wang Z (2021) Zero-shot knowledge distillation from a decision-based black-box model. In: International conference on machine learning, pp 10675–10685. PMLR

  39. Li C, Wang Z, Qi H (2022) Online knowledge distillation by temporal–spatial boosting. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 197–206

  40. Wu G, Gong S (2021) Peer collaborative learning for online knowledge distillation. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 10302–10310

  41. Wang Z (2021) Data-free knowledge distillation with soft targeted transfer set synthesis. In: Proceedings of the AAAI conference on artificial intelligence, vol 35, pp 10245–10253

  42. Sarkar R, Das N, Basu S, Kundu M, Nasipuri M, Basu DK (2012) Cmaterdb1: a database of unconstrained handwritten Bangla and Bangla–English mixed script document image. Int J Doc Anal Recognit (IJDAR) 15(1):71–83

    Article  Google Scholar 

  43. Biswas M, Islam R, Shom GK, Shopon M, Mohammed N, Momen S, Abedin A (2017) Banglalekha-isolated: a multi-purpose comprehensive dataset of handwritten Bangla isolated characters. Data Brief 12:103–107

    Article  Google Scholar 

  44. Bhattacharya U, Chaudhuri BB (2008) Handwritten numeral databases of Indian scripts and multistage recognition of mixed numerals. IEEE Trans Pattern Anal Mach Intell 31(3):444–457

    Article  Google Scholar 

  45. Rabby ASA, Haque S, Islam MS, Abujar S, Hossain SA (2018) EKUSH: a multipurpose and multitype comprehensive database for online off-line Bangla handwritten characters. In: International conference on recent trends in image processing and pattern recognition. Springer, Berlin, pp 149–158

  46. Kingma DP, Ba J (2014) ADAM: a method for stochastic optimization. arXiv preprint arXiv:1412.6980

  47. Schaul T, Zhang S, LeCun Y (2013) No more pesky learning rates. In: International conference on machine learning, pp 343–351

  48. Perez L, Wang J (2017) The effectiveness of data augmentation in image classification using deep learning. arXiv preprint arXiv:1712.04621

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Md. Ekramul Hossain.

Ethics declarations

Conflicts of interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Opu, M.N.I., Hossain, M.E. & Kabir, M.A. Handwritten Bangla character recognition using convolutional neural networks: a comparative study and new lightweight model. Neural Comput & Applic 36, 337–348 (2024). https://doi.org/10.1007/s00521-023-09008-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-023-09008-8

Keywords

Navigation