Skip to main content
Log in

Handwritten Arabic and Roman word recognition using holistic approach

  • Original article
  • Published:
The Visual Computer Aims and scope Submit manuscript

Abstract

The research community considers handwritten word recognition (HWR) as an open research problem to date. The reasons behind this are variations in intra-/interpersonal writing style, overlapping and/or touching characters in a word, degraded scanned document images, etc. Two major approaches, namely holistic and analytical, are followed by the researchers while designing an HWR system. In this work, we have followed the holistic approach as it works well on limited and pre-defined lexicon as compared to the analytical approach. As observed in the literature related to handwritten word recognition, irrespective of the approaches, researchers generally extract various local features from hypothetically partitioned segments of a word image while dealing with the said problem. However, no such work has been found which has considered inter-segment similarity that might carry some distinct information about different patterns (here, word segments). To this end, in the present work, we have used Hausdorff and Fréchet distances to quantize the similarity among all possible word segments taking two at a time. Along with this, conventional chain code histogram (a shape-based feature descriptor) and modified negative refraction-based shape transformation features have been used. Finally, a majority voting schema is used to combine outputs from six different classifiers. The model has been evaluated on two standard databases, namely IAM and IFN/ENIT, and the results obtained are promising in comparison with state-of-the-art holistic word recognition methods. Moreover, a performance comparison of the present method with some deep learning models confirms the usefulness of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14
Fig. 15
Fig. 16
Fig. 17
Fig. 18
Fig. 19
Fig. 20
Fig. 21
Fig. 22
Fig. 23
Fig. 24
Fig. 25
Fig. 26

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

References

  1. Pal, U., Roy, R.K., Kimura, F.: Multi-lingual city name recognition for Indian postal automation. In: Proceedings of the International Workshop on Frontiers in Handwriting Recognition. pp. 169–173. IEEE (2012)

  2. Pal, U., Roy, K., Kimura, F.: A lexicon-driven handwritten city-name recognition scheme for Indian postal automation. IEICE Trans. Inf. Syst. E92-D, 1146–1158 (2009). https://doi.org/10.1587/transinf.E92.D.1146

    Article  Google Scholar 

  3. Singh, S., Kariveda, T., Gupta, J. Das, Bhattacharya, K.: Handwritten words recognition for legal amounts of bank cheques in English script. In: Proceeding of the 2015 8th International Conference on Advances in Pattern Recognition. pp. 1–5. IEEE (2015)

  4. Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Database development and recognition of handwritten Devanagari legal amount words. In: Proceedings of the International Conference on Document Analysis and Recognition (ICDAR). pp. 304–308. IEEE (2011)

  5. Malakar, S., Ghosh, M., Sarkar, R., Nasipuri, M.: Development of a two-stage segmentation-based word searching method for handwritten document images. J. Intell. Syst. 29, 719–735 (2020). https://doi.org/10.1515/jisys-2017-0384

    Article  Google Scholar 

  6. Majumder, S., Ghosh, S., Malakar, S., Sarkar, R., Nasipuri, M.: A voting-based technique for word spotting in handwritten document images. Multimed. Tools Appl. (2021). https://doi.org/10.1007/s11042-020-10363-0

    Article  Google Scholar 

  7. Kundu, S., Malakar, S., Geem, Z.W., Moon, Y.Y., Singh, P.K., Sarkar, R.: Hough transform-based angular features for learning-free handwritten keyword spotting. Sensors 21, 4648 (2021)

    Article  Google Scholar 

  8. Mondal, R., Malakar, S., Barney Smith, E.H., Sarkar, R.: Handwritten English word recognition using a deep learning based object detection architecture. Multimed. Tools Appl. 81, 975–1000 (2022)

    Article  Google Scholar 

  9. Bhattacharya, R., Malakar, S., Schwenker, F., Sarkar, R.: Fuzzy-based pseudo segmentation approach for handwritten word recognition using a sequence to sequence model with attention. In: Pattern recognition. ICPR international workshops and challenges: virtual event, January 10–15, 2021, Proceedings, Part II, pp. 582–596. Springer (2021)

  10. Majid, N., Smith, E.H.B.: Segmentation-free bangla offline handwriting recognition using sequential detection of characters and diacritics with a Faster R-CNN. In: 2019 international conference on document analysis and recognition (ICDAR), pp. 228–233. IEEE (2019)

  11. Amrouch, M., Rabi, M., Mammass, D.: An improved Arabic handwritten recognition system using embedded training based on HMMs. In: 2016 IEEE/ACS 13th international conference of computer systems and applications (AICCSA), pp 1–5. IEEE (2016)

  12. Mahjoub, M.A., Ghanmy, N., Miled, I.: Multiple models of Bayesian networks applied to offline recognition of Arabic handwritten city names. arXiv Prepr. http://arxiv.org/abs/1301.4377. (2013)

  13. Dasgupta, J., Bhattacharya, K., Chanda, B.: A holistic approach for Off-line handwritten cursive word recognition using directional feature based on Arnold transform. Pattern Recognit. Lett. 79, 73–79 (2016). https://doi.org/10.1016/j.patrec.2016.05.017

    Article  Google Scholar 

  14. Malakar, S., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: An image database of handwritten Bangla words with automatic benchmarking facilities for character segmentation algorithms. Neural Comput. Appl. 33, 449–468 (2021). https://doi.org/10.1007/s00521-020-04981-w

    Article  Google Scholar 

  15. Inunganbi, S., Choudhary, P., Manglem, K.: Meitei Mayek handwritten dataset: compilation, segmentation, and character recognition. Vis. Comput. 37, 291–305 (2021)

    Article  Google Scholar 

  16. Madhvanath, S., Govindaraju, V., Member, S.: The role of holistic paradigms in handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 23, 149–164 (2001)

    Article  Google Scholar 

  17. Hijam, D., Saharia, S.: On developing complete character set meitei mayek handwritten character database. Vis. Comput. 1–15 (2021)

  18. Guo, H., Liu, Y., Yang, D., Zhao, J.: Offline handwritten Tai Le character recognition using ensemble deep learning. Vis. Comput. 1–14 (2021)

  19. Ghosh, S., Chatterjee, A., Singh, P.K., Bhowmik, S., Sarkar, R.: Language-invariant novel feature descriptors for handwritten numeral recognition. Vis. Comput. 37, 1781–1803 (2021)

    Article  Google Scholar 

  20. Chatterjee, A., Malakar, S., Sarkar, R., Nasipuri, M.: Handwritten digit recognition using DAISY descriptor: a study. In: Proceedings of 5th international conference on emerging applications of information technology (EAIT 2018), pp. 1–4. (2018)

  21. Warrington, E.K., Shallice, T.: Word-form dyslexia. Brain 103, 99–112 (1980)

    Article  Google Scholar 

  22. Madhvanath, S.: Holistic verification of handwritten phrases. IEEE Trans. Pattern Anal. Mach. Intell. 21, 1344–1356 (1999). https://doi.org/10.1109/34.817412

    Article  Google Scholar 

  23. Kacem, A., Aouïti, N., Belaïd, A.: Structural features extraction for handwritten Arabic personal names recognition. In: 2012 international conference on frontiers in handwriting recognition, pp. 268–273. IEEE (2012)

  24. Khémiri, A., Kacem, A., Belaïd, A.: Towards arabic handwritten word recognition via probabilistic graphical models. In: 2014 14th international conference on frontiers in handwriting recognition, pp. 678–683. IEEE (2014)

  25. Khémiri, A., Echi, A.K., Belaïd, A., Elloumi, M.: A system for off-line Arabic handwritten word recognition based on Bayesian approach. In: 2016 15th international conference on frontiers in handwriting recognition (ICFHR), pp. 560–565. IEEE (2016)

  26. AlKhateeb, J.H., Khelifi, F., Jiang, J., Ipson, S.S.: A new approach for off-line handwritten Arabic word recognition using KNN classifier. In: 2009 IEEE International Conference on Signal and Image Processing Applications. pp. 191–194. IEEE (2009)

  27. Tamen, Z., Drias, H., Boughaci, D.: An efficient multiple classifier system for Arabic handwritten words recognition. Pattern Recognit. Lett. 93, 123–132 (2017). https://doi.org/10.1016/j.patrec.2017.01.020

    Article  Google Scholar 

  28. Al-Nuzaili, Q.A., Hashim, S.Z.M., Saeed, F., Khalil, M.S., Mohamad, D.: Bin: Pixel distribution-based features for offline Arabic handwritten word recognition. Int. J. Comput. Vis. Robot. 7, 99–122 (2017)

    Article  Google Scholar 

  29. Hassen, H., Al-Maadeed, S.: Arabic handwriting recognition using sequential minimal optimization. In: 2017 1st international workshop on arabic script analysis and recognition (ASAR), pp 79–84. IEEE (2017)

  30. Cajote, R.D., Guevara, R.C.L.: Global word shape processing using polar-radii graphs for offline handwriting recognition. In: 2004 IEEE Region 10 Conference TENCON 2004, pp. 315–318. IEEE (2004)

  31. Gatos, B., Pratikakis, I., Perantonis, S.J.: Hybrid off-line cursive handwriting word recognition. In: 18th International Conference on Pattern Recognition (ICPR’06), pp. 998–1002. IEEE (2006)

  32. Bianne-Bernard, A.-L., Menasri, F., Mohamad, R.A.-H., Mokbel, C., Kermorvant, C., Likforman-Sulem, L.: Dynamic and contextual information in HMM modeling for handwritten word recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33, 2066–2080 (2011)

    Article  Google Scholar 

  33. Sahoo, S., Nandi, S.K., Barua, S., Priyam, P., Bhowmik, S., Malakar, S., Sarkar, R.: Handwritten Bangla word recognition using negative refraction based shape transformation. J. Intell. Fuzzy Syst. 35, 1765–1777 (2018)

    Article  Google Scholar 

  34. Malakar, S., Ghosh, M., Bhowmik, S., Sarkar, R., Nasipuri, M.: A GA based hierarchical feature selection approach for handwritten word recognition. Neural Comput. Appl. 32, 2533–2552 (2020)

    Article  Google Scholar 

  35. Bhowmik, S., Malakar, S., Sarkar, R., Nasipuri, M.: Handwritten bangla word recognition using elliptical features. In: Proceedings - 2014 6th International Conference on Computational Intelligence and Communication Networks, CICN 2014, pp. 257–261. IEEE (2014)

  36. Barua, S., Malakar, S., Bhowmik, S., Sarkar, R., Nasipuri, M.: Bangla handwritten city name recognition using gradient-based feature. In: 5th international conference on frontiers in intelligent computing: theory and applications, pp. 343–352. Springer, Singapore (2017)

  37. Bhowmik, S., Malakar, S., Sarkar, R., Basu, S., Kundu, M., Nasipuri, M.: Off-line Bangla handwritten word recognition: a holistic approach. Neural Comput. Appl. 31, 5783–5798 (2019)

    Article  Google Scholar 

  38. Malakar, S., Sharma, P., Singh, P.K., Das, M., Sarkar, R., Nasipuri, M.: A holistic approach for handwritten hindi word recognition. Int. J. Comput. Vis. Image Process. 7, 59–78 (2017). https://doi.org/10.4018/IJCVIP.2017010104

    Article  Google Scholar 

  39. Jino, P.J., Balakrishnan, K.: Offline handwritten recognition of malayalam district name-a holistic approach. Int. J. Eng. Technol. 9, 987–994 (2017)

    Article  Google Scholar 

  40. Kaur, H., Kumar, M.: Offline handwritten Gurumukhi word recognition using eXtreme Gradient Boosting methodology. Soft Comput. 1–14 (2020)

  41. Ruiz-Pinales, J., Jaime-Rivas, R., Castro-Bleda, M.J.: Holistic cursive word recognition based on perceptual features. Pattern Recognit. Lett. 28, 1600–1609 (2007). https://doi.org/10.1016/j.patrec.2007.03.017

    Article  Google Scholar 

  42. Awni, M., Khalil, M.I., Abbas, H.M.: Deep-learning ensemble for offline Arabic handwritten words recognition. In: 2019 14th International Conference on Computer Engineering and Systems (ICCES), pp. 40–45. IEEE (2019)

  43. Essa, N., El-Daydamony, E., Mohamed, A.A.: Enhanced technique for Arabic handwriting recognition using deep belief network and a morphological algorithm for solving ligature segmentation. ETRI J. 40, 774–787 (2018)

    Article  Google Scholar 

  44. Malakar, S., Paul, S., Kundu, S., Bhowmik, S., Sarkar, R., Nasipuri, M.: Handwritten word recognition using lottery ticket hypothesis based pruned CNN model: a new benchmark on CMATERdb2. 1.2. Neural Comput. Appl. 32, 15209–15220 (2020)

    Google Scholar 

  45. Pramanik, R., Bag, S.: Handwritten Bangla city name word recognition using CNN-based transfer learning and FCN. Neural Comput. Appl. 33, 9329–9341 (2021)

    Article  Google Scholar 

  46. Das, D., Nayak, D.R., Dash, R., Majhi, B., Zhang, Y.-D.: H-WordNet: a holistic convolutional neural network approach for handwritten word recognition. IET Image Process. 14, 1794–1805 (2020)

    Article  Google Scholar 

  47. Malakar, S., Mohanta, D., Sarkar, R., Das, N., Nasipuri, M., Basu, D.K.: Binarization of the noisy document images: a new approach. In: International Conference on Information Processing, pp. 511–520. (2011)

  48. Pechwitz, M., Maddouri, S.S., Märgner, V.: IFN/ENIT-database of handwritten Arabic words. In: Francophone International Conference on writing and Document, pp. 127–136. Citeseer (2002)

  49. IAM On-Line Handwriting Database, http://www.fki.inf.unibe.ch/databases/iam-on-line-handwriting-database

  50. Zimmermann, M., Bunke, H.: Automatic segmentation of the IAM off-line database for handwritten English text. In: Object Recognition Supported by User Interaction for Service Robots. pp. 35–39. IEEE (2002)

  51. Sahoo, S., Nandi, S.K., Barua, S., Priyam, P., Malakar, S., Sarkar, R.: Handwritten Bangla city name recognition using shape-context feature. In: 6th International Conference on Frontiers in Intelligent Computing: Theory and Applications, pp. 451–460. Springer, Singapore (2018)

  52. Mohamed, I.: The application of negative refractive index metamaterials to mm and sub-mm wave-length instrumentation (2013)

  53. Rucklidge, W.: The hausdorff distance. In: Rucklidge, W. (ed.) Efficient Visual Recognition Using the Hausdorff Distance, pp. 27–42. Springer, Berlin, Heidelberg (1996)

    Chapter  MATH  Google Scholar 

  54. Zhu, L., Zhu, C.: Application of Hausdorff distance in image matching. In: 2014 IEEE Workshop on Electronics, Computer and Applications, pp. 97–100. IEEE (2014)

  55. Eiter, T., Mannila, H.: Computing discrete Fréchet distance. Citeseer (1994)

  56. Kundu, S., Paul, S., Singh, P.K., Sarkar, R., Nasipuri, M.: Understanding NFC-Net: a deep learning approach to word-level handwritten Indic script recognition. Neural Comput. Appl. 32, 1–17 (2020)

    Article  Google Scholar 

  57. Fan, Y., Rui, X., Poslad, S., Zhang, G., Yu, T., Xu, X., Song, X.: A better way to monitor haze through image based upon the adjusted LeNet-5 CNN model. Signal Image Video Process. 14, 455–463 (2020)

    Article  Google Scholar 

  58. Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: Advances in Neural Information Processing Systems, pp. 1097–1105 (2012)

Download references

Acknowledgements

We would like to thank the Center for Microprocessor Applications for Training Education and Research (CMATER) research laboratory of the Computer Science and Engineering Department, Jadavpur University, Kolkata, India, for providing us the infrastructural support.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Samir Malakar.

Ethics declarations

Conflict of interest

We declare that we have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Malakar, S., Sahoo, S., Chakraborty, A. et al. Handwritten Arabic and Roman word recognition using holistic approach. Vis Comput 39, 2909–2932 (2023). https://doi.org/10.1007/s00371-022-02500-7

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00371-022-02500-7

Keywords