Skip to main content
Log in

Segmentation of handwritten words into characters

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In this paper, SFF (Segmentation Facilitate Feature) technique is proposed to find the junction path to segment touched components based on the seed pixel selected among candidate pixels. Handwritten Recognition system has number of applications like reading postal address, filling forms, reading bank cheques, offering several challenges. In practice, constitute of the word images get touched in handwritten data due to variability in stroke, shortage of space which make the individual character extraction from the word image more complicated. Segmentation of individual in a word image requires a technique that takes care of the variability of writing. This paper proposed the SFF (Segmentation Facilitate Feature) technique to find seed pixel among candidate pixels based on 3-neighbouring pixels. It is used to find junction pixels which form a junction path to segregate the touched component. The junction path is selected to avoid the issues arising due to artifacts or deletion of components features. For experimentation, 1840 legal amount words containing touching components are used. The above number includes 250 words from benchmark database (ICDAR) and 1590 words are gathered from 15 different writers. On implementing, SFF (Segmentation Facilitate Feature) technique on the above mentioned database, 89.9% accuracy is achieved and a higher accuracy level 96.2% is achieved when performed on 1000 words containing two touching consonants.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

References

  1. Aneja N, Aneja S (2019) Transfer learning using CNN for handwritten devanagari character recognition. 1st IEEE Int. Conf. Adv. Inf. Technol. ICAIT 2019 - Proc., pp. 293–296.

  2. Avola D, Caschera MC, Ferri F, Grifoni P (2010) Classifying and resolving ambiguities in sketch-based interaction. Int J Virtual Technol Multimed 1(2):104

    Article  Google Scholar 

  3. Bansal V, Sinha RMK (2002) Segmentation of touching and fused Devanagari characters. Pattern Recogn 35(4):875–893

    Article  Google Scholar 

  4. Chen M-Y (1994) Off-line handwritten word recognition using a hidden Markov model type stochastic network. IEEE Trans Pattern Anal Mach Intell 16(5):481–496

    Article  Google Scholar 

  5. Choudhary A, Rishi R, Ahlawat S (2013) New character segmentation approach for off-line cursive handwritten words. Procedia Comput Sci 17:88–95

    Article  Google Scholar 

  6. Dhaka VP, Sharma MK (2015) An efficient segmentation technique for Devanagari offline handwritten scripts using the feedforward neural network. Neural Comput Appl 26(8):1881–1893

    Article  Google Scholar 

  7. Gaurav DD, Ramesh R (2012) A feature extraction technique based on character geometry for character recognition, no. eprint 1202.3884, pp. 1–4

  8. Gupta D, Bag S (2019) Handwritten multilingual word segmentation using polygonal approximation of digital curves for Indian languages. Multimed Tools Appl 78(14):19361–19386

    Article  Google Scholar 

  9. Kamble SN, Kamble PM (2011) Morphological approach for segmentation of scanned handwritten devnagari text. 3:99–108

  10. Kapoor S, Verma V (2014) Fragmentation of handwritten touching characters in devanagari script. Int J Inf Technol Model Comput 2(1):11–21

    Google Scholar 

  11. Kaur A, Singh P, Rani S (2015) Segmentation of broken and isolated characters in handwritten Gurumukhi word using neighboring pixel technique. Trans Networks Commun 3(2):37–42

    Article  Google Scholar 

  12. Kumar M, Jindal MK, Sharma RK (2014) Segmentation of isolated and touching characters in offline handwritten Gurmukhi script recognition. Int J Inf Technol Comput Sci 6(2):58–63

    Google Scholar 

  13. Kurniawan F, Rahim MSM, Sholihah N, Rakhmadi A, Mohamad D (2011) Characters segmentation of cursive hand written words based on contour analysis and neural network validation. ITB J Inf Commun Technol 5(1):1–16

    Google Scholar 

  14. Ladwani VM, Malik L (2010) Novel approach to segmentation of handwritten Devnagari word. Proc. - 3rd Int. Conf. Emerg. Trends Eng. Technol. ICETET 2010, pp. 219–224

  15. Lemaitre A, Camillerapp J, Coüasnon B (2011) A perceptive method for handwritten text segmentation. Doc Recognit Retr XVIII 7874:78740C

    Google Scholar 

  16. Louloudis G, Gatos B, Pratikakis I, Halatsis C (2009) Text line and word segmentation of handwritten documents. Pattern Recogn 42(12):3169–3183

    Article  Google Scholar 

  17. Lu Y, Shridhar M (1996) Character segmentation in handwritten words - an overview. Pattern Recogn 29(1):77–96

    Article  Google Scholar 

  18. Mamatha HR, Srikantamurthy K (2012) Morphological operations and projection profiles based segmentation of handwritten Kannada document. Int J Appl Inf Syst 4(5):13–19

    Google Scholar 

  19. Modi N, Jindal K (2013) Text line detection and segmentation in handwritten Gurumukhi scripts. IJ Adv Res Comput Sci Softw Engeenring (2013) 3(5):1075–1080

    Google Scholar 

  20. Naveena C, Manjunath Aradhya VN (2012) Handwritten character segmentation for Kannada scripts. Proc. 2012 World Congr. Inf. Commun. Technol. WICT 2012, pp. 144–149

  21. Pal U, Datta S (2003) Segmentation of Bangla unconstrained handwritten text. in Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, vol. 2003–Janua, no. ICDAR, pp. 1128–1132.

  22. Pal U, Behaid A, Choisy C (2003) Touching numeral segmentation using water reservoir concept. Pattern Recognit Lett 24(jan):261–272

    Article  Google Scholar 

  23. Sarkar R (2010) A script independent technique for extraction of characters from handwritten word images. 1(23):83–88

  24. Thakral B, Kumar M (2014) Devanagari handwritten text segmentation for overlapping and conjunct characters- A proficient technique. Proc. - 2014 3rd Int Conf Reliab Infocom Technol Optim Trends Futur Dir ICRITO 2014, 2015.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Monika Kohli.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Kohli, M., Kumar, S. Segmentation of handwritten words into characters. Multimed Tools Appl 80, 22121–22133 (2021). https://doi.org/10.1007/s11042-021-10638-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-021-10638-0

Keywords

Navigation