Abstract
In this paper, we propose different approaches for the segmentation of handwritten Devanagari word documents into constituent characters (or pseudo-characters). For accurate identification and segmentation of shiroreakha we exploited ShiroreakhaNet which is encoder-decoder based convolutional neural network. After, segmenting the shiroreakha structural patterns/properties are exploited for the segmentation of upper and lower modifiers. For the corroboration of the efficacy of the results, we collected dataset from different domains. Comparison is also performed with the state-of-the-art methods, and it was revealed that proposed approaches significantly perform better.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bhunia, A.K., Roy, P.P., Sain, A., Pal, U.: Zone-based keyword spotting in Bangla and Devanagari documents. Multimedia Tools and Applications 79(37–38), 27365–27389 (2020). https://doi.org/10.1007/s11042-019-08442-y
Bhunia, A.K., Mukherjee, S., Sain, A., Bhunia, A.K., Roy, P.P., Pal, U.: Indic handwritten script identification using offline-online multi-modal deep network. Information Fusion. 57, 1–14 (2020)
Bhat, M.I., Sharada, B.: Automatic recognition of legal amounts on indian bank cheques: a fusion based approach at feature and decision level. Int. J. Comp. Vision and Image Procesing 10(4), 57–73
Casey, R.G., Casey, R.G., Lecolinet, E., Lecolinet, E.: A survey of methods and strategies in character segmentation. Analysis. IEEE Transactions on Pattern Analysis and Machine Intelligence 18, 690–706 (1996)
Bortolozzi, F., Souza, A. De B., Jr, Oliveira, L.S., Morita, M.: Recent advances in handwriting recognition. IEEE Trans. Pattern Anal. Mach. Intell. 22, 38–62 (2004)
Murthy Ramana, O.V.: An approach to offline handwritten Devanagari word segmentation. Int. J. Comput. Appl. Technol. 44, 284–292 (2012)
Bag, S., Krishna, A.: Character segmentation of Hindi unconstrained handwritten words. Lect. Notes Comput. Sci. 9448, 247–260 (2015)
Bhujade, M.V.G., Meshram, M.C.M.: A technique for segmentation of handwritten Hindi text. Int. J. Eng. Res. technoogy. 3, 1491–1495 (2014)
Kohli, M., Kumar, S.: Pre-segmentation in offline handwritten words. Infocomp J. Comput. Sci. 18, 48–53 (2019)
Ramteke, A.S., Rane, M.E.: Offline handwritten Devanagari script segmentation. Int. J. Sci. Technol. Res. 1, 142–145 (2012)
Pal, U., Chaudhuri, B.B.: Indian script character recognition: a survey. Pattern Recognit. 37, 1887–1899 (2004)
Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Offline recognition of Devanagari script: a survey. IEEE Trans. Syst. Man Cybern. Part C Appl. Rev. 41, 782–796 (2011)
Bag, S., Harit, G.: A survey on optical character recognition for Bangla and Devanagari scripts. Sadhana - Acad. Proc. Eng. Sci. 38, 133–168 (2013)
Bhat, M.I., Sharada, B., Obaidullah, S.M., and Imran, M.: Towards accurate identification and removal of shirorekha from off-line handwritten devanagari word documents. Proc. ICFHR 2020-September, pp. 234–239 (2020)
Kaya, A., Keceli, A.S., Catal, C., Yalic, H.Y., Temucin, H., Tekinerdogan, B.: Analysis of transfer learning for deep neural network based plant classification models. Comput. Electron. Agric. 158, 20–29 (2019)
Gonzalez, R.C., Woods, R.E.: Digital image processing, https://books.google.com/books?id=lDojQwAACAAJ&pgis=1 (2008)
Jayadevan, R., Kolhe, S.R., Patil, P.M., Pal, U.: Database development and recognition of handwritten Devanagari legal amount words, pp. 304–308. Int. Conf. Doc. Anal. Recognition, Beijing (2011)
Malik, L.: A graph based approach for handwritten Devanagari word recognition. Fifth Int. Conf. Emerg. Trends Eng. Technol. 1, 2012–42 (2012)
Bansal, V., Sinha, R.M.K.: Segmentation of touching and fused Devanagari characters. Pattern Recogn. 35, 875–893 (2002)
Bag, S., Krishna, A.: Character segmentation of Hindi unconstrained handwritten words. Proc. 17th Int. Work. Comb. image Anal. 9448, 247–260 (2015)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Bhat, M.I., Sharada, B., Imran, M., Obaidullah, S. (2022). Automatic Segmentation of Handwritten Devanagari Word Documents Enabling Accurate Recognition. In: Chbeir, R., Manolopoulos, Y., Prasath, R. (eds) Mining Intelligence and Knowledge Exploration. MIKE 2021. Lecture Notes in Computer Science(), vol 13119. Springer, Cham. https://doi.org/10.1007/978-3-031-21517-9_8
Download citation
DOI: https://doi.org/10.1007/978-3-031-21517-9_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-21516-2
Online ISBN: 978-3-031-21517-9
eBook Packages: Computer ScienceComputer Science (R0)