Abstract
The presence of struck-out texts in handwritten manuscripts adversely affects the performance of state-of-the-art automatic handwritten document processing systems. The information of struck-out words (STW) are often important for real-time applications like handwritten character recognition, writer identification, digital transcription, forensic applications, historical document analysis etc. Hence, the detection of STW and localisation of struck-out strokes (SS) are crucial tasks. In this paper, we introduce a system for simultaneous detection of STWs and localisation of the SS using a single network architecture based on Generative Adversarial Network (GAN). The system requires no prior information about the type of SS stroke and it is also able to robustly handle variant of strokes like straight, slanted, cris-cross, multiple-lines, underlines and partial STW as well. However, we also present a methodology to generate STW with high variability of SS for network learning. We have evaluated the proposed pipeline on publicly available IAM dataset and also on struck-out words collected from real-world writers with high variability factors like age, gender, stroke-width, stroke-type etc. The evaluation metrics show robustness and applicability in real-world scenario.
The work is partially supported by the project entitled as “Information Access from Document Images of Indian Languages” sponsored by IMPRINT, MHRD, Govt. of INDIA.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bibliothèque de Rouen (Rouen Library), Rouen Cedex-76043, France. http://www.bovary.fr. Accessed 19 July 2008
Ray, S.: https://en.wikipedia.org/wiki/Satyajit_Ray. Accessed 22 Feb 2021
The Morgan Library & Museum, New York, USA-10016. http://www.themorgan.org. Accessed 19 July 2008
A brief about the movie Goopi Gyne Bagha Byne. https://en.wikipedia.org/wiki/Goopy_Gyne_Bagha_Byne. Accessed 22 Feb 2021
A brief about the movie series Apu Triology. https://en.wikipedia.org/wiki/The_Apu_Trilogy. Accessed 22 Feb 2021
A brief description about ‘National Digital Library of India’ on Wikipedia. https://en.wikipedia.org/wiki/National_Digital_Library_of_India. Accessed 22 Feb 2021
A brief description of ‘Satyajit Ray’ by Satyajit Ray Organisation. https://satyajitray.org/. Accessed 22 Feb 2021
George Washington Papers, The Library of Congress, USA. http://memory.loc.gov/ammem/gwhtml/gwhome.html. Accessed 19 July 2008
National Digital Library of India. https://www.ndl.gov.in/. Accessed 22 Feb 2021
Queensland State Archive, Australia-4113. http://www.archivessearch.qld.gov.au. Accessed 19 July 2008
Adak, C., Chaudhuri, B.B.: An approach of strike-through text identification from handwritten documents. In: 2014 14th International Conference on Frontiers in Handwriting Recognition, pp. 643–648. IEEE (2014)
Adak, C., Chaudhuri, B.B., Blumenstein, M.: Impact of struck-out text on writer identification. In: 2017 International Joint Conference on Neural Networks (IJCNN), pp. 1465–1471. IEEE (2017)
Brink, A., Schomaker, L., Bulacu, M.: Towards explainable writer verification and identification using vantage writers. In: Ninth International Conference on Document Analysis and Recognition (ICDAR 2007), vol. 2, pp. 824–828. IEEE (2007)
Caligiuri, M.P., Mohammed, L.A.: The Neuroscience of Handwriting: Applications for Forensic Document Examination. CRC Press, Boca Raton (2012)
Chaudhuri, B.B., Adak, C.: An approach for detecting and cleaning of struck-out handwritten text. Pattern Recogn. 61, 282–294 (2017)
Gatys, L.A., Ecker, A.S., Bethge, M.: Image style transfer using convolutional neural networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2414–2423 (2016)
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1125–1134 (2017)
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 694–711. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_43
Likforman-Sulem, L., Vinciarelli, A.: Hmm-based offline recognition of handwritten words crossed out with different kinds of strokes (2008)
Liu, C.L., Yin, F., Wang, D.H., Wang, Q.F.: Online and offline handwritten Chinese character recognition: benchmarking on new databases. Pattern Recogn. 46(1), 155–162 (2013)
Marti, U.V., Bunke, H.: The IAM-database: an English sentence database for offline handwriting recognition. Int. J. Doc. Anal. Recogn. 5(1), 39–46 (2002)
Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)
Pal, U., Jayadevan, R., Sharma, N.: Handwriting recognition in Indian regional scripts: a survey of offline techniques. ACM Trans. Asian Lang. Inf. Process. (TALIP) 11(1), 1 (2012)
Plamondon, R., Srihari, S.N.: Online and off-line handwriting recognition: a comprehensive survey. IEEE Trans. Pattern Anal. Mach. Intell. 22(1), 63–84 (2000)
Poddar, A., Mukherjee, R., Mukhopadhyay, J., Biswas, P.K.: MultiDIAS: a hierarchical multi-layered document image annotation system. In: Sundaram, S., Harit, G. (eds.) DAR 2018. CCIS, vol. 1020, pp. 3–14. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-9361-7_1
Tuganbaev, D., Deriaguine, D.: Method of stricken-out character recognition in handwritten text, 25 June 2013. uS Patent 8,472,719
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Poddar, A., Chakraborty, A., Mukhopadhyay, J., Biswas, P.K. (2021). Detection and Localisation of Struck-Out-Strokes in Handwritten Manuscripts. In: Barney Smith, E.H., Pal, U. (eds) Document Analysis and Recognition – ICDAR 2021 Workshops. ICDAR 2021. Lecture Notes in Computer Science(), vol 12917. Springer, Cham. https://doi.org/10.1007/978-3-030-86159-9_7
Download citation
DOI: https://doi.org/10.1007/978-3-030-86159-9_7
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-86158-2
Online ISBN: 978-3-030-86159-9
eBook Packages: Computer ScienceComputer Science (R0)