Abstract
Word spotting in handwritten document images is a field of immense interest due to its widespread applications. Recognition-free and recognition-based approaches are the two comprehensively studied regimes for the said problem out of which the first one is more realistic for practical applications. In literature, several works have been found that have used contour and distance-based measures for matching of the profiles of two word images. Although this is a prudent choice for printed words, the same often faces bottlenecks for unconstrained handwriting. To this end, this work applies dynamic time warping algorithm on logarithmic profiles of handwritten word images to lessen the uncontrolled profile variation that occurs due to elongation while writing some characters. We have considered both global and local interpretations of a word image by dividing it vertically into a number of sub-parts. This multi-view analysis provides close-up views of different approximations for the same word image. Finally, a voting scheme is evoked to produce the final decision. Besides, we have adopted a pruning method to pre-filter the target word images prior to applying the voting-based word matching scheme. The method has been tested on word images, taken from Qatar University Writer Identification database. We have obtained satisfactory results as compared to many state-of-the-art methods that also include deep learning-based feature extraction models.









Similar content being viewed by others
References
Al Aghbari Z, Brook S (2009) HAH manuscripts: a holistic paradigm for classifying and retrieving historical Arabic handwritten documents. Expert Syst Appl 36:10942–10951. https://doi.org/10.1016/j.eswa.2009.02.024
Al Maadeed S, Ayouby W, Hassaïne A, Aljaam JM (2012) Quwi: an Arabic and English handwriting dataset for offline writer identification. In: 2012 international conference on Frontiers in handwriting recognition. IEEE, pp 746–751
Athitsos V, Alon J, Sclaroff S, Kollios G (2004) Boostmap: A method for efficient approximate similarity rankings. In: Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004. IEEE, pp II–II
Basu S, Das N, Sarkar R, Kundu M, Nasipuri M, Basu DK (2009) A hierarchical approach to recognition of handwritten Bangla characters. Pattern Recogn 42:1467–1484. https://doi.org/10.1016/j.patcog.2009.01.008
Belongie S, Malik J, Puzicha J (2002) Shape matching and object recognition using shape contexts. IEEE Trans Pattern Anal Mach Intell 24:509–522
Bhowmik S, Malakar S, Sarkar R, Basu S, Kundu M, Nasipuri M (2018) Off-line Bangla handwritten word recognition: a holistic approach. Neural Comput & Applic 31:5783–5798. https://doi.org/10.1007/s00521-018-3389-1
Cao H, Govindaraju V (2007) Template-free word spotting in low-quality manuscripts. In: International Conference on Advances in Pattern Recognition. World Scientific, pp. 1–5
Chao P, Kao C-Y, Ruan Y-S, et al (2019) Hardnet: a low memory traffic network. In: Proceedings of the IEEE International Conference on Computer Vision. pp. 3552–3561
Deng J, Dong W, Socher R, et al (2010) ImageNet: a large-scale hierarchical image database. In: IEEE conference on computer vision and pattern recognition. pp. 248–255
Djeddi C, Al-Maadeed S, Gattal A, et al (2015) ICDAR2015 competition on multi-script writer identification and gender classification using ‘QUWI’database. In: 2015 13th international conference on document analysis and recognition (ICDAR). IEEE, pp 1191–1195
Fischer A, Keller A, Frinken V, Bunke H (2012) Lexicon free handwritten word spotting using character HMMs. Pattern Recogn Lett 33:934–942
Frinken V, Fischer A, Manmatha R, Bunke H (2011) A novel word spotting method based on recurrent neural networks. IEEE Trans Pattern Anal Mach Intell 34:211–224
Giotis AP, Sfikas G, Gatos B, Nikou C (2017) A survey of document image word spotting techniques. Pattern Recogn 68:310–332. https://doi.org/10.1016/j.patcog.2017.02.023
Jegou H, Douze M, Schmid C (2010) Product quantization for nearest neighbor search. IEEE Trans Pattern Anal Mach Intell 33:117–128
Khayyat M, Lam L, Suen CY (2014) Learning-based word spotting system for Arabic handwritten documents. Pattern Recogn 47:1021–1030. https://doi.org/10.1016/j.patcog.2013.08.014
Khurshid K, Faure C, Vincent N (2009) A novel approach for word spotting using merge-split edit distance. In: Lecture notes in computer science (including subseries lecture notes in artificial intelligence and lecture notes in bioinformatics). Springer, pp 213–220
Kovalchuk A, Wolf L, Dershowitz N (2014) A simple and fast word spotting method. In: 2014 14th international conference on Frontiers in handwriting recognition. IEEE, pp 3–8
Leydier Y, Lebourgeois F, Emptoz H (2007) Text search for medieval manuscript images. Pattern Recogn 40:3552–3567
Liang Y, Fairhurst MC, Guest RM (2012) A synthesised word approach to word retrieval in handwritten documents. Pattern Recogn 45:4225–4236. https://doi.org/10.1016/j.patcog.2012.05.024
Malakar S, Mohanta D, Sarkar R et al (2011) A new global Thresholding approach for document image Binarization. Int J Inf Process 6:48–59
Malakar S, Ghosh M, Sarkar R, Nasipuri M (2019) Development of a two-stage segmentation-based word searching method for handwritten document images. J Intell Syst 29:719–735
Manmatha R, Han C, Riseman EM (1996) Word spotting: a new approach to indexing handwriting. In: Proceedings CVPR IEEE Computer Society Conference on Computer Vision and Pattern Recognition. IEEE, pp. 631–637
Mondal T, Ragot N, Ramel J yves, Pal U (2018) Comparative study of conventional time series matching techniques for word spotting. Pattern Recogn 73:47–64. https://doi.org/10.1016/j.patcog.2017.07.011
Mondal T, Ragot N, Ramel JY, Pal U (2016) Flexible sequence matching technique: an effective learning-free approach for word spotting. Pattern Recogn 60:596–612. https://doi.org/10.1016/j.patcog.2016.05.011
Mukherjee P (2019) A two-stage approach for word searching in handwritten document images. MTech Thesis under Jadavpur University, Kolkata
Pantke W, Dennhardt M, Fecker D, et al (2014) An historical handwritten Arabic dataset for segmentation-free word spotting - HADARA80P. In: Proceedings of international conference on Frontiers in handwriting recognition, ICFHR. IEEE, pp. 15–20
Rath TM, Manmatha R (2003) Features for word spotting in historical manuscripts. In: Seventh international conference on document analysis and recognition, 2003. Proceedings. IEEE, pp. 218–222
Rath TM, Manmatha R (2007) Word spotting for historical documents. Int J Doc Anal Recognit 9:139–152
Retsinas G, Louloudis G, Stamatopoulos N, Gatos B (2016) Keyword spotting in handwritten documents using projections of oriented gradients. In: 2016 12th IAPR workshop on document analysis systems (DAS). IEEE, pp 411–416
Retsinas G, Louloudis G, Stamatopoulos N, Gatos B (2019) Efficient learning-free keyword spotting. IEEE Trans Pattern Anal Mach Intell 41:1587–1600
Riba P, Llados J, Fornes A (2015) Handwritten word spotting by inexact matching of grapheme graphs. In: Proceedings of the international conference on document analysis and recognition, ICDAR. IEEE, pp. 781–785
Riesen K, Bunke H (2009) Approximate graph edit distance computation by means of bipartite graph matching. Image Vis Comput 27:950–959
Rodríguez-Serrano JA, Perronnin F (2009) Handwritten word-spotting using hidden Markov models and universal vocabularies. Pattern Recogn 42:2106–2116. https://doi.org/10.1016/j.patcog.2009.02.005
Rothacker L, Fink GA (2015) Segmentation-free query-by-string word spotting with bag-of-features HMMs. In: Proceedings of the international conference on document analysis and recognition, ICDAR. IEEE, pp. 661–665
Rothacker L, Rusinol M, Fink GA (2013) Bag-of-features HMMs for segmentation-free word spotting in handwritten documents. In: Proceedings of the international conference on document analysis and recognition, ICDAR. IEEE, pp. 1305–1309
Roy PP, Ramel J, Ragot N (2011) Word retrieval in historical document using character-primitives. In: 2011 international conference on document analysis and recognition. Pp 678–682
Rusiñol M, Aldavert D, Toledo R, Lladós J (2015) Efficient segmentation-free keyword spotting in historical document collections. Pattern Recogn 48:545–555. https://doi.org/10.1016/j.patcog.2014.08.021
Saabni R (2013) The multi angular descriptor (mad): a binary and gray images descriptor for shape recognition. In: proceedings of the 2nd international workshop on historical document imaging and processing. ACM, pp 53–58
Saabni R, Bronstein A (2012) Fast keyword searching using “boostmap” based embedding. In: Frontiers in handwriting recognition (ICFHR), 2012 international conference on. IEEE, pp 734–739
Scot G, Loguet-Higgins H (1991) An algorithm for associating the features of two patterns. Proc R Soc Lond B 224:21–26
Sfikas G, Retsinas G, Gatos B (2016) Zoning aggregated hypercolumns for keyword spotting. In: 2016 15th international conference on Frontiers in handwriting recognition (ICFHR). IEEE, pp 283–288
Simonyan K, Zisserman A (2015) Very deep convolutional networks for large-scale image recognition. In: 3rd international conference on learning representations, ICLR 2015 - conference track proceedings
Singh PK, Chowdhury SP, Sinha S, et al (2017) Page-to-word extraction from unconstrained handwritten document images. In: Proceedings of the First International Conference on Intelligent Computing and Communication. Springer, pp. 517–525
Sudholt S, Fink GA (2016) PHOCNet : a deep convolutional neural network for word spotting in handwritten documents. In: international conference on Frontiers in handwriting recognition (ICFHR). IEEE, pp 277–282
Terasawa K, Tanaka Y (2009) Slit style HOG feature for document image word spotting. In: 2009 10th international conference on document analysis and recognition. IEEE, pp 116–120
Toselli AH, Vidal E (2013) Fast HMM-filler approach for key word spotting in handwritten documents. In: 2013 12th international conference on document analysis and recognition. IEEE, pp 501–505
Wang P, Eglin V, Garcia C, et al (2014) A coarse-to-fine word spotting approach for historical handwritten documents based on graph embedding and graph edit distance. In: Proceedings - International Conference on Pattern Recognition. IEEE, pp. 3074–3079
Wang P, Eglin V, Garcia C, et al (2014) A novel learning-free word spotting approach based on graph representation. In: proceedings - 11th IAPR international workshop on document analysis systems, DAS 2014. IEEE, pp 207–211
Acknowledgements
We would like to thank CMATER research laboratory of the Computer Science and Engineering Department, Jadavpur University, India for providing us the infrastructural support. This work is partially supported by the PURSE-II and UPE-II, Jadavpur University projects. Ram Sarkar is thankful to DST, Govt. of India, for the grant (EMR/2016/007213) to carry out this research.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Majumder, S., Ghosh, S., Malakar, S. et al. A voting-based technique for word spotting in handwritten document images. Multimed Tools Appl 80, 12411–12434 (2021). https://doi.org/10.1007/s11042-020-10363-0
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-10363-0