Speeding-Up Graph-Based Keyword Spotting by Quadtree Segmentations

Stauffer, Michael; Fischer, Andreas; Riesen, Kaspar

doi:10.1007/978-3-319-64689-3_25

Michael Stauffer^16,19,
Andreas Fischer^17,18 &
Kaspar Riesen¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 10424))

Included in the following conference series:

International Conference on Computer Analysis of Images and Patterns

1360 Accesses
2 Citations

Abstract

Keyword Spotting (KWS) improves the accessibility to handwritten historical documents by unconstrained retrievals of keywords. The proposed KWS framework operates on segmented words that are in turn represented as graphs. The actual KWS process is based on matching graphs by means of a cubic-time graph matching algorithm. Although this matching algorithm is quite efficient, the polynomial time complexity might still be a limiting factor (especially in case of large documents). The present paper introduces a novel approach that aims at speeding up the retrieval process. The basic idea is to first segment individual graphs into smaller subgraphs by means of a quadtree procedure. Eventually, the graph matching procedure can be conducted on the resulting pairs of smaller subgraphs. In an experimental evaluation on two benchmark datasets we empirically confirm substantial speed-ups while the KWS accuracy is nearly not affected.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 59.99; Price excludes VAT (USA)

Softcover Book: USD 79.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
George Washington Papers at the Library of Congress, 1741–1799: Series 2, Letterbook 1, pp. 270–279 & pp. 300–309, http://memory.loc.gov/ammem/gwhtml/gwseries2.html.
2.
Parzival at IAM historical document database, http://www.fki.inf.unibe.ch/databases/iam-historical-document-database/parzival-database.
3.
BP stand for bipartite (LSAPs are also termed bipartite matching problem).
4.
We carry out our experiments on a high performance computing cluster with dozens of 2.2 GHz CPU nodes. Hence, these readings refer to the average matching time per keyword measured in a sequential scenario.

References

Fernandez-Mota, D., Almazan, J., Cirera, N., Fornes, A., Llados, J.: BH2M: The Barcelona historical, handwritten marriages database. In: International Conference on Pattern Recognition, pp. 256–261 (2014)
Google Scholar
Fischer, A., Frinken, V., Fornés, A., Bunke, H.: Transcription alignment of Latin manuscripts using hidden Markov models. In: Workshop on Historical Document Imaging and Processing, New York, p. 29 (2011)
Google Scholar
Fischer, A., Keller, A., Frinken, V., Bunke, H.: Lexicon-free handwritten word spotting using character HMMs. Pattern Recognit. Lett. 33(7), 934–942 (2012)
Article Google Scholar
Manmatha, R., Han, C., Riseman, E.: Word spotting: a new approach to indexing handwriting. In: Computer Vision and Pattern Recognition, pp. 631–637 (1996)
Google Scholar
Rath, T., Manmatha, R.: Word image matching using dynamic time warping. In: Computer Vision and Pattern Recognition, vol. 2, pp. II-521–II-527 (2003)
Google Scholar
Rodríguez-Serrano, J.A., Perronnin, F.: Handwritten word-spotting using hidden Markov models and universal vocabularies. Pattern Recognit. 42(9), 2106–2116 (2009)
Article MATH Google Scholar
Rodriguez, J.A., Perronnin, F.: Local gradient histogram features for word spotting in unconstrained handwritten documents. In: International Conference on Frontiers in Handwriting Recognition, pp. 7–12 (2008)
Google Scholar
Rodríguez-Serrano, J.A., Perronnin, F.: A model-based sequence similarity with application to handwritten word spotting. IEEE Trans. Pattern Anal. Mach. Intell. 34(11), 2108–2120 (2012)
Article Google Scholar
Perronnin, F., Rodriguez-Serrano, J.A.: Fisher kernels for handwritten word-spotting. In: International Conference on Document Analysis and Recognition, pp. 106–110 (2009)
Google Scholar
Conte, D., Foggia, P., Sansone, C., Vento, M.: Thirty years of graph matching in pattern recognition. Int. J. Pattern Recognit. Artif. Intell. 18(03), 265–298 (2004)
Article Google Scholar
Riesen, K.: Structural Pattern Recognition with Graph Edit Distance. Advances in Computer Vision and Pattern Recognition. Springer, Cham (2015). doi:10.1007/978-3-319-27252-8
Book MATH Google Scholar
Stauffer, M., Tschachtli, T., Fischer, A., Riesen, K.: A survey on applications of bipartite graph edit distance. In: Foggia, P., Liu, C.-L., Vento, M. (eds.) GbRPR 2017. LNCS, vol. 10310, pp. 242–252. Springer, Cham (2017). doi:10.1007/978-3-319-58961-9_22
Chapter Google Scholar
Wang, P., Eglin, V., Garcia, C., Largeron, C., Llados, J., Fornes, A.: A novel learning-free word spotting approach based on graph representation. In: International Workshop on Document Analysis Systems, pp. 207–211 (2014)
Google Scholar
Bui, Q.A., Visani, M., Mullot, R.: Unsupervised word spotting using a graph representation based on invariants. In: International Conference on Document Analysis and Recognition, pp. 616–620 (2015)
Google Scholar
Riba, P., Llados, J., Fornes, A.: Handwritten word spotting by inexact matching of grapheme graphs. In: International Conference on Document Analysis and Recognition, pp. 781–785 (2015)
Google Scholar
Stauffer, M., Fischer, A., Riesen, K.: Graph-based keyword spotting in historical handwritten documents. In: International Workshop on Structural, Syntactic, and Statistical Pattern Recognition (2016)
Google Scholar
Stauffer, M., Fischer, A., Riesen, K.: A novel graph database for handwritten word images. In: International Workshop on Structural, Syntactic, and Statistical Pattern Recognition (2016)
Google Scholar
Stauffer, M., Fischer, A., Riesen, K.: Speeding-up graph-based keyword spotting in historical handwritten documents. In: Foggia, P., Liu, C.-L., Vento, M. (eds.) GbRPR 2017. LNCS, vol. 10310, pp. 83–93. Springer, Cham (2017). doi:10.1007/978-3-319-58961-9_8
Chapter Google Scholar
Bunke, H., Allermann, G.: Inexact graph matching for structural pattern recognition. Pattern Recognit. Lett. 1(4), 245–253 (1983)
Article MATH Google Scholar
Berretti, S., Del Bimbo, A., Vicario, E.: Efficient matching and indexing of graph models in content-based retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 23(10), 1089–1105 (2001)
Article Google Scholar
Fankhauser, S., Riesen, K., Bunke, H.: Speeding up graph edit distance computation through fast bipartite matching. In: Jiang, X., Ferrer, M., Torsello, A. (eds.) GbRPR 2011. LNCS, vol. 6658, pp. 102–111. Springer, Heidelberg (2011). doi:10.1007/978-3-642-20844-7_11
Chapter Google Scholar
Koopmans, T.C., Beckmann, M.: Assignment problems and the location of economic activities. Econometrica 25(1), 53 (1957)
Article MathSciNet MATH Google Scholar
Riesen, K., Bunke, H.: Approximate graph edit distance computation by means of bipartite graph matching. Image Vis. Comput. 27(7), 950–959 (2009)
Article Google Scholar
Burkard, R., Dell’Amico, M., Martello, S.: Assignment Problems (2009)
Google Scholar

Download references

Acknowledgments

This work has been supported by the Hasler Foundation Switzerland.

Author information

Authors and Affiliations

Institute for Information Systems, University of Applied Sciences and Arts Northwestern Switzerland, Riggenbachstr. 16, 4600, Olten, Switzerland
Michael Stauffer & Kaspar Riesen
Department of Informatics, University of Fribourg, 1700, Fribourg, Switzerland
Andreas Fischer
Institute for Complex Systems, University of Applied Sciences and Arts Western Switzerland, 1705, Fribourg, Switzerland
Andreas Fischer
Department of Informatics, University of Pretoria, Pretoria, South Africa
Michael Stauffer

Authors

Michael Stauffer
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Fischer
View author publications
You can also search for this author in PubMed Google Scholar
Kaspar Riesen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Michael Stauffer .

Editor information

Editors and Affiliations

Linköping University, Linköping, Sweden
Michael Felsberg
Lund University, Lund, Sweden
Anders Heyden
University of Southern Denmark, Odense, Denmark
Norbert Krüger

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stauffer, M., Fischer, A., Riesen, K. (2017). Speeding-Up Graph-Based Keyword Spotting by Quadtree Segmentations. In: Felsberg, M., Heyden, A., Krüger, N. (eds) Computer Analysis of Images and Patterns. CAIP 2017. Lecture Notes in Computer Science(), vol 10424. Springer, Cham. https://doi.org/10.1007/978-3-319-64689-3_25

Download citation

DOI: https://doi.org/10.1007/978-3-319-64689-3_25
Published: 28 July 2017
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-64688-6
Online ISBN: 978-3-319-64689-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics