skip to main content
10.1145/1815330.1815377acmotherconferencesArticle/Chapter ViewAbstractPublication PagesdasConference Proceedingsconference-collections
research-article

Towards more effective distance functions for word image matching

Published: 09 June 2010 Publication History

Abstract

Matching word images has many applications in document recognition and retrieval systems. Dynamic Time Warping (DTW) is popularly used to estimate the similarity between word images. Word images are represented as sequences of feature vectors, and the cost associated with dynamic programming based alignment is considered as the dissimilarity between them. However, such approaches are computationally costly when compared to fixed length matching schemes. In this paper, we explore systematic methods for identifying appropriate distance metrics for a given database or language. This is achieved by learning query specific distance functions which can be computed online efficiently. We show that a weighted Euclidean distance can outperform DTW for matching word images. This class of distance functions are also ideal for scalability and large scale matching. Our results are validated with mean Average Precision (mAP) on a fully annotated data set of 160K word images. We then show that the learnt distance functions can even be extended to a new database to obtain accurate retrieval.

References

[1]
Digital library of india. http://dli.iiit.ac.in/.
[2]
S. Aksoy, R. M. Haralick, F. A. Cheikh, and M. Gabbouj. A weighted distance approach to relevance feedback. In ICPR, pages 4812--4815, 2000.
[3]
A. Balasubramanian, M. Meshesha, and C. V. Jawahar. Retrieval from document image collections. In Document Analysis Systems, pages 1--12, 2006.
[4]
A. Bar-Hillel, T. Hertz, N. Shental, and D. Weinshall. Learning a mahalanobis metric from equivalence constraints. Journal of Machine Learning Research, 6:937--965, 2005.
[5]
C. Böhm, S. Berchtold, and D. A. Keim. Searching in high-dimensional spaces: Index structures for improving the performance of multimedia databases. ACM Computing Surveys., 33(3):322--373, 2001.
[6]
T. E. de Campos, B. R. Babu, and M. Varma. Character recognition in natural images. In VISSAPP (2), pages 273--280, 2009.
[7]
T. Deselaers, R. Paredes, E. Vidal, and H. Ney. Learning weighted distances for relevance feedback in image retrieval. In ICPR, pages 1--4, 2008.
[8]
D. S. Doermann. The indexing and retrieval of document images: A survey. Computer Vision and Image Understanding, 70(3):287--298, 1998.
[9]
A. Franco, A. Lumini, and D. Maio. A new approach for relevance feedback through positive and negative samples. In ICPR (4), pages 905--908, 2004.
[10]
A. Frome, Y. Singer, and J. Malik. Image retrieval and classification using local distance functions. In NIPS, pages 417--424, 2006.
[11]
T. Hong, T., and J. J. Hull. Algorithms for postprocessing ocr results with visual inter-word constraints. In ICIP, 1995.
[12]
T. Konidaris, B. Gatos, K. Ntzios, I. Pratikakis, S. Theodoridis, and S. J. Perantonis. Keyword-guided word spotting in historical printed documents using synthetic data and user feedback. IJDAR, 2007.
[13]
A. Kumar, C. V. Jawahar, and R. Manmatha. Efficient search in document image collections. In ACCV (1), pages 586--595, 2007.
[14]
V. Lavrenko, T. M. Rath, and R. Manmatha. Holistic word recognition for handwritten historical documents. In DIAL, pages 278--287, 2004.
[15]
D. Lowe. Object recognition from local scale-invariant features. In ICCV, pages 1150--1157, 1999.
[16]
Y. Lu, C. L. Tan, W. Huang, and L. Fan. An approach to word image matching based on weighted hausdorff distance. ICDAR, 2001.
[17]
R. Manmatha, C. Han, and E. M. Riseman. Word spotting: A new approach to indexing handwriting. In CVPR, pages 631--637, 1996.
[18]
M. Meshesha and C. V. Jawahar. Matching word images for content-based retrieval from printed document images. IJDAR, 11(1):29--38, 2008.
[19]
V. Rasagna, A. Kumar, C. V. Jawahar, and R. Manmatha. Robust recognition of documents by fusing results of word clusters. In ICDAR, 2009.
[20]
T. M. Rath and R. Manmatha. Word image matching using dynamic time warping. In CVPR (2), pages 521--527, 2003.
[21]
T. M. Rath and R. Manmatha. Word spotting for historical documents. IJDAR, 9(2--4):139--152, 2007.
[22]
J. Rodriguez-Serrano, F. Perronnin, J. Llados, and G. Sanchez. A similarity measure between vector sequences with application to handwritten word image retrieval. CVPR, 2009.
[23]
Y. Seiji and O. Takashi. Document retrieval using relevance feedback. CICSJ Bull, 21(1):32--36, 2004.
[24]
J. B. Tenenbaum and W. T. Freeman. Separating style and content with bilinear models. Neural Computing, 12(6):1247--1283, 2000.
[25]
C. J. van Rijsbergen. Information Retrieval. Butterworth, 1979.
[26]
K. Q. Weinberger, J. Blitzer, and L. K. Saul. Distance metric learning for large margin nearest neighbor classification. In NIPS, 2005.
[27]
Q. Zheng and T. Kanungo. Morphological degradation models and their use in document image restoration. In ICIP (1), pages 193--196, 2001.
[28]
X. S. Zhou and T. S. Huang. Relevance feedback in image retrieval: A comprehensive review. Multimedia System, 8(6):536--544, 2003.

Cited By

View all
  • (2022) FSOA‐DNFNet : Incremental indexing and image classification using hybrid optimization‐based deep neuro Fuzzy network Concurrency and Computation: Practice and Experience10.1002/cpe.704934:19Online publication date: 10-May-2022
  • (2021)A Review of Deep Learning Techniques in Document Image Word SpottingArchives of Computational Methods in Engineering10.1007/s11831-021-09605-7Online publication date: 17-May-2021
  • (2015)High performance Query-by-Example keyword spotting using Query-by-String techniquesProceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2015.7333860(741-745)Online publication date: 23-Aug-2015
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
DAS '10: Proceedings of the 9th IAPR International Workshop on Document Analysis Systems
June 2010
490 pages
ISBN:9781605587738
DOI:10.1145/1815330
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 09 June 2010

Permissions

Request permissions for this article.

Check for updates

Qualifiers

  • Research-article

Conference

DAS '10

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 02 Mar 2025

Other Metrics

Citations

Cited By

View all
  • (2022) FSOA‐DNFNet : Incremental indexing and image classification using hybrid optimization‐based deep neuro Fuzzy network Concurrency and Computation: Practice and Experience10.1002/cpe.704934:19Online publication date: 10-May-2022
  • (2021)A Review of Deep Learning Techniques in Document Image Word SpottingArchives of Computational Methods in Engineering10.1007/s11831-021-09605-7Online publication date: 17-May-2021
  • (2015)High performance Query-by-Example keyword spotting using Query-by-String techniquesProceedings of the 2015 13th International Conference on Document Analysis and Recognition (ICDAR)10.1109/ICDAR.2015.7333860(741-745)Online publication date: 23-Aug-2015
  • (2015)Label EmbeddingInternational Journal of Computer Vision10.1007/s11263-014-0793-6113:3(193-207)Online publication date: 1-Jul-2015
  • (2014)Enhancing Word Image Retrieval in Presence of Font Variations2014 22nd International Conference on Pattern Recognition10.1109/ICPR.2014.468(2709-2714)Online publication date: Aug-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media