Logistic Regression of Generic Codebooks for Semantic Image Retrieval

Magalhães, João; Rüger, Stefan

doi:10.1007/11788034_5

João Magalhães²⁰ &
Stefan Rüger²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4071))

Included in the following conference series:

International Conference on Image and Video Retrieval

793 Accesses
5 Citations

Abstract

This paper is about automatically annotating images with keywords in order to be able to retrieve images with text searches. Our approach is to model keywords such as ’mountain’ and ’city’ in terms of visual features that were extracted from images. In contrast to other algorithms, each specific keyword-model considers not only its own training data but also the whole training set by utilizing correlations of visual features to refine its own model. Initially, the algorithm clusters all visual features extracted from the full imageset, captures its salient structure (e.g. mixture of clusters or patterns) and represents this as a generic codebook. Then keywords that were associated with images in the training set are encoded as a linear combination of patterns from the generic codebook. We evaluate the validity of our approach in an image retrieval scenario with two distinct large datasets of real-world photos and corresponding manual annotations.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Yavlinsky, A., Schofield, E., Rüger, S.: Automated image annotation using global features and robust nonparametric density estimation. In: Leow, W.-K., Lew, M., Chua, T.-S., Ma, W.-Y., Chaisorn, L., Bakker, E.M. (eds.) CIVR 2005. LNCS, vol. 3568, pp. 507–517. Springer, Heidelberg (2005)
Chapter Google Scholar
Carneiro, G., Vasconcelos, N.: Formulating semantic image annotation as a supervised learning problem. In: IEEE Conference on Computer Vision and Pattern Recognition, San Diego, CA, USA (2005)
Google Scholar
Westerveld, T., de Vries, A.P.: Experimental result analysis for a generative probabilistic image retrieval model. In: ACM SIGIR Conference on research and development in information retrieval, Toronto, Canada (2003)
Google Scholar
Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Int’l Workshop on Multimedia Intelligent Storage and Retrieval Management, Orlando, FL, USA (1999)
Google Scholar
Vailaya, A., Figueiredo, M., Jain, A.K., Zhang, H.J.: Image classification for content-based indexing. IEEE Transactions on Image Processing 10, 117–130 (2001)
Article MATH Google Scholar
Duygulu, P., Barnard, K., de Freitas, N., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2353, pp. 97–112. Springer, Heidelberg (2002)
Chapter Google Scholar
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: ACM SIGIR Conference on research and development in information retrieval, Toronto, Canada (2003)
Google Scholar
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Neural Information Processing System Conference, Vancouver, Canada (2003)
Google Scholar
Feng, S.L., Lavrenko, V., Manmatha, R.: Multiple Bernoulli relevance models for image and video annotation. In: IEEE Conference on Computer Vision and Pattern Recognition, Cambridge, UK (2004)
Google Scholar
Barnard, K., Forsyth, D.A.: Learning the semantics of words and pictures. In: Int’l Conference on Computer Vision (2001)
Google Scholar
Blei, D., Jordan, M.: Modeling annotated data. In: ACM SIGIR Conference on research and development in information retrieval, Toronto, Canada (2003)
Google Scholar
Figueiredo, M., Jain, A.K.: Unsupervised learning of finite mixture models. IEEE Transactions on Pattern Analysis and Machine Intelligence 24, 381–396 (2002)
Article Google Scholar
Liu, D.C., Nocedal, J.: On the limited memory method for large scale optimization. Mathematical Programming B 45, 503–528 (1989)
Article MATH MathSciNet Google Scholar
Hastie, T., Tibshirani, R., Friedman, J.: The elements of statistical learning: Data mining, inference and prediction. Springer, Heidelberg (2001)
MATH Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computing, Imperial College London, South Kensington Campus, London, SW7 2AZ, UK
João Magalhães & Stefan Rüger

Authors

João Magalhães
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Rüger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Arts, Media and Engineering Program, Arizona State University, 85281, Tempe, AZ,
Hari Sundaram
Intelligent Information Management Department, IBM T.J. Watson Research Center, 19 Skyline Drive, 10532, Hawthorne, NY, USA
Milind Naphade
Intelligent Information Management Department, IBM T. J. Watson Research Center, 19 Skyline Drive, 10532, Hawthorne, NY, USA
John R. Smith
Microsoft Corporation, Microsoft China R&D Group, 49 Zhichun Road, 100080, Beijing, China
Yong Rui

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Magalhães, J., Rüger, S. (2006). Logistic Regression of Generic Codebooks for Semantic Image Retrieval. In: Sundaram, H., Naphade, M., Smith, J.R., Rui, Y. (eds) Image and Video Retrieval. CIVR 2006. Lecture Notes in Computer Science, vol 4071. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11788034_5

Download citation

DOI: https://doi.org/10.1007/11788034_5
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-36018-6
Online ISBN: 978-3-540-36019-3
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics