Skip to main content

Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation

  • Conference paper
Image and Video Retrieval (CIVR 2005)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3568))

Included in the following conference series:

Abstract

This paper describes a simple framework for automatically annotating images using non-parametric models of distributions of image features. We show that under this framework quite simple image properties such as global colour and texture distributions provide a strong basis for reliably annotating images. We report results on subsets of two photographic libraries, the Corel Photo Archive and the Getty Image Archive. We also show how the popular Earth Mover’s Distance measure can be effectively incorporated within this framework.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)

    Google Scholar 

  2. Duygulu, P., Barnard, K., de Fretias, N., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Proceedings of the European Conference on Computer Vision, pp. 97–112 (2002)

    Google Scholar 

  3. Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the ACM SIGIR Conference on Research and Development in Infrmation Retrieval, pp. 119–126 (2003)

    Google Scholar 

  4. Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Proceedings of the 16th Conference on Advances in Neural Information Processing Systems NIPS (2003)

    Google Scholar 

  5. Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Proceedings of the International Conference on Image and Video Retrieval, pp. 42–50 (2004)

    Google Scholar 

  6. Feng, S., Manmatha, R., Lavrenko, V.: Multiple Bernoulli relevance models for image and video annotation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1002–1009 (2004)

    Google Scholar 

  7. Blei, D., Jordan, M.: Modeling annotated data. In: Proceedings of the ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 127–134 (2003)

    Google Scholar 

  8. Blei, D., Ng, A., Jordan, M.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)

    Article  MATH  Google Scholar 

  9. Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42, 145–175 (2001)

    Article  MATH  Google Scholar 

  10. Oliva, A., Torralba, A.: Scene-centered representation from spatial envelope descriptors. In: Proceedings of Biologically Motivated Computer Vision (2002)

    Google Scholar 

  11. Torralba, A., Oliva, A.: Statistics of natural image categories. Network: Computation in Neural Systems 14, 391–412 (2003)

    Article  Google Scholar 

  12. Reiss, R.: Nonparametric estimation of smooth distribution functions. Scandinavian Journal of Statistics 8, 116–119 (1981)

    MATH  MathSciNet  Google Scholar 

  13. Parzen, E.: On estimation of a probability density and mode. Annals of Mathematical Statistics 35, 1065–1076 (1962)

    Article  MathSciNet  Google Scholar 

  14. Härdle, W.: Applied Nonparametric Regression. Cambridge University Press, Cambridge (1992)

    Google Scholar 

  15. Friedman, J., Stuetzle, W., Schroeder, A.: Projection pursuit density estimation. Journal of the American Statistical Association 79, 599–608 (1984)

    Article  MathSciNet  Google Scholar 

  16. Rubner, Y.: The earth-mover’s distance as a metric for image retrieval. Technical Report STAN-CS-TN-98-86, Stanford University (1998)

    Google Scholar 

  17. Rubner, Y., Puzicha, J., Tomasi, C., Buhmann, J.: Empirical evaluation of dissimilarity measures for color and texture. Computer Vision and Image Understanding 84, 25–43 (2001)

    Article  MATH  Google Scholar 

  18. Levina, E., Bickel, P.: The earth mover’s distance is the Mallows distance: Some insights from statistics. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 251–256 (2001)

    Google Scholar 

  19. Jones, M., Marron, J., Sheather, S.: A brief survey of bandwidth selection for density estimation. Journal of American Statistics Association 91, 401–407 (1996)

    Article  MATH  MathSciNet  Google Scholar 

  20. Loader, R.: Bandwidth selection: classical or plug-in? The Annals of Statistics 27(2), 415–438 (1999)

    Article  MATH  MathSciNet  Google Scholar 

  21. Tamura, H.: Texture features corresponding to visual perception. IEEE Transactions. Systems, Man and Cybernetics 8(6), 460–473 (1978)

    Article  Google Scholar 

  22. Howarth, P., Rüger, S.: Evaluation of texture features for content-based image retrieval. In: Proceedings of the International Conference on Image and Video Retrieval, pp. 326–334 (2004)

    Google Scholar 

  23. Müller, H., Marchand-Maillet, S., Pun, T.: The truth about Corel - evaluation in image retrieval. In: Proceedings of the International Conference on Image and Video Retrieval, pp. 38–49 (2002)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2005 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yavlinsky, A., Schofield, E., Rüger, S. (2005). Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_54

Download citation

  • DOI: https://doi.org/10.1007/11526346_54

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-27858-0

  • Online ISBN: 978-3-540-31678-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics