Abstract
This paper describes a simple framework for automatically annotating images using non-parametric models of distributions of image features. We show that under this framework quite simple image properties such as global colour and texture distributions provide a strong basis for reliably annotating images. We report results on subsets of two photographic libraries, the Corel Photo Archive and the Getty Image Archive. We also show how the popular Earth Mover’s Distance measure can be effectively incorporated within this framework.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Mori, Y., Takahashi, H., Oka, R.: Image-to-word transformation based on dividing and vector quantizing images with words. In: Proceedings of the International Workshop on Multimedia Intelligent Storage and Retrieval Management (1999)
Duygulu, P., Barnard, K., de Fretias, N., Forsyth, D.: Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In: Proceedings of the European Conference on Computer Vision, pp. 97–112 (2002)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the ACM SIGIR Conference on Research and Development in Infrmation Retrieval, pp. 119–126 (2003)
Lavrenko, V., Manmatha, R., Jeon, J.: A model for learning the semantics of pictures. In: Proceedings of the 16th Conference on Advances in Neural Information Processing Systems NIPS (2003)
Metzler, D., Manmatha, R.: An inference network approach to image retrieval. In: Proceedings of the International Conference on Image and Video Retrieval, pp. 42–50 (2004)
Feng, S., Manmatha, R., Lavrenko, V.: Multiple Bernoulli relevance models for image and video annotation. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 1002–1009 (2004)
Blei, D., Jordan, M.: Modeling annotated data. In: Proceedings of the ACM SIGIR Conference on Research and Development in Informaion Retrieval, pp. 127–134 (2003)
Blei, D., Ng, A., Jordan, M.: Latent Dirichlet allocation. Journal of Machine Learning Research 3, 993–1022 (2003)
Oliva, A., Torralba, A.: Modeling the shape of the scene: a holistic representation of the spatial envelope. International Journal of Computer Vision 42, 145–175 (2001)
Oliva, A., Torralba, A.: Scene-centered representation from spatial envelope descriptors. In: Proceedings of Biologically Motivated Computer Vision (2002)
Torralba, A., Oliva, A.: Statistics of natural image categories. Network: Computation in Neural Systems 14, 391–412 (2003)
Reiss, R.: Nonparametric estimation of smooth distribution functions. Scandinavian Journal of Statistics 8, 116–119 (1981)
Parzen, E.: On estimation of a probability density and mode. Annals of Mathematical Statistics 35, 1065–1076 (1962)
Härdle, W.: Applied Nonparametric Regression. Cambridge University Press, Cambridge (1992)
Friedman, J., Stuetzle, W., Schroeder, A.: Projection pursuit density estimation. Journal of the American Statistical Association 79, 599–608 (1984)
Rubner, Y.: The earth-mover’s distance as a metric for image retrieval. Technical Report STAN-CS-TN-98-86, Stanford University (1998)
Rubner, Y., Puzicha, J., Tomasi, C., Buhmann, J.: Empirical evaluation of dissimilarity measures for color and texture. Computer Vision and Image Understanding 84, 25–43 (2001)
Levina, E., Bickel, P.: The earth mover’s distance is the Mallows distance: Some insights from statistics. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 251–256 (2001)
Jones, M., Marron, J., Sheather, S.: A brief survey of bandwidth selection for density estimation. Journal of American Statistics Association 91, 401–407 (1996)
Loader, R.: Bandwidth selection: classical or plug-in? The Annals of Statistics 27(2), 415–438 (1999)
Tamura, H.: Texture features corresponding to visual perception. IEEE Transactions. Systems, Man and Cybernetics 8(6), 460–473 (1978)
Howarth, P., Rüger, S.: Evaluation of texture features for content-based image retrieval. In: Proceedings of the International Conference on Image and Video Retrieval, pp. 326–334 (2004)
Müller, H., Marchand-Maillet, S., Pun, T.: The truth about Corel - evaluation in image retrieval. In: Proceedings of the International Conference on Image and Video Retrieval, pp. 38–49 (2002)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Yavlinsky, A., Schofield, E., Rüger, S. (2005). Automated Image Annotation Using Global Features and Robust Nonparametric Density Estimation. In: Leow, WK., Lew, M.S., Chua, TS., Ma, WY., Chaisorn, L., Bakker, E.M. (eds) Image and Video Retrieval. CIVR 2005. Lecture Notes in Computer Science, vol 3568. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11526346_54
Download citation
DOI: https://doi.org/10.1007/11526346_54
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-27858-0
Online ISBN: 978-3-540-31678-7
eBook Packages: Computer ScienceComputer Science (R0)