Exploring statistical correlations for image retrieval

Wang, Xin-Jing; Ma, Wei-Ying; Li, Xing

doi:10.1007/s00530-006-0013-5

Exploring statistical correlations for image retrieval

Regular Paper
Published: 25 February 2006

Volume 11, pages 340–351, (2006)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Xin-Jing Wang¹,
Wei-Ying Ma² &
Xing Li³

150 Accesses
11 Citations
Explore all metrics

Abstract

Bridging the cognitive gap in image retrieval has been an active research direction in recent years, of which a key challenge is to get enough training data to learn the mapping functions from low-level feature spaces to high-level semantics. In this paper, image regions are classified into two types: key regions representing the main semantic contents and environmental regions representing the contexts. We attempt to leverage the correlations between types of regions to improve the performance of image retrieval. A Context Expansion approach is explored to take advantages of such correlations by expanding the key regions of the queries using highly correlated environmental regions according to an image thesaurus. The thesaurus serves as both a mapping function between image low-level features and concepts and a store of the statistical correlations between different concepts. It is constructed through a data-driven approach which uses Web data (images, their surrounding textual annotations) as training data source to learn the region concepts and to explore the statistical correlations. Experimental results on a database of 10,000 general-purpose images show the effectiveness of our proposed approach in both improving search precision (i.e. filter irrelevant images) and recall (i.e. retrieval relevant images whose context may be varied). Several major factors which have impact on the performance of our approach are also studied.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Jing, F., Li, M.J., Zhang, H.J., Zhang, B.: Support vector machines for region-based image retrieval. In: Proceedings of the IEEE International Conference on Multimedia and Expo. Baltimore, Maryland (2003)
Jing, F., Li, M.J., Zhang, H.J., Zhang, B.: An efficient and effective region-based image retrieval framework. IEEE Trans. Image Process. 13(5):699–709 (2004)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D.: Clustering art. Computer Vision and Pattern Recognition, II:434–439 (2001)
Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., Freitas, N., Blei, D.M., Jordan, M.: Matching words and pictures. J. Mach. Learn. Res. 3, 1107–1135 (2003)
Article MATH Google Scholar
Ma, W.Y., Manjunath, B.S.: Netra: a toolbox for navigating large image databases. In: Proceedings of the International Conference on Image Processing. Washington DC, USA (1997)
Wood, M.E.J., Campbell, N.W., Thomas, B.T.: Iterative refinement by relevance feedback in content-based digital image retrieval. In: Proceedings of the ACM International Conference on Multimedia. Bristol, UK (1998)
Zhu, L., Rao, A.B., Zhang, A.D.: Advanced feature extraction for keyblock-based image retrieval. Inform. Syst. 27(8), 537–557 (2002)
Article MATH Google Scholar
Tong, S., Chang, E.: Support vector machine active learning for image retrieval. In: Proceedings of the ACM International Conference on Multimedia. Ontario, Canada (2001)
Zhou, X.S., Huang, T.S.: Unifying keywords and visual contents in image retrieval. IEEE Multimedia 9(2), 23–33 (2002)
Article MathSciNet Google Scholar
Chang, E., Goh, K., Sychay, G., Wu, G.: CBSA: Content-based soft annotation for multimodal image retrieval using Bayes point machines. In: Proceedings of the IEEE Transactions on CSVT Special Issue on Conceptual and Dynamical Aspects of Multimedia Content Description, vol. 13, no. 1, pp. 26–38 (2003)
Zhang, H.J., Su, Z.: Improving CBIR by semantic propagation and cross-mode query expansion. Multi-Media Content Based Indexing and Retrieval (2001)
Porkaewand, K., Mehrotra, S.: Query reformulation for content based multimedia retrieval in MARS. Technical Report TR-MARS-99-05, University of California at Irvine (1999)
Ma, Y.F., Zhang, H.J.: Contrast-based image attention analysis by using fuzzy growing, In: Proceedings of the ACM International Conference on Multimedia. Berkeley, CA USA (2003)
Jeon, J., Lavrenko, V., Manmatha, R.: Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of the 26th Annual International ACM SIGIR Conference. Toronto, Canada (2003)
Srihari, R.K.: Use of multimedia input in automated image annotation and content-based retrieval. Storage and Retrieval for Image and Video Databases, pp. 249–260 (1995).
Fellbaum, C.: WordNet: An Electronical Lexical Database. MIT Press, Cambridge, Mass (1998)
Google Scholar
Cai, D., Yu, S., Wen, J.R. Ma, W.-Y.: VIPS: a vision-based page segmentation algorithm. Microsoft Technical Report, MSR-TR-2003-79 (2003)
Deng, Y., Manjunath, B.S.: Unsupervised segmentation of color-texture regions in images and video. IEEE Trans. Pattern Anal. Mach. Intell. 23(8), 800–810 (2001)
Article Google Scholar
Wang, X.J., Ma, W.Y., Li, X.: Data-driven approach for bridging the cognitive gap in image retrieval. In: Proceedings of the IEEE International Conference on Multimedia and Expo. Taipei, Taiwan (2004)
Sneath, P., Sokal, R.: Numerical Taxonomy: The Principles and Practice of Numerical Classification. W.H. Freeman, San Francisco, pp. 573 (1973)
MATH Google Scholar
Rubner, Y., Guibas, L.J., Tomasi, C.: The Earth mover's distance, multi-dimensional scaling, and color-based image retrieval. In: Proceedings of the ARPA Image Understanding Workshop, pp. 661–668. New Orleans, LA (1997)
Mikolajczyk, K., Schmid, C.: Scale and affine invariant interest point detectors. IJCV 1(60):63–86 (2004)
Google Scholar
Kadir, T.: Scale, saliency and scene description. Ph.D. Thesis, Oxford University (2002)

Download references

Author information

Authors and Affiliations

CERNET Center, Room 305, Tsinghua University, Beijing, 100084, China
Xin-Jing Wang
Microsoft Research Asia, 49 Zhichun Road, Beijing, 100080, China
Wei-Ying Ma
CERNET Center, Room 224, Tsinghua University, Beijing, 100084, China
Xing Li

Authors

Xin-Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Wei-Ying Ma
View author publications
You can also search for this author in PubMed Google Scholar
Xing Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xin-Jing Wang.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, XJ., Ma, WY. & Li, X. Exploring statistical correlations for image retrieval. Multimedia Systems 11, 340–351 (2006). https://doi.org/10.1007/s00530-006-0013-5

Download citation

Published: 25 February 2006
Issue Date: April 2006
DOI: https://doi.org/10.1007/s00530-006-0013-5

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Exploring statistical correlations for image retrieval

Abstract

Access this article

Similar content being viewed by others

A context-aware semantic modeling framework for efficient image retrieval

Region-Based Semantic Image Clustering Using Positive and Negative Examples

Automatic content based image retrieval using semantic analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Exploring statistical correlations for image retrieval

Abstract

Access this article

Similar content being viewed by others

A context-aware semantic modeling framework for efficient image retrieval

Region-Based Semantic Image Clustering Using Positive and Negative Examples

Automatic content based image retrieval using semantic analysis

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation