Semantically Relevant Image Retrieval by Combining Image and Linguistic Analysis

Lam, Tony; Singh, Rahul

doi:10.1007/11919629_77

Tony Lam²⁸ &
Rahul Singh²⁸

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4292))

Included in the following conference series:

International Symposium on Visual Computing

1934 Accesses
6 Citations

Abstract

In this paper, we introduce a novel approach to image-based information retrieval by combining image analysis with linguistic analysis of associated annotation information. While numerous Content Based Image Retrieval (CBIR) systems exist, most of them are constrained to use images as the only source of information. In contrast, recent research, especially in the area of web-search has also used techniques that rely purely on textual information associated with an image. The proposed research adopts a conceptually different philosophy. It utilizes the information at both the image and annotation level, if it detects a strong semantic coherence between them. Otherwise, depending on the quality of information available, either of the media is selected to execute the search. Semantic similarity is defined through the use of linguistic relationships in WordNet as well as through shape, texture, and color. Our investigations lead to results that are of significance in designing multimedia information retrieval systems. These include technical details on designing cross-media retrieval strategies as well as the conclusion that combining information modalities during retrieval not only leads to more semantically relevant performance but can also help capture highly complex issues such as the emergent semantics associated with images.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Aslandogan, Y., Their, C., Yu, C., Zou, J., Rishe, N.: Using Semantic Contents and WordNet in Image Retrieval. In: Proceedings of ACM SIGIR Conference, Philadelphia (July 1997)
Google Scholar
Barnard, K., Forsyth, D.: Learning the Semantics of Words and Pictures. In: International Conference on Computer Vision, vol. 2, pp. 408–415 (2001)
Google Scholar
Carson, C., Belonge, S., Greenspan, H., Malik, J.: Blobworld: Image segmentation using Expectation-Maximization and its application to image querying. IEEE Transactions on Pattern Analysis and Machine Intelligence, SUB
Google Scholar
Chen, F., Gargi, U., Niles, L., Schütze, H.: Multi-modal browsing of images in web documents. In: Proc. SPIE Document Recognition and Retrieval (1999)
Google Scholar
La Cascia, M., Sethi, S., Sclaroff, S.: Combining Textual and Visual Cues for Content-based Image Retrieval on the World Wide Web. In: IEEE Workshop on Content-based Access of Image and Video Libraries
Google Scholar
Deng, C., He, X., Li, Z., Ma, W., Wen, J.: Hierarchical Clustering of WWW Image Search Results Using Visual, Textual and Link Information. In: Proceedings of the 12th annual ACM international conference on Multimedia, pp. 952–959 (2004)
Google Scholar
Deng, Y., Manjunath, B., Kenney, C., Moore, M., Shin, H.: An Efficient Color Representation for Image Retrieval. IEEE Transactions on Image Processing 10(1), 140–147 (2001)
Article MATH Google Scholar
Flickr, http://www.flickr.com/
Google search engine, http://www.google.com/
Jacobs, C., Finkelstein, A., Salesin, D.: Fast Multiresolution Image Querying. In: Proceedings of Computer Graphics, Annual Conference Series, pp. 277–286 (1995)
Google Scholar
Miller, G., Beckwith, R., Fellbaum, C., Gross, D., Miller, K.: Introduction to WordNet: An on-line lexical database. International Journal of Lexicography 3(4), 235–312 (1990)
Article Google Scholar
Paek, S., Sable, C.L., Hatzivassiloglou, V., Jaimes, A., Schiffman, B.H., Chang, S.-F., McKeown, K.R.: Integration of Visual and Text based Approaches for the Content Labeling and 21 Classification of Photographs. In: ACM SIGIR 1999 Workshop on Multimedia Indexing and Retrieval (1999)
Google Scholar
Rodden, K., Basalaj, W., Sinclair, D., Wood, K.R.: Does organisation by similarity assist image browsing? In: Proceedings of Human Factors in Computing Systems (2001)
Google Scholar
Sable, C., Hatzivassiloglou, V.: Text-based approaches for the categorization of images. In: Abiteboul, S., Vercoustre, A.-M. (eds.) ECDL 1999. LNCS, vol. 1696, pp. 19–38. Springer, Heidelberg (1999)
Chapter Google Scholar
Santini, S., Gupta, A., Jain, R.: Emergent Semantics Through Interaction in Image Databases. Knowledge and Data Engineering 13(3), 337–351 (2001)
Article Google Scholar
Sclaroff, S., Taycher, L., La Cascia, M.: ImageRover: A Content-Based Image Browser for the World Wide Web. In: IEEE Workshop on Content-based Access of Image and Video Libraries, TR97-005 06/97
Google Scholar
Wang, J., Li, J., Wiederhold, G.: Semantics-Sensitive Integrated Matching for Picture Libraries. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(9), 947–963 (2001)
Article Google Scholar
Wang, J., Wiederhold, G., Firschein, O., Wei, S.: Content-based image indexing and searching using daubechies’ wavelets. International Journal of Digital Libraries 1(4), 311–328 (1998)
Article Google Scholar
Yee, K., Swearingen, K., Li, K., Heart, M.: Faceted Metadata for Image Search and Browsing. In: Proceedings of the Conference on Human Factors in Computing Systems, pp. 401–408 (2003)
Google Scholar
Zambrano, B., Singh, R., Bhattarai, B.: Using Linguistic Models for Image Retrieval. In: Bebis, G., Boyle, R., Koracin, D., Parvin, B. (eds.) ISVC 2005. LNCS, vol. 3804, pp. 494–501. Springer, Heidelberg (2005)
Chapter Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, San Francisco State University, San Francisco, CA, 94132
Tony Lam & Rahul Singh

Authors

Tony Lam
View author publications
You can also search for this author in PubMed Google Scholar
Rahul Singh
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Engineering, University of Nevada, Reno, USA
George Bebis
NASA Ames Research Center, Moffett Field, CA, USA
Richard Boyle
Lawrence Berkeley National Laboratory, Berkeley, CA, USA
Bahram Parvin
Desert Research Institute, Reno, NV, USA
Darko Koracin
Digital Image Research Center, Kingston University, London, UK
Paolo Remagnino
Intel, 95052, Santa Clara, CA, USA
Ara Nefian
University of California, 430, Computer Science Building, 92697-3425, Irvine, CA, USA
Gopi Meenakshisundaram
Institute for Data Analysis and Visualization, P.O. Box
Valerio Pascucci
Department of Computer Science and Engineering, Czech Technical University in Prague, Czech
Jiri Zara
Rockwell Scientific, 1049 Camino Dos Rios, 91360, Thousand Oaks, CA, USA
Jose Molineros
Computer Graphics Group, Bielefeld University, D-33501, Bielefeld, Germany
Holger Theisel
Hewlett Packard Labs, Palo Alto, CA, USA
Tom Malzbender

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Lam, T., Singh, R. (2006). Semantically Relevant Image Retrieval by Combining Image and Linguistic Analysis. In: Bebis, G., et al. Advances in Visual Computing. ISVC 2006. Lecture Notes in Computer Science, vol 4292. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11919629_77

Download citation

DOI: https://doi.org/10.1007/11919629_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-48626-8
Online ISBN: 978-3-540-48627-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics