Semantic interactive image retrieval combining visual and conceptual content description

Ferecatu, Marin; Boujemaa, Nozha; Crucianu, Michel

doi:10.1007/s00530-007-0094-9

Semantic interactive image retrieval combining visual and conceptual content description

Regular Paper
Published: 22 August 2007

Volume 13, pages 309–322, (2008)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Marin Ferecatu¹,
Nozha Boujemaa¹ &
Michel Crucianu²

207 Accesses
31 Citations
Explore all metrics

Abstract

We address the challenge of semantic gap reduction for image retrieval through an improved support vector machines (SVM)-based active relevance feedback framework, together with a hybrid visual and conceptual content representation and retrieval. We introduce a new feature vector based on projecting the keywords associated to an image on a set of “key concepts” with the help of an external lexical database. We then put forward two improvements of SVM-based relevance feedback method. First, to optimize the transfer of information between the user and the system, we introduce a new active learning selection criterion that minimizes redundancy between the candidate images shown to the user. Second, as most image classes span a wide range of scales in the description space, we argue that the insensitivity of the SVM to the scale of the data is desirable in this context and we show how to obtain it by using specific kernel functions. Experimental evaluations show that the joint use of the new concept-based feature vector and the visual features with our relevance feedback scheme can significantly improve the quality of the results.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

On the coupled use of signal and semantic concepts to bridge the semantic and user intention gaps for visual content retrieval

Article 14 July 2016

Automatic content based image retrieval using semantic analysis

Article 04 June 2014

Multimodal Image Retrieval Based on Keywords and Low-Level Image Features

References

Adams W.H., Iyengar G., Lin C.Y., Naphade M.R., Neti C., Nock H.J. and Smith J.R. (2003). Semantic indexing of multimedia content using visual, audio and text cues. EURASIP J. Appl. Signal Process. 3(2): 170–185
Article Google Scholar
Berg C., Christensen J.P.R. and Ressel P. (1984). Harmonic Analysis on Semigroups. Springer, Heidelberg
MATH Google Scholar
del Bimbo, A.: Visual Information Retrieval. Morgan Kaufmann (1999)
Boujemaa, N., Fauqueur, J., Ferecatu, M., Fleuret, F., Gouet, V., Saux, B.L., Sahbi, H.: Ikona: Interactive generic and specific image retrieval. In: Proceedings of the International Workshop on Multimedia Content-Based Indexing and Retrieval (MMCBIR’2001) (2001)
Brinker, K.: Incorporating diversity in active learning with support vector machines. In: Proceedings of ICML-04, International Conference on Machine Learning, pp. 59–66 (2003)
Budanitsky, A., Hirst, G.: Semantic distance in wordnet: An experimental, application-oriented evaluation of five measures. In: Proceedings of the Workshop on WordNet and Other Lexical Resources NAACL 2001 (2001)
Campbell, C., Cristianini, N., Smola, A.: Query learning with large margin classifiers. In: Proceedings of ICML-00, 17th International Conference on Machine Learning, pp. 111–118. Morgan Kaufmann (2000)
Chang, E.Y., Li, B., Wu, G., Goh, K.: Statistical learning for effective visual image retrieval. In: Proceedings of the IEEE International Conference on Image Processing (ICIP’03), pp. 609–612 (2003)
Chapelle O., Haffner P. and Vapnik V.N. (1999). Support-vector machines for histogram-based image classification. IEEE Trans. Neural Netw. 10(5): 1055–1064
Article Google Scholar
Cohn D.A., Ghahramani Z. and Jordan M.I. (1996). Active learning with statistical models. J. Artif. Intell. Res. 4: 129–145
MATH Google Scholar
Cox I.J., Miller M.L., Minka T.P., Papathomas T. and Yianilos P.N. (2000). The Bayesian image retrieval system, PicHunter: theory, implementation and psychophysical experiments. IEEE Trans. Image Process. 9(1): 20–37
Article Google Scholar
Cox, I.J., Miller, M.L., Omohundro, S.M., Yianilos, P.N.: An optimized interaction strategy for Bayesian relevance feedback. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 553–558. IEEE Computer Society (1998)
Crucianu, M., Tarel, J.P., Ferecatu, M.: A comparison of user strategies in image retrieval with relevance feedback. In: Proceedings of the 7th International Workshop on Audio–Visual Content and Information Visualization in Digital Libraries (AVIVDiLib’05), pp. 121–130 (2005)
Duygulu, P., Barnard, K., de Freitas, J.F.G., Forsyth, D.A.: Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. In: Proceedings of the 7th European Conference on Computer Vision-Part IV, pp. 97–112. Springer, Heidelberg (2002)
Fellbaum, C., Miller, G (eds.).: WordNet: an Electronic Lexical Database. The MIT Press (1998)
Ferecatu, M.: Image retrieval with active relevance feedback using both visual and keyword-based descriptors. Ph.D. thesis, INRIA—Université de Versailles Saint Quentin en Yvelines, France (2005)
Ferecatu, M., Crucianu, M., Boujemaa, N.: Retrieval of difficult image classes using svm-based relevance feedback. In: Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, pp. 23–30 (2004)
Fleuret, F., Sahbi, H.: Scale-invariance of support vector machines based on the triangular kernel. In: 3rd International Workshop on Statistical and Computational Theories of Vision (2003)
Goh, K., Chang, E., Lai, W.: Multimodal concept-dependent active learning for image retrieval. In: ACM International Conference on Multimedia 2004, pp. 564–571 (2004)
Gonzalo, J., Verdejo, F., Chugur, I., Cigarran, J.: Indexing with wordnet synsets can improve text retrieval. In: Proceedings of the COLING/ACL 1998 Workshop on Usage of WordNet for Natural Language Processing, pp. 38–44 (1998)
Herbrich R., Graepel T. and Campbell C. (2001). Bayes point machines. J. Mach. Learning Res. 1: 245–279
Article MATH MathSciNet Google Scholar
Hofmann, T.: Probabilistic latent semantic indexing. In: Proceedings of the 22th International Conference on Research and Development in Information Retrieval (SIGIR’99), pp. 50–57 (1999)
Kherfi, M., Brahmi, D., Ziou, D.: Combining visual features with semantics for a more effective image retrieval. In: Proceedings of the 17th International Conference on Pattern Recognition (2004)
La Cascia, M., Sethi, S., Sclaroff, S.: Combining textual and visual cues for content-based image retrieval on the world wide web. In: IEEE Workshop on Content-Based Access of Image and Video Libraries, pp. 24–28 (1998)
Leacock C., Chodorow M. and Miller G.A. (1998). Using corpus statistics and WordNet relations for sense identification. Comput. Linguist. 24(1): 147–165
Google Scholar
Lenat D. (1995). Cyc: a large-scale investment in knowledge infrastructure. Commun. ACM 38(11): 33–38
Article Google Scholar
Lin, D.: An information-theoretic definition of similarity. In: Proceedings of the 15th International Conference on Machine Learning, pp. 296–304 (1998)
Liu H. and Singh P. (2004). Conceptnet: a practical commonsense reasoning tool-kit. BT Technol. J. 22(4): 211–226
Article Google Scholar
Lu, Y., Hu, C., Zhu, X., Zhang, H.J., Yang, Q.: A unified framework for semantics and feature based relevance feedback in image retrieval systems. In: Proceedings of the 8th ACM International Conference on Multimedia, pp. 31–37. ACM Press (2000)
Mihalcea, R., Moldovan, D.: Semantic indexing using wordnet senses. In: Proceedings of ACL Workshop on IR and NLP (2000)
Resnik P. (1995). Using information content to evaluate semantic similarity in a taxonomy. In: Mellish, C.S. (eds) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence., pp 448–453. Morgan Kaufmann, San Mateo
Google Scholar
Schölkopf B. (2000). The kernel trick for distances. Adv. Neural Inf. Process. Systems 12: 301–307
Google Scholar
Schölkopf, B., Smola, A.: Learning with Kernels. MIT Press (2002)
Seydoux, F., Chappelier, J.C.: Semantic indexing using minimum redundancy cut in ontologies. In: Proceedings of International Conference on Recent Advances in Natural Language Processing (RANLP 2005), pp. 486–492 (2005)
Singh, P., Lin, T., Mueller, E., Lim, G., Perkins, T., Zhu, W.: Open mind commonsense: knowledge acquisition from the general public. In: Proceedings of the First International Conference on Ontologies, Databases, and Applications of Semantics for Large Scale Information Systems (2002)
Smeulders A., Worring M., Santini S., Gupta A. and Jain R. (2000). Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Anal. Mach. Intell. 22(12): 1349–1380
Article Google Scholar
Smith, J.R., Basu, S., Lin, C.Y., Naphade, M.R., Tseng, B.: Integrating features, models and semantics for content-based retrieval. In: Proceedings of the International Workshop on MultiMedia Content-Based Indexing and Retrieval (MMCBIR’01), pp. 95–98 (2001)
Tong, S., Chang, E.: Support vector machine active learning for image retrieval. In: Proceedings of the 9th ACM International Conference on Multimedia, pp. 107–118. ACM Press (2001)
Tong, S., Koller, D.: Support vector machine active learning with applications to text classification. In: Proceedings of ICML-00, 17th International Conference on Machine Learning, pp. 999–1006. Morgan Kaufmann (2000)
Wu, Z., Palmer, M.: Verb semantics and lexical selection. In: 32nd Annual Meeting of the Association for Computational Linguistics, pp. 133–138. New Mexico State University, Las Cruces, New Mexico (1994)
Zhang C. and Chen T. (2002). An active learning framework for content-based information retrieval. IEEE Trans. Multimedia 4(2): 260–268
Article Google Scholar
Zhang, H.J., Su, Z.: Improving CBIR by semantic propagation and cross-mode query expansion. In: Proceedings of the International Workshop on MultiMedia Content-Based Indexing and Retrieval (MMCBIR’01), pp. 83–86 (2001)
Zhang, R., Zhang, Z.M., Li, M., Ma, W.Y., Zhang, H.J.: A probabilistic semantic model for image annotation and multi-modal image retrieval. In: Proceedings of the 2005 IEEE International Conference on Computer Vision (ICCV’05) (2005)
Zhao R. and Grosky W.I. (2002). Narrowing the semantic gap—improved text based web document retrieval using visual features. IEEE Trans. Multimedia 4(2): 189–200
Article Google Scholar
Zhou X.S. and Huang T.S. (2002). Unifying keywords and visual contents in image retrieval. IEEE Multimedia 9(2): 23–33
Article MathSciNet Google Scholar
Zhou X.S. and Huang T.S. (2003). Relevance feedback for image retrieval: a comprehensive review. Multimedia Systems 8(6): 536–544
Article Google Scholar
Zhou, Z.H., Chen, K.J., Jiang, Y.: Exploiting unlabeled data in content-based image retrieval. In: Proceedings of the 15th European Conference on Machine Learning (ECML’04), pp. 525–536 (2004)

Download references

Author information

Authors and Affiliations

INRIA Rocquencourt, IMEDIA Team, BP 105 Rocquencourt, 78153, Le Chesnay Cedex, France
Marin Ferecatu & Nozha Boujemaa
CNAM Paris, Vertigo Team 292 rue St Martin, 75141, Paris Cedex 03, France
Michel Crucianu

Authors

Marin Ferecatu
View author publications
You can also search for this author in PubMed Google Scholar
Nozha Boujemaa
View author publications
You can also search for this author in PubMed Google Scholar
Michel Crucianu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marin Ferecatu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ferecatu, M., Boujemaa, N. & Crucianu, M. Semantic interactive image retrieval combining visual and conceptual content description. Multimedia Systems 13, 309–322 (2008). https://doi.org/10.1007/s00530-007-0094-9

Download citation

Received: 25 July 2007
Published: 22 August 2007
Issue Date: February 2008
DOI: https://doi.org/10.1007/s00530-007-0094-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Semantic interactive image retrieval combining visual and conceptual content description

Abstract

Access this article

Similar content being viewed by others

On the coupled use of signal and semantic concepts to bridge the semantic and user intention gaps for visual content retrieval

Automatic content based image retrieval using semantic analysis

Multimodal Image Retrieval Based on Keywords and Low-Level Image Features

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Semantic interactive image retrieval combining visual and conceptual content description

Abstract

Access this article

Similar content being viewed by others

On the coupled use of signal and semantic concepts to bridge the semantic and user intention gaps for visual content retrieval

Automatic content based image retrieval using semantic analysis

Multimodal Image Retrieval Based on Keywords and Low-Level Image Features

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation