skip to main content
10.1145/1101826.1101860acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Hybrid visual and conceptual image representation within active relevance feedback context

Published:10 November 2005Publication History

ABSTRACT

Many of the available image databases have keyword annotations associated with the images. In spite of the availability of good quality low-level visual features that reflect well the physical content, image retrieval based on visual features alone is subject to semantic gap. Text annotations are related to image context or semantic interpretation of the visual content and are not necessarely directly linked to the visual appearance of the images. Keywords and visual features thus provide complementary information. Using both sources of information is an advantage in many applications and recent work in this area reflects this interest. In this paper, we address the challenge of semantic gap reduction using a hybrid visual and conceptual representation of the content within an active relevance feedback context. We introduce a new feature vector, based on the keyword annotations available for the images, which makes use of conceptual information extracted from an external lexical database, information represented by a set of "core concepts". Our experiments show that the use of the proposed hybrid conceptual and visual feature vector dramatically improves the quality of the relevance feedback results.

References

  1. W. H. Adams, G. Iyengar, C.-Y. Lin, M. R. Naphade, C. Neti, H. J. Nock, and J. R. Smith. Semantic indexing of multimedia content using visual, audio and text cues. EURASIP Journal on Applied Signal Processing, 3(2):170--185, 2003.Google ScholarGoogle Scholar
  2. K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, March 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. R. Beckwith, C. Fellbaum, D. Gross, and G. Miller. WordNet: A lexical database organized on psycholinguistic principles. In U. Zernik, editor, Lexical Acquisition: Exploiting On-Line Resources to Build a Lexicon, pages 211--232. Erlbaum, 1991.Google ScholarGoogle Scholar
  4. N. Boujemaa, J. Fauqueur, M. Ferecatu, F. Fleuret, V. Gouet, B. L. Saux, and H. Sahbi. Ikona: Interactive generic and specific image retrieval. In Proceedings of the International workshop on Multimedia Content-Based Indexing and Retrieval (MMCBIR'2001), 2001.Google ScholarGoogle Scholar
  5. A. Budanitsky and G. Hirst. Semantic distance in WordNet: An experimental, application-oriented evaluation of five measures. In Proceedings of the Workshop on WordNet and Other Lexical Resources NAACL 2001, 2001.Google ScholarGoogle Scholar
  6. I. J. Cox, M. L. Miller, S. M. Omohundro, and P. N. Yianilos. An optimized interaction strategy for Bayesian relevance feedback. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 553--558. IEEE Computer Society, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. A. del Bimbo. Visual Information Retrieval. Morgan Kaufmann, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. P. Duygulu, K. Barnard, J. F. G. de Freitas, and D. A. Forsyth. Object recognition as machine translation: Learning a lexicon for a fixed image vocabulary. In Proceedings of the 7th European Conference on Computer Vision-Part IV, pages 97--112. Springer-Verlag, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. C. Fellbaum and G. Miller, editors. WordNet: An Electronic Lexical Database. The MIT Press, 1998.Google ScholarGoogle Scholar
  10. M. Ferecatu, M. Crucianu, and N. Boujemaa. Retrieval of difficult image classes using SVM-based relevance feedback. In Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, pages 23--30, October 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. F. Fleuret and H. Sahbi. Scale-invariance of support vector machines based on the triangular kernel. In 3rd International Workshop on Statistical and Computational Theories of Vision, October 2003.Google ScholarGoogle Scholar
  12. T. Gevers and A. W. M. Smeulders. Content-based image retrieval: An overview. In G. Medioni and S. B. Kang, editors, Emerging Topics in Computer Vision. Prentice Hall, 2004.Google ScholarGoogle Scholar
  13. M. La Cascia, S. Sethi, and S. Sclaroff. Combining textual and visual cues for content-based image retrieval on the world wide web. In IEEE Workshop on Content-Based Access of Image and Video Libraries, pages 24--28, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. C. Leacock, M. Chodorow, and G. A. Miller. Using corpus statistics and WordNet relations for sense identification. Computational Linguistics, 24(1):147--165, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. S. Lew. Principles of Visual Information Retrieval. Springer-Verlag, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. D. Lin. An information-theoretic definition of similarity. In Proc. 15th International Conf. on Machine Learning, pages 296--304, 1998. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. V. Mezaris, I. Kompatsiaris, and M. G. Strintzis. Region-based image retrieval using an object ontology and relevance feedback. EURASIP Journal on Applied Signal Processing, 2004(6):886--901, June 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. P. Mitra and S. K. Pal. A probabilistic active support vector learning algorithm. IEEE Transactions on Pattern Analysis and Machine Intelligence, 26(3):413--418, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. P. Resnik. Using information content to evaluate semantic similarity in a taxonomy. In C. S. Mellish, editor, Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence, pages 448--453, San Mateo, Aug. 20-25 1995. Morgan Kaufmann. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. B. Schölkopf. The kernel trick for distances. In Advances in Neural Information Processing Systems, volume 12, pages 301--307. MIT Press, 2000.Google ScholarGoogle Scholar
  21. F. Seydoux and J.-C. Chappelier. Indexation sémantique au moyen de coupes de redondance minimale dans une ontologie. In Proceedings of Traitement Automatique du Langage Naturel (TALN'05), pages 33--42, June 2005.Google ScholarGoogle Scholar
  22. A. Smeulders, M. Worring, S. Santini, A. Gupta, and R. Jain. Content-based image retrieval at the end of the early years. IEEE Trans. Pattern Analysis and Machine Intelligence, 22(12):1349--1380, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. J. R. Smith, S. Basu, C.-Y. Lin, M. R. Naphade, and B. Tseng. Integrating features, models and semantics for content-based retrieval. In Proceedings of the international workshop on MultiMedia Content-Based Indexing and Retrieval (MMCBIR'01), pages 95--98, September 2001.Google ScholarGoogle Scholar
  24. S. Tong and E. Chang. Support vector machine active learning for image retrieval. In Proceedings of the 9th ACM international conference on Multimedia, pages 107--118. ACM Press, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Z. Wu and M. Palmer. Verb semantics and lexical selection. In 32nd. Annual Meeting of the Association for Computational Linguistics, pages 133--138, New Mexico State University, Las Cruces, New Mexico, 1994. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. H.-J. Zhang and Z. Su. Improving CBIR by semantic propagation and cross-mode query expansion. In Proceedings of the international workshop on MultiMedia Content-Based Indexing and Retrieval (MMCBIR'01), pages 83--86, September 2001.Google ScholarGoogle Scholar
  27. X. S. Zhou and T. S. Huang. Unifying keywords and visual contents in image retrieval. IEEE Multimedia, 9(2):23--33, 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. X. S. Zhou and T. S. Huang. Relevance feedback for image retrieval: a comprehensive review. Multimedia Systems, 8(6):536--544, 2003.Google ScholarGoogle ScholarCross RefCross Ref

Index Terms

  1. Hybrid visual and conceptual image representation within active relevance feedback context

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      MIR '05: Proceedings of the 7th ACM SIGMM international workshop on Multimedia information retrieval
      November 2005
      274 pages
      ISBN:1595932445
      DOI:10.1145/1101826
      • General Chairs:
      • Hongjiang Zhang,
      • John Smith,
      • Qi Tian

      Copyright © 2005 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 10 November 2005

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

      Upcoming Conference

      MM '24
      MM '24: The 32nd ACM International Conference on Multimedia
      October 28 - November 1, 2024
      Melbourne , VIC , Australia

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader