skip to main content
10.1145/1291233.1291307acmconferencesArticle/Chapter ViewAbstractPublication PagesmmConference Proceedingsconference-collections
Article

Enhancing image annotation by integrating concept ontology and text-based bayesian learning model

Published:29 September 2007Publication History

ABSTRACT

Automatic image annotation (AIA) has been a hot research topic in recent years since it can be used to support concept-based image retrieval. However, most existing AIA models depend heavily on the availability of a large number of labeled training samples, which require significant human labeling efforts. In this paper, we propose a novel learning framework which integrates text-based Bayesian model (TBM) and concept ontology to effectively expand the training set of each concept class without the need of additional human labeling efforts or collecting additional training images from other data sources. The basic idea lies in exploiting the text information from training set to provide additional effective annotations for training images so that training data for each concept class can be augmented. In this study we employ Bayesian Hierarchical Multinomial Mixture Models (BHMMMs) as our baseline AIA model. By combining additional annotations obtained from TBM into each concept class in the training phase, the performance of BHMMMs can be significantly improved on Corel image dataset with 263 testing concepts as compared to the state-of-the-art AIA models under the same experimental configurations.

References

  1. K. Barnard, P. Duygulu and D. Forsyth, "Clustering Art", In Proc. Of IEEE Computer Vision and Pattern Recognition, 2001.Google ScholarGoogle ScholarCross RefCross Ref
  2. G. Carneiro and N. Vasconcelos, "Formulating Semantic Image Annotation as a Supervised Learning Problem", In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. P. Fan, H. Z. Luo and Y. L. Gao, "Learning the Semantics of Images by Using Unlabeled Samples", Proceedings CVPR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. H. M. Feng. R. Shi and T. S. Chua, "A Bootstrapping Framework for Annotating and Retrieving WWW Images", In ACM Multimedia'04, pp. 960--967, New York, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. S. L. Feng, R. Manmatha and V. Lavrenko, "Multiple Bernoulli Relevance Models for Image and Video Annotation", Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR'04. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. S. Gao, D.-H. Wang and C.-H. Lee, "Automatic Image Annotation through Multi-Topic Text Categorization", Proc. ICASSP, Toulouse, France, May 2006.Google ScholarGoogle Scholar
  7. J. Jeon, V. Lavrenko, and R. Manmatha, "Automatic Image Annotation and Retrieval Using Cross-Media Relevance Models", Proc. of the 26th ACM SIGIR, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. V. Lavrenko, R. Manmatha and J. Jeon, "A Model for Learning the Semantics of Pictures", NIPS, 2003.Google ScholarGoogle Scholar
  9. G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross and K. J. Miller, "Introduction to WordNet: an on-line lexical database", Intl. Jour. Of Lexicography, pp. 235--244, 1990.Google ScholarGoogle ScholarCross RefCross Ref
  10. J. Novovicova and A. Malik, "Application of Multinomial Mixture Model to Text Classification", Pattern Recognition and Image Analysis, pp. 646--653, 2003.Google ScholarGoogle ScholarCross RefCross Ref
  11. M. Srikanth, J. Varner, M. Bowden and D. Moldovan, "Exploiting Ontologies for Automatic Image Annotation", Proceedings of the 28th ACM SIGIR, 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. R. Shi, T. S. Chua, C. H. Lee and S. Gao, "Bayesian Learning of Hierarchical Multinomial Mixture Models of Concepts for Automatic Image Annotation", In Proc. of CIVR'06, pp. 102--112, Arizona, United States, 2006. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. S. Tong and E. Chang, "Support Vector Machine Active Learning for Image Retrieval", In ACM Multimedia'01, pp.107--118, Ottawa, Canada, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. R. Yan, and A. G. Hauptmann, "Multi-class Active Learning for Video Semantic Feature Extraction", In Proc. of ICME'04, pp. 69--72, 2004.Google ScholarGoogle Scholar
  15. C. X. Zhai and J. Lafferty, "A Study of Smoothing Methods for Language Models Applied to Ad Hoc Information Retrieval", SIGIR'01, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Enhancing image annotation by integrating concept ontology and text-based bayesian learning model

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          MM '07: Proceedings of the 15th ACM international conference on Multimedia
          September 2007
          1115 pages
          ISBN:9781595937025
          DOI:10.1145/1291233

          Copyright © 2007 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 29 September 2007

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • Article

          Acceptance Rates

          Overall Acceptance Rate995of4,171submissions,24%

          Upcoming Conference

          MM '24
          MM '24: The 32nd ACM International Conference on Multimedia
          October 28 - November 1, 2024
          Melbourne , VIC , Australia

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader