skip to main content
10.1145/1160939.1160945acmotherconferencesArticle/Chapter ViewAbstractPublication PagescvdbConference Proceedingsconference-collections
Article

EXTENT: fusing context, content, and semantic ontology for photo annotation

Published:17 June 2005Publication History

ABSTRACT

This architecture paper presents EXTENT, a probabilistic framework that uses influence diagrams to fuse metadata of multiple modalities for photo annotation. EXTENT fuses contextual information (location, time, and camera parameters), photo content (perceptual features), and semantic ontology in a synergistic way. It uses causal strengths to encode causalities between variables, and between variables and semantic labels. Through a landmark-recognition case study, we show that EXTENT can provide high-quality annotation, substantially better than any traditional unimodal methods.

References

  1. K. Barnard and D. Forsyth. Learning the semantics of words and pictures. In International Conference on Computer Vision, volume 2, pages 408--415, 2000.Google ScholarGoogle Scholar
  2. E. Y. Chang. Extent: Combining context, content, and semantic ontology for photo annotation. US Provisional Patent, 2005.Google ScholarGoogle Scholar
  3. E. Y. Chang, K. Goh, G. Sychay, and G. Wu. Content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Trans. on Circuits and Systems for Video Technology Special Issue on Conceptual and Dynamical Aspects of Multimedia Content Description, 13(1):26--38, 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Davis, S. King, N. Good, and R. Sarvas. From context to content: Leveraging context to infer media metadata. ACM International Conference on Multimedia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. A. Deshpande, C. Guestrin, S. Madden, and W. Hong. Beyong pixels: Exploiting camera metadata for photo classification. IEEE CVPR, 2004.Google ScholarGoogle Scholar
  6. A. K. Dey. Understanding and using context. Personal and Ubiquitous Computing Journal, 5(1), 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. D. S. Diomidis. Position-annotated photographs: a geotemporal web. IEEE Pervasive Computing, 2(2), 2003. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. N. Friedman and D. Koller. Learning bayesian networks from data (tutorial). NIPS, 2000.Google ScholarGoogle Scholar
  9. K.-S. Goh and E. Y. Chang. One, two class svms for multi-class image annotation. IEEE Transactions on Knowledge and Data Engineering (TKDE) (accepted), 2005. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. K.-S. Goh, E. Y. Chang, and K.-T. Cheng. Svm binary classifier ensembles for multi-class image classification. ACM International Conference on Information and Knowledge Management (CIKM), pages 395--402, 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. D. Heckerman. A bayesian approach to learning causal networks. Conference on Uncertainty in Artificial Intelligence, pages 107--118, 1995.Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. D. Heckerman and R. Shachter. Decision-theoretic foundations for causal reasoning. MSR-TR-94-11, 1994.Google ScholarGoogle Scholar
  13. D. G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, 1999. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. M. Naaman, S. Harada, Q. Wang, H. Garcia-Molina, and A. Paepcke. Context data in geo-referenced digital photo collections. ACM International Conference on Multimedia, 2004. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. M. Naaman, A. Paepcke, and H. Garcia-Molina. From where to what: Metadata sharing for digital photographs with geographic coordinates. International Conference on Cooperative Information Systems (CoopIS), 2003.Google ScholarGoogle ScholarCross RefCross Ref
  17. L. R. Novick and P. W. Cheng. Assessing interactive causal influence. Psychological Review, 111(2):455--485, 2004.Google ScholarGoogle ScholarCross RefCross Ref
  18. J. B. Tenenbaum and T. L. Griffiths. Generalization, similarity, and bayesian inference. Behavioral and Brain Sciences, 24:629--641, 2001.Google ScholarGoogle Scholar
  19. S. Tong and E. Chang. Support vector machine active learning for image retrieval. Proceedings of ACM International Conference on Multimedia, pages 107--118, October 2001. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. J. Williamson. Causality, in Dov Gabbay & F. Guenthner (eds.): Handbook of Philosophical Logic. Kluwer (to appear), 2005.Google ScholarGoogle Scholar
  1. EXTENT: fusing context, content, and semantic ontology for photo annotation

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Other conferences
      CVDB '05: Proceedings of the 2nd international workshop on Computer vision meets databases
      June 2005
      75 pages
      ISBN:1595931511
      DOI:10.1145/1160939

      Copyright © 2005 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 17 June 2005

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • Article

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader