Article

EXTENT: fusing context, content, and semantic ontology for photo annotation

Author:
Edward Y. Chang

VIMA Technologies, Santa Barbara, California

VIMA Technologies, Santa Barbara, California
View Profile

CVDB '05: Proceedings of the 2nd international workshop on Computer vision meets databasesJune 2005Pages 5–11https://doi.org/10.1145/1160939.1160945

Published:17 June 2005Publication History

CVDB '05: Proceedings of the 2nd international workshop on Computer vision meets databases

Pages 5–11

ABSTRACT

This architecture paper presents EXTENT, a probabilistic framework that uses influence diagrams to fuse metadata of multiple modalities for photo annotation. EXTENT fuses contextual information (location, time, and camera parameters), photo content (perceptual features), and semantic ontology in a synergistic way. It uses causal strengths to encode causalities between variables, and between variables and semantic labels. Through a landmark-recognition case study, we show that EXTENT can provide high-quality annotation, substantially better than any traditional unimodal methods.

References

K. Barnard and D. Forsyth. Learning the semantics of words and pictures. In International Conference on Computer Vision, volume 2, pages 408--415, 2000.Google Scholar
E. Y. Chang. Extent: Combining context, content, and semantic ontology for photo annotation. US Provisional Patent, 2005.Google Scholar
E. Y. Chang, K. Goh, G. Sychay, and G. Wu. Content-based soft annotation for multimodal image retrieval using bayes point machines. IEEE Trans. on Circuits and Systems for Video Technology Special Issue on Conceptual and Dynamical Aspects of Multimedia Content Description, 13(1):26--38, 2003. Google ScholarDigital Library
M. Davis, S. King, N. Good, and R. Sarvas. From context to content: Leveraging context to infer media metadata. ACM International Conference on Multimedia, 2004. Google ScholarDigital Library
A. Deshpande, C. Guestrin, S. Madden, and W. Hong. Beyong pixels: Exploiting camera metadata for photo classification. IEEE CVPR, 2004.Google Scholar
A. K. Dey. Understanding and using context. Personal and Ubiquitous Computing Journal, 5(1), 2001. Google ScholarDigital Library
D. S. Diomidis. Position-annotated photographs: a geotemporal web. IEEE Pervasive Computing, 2(2), 2003. Google ScholarDigital Library
N. Friedman and D. Koller. Learning bayesian networks from data (tutorial). NIPS, 2000.Google Scholar
K.-S. Goh and E. Y. Chang. One, two class svms for multi-class image annotation. IEEE Transactions on Knowledge and Data Engineering (TKDE) (accepted), 2005. Google ScholarDigital Library
K.-S. Goh, E. Y. Chang, and K.-T. Cheng. Svm binary classifier ensembles for multi-class image classification. ACM International Conference on Information and Knowledge Management (CIKM), pages 395--402, 2001. Google ScholarDigital Library
D. Heckerman. A bayesian approach to learning causal networks. Conference on Uncertainty in Artificial Intelligence, pages 107--118, 1995.Google ScholarDigital Library
D. Heckerman and R. Shachter. Decision-theoretic foundations for causal reasoning. MSR-TR-94-11, 1994.Google Scholar
D. G. Lowe. Object recognition from local scale-invariant features. International Conference on Computer Vision, 1999. Google ScholarDigital Library
D. G. Lowe. Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 2004. Google ScholarDigital Library
M. Naaman, S. Harada, Q. Wang, H. Garcia-Molina, and A. Paepcke. Context data in geo-referenced digital photo collections. ACM International Conference on Multimedia, 2004. Google ScholarDigital Library
M. Naaman, A. Paepcke, and H. Garcia-Molina. From where to what: Metadata sharing for digital photographs with geographic coordinates. International Conference on Cooperative Information Systems (CoopIS), 2003.Google ScholarCross Ref
L. R. Novick and P. W. Cheng. Assessing interactive causal influence. Psychological Review, 111(2):455--485, 2004.Google ScholarCross Ref
J. B. Tenenbaum and T. L. Griffiths. Generalization, similarity, and bayesian inference. Behavioral and Brain Sciences, 24:629--641, 2001.Google Scholar
S. Tong and E. Chang. Support vector machine active learning for image retrieval. Proceedings of ACM International Conference on Multimedia, pages 107--118, October 2001. Google ScholarDigital Library
J. Williamson. Causality, in Dov Gabbay & F. Guenthner (eds.): Handbook of Philosophical Logic. Kluwer (to appear), 2005.Google Scholar

EXTENT: fusing context, content, and semantic ontology for photo annotation
1. Computing methodologies

Recommendations

Multimodal metadata fusion using causal strength
MULTIMEDIA '05: Proceedings of the 13th annual ACM international conference on Multimedia

We propose a probabilistic framework that uses influence diagrams to fuse metadata of multiple modalities for photo annotation. We fuse contextual information (location, time, and camera parameters), visual content (holistic and local perceptual ...
Read More
Linked tag: image annotation using semantic relationships between image tags

State of the art image tagging systems are limited because they allow users to annotate image tags in noun form, which cannot fully express the semantics of image content. In this paper, we propose Linked Tag, a semi-automatic image annotation system ...
Read More
Social image tag enrichment based on textual similarity modeling

In social image sharing websites, users provide several descriptive tags to annotate their shared images. Usually, the user annotated tags are noisy, biased and incomplete. How to improve tag quality is very important for tag based applications. The ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
CVDB '05: Proceedings of the 2nd international workshop on Computer vision meets databases
June 2005
75 pages
ISBN:1595931511
DOI:10.1145/1160939
Conference Chairs:
Laurent Amsaleg,
Björn þór Jónsson,
Vincent Oria
Copyright © 2005 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 June 2005
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
causal strength
image annotation
inference diagram
information integration
Qualifiers
- Article
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 7
  Total Citations
  View Citations
- 280
  Total Downloads
- Downloads (Last 12 months)1
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

EXTENT: fusing context, content, and semantic ontology for photo annotation

CVDB '05: Proceedings of the 2nd international workshop on Computer vision meets databases

ABSTRACT

References

Cited By

Recommendations

Multimodal metadata fusion using causal strength

Linked tag: image annotation using semantic relationships between image tags

Social image tag enrichment based on textual similarity modeling