short-paper

What is a complete set of keywords for image description & annotation on the web

Authors:

Xiaoshuai SunAuthors Info & Claims

MM '09: Proceedings of the 17th ACM international conference on Multimedia

Pages 613 - 616

https://doi.org/10.1145/1631272.1631369

Published: 19 October 2009 Publication History

Get Access

Abstract

Does there exist a compact set of keywords that can completely and effectively cover the image annotation problem by expanding from it? In this paper, we answer this question by presenting a complete set framework for image annotation, which is motivated by the existence of semantic ontology. To generate this set, we propose a cross model optimization strategy from both textual and visual information for topic decomposition, based on a so-called Bipartite LSA model, which minimize multimodal error energy functions in a probabilistic Latent Semantic Analysis model. To achieve complete set based annotation, we present a Gaussian-Kernel-Generative process based keyword generation procedure, which analogizes keyword annotation in a probabilistic generative manner. A group of experiments is performed on Washington University image database and 80,000 Flickr images with comparisons to the state-of-the-arts. Finally, potential advantages and future improvements of our framework are discussed outside the scope of topic modeling.

References

[1]

S. Zinger, C. Millet, B. Mathieu, G. Grefenstette, P. Hède, and P.-A. Moëllic. 2005. Extracting an Ontology of Portrayable Objects from WordNet. In Proceedings of the MUSCLE/ImageCLEF Workshop on Image and Video Retrieval Evaluation, 2005.

Google Scholar

[2]

T.K. Landauer, P.W. Foltz and D. Laham. 1998. An Introduction to Latent Semantic Analysis. Discourse Processes, 1998.

Google Scholar

[3]

T. Hofmann. 1999. Probabilistic Latent Semantic Indexing. In Proceedings of the 22nd annual international ACM SIGIR, 1999.

Digital Library

Google Scholar

[4]

D. Blei, A. Ng and M. Jordan. 2003. Latent Dirichlet allocation. Journal of Machine Learning Research, 2003, 3:993--1022.

Digital Library

Google Scholar

[5]

K. Barnard, P. Duygulu, N. de Freitas, D. Forsyth, D. Blei and M. Jordan. 2003. Matching words and pictures. Journal of Machine Learning Research, 2003.

Digital Library

Google Scholar

[6]

D. Blei and M. I. Jordan. 2003. Modeling annotated data. In Proceedings of the 26th Intl. ACM SIGIR, 2003.

Digital Library

Google Scholar

[7]

X. Rui, M. Li, Z. Li, W.Y. Ma and N. Yu. 2007. Bipartite graph reinforcement model for web image annotation. In Proceedings of the ACM International Conference on Multimedia, 2007.

Digital Library

Google Scholar

[8]

Y. Lu, L. Zhang, Q. Tian and W.Y. Ma. 2008. What are the High-Level Concepts with Small Semantic Gaps?. CVPR, 2008.

Google Scholar

[9]

Xianming Liu, Rongrong Ji, Hongxun Yao, Pengfei Xu, Xiaoshuai Sun, Tianqiang Liu. "Cross-Media Manifold Learning for Image Retrieval&Annotation". ACM MIR 2008, pp: 141--148, 2008.

Digital Library

Google Scholar

[10]

Rongrong Ji, Hongxun Yao, "Visual&Textual Fusion for Region Retrieval: From Both Fuzzy Matching and Bayesian Reasoning Aspects," ACM Conference on Multimedia Information Retrieval (MIR), pp.159--168, 2007.

Digital Library

Google Scholar

Cited By

View all

Liu XYao HJi RXu PSun X(2018)Bidirectional-isomorphic manifold learning at image semantic understanding & representationMultimedia Tools and Applications10.1007/s11042-011-0947-264:1(53-76)Online publication date: 30-Dec-2018
https://dl.acm.org/doi/10.1007/s11042-011-0947-2
Liu XYao HJi RXu PSun XTian QRui YNahrstedt KXu XYao HJiang SCheng J(2010)Visual topic model for web image annotationProceedings of the Second International Conference on Internet Multimedia Computing and Service10.1145/1937728.1937758(126-130)Online publication date: 30-Dec-2010
https://dl.acm.org/doi/10.1145/1937728.1937758
Liu XYao HJi R(2010)Exploring statistical properties for semantic annotation: sparse distributed and convergent assumptions for keywords2010 IEEE International Conference on Acoustics, Speech and Signal Processing10.1109/ICASSP.2010.5494954(802-805)Online publication date: Mar-2010
https://doi.org/10.1109/ICASSP.2010.5494954

Index Terms

What is a complete set of keywords for image description & annotation on the web
1. Information systems
  1. Information retrieval

Recommendations

Visual topic model for web image annotation
ICIMCS '10: Proceedings of the Second International Conference on Internet Multimedia Computing and Service

In this paper, we focus on image semantic understanding under large scale of image set, in which traditional approaches suffer from the limitations of scalability, tag correlation and noisy items. To solve these problems, a novel Visual Topic Model ...
A survey of methods for image annotation

In order to evaluate automated image annotation and object recognition algorithms, ground truth in the form of a set of images correctly annotated with text describing each image is required. In this paper, three image annotation approaches are reviewed:...
The conflict detection and resolution in knowledge merging for image annotation

Semantic annotation of images is an important step to support semantic information extraction and retrieval. However, in a multi-annotator environment, various types of conflicts such as converting, merging, and inference conflicts could arise during ...

Comments

Information & Contributors

Information

Published In

MM '09: Proceedings of the 17th ACM international conference on Multimedia

October 2009

1202 pages

ISBN:9781605586083

DOI:10.1145/1631272

General Chairs:
Wen Gao
Peking University, China
,
Yong Rui
Microsoft, China
,
Alan Hanjalic
Delft University of Technology, The Netherlands
,
Program Chairs:
Changsheng Xu
Institute of Automation, Chinese Academy of Sciences, China
,
Eckehard Steinbach
Technical University of Munich, Germany
,
Abdulmotaleb El Saddik
University of Ottawa, Canada
,
Michelle Zhou
IBM T. J. Watson Research Center, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM09

Sponsor:

SIGMM

MM09: ACM Multimedia Conference

October 19 - 24, 2009

Beijing, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

3
Total Citations
View Citations
286
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 27 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Liu XYao HJi RXu PSun X(2018)Bidirectional-isomorphic manifold learning at image semantic understanding & representationMultimedia Tools and Applications10.1007/s11042-011-0947-264:1(53-76)Online publication date: 30-Dec-2018
https://dl.acm.org/doi/10.1007/s11042-011-0947-2
Liu XYao HJi RXu PSun XTian QRui YNahrstedt KXu XYao HJiang SCheng J(2010)Visual topic model for web image annotationProceedings of the Second International Conference on Internet Multimedia Computing and Service10.1145/1937728.1937758(126-130)Online publication date: 30-Dec-2010
https://dl.acm.org/doi/10.1145/1937728.1937758
Liu XYao HJi R(2010)Exploring statistical properties for semantic annotation: sparse distributed and convergent assumptions for keywords2010 IEEE International Conference on Acoustics, Speech and Signal Processing10.1109/ICASSP.2010.5494954(802-805)Online publication date: Mar-2010
https://doi.org/10.1109/ICASSP.2010.5494954

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Visual topic model for web image annotation

A survey of methods for image annotation

The conflict detection and resolution in knowledge merging for image annotation

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations