short-paper

Semi-supervised topic modeling for image annotation

Authors:

Hujun BaoAuthors Info & Claims

MM '09: Proceedings of the 17th ACM international conference on Multimedia

Pages 521 - 524

https://doi.org/10.1145/1631272.1631346

Published: 19 October 2009 Publication History

Get Access

Abstract

We propose a novel technique for semi-supervised image annotation which introduces a harmonic regularizer based on the graph Laplacian of the data into the probabilistic semantic model for learning latent topics of the images. By using a probabilistic semantic model, we connect visual features and textual annotations of images by their latent topics. Meanwhile, we incorporate the manifold assumption into the model to say that the probabilities of latent topics of images are drawn from a manifold, so that for images sharing similar visual features or the same annotations, their probability distribution of latent topics should also be similar. We create a nearest neighbor graph to model the manifold and propose a regularized EM algorithm to simultaneously learn a generative model and assign probability density of latent topics to images discriminatively. In this way, databases with very few labeled images can be annotated better than previous works.

References

[1]

K. Barnard, P. Duygulu, D. Forsyth, N. de Freitas, D. M. Blei, and M. I. Jordan. Matching words and pictures. Journal of Machine Learning Research, 3:1107--1135, 2003.

Digital Library

Google Scholar

[2]

M. Belkin, P. Niyogi, and V. Sindhwani. Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of Machine Learning Research, 7:2399--2434, 2006.

Digital Library

Google Scholar

[3]

D. M. Blei and M. I. Jordan. Modeling annotated data. In Proc. ACM Int. Conf. on Research and Development in Informaion Retrieval(ACM SIGIR), pages 127--134, 2003.

Digital Library

Google Scholar

[4]

D. Cai, Q. Mei, J. Han, and C. Zhai. Modeling hidden topics on document manifold. In Proc. ACM Conf. on Information and knowledge management(CIKM'08), pages 911--920, 2008.

Digital Library

Google Scholar

[5]

G. Csurka, C. R. Dance, L. Fan, J. Willamowski, and C. Bray. Visual categorization with bags of keypoints. In Workshop on Statistical Learning in Computer Vision, ECCV, pages 1--22, 2004.

Google Scholar

[6]

X. He, D. Cai, Y. Shao, H. Bao, and J. Han. Laplacian regularized gaussian mixture model for data clustering. Preprint.

Google Scholar

[7]

Q. Mei, D. Cai, D. Zhang, and C. Zhai. Topic modeling with network regularization. In Proc. ACM Int. Conf. on World Wide Web (WWW'08), pages 101--110, 2008.

Digital Library

Google Scholar

[8]

F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. In Proc. ACM Int. Conf. on Multimedia (SIGMM'03), pages 275--278, 2003.

Digital Library

Google Scholar

[9]

F. Monay and D. Gatica-Perez. Plsa-based image auto-annotation: constraining the latent space. In Proc. ACM Int. Conf. on Multimedia (SIGMM'04), pages 348--351, 2004.

Digital Library

Google Scholar

[10]

R. M. Neal and G. E. Hinton. A view of the em algorithm that justifies incremental, sparse, and other variants. In Learning in graphical models, pages 355--368. 1999.

Digital Library

Google Scholar

[11]

R. Zhang, Z. M. Zhang, M. Li, W.-Y. Ma, and H.-J. Zhang. A probabilistic semantic model for image annotation and multi-modal image retrieval. In Proc. IEEE Int. Conf. on Computer Vision (ICCV'05), pages 846--851, 2005.

Digital Library

Google Scholar

[12]

X. Zhu, J. Lafferty, and Z. Ghahramani. Semi-supervised learning using gaussian fields and harmonic functions. In Proc. Int. Conf. Machine Learning(ICML'05), 2005.

Google Scholar

Cited By

View all

Tao DTao DLi XGao X(2017)Large Sparse Cone Non-negative Matrix Factorization for Image AnnotationACM Transactions on Intelligent Systems and Technology10.1145/29873798:3(1-21)Online publication date: 20-Apr-2017
https://dl.acm.org/doi/10.1145/2987379
Tao DCheng JGao XLi XDeng C(2017)Robust Sparse Coding for Mobile Image Labeling on the CloudIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2016.253977827:1(62-72)Online publication date: 1-Jan-2017
https://dl.acm.org/doi/10.1109/TCSVT.2016.2539778
Tao DYang XLiu WSun SGuo YYu YPang J(2017)Cauchy Estimator Discriminant Learning for RGB-D Sensor-based Scene ClassificationMultimedia Tools and Applications10.1007/s11042-016-3370-x76:3(4471-4489)Online publication date: 1-Feb-2017
https://dl.acm.org/doi/10.1007/s11042-016-3370-x
Show More Cited By

Index Terms

Semi-supervised topic modeling for image annotation
1. Information systems
  1. Information retrieval
    1. Document representation
    2. Search engine architectures and scalability
      1. Search engine indexing

Recommendations

Opinion integration through semi-supervised topic modeling
WWW '08: Proceedings of the 17th international conference on World Wide Web

Web 2.0 technology has enabled more and more people to freely express their opinions on the Web, making the Web an extremely valuable source for mining user opinions about all kinds of topics. In this paper we study how to automatically integrate ...
Automatic image annotation using semi-supervised generative modeling

Image annotation approaches need an annotated dataset to learn a model for the relation between images and words. Unfortunately, preparing a labeled dataset is highly time consuming and expensive. In this work, we describe the development of an ...
A Novel Region-based Image Annotation Using Multi-instance Learning
WKDD '09: Proceedings of the 2009 Second International Workshop on Knowledge Discovery and Data Mining

In this paper, we formulate image annotation as a semi-supervised learning problem under multi-instance learning framework. A novel graph based semi-supervised learning approach to image annotation using multiple instances is presented, which extends ...

Comments

Information & Contributors

Information

Published In

MM '09: Proceedings of the 17th ACM international conference on Multimedia

October 2009

1202 pages

ISBN:9781605586083

DOI:10.1145/1631272

General Chairs:
Wen Gao
Peking University, China
,
Yong Rui
Microsoft, China
,
Alan Hanjalic
Delft University of Technology, The Netherlands
,
Program Chairs:
Changsheng Xu
Institute of Automation, Chinese Academy of Sciences, China
,
Eckehard Steinbach
Technical University of Munich, Germany
,
Abdulmotaleb El Saddik
University of Ottawa, Canada
,
Michelle Zhou
IBM T. J. Watson Research Center, USA

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2009

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Short-paper

Conference

MM09

Sponsor:

SIGMM

MM09: ACM Multimedia Conference

October 19 - 24, 2009

Beijing, China

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

11
Total Citations
View Citations
414
Total Downloads

Downloads (Last 12 months)1
Downloads (Last 6 weeks)0

Reflects downloads up to 28 Feb 2025

Other Metrics

View Author Metrics

Citations

Cited By

View all

Tao DTao DLi XGao X(2017)Large Sparse Cone Non-negative Matrix Factorization for Image AnnotationACM Transactions on Intelligent Systems and Technology10.1145/29873798:3(1-21)Online publication date: 20-Apr-2017
https://dl.acm.org/doi/10.1145/2987379
Tao DCheng JGao XLi XDeng C(2017)Robust Sparse Coding for Mobile Image Labeling on the CloudIEEE Transactions on Circuits and Systems for Video Technology10.1109/TCSVT.2016.253977827:1(62-72)Online publication date: 1-Jan-2017
https://dl.acm.org/doi/10.1109/TCSVT.2016.2539778
Tao DYang XLiu WSun SGuo YYu YPang J(2017)Cauchy Estimator Discriminant Learning for RGB-D Sensor-based Scene ClassificationMultimedia Tools and Applications10.1007/s11042-016-3370-x76:3(4471-4489)Online publication date: 1-Feb-2017
https://dl.acm.org/doi/10.1007/s11042-016-3370-x
Hamid Amiri SJamzad M(2015)Automatic image annotation using semi-supervised generative modelingPattern Recognition10.1016/j.patcog.2014.07.01248:1(174-188)Online publication date: 1-Jan-2015
https://dl.acm.org/doi/10.1016/j.patcog.2014.07.012
Zhang YWei W(2014)A jointly distributed semi-supervised topic modelNeurocomputing10.1016/j.neucom.2012.12.077134(38-45)Online publication date: 1-Jun-2014
https://dl.acm.org/doi/10.1016/j.neucom.2012.12.077
Tian D(2014)Semi-supervised learning for refining image annotation based on random walk modelKnowledge-Based Systems10.1016/j.knosys.2014.08.02372:1(72-80)Online publication date: 1-Dec-2014
https://dl.acm.org/doi/10.1016/j.knosys.2014.08.023
Ji PZhao NHao SJiang J(2014)Automatic image annotation by semi-supervised manifold kernel density estimationInformation Sciences: an International Journal10.1016/j.ins.2013.09.016281(648-660)Online publication date: 1-Oct-2014
https://dl.acm.org/doi/10.1016/j.ins.2013.09.016
Tao DJin LLiu WLi X(2013)Hessian Regularized Support Vector Machines for Mobile Image Annotation on the CloudIEEE Transactions on Multimedia10.1109/TMM.2013.223890915:4(833-844)Online publication date: 1-Jun-2013
https://dl.acm.org/doi/10.1109/TMM.2013.2238909
Dapeng Tao Lianwen Jin Zhao Yang Xuelong Li (2013)Rank Preserving Sparse Learning for Kinect Based Scene ClassificationIEEE Transactions on Cybernetics10.1109/TCYB.2013.226428543:5(1406-1417)Online publication date: Oct-2013
https://doi.org/10.1109/TCYB.2013.2264285
Zhuang LGao HLuo JLin Z(2013)Regularized Semi-Supervised Latent Dirichlet Allocation for visual concept learningNeurocomputing10.1016/j.neucom.2012.04.043119(26-32)Online publication date: Nov-2013
https://doi.org/10.1016/j.neucom.2012.04.043
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Abstract

References

Cited By

Index Terms

Recommendations

Opinion integration through semi-supervised topic modeling

Automatic image annotation using semi-supervised generative modeling

A Novel Region-based Image Annotation Using Multi-instance Learning

Comments

Information

Published In

Sponsors

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Acceptance Rates

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

Cited By

Login options

Full Access

View options

PDF

eReader

Share

Share this Publication link

Share on social media

Affiliations