short-paper

Image annotation using multi-correlation probabilistic matrix factorization

Authors:
Zechao Li

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

,
Jing Liu

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

,
Xiaobin Zhu

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

,
Tinglin Liu

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

,
Hanqing Lu

Institute of Automation, Chinese Academy of Sciences, Beijing, China

Institute of Automation, Chinese Academy of Sciences, Beijing, China
View Profile

MM '10: Proceedings of the 18th ACM international conference on MultimediaOctober 2010Pages 1187–1190https://doi.org/10.1145/1873951.1874183

Published:25 October 2010Publication History

MM '10: Proceedings of the 18th ACM international conference on Multimedia

Pages 1187–1190

ABSTRACT

The image-word correlation estimation is an essential issue in image annotation. In this paper, we propose a multi-correlation probabilistic matrix factorization (MPMF) algorithm for the correlation estimation. Different from the traditional solutions which treat the image-word correlation, image similarity and word relation independently or sequentially, in the proposed MPMF, these three elements are integrated together simultaneously and seamlessly. Specifically, we have derived two low-dimensional sets by conducting a joint factorization upon the word-to-image relation matrix, the image similarity matrix, and the word relation matrix to derive two low-dimensional sets of latent word factors and latent image factors. Finally, the annotation words of each untagged or noisily tagged image can be predicted by reconstructing the image-word correlations with the both derived latent factors. Experimental results on the Corel dataset and a Flickr image dataset show the superior performance of our proposed algorithm over the state-of-the-arts.

References

P. Dugulu and K. Barnard. Object recognitions as machine translation: learning a lexicon for a fixed image vocabular. ECCV, 2002. Google ScholarDigital Library
S. Feng, R. Manmatha, and V. Lavrenko. Multiple bernoulli relevance models for image and video annotation. CVPR, pages 1002--1009, 2004. Google ScholarDigital Library
J. Jeon, V. Lavrenko, and R. Manmatha. Automatic image annotation and retrieval using cross-media relevance models. ACM SIGIR, pages 119--126, 2003. Google ScholarDigital Library
R. Jin, J. Y. Chai, and L. Si. Effective automatic image annotation via a coherent language model and active learning. ACM SIGMM, pages 892--899, 2004. Google ScholarDigital Library
Y. Jin, L. Khan, L. Wang, and M. Awad. Image annotation by combining multiple evidence & wordnet. ACM SIGMM, pages 706--715, 2005. Google ScholarDigital Library
F. Kang, R. Jin, and R. Sukthankar. Correlated label propagation with application to multi-label learning. CVPR, pages 1719--1726, 2006. Google ScholarDigital Library
V. Lavrenko, R. Manmatha, and J. Jeon. A model for learning the semantics of pictures. NIPS, 2004.Google Scholar
J. Liu, M. Li, Q. Liu, H. Lu, and S. Ma. Image annotation via graph learning. PR, 42(2):218--228, 2009. Google ScholarDigital Library
J. Liu, B. Wang, M. Li, M. Li, W. Ma, H. Lu, and S. Ma. Dual cross-media relevance model for image annotation. ACM SIGMM, pages 605--614, 2007. Google ScholarDigital Library
R. Salakhutdinov and A. Mnih. Probabilistic matrix factorization. NIPS, 20:605--614, 2008.Google Scholar
C. Wang, S. Yan, L. Zhang, and H. Zhang. Multi-label sparse coding for image annotation. CVPR, pages 1463--1650, 2009.Google ScholarCross Ref

Index Terms

Image annotation using multi-correlation probabilistic matrix factorization
1. Information systems
  1. Information retrieval
    1. Retrieval models and ranking

Recommendations

Image annotation via graph learning

Image annotation has been an active research topic in recent years due to its potential impact on both image understanding and web image search. In this paper, we propose a graph learning framework for image annotation. First, the image-based graph ...
Read More
Discovering phrase-level lexicon for image annotation
PCM'10: Proceedings of the 11th Pacific Rim conference on Advances in multimedia information processing: Part I

In image annotation, the annotation words are expected to represent image content at both visual level and semantic level. However, a single word sometimes is ambiguous in annotation, for example, "apple" may refer to a fruit or a company. However, when ...
Read More
Probabilistic Matrix Factorization With Semantic And Visual Neighborhoods For Image Tag Completion
ICMR '15: Proceedings of the 5th ACM on International Conference on Multimedia Retrieval

We present an image tag completion method, namely PMF-SVN, where the key idea is to exploit images' Semantically and Visually similar Neighborhoods (SVNs) in the learning process of a Probabilistic Matrix Factorization (PMF) framework. We propose a two-...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '10: Proceedings of the 18th ACM international conference on Multimedia
October 2010
1836 pages
ISBN:9781605589336
DOI:10.1145/1873951
General Chairs:
Alberto del Bimbo
University of Florence, Italy
,
Shih-Fu Chang
Columbia University, USA
,
Program Chair:
Arnold Smeulders
University of Amsterdam, NL
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 October 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
image annotation
image similarity
matrix factorization
word correlation
Qualifiers
- short-paper
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 58
  Total Citations
  View Citations
- 358
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Image annotation using multi-correlation probabilistic matrix factorization

MM '10: Proceedings of the 18th ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Image annotation via graph learning

Discovering phrase-level lexicon for image annotation

Probabilistic Matrix Factorization With Semantic And Visual Neighborhoods For Image Tag Completion