research-article

Leveraging loosely-tagged images and inter-object correlations for tag recommendation

Authors:

Jianping FanAuthors Info & Claims

MM '10: Proceedings of the 18th ACM international conference on Multimedia

Pages 5 - 14

https://doi.org/10.1145/1873951.1873956

Published: 25 October 2010 Publication History

Abstract

Large-scale loosely-tagged images (i.e., multiple object tags are given loosely at the image level) are available on Internet, and it is very attractive to leverage such loosely-tagged images for automatic image annotation applications. In this paper, a multi-task structured SVM algorithm is developed to leverage both the inter-object correlations and the loosely-tagged images for achieving more effective training of a large number of inter-related object classifiers. To leverage the loosely-tagged images for object classifier training, each loosely-tagged image is partitioned into a set of image instances (image regions) and a multiple instance learning algorithm is developed for instance label identification by automatically identifying the correspondences between multiple tags (given at the image level) and the image instances. An object correlation network is constructed for characterizing the inter-object correlations explicitly and identifying the inter-related learning tasks automatically. To enhance the discrimination power of a large number of inter-related object classifiers, a multi-task structured SVM algorithm is developed to model the inter-task relatedness more precisely and leverage the inter-object correlations for classifier training. Our experiments on a large number of inter-related object classes have provided very positive results.

References

[1]

Flickr, http://www.flickr.com.

[2]

J. Fan, Y. Shen, N. Zhou, Y. Gao, “Harvesting large-scale weakly-tagged image databases from the web", IEEE CVPR, 2010.

[3]

MSRC, http://research.microsoft.com/.

[4]

Y. Deng, B.S. Manjunath, "Color image segmentation", IEEE CVPR, 1999.

[5]

B. Russell, A. Efros, J. Sivic, W. Freeman, A. Zisserman, "Using multiple segmentations to discover objects and their extent in image collections", IEEE CVPR, 2006.

Digital Library

[6]

B.J. Frey, D. Dueck, "Clustering by passing messages between data points", Science, vol.315, 2007.

[7]

S. Vijayanarasimhan, K. Grauman, "Keywords to visual categories: Multiple-instance learning for weakly supervised object categorization", IEEE CVPR 2008.

[8]

Q. Zhang, W. Yu, S. A. Goldman, J. E. Fritts, "Content-based image retrieval using multiple-instance learning", ICML, 2002.

Digital Library

[9]

O. Maron, A. L. Ratan, "Multiple-instance learning for natural scene classification", ICML, 1998.

Digital Library

[10]

Y. Chen, J. Bi, J. Z. Wang, "MILES: multiple instance learning via embedded instance selection", IEEE Trans. PAMI, vol.28, no.12, pp.1931--1947, 2006.

Digital Library

[11]

M. R. Boutell, J. Luo, X. Shen, C.M. Brown,"Learning multi-label scene classification", Pattern Recognition, vol. 37, no.9, pp. 1757--1771, 2004.

[12]

Z.-H. Zhu, M.-L. Zhang, "Multi-instance multi-label learning with application to scene classification", NIPS, 2006.

[13]

J. Fan, Y. Gao, H. Luo, "Integrating concept ontology and multi-task learning to achieve more effective classifier training for multi-level image annotation", IEEE Trans. on Image Processing, vol. 17, no.3, pp.407--426, 2008.

Digital Library

[14]

T. Evgeniou, C.A. Micchelli, M. Pontil, "Learning multiple tasks with kernel methods", Journal of Machine Learning Research, vol.6, pp.615--637, 2005.

Digital Library

[15]

S. Kumar, M. Hebert, "Discriminative random fields", Intl. Journal of Computer Vision, vol.68, no.2, pp.179--201, 2006.

Digital Library

[16]

J. Yang, Y. Liu, E. X. Ping, A.G. Hauptmann, "Harmonium models for semantic video representation and classification", SIAM Conf. on Data Mining, 2007.

[17]

A. Torralba, K. P. Murphy, W. T. Freeman, "Sharing features: efficient boosting procedures for multiclass object detection", IEEE CVPR, 2004.

Digital Library

[18]

W. Jiang, S.-F. Chang, A. Loui, "Context-based concept fusion with boosted conditional random fields", IEEE ICASSP, 2007.

[19]

J. Tang, X. Hua, M. Wang, Z. Gu, G. Qi, X. Wu, "Correlative linear neighborhood propagation for video annotation", IEEE Trans. on SMC, vol. 39, no.2, pp.409--416, 2009.

Digital Library

[20]

J. Liu, M. Li, W.-Y. Ma, Q. Liu, H. Lu, "An adaptive graph model for automatic image annotation", ACM Multimedia Workshop on MIR, 2006.

Digital Library

[21]

G.-J. Qi, X.-S. Hua, Y. Rui, J. Tang, T. Mei, H.-J. Zhang, "Correlative multi-label video annotation", ACM Multimedia, pp.17--26, 2007.

Digital Library

[22]

Z. Zha, X.-S. Hua, T. Mei, J. Wang, G.-J. Qi, Z. Wang, "Joint multi-label multi-instance learning for image classification", IEEE CVPR, 2008.

[23]

I. Tsochantaridis, T. Joachims, T. Hofmann,Y. Altun, "Large margin methods for structured and interdependent output variables", Journal of Machine Learning Research, vol.6, pp.1453--1484, 2005.

Digital Library

[24]

T. Joachims, T. Finley, C. Yu, "Cutting-plane training of structural SVMs", Machine Learning, vol. 77, no.1, pp.27--59, 2009.

Digital Library

[25]

J.A. Hanley, B.J. Mcneil, "The meaning and use of the area under a receiver operating characteristic (roc) curve", Radiology, vol.143, no.1, pp.29--36, 1982.

[26]

J. Fan, Y. Gao, H. Luo, "Multi-level annotation of natural scenes using dominant image components and semantic image concepts", ACM Multimedia, 2004.

Digital Library

[27]

H.P. Graf, E. Cosatto, L. Bottou, I. Durdanovic, V. Vapnik, "Parallel support vector machines: The cascade SVM", NIPS 2004.

[28]

R. Fan, P. Chen, C.-J. Lin, "Working set selection using the second order information for training SVM", Journal of Machine Learning Research, vol. 6, pp.1889--1918, 2005.

Digital Library

Cited By

Yu JZhang BKuang ZLin DFan J(2017)iPrivacy: Image Privacy Protection by Identifying Sensitive Objects via Deep Multi-Task LearningIEEE Transactions on Information Forensics and Security10.1109/TIFS.2016.263609012:5(1005-1016)Online publication date: May-2017
https://doi.org/10.1109/TIFS.2016.2636090
Cui CShen JMa JLian T(2017)Social tag relevance learning via ranking-oriented neighbor votingMultimedia Tools and Applications10.1007/s11042-016-3512-176:6(8831-8857)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1007/s11042-016-3512-1
Li ZLi Z(2017)Personalized Tag RecommendationUnderstanding-Oriented Multimedia Content Analysis10.1007/978-981-10-3689-7_4(75-99)Online publication date: 27-May-2017
https://doi.org/10.1007/978-981-10-3689-7_4
Show More Cited By

Index Terms

Leveraging loosely-tagged images and inter-object correlations for tag recommendation
1. Computing methodologies
  1. Artificial intelligence
    1. Computer vision
      1. Computer vision problems
        Object recognition

Recommendations

Multi-label multi-instance learning with missing object tags

In this paper, a novel framework is developed for leveraging large-scale loosely tagged images for object classifier training by addressing three key issues jointly: (a) spam tags e.g., some tags are more related to popular query terms rather than the ...
Leveraging large-scale weakly-tagged images to train inter-related classifiers for multi-label annotation
LS-MMRM '09: Proceedings of the First ACM workshop on Large-scale multimedia retrieval and mining

In this paper, we have developed a new multi-label multi-task learning framework to leverage large-scale weakly-tagged images for inter-related classifier training. A novel image and tag cleansing algorithm is developed for tackling the issues of spam, ...
Multiple instance learning with bag dissimilarities

Multiple instance learning (MIL) is concerned with learning from sets (bags) of objects (instances), where the individual instance labels are ambiguous. In this setting, supervised learning cannot be applied directly. Often, specialized MIL methods ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '10: Proceedings of the 18th ACM international conference on Multimedia

October 2010

1836 pages

ISBN:9781605589336

DOI:10.1145/1873951

General Chairs:
Alberto del Bimbo
University of Florence, Italy
,
Shih-Fu Chang
Columbia University, USA
,
Program Chair:
Arnold Smeulders
University of Amsterdam, NL

Copyright © 2010 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 25 October 2010

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Conference

MM '10

Sponsor:

SIGMM

MM '10: ACM Multimedia Conference

October 25 - 29, 2010

Firenze, Italy

Acceptance Rates

Overall Acceptance Rate 554 of 2,551 submissions, 22%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

33
Total Citations
View Citations
444
Total Downloads

Downloads (Last 12 months)2
Downloads (Last 6 weeks)1

Reflects downloads up to 24 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Yu JZhang BKuang ZLin DFan J(2017)iPrivacy: Image Privacy Protection by Identifying Sensitive Objects via Deep Multi-Task LearningIEEE Transactions on Information Forensics and Security10.1109/TIFS.2016.263609012:5(1005-1016)Online publication date: May-2017
https://doi.org/10.1109/TIFS.2016.2636090
Cui CShen JMa JLian T(2017)Social tag relevance learning via ranking-oriented neighbor votingMultimedia Tools and Applications10.1007/s11042-016-3512-176:6(8831-8857)Online publication date: 1-Mar-2017
https://dl.acm.org/doi/10.1007/s11042-016-3512-1
Li ZLi Z(2017)Personalized Tag RecommendationUnderstanding-Oriented Multimedia Content Analysis10.1007/978-981-10-3689-7_4(75-99)Online publication date: 27-May-2017
https://doi.org/10.1007/978-981-10-3689-7_4
Huo HLiu XZheng DWu ZYu SLiu L(2017)Collaborative Filtering Fusing Label Features Based on SDAEAdvances in Data Mining. Applications and Theoretical Aspects10.1007/978-3-319-62701-4_17(223-236)Online publication date: 1-Jul-2017
https://doi.org/10.1007/978-3-319-62701-4_17
Lee SMasoud MBalaji JBelkasim SSunderraman RMoon S(2016)A survey of tag-based information retrievalInternational Journal of Multimedia Information Retrieval10.1007/s13735-016-0115-66:2(99-113)Online publication date: 9-Dec-2016
https://doi.org/10.1007/s13735-016-0115-6
Zhang XLi ZLv XChen X(2016)Integrating multiple types of features for event identification in social imagesMultimedia Tools and Applications10.1007/s11042-014-2436-x75:6(3301-3322)Online publication date: 1-Mar-2016
https://dl.acm.org/doi/10.1007/s11042-014-2436-x
Gupta SPhung DVenkatesh S(2016)Modelling multilevel data in multimediaMultimedia Tools and Applications10.1007/s11042-014-2394-375:9(4933-4955)Online publication date: 1-May-2016
https://dl.acm.org/doi/10.1007/s11042-014-2394-3
Qu YZhang BFan JHauptmann ANgo CXue XJiang YSnoek CVasconcelos N(2015)Parallel AP Clustering and Re-ranking for Automatic Image-Text Alignment and Large-Scale Web Image SearchProceedings of the 5th ACM on International Conference on Multimedia Retrieval10.1145/2671188.2749294(451-454)Online publication date: 22-Jun-2015
https://dl.acm.org/doi/10.1145/2671188.2749294
Xia ZFeng XPeng JFan J(2015)Content-Irrelevant Tag Cleansing via Bi-Layer Clustering and Peer CooperationJournal of Signal Processing Systems10.1007/s11265-014-0895-y81:1(29-44)Online publication date: 1-Oct-2015
https://dl.acm.org/doi/10.1007/s11265-014-0895-y
Wu LHuang XZhang CShepherd JWang Y(2015)An efficient framework of Bregman divergence optimization for co-ranking images and tags in a heterogeneous networkMultimedia Tools and Applications10.1007/s11042-014-1873-x74:15(5635-5660)Online publication date: 1-Jul-2015
https://dl.acm.org/doi/10.1007/s11042-014-1873-x
Show More Cited By

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Figures

Tables

Media

View Table of Conten