Automatic image annotation with real-world community contributed data set

Tian, Feng; Shen, Xukun; Shang, Fuhua

doi:10.1007/s00530-017-0548-7

Automatic image annotation with real-world community contributed data set

Special Issue Paper
Published: 25 March 2017

Volume 25, pages 463–474, (2019)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

252 Accesses
5 Citations
Explore all metrics

Abstract

With the massive explosion of social multimedia community, social images have become very popular in our daily life. The image-associated labels are a valuable resource for automatic image annotation, but they tend to be unreliable. In this paper, we exploit the problem of image annotation from real-world community contributed images and their associated incorrect, insufficient, and personalized labels. We present SNTag, a novel semantic neighborhood learning method, on which image annotation task can be efficiently carried out in real-world scenario. First, we propose to use image-associated labels as the supervising information to guide the replenishment of training images, which enable the labels for training image not only more sufficient, but also more correct. Then, the “semantic balanced neighborhood” for image is generated, thus enabling the presence of more rare labels in image label list. Furthermore, we generate “semantic consistent neighborhood” within corresponding “semantic balanced neighborhood”. The retrieved neighbor images are not only visually alike but also semantically related. Contrary to earlier work, these neighbors are retrieved from the same subspace by the integration of metric learning embedded in multiple labels and sparse reconstruction. Based on the neighbor set, we propose a novel algorithm to assign the optimal labels to the image, which is more robust to noise. We conduct extensive experiments on several standard real-world benchmark data sets downloaded from community websites. The experimental results demonstrate that it outperforms the current state-of-the-art methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Image annotation refinement via 2P-KNN based group sparse reconstruction

Article 14 April 2018

Adaptive image annotation: refining labels according to contents and relations

Article 30 January 2022

Semi-supervised dual low-rank feature mapping for multi-label image annotation

Article 06 February 2018

References

Wang, M., Ni, B., Hua, X.S.: Assistive tagging: a survey of multimedia tagging with human-computer joint exploration. ACM Comput. Surv. 44, 25–25 (2012)
Article Google Scholar
Hauptmann, A., Yan, R., Lin, W.H.: How many high-level concepts will fill the semantic gap in news video retrieval? ACM International Conference on Image and Video Retrieval, 627–634 (2007)
Li, X., Snoek, C.G.M., Worring, M.: Learning social tag relevance by neighbor voting. Multimed. IEEE Trans. 11, 1310–1322 (2009)
Article Google Scholar
Golder, S.A., Huberman, B.A.: Usage patterns of collaborative tagging systems. J. Inf. Sci. 32, 198–208 (2006)
Article Google Scholar
Matusiak, K.K.: Towards user-centered indexing in digital image collections. Oclc Syst. Serv. 22, 283–298 (2006)
Article Google Scholar
Makadia, A., Pavlovic, V., Kumar, S.: A new baseline for image annotation. Proc. ECCV 2008, 316–329 (2008)
Google Scholar
Nguyen, N., Caruana, R.: Classification with partial labels. In: Proceedings of 14th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 551–559 (2008)
Fan, J., Shen, Y., Zhou, N.: Harvesting large-scale weakly-tagged image databases from the web. In: Proceedings of 23th IEEE Conference on Computer Vision and Pattern Recognition, pp. 802–809 (2010)
He, X., Zemel, R.S.: Learning hybrid models for image annotation with partially labeled data. In: Proceedings of 24th Annual Conference on Advances in Neural Information Processing Systems, pp. 625–632 (2009)
Bucak, S.S., Jin, R., Jain, A.K.: Multi-label learning with incomplete class assignments. In: Proceedings of 24th IEEE Conference on Computer Vision and Pattern Recognition, pp. 2801–2808 (2011)
Duygulu, P., Barnard, K., Freitas, J.F.G.D., et al.: Object recognition as machine translation: learning a lexicon for a fixed image vocabulary. Comput. Vis. ECCV 2002, 97–112 (2002)
MATH Google Scholar
Barnard, K., Duygulu, P., Forsyth, D., et al.: Matching words and pictures. J. Mach. Learn. Res. 3, 1107–1135 (2003)
MATH Google Scholar
Monay F., Gatica-Perez D.: PLSA-based image auto-annotation: constraining the latent space. In: Proceedings acm International Conference on Multimedia, pp. 348–351 (2004)
Yakhnenko, O., Honavar, V.: Annotating images and image objects using a hierarchical dirichlet process model. In: International Workshop on Multimedia Data Mining: Held in Conjunction with the ACM SIGKDD, pp. 23–43 (2008)
Socher, R., Li, F.F.: Connecting modalities: semi-supervised segmentation and annotation of images using unaligned text corpora. In: IEEE Conference on Computer Vision and Pattern Recognition, pp. 966–973 (2010)
Yavlinsky A, Schofield E, Rger S, automated image annotation using global features and robust nonparametric density estimation. Image Video Retr. 507–517 (2005)
Carneiro, G., Chan, A.B., Moreno, P.J., et al.: Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 29, 394–410 (2007)
Article Google Scholar
Xiang, Y., Zhou, X., Chua, T.S., et al.: A revisit of generative model for automatic image annotation using Markov Random Fields. iN: IEEE Conference on Computer Vision AND Pattern Recognition, PP. 1153–1160 (2009)
Cusano, C., Ciocca, G., Schettini, R., Image annotation using SVM. Internet Imag. V, 330–338 (2003)
Hertz, T., Bar-Hillel, A., Weinshall, D.: Learning distance functions for image retrieval. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition, pp. 570–577 (2004)
Grangier, D., Bengio, S.: A discriminative kernel-based approach to rank images from text queries. IEEE Trans. Pattern Anal. Mach. Intell. 30, 1371–1384 (2008)
Article Google Scholar
Li, X., Snoek, C.G.M., Worring, M.: Learning social tag relevance by neighbor voting. Multimed. IEEE Trans. 11, 1310–1322 (2009)
Article Google Scholar
Guillaumin, M., Mensink, T., Verbeek, J., Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In: Proceedings of IEEE 12th International Conference on Computer Vision, pp. 309–316 (2009)
S.Zhang, J.Huang, Y.Huang, Automatic image annotation using group sparsity, Proc. of 23th IEEE Conference on Computer Vision and Pattern Recognition, 3312-3319(2010)
Chen M, Zheng A, Weinberger K Q. Fast image tagging, Proc.of International Conference on Machine Learning, 2013
Fergus, R., Weiss, Y., Torralba, A.: Semi-supervised learning in gigantic image collections. Adv Neural Inf. Proc. Syst. (2009)
Schroff, F., Criminisi, A., Zisserman, A.: Harvesting image databases from the web. In: International Conference on Computer Vision (2007)
Shalev-Shwartz, S., Singer, Y., Srebro, N.: Pegasos: primal estimated sub-gradient solver for svm. Math. Program. 127, 3–30 (2011)
Article MathSciNet Google Scholar
Saad, Y., Schultz, M.H.: GMRES: a generalized minimal residual algorithm for solving nonsymmetric linear systems. SIAM J. Sci. Stat. Comput. 7, 569–856 (1986)
Article MathSciNet Google Scholar
Huiskes, M.J., Lew, M.S.: The MIR flickr retrieval evaluation. In: Proceedings of 1st ACM International Conference on Multimedia Information Retrieval, pp. 39–43(2008)

Download references

Acknowledgements

Special thanks should go to the collaborators in the Lab for Media Search of National University of Singapore, for their instructive advice and useful suggestions on this work. I am deeply grateful of their help in the completion of this work. This work is supported by the Natural Science Foundation of China (Nos. 61502094, 61402099) and Natural Science Foundation of Heilongjiang Province of China (Nos. F2016002, F2015020).

Author information

Authors and Affiliations

School of Computer and Information Technology, Northeast Petroleum University, DaQing, 163318, China
Feng Tian & Fuhua Shang
School of Computing, National University of Singapore, Singapore, 119077, Singapore
Feng Tian
State Key Laboratory of Virtual Reality Technology and Systems, BeiHang University, Beijing, 100191, China
Xukun Shen

Authors

Feng Tian
View author publications
You can also search for this author inPubMed Google Scholar
Xukun Shen
View author publications
You can also search for this author inPubMed Google Scholar
Fuhua Shang
View author publications
You can also search for this author inPubMed Google Scholar

Corresponding author

Correspondence to Feng Tian.

Additional information

This work is supported by the Natural Science Foundation of China (Nos. 61502094, 61402099) and Natural Science Foundation of Heilongjiang Province of China (Nos. F2016002, F2015020).

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tian, F., Shen, X. & Shang, F. Automatic image annotation with real-world community contributed data set. Multimedia Systems 25, 463–474 (2019). https://doi.org/10.1007/s00530-017-0548-7

Download citation

Published: 25 March 2017
Issue Date: October 2019
DOI: https://doi.org/10.1007/s00530-017-0548-7

Keywords

Access this article

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Automatic image annotation with real-world community contributed data set

Abstract

Access this article

Subscribe and save

Buy Now

Similar content being viewed by others

Image annotation refinement via 2P-KNN based group sparse reconstruction

Adaptive image annotation: refining labels according to contents and relations

Semi-supervised dual low-rank feature mapping for multi-label image annotation

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Subscribe and save

Buy Now