Geo-location driven image tagging via cross-domain learning

Nie, Weizhi; Liu, Anan; Wang, Zhongyang; Su, Yuting

doi:10.1007/s00530-014-0396-7

Geo-location driven image tagging via cross-domain learning

Special Issue Paper
Published: 17 June 2014

Volume 22, pages 395–404, (2016)
Cite this article

Multimedia Systems Aims and scope Submit manuscript

Weizhi Nie¹,
Anan Liu¹,
Zhongyang Wang¹ &
…
Yuting Su¹

360 Accesses
1 Citation
Explore all metrics

Abstract

With the rapid development of location-based social network, more and more multimedia data are uploaded by users. These data always include large-scale of independent information with both textual and visual contents. To bridge the semantic gap in between, we propose a novel cross-domain learning method for automatic image annotation with geo-location information. First, we propose the topic model-based method for popular concept extraction to adaptively construct cross-domain datasets. Then these concepts are utilized to collect the visual correlation information from Flickr. Finally, we leverage cross-domain learning method for model learning. The comparison experiments on cross-domain datasets are conducted to demonstrate the superiority of the proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Cross-domain semantic transfer from large-scale social media

Article 12 June 2014

Learning Tag Relevance by Context Analysis for Social Image Retrieval

Relevant Tag Extraction Based on Image Visual Content

References

Gao, Y., Tang, J., Hong, R., Dai, Q., Chua, T.-S., Jain, R.: W2go: a travel guidance system by automatic landmark ranking. In: ACM Multimedia, pp. 123–132 (2010)
Ji, R., Duan, L.-Y., Chen, J., Yao, H., Yuan, J., Rui, Y., Gao, W.: Location discriminative vocabulary coding for mobile landmark search. Int. J. Comput. Vis. 96(3), 290–314 (2012)
Article MATH Google Scholar
Gao, Y., Wang, F., Luan, H., Chua, T.: Brand data gathering from live social media streams. In: ACM Conference on Multimedia Retrieval (2014)
Wang, H., Huang, H., Ding, C.H.Q.: Image annotation using bi-relational graph of images and semantic labels. In: CVPR, pp. 793–800 (2011)
Sang, J., Xu, C., Liu, J.: User-aware image tag refinement via ternary semantic analysis. IEEE Trans. Multimed. 14(3–2), 883–895 (2012)
Article Google Scholar
Gao, Y., Wang, M., Luan, H., Shen, J., Yan, S., Tao, D.: Tag-based social image search with visual-text joint hypergraph learning. In: ACM Multimedia, pp. 1517–1520 (2011)
Belani, A.: Vandalism detection in wikipedia: a bag-of-words classifier approach. In: CoRR, vol. abs/1001.0700 (2010)
Yang, J., Yan, R., Hauptmann, A.: Cross-domain video concept detection using adaptive svms. In: ACM Multimedia (2007)
Jiang, Y., Wang, J., Chang, S., Ngo, C.: Domain adaptive semantic diffusion for large scale context-based video annotation. In: ICCV (2009)
III, H.D.: Frustratingly easy domain adaptation. In: CoRR, vol. abs/0907.1815 (2009)
Wu, P., Dietterich, T.: Improving svm accuracy by training on auxiliary data sources. In: ICML (2004)
Jiang, W., Zavesky, E., Chang, S., Loui, A.: Cross-domain learning methods for high-level visual concept classification. In: ICIP (2008)
Roy, S.D., Mei, T., Zeng, W., Li, S.: Towards cross-domain learning for social video popularity prediction. IEEE Trans. Multimed. 15(6):1 (2013)
Huang, J., Smola, A., Gretton, A., Borgwardt, K., Schölkopf, B.: Correcting sample selection bias by unlabeled data. In: NIPS (2006)
Storkey, A., Sugiyama, M.: Mixture regression for covariate shift. In: NIPS (2006)
Fang, Z., Zhang, Z.M.: Discriminative feature selection for multi-view cross-domain learning. In: CIKM, pp. 1321–1330 (2013)
Chen, L., Duan, L., Tsang, I.W., Xu, D.: Efficient discriminative learning of class hierarchy for many class prediction. In: ACCV, vol. 1, pp. 274–288 (2012)
Bruzzone, L., Marconcini, M.: Domain adaptation problems: a dasvm classification technique and a circular validation strategy. IEEE Trans. Pattern Anal. Mach. Intell. 32(5), 770–787 (2010)
Article Google Scholar
Bruzzone, L., Chi, M., Marconcini, M.: Transductive svms for semisupervised classification of hyperspectral data. In: IGARSS, p. 4 (2005)
Yuan, Y., Wu, F., Shao, J., Zhuang, Y.: Image annotation by semi-supervised cross-domain learning with group sparsity. J. Visual Commun. Image Rep. 24(2), 95–102 (2013)
Article Google Scholar
Si, S., Tao, D., Wang, M., Chan, K.: Social image annotation via cross-domain subspace learning. Multimed. Tools Appl. 56(1), 91–108 (2012)
Article Google Scholar
Federico, L., Nestor, D., Oscar, C.: Smitag: a social network for semantic annotation of medical images. In: CLEI (2012)
Si, S., Tao, D., Chan, K.: Cross-domain web image annotation. In: ICDM Workshops, pp. 184–189 (2009)
Denoyer, L., Gallinari, P.: A ranking based model for automatic image annotation in a social network. In: ICWSM (2010)
Han, Y., Wu, F., Zhuang, Y.: Multi-label image annotation by structural grouping sparsity. In: Social Media Modeling and Computing, pp. 97–118 (2011)
Joshi, D., Luo, J., Yu, J., Lei, P., Gallagher, A.C.: Using geotags to derive rich tag-clouds for image annotation. In: Social Media Modeling and Computing, pp. 239–256 (2011)
Gao, Y., Wang, M., Zha, Z., Shen, J., Li, X., Wu, X.: Visual-textual joint relevance learning for tag-based social image search. IEEE Trans. Image Process. 22(1), 363–373 (2013)
Article MathSciNet Google Scholar
Blei, D., Ng, A., Jordan, M.: Latent dirichlet allocation. In: NIPS (2001)
Balamurali, A., Mukherjee, S., Malu, A., Bhattacharyya, P.: Leveraging sentiment to compute word similarity. In: CoRR (2012)
Felzenszwalb, P.F., Girshick, R.B., McAllester, D.A., Ramanan, D.: Object detection with discriminatively trained part-based models. IEEE Trans. PAMI 32(9), 1627–1645 (2010)
Article Google Scholar
Viola, P.A., Jones, M.J.: Robust real-time face detection. Int. J. Comput. Vis. 57(2), 137–154 (2004)
Article Google Scholar
Sang, J., Xu, C.: Right buddy makes the difference: an early exploration of social relation analysis in multimedia applications. In: ACM Multimedia, pp. 19–28 (2012)
Ji, R., Yao, H., Liu, W., Sun, X., Tian, Q.: Task-dependent visual-codebook compression. IEEE Trans. Image Process. 21(4), 2282–2293 (2012)
Article MathSciNet Google Scholar
Yang, J., Yu, K., Gong, Y., Huang, T.: Linear spatial pyramid matching using sparse coding for image classification. In: CVPR (2009)
Sanromà, G., Alquézar, R., Serratosa, F.: A new graph matching method for point-set correspondence using the em algorithm and softassign. Comput. Vis. Image Underst. 116(2), 292–304 (2012)
Article Google Scholar
Pan, S.J., Kwok, J., Yang, Q.: Transfer learning via dimensionality reduction. In: AAAI (2008)
Taylor, M.E., Stone, P.: Cross-domain transfer for reinforcement learning. In: ICML, pp. 879–886 (2007)
Duan, L., Tsang, I., Xu, D.: Domain transfer multiple kernel learning. IEEE Trans. Pattern Anal. Mach. Intell. 34(3), 465–479 (2012)
Article Google Scholar
Lanckriet, G., Cristianini, N., Bartlett, P., Ghaoui, L., Jordan, M.: Learning the kernel matrix with semidefinite programming. J. Mach. Learn. Res. 5, 27–72 (2004)
MathSciNet MATH Google Scholar
Liu, X., Wang, L., Yin, J., Liu, L.: Incorporation of radius-info can be simple with simplemkl. Neurocomputing 89, 30–38 (2012)
Article Google Scholar
Sonnenburg, S., Rätsch, G., Schäfer, C., Schölkopf, B.: Large scale multiple kernel learning. J. Mach. Learn. Res 7, 1531–1565 (2006)
MathSciNet MATH Google Scholar
Ji, R., Gao, Y., Zhong, B., Yao, H., Tian, Q.: Mining flickr landmarks by modeling reconstruction sparsity. TOMCCAP 7, 31 (2011)
Google Scholar
Dalal, N., Triggs, B.: Histograms of oriented gradients for human detection. In: CVPR, vol. 1, pp. 886–893 (2005)
Mäenpää, T., Ojala, T., Pietikäinen, M., Soriano, M.: Robust texture classification by subsets of local binary patterns. In: ICPR (2000)
Zhang, W., Lu, Y., Xue, X., Fan, J.: Automatic image annotation with weakly labeled dataset. In: ACM Multimedia, pp. 1185–1188 (2011)
Liu, X., Gao, Y., Ji, R., Chang, S., Huang, T. S.: Localizing web videos from heterogeneous images. In: AAAI (late-breaking developments) (2013)
Gao, Y., Dai, Q.: Clip based video summarization and ranking. In: CIVR, pp. 135–140 (2008)

Download references

Acknowledgments

I would like to express my deep gratitude to Prof. Tat-Seng Chua and the NeXT group in National University of Singapore for helpful discussion. This work was supported in part by the National Natural Science Foundation of China (61100124, 21106095, 61170239, and 61202168), the Grant of Elite Scholar Program of Tianjin University, the Grant of Introducing Talents to Tianjin Normal University (5RL123), the Grant of Introduction of One Thousand High-level Talents in Three Years in Tianjin.

Author information

Authors and Affiliations

The department of Electronics Information Engineering, Tianjin University, Tianjin, China
Weizhi Nie, Anan Liu, Zhongyang Wang & Yuting Su

Authors

Weizhi Nie
View author publications
You can also search for this author in PubMed Google Scholar
Anan Liu
View author publications
You can also search for this author in PubMed Google Scholar
Zhongyang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuting Su
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Anan Liu.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Nie, W., Liu, A., Wang, Z. et al. Geo-location driven image tagging via cross-domain learning. Multimedia Systems 22, 395–404 (2016). https://doi.org/10.1007/s00530-014-0396-7

Download citation

Published: 17 June 2014
Issue Date: July 2016
DOI: https://doi.org/10.1007/s00530-014-0396-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Geo-location driven image tagging via cross-domain learning

Abstract

Access this article

Similar content being viewed by others

Cross-domain semantic transfer from large-scale social media

Learning Tag Relevance by Context Analysis for Social Image Retrieval

Relevant Tag Extraction Based on Image Visual Content

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Geo-location driven image tagging via cross-domain learning

Abstract

Access this article

Similar content being viewed by others

Cross-domain semantic transfer from large-scale social media

Learning Tag Relevance by Context Analysis for Social Image Retrieval

Relevant Tag Extraction Based on Image Visual Content

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation