skip to main content
research-article

Location-Based Parallel Tag Completion for Geo-Tagged Social Image Retrieval

Published: 20 April 2017 Publication History

Abstract

Having benefited from tremendous growth of user-generated content, social annotated tags get higher importance in the organization and retrieval of large-scale image databases on Online Sharing Websites (OSW). To obtain high-quality tags from existing community contributed tags with missing information and noise, tag-based annotation or recommendation methods have been proposed for performance promotion of tag prediction. While images from OSW contain rich social attributes, they have not taken full advantage of rich social attributes and auxiliary information associated with social images to construct global information completion models. In this article, beyond the image-tag relation, we take full advantage of the ubiquitous GPS locations and image-user relationship to enhance the accuracy of tag prediction and improve the computational efficiency. For GPS locations, we define the popular geo-locations where people tend to take more images as Points of Interests (POI), which are discovered by mean shift approach. For image-user relationship, we integrate a localized prior constraint, expecting the completed tag sub-matrix in each POI to maintain consistency with users’ tagging behaviors. Based on these two key issues, we propose a unified tag matrix completion framework, which learns the image-tag relation within each POI. To solve the optimization problem, an efficient proximal sub-gradient descent algorithm is designed. The model optimization can be easily parallelized and distributed to learn the tag sub-matrix for each POI. Extensive experimental results reveal that the learned tag sub-matrix of each POI reflects the major trend of users’ tagging results with respect to different POIs and users, and the parallel learning process provides strong support for processing large-scale online image databases. To fit the response time requirement and storage limitations of Tag-based Image Retrieval (TBIR) on mobile devices, we introduce Asymmetric Locality Sensitive Hashing (ALSH) to reduce the time cost and meanwhile improve the efficiency of retrieval.

References

[1]
Kobus Barnard, Pinar Duygulu, David Forsyth, Nando De Freitas, David M. Blei, and Michael I. Jordan. 2003. Matching words and pictures. Journal of Machine Learning Research 3 (2003), 1107--1135.
[2]
Gustavo Carneiro, Antoni B. Chan, Pedro J. Moreno, and Nuno Vasconcelos. 2007. Supervised learning of semantic classes for image annotation and retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 29, 3 (2007), 394--410.
[3]
Coralia Cartis, Nicholas I. M. Gould, and Philippe L. Toint. 2011. On the evaluation complexity of composite function minimization with applications to nonconvex nonlinear programming. SIAM Journal on Optimization 21, 4 (2011), 1721--1739.
[4]
Lin Chen, Dong Xu, Ivor W. Tsang, and Jiebo Luo. 2010. Tag-based web photo retrieval improved by batch mode re-tagging. In Proceedings of the 2010 IEEE Conference on Computer Vision and Pattern Recognition (CVPR’10). IEEE, 3440--3446.
[5]
Lin Chen, Dong Xu, Ivor W. Tsang, and Jiebo Luo. 2012. Tag-based image retrieval improved by augmented features and group-based refinement. IEEE Transactions on Multimedia 14, 4 (2012), 1057--1067.
[6]
David J. Crandall, Lars Backstrom, Daniel Huttenlocher, and Jon Kleinberg. 2009. Mapping the world’s photos. In Proceedings of the 18th International Conference on World Wide Web. ACM, 761--770.
[7]
Paolo Cremonesi, Yehuda Koren, and Roberto Turrin. 2010. Performance of recommender algorithms on top-n recommendation tasks. In Proceedings of the 4th ACM Conference on Recommender Systems. ACM, 39--46.
[8]
Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S Mirrokni. 2004. Locality-sensitive hashing scheme based on p-stable distributions. In Proceedings of the Twentieth Annual Symposium on Computational Geometry. ACM, 253--262.
[9]
Chaitanya Desai, Deva Ramanan, and Charless C. Fowlkes. 2011. Discriminative models for multi-class object layout. International Journal of Computer Vision 95, 1 (2011), 1--12.
[10]
Yue Gao, Meng Wang, Huanbo Luan, Jialie Shen, Shuicheng Yan, and Dacheng Tao. 2011. Tag-based social image search with visual-text joint hypergraph learning. In Proceedings of the 19th ACM International Conference on Multimedia. ACM, 1517--1520.
[11]
K.-S. Goh, Edward Y. Chang, and Beitao Li. 2005. Using one-class and two-class SVMs for multiclass image annotation. IEEE Transactions on Knowledge and Data Engineering 17, 10 (2005), 1333--1346.
[12]
Andrew Goldberg, Ben Recht, Junming Xu, Robert Nowak, and Xiaojin Zhu. 2010. Transduction with matrix completion: Three birds with one stone. In Advances in Neural Information Processing Systems. 757--765.
[13]
Matthieu Guillaumin, Thomas Mensink, Jakob Verbeek, and Cordelia Schmid. 2009. Tagprop: Discriminative metric learning in nearest neighbor models for image auto-annotation. In Proceedings of the 2009 IEEE 12th International Conference on Computer Vision. IEEE, 309--316.
[14]
Harry Halpin, Valentin Robu, and Hana Shepherd. 2007. The complex dynamics of collaborative tagging. In Proceedings of the 16th International Conference on World Wide Web. ACM, 211--220.
[15]
Bharath Hariharan, Lihi Zelnik-Manor, Manik Varma, and Svn Vishwanathan. 2010. Large scale max-margin multi-label classification with priors. In Proceedings of the 27th International Conference on Machine Learning (ICML’10). 423--430.
[16]
Choochart Haruechaiyasak and Chaianun Damrongrat. 2010. Improving Social Tag-based Image Retrieval with CBIR Technique. Springer.
[17]
Yang Hu, Mingjing Li, and Nenghai Yu. 2008. Multiple-instance ranking: Learning to rank images for image retrieval. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR’08). IEEE, 1--8.
[18]
Rongrong Ji, Ling-Yu Duan, Jie Chen, Tiejun Huang, and Wen Gao. 2014. Mining compact bag-of-patterns for low bit rate mobile visual search. IEEE Transactions on Image Processing 23, 7 (2014), 3099--3113.
[19]
Rongrong Ji, Yue Gao, Wei Liu, Xing Xie, Qi Tian, and Xuelong Li. 2015. When location meets social multimedia: A survey on vision-based recognition and mining for geo-social multimedia analytics. ACM Transactions on Intelligent Systems and Technology (TIST) 6, 1 (2015), 1.
[20]
Yu-Gang Jiang, Chong-Wah Ngo, and Jun Yang. 2007. Towards optimal bag-of-features for object categorization and semantic video retrieval. In Proceedings of the 6th ACM International Conference on Image and Video Retrieval. ACM, 494--501.
[21]
Margaret EI Kipp and D. Grant Campbell. 2006. Patterns and inconsistencies in collaborative tagging systems: An examination of tagging practices. Proceedings of the American Society for Information Science and Technology 43, 1 (2006), 1--18.
[22]
Noam Koenigstein, Parikshit Ram, and Yuval Shavitt. 2012. Efficient retrieval of recommendations in a matrix factorization framework. In Proceedings of the 21st ACM International Conference on Information and Knowledge Management. ACM, 535--544.
[23]
Sihyoung Lee, Wesley De Neve, and Yong Man Ro. 2014. Visually weighted neighbor voting for image tag relevance learning. Multimedia Tools and Applications 72, 2 (2014), 1363--1386.
[24]
Xirong Li, Cees G. M. Snoek, and Marcel Worring. 2009. Learning social tag relevance by neighbor voting. IEEE Transactions on Multimedia 11, 7 (2009), 1310--1322.
[25]
Xirong Li, Tiberio Uricchio, Lamberto Ballan, Marco Bertini, Cees GM Snoek, and Alberto Del Bimbo. 2016. Socializing the semantic gap: A comparative survey on image tag assignment, refinement, and retrieval. ACM Computing Surveys (CSUR) 49, 1 (2016), 14.
[26]
Zijia Lin, Guiguang Ding, Mingqing Hu, Jianmin Wang, and Xiaojun Ye. 2013. Image tag completion via image-specific and tag-specific linear sparse reconstructions. In Proceedings of the 2013 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, 1618--1625.
[27]
Dong Liu, Xian-Sheng Hua, Meng Wang, and HongJiang Zhang. 2009a. Boost search relevance for tag-based social image retrieval. In IEEE International Conference on Multimedia and Expo. IEEE, 1636--1639.
[28]
Dong Liu, Xian-Sheng Hua, Linjun Yang, Meng Wang, and Hong-Jiang Zhang. 2009b. Tag ranking. In Proceedings of the 18th International Conference on World Wide Web. ACM, 351--360.
[29]
Dong Liu, Shuicheng Yan, Xian-Sheng Hua, and Hong-Jiang Zhang. 2011. Image retagging using collaborative tag propagation. IEEE Transactions on Multimedia 13, 4 (2011), 702--712.
[30]
Jing Liu, Zechao Li, Jinhui Tang, Yu Jiang, and Hanqing Lu. 2014. Personalized geo-specific tag recommendation for photos on social websites. IEEE Transactions on Multimedia 16, 3 (2014), 588--600.
[31]
Siyuan Liu, Yunhuai Liu, Lionel M. Ni, Jianping Fan, and Minglu Li. 2010. Towards mobility-based clustering. In Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM, 919--928.
[32]
Julien Mairal, Francis Bach, Jean Ponce, and Guillermo Sapiro. 2010. Online learning for matrix factorization and sparse coding. Journal of Machine Learning Research 11 (2010), 19--60.
[33]
Deepshikha Mishra, Uday Pratap Singh, and Vineet Richhariya. 2014. Tag relevance for social image retrieval in accordance with neighbor voting algorithm. International Journal of Computer Science and Network Security (IJCSNS) 14, 7 (2014), 50.
[34]
Emily Moxley, Jim Kleban, and B. S. Manjunath. 2008. Spirittagger: A geo-aware tag suggestion tool mined from Flickr. In Proceedings of the 1st ACM International Conference on Multimedia Information Retrieval. ACM, 24--30.
[35]
X. Qian, X. S. Hua, Y. Y. Tang, and T. Mei. 2014. Social image tagging with diverse semantics. IEEE Transactions on Cybernetics 44, 12 (2014), 2493--2508.
[36]
Bryan C. Russell, Antonio Torralba, Kevin P. Murphy, and William T. Freeman. 2008. LabelMe: A database and web-based tool for image annotation. International Journal of Computer Vision 77, 1--3 (2008), 157--173.
[37]
Jitao Sang, Changsheng Xu, and Jing Liu. 2012. User-aware image tag refinement via ternary semantic analysis. IEEE Transactions on Multimedia 14, 3 (2012), 883--895.
[38]
Anshumali Shrivastava and Ping Li. 2014. Asymmetric LSH (ALSH) for sublinear time maximum inner product search (MIPS). In Advances in Neural Information Processing Systems. 2321--2329.
[39]
Andrea Vedaldi and Brian Fulkerson. 2010. VLFeat: An open and portable library of computer vision algorithms. In Proceedings of the 18th ACM International Conference on Multimedia. ACM, 1469--1472.
[40]
Zhi Wang, Wenwu Zhu, Peng Cui, Lifeng Sun, and Shiqiang Yang. 2013. Social media recommendation. In Social Media Retrieval. Springer, 23--42.
[41]
Shikui Wei, Dong Xu, Xuelong Li, and Yao Zhao. 2013. Joint optimization toward effective and efficient image search. IEEE Transactions on Cybernetics 43, 6 (2013), 2216--2227.
[42]
Shikui Wei, Yao Zhao, Ce Zhu, Changsheng Xu, and Zhenfeng Zhu. 2011. Frame fusion for video copy detection. IEEE Transactions on Circuits and Systems for Video Technology 21, 1 (2011), 15--28.
[43]
Shikui Wei, Yao Zhao, Zhenfeng Zhu, and Nan Liu. 2010. Multimodal fusion for video search reranking. IEEE Transactions on Knowledge and Data Engineering 22, 8 (2010), 1191--1199.
[44]
Di Wu, Jun Wu, Ming-Yu Lu, and Chun-Li Wang. 2014. A two-step similarity ranking scheme for image retrieval. In Proceedings of the 2014 6th International Symposium on Parallel Architectures, Algorithms and Programming (PAAP’14). IEEE, 191--196.
[45]
Lei Wu, Xian-Sheng Hua, Nenghai Yu, Wei-Ying Ma, and Shipeng Li. 2008. Flickr distance. In Proceedings of the 16th ACM International Conference on Multimedia. ACM, 31--40.
[46]
Lei Wu, Rong Jin, and Anil K. Jain. 2013. Tag completion for image retrieval. IEEE Transactions on Pattern Analysis and Machine Intelligence 35, 3 (2013), 716--727.
[47]
Lei Wu, Linjun Yang, Nenghai Yu, and Xian-Sheng Hua. 2009. Learning to tag. In Proceedings of the 18th International Conference on World Wide Web. ACM, 361--370.
[48]
Hao Xu, Jingdong Wang, Xian-Sheng Hua, and Shipeng Li. 2009. Tag refinement by regularized LDA. In Proceedings of the 17th ACM International Conference on Multimedia. ACM, 573--576.
[49]
Yang Yang, Yue Gao, Hanwang Zhang, Jie Shao, and Tat-Seng Chua. 2014. Image tagging with social assistance. In Proceedings of International Conference on Multimedia Retrieval. ACM, 81.
[50]
J. Yu, D. Tao, M. Wang, and Y. Rui. 2015. Learning to rank using user clicks and visual features for image retrieval. IEEE Transactions on Cybernetics 45, 4 (2015), 767--779.
[51]
Zheng-Jun Zha, Tao Mei, Jingdong Wang, Zengfu Wang, and Xian-Sheng Hua. 2009. Graph-based semi-supervised learning with multiple labels. Journal of Visual Communication and Image Representation 20, 2 (2009), 97--103.
[52]
Jiangchuan Zheng, Siyuan Liu, and Lionel M. Ni. 2014. User characterization from geographic topic analysis in online social media. In Proceedings of the 2014 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining (ASONAM’14). IEEE, 464--471.
[53]
Ning Zhou, William K. Cheung, Guoping Qiu, and Xiangyang Xue. 2011. A hybrid probabilistic model for unified collaborative and content-based image tagging. IEEE Transactions on Pattern Analysis and Machine Intelligence 33, 7 (2011), 1281--1294.
[54]
Guangyu Zhu, Shuicheng Yan, and Yi Ma. 2010. Image tag refinement towards low-rank, content-tag prior and error sparsity. In Proceedings of the 18th ACM International Conference on Multimedia. ACM, 461--470.

Cited By

View all
  • (2023)M3PT: A Multi-Modal Model for POI TaggingProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599862(5382-5392)Online publication date: 6-Aug-2023
  • (2021)Performance Assessment of Certain Machine Learning Models for Predicting the Major Depressive Disorder among IT Professionals during Pandemic timesComputational Intelligence and Neuroscience10.1155/2021/99503322021Online publication date: 1-Jan-2021
  • (2021)Analog Circuit Soft Fault Diagnosis Based on Sparse Random Projections and K-Nearest NeighborScientific Programming10.1155/2021/80401402021Online publication date: 1-Jan-2021
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Transactions on Intelligent Systems and Technology
ACM Transactions on Intelligent Systems and Technology  Volume 8, Issue 3
Special Issue: Mobile Social Multimedia Analytics in the Big Data Era and Regular Papers
May 2017
320 pages
ISSN:2157-6904
EISSN:2157-6912
DOI:10.1145/3040485
  • Editor:
  • Yu Zheng
Issue’s Table of Contents
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 20 April 2017
Accepted: 01 September 2016
Revised: 01 February 2016
Received: 01 July 2015
Published in TIST Volume 8, Issue 3

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. Tag matrix completion
  2. asymmetric locality sensitive hashing
  3. geo-location information
  4. social image retrieval

Qualifiers

  • Research-article
  • Research
  • Refereed

Funding Sources

  • National Natural Science Foundation of China
  • Basic Research Program of Shenzhen
  • 863 program of China
  • Postdoctoral Science Foundation of China
  • National Basic Research Program of China (973 Program)
  • Key Research Program of Frontier Sciences, CAS

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)9
  • Downloads (Last 6 weeks)0
Reflects downloads up to 28 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2023)M3PT: A Multi-Modal Model for POI TaggingProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining10.1145/3580305.3599862(5382-5392)Online publication date: 6-Aug-2023
  • (2021)Performance Assessment of Certain Machine Learning Models for Predicting the Major Depressive Disorder among IT Professionals during Pandemic timesComputational Intelligence and Neuroscience10.1155/2021/99503322021Online publication date: 1-Jan-2021
  • (2021)Analog Circuit Soft Fault Diagnosis Based on Sparse Random Projections and K-Nearest NeighborScientific Programming10.1155/2021/80401402021Online publication date: 1-Jan-2021
  • (2019)A Collaborative Learning Framework to Tag Refinement for Points of InterestProceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining10.1145/3292500.3330698(1752-1761)Online publication date: 25-Jul-2019
  • (2019)Crowdsourcing for search engines: perspectives and challengesInternational Journal of Crowd Science10.1108/IJCS-12-2018-00263:1(49-62)Online publication date: 10-May-2019
  • (2018)Multimodal Semantics and Affective Computing from Multimedia ContentIntelligent Multidimensional Data and Image Processing10.4018/978-1-5225-5246-8.ch014(359-382)Online publication date: 2018
  • (2017)Cross-validation based K nearest neighbor imputation for software quality datasetsJournal of Systems and Software10.1016/j.jss.2017.07.012132:C(226-252)Online publication date: 1-Oct-2017

View Options

Login options

Full Access

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media