A survey of tag-based information retrieval

Lee, Sanghoon; Masoud, Mohamed; Balaji, Janani; Belkasim, Saeid; Sunderraman, Rajshekhar; Moon, Seung-Jin

doi:10.1007/s13735-016-0115-6

A survey of tag-based information retrieval

Trends and Surveys
Published: 09 December 2016

Volume 6, pages 99–113, (2017)
Cite this article

International Journal of Multimedia Information Retrieval Aims and scope Submit manuscript

Sanghoon Lee ORCID: orcid.org/0000-0003-2173-6490¹^nAff2,
Mohamed Masoud¹,
Janani Balaji¹,
Saeid Belkasim¹,
Rajshekhar Sunderraman¹ &
…
Seung-Jin Moon³

828 Accesses
7 Citations
Explore all metrics

Abstract

This paper aims to provide a comprehensive survey of tag-based information retrieval that covers three areas: tag-based document retrieval, tag-based image retrieval, and tag-based music information retrieval. First of all, seven representative graphical models associated with tag contents are reviewed and evaluated in terms of effectiveness in achieving their goals. The models are explored in depth based on appropriate plate notations for the tag-based document retrieval. Second, well-established review criteria for two-way classical methods, tag refinement and tag recommendation, are utilized for tag-based image retrieval. In particular, tag refinement methods are analyzed by means of the experimental results measured on different datasets. Last, popular tagging methods in the area of music information retrieval are reviewed for the tag-based music information retrieval. We introduce five criteria: used models, tagging purpose, tagging right, object type, and used dataset, for evaluating tag-based information retrieval methods as a new categorical framework engaging the graphical models as well as the two-way classical methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Cisco (2016) Cisco Visual Networking Index: Forecast and Methodology 2015–2020. http://www.cisco.com/c/en/us/solutions/collateral/service-provider/visual-networking-index-vni/complete-white-paper-c11-481360. Accessed 1 June 2016
Gao Y, Wang M, Luan H, Shen J, Yan S, Tao D (2011) Tag-based social image search with visual-text joint hypergraph learning in Proc. ACM ICM, pp 1517–1520
Gao Y, Wang M, Zha Z-J, Shen J, Li X, Wu X (2013) Visual-textual joint relevance learning for tag-based social image search. IEEE Trans Image Process 22(1):363–376
Article MathSciNet Google Scholar
Zanardi V, Capra L (2008) Social ranking: uncovering relevant content using tag-based recommender systems. In: Proc. ACM Rec Sys, pp 51–58
Nakamoto R, Nakajima S, Miyazaki J, Uemura S (2007) Tag-based contextual collaborative filtering. In: Proc. 18th IEICE data engineering workshop
Carmagnola F, Cena F, Cortassa O, Gena C, Torre I (2007) Towards a tag-based user model: how can user model benefit from tags? User modeling. Springer, Berlin Heidelberg, pp 445–449
Google Scholar
Cai Y, Li Q (2010) Personalized search by tag-based user profile and resource profile in collaborative tagging systems. In: Proc. 19th ACM ICIKM, pp 969–978
Chen L, Xu D, Tsang IW, Luo J (2010) Tag-based web photo retrieval improved by batch mode re-tagging. In: Proc. IEEE CVPR, pp 3440–3446
Mathes A (2004) Folksonomies cooperative classification and communication through shared metadata. UIC Technical Report
Trant J (2009) Studying social tagging and folksonomy: A review and framework J. Digital Information 10-1
Gupta M, Li R, Yin Z, Han J (2010) Survey on social tagging techniques. ACM SIGKDD Explor Newslett 12(1):58–72
Article Google Scholar
Dattolo A, Ferrara F, Tasso C (2010) The role of tags for recommendation: a survey. In: HSI, pp 548–555
Majid A, Khusro S, Rauf A (2011) Semantics in social tagging systems: a review. In: ICCNIT, pp 191–203
Wang M, Ni B, Hua XS, Chua TS (2012) Assistive tagging: a survey of multimedia tagging with human–computer joint exploration. ACM CSUR 44(4):25
Deerwester S, Dumais S, Landauer T, Furnas G, Harshman R (1990) Indexing by latent semantic analysis. J Am Soc Inf Sci 41(6):391407
Article Google Scholar
Hofmann T (1999) Probabilistic latent semantic indexing. In: Proceedings of the 22nd annual international ACM SIGIR, pp 50–57
Blei DM, Andrew YN, Michael IJ (2003) Latent dirichlet allocation. J Mach Learn Res 3:993–1022
MATH Google Scholar
Rosen-Zvi M, Griffiths T, Steyvers M, Smyth P (2012) The author-topic model for authors and documents. In: Proceedings of the 20th conference on uncertainty in artificial intelligence. AUAI Press, pp 487–494
Mimno D, McCallum A (2008) Topic models conditioned on arbitrary features with dirichlet-multinomial regression. In: UAI, pp 411–418
Ramage D, Hall D, Nallapati R, Manning CD (2009) Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora. In: EMNLP, pp 248–256
Ramage D, Manning CD, Dumais S (2011) Partially labeled topic models for interpretable text mining. In: Proc. ACM SIGKDD, pp 457–465
Li S, Li J, Pan R (2013) Tag-weighted topic model for mining semi-structured documents. In: Proceedings of the 23th international joint conference on artificial intelligence. AAAI Press, pp 2855–2861
Li S, Li J, Huang G, Tan R, Pan R (2015) Tag-weighted topic model for large-scale semi-structured documents (arXiv preprint). arXiv:1507.08396
Zamir AR, Ardeshir S, Shah M (2014) GPS-Tag refinement using random walks with an adaptive damping factor. In: CVPR, pp 4280–4287
Liu Y, Wu F, Zhang Y, Shao J, Zhuang Y (2011) Tag clustering and refinement on semantic unity graph. In: ICDM, pp 417–426
Hou Y, Lin Z (2015) Image tag completion and refinement by subspace clustering and matrix completion (arXiv preprint). arXiv:1506.03475
Cui B, Yao J (2012) Tag refinement strategies for social tagging systems U.S. Patent Application 13/980,573, filed March 31
Qian Z, Zhong P, Wang R (2015) Tag refinement for user-contributed images via graph learning and nonnegative tensor factorization. IEEE Signal Process Lett 22(9):1302–1305
Article Google Scholar
Wang L, Zhou TH, Lee YK, Cheoi KJ, Ryu KH (2015) An efficient refinement algorithm for multi-label image annotation with correlation model. Telecommunication Systems, pp 1–17
Sang J, Liu J, Xu C (2011) Exploiting user information for image tag refinement. In: Proceedings of 19th ACM international conference on multimedia, pp 1129–1132
Hua XS, Liu D, Wang M, Zhang HJ (2010) Image Tag Refinement U.S. Patent Application 12/971,880, filed December 17
Jin Y, Khan L, Wang L, Awad M (2005) Image annotations by combining multiple evidence and wordnet. In: Proceedings of 13th annual ACM international conference on multimedia, pp 706–715
Resnik P (1995) Using information content to evaluate semantic similarity in a taxonomy. In: Proceedings of the 14th international joint conference on artificial intelligence
Jiang J, Conrath D (1997) Semantic similarity based on corpus statistics and lexical taxonomy. In: Proceedings of the international conference on research in computational linguistics
Lin D (1997) Using syntatic dependency as a local context to resolve word sense ambiguity. In: Proceedings of the 35th annual meeting of the association for computational linguistics, pp 64–71
Leacock C (1998) Combining local context and wordnet similarity for word sense identification. In: Fellbaum Christiane (ed) WordNet: a lexical reference system and its application. MIT Press, Cambridge, pp 265–283
Google Scholar
Banerjee S, Pedersen T (2003) Extended gloss overlaps as a measure of semantic relatedness. In: Proceedings of eighteenth international joint conference on artificial intelligence, pp 805–810
Jeon J, Lavrenko V, Manmatha R (2003) Automatic image annotation and retrieval using cross-media relevance models. In: Proceedings of 26th annual international ACM SIGIR conference on research and development in information retrieval, pp 119–126
Wang C, Jing F, Zhang L, Zhang HJ (2006) Image annotation refinement using random walk with restarts. In: Proceedings of 14th annual ACM international conference on multimedia, pp 647–650
Pan JY, Yang HJ, Faloutsos C, Duygulu P (2004) Automatic multimedia cross-modal correlation discovery. In: Proceedings of 10th ACM SIGKDD international conference on knowledge discovery and data mining, pp 653–658
Cheng W, Wang X (2011) Image tag refinement using tag semantic and visual similarity. In: ICCSNT 4, pp 2146–2149
Wu L, Hua XS, Yu NH, Ma WY, Li SP (2008) Flickr distance ACM multimedia, pp 31–40
Wang C, Jing F, Zhang L, Zhang HJ (2007) Content-based image annotation refinement. In: CVPR’071-8
Sang J, Xu C, Liu J (2012) User-aware image tag refinement via ternary semantic analysis. IEEE Trans Multimedia 14(3):883–895
Article Google Scholar
Liu J, Zhang Y, Li Z, Lu H (2013) Correlation consistency constrained probabilistic matrix factorization for social tag refinement. Neurocomputing 119:3–9
Article Google Scholar
Liu D, Hua XS, Yang L, Wang M, Zhang HJ (2009) Tag ranking. In: Proceedings of 18th international conference on WWW, pp 351–360
Zhuang J, Hoi JCH (2011) A two-view learning approach for image tag ranking. In: WSDM
Zhu G, Yan S, Ma Y (2010) Image tag refinement towards low-rank, content-tag prior and error sparsity. In: Proceedings of international conference on multimedia, pp 461–470
Li Z, Liu J, Zhu X, Liu T, Lu H (2010) Image annotation using multi-correlation probabilistic matrix factorization. ACM Multimedia, pp 1187–1190
Parzen E (1962) On estimation of a probability density function and mode. Ann Math Stat 33(3):1065–1076
Liu D, Yan S, Hua XS, Zhang HJ (2011) Image retagging using collaborative tag propagation. IEEE Trans Multimedia 13(4):702–712
Article Google Scholar
Zhang S, Tian Q, Hua G, Huang Q, Li S (2009) Descriptive visual words and visual phrases for image applications. In: Proc. ACM Int. Conf, Multimedia, p 7584
Xu H, Wang J, Hua XS, Li S (2009) Tag refinement by regularized LDA. In: Proceedings of 17th ACM international conference on multimedia, pp 573–576
Fu J, Wang J, Rui Y, Wang XJ, Mei T (2015) Image tag refinement with view-dependent concept representations. IEEE Trans Circuits Syst Video Technol 25(8):1409–1422
Article Google Scholar
Tsai D, Jing Y, Liu Y, Rowley HA, Ioffe S, Rehg JM (2011) Large-scale image annotation using visual synset. In: Proc. IEEE ICCV, pp 611–618
Chen L, Xu D, Tsang IW, Luo J (2012) Tag-based image retrieval improved by augmented features and group-based refinement. IEEE Trans Multimedia 14(4):10571067
Article Google Scholar
Wu L, Jin R, Jain AK (2013) Tag completion for image retrieval. IEEE Trans Pattern Anal Mach Intell 35(3):716727
Google Scholar
Mishne G (2006) AutoTag: a collaborative approach to automated tag assignment for weblog posts. In: Proceedings of 15th international conference on World Wide Web, pp 953–954
Ames M, Naaman M (2007) Why we tag: Motivations for annotation in mobile and online media. In: Proceedings of the SIGCHI conference on human factors in computing systems, pp 971–980
Sigurbj Orsnsson B, van Zwol R (2008) Flickr tag recommendation based on collective knowledge. In: Proceedings of the 17th international conference WWW, pp 327–336
Hotho A, Jaschke R, Schmitz C, Stumme G (2006) Information retrieval in folksonomies: search and ranking. Lect Notes Comput Sci 4011:411–426
Article Google Scholar
Semeraro G, Lops P, Gemmis MD, Musto C, Narducci F (2012) A folksonomy-based recommender system for personalized access to digital artworks. JOCCH 5(3):11
Article Google Scholar
Chen HM, Chang MH, Chang PC, Tien MC, Hsu WH, Wu JL (2008) SheepDog: group and tag recommendation for Flickr photos by automatic search-based learning. In: Proceedings of the 16th ACM international conference on multimedia, pp 737–740
Lee S, Neve WD, Plataniotis KN, Ro YM (2010) MAP-based image tag recommendation using a visual folksonomy. Pattern Recognit Lett 31(9):976–982
Article Google Scholar
Shen Y, Fan J (2010) Leveraging loosely-tagged images and inter-object correlations for tag recommendation. In: Proceedings of international conference on multimedia, pp 5–14
Lindstaedt S, Mrzinger R, Sorschag R, Pammer V, Thallinger G (2009) Automatic image annotation using visual content and folksonomies. Multimedia Tools Appl 42(1):97–113
Article Google Scholar
Silva A, Martins B (2011) Tag recommendation for geo-referenced photos. In: Proceedings of 3rd ACM SIGSPATIAL international workshop on location-based social networks, pp 57–64
Eom W, Lee S, Neve WD, Ro YM (2011) Improving image tag recommendation using favorite image context. In: ICIP, pp 2445–2448
Anderson A, Ranghunathan K, Vogel A (2008) Tagez: Flickr tag recommendation. Association for the Advancement of Artificial Intelligence
Takashita T, Itokawa T, Kitasuka T, Aritsugi M (2010) Tag recommendation for Flickr using web browsing behavior. In: ICCSA, pp 412–421
Xu Z, Fu Y, Mao J, Su D (2006) Towards the semantic web: collaborative tag suggestions. In: Collaborative web tagging workshop at WWW2006
Kim JW, Candan KS, Tatemura J (2010) Organization and tagging of blog and news entries based on content reuse. J Signal Process Syst 58:407–421
Article Google Scholar
Weinberger KQ, Slaney M, Van Zwol R (2008) Resolving tag ambiguity In: MM, pp 111–120
Nguyen QVH, Do ST, Nguyen TT, Aberer K (2015) Tag-based paper retrieval: minimizing user effort with diversity awareness. In: Database systems for advanced applications. Springer International Publishing, pp 510–528
Cantador I, Konstas I, Jose JM (2011) Categorizing social tags to improve folksonomy-based recommendations. Web semantics: science, services and agents on the World Wide Web 9(1):1–15
Article Google Scholar
Wang YS (2014) Image tag recommendation algorithm using tensor factorization. J Multimedia 9(3):416–422
Article Google Scholar
Fang X, Pan R, Cao G, He X, Dai W (2015) personalized tag recommendation through nonlinear tensor factorization using Gaussian kernel. In: AAAI, pp 439–445
McParlane PJ, Moshfeghi J, Jose JM (2013) On contextual photo tag recommendation. In: Proceedings of the 36th international ACM SIGIR conference on research and development in information retrieval, pp 965–968
Turnbull D, Barrington L, Lanckriet G (2008) Five approaches to collecting tags for music. In: ISMIR, vol 8, pp 225–230
Lamere P (2008) Social tagging and music information retrieval. J New Music Res 37(2):101–114
Article Google Scholar
Marlow C, Naaman M, Boyd D, Davis M (2006) Tagging paper, taxonomy, Flickr, academic article, to read. In: Proceedings of ACM hypertext
Ames M, Naaman M (2007) Why we tag: motivations for annotation in mobile and online media. In: CHI 07: proceedings of the SIGCHI conference on Human factors in computing systems, pp 971–980
Nanopoulos A, Rafailidis D, Symeonidis P, Manolopoulos Y (2010) Musicbox: Personalized music recommendation based on cubic analysis of social tags. IEEE Trans Audio Speech Language Process 18(2):407–412
Article Google Scholar
Lathauwer L, Moor B, Vandewalle J (2000) A multilinear singular value decomposition. SIAM J Matrix Anal Appl 21(4):12531278
MathSciNet MATH Google Scholar
Hariri N, Mobasher B, Burke R (2012) Using social tags to infer context in hybrid music recommendation. In: Proceedings of 12th international workshop on web information and data management, pp 41–48
Nanopoulos A, Karydis I (2011) Know thy neighbor: combining audio features and social tags for effective music similarity. In: ICASSP, pp 165–168
Logan B, Salomon A (2001) A music similarity function based on signal analysis. In: ICME
Yang YH, Bogdanov D, Herrera P, Sordo M (2012) Music retagging using label propagation and robust principal component analysis. In: Proceedings of 21st international conference companion on WWW, pp 869–876
Zhou D, Bousquet O, Lal TN, Weston J, Scholkopf B (2004) Learning with local and global consistency. In: NIPS, pp 321–328
Lin Z, Ganesh A, Wright J, Wu L, Chen M, Ma Y (2009) Fast convex optimization algorithms for exact recovery of a corrupted low-rank matrix Technical Report UILU-ENG-09-2214
Song Y, Dixon S, Pearce M, Fazekas G (2013) Using tags to select stimuli in the study of music and emotion. In: Proc. ICME3
Font F, Serra X (2012) Analysis of the folksonomy of freesound. In: Proc. 2nd comp music workshop
Chen L, Wright P, Nejdl W (2009) Improving music genre classification using collaborative tagging data. In: Proceedings of second ACM international conference on web search and data mining, pp 84–93
Font F, Serra J, Serra X (2014) Audio clip classification using social tags and the effect of tag expansion. In: Audio Engineering Society Conference: 53rd international conference: semantic audio. Audio Engineering Society
Levy M, Sandler M (2007) A semantic space for music derived from social tags. In: Proc. ISMIR
Laurier C, Sordo M, Serra J, Herrera P (2009) Music mood representations from social tags. In: Proc. ISMIR, pp 381–386
Saari P, Eerola T (2014) Semantic computing of moods based on tags in social media of music. IEEE KDE 26(10):2548–2560
Google Scholar
Russell JA (1980) A circumplex model of affect. J Person Soc Psychol 39(6):1161–1178
Article Google Scholar
Ekman P (1992) An argument for basic emotions. Cogn Emotion 6:169–200
Article Google Scholar
Ballan L, Bertini M, Uricchio T, Bimbo AD (2013) Social media annotation. In: 11th international workshop on CBMI, pp 229–235
Miller GA, Claudia L, Randee T, Ross TB (1993) A semantic concordance. In: Proceedings of the workshop on human language technology, pp 303–308
Bentivogli L, Pamela F, Bernardo M, Emanuele P (2004) Revising the wordnet domains hierarchy: semantics, coverage and balancing. In: Proceedings of the workshop on multilingual linguistic resources, pp 101–108
Golder SA, Bernardo AH (2006) Usage patterns of col-laborative tagging systems. J Inf Sci 32(2):198–208
Halpin H, Valentin R, Hana S (2007) The complex dynamics of collaborative tagging. In: Proceedings of the 16th international conference on World Wide Web, pp 211–220
You D, Antani S, Demner-Fushman D, Thoma GR. (2014) Does figure-text improve biomedical article retrieval? A pilot study. In: 2014 IEEE 27th international symposium on CBMS, pp 471–472
Simpson M, You D, Rahman MM, Xue Z, Demner-Fushman D, Antani S, Thoma G (2015) Literature-based biomedical image classification and retrieval. Comput Med Imaging Graphics 39:3–13
Article Google Scholar
Wu H, Bowers DM, Huynh TT, Souvenir R (2013) Biomedical video denoising using supervised manifold learning. In: IEEE 10th international symposium on ISBI, pp 1244–1247
Savaris A, Hrder T, Wangenheim A (2014) DCMDSM: a DICOM decomposed storage model. J Am Med Inf Assoc 21(5):917–924
Article Google Scholar
Nemiroff RJ, Bonnell JT (1995) Astronomy picture of the day. Bull Am Astron Soc, 1291. http://antwrp.gsfc.nasa.gov/apod/astropix.html
Kipp M (2011) Tagging of biomedical articles on CiteULike: a comparison of user, author and professional indexing. Knowl Org 38:245–261
Google Scholar
Lee S, Yanjun Z, Mohamed Eid M M, Maria V, Semra K, Saeid B (2015) Domain specific information retrieval and text mining in medical document. In: Proceedings of the 6th ACM BCB, pp 67–76
TAC (2016) Proceedings of text analysis conference NIST. https://tac.nist.gov
Turnbull D, Barrington L, Torres D, Lanckriet G (2007) Towards musical query by semantic description using the CAL500 data set. ACM SIGIR, pp 439–446
Knees P, Pampalk E, Widmer G (2004) Artist classification with web-based data. In: Proceeding of ISMIR, pp 517–524

Download references

Author information

Sanghoon Lee
Present address: Department of Neurology, Emory University School of Medicine, Atlanta, GA, USA

Authors and Affiliations

Department of Computer Science, Georgia State University, Atlanta, GA, USA
Sanghoon Lee, Mohamed Masoud, Janani Balaji, Saeid Belkasim & Rajshekhar Sunderraman
Department of Computer Science, University of Suwon, Hwaseong-si, Gyeonggi-do, South Korea
Seung-Jin Moon

Authors

Sanghoon Lee
View author publications
You can also search for this author in PubMed Google Scholar
Mohamed Masoud
View author publications
You can also search for this author in PubMed Google Scholar
Janani Balaji
View author publications
You can also search for this author in PubMed Google Scholar
Saeid Belkasim
View author publications
You can also search for this author in PubMed Google Scholar
Rajshekhar Sunderraman
View author publications
You can also search for this author in PubMed Google Scholar
Seung-Jin Moon
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sanghoon Lee.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Lee, S., Masoud, M., Balaji, J. et al. A survey of tag-based information retrieval. Int J Multimed Info Retr 6, 99–113 (2017). https://doi.org/10.1007/s13735-016-0115-6

Download citation

Received: 20 September 2016
Revised: 13 November 2016
Accepted: 24 November 2016
Published: 09 December 2016
Issue Date: June 2017
DOI: https://doi.org/10.1007/s13735-016-0115-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A survey of tag-based information retrieval

Abstract

Access this article

Similar content being viewed by others

Tag-Based Recommendation

Usage Based Tag Enhancement of Images

MPTR: A Metadata-Driven Prospective Tag Recommendation Model Based on Integrative Classification

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Navigation

A survey of tag-based information retrieval

Abstract

Access this article

Similar content being viewed by others

Tag-Based Recommendation

Usage Based Tag Enhancement of Images

MPTR: A Metadata-Driven Prospective Tag Recommendation Model Based on Integrative Classification

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation