research-article

Large-scale music tag recommendation with explicit multiple attributes

Authors:
Zhendong Zhao

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Xinxi Wang

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Qiaoliang Xiang

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Andy M. Sarroff

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Zhonghua Li

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

,
Ye Wang

National University of Singapore, Singapore, Singapore

National University of Singapore, Singapore, Singapore
View Profile

MM '10: Proceedings of the 18th ACM international conference on MultimediaOctober 2010Pages 401–410https://doi.org/10.1145/1873951.1874006

Published:25 October 2010Publication History

MM '10: Proceedings of the 18th ACM international conference on Multimedia

Pages 401–410

ABSTRACT

Social tagging can provide rich semantic information for large-scale retrieval in music discovery. Such collaborative intelligence, however, also generates a high degree of tags unhelpful to discovery, some of which obfuscate critical information. Towards addressing these shortcomings, tag recommendation for more robust music discovery is an emerging topic of significance for researchers. However, current methods do not consider diversity of music attributes, often using simple heuristics such as tag frequency for filtering out irrelevant tags. Music attributes encompass any number of perceived dimensions, for instance vocalness, genre, and instrumentation. Many of these are underrepresented by current tag recommenders. We propose a scheme for tag recommendation using Explicit Multiple Attributes based on tag semantic similarity and music content. In our approach, the attribute space is explicitly constrained at the outset to a set that minimizes semantic loss and tag noise, while ensuring attribute diversity. Once the user uploads or browses a song, the system recommends a list of relevant tags in each attribute independently. To the best of our knowledge, this is the first method to consider Explicit Multiple Attributes for tag recommendation. Our system is designed for large-scale deployment, on the order of millions of objects. For processing large-scale music data sets, we design parallel algorithms based on the MapReduce framework to perform large-scale music content and social tag analysis, train a model, and compute tag similarity. We evaluate our tag recommendation system on CAL-500 and a large-scale data set ($N = 77,448$ songs) generated by crawling Youtube and Last.fm. Our results indicate that our proposed method is both effective for recommending attribute-diverse relevant tags and efficient at scalable processing.

References

A. Andoni and P. Indyk. Near-optimal hashing algorithms for approximate nearest neighbor in high dimensions. Communications of the ACM, 51(1):117--122, 2008. Google ScholarDigital Library
T. Bertin-Mahieux, D. Eck, F. Maillet, and P. Lamere. Autotagger: A model for predicting social tags from acoustic features on large music databases. Journal of New Music Research, 37(2):115--135, 2008.Google ScholarCross Ref
G. Bradski, C.-T. Chu, A. Ng, K. Olukotun, S. K. Kim, Y.-A. Lin, and Y. Yu. Map-reduce for machine learning on multicore. In NIPS, 12/2006 2006.Google Scholar
H.-M. Chen, M.-H. Chang, P.-C. Chang, M.-C. Tien, W. H. Hsu, and J.-L. Wu. Sheepdog: group and tag recommendation for flickr photos by automatic search-based learning. In MM '08: Proceeding of the 16th ACM international conference on Multimedia, pages 737--740, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
R. L. Cilibrasi and P. M. B. Vitanyi. The google similarity distance. IEEE Trans. on Knowl. and Data Eng., 19(3):370--383, 2007. Google ScholarDigital Library
J. Dean and S. Ghemawat. MapReduce: simplified data processing on large clusters. In Usenix SDI, pages 137--150, 2004. Google ScholarDigital Library
C. Fellbaum, editor. WordNet: an electronic lexical database. MIT Press, 1998.Google ScholarCross Ref
M. Hoffman, D. Blei, and P. Cook. Easy as cba: A simple probabilistic model for tagging music. In Proc. International Symposium on Music Information Retrieval, 2009.Google Scholar
J. Li and J. Z. Wang. Real-time computerized annotation of pictures. In MULTIMEDIA '06: Proceedings of the 14th annual ACM international conference on Multimedia, pages 911--920, New York, NY, USA, 2006. ACM. Google ScholarDigital Library
J. Lin. Scalable language processing algorithms for the masses: a case study in computing word co-occurrence matrices with MapReduce. In EMNLP '08: Proceedings of the Conference on Empirical Methods in Natural Language Processing, pages 419--428, Morristown, NJ, USA, 2008. Association for Computational Linguistics. Google ScholarDigital Library
J. Lin. Brute force and indexed approaches to pairwise document similarity comparisons with mapreduce. In SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 155--162, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
D. Liu, X.-S. Hua, L. Yang, M. Wang, and H.-J. Zhang. Tag ranking. In WWW '09: Proceedings of the 18th International conference on World wide web, pages 351--360, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
R. McCreadie, C. Mcdonald, and I. Ounis. Comparing distributed indexing: To mapreduce or not? In Proceedings of the 7th Workshop on Large-Scale Distributed Systems for Information Retrieval(LSDS-IR'09) at SIGIR 2009, July 2009.Google Scholar
F. Monay and D. Gatica-Perez. On image auto-annotation with latent space models. In MULTIMEDIA '03: Proceedings of the eleventh ACM international conference on Multimedia, pages 275--278, New York, NY, USA, 2003. ACM. Google ScholarDigital Library
S. R. Ness, A. Theocharis, G. Tzanetakis, and L. G. Martins. Improving automatic music tag annotation using stacked generalization of probabilistic svm outputs. In MM '09: Proceedings of the seventeen ACM international conference on Multimedia, pages 705--708, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
S. Shalev-Shwartz, Y. Singer, and N. Srebro. Pegasos: Primal estimated sub-gradient solver for svm. In ICML '07: Proceedings of the 24th international conference on Machine learning, pages 807--814, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
R. Shi, C.-H. Lee, and T.-S. Chua. Enhancing image annotation by integrating concept ontology and text-based bayesian learning model. In MULTIMEDIA '07: Proceedings of the 15th international conference on Multimedia, pages 341--344, New York, NY, USA,2007. ACM. Google ScholarDigital Library
B. Sigurbjörnsson and R. van Zwol. Flickr tag recommendation based on collective knowledge. In WWW '08: Proceeding of the 17th international conference on World Wide Web, pages 327--336, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
G. Sychay, E. Chang, and K. Goh. Effective image annotation via active learning. In 2002 IEEE International Conference on Multimedia and Expo, 2002. ICME'02. Proceedings, volume 1, 2002.Google ScholarCross Ref
D. Turnbull, L. Barrington, D. Torres, and G. Lanckriet. Towards musical query-by-semantic-description using the cal500 data set. In SIGIR '07: Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 439--446, New York, NY, USA, 2007. ACM. Google ScholarDigital Library
D. Turnbull, L. Barrington, D. Torres, and G. Lanckriet. Semantic annotation and retrieval of music and sound effects. Audio, Speech, and Language Processing, IEEE Transactions on, 16(2):467--476, 2008. Google ScholarDigital Library
G. Tzanetakis and P. Cook. Marsyas: a framework for audio analysis. Org. Sound, 4(3):169--175, 1999. Google ScholarDigital Library
C. Wang, L. Zhang, and H.-J. Zhang. Learning to reduce the semantic gap in web image retrieval and annotation. In SIGIR '08: Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval, pages 355--362, New York, NY, USA, 2008. ACM. Google ScholarDigital Library
X. J. Wang, L. Zhang, X. Li, and W. Y. Ma. Annotating images by mining image search results. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(11):1919--1932, 2008. Google ScholarDigital Library
L. Wu, L. Yang, N. Yu, and X. S. Hua. Learning to tag. In WWW '09: Proceedings of the 18th international conference on World wide web, pages 361--370, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
B. Zhang, J. Shen, Q. Xiang, and Y. Wang. Compositemap: a novel framework for music similarity measure. In SIGIR '09: Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval, pages 403--410, New York, NY, USA, 2009. ACM. Google ScholarDigital Library
B. Zhang, Q. Xiang, H. Lu, J. Shen, and Y. Wang. Comprehensive query-dependent fusion using regression-on-folksonomies: a case study of multimodal music search. In MM '09: Proceedings of the seventeen ACM international conference on Multimedia, pages 213--222, New York, NY, USA, 2009. ACM. Google ScholarDigital Library

Index Terms

Large-scale music tag recommendation with explicit multiple attributes
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Information systems
  1. Information retrieval
    1. Document representation
    2. Specialized information retrieval
      1. Multimedia and multimodal retrieval
        Music retrieval

Recommendations

Tag recommendation for social bookmarking: Probabilistic approaches
Principles and Practice of Multi-Agent Systems

Tagging has become increasingly popular with the explosion of user-created content on the web. A 'tag' can be defined as a group of keywords that makes organizing, browsing and searching for content more efficient. Users apply tags to a variety of web-...
Read More
Flickr tag recommendation based on collective knowledge
WWW '08: Proceedings of the 17th international conference on World Wide Web

Online photo services such as Flickr and Zooomr allow users to share their photos with family, friends, and the online community at large. An important facet of these services is that users manually annotate their photos using so called tags, which ...
Read More
Personalized tag recommendation based on user preference and content
ADMA'10: Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II

With the widely use of collaborative tagging system nowadays, users could tag their favorite resources with free keywords. Tag recommendation technology is developed to help users in the process of tagging. However, most of the tag recommendation ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
MM '10: Proceedings of the 18th ACM international conference on Multimedia
October 2010
1836 pages
ISBN:9781605589336
DOI:10.1145/1873951
General Chairs:
Alberto del Bimbo
University of Florence, Italy
,
Shih-Fu Chang
Columbia University, USA
,
Program Chair:
Arnold Smeulders
University of Amsterdam, NL
Copyright © 2010 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 25 October 2010
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
explicit multiple attributes
music
search
tag recommendation
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate995of4,171submissions,24%
Upcoming Conference
MM '24

Sponsor:

sigmm

MM '24: The 32nd ACM International Conference on Multimedia

October 28 - November 1, 2024

Melbourne , VIC , Australia
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 8
  Total Citations
  View Citations
- 466
  Total Downloads
- Downloads (Last 12 months)3
- Downloads (Last 6 weeks)0
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Large-scale music tag recommendation with explicit multiple attributes

MM '10: Proceedings of the 18th ACM international conference on Multimedia

ABSTRACT

References

Cited By

Index Terms

Recommendations

Tag recommendation for social bookmarking: Probabilistic approaches

Flickr tag recommendation based on collective knowledge

Personalized tag recommendation based on user preference and content