research-article

Exploiting Heterogeneous Artist and Listener Preference Graph for Music Genre Classification

Authors:

Songlin HuAuthors Info & Claims

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

Pages 3532 - 3540

https://doi.org/10.1145/3394171.3414000

Published: 12 October 2020 Publication History

Abstract

Music genres are useful for indexing, organizing, searching, and recommending songs and albums. Therefore, the automatic classification of music genres is an essential part of almost all kinds of music applications. Recent works focus on exploiting text, audio, or multi-modal information for genre classification, without considering the influence of the artists' and listeners' preference. However, intuitively, artists have their composing preferences, and listeners also have their music tastes. Both of them provide helpful hints to the music genre from different views, which are crucial to improve classification performance.

In this paper, we make use of both artist-music and listener-music preference relations to construct a heterogeneous preference graph. Then, we propose a novel graph-based neural network to automatically encode the global preference relations of the heterogeneous graph into artist and listener representations. We construct a graph to capture the correlations among genres and apply a graph convolutional network to learn genre representation from the correlation graph. Finally, we combine artist, listener, and genre representations for multi-label genre classification. Experimental results show that our model significantly outperforms the state-of-the-art methods on two public music genre classification datasets.

Supplementary Material

MP4 File (3394171.3414000.mp4)

This video is a presentation of the paper "Exploiting Heterogeneous Artist and Listener Preference Graph for Music Genre Classification". In this video, we introduce the multi-label music genre classification task and our solution to this task. The duration of the video is 4 minutes and 38 seconds.

Download
9.48 MB

References

[1]

L Rafael Aguiar, MG Yandre Costa, and N Carlos Silla. 2018. Exploring Data Augmentation to Improve Music Genre Classification with ConvNets. In 2018 International Joint Conference on Neural Networks (IJCNN). IEEE, 1--8.

[2]

Keunwoo Choi, George Fazekas, and Mark Sandler. 2016. Automatic tagging using deep convolutional neural networks. In The 17th International Society for Music Information Retrieval Conference (ISMIR 2016).

[3]

Keunwoo Choi, György Fazekas, Mark Sandler, and Kyunghyun Cho. 2017. Convolutional recurrent neural networks for music classification. In 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2392--2396.

Digital Library

[4]

Kahyun Choi, Jin Ha Lee, and J Stephen Downie. 2014. What is this song about anyway?: Automatic classification of subject using user interpretations and lyrics. In Proceedings of the 14th ACM/IEEE-CS Joint Conference on Digital Libraries. IEEE Press, 453--454.

[5]

Djork-Arné Clevert, Thomas Unterthiner, and Sepp Hochreiter. 2015. Fast and accurate deep network learning by exponential linear units (elus). arXiv preprint arXiv:1511.07289 (2015).

[6]

Sander Dieleman, Philémon Brakel, and Benjamin Schrauwen. 2011. Audio-based music classification with a pretrained convolutional network. In 12th International Society for Music Information Retrieval Conference (ISMIR-2011). University of Miami, 669--674.

[7]

Michael Fell and Caroline Sporleder. 2014. Lyrics-based analysis and classification of music. In Proceedings of COLING 2014, the 25th International Conference on Computational Linguistics: Technical Papers. 620--631.

[8]

Xavier Glorot and Yoshua Bengio. 2010. Understanding the difficulty of training deep feedforward neural networks. In Proceedings of the thirteenth international conference on artificial intelligence and statistics. 249--256.

[9]

Siddharth Gopal and Yiming Yang. 2010. Multi-label classification with meta-level features. Proceeding of the 33rd international ACM SIGIR conference on Research and development in information retrieval - SIGIR '10 (2010). https://doi.org/10.1145/1835449.1835503

Digital Library

[10]

Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short-term memory. Neural computation, Vol. 9, 8 (1997), 1735--1780.

[11]

Xiao Hu, J Stephen Downie, Kris West, and Andreas F Ehmann. 2005. Mining Music Reviews: Promising Preliminary Results. In ISMIR. 536--539.

[12]

Yoon Kim. 2014. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 1746--1751.

[13]

Diederik P Kingma and Jimmy Ba. 2014. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 (2014).

[14]

Thomas N Kipf and Max Welling. 2017. Semi-supervised classification with graph convolutional networks. In International Conference on Learning Representations (ICLR).

[15]

Tao Li, Mitsunori Ogihara, and Qi Li. 2003. A comparative study on content-based music genre classification. In Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval. ACM, 282--289.

Digital Library

[16]

Qianwen Ma, Chunyuan Yuan, Wei Zhou, Jizhong Han, and Songlin Hu. 2020. Beyond Statistical Relations: Integrating Knowledge Relations into Style Correlations for Multi-Label Music Style Classification. In The 13th ACM International WSDM Conference.

Digital Library

[17]

Andrew L Maas, Awni Y Hannun, and Andrew Y Ng. 2013. Rectifier nonlinearities improve neural network acoustic models. In Proc. icml, Vol. 30. 3.

[18]

Loris Nanni, Yandre MG Costa, Alessandra Lumini, Moo Young Kim, and Seung Ryul Baek. 2016. Combining visual and acoustic features for music genre classification. Expert Systems with Applications, Vol. 45 (2016), 108--117.

Digital Library

[19]

Sergio Oramas, Francesco Barbieri, Oriol Nieto, and Xavier Serra. 2018. Multimodal deep learning for music genre classification. Transactions of the International Society for Music Information Retrieval. 2018; 1 (1): 4--21. (2018).

[20]

Sergio Oramas, Luis Espinosa-Anke, Aonghus Lawlor, et almbox. 2016. Exploring customer reviews for music genre classification and evolutionary studies. In The 17th International Society for Music Information Retrieval Conference (ISMIR 2016), New York City, United States of America, 7--11 August 2016.

[21]

Sergio Oramas, Oriol Nieto Caballero, Francesco Barbieri, and Xavier Serra. 2017. Multi-label music genre classification from audio, text and images using deep features. In 18th International Society for Music Information Retrieval Conference (ISMIR 2017).

[22]

Aggelos Pikrakis, Sergios Theodoridis, and Dimitris Kamarotos. 2006. Classification of musical patterns using variable duration hidden Markov models. IEEE Transactions on Audio, Speech, and Language Processing, Vol. 14, 5 (2006), 1795--1807.

Digital Library

[23]

Jordi Pons, Thomas Lidy, and Xavier Serra. 2016. Experimenting with musically motivated convolutional neural networks. In 2016 14th International Workshop on Content-Based Multimedia Indexing (CBMI). IEEE, 1--6.

[24]

Jesse Read, Bernhard Pfahringer, Geoff Holmes, and Eibe Frank. 2011. Classifier chains for multi-label classification. Machine learning, Vol. 85, 3 (2011), 333.

[25]

Chris Sanden and John Z. Zhang. 2011. Enhancing multi-label music genre classification through ensemble techniques. Proceedings of the 34th international ACM SIGIR conference on Research and development in Information - SIGIR '11 (2011). https://doi.org/10.1145/2009916.2010011

[26]

Yizhou Sun, Jiawei Han, Xifeng Yan, Philip S Yu, and Tianyi Wu. 2011. Pathsim: Meta path-based top-k similarity search in heterogeneous information networks. Proceedings of the VLDB Endowment, Vol. 4, 11 (2011), 992--1003.

Digital Library

[27]

Alexandros Tsaptsinos. 2017. Lyrics-based music genre classification using a hierarchical attention network. arXiv preprint arXiv:1707.04678 (2017).

[28]

Grigorios Tsoumakas, Ioannis Katakis, and Ioannis Vlahavas. 2009. Mining multi-label data. In Data mining and knowledge discovery handbook. Springer, 667--685.

[29]

Grigorios Tsoumakas, Ioannis Vlahavas, and Ioannis Vlahavas. 2007. Random k-labelsets: An ensemble method for multilabel classification. In European conference on machine learning. Springer, 406--417.

Digital Library

[30]

Ashish Vaswani, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. Attention is all you need. In Advances in neural information processing systems. 5998--6008.

[31]

Petar Velivc ković, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Liò, and Yoshua Bengio. 2018. Graph Attention Networks . International Conference on Learning Representations (2018). https://openreview.net/forum?id=rJXMpikCZ

[32]

Fei Wang, Xin Wang, Bo Shao, Tao Li, and Mitsunori Ogihara. 2009. Tag Integrated Multi-Label Music Style Classification with Hypergraph. In ISMIR. 363--368.

[33]

Xiao Wang, Houye Ji, Chuan Shi, Bai Wang, Yanfang Ye, Peng Cui, and Philip S Yu. 2019. Heterogeneous Graph Attention Network. In The World Wide Web Conference. ACM, 2022--2032.

[34]

Changsheng Xu, Namunu C Maddage, Xi Shao, Fang Cao, and Qi Tian. 2003. Musical genre classification using support vector machines. In 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03)., Vol. 5. IEEE, V--429.

[35]

Min-Ling Zhang and Zhi-Hua Zhou. 2007. ML-KNN: A lazy learning approach to multi-label learning. Pattern recognition, Vol. 40, 7 (2007), 2038--2048.

[36]

Guangxiang Zhao, Jingjing Xu, Qi Zeng, Xuancheng Ren, and Xu Sun. 2019. Review-Driven Multi-Label Music Style Classification by Exploiting Style Correlations. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers). 2884--2891.

[37]

Yatong Zhou, Taiyi Zhang, and Jiancheng Sun. 2006. Music style classification with a novel Bayesian model. In International Conference on Advanced Data Mining and Applications. Springer, 150--156.

Digital Library

[38]

Shenghuo Zhu, Xiang Ji, Wei Xu, and Yihong Gong. 2005. Multi-labelled classification using maximum entropy method. Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval - SIGIR '05 (2005). https://doi.org/10.1145/1076034.1076082

Digital Library

Cited By

Mimenbayeva ABekmagambetova GMuratova GNaizagarayeva AOspanova TKonyrkhanova A(2024)CLASSIFICATION OF KAZAKH MUSIC GENRES USING MACHINE LEARNING TECHNIQUESScientific Journal of Astana IT University10.37943/17NZKG3418(83-94)Online publication date: 20-May-2024
https://doi.org/10.37943/17NZKG3418
Zhao BChen HZhang JZhang WYu N(2024)Dual-verification-based model fingerprints against ambiguity attacksCybersecurity10.1186/s42400-024-00298-67:1Online publication date: 23-Dec-2024
https://doi.org/10.1186/s42400-024-00298-6
Puttegowda KKeoy KDeepak RArmoogum VParameshachari B(2024)Automated Music Classification using Machine Learning for Indian Songs2024 Second International Conference on Networks, Multimedia and Information Technology (NMITCON)10.1109/NMITCON62075.2024.10698871(1-6)Online publication date: 9-Aug-2024
https://doi.org/10.1109/NMITCON62075.2024.10698871
Show More Cited By

Index Terms

Exploiting Heterogeneous Artist and Listener Preference Graph for Music Genre Classification
1. Applied computing
  1. Arts and humanities
    1. Sound and music computing
2. Computing methodologies
  1. Artificial intelligence
    1. Natural language processing
  2. Machine learning
    1. Machine learning approaches
      1. Classification and regression trees
      2. Neural networks

Recommendations

Detecting Musical Genre Borders for Multi-label Genre Classification
ISM '13: Proceedings of the 2013 IEEE International Symposium on Multimedia

In this paper, we propose a novel method to detect music genre borders for the music genre classification. The music genre classification is getting more important because music is influenced by an increasing amount of different musical styles. A ...
Improving Automatic Music Genre Classification Systems by Using Descriptive Statistical Features of Audio Signals
Artificial Intelligence in Music, Sound, Art and Design
Abstract
Automatic music genre classification systems are vital nowadays because the traditional music genre classification process is mostly implemented without following a universal taxonomy and the traditional process for audio indexing is prone to ...
Music genre classification using explicit semantic analysis
MIRUM '11: Proceedings of the 1st international ACM workshop on Music information retrieval with user-centered and multimodal strategies

Music genre classification is the categorization of a piece of music into its corresponding categorical labels created by humans and has been traditionally performed through a manual process. Automatic music genre classification, a fundamental problem ...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

MM '20: Proceedings of the 28th ACM International Conference on Multimedia

October 2020

4889 pages

ISBN:9781450379885

DOI:10.1145/3394171

General Chairs:
Chang Wen Chen
Chinese University of Hong Kong, Shenzhen, China
,
Rita Cucchiara
UNIMORE, Italy
,
Xian-Sheng Hua
Alibaba Group, China
,
Program Chairs:
Guo-Jun Qi
Futurewei Technologies, USA
,
Elisa Ricci
UNITN & Fondazione Bruno Kessler, Italy
,
Zhengyou Zhang
Tencent, China
,
Roger Zimmermann
National University of Singapore, Singapore

Copyright © 2020 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

SIGMM: ACM Special Interest Group on Multimedia

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 12 October 2020

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

the Beijing Municipal Science and Technology Project

Conference

MM '20

Sponsor:

SIGMM

MM '20: The 28th ACM International Conference on Multimedia

October 12 - 16, 2020

WA, Seattle, USA

Acceptance Rates

Overall Acceptance Rate 2,145 of 8,556 submissions, 25%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
238
Total Downloads

Downloads (Last 12 months)21
Downloads (Last 6 weeks)1

Reflects downloads up to 17 Jan 2025

Other Metrics

View Author Metrics

Citations

Cited By

Mimenbayeva ABekmagambetova GMuratova GNaizagarayeva AOspanova TKonyrkhanova A(2024)CLASSIFICATION OF KAZAKH MUSIC GENRES USING MACHINE LEARNING TECHNIQUESScientific Journal of Astana IT University10.37943/17NZKG3418(83-94)Online publication date: 20-May-2024
https://doi.org/10.37943/17NZKG3418
Zhao BChen HZhang JZhang WYu N(2024)Dual-verification-based model fingerprints against ambiguity attacksCybersecurity10.1186/s42400-024-00298-67:1Online publication date: 23-Dec-2024
https://doi.org/10.1186/s42400-024-00298-6
Puttegowda KKeoy KDeepak RArmoogum VParameshachari B(2024)Automated Music Classification using Machine Learning for Indian Songs2024 Second International Conference on Networks, Multimedia and Information Technology (NMITCON)10.1109/NMITCON62075.2024.10698871(1-6)Online publication date: 9-Aug-2024
https://doi.org/10.1109/NMITCON62075.2024.10698871
Patil SPradeepini GKomati T(2023)Novel mathematical model for the classification of music and rhythmic genre using deep neural networkJournal of Big Data10.1186/s40537-023-00789-210:1Online publication date: 21-Jun-2023
https://doi.org/10.1186/s40537-023-00789-2
Li YZhang ZDing HChang L(2022)Music genre classification based on fusing audio and lyric informationMultimedia Tools and Applications10.1007/s11042-022-14252-682:13(20157-20176)Online publication date: 29-Dec-2022
https://doi.org/10.1007/s11042-022-14252-6

View Options

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents