Abstract
For human beings, music is generally perceived, categorized, and enjoyed based on its attributes, such as rhythm, pitch, timbre, and harmony. In recent years, due to their high performances, content-based music classification and recommendation systems have attracted much attention from both the music industry and research community. However, on the one hand, deep music classification models are still very rare, and on the other hand, the collaborative filtering approach, which has the cold start problem, still dominates the music recommendation applications. In this paper, we propose Music-CRN (short for music classification and recommendation network), a simple yet effective model that facilitates music classification and recommendation with learning the audio content features of music. Specifically, to extract the content features of music, the audio is converted into spectrogram “images” by Fourier transformation. Music-CRN can be applied on the spectrograms as similar as natural images to effectively extract music content features. Additionally, we collect a new dataset containing nearly 200,000 music spectrogram slices. To the best of our knowledge, this is the first publicly available music spectrogram dataset, which is at https://github.com/YX-Mao/Music-spectrum-image-data. We compare Music-CRN to previous content-based music classification and recommendation models on the collected dataset. Experimental results show that Music-CRN achieves state-of-the-art performance on music classification and recommendation tasks, demonstrating its superiority over previous methods.
Similar content being viewed by others
References
Gardini E, Ferrarotti MJ, Cavalli A, Decherchi S. Using principal paths to walk through music and visual art style spaces induced by convolutional neural networks. Cogn Comput. 2021;13(2):570–82.
Liu C, Feng L, Liu G, Wang H, Liu S. Bottom-up broadcast neural network for music genre classification. Multimed Tools Appl. 2021;80(5):7313–31.
Bergstra J, Casagrande N, Erhan D, Eck D, Kégl B. Aggregate features and AdaBoost for music classification. Mach Learn. 2006;65(2–3):473–84.
Cano P, Koppenberger M, Wack N. Content-based music audio recommendation. In: ACM MM. 2005. p. 211–212.
Wang X, Rosenblum DS, Wang, Y. Context-aware mobile music recommendation for daily activities. In: ACM MM. 2012. p. 99–108.
Tzanetakis G, Cook PR. Musical genre classification of audio signals. IEEE Trans Speech and Audio Processing. 2002;10(5):293–302.
Rich E. User modeling via stereotypes. Cogn Sci. 1979;3(4):329–54.
Li S, Kawale J, Fu Y. Deep collaborative filtering via marginalized denoising auto-encoder. In: CIKM. 2015. p. 811–820.
Herlocker JL, Konstan JA, Borchers A, Riedl J. An algorithmic framework for performing collaborative filtering. SIGIR Forum. 2017;51(2):227–34.
Jongpil L, Jiyoung P, Keunhyoung K, Juhan N. SampleCNN: end-to-end deep convolutional neural networks using very small filters for music classification. Appl Sci. 2018;8(1):150.
Choi K, Fazekas G, Sandler MB, Cho K. Transfer learning for music classification and regression tasks. In: ISMIR. 2017. p. 141–149.
Zhong G, Wang H, Jiao W. MusicCNNs: a new benchmark on content-based music recommendation. In: ICONIP. 2018. p. 394–405.
Kour G, Mehan N, Kour G, Mehan N. Music genre classification using MFCC, SVM and BPNN. Int J Comput Appl. 2015;112(6):12–4.
García-García D, Arenas-García J, Parrado-Hernández E, Diaz-de Maria F. Music genre classification using the temporal structure of songs. In: MLSP. 2010. p. 266–271.
McKinney MF, Breebaart J. Features for audio and music classification. In: ISMIR. 2003.
Dieleman S, Brakel P, Schrauwen B. Audio-based music classification with a pretrained convolutional network. In: ISMIR. 2011. p. 669–674.
Bian W, Wang J, Zhuang B, Yang J, Wang S, Xiao J. Audio-based music classification with DenseNet and data augmentation. In: PRICAI. 2019. p. 56–65.
Dieleman S, Schrauwen B. End-to-end learning for music audio. In: ICASSP. 2014. p. 6964–6968.
Kumar V, Khattar D, Gupta S, Gupta M, Varma V. Deep neural architecture for news recommendation. In: CLEF. 2017.
Porteous I, Asuncion AU, Welling M. Bayesian matrix factorization with side information and dirichlet process mixtures. In: AAAI. 2010.
Xiao H, Chen Y, Shi X, Xu G. Multi-perspective neural architecture for recommendation system. Neural Netw. 2019;118:280–8.
Chen J, Zhang H, He X, Nie L, Liu W, Chua T. Attentive collaborative filtering: multimedia recommendation with item- and component-level attention. In: SIGIR. 2017. p. 335–344.
Sánchez-Moreno D, González ABG, Vicente MDM, Batista VFL, García MNM. A collaborative filtering method for music recommendation using playing coefficients for artists and users. Expert Syst Appl. 2016;66:234–44.
Tang YF, Yun YZ, Jin WW, Xiao MC. Music recommendation with collaborative filtering for mobile services. Appl Mech Mater. 2014;519–520:510–5.
Yan Y, Liu T, Wang Z. A music recommendation algorithm based on hybrid collaborative filtering technique. In: SMP, vol. 568. Springer; 2015. p. 233–240.
Liang Z, Tan Z, Zhuo Z, Zhang X. A hybrid music recommendation system based on scene-state perception model. In: ICSCC, vol. 10699. 2017. p. 19–26.
Wang, L. Design and implementation of hybrid music recommendation system based on music gene. In: CIPAE. 2020. p. 121–124.
Wu, D. Music personalized recommendation system based on hybrid filtration. In: ICITBS. 2019. p. 430–433.
Liang D, Zhan M, Ellis DPW. Content-aware collaborative music recommendation using pre-trained neural networks. In: ISMIR. 2015. p. 295–301.
Su J, Chiu T. An item-based music recommender system using music content similarity. In: ACIIDS, vol. 9622. 2016. p. 179–190.
Lin M, Chen Q, Yan S. Network in network. In: ICLR. 2014.
Murray M. Building a music recommender with deep learning. 2017.
Funding
This work was partially supported by the National Key Research and Development Program of China under Grant No. 2018AAA0100400, the Natural Science Foundation of Shandong Province under Grants No. ZR2020MF131 and No. ZR2021ZD19, and the Science and Technology Program of Qingdao under Grant No. 21-1-4-ny-19-nsh.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Ethical Approval
This article does not contain any studies with human participants or animals performed by any of the authors.
Informed Consent
Informed consent was obtained from all individual participants included in the study.
Conflict of Interest
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Mao, Y., Zhong, G., Wang, H. et al. Music-CRN: an Efficient Content-Based Music Classification and Recommendation Network. Cogn Comput 14, 2306–2316 (2022). https://doi.org/10.1007/s12559-022-10039-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12559-022-10039-x