skip to main content
10.1145/3459104.3459171acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiseeieConference Proceedingsconference-collections
research-article

Language Representation Models for Music Genre Classification Using Lyrics

Published:20 July 2021Publication History

ABSTRACT

There are various genres of music available in every period and field of human life. Every music genre represents a set of shared conventions. Today people have the opportunity to listen to any genre of music they want using various music platforms. However, with the increasing number of music genres, the management of these platforms becomes difficult. Language representation models such as BERT, DistilBERT have been proven to be useful in learning universal language representations. Such language representation models have achieved amazing results in many language understanding tasks. In this study, we apply language representation models for music genre classification using song lyrics. We examine whether language representation models are better than traditional deep learning models for music genre classification by comparing results and computation times. Experimental results show that BERT outperforms other models on one-label and multi-label classification with accuracy of 77.63% and 71.29% respectively. On the other hand, considering the time taken for one epoch, BERT runs 4 times faster than DistilBERT.

References

  1. Adhikari, Ashutosh, Achyudh Ram, Raphael Tang, and Jimmy Lin. 2019. “Docbert: Bert for Document Classification.” arXiv Preprint arXiv:1904.08398.Google ScholarGoogle Scholar
  2. Araújo Lima, Raul de, Rômulo César Costa de Sousa, Simone Diniz Junqueira Barbosa, and Hélio Cortês Vieira Lopes. 2020. “Brazilian Lyrics-Based Music Genre Classification Using a Blstm Network.” http://arxiv.org/abs/2003.05377.Google ScholarGoogle Scholar
  3. Chang, Wei-Cheng, Hsiang-Fu Yu, Kai Zhong, Yiming Yang, and Inderjit Dhillon. 2019. “X-Bert: EXtreme Multi-Label Text Classification with Using Bidirectional Encoder Representations from Transformers.” arXiv Preprint arXiv:1905.02331.Google ScholarGoogle Scholar
  4. Chia, Yew Ken, Sam Witteveen, and Martin Andrews. 2019. “Transformer to Cnn: Label-Scarce Distillation for Efficient Text Classification.” http://arxiv.org/abs/1909.03508.Google ScholarGoogle Scholar
  5. Devlin, Jacob, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2018. “BERT: Pre-Training of Deep Bidirectional Transformers for Language Understanding.” http://arxiv.org/abs/1810.04805.Google ScholarGoogle Scholar
  6. Howard, Sam, Carlos N. Silla Jr, and Colin G. Johnson. 2011. “Automatic Lyrics-Based Music Genre Classification in a Multilingual Setting.” In Thirteenth Brazilian Symposium on Computer Music. https://kar.kent.ac.uk/33266/.Google ScholarGoogle Scholar
  7. Huang, Derek A, Arianna A Serafini, and Eli J Pugh. n.d. “Music Genre Classification.”Google ScholarGoogle Scholar
  8. “Hugging Face – on a Mission to Solve Nlp, One Commit at a Time.” n.d. Hugging Face – on a Mission to Solve NLP, One Commit at a Time.https://huggingface.co/.Google ScholarGoogle Scholar
  9. Johnson, Rie, and Tong Zhang. 2016. “Supervised and Semi-Supervised Text Categorization Using Lstm for Region Embeddings.” http://arxiv.org/abs/1602.02373.Google ScholarGoogle Scholar
  10. Johnson, Rie, and Tong Zhang. 2017. “Deep Pyramid Convolutional Neural Networks for Text Categorization.” In Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 562–70.Google ScholarGoogle Scholar
  11. Diederik P Kingma and Jimmy Ba. 2014. Adam:Amethodforstochasticoptimization. arXiv preprint arXiv:1412.6980 (2014).Google ScholarGoogle Scholar
  12. Li, Wenting, Shangbing Gao, Hong Zhou, Zihe Huang, Kewen Zhang, and Wei Li. 2019. “The Automatic Text Classification Method Based on Bert and Feature Union.” In2019 Ieee 25th International Conference on Parallel and Distributed Systems (Icpads), 774–77. IEEE.Google ScholarGoogle Scholar
  13. Lin, Zhouhan, Minwei Feng, Cicero Nogueira dos Santos, Mo Yu, Bing Xiang, Bowen Zhou, and Yoshua Bengio. 2017. “A Structured Self-Attentive Sentence Embedding.” arXiv Preprint arXiv:1703.03130.Google ScholarGoogle Scholar
  14. Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xiaodong Liu, Jianfeng Gao, and Jiawei Han. 2019. On the variance of the adaptive learning rate and beyond. arXiv preprint arXiv:1908.03265 (2019).Google ScholarGoogle Scholar
  15. Liu, Pengfei, Xipeng Qiu, and Xuanjing Huang. 2016. “Recurrent Neural Network for Text Classification with Multi-Task Learning.” arXiv Preprint arXiv:1605.05101.Google ScholarGoogle Scholar
  16. Mayer, Rudolf, Robert Neumayer, and Andreas Rauber. 2008a. “Combination of Audio and Lyrics Features for Genre Classification in Digital Audio Collections.” In Proceedings of the 16th Acm International Conference on Multimedia, 159–68.Google ScholarGoogle Scholar
  17. Mayer, Rudolf, Robert Neumayer, and Andreas Rauber. 2008b. “Rhyme and Style Features for Musical Genre Classification by Song Lyrics.” In Ismir, 337–42.Google ScholarGoogle Scholar
  18. Mayer, Rudolf, and Andreas Rauber. 2011. “Musical Genre Classification by Ensembles of Audio and Lyrics Features.” In Proceedings of International Conference on Music Information Retrieval, 675–80.Google ScholarGoogle Scholar
  19. Munikar, Manish, Sushil Shakya, and Aakash Shrestha. 2019. “Fine-Grained Sentiment Classification Using Bert.” In2019 Artificial Intelligence for Transforming Business and Society (Aitb), 1:1–5. IEEE.Google ScholarGoogle Scholar
  20. Neumayer, Robert, and Andreas Rauber. 2007. “Integration of Text and Audio Features for Genre Classification in Music Information Retrieval.” In European Conference on Information Retrieval, 724–27. Springer.Google ScholarGoogle Scholar
  21. Oramas, Sergio, Francesco Barbieri, Oriol Nieto, and Xavier Serra. 2018. “Multimodal Deep Learning for Music Genre Classification.” Transactions of the International Society for Music Information Retrieval. 2018; 1 (1): 4-21.Google ScholarGoogle Scholar
  22. Oramas, Sergio, Oriol Nieto, Francesco Barbieri, and Xavier Serra. 2017. “Multi-Label Music Genre Classification from Audio, Text, and Images Using Deep Features.” http://arxiv.org/abs/1707.04916.Google ScholarGoogle Scholar
  23. Radford, Alec, Karthik Narasimhan, Tim Salimans, and Ilya Sutskever. 2018. “Improving Language Understanding with Unsupervised Learning.” Technical Report, OpenAI.Google ScholarGoogle Scholar
  24. Sanh, Victor, Lysandre Debut, Julien Chaumond, and Thomas Wolf. 2019. “DistilBERT, a Distilled Version of Bert: Smaller, Faster, Cheaper and Lighter.” http://arxiv.org/abs/1910.01108.Google ScholarGoogle Scholar
  25. Schuster, Mike, and Kuldip Paliwal. 1997a. “Bidirectional Recurrent Neural Networks.” Signal Processing, IEEE Transactions on 45 (December): 2673–81. https://doi.org/10.1109/78.650093.Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Schuster, Mike, and Kuldip K Paliwal. 1997b. “Bidirectional Recurrent Neural Networks.” IEEE Transactions on Signal Processing 45 (11): 2673–81.Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. Shen, Dinghan, Yizhe Zhang, Ricardo Henao, Qinliang Su, and Lawrence Carin. 2018. “Deconvolutional Latent-Variable Model for Text Sequence Matching.” In Thirty-Second Aaai Conference on Artificial Intelligence.Google ScholarGoogle Scholar
  28. Sun, Chi, Xipeng Qiu, Yige Xu, and Xuanjing Huang. 2019. “How to Fine-Tune Bert for Text Classification?” In China National Conference on Chinese Computational Linguistics, 194–206. Springer.Google ScholarGoogle Scholar
  29. Tsaptsinos, Alexandros. 2017. “Music Genre Classification by Lyrics Using a Hierarchical Attention Network.” In. ICME.Google ScholarGoogle Scholar
  30. Tzanetakis, George, and Perry Cook. 2002. “Musical Genre Classification of Audio Signals.” IEEE Transactions on Speech and Audio Processing 10 (5): 293–302.Google ScholarGoogle ScholarCross RefCross Ref
  31. Vaswani, Ashish, Noam Shazeer, Niki Parmar, Jakob Uszkoreit, Llion Jones, Aidan N Gomez, Łukasz Kaiser, and Illia Polosukhin. 2017. “Attention Is All You Need.” In Advances in Neural Information Processing Systems, 5998–6008.Google ScholarGoogle Scholar
  32. Yang, Zichao, Diyi Yang, Chris Dyer, Xiaodong He, Alex Smola, and Eduard Hovy. 2016. “Hierarchical Attention Networks for Document Classification.” In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, 1480–9.Google ScholarGoogle ScholarCross RefCross Ref
  33. Ying, Teh Chao, Shyamala Doraisamy, and Lili Nurliyana Abdullah. 2012. “Genre and Mood Classification Using Lyric Features.” In2012 International Conference on Information Retrieval & Knowledge Management, 260–63. IEEE.Google ScholarGoogle Scholar
  34. Yogatama, Dani, Chris Dyer, Wang Ling, and Phil Blunsom. 2017. “Generative and Discriminative Text Classification with Recurrent Neural Networks.” arXiv Preprint arXiv:1703.0189Google ScholarGoogle Scholar

Index Terms

  1. Language Representation Models for Music Genre Classification Using Lyrics
          Index terms have been assigned to the content through auto-classification.

          Recommendations

          Comments

          Login options

          Check if you have access through your login credentials or your institution to get full access on this article.

          Sign in
          • Published in

            cover image ACM Other conferences
            ISEEIE 2021: 2021 International Symposium on Electrical, Electronics and Information Engineering
            February 2021
            644 pages
            ISBN:9781450389839
            DOI:10.1145/3459104

            Copyright © 2021 ACM

            Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

            Publisher

            Association for Computing Machinery

            New York, NY, United States

            Publication History

            • Published: 20 July 2021

            Permissions

            Request permissions about this article.

            Request Permissions

            Check for updates

            Qualifiers

            • research-article
            • Research
            • Refereed limited

          PDF Format

          View or Download as a PDF file.

          PDF

          eReader

          View online with eReader.

          eReader

          HTML Format

          View this article in HTML Format .

          View HTML Format