Skip to main content

Advertisement

Log in

Robust handcrafted features for music genre classification

  • S.I.: Latin American Computational Intelligence
  • Published:
Neural Computing and Applications Aims and scope Submit manuscript

Abstract

When it comes to offering song suggestions to a user, the music genre is one of the most used tags considered by music streaming services. Motivated by the growing number of songs available, automatic music genre classification systems have become a valuable tool for the creation of user-personalised playlists. Considering that feature engineering represents a major task to be addressed when one develops such systems, this work discusses the generation of new handcrafted features over songs, originally exploring high-order features’ moments combined with their derivatives. Additionally, this paper proposes a new wrapper-based selection procedure rigorously based on statistical tests to identify a subset of features that maximise the performance of these systems, irrespective of the classification approach adopted, named Robust Selector of Basis Feature Sets. Based on a synergistic combination of both strategies, a compact subset with 81 features is derived over the GTZAN Dataset. When compared with alternative solutions, this feature set boosted the classification accuracy in datasets containing a wide range of music genres, such as ISMIR2004, BALLROOM, HOMBURG, and FMA Datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13
Fig. 14

Similar content being viewed by others

Explore related subjects

Discover the latest articles, news and stories from top researchers in related subjects.

Data availability

The datasets analysed are publicly available, as well the materials used in the implementation.

Code availability

The code can be requested from the authors.

Notes

  1. https://everynoise.com.

  2. https://newsroom.spotify.com/company-info/.

  3. https://scikit-learn.org/stable/modules/generated/sklearn.metrics.roc_auc_score.html.

  4. http://marsyas.info/downloads/datasets.html.

  5. https://www.upf.edu/web/mtg/ismir2004-genre.

  6. http://mtg.upf.edu/ismir2004/contest/tempoContest/node5.html.

  7. https://www-ai.cs.tu-dortmund.de/audio.html.

  8. https://github.com/mdeff/fma.

  9. https://scikit-learn.org.

  10. https://lightgbm.readthedocs.io.

  11. https://github.com/smazzanti/mrmr.

  12. https://scikit-posthocs.readthedocs.io.

  13. https://orange3.readthedocs.io/projects/orange-data-mining-library.

  14. https://matplotlib.org.

  15. https://librosa.org.

  16. https://docs.scipy.org/doc/scipy/index.html.

References

  1. Li S, Karatzoglou A, Gentile C (2016) Collaborative filtering Bandits. In: Proceedings of the 39th international ACM SIGIR conference on research and development in information retrieval. SIGIR ’16, pp 539–548

  2. Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302. https://doi.org/10.1109/TSA.2002.800560

    Article  Google Scholar 

  3. Markov K, Matsui T (2014) Music genre and emotion recognition using Gaussian processes. IEEE Access 2:688–697. https://doi.org/10.1109/ACCESS.2014.2333095

    Article  Google Scholar 

  4. Baniya BK, Lee J (2016) Importance of audio feature reduction in automatic music genre classification. Multimedia Tools Appl 75(6):3013–3026. https://doi.org/10.1007/s11042-014-2418-z

    Article  Google Scholar 

  5. Foleis JH, Tavares TF (2020) Texture selection for automatic music genre classification. Appl Soft Comput 89:106127. https://doi.org/10.1016/j.asoc.2020.106127

    Article  Google Scholar 

  6. Singh Y, Biswas A (2022) Robustness of musical features on deep learning models for music genre classification. Expert Syst Appl 199:116879. https://doi.org/10.1016/j.eswa.2022.116879

    Article  Google Scholar 

  7. Medhat F, Chesmore D, Robinson J (2020) Masked conditional neural networks for sound classification. Appl Soft Comput 90:106073. https://doi.org/10.1016/j.asoc.2020.106073

    Article  Google Scholar 

  8. Ashraf M, Geng G, Wang X, Ahmad F, Abid F (2020) A globally regularized joint neural architecture for music classification. IEEE Access 8:220980–220989. https://doi.org/10.1109/ACCESS.2020.3043142

    Article  Google Scholar 

  9. Yi Y, Chen KY, Gu HY (2019) Mixture of CNN experts from multiple acoustic feature domain for music genre classification. In: 2019 Asia-Pacific signal and information processing association annual summit and conference (APSIPA ASC), pp 1250–1255

  10. Kim J, Urbano J, Liem CCS, Hanjalic A (2020) One deep music representation to rule them all? A comparative analysis of different representation learning strategies. Neural Comput Appl 32(4):1067–1093. https://doi.org/10.1007/s00521-019-04076-1

    Article  Google Scholar 

  11. Sousa JM, Pereira ET, Veloso LR (2016) A robust music genre classification approach for global and regional music datasets evaluation. In: 2016 IEEE international conference on digital signal processing (DSP), pp 109–113

  12. Kobayashi T, Kubota A, Suzuki Y (2018) Audio feature extraction based on sub-band signal correlations for music genre classification. In: 2018 IEEE international symposium on multimedia (ISM), pp 180–181

  13. Carbonneau MA, Cheplygina V, Granger E, Gagnon G (2018) Multiple instance learning: a survey of problem characteristics and applications. Pattern Recogn 77:329–353. https://doi.org/10.1016/j.patcog.2017.10.009

    Article  Google Scholar 

  14. Muniz VHS, Souza Filho JBO (2021) Feature vector design for music genre classification. In: 2021 IEEE Latin American conference on computational intelligence (LA-CCI), pp 1–6

  15. Silla Jr CN, Koerich AL, Kaestner CAA (2008) Feature selection in automatic music genre classification. In: 2008 Tenth IEEE international symposium on multimedia, pp 39–44

  16. Serwach M, Stasiak B (2016) GA-based parameterization and feature selection for automatic music genre recognition. In: 2016 17th international conference computational problems of electrical engineering (CPEE), pp 1–5

  17. Pons J, Lidy T, Serra X (2016) Experimenting with musically motivated convolutional neural networks. In: 2016 14th international workshop on content-based multimedia indexing (CBMI), pp 1–6

  18. Liu C, Feng L, Liu G, Wang H, Liu S (2021) Bottom-up broadcast neural network for music genre classification. Multimedia Tools Appl 80(5):7313–7331. https://doi.org/10.1007/s11042-020-09643-6

    Article  Google Scholar 

  19. Raissi T, Tibo A, Bientinesi P (2018) Extended pipeline for content-based feature engineering in music genre recognition. In: 2018 IEEE international conference on acoustics, speech and signal processing (ICASSP), pp 2661–2665

  20. Seo JS, Lee S (2011) Higher-order moments for musical genre classification. Signal Process 91(8):2154–2157. https://doi.org/10.1016/j.sigpro.2011.03.019

    Article  MATH  Google Scholar 

  21. Wu M, Wang Y (2015) A feature selection algorithm of music genre classification based on ReliefF and SFS. In: 2015 IEEE/ACIS 14th international conference on computer and information science (ICIS), pp 539–544

  22. Chae J, Cho SH, Park J, Kim DW, Lee J (2021) Toward a fair evaluation and analysis of feature selection for music tag classification. IEEE Access 9:147717–147731. https://doi.org/10.1109/ACCESS.2021.3123966

    Article  Google Scholar 

  23. Wald R, Khoshgoftaar T, Napolitano A (2013) Comparison of stability for different families of filter-based and wrapper-based feature selection. In: 2013 12th international conference on machine learning and applications, vol 2, pp 457–464

  24. Saeys Y, Abeel T, Van de Peer Y (2008) Robust feature selection using ensemble feature selection techniques. In: Proceedings of the European conference on machine learning and knowledge discovery in databases. Part II, vol 5212, pp 313–325

  25. Laranjeiro N, Agnelo J, Bernardino J (2021) A systematic review on software robustness assessment. ACM Comput Surv. https://doi.org/10.1145/3448977

    Article  Google Scholar 

  26. Zhang WJ, Lin Y (2010) On the principle of design of resilient systems—application to enterprise information systems. Enterprise Inf Syst 4(2):99–110. https://doi.org/10.1080/17517571003763380

    Article  Google Scholar 

  27. Bez CL, Souza Filho JBO, de Vasconcelos LGLBM, Frensch T, da Silva EAB, Netto SL (2021) Multimodal soccer highlight identification using a sparse subset of frames integrating long-term sliding windows. Inf Sci 578:702–724. https://doi.org/10.1016/j.ins.2021.07.066

    Article  MathSciNet  Google Scholar 

  28. Brase CH, Brase CP (2016) Understanding basic statistics, enhanced. Cengage Learning, Boston

    Google Scholar 

  29. Krishnan S, Magimai-Doss M, Seelamantula CS (2013) A Savitzky-Golay filtering perspective of dynamic feature computation. Signal Process Lett IEEE 20:281–284. https://doi.org/10.1109/LSP.2013.2244593

    Article  Google Scholar 

  30. Su X, Liu F (2018) A survey for study of feature selection based on mutual information. In: 2018 9th workshop on hyperspectral image and signal processing: evolution in remote sensing (WHISPERS), pp 1–4

  31. Shi X, Xing F, Guo Z, Su H, Liu F, Yang L (2019) Structured orthogonal matching pursuit for feature selection. Neurocomputing 349:164–172. https://doi.org/10.1016/j.neucom.2018.12.030

    Article  Google Scholar 

  32. Visalakshi S, Radha V (2014) A literature review of feature selection techniques and applications: review of feature selection in data mining. In: 2014 IEEE international conference on computational intelligence and computing research, pp 1–6

  33. You SD, Hung MJ (2020) Reducing dimensionality of spectro-temporal data by independent component analysis. In: 2020 2nd international conference on computer communication and the internet (ICCCI), pp 93–97

  34. Grandini M, Bagli E, Visani G. Metrics for multi-class classification: an overview. Available from: arXiv:2008.05756

  35. Japkowicz N, Shah M (2011) Evaluating learning algorithms: a classification perspective. Cambridge University Press, Cambridge

    Book  MATH  Google Scholar 

  36. Demšar J (2006) Statistical comparisons of classifiers over multiple data sets. J Mach Learn Res 7:1–30

    MathSciNet  MATH  Google Scholar 

  37. Wolpert DH (1996) The lack of a priori distinctions between learning algorithms. Neural Comput 8(7):1341–1390. https://doi.org/10.1162/neco.1996.8.7.1341

    Article  Google Scholar 

  38. Abu-Mostafa YS, Magdon-Ismail M, Lin HT (2012) Learning from data. AMLBook

  39. Murphy KP (2022) Probabilistic machine learning: an introduction. MIT Press, Cambridge

    MATH  Google Scholar 

  40. Mittelhammer R, Judge G, Miller D (2000) Econometric foundation. Cambridge University Press, Cambridge

    MATH  Google Scholar 

  41. Modi S, Lin Y, Cheng L, Yang G, Liu L, Zhang WJ (2011) A socially inspired framework for human state inference using expert opinion integration. IEEE/ASME Trans Mechatron 16(5):874–878. https://doi.org/10.1109/TMECH.2011.2161094

    Article  Google Scholar 

  42. Cano P, Gómez E, Gouyon F, Herrera P, Koppenberger M, Ong B, ISMIR, et al (2004) audio description contest. Tech Rep MTG-TR-2006-02. Universitat Pompeu Fabra 2006, pp 1–20

  43. Homburg H, Mierswa I, Möller B, Morik K, Wurst M (2005) A benchmark dataset for audio classification and clustering. In: Proc. 6th Int. Conf. Music Information Retrieval, pp 528–531

  44. Defferrard M, Benzi K, Vandergheynst P, Bresson X (2017) FMA: a dataset for music analysis. In: 18th International society for music information retrieval conference (ISMIR). Available from: arXiv:1612.01840

  45. Bhat AD, Acharya HR, HRS (2019) A novel solution to the curse of dimensionality in using KNNs for image classification. In: 2019 2nd international conference on intelligent autonomous systems (ICoIAS), pp 32–36

  46. Ng WWY, Zeng W, Wang T (2020) Multi-level local feature coding fusion for music genre recognition. IEEE Access 8:152713–152727. https://doi.org/10.1109/ACCESS.2020.3017661

    Article  Google Scholar 

Download references

Funding

This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior - Brasil (CAPES) Finance Code 001, CNPq, and FAPERJ.

Author information

Authors and Affiliations

Authors

Contributions

The authors equally contributed to this work.

Corresponding author

Correspondence to Victor Hugo da Silva Muniz.

Ethics declarations

Competing interests

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file 1 (pdf 172 KB)

Supplementary file 2 (pdf 193 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

da Silva Muniz, V.H., de Oliveira e Souza Filho, J.B. Robust handcrafted features for music genre classification. Neural Comput & Applic 35, 9335–9348 (2023). https://doi.org/10.1007/s00521-022-08069-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00521-022-08069-5

Keywords