Skip to main content

Applying Multiple Kernel Learning to Automatic Genre Classification

  • Conference paper
  • First Online:

Abstract

In this paper we demonstrate the advantages of multiple-kernel learning in the application to music genre classification. Multiple-kernel learning provides the possibility to adaptively tune the kernel settings to each group of features independently. Our experiments show the improvement of classification performance in comparison to the conventional support vector machine classifier.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Available online: http://www.shogun-toolbox.org

  2. 2.

    Available online: http://ismir2004.ismir.net/genre_contest/index.htm

  3. 3.

    see http://www.globalmusic2one.net

  4. 4.

    see http://www.songs2see.net

References

  • Atlas L, Shamma S (2003) Joint acoustic and modulation frequency. EURASIP J Appl Signal Proces 7:668–675

    Article  Google Scholar 

  • Aucouturier JJ, Defreville B, Pachet F (2007) The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music. J Acoust Soc Am 122(2):881–891

    Article  Google Scholar 

  • Barrington L, Yazdani M, Turnbull D, Lanckriet G (2008) Combining feature kernels for semantic music retrieval. In: Proc. of the 9th Intl. Conf. on Music Information Retrieval (ISMIR), pp 614–619

    Google Scholar 

  • Bello JP, Pickens J (2005) A robust mid-level representation for harmonic content in music signals. In: Proc. of the 6th Int. Conf. on Music Information Retrieval (ISMIR), London, UK, pp 304–311

    Google Scholar 

  • Dittmar C, Bastuck C, Gruhne M (2007) Novel mid-level audio features for music similarity. In: Proc. of the Int. Conf. on Music Communication Science (ICOMCS), Sydney, Australia

    Google Scholar 

  • Essid S (2005) Classification automatique des signaux audio-fréquences: Reconnaissance des instruments de musique. PhD thesis, l’Université Pierre et Marie Curie, Paris, France

    Google Scholar 

  • Gatzsche G, Mehnert M, Gatzsche D, Brandenburg K (2007) A symmetry based approach for musical tonality analysis. In: Proc. of the 8th Int. Conf. on Music Information Retrieval (ISMIR), Vienna, Austria, pp 207–210

    Google Scholar 

  • Gruhne M, Dittmar C (2009) Comparison of harmonic mid-level representations for genre recognition. In: Proc. of the 3rd Workshop on Learning the Semantics of Audio Signals (LSAS), Graz, Austria, pp 91–102

    Google Scholar 

  • Gruhne M, Dittmar C, Gaertner D (2009) Improving rhythmic similarity computation by beat histogram transformations. In: Proc. of the 10th Int. Society for Music Information Retrieval Conf. (ISMIR), Kobe, Japan

    Google Scholar 

  • Lanckriet G, Cristianini N, Bartlett P, Ghaoui LE, Jordan M (2004) Learning the kernel matrix with semidefinite programming. J Mach Learn Res 5:27–72

    MATH  Google Scholar 

  • Lee K (2006) Automatic chord recognition from audio using enhanced pitch class profile. In: Proc. of the Int. Computer Music Conf. (ICMC), New Orleans, USA, pp 306–313

    Google Scholar 

  • Nakajima S, Binder A, Müller C, Wojcikiewicz W, Kloft M, Brefeld U, Müller KR, Kawanabe M (2009) Multiple kernel learning for object classification. Tech. rep., Information-Based Induction Sciences, Fukuoka, Japan

    Google Scholar 

  • Peeters G, Rodet X (2003) Hierarchical gaussian tree with inertia ratio maximization for the classification of large musical instruments databases. In: Proc. of the 6th Intl. Conf. on Digital Audio Effects (DAFx)., London, UK

    Google Scholar 

  • Sonnenburg S, Rätsch G, Schäfer C (2005) Learning interpretable SVMs for biological sequence classification. In: Miyano S, Mesirov J, Kasif S, Istrail S, Pevzner P, Waterman M (eds) Research in Computational Molecular Biology, Lecture Notes in Computer Science, vol 3500, Springer, Berlin/Heidelberg, pp 389–407

    Google Scholar 

  • Sonnenburg S, Rätsch G, Schäfer C, Schölkopf B (2006) Large scale multiple kernel learning. J Mach Learn Res 7:1531–1565

    MathSciNet  MATH  Google Scholar 

  • Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302

    Article  Google Scholar 

  • Uhle C, Dittmar C, Sporer T (2003) Extraction of drum tracks from polyphonic music using independent subspace analysis. In: Proc. of the 4th Int. Symposium on Independent Component Analysis (ICA), Nara, Japan, pp 843–848

    Google Scholar 

  • Vapnik V (1998) Statistical learning theory. Wiley, New York

    MATH  Google Scholar 

Download references

Acknowledgements

This work has been partly supported by the German research project GlobalMusic2One Footnote 3 funded by the Federal Ministry of Education and Research (BMBF-FKZ: 01/S08039B). Additionally, the Thuringian Ministry of Economy, Employment and Technology supported this research by granting funds of the European Fund for Regional Development to the project Songs2See Footnote 4, enabling transnational cooperation between Thuringian companies and their partners from other European regions.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hanna Lukashevich .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Lukashevich, H. (2012). Applying Multiple Kernel Learning to Automatic Genre Classification. In: Gaul, W., Geyer-Schulz, A., Schmidt-Thieme, L., Kunze, J. (eds) Challenges at the Interface of Data Analysis, Computer Science, and Optimization. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24466-7_40

Download citation

Publish with us

Policies and ethics