Abstract
In this paper we demonstrate the advantages of multiple-kernel learning in the application to music genre classification. Multiple-kernel learning provides the possibility to adaptively tune the kernel settings to each group of features independently. Our experiments show the improvement of classification performance in comparison to the conventional support vector machine classifier.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
Available online: http://www.shogun-toolbox.org
- 2.
Available online: http://ismir2004.ismir.net/genre_contest/index.htm
- 3.
- 4.
References
Atlas L, Shamma S (2003) Joint acoustic and modulation frequency. EURASIP J Appl Signal Proces 7:668–675
Aucouturier JJ, Defreville B, Pachet F (2007) The bag-of-frames approach to audio pattern recognition: A sufficient model for urban soundscapes but not for polyphonic music. J Acoust Soc Am 122(2):881–891
Barrington L, Yazdani M, Turnbull D, Lanckriet G (2008) Combining feature kernels for semantic music retrieval. In: Proc. of the 9th Intl. Conf. on Music Information Retrieval (ISMIR), pp 614–619
Bello JP, Pickens J (2005) A robust mid-level representation for harmonic content in music signals. In: Proc. of the 6th Int. Conf. on Music Information Retrieval (ISMIR), London, UK, pp 304–311
Dittmar C, Bastuck C, Gruhne M (2007) Novel mid-level audio features for music similarity. In: Proc. of the Int. Conf. on Music Communication Science (ICOMCS), Sydney, Australia
Essid S (2005) Classification automatique des signaux audio-fréquences: Reconnaissance des instruments de musique. PhD thesis, l’Université Pierre et Marie Curie, Paris, France
Gatzsche G, Mehnert M, Gatzsche D, Brandenburg K (2007) A symmetry based approach for musical tonality analysis. In: Proc. of the 8th Int. Conf. on Music Information Retrieval (ISMIR), Vienna, Austria, pp 207–210
Gruhne M, Dittmar C (2009) Comparison of harmonic mid-level representations for genre recognition. In: Proc. of the 3rd Workshop on Learning the Semantics of Audio Signals (LSAS), Graz, Austria, pp 91–102
Gruhne M, Dittmar C, Gaertner D (2009) Improving rhythmic similarity computation by beat histogram transformations. In: Proc. of the 10th Int. Society for Music Information Retrieval Conf. (ISMIR), Kobe, Japan
Lanckriet G, Cristianini N, Bartlett P, Ghaoui LE, Jordan M (2004) Learning the kernel matrix with semidefinite programming. J Mach Learn Res 5:27–72
Lee K (2006) Automatic chord recognition from audio using enhanced pitch class profile. In: Proc. of the Int. Computer Music Conf. (ICMC), New Orleans, USA, pp 306–313
Nakajima S, Binder A, Müller C, Wojcikiewicz W, Kloft M, Brefeld U, Müller KR, Kawanabe M (2009) Multiple kernel learning for object classification. Tech. rep., Information-Based Induction Sciences, Fukuoka, Japan
Peeters G, Rodet X (2003) Hierarchical gaussian tree with inertia ratio maximization for the classification of large musical instruments databases. In: Proc. of the 6th Intl. Conf. on Digital Audio Effects (DAFx)., London, UK
Sonnenburg S, Rätsch G, Schäfer C (2005) Learning interpretable SVMs for biological sequence classification. In: Miyano S, Mesirov J, Kasif S, Istrail S, Pevzner P, Waterman M (eds) Research in Computational Molecular Biology, Lecture Notes in Computer Science, vol 3500, Springer, Berlin/Heidelberg, pp 389–407
Sonnenburg S, Rätsch G, Schäfer C, Schölkopf B (2006) Large scale multiple kernel learning. J Mach Learn Res 7:1531–1565
Tzanetakis G, Cook P (2002) Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5):293–302
Uhle C, Dittmar C, Sporer T (2003) Extraction of drum tracks from polyphonic music using independent subspace analysis. In: Proc. of the 4th Int. Symposium on Independent Component Analysis (ICA), Nara, Japan, pp 843–848
Vapnik V (1998) Statistical learning theory. Wiley, New York
Acknowledgements
This work has been partly supported by the German research project GlobalMusic2One Footnote 3 funded by the Federal Ministry of Education and Research (BMBF-FKZ: 01/S08039B). Additionally, the Thuringian Ministry of Economy, Employment and Technology supported this research by granting funds of the European Fund for Regional Development to the project Songs2See Footnote 4, enabling transnational cooperation between Thuringian companies and their partners from other European regions.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lukashevich, H. (2012). Applying Multiple Kernel Learning to Automatic Genre Classification. In: Gaul, W., Geyer-Schulz, A., Schmidt-Thieme, L., Kunze, J. (eds) Challenges at the Interface of Data Analysis, Computer Science, and Optimization. Studies in Classification, Data Analysis, and Knowledge Organization. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-24466-7_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-24466-7_40
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-24465-0
Online ISBN: 978-3-642-24466-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)