Abstract
A tree-based method for the recognition of the tonal center or key in a musical audio signal is presented. Time-varying key feature vectors of 264 synthesized sounds are extracted from an auditory-based pitch model and converted into character strings using PCA-analysis and classification trees. The results are compared with distance-based methods. The characteristics of the new tonality analysis tool are illustrated on various examples. The potential of this method as a building stone in a music retrieval system is discussed.
References
Breiman L, Friedman J, Olshen R, Stone C (1984) Classification and Regression trees. Wadsworth Int. Group, Belmont, California
CART Decision Tree Software (1998) Salford systems, San Diego, http://www.salford-systems.com
Foote JT (1997) A similarity measure for automatic audio classification. In: Proceedings of the AAAI 1997 spring symposium on intelligent integration and use of text, image, video and audio corpora. Standford
Herrera P, Yeterian A, Gouyon F (2002) Automatic classification of drum sounds: a comparison of feature selection methods and classification techniques. In: Proceedings of ICMAI, pp 69–80
Holtzman SR (1970) A Program for key determination. Interface 6: 29–56
Janata P, Birk JL, Van Horn JD, Leman M, Tillmann B, Bharucha JJ, The cortical topography of tonal structures underlying western music, Science 298: 2167–2170
Jensen K, Arnspang K (1999) Binary decision tree classification of musical sounds. In: Proceedings of the 1999 ICMC
Kohonen T (1995) Self-Organizing Maps. Springer, Berlin Heidelberg New York
Krumhansl C, Kessler E (1982) Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. Psychol Rev 89: 334–368
Krumhansl C (1990) Cognitive foundations of musical pitch. Oxford University Press, New York
Leman M (1995) Music and schema theory. Cognitive foundations of systematic musicology, Springer, Berlin Heidelberg New York
Leman M (2000) An auditory model of the role of short-term memory in probe-tone ratings. Music Percept. 17: 435–464
Leman M, Lesaffre M, Tanghe K (2001) The IPEM toolbox for perception-based music analysis. IPEM, Ghent University, http://www.ipem.rug.ac.be/Toolbox
Leman M (2002) The structure of auditory-based induced musical pitch images is low-dimensional. IPEM, Ghent University, (submitted)
Leman M, Clarisse LP, De Baets B, De Meyer H, Lesaffre M, Martens G, Martens JP, Van Steelant D (2002) Tendencies, perspectives, and opportunities of musical audio-mining. In: Proceedings of the 3rd EAA European Congress on Acoustics
Liu M, Wan C, Wang L (2002) Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines. Soft Comput. 6: 357–364
Longuet-Higgens HC, Steedman MJ (1971) On interpreting bach. Machine intell. 6: 221–241
Martens G, De Meyer H, De Baets B, Leman M, Martens JP, Clarisse L, Lesaffre M (2002) A tonality-oriented symbolic representation of musical audio generated by classiffication trees. In: Proceedings of the eurofuse workshop on information systems, pp 49–54
Moelants D, Van Noorden L (1999) Resonance in the perception of musical tempo. J New Music Research 28: 43–66
Pye D (2000) Content-based methods for the management of digital music. In: proceedings of ICASSP, vol. 4. pp 2437–2440
Shepard R (1964) Circularity in judgements of relative pitch. J Acoust Soc Amer 36: 2346–2353
Shmulevich I, Yli-Harja O (2000) Localized key finding: algorithms and applications. Music Percept. 17: 531–544
Shmulevich I, Yli-Harja O, Coyle E, Povel D-J, Lemström K, (2001) Perceptual issues in music pattern recognition: complexity of rhythm and key finding. Comput Humanit. 35: 23–35
Shmulevich I, Coyle J (1997) The use of recursive median filters for establishing the tonal context in music. In: proceedings of the 1997 IEEE workshop on applications of signal processing to audio and acoustics
Temperley D, Bartlette C (2002) Parallelism as a factor in metrical analysis. Music Percept. 20: 117–149
Tzanetakis G, Cook P, Essl G (2001) Automatic musical genre classification of audio signals. In: proceedings of the international symposium for audio information retrieval, pp 205–210
Vos P, Leman M (2000) Ed Tonal induction. Special issue of Music Percept. 17: 401–544
Vos P, Van Geenen EW (1996) A parallel-processing key-finding music. Music Percept 14: 185–223
Wieczorkowska A (1999) A Classification of musical sounds using decision trees. In: proceedings of the 8th Internat. symposium on sound engineering and mastering, 1999, pp 1933–1941
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Martens, G., De Meyer, H., De Baets, B. et al. Tree-based versus distance-based key recognition in musical audio. Soft Comput 9, 565–574 (2005). https://doi.org/10.1007/s00500-004-0374-7
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00500-004-0374-7