Skip to main content
Log in

Tree-based versus distance-based key recognition in musical audio

  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

A tree-based method for the recognition of the tonal center or key in a musical audio signal is presented. Time-varying key feature vectors of 264 synthesized sounds are extracted from an auditory-based pitch model and converted into character strings using PCA-analysis and classification trees. The results are compared with distance-based methods. The characteristics of the new tonality analysis tool are illustrated on various examples. The potential of this method as a building stone in a music retrieval system is discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

References

  • Breiman L, Friedman J, Olshen R, Stone C (1984) Classification and Regression trees. Wadsworth Int. Group, Belmont, California

  • CART Decision Tree Software (1998) Salford systems, San Diego, http://www.salford-systems.com

  • Foote JT (1997) A similarity measure for automatic audio classification. In: Proceedings of the AAAI 1997 spring symposium on intelligent integration and use of text, image, video and audio corpora. Standford

  • Herrera P, Yeterian A, Gouyon F (2002) Automatic classification of drum sounds: a comparison of feature selection methods and classification techniques. In: Proceedings of ICMAI, pp 69–80

  • Holtzman SR (1970) A Program for key determination. Interface 6: 29–56

    Google Scholar 

  • Janata P, Birk JL, Van Horn JD, Leman M, Tillmann B, Bharucha JJ, The cortical topography of tonal structures underlying western music, Science 298: 2167–2170

  • Jensen K, Arnspang K (1999) Binary decision tree classification of musical sounds. In: Proceedings of the 1999 ICMC

  • Kohonen T (1995) Self-Organizing Maps. Springer, Berlin Heidelberg New York

  • Krumhansl C, Kessler E (1982) Tracing the dynamic changes in perceived tonal organization in a spatial representation of musical keys. Psychol Rev 89: 334–368

    Google Scholar 

  • Krumhansl C (1990) Cognitive foundations of musical pitch. Oxford University Press, New York

  • Leman M (1995) Music and schema theory. Cognitive foundations of systematic musicology, Springer, Berlin Heidelberg New York

  • Leman M (2000) An auditory model of the role of short-term memory in probe-tone ratings. Music Percept. 17: 435–464

    Google Scholar 

  • Leman M, Lesaffre M, Tanghe K (2001) The IPEM toolbox for perception-based music analysis. IPEM, Ghent University, http://www.ipem.rug.ac.be/Toolbox

  • Leman M (2002) The structure of auditory-based induced musical pitch images is low-dimensional. IPEM, Ghent University, (submitted)

  • Leman M, Clarisse LP, De Baets B, De Meyer H, Lesaffre M, Martens G, Martens JP, Van Steelant D (2002) Tendencies, perspectives, and opportunities of musical audio-mining. In: Proceedings of the 3rd EAA European Congress on Acoustics

  • Liu M, Wan C, Wang L (2002) Content-based audio classification and retrieval using a fuzzy logic system: towards multimedia search engines. Soft Comput. 6: 357–364

    Google Scholar 

  • Longuet-Higgens HC, Steedman MJ (1971) On interpreting bach. Machine intell. 6: 221–241

    Google Scholar 

  • Martens G, De Meyer H, De Baets B, Leman M, Martens JP, Clarisse L, Lesaffre M (2002) A tonality-oriented symbolic representation of musical audio generated by classiffication trees. In: Proceedings of the eurofuse workshop on information systems, pp 49–54

  • Moelants D, Van Noorden L (1999) Resonance in the perception of musical tempo. J New Music Research 28: 43–66

    Google Scholar 

  • Pye D (2000) Content-based methods for the management of digital music. In: proceedings of ICASSP, vol. 4. pp 2437–2440

  • Shepard R (1964) Circularity in judgements of relative pitch. J Acoust Soc Amer 36: 2346–2353

    Google Scholar 

  • Shmulevich I, Yli-Harja O (2000) Localized key finding: algorithms and applications. Music Percept. 17: 531–544

    Google Scholar 

  • Shmulevich I, Yli-Harja O, Coyle E, Povel D-J, Lemström K, (2001) Perceptual issues in music pattern recognition: complexity of rhythm and key finding. Comput Humanit. 35: 23–35

    Google Scholar 

  • Shmulevich I, Coyle J (1997) The use of recursive median filters for establishing the tonal context in music. In: proceedings of the 1997 IEEE workshop on applications of signal processing to audio and acoustics

  • Temperley D, Bartlette C (2002) Parallelism as a factor in metrical analysis. Music Percept. 20: 117–149

    Google Scholar 

  • Tzanetakis G, Cook P, Essl G (2001) Automatic musical genre classification of audio signals. In: proceedings of the international symposium for audio information retrieval, pp 205–210

  • Vos P, Leman M (2000) Ed Tonal induction. Special issue of Music Percept. 17: 401–544

  • Vos P, Van Geenen EW (1996) A parallel-processing key-finding music. Music Percept 14: 185–223

    Google Scholar 

  • Wieczorkowska A (1999) A Classification of musical sounds using decision trees. In: proceedings of the 8th Internat. symposium on sound engineering and mastering, 1999, pp 1933–1941

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to H. De Meyer.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Martens, G., De Meyer, H., De Baets, B. et al. Tree-based versus distance-based key recognition in musical audio. Soft Comput 9, 565–574 (2005). https://doi.org/10.1007/s00500-004-0374-7

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-004-0374-7

Keywords

Navigation