Classification in music research

Weihs, Claus; Ligges, Uwe; Mörchen, Fabian; Müllensiefen, Daniel

doi:10.1007/s11634-007-0016-x

Classification in music research

Regular Article
Published: 21 November 2007

Volume 1, pages 255–291, (2007)
Cite this article

Advances in Data Analysis and Classification Aims and scope Submit manuscript

Claus Weihs¹,
Uwe Ligges¹,
Fabian Mörchen² &
…
Daniel Müllensiefen³

730 Accesses
41 Citations
Explore all metrics

Abstract

Since a few years, classification in music research is a very broad and quickly growing field. Most important for adequate classification is the knowledge of adequate observable or deduced features on the basis of which meaningful groups or classes can be distinguished. Unsupervised classification additionally needs an adequate similarity or distance measure grouping is to be based upon. Evaluation of supervised learning is typically based on the error rates of the classification rules. In this paper we first discuss typical problems and possible influential features derived from signal analysis, mental mechanisms or concepts, and compositional structure. Then, we present typical solutions of such tasks related to music research, namely for organization of music collections, transcription of music signals, cognitive psychology of music, and compositional structure analysis.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

References

Adak S (1998). Time-dependent spectral analysis of nonstationary time series. J Am Stat Assoc 93: 1488–1501
Article MATH MathSciNet Google Scholar
Ahrendt P (2006) Music genre classification systems—a computational approach. PhD thesis, Technical University of Denmark, DTU
Alonso M, David B, Richard G (2003) A study of tempo tracking algorithms from polyphonic music signals. In: Proceedings of the 4th COST 276 workshop, information and knowledge management for integrated media communication, Bordeaux, France, pp 1–5
Amatriain X, Arumi P, Ramirez M (2002) CLAM, yet another library for audio and music processing? In: Proceedings of the 17th annual acm conference on Object-Oriented Programming, Systems, Languages and Applications, ACM press, Seattle, WA, USA, pp 46–47
von Ameln F (2001) Blind source separation in der Praxis. Diplomarbeit, Fachbereich Statistik, Universität Dortmund, Dortmund, Germany
Arenas-Garca J, Larsen J, Hansen LK, Meng A (2006) Optimal filtering of dynamics in short-time features for music organization. In: Proceedings of the 7th international conference on music information retrieval, Victoria, Canada, pp 290–295
Aucouturier JJ, Pachet F (2002) Finding songs that sound the same. In: Proceedings of the IEEE Benelux workshop on model based processing and coding of audio, Leuven, Belgium, pp 1–8
Aucouturier JJ and Pachet F (2004). Improving timbre similarity: how high is the sky. J Neg Results Speech Audio Sci 1(1): 1–13
Google Scholar
Bainbridge D, Cunningham SJ, Downie JS (2004) Visual collaging of music in a digital library. In: Proceedings of the 5th international conference on music information retrieval, pp 397–402
Baumann S (2003) Music similarity analysis in a P2P environment. In: Proceedings of the 4th European workshop on image analysis for multimedia interactive services, London, UK, pp 314–319
Beran J (2004). Statistics in musicology. Chapman & Hall/CRC, Boca Raton
MATH Google Scholar
Berenzweig A, Ellis D, Lawrence S (2002) Using voice segments to improve artist classification of music. In: Proceedings of the 22nd international AES conference, Espoo, Finland, pp 119–122
Berenzweig A, Ellis D, Lawrence S (2003) Anchor space for classification and similarity measurement of music. In: Proceedings of the IEEE international conference on multimedia and expo, pp I–29–32
Berenzweig A, Logan B, Ellis D and Whitman B (2004). A large-scale evaluation of acoustic and subjective music-similarity measures. Comput Music J 28(2): 63–76
Article Google Scholar
Bloomfield P (2000). Fourier analyis of time series—an introduction, 2nd edn. Wiley, New York
Google Scholar
Brandenburg K, Popp H (2000) An introduction to MPEG Layer 3. EBU Technical review
Breiman L, Friedman J, Olshen R and Stone C (1984). Classification and regression trees. Wadsworth, Belmont
MATH Google Scholar
Brillinger D (1975). Time series: data analysis and theory. Holt, Rinehart & Winston Inc., New York
MATH Google Scholar
Brown H, Butler D and Jones M (1994). Musical and temporal influences on key discovery. Music Percept 11(4): 371–407
Google Scholar
Bruderer M (2003) Automatic recognition of musical instruments. Master thesis, Ecole Polytechnique Fédérale de Lausanne
Cano P, Loscos A, Bonada J (1999) Score-performance matching using HMMs. In: Proceedings of the international computer music conference, Beijing, China, pp 441–444
Cano P, Kaltenbrunner M, Gouyon F, Battle E (2002) On the use of FastMap for audio retrieval and browsing. In: Proceedings of the 3rd international conference on music information retrieval, Paris, France, pp 275–276
Cemgil A and Kappen B (2003). Monte Carlo methods for tempo tracking and rhythm quantization. J Artif Intell Res 18: 45–81
MATH Google Scholar
Cemgil A, Kappen B, Desain P and Honing H (2001). On tempo tracking: tempogram representation and Kalman filtering. J New Music Res 29(4): 259–273
Article Google Scholar
Cemgil T, Desain P and Kappen B (2000). Rhythm quantization for transcription. Comput Music J 24(2): 60–76
Article Google Scholar
Chew E (2000) Towards a mathematical model of tonality. PhD thesis, Department of Operaitons Research, MIT, Cambridge
Chuan CH, Chew E (2005) Audio key finding: considerations in system design, and the selecting and evaluating of solutions. In: International conference on multimedia and expo (ICME), pp 21–24
Costa M, Fine P and Ricci Bitti PE (2004). Interval distribution, mode, and tonal strength of melodies as predictors of perceived emotion. Music Percept 22(1): 1–14
Article Google Scholar
Dahlhaus R (1997). Fitting time series models to nonstationary processes. Ann Stat 25: 1–37
Article MATH MathSciNet Google Scholar
Davies M, Plumbley M (2004) Causal tempo tracking of audio. In: Proceedings of the 5th international conference on music information retrieval, Audiovisual Institute, Universitat Pompeu Fabra, Barcelona, Spain, pp 164–169
Davy M, Godsill S (2002) Bayesian harmonic models for musical pitch estimation and analysis. Technical Report 431, Cambridge University Engineering Department, Cambridge
Dixon S (1996). Multiphonic note identification. Aust Comput Sci Commun 17(1): 318–323
Google Scholar
Dixon S, Goebl W and Cambouropoulos E (2006). Perceptual smoothness of tempo in expressively performed music. Music Percept 23(3): 195–214
Article Google Scholar
Downie JS (1999) Evaluating a simple approach to music information retrieval: Conceiving melodic n-grams as text. PhD thesis, Faculty of Information and Media Studies, University of Western Ontario, London (Ontario), Canada, http://people.lis.uiuc.edu.jdownie/mir_papers/thesis_missing_some_music_figs.pdf
Eerola T, Järvinen T, Louhivuori J and Toiviainen P (2002). Statistical features and perceived similarity of folk melodies. Music Percept 18(3): 275–296
Article Google Scholar
Ellis D, Whitman B, Berenzweig A, Lawrence S (2002) The quest for ground truth in musical artist similarity. In: Proceedings of the 3rd international conference on music information retrieval, pp 170–177
Evangelista G (2001). Flexible wavelets for music signal processing. J New Music Res 30(1): 13–22
Article MathSciNet Google Scholar
Faloutsos C, Lin KI (1995) FastMap: A fast algorithm for indexing, data mining and visualization of traditional and multimedia datasets. In: Carey MJ, Schneider DA (eds) Proceedings of the 1995 ACM SIGMOD international conference on management of data, San Jose, pp 163–174
Flexer A, Pampalk E, Widmer G (2005) Hidden Markov models for spectral similarity of songs. In: Proceedings of the 8th international conference on digital audio effects, Madrid, Spain
Foote J (2002) Audio retrieval by rhythmic similarity. In: Proceedings of the 3rd international conference on music information retrieval
Foote J, Uchihashi S (2001) The beat spectrum: a new approach to rhythm analysis. In: Proceedings of the IEEE international conference on multimedia and expo, Tokyo, Japan, pp 224–228
Friedman J (1989). Regularized discriminant analysis. J Am Stat Assoc 84: 165–175
Article Google Scholar
Fucks W (1962). Mathematical analysis of formal structure of music. IEEE Trans Inform Theory 8(5): 225–228
Article Google Scholar
Fucks W (1963) Mathematische Analyse von Formalstrukturen von Werken der Musik (mit Diskussion). In: Arbeitsgemeinschaft für Forschung des Landes Nordrhein-Westfalen, Westdeutscher Verlag, Köln und Opladen, pp 39–114
Fucks W (1964) Gibt es mathematische Gesetze in Sprache und Musik? In: Frank H (ed) Kybernetik – Brücke zwischen den Wissenschaften, Umschau Verlag, Frankfurt am Main, pp 171–183
Fucks W (1968). Nach allen Regeln der Kunst. DVA, Stuttgart
Google Scholar
Fucks W and Lauter J (1965). Exaktwissenschaftliche Musikanalyse. Westdeutscher Verlag, Köln und Opladen
Google Scholar
Godsill S, Davy M (2003) Bayesian modelling of music audio signals. In: Bulletin of the International Statistical Institute, 54th Session, Berlin, vol LX, book 2, pp 504–506
Godsill S, Davy M (2005) Bayesian computational models for inharmonicity in musical instruments. In: IEEE workshop on applications of signal processing to audio and acoustics, New Paltz, NY, pp 283–286
Gomez E (2004). Tonal description of polyphonic audio for music content processing. INFORMS J Comput Spec Clust Comput Music 18(3): 294–304
Google Scholar
Gomez E (2006) Tonal description of music audio signals: harmonic pitch class profiles, tonality and tonal similarity of polyphonic audio signals. PhD thesis, Departament de Tecnologia, Universitat Pompeu Fabra, Barcelona, Spain
Goto M (2003) A chorus-section detecting method for musical audio signals. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, pp 437–440
Goto M (2004) A predominant-F0 estimation method for polyphonic musical audio signals. In: Proceedings of the 18th international congress on acoustics (ICA’04), Acoustical Society of Japan, Kyoto, Japan, pp 1085–1088
Gouyon F (2005) A computational approach to rhythm description: Audio features for the computation of rhythm periodicity functions and their use in tempo induction and music content processing. PhD thesis, Universitat Pompeu Fabra, Departament de Tecnologia, Barcelona, Spain
Gouyon F and Dixon S (2005). A review of automatic rhythm description systems. Comput Music J 29(1): 34–54
Article Google Scholar
Gouyon F, Klapuri A, Dixon S, Alonso M, Tzanetakis G, Uhle C and Cano P (2006). An experimental comparison of audio tempo induction algorithms. IEEE Trans Speech Audio Process 14(5): 1832–1844
Article Google Scholar
Gromko JE (1993). Perceptual differences between expert and novice music listeners at multidimensional scaling analysis. Psychol Music 21: 34–47
Article Google Scholar
Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, New York, http://www-stat.stanford.edu.tibs/ElemStatLearn/
Herre J, Allamanche E, Ertel C (2003) How similar do songs sound? Towards modeling human perception of musical similarity. In: Proceedings of the IEEE workshop on applications of signal processing to audio and acoustics, pp 83–86
Herrera P, Sandvold V, Gouyon F (2004) Percussion-related semantic descriptors of music audio files. In: Proceedings of the 25th international AES conference, London, United Kingdom
Hyvärinen A, Karhunen J and Oja E (2001). Independent component analysis. Wiley, New York
Google Scholar
Jürgensen F, Knopke I (2004) A comparison of automated methods for the analysis of style in fifteenth-century song intabulations. In: Parncutt R, Kessler A, Zimmer F (eds) Proceedings of the conference on interdisciplinary musicology (CIM04), http://www-gewi.uni-graz.at/staff/parncutt/cim04/CIM04_paper_pdf/JurgensenKnopke.pdf
Kantz H and Schreiber T (1997). Nonlinear time series analysis. Cambridge University Press, Cambridge
MATH Google Scholar
Klapuri A (2001) Multipitch estimation and sound separation by the spectral smoothness principle. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), vol 5, pp 3381–3384
Klapuri A (2004). Automatic music transcription as we know it today. J New Music Res 33(3): 269–282
Article Google Scholar
Klapuri A, Davy M (eds) (2006). Signal processing methods for music transcription. Springer, New York
Google Scholar
Kleber B (2002) Evaluation von Stimmqualität in westlichem, klassischen Gesang. Diploma Thesis, Fachbereich Psychologie, Universität Konstanz, Germany
Knees P, Pampalk E, Widmer G (2004) Artist classification with web-based data. In: Proceedings of the 5th international conference on music information retrieval. Barcelona, Spain, pp 517–524
Knuth D (1984). The T_EXbook. Addison-Wesley, Reading
Google Scholar
Kohonen T (1995). Self-organizing maps. Springer, Berlin
Google Scholar
Kopiez R, Weihs C, Ligges U and Lee JI (2006). Classification of high and low achievers in a music sight-reading task. Psychol Music 34(1): 5–26
Article Google Scholar
Koza JR (1992). Genetic programming: on the programming of computers by means of natural selection. MIT, Cambridge
MATH Google Scholar
Kranenburg Pv, Backer E (2004) Musical style recognition—a quantitative approach. In: Parncutt R, Kessler A, Zimmer F (eds) Proceedings of the conference on interdisciplinary musicology (CIM04), http://www-gewi.uni-graz.at/staff/parncutt/cim04/CIM04_paper_pdf/Kranenburg_Backer_CIM04_proceedings.pdf
Krumhansl CL (1990). Cognitive foundations of musical pitch. Oxford Psychology Series 17. Oxford University Press, Oxford
Google Scholar
Kulesh V, Sethi I, V P (2003) Indexing and retrieval of music via Gaussian mixture models. In: Proceedings of the 3rd international workshop on content based multimedia indexing, Rennes, France, pp 201–205
Kullback S and Leibler RA (1951). On information and sufficiency. Ann Math Stat 22: 79–86
MathSciNet MATH Google Scholar
Kurth F, Gehrmann T, Müller M (2006) The cyclic-beat spectrum: Tempo-related audio features for time-scale invariant audio identification. In: Proceedings of the 7th international conference on music information retrieval, pp 35–40
Lambrou T, Kudumakis P, Speller R, Sandler M, Linney A (1998) Classification of audio signals using statistical features on time and wavelet transform domains. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, vol 6, pp 3621–3624
Lamport L (1994). LATEX , a document preparation system, 2nd edn. Addison-Wesley, Reading
Google Scholar
Lehwark P, Risi S, Ultsch A (2007) Visualization and clustering of tagged music data. In: Proceedings GfKl 2007, Freiburg, Germany (to appear)
Lesaffre M, Tanghe K, Martens G, Moelants D, Leman M, De Baets B, De Meyer H, Martens JP (2003) The MAMI query-by-voice experiment: collecting and annotating vocal queries for music information retrieval. In: Proceedings of the 4th international conference on music information retrieval, Baltimore, Maryland, USA and Library of Congress, Washington, DC, USA, pp 65–71
Levy M, Sandler M (2006) Lightweight measures for timbral similarity of musical audio. In: Proceedings of the first ACM workshop on audio and music computing multimedia (AMCMM). ACM, New York, pp 27–36
Li D, Sethi I, Dimitrova N and McGee T (2001). Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22: 533–544
Article MATH Google Scholar
Li T, Ogihara M, Li Q (2003) A comparative study on content-based music genre classification. In: Proceedings of the 26th international ACM SIGIR conference on research and development in information retrieval. ACM, New York, pp 282–289
Lidy T, Rauber A (2005) Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In: Proceedings of the 6th international conference on music information retrieval, pp 34–41
Ligges U (2006) Transkription monophoner Gesangszeitreihen. Dissertation, Fachbereich Statistik, Universität Dortmund, Dortmund, Germany, http://hdl.handle.net/2003/22521
Logan B (2000) Mel frequency cepstral coefficients for music modeling. In: Proceedings of the first international conference on music information retrieval, pp 23–25
Logan B, Salomon A (2001) A music similarity function based on signal analysis. In: Proceedings of the IEEE international conference on multimedia and expo, pp 745–748
Mandel M, Ellis D (2005) Song-level features and SVMs for music classification. In: Proceedings of the 6th international conference on music information retrieval, pp 594–599
Markuse B and Schneider A (1996). ähnlichkeit, Nähe, Distanz: zur Anwendung multidimensionaler Skalierung in musik-wissenschaftlichen Untersuchungen. Systematische Musikwissenschaft / Systematic Musicology / Musicologie syst[[’e]]matique 4: 53–89
Google Scholar
McEnnis D, McKay C, Fujinaga I, Depalle P (2005) jAudio: a feature extraction library. In: Proceedings of the 6th international conference on music information retrieval, pp 600–603
McKinney M, Breebaart J (2003) Features for audio and music classification. In: Proceedings of the 4th international conference on music information retrieval, pp 151–158
Meng A (2006) Temporal feature integration for music organisation. PhD thesis, Informatics and Mathematical Modelling, Technical University of Denmark, DTU
Meng A, Ahrendt P, Larsen J (2005) Improving music genre classification by short-time feature integration. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, vol V, pp 497–500
Meng A, Ahrendt P, Larsen J and Hansen LK (2006). Temporal feature integration for music genre classification. IEEE Trans Signal Process 15: 1654–1664
Google Scholar
Meyer J (1995) Akustik und musikalische Aufführungspraxis. Bochinsky, Frankfurt am Main
Meyer LB (1957). Meaning in music and information theory. J Aesthet Art Criticism 15: 412–424
Article Google Scholar
Microsoft Corporation (1991) Multimedia programming interface and data specification, 1.0. Joint design by IBM Corporation and Microsoft Corporation
MIDI Manufacturers Association (2001) Complete MIDI 1.0 Detailed Specification, 2nd edn, http://www.midi.org
Mierswa I and Morik K (2005). Automatic feature extraction for classifying audio data. Mach Learn J 58: 127–149
Article MATH Google Scholar
Mierswa I, Wurst M, Klinkenberg R, Scholz M, Euler T (2006) YALE: Rapid prototyping for complex data mining tasks. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, NY, USA, pp 935–940
Moles A (1958). Th[[’e]]orie de l’information et perception est[[’e]]tique. Flammarion, Paris
Google Scholar
Moles A (1971). Informationstheorie und ästhetische Wahrnehmung. DuMont Schauberg, Köln
Google Scholar
Moore BCJ and Glasberg BR (1996). A revision of Zwickers loudness model. ACTA Acustica 82: 335–345
Google Scholar
Mörchen F, Ultsch A, Nöcker M, Stamm C (2005a) Databionic visualization of music collections according to perceptual distance. In: Proceedings of the 6th international conference on music information retrieval, pp 396–403
Mörchen F, Ultsch A, Thies M, Löhken I, Nöcker M, Stamm C, Efthymiou N, Kümmerer M (2005b) MusicMiner: visualizing timbre distances of music as topographical maps. Tech. rep., Department of Mathematics and Computer Science, University of Marburg, Germany
Mörchen F, Mierswa I, Ultsch A (2006a) Understandable models of music collections based on exhaustive feature generation with temporal statistics. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, Philadelphia, PA, USA, pp 882–891
Mörchen F, Ultsch A, Thies M and Löhken I (2006b). Modelling timbre distance with temporal statistics from polyphonic music. IEEE Trans Speech Audio Process 14(1): 81–90
Article Google Scholar
Müllensiefen D and Frieler K (2004a). Cognitive adequacy in the measurement of melodic similarity: Algorithmic vs. human judgments. Comput Musicol 13: 147–176
Google Scholar
Müllensiefen D, Frieler K (2004b) Optimizing measures of melodic similarity for the exploration of a large folk song database. In: 5th international conference on music information retrieval, Audiovisual Institute, Universitat Pompeu Fabra, Barcelona, Spain, pp 274–280
Müllensiefen D, Hennig C (2006) Modeling memory for melodies. In: Spiliopoulou M, Kruse R, Borgelt C, Nürnberger A, Gaul W (eds) From data and information analysis to knowledge engineering, Springer, Berlin, pp 732–739
Narmour E (1990) The Analysis and Cognition of Basic Melodic Structures: The Implication-Realization Model. University of Chicago Press, Chicago
Nienhuys HW, Nieuwenhuizen J, et al (2005) GNU LilyPond—the music typesetter. Free Software Foundation, http://www.lilypond.org/,version 2.6.5
Ombao H, Raz J, Malow B and Sachs R (2001). Automatic statistical analysis of bivariate nonstationary time series. J Am Stat Assoc 96(454): 543–560
Article MATH Google Scholar
Oppenheim A, Schafer R and Buck J (1999). Discrete-time signal processing, 2nd edn. Prentice-Hall, New Jersey
Google Scholar
Pachet F, Zils A (2003) Evolving automatically high-level music descriptors from acoustic signals. In: Proceedings of the international symposium on computer music modeling and retrieval, pp 42–53
Pampalk E (2004) A MATLAB toolbox to compute music similarity from audio. In: Proceedings of the 5th international conference on music information retrieval, Barcelona, Spain, pp 254–257
Pampalk E (2006a) Audio-based music similarity and retrieval: Combining a spectral similarity model with information extracted from fluctuation patterns. In: 3rd Annual Music Information Retrieval eXchange (MIREX’06), http://pampalk.at/publications/
Pampalk E (2006b) Computational models of music similarity and their application in music information retrieval. PhD thesis, Computer Science Department, Technical University Vienna, Austria
Pampalk E, Goto M (2006) MusicRainbow: a new user interface to discover artists using audio-rased similarity and web-based labeling. In: Proceedings of the 7th international conference on music information retrieval, pp 367–370
Pampalk E, Rauber A, Merkl D (2002) Content-based organization and visualization of music archives. In: Proceedings of the 10th ACM international conference on multimedia, pp 570–579
Pampalk E, Dixon S, Widmer G (2003a) Exploring music collections by browsing different views. In: Proceedings of the 4th international conference on music information retrieval, pp 201–208
Pampalk E, Dixon S, Widmer G (2003b) On the evaluation of perceptual similarity measures for music. In: Proceedings of the international conference on digital audio effects, pp 6–12
Pampalk E, Flexer A, Widmer G (2005) Hierarchical organization and description of music collections at the artist level. In: Proceedings of the 9th European conference on research and advanced technology for digital libraries, pp 37–48
Pang H and Yoon D (2005). Automatic detection of vibrato in monophonic music. Pattern Recogn 38(7): 1135–1138
Article Google Scholar
Pearce MT and Wiggins GA (2004). Improved methods for statistical modelling of monophonic music. J New Music Res 33(4): 367–385
Article Google Scholar
Pearce MT and Wiggins GA (2006). Expectation in melody: the influence of context and learning. Music Percept 23(5): 377–405
Article Google Scholar
Pierce JR (1992). The science of musical sound, 2nd ed. W.H. Freeman and Co., New York
Google Scholar
Plumbley M (2003). Algorithms for nonnegative independent component analysis. IEEE Trans Neural Netw 14(3): 534–543
Article Google Scholar
Plumbley M (2004) Optimization using Fourier expansion over a geodesic for non-negative ICA. In: Proceedings of the international conference on independent component analysis and blind signal separation (ICA 2004), Granada, Spain, pp 49–56
Plumbley M, Abdallah S, Blumensath T, Jafari M, Nesbit A, Vincent E, Wang B (2006) Musical audio analysis using sparse representations. In: COMPSTAT 2006—proceedings in computational statistics, Physica Verlag, Heidelberg, pp 104–117
Pohle T (2006) Post processing music similarity computations. In: The second annual music information retrieval evaluation eXchange (MIREX 2006), pp 16–18, http://www.music-ir.org/evaluation/MIREX/2006_abstracts/AS_pohle.pdf
Pohle T, Pampalk E, Widmer G (2005) Evaluation of frequently used audio features for classification of music into perceptual categories. In: Proceedings of the 4th international workshop on content-based multimedia indexing (CBMI), Riga, Latvia
Polotti P, Evangelista G (2000) Harmonic-band wavelet coefficient modeling for pseudo-periodic sound processing. In: Proceedings of the COST G-6 conference on digital audio effects (DAFX-00), Verona, Italy, pp 103–108
Polotti P, Evangelista G (2001) Multiresolution sinusoidal/stochastic model for voiced-sounds. In: Proceedings of the COST G-6 conference on digital audio effects (DAFX-01), Limerick, Ireland, pp 120–124
Pressing J, Lawrence P (1993) Transcribe: a comprehensive autotranscription program. In: Proceedings of the international computer music conference, Tokyo, Japan, pp 343–345
Pye D (2000) Content-based methods for managing electronic music. In: Proceedings of the international conference on acoustics, speech, and signal processing, pp 2437–2440
R Development Core Team (2007) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org,ISBN3-900051-07-0
Rabiner L and Juang BH (1993). Fundamentals of speech recognition. Prentice-Hall, New York
Google Scholar
Rabiner LR (1989). A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc IEEE 77(2): 257–286
Article Google Scholar
Raphael C (2001). A probabilistic expert system for automatic musical accompaniment. J Comput Graph Stat 10(3): 487–512
Article MathSciNet Google Scholar
Risi S, Mörchen F, Ultsch A, Lewark P (2007) Visual mining in music collections with emergent SOM. In: Proceedings workshop on self-organizing maps (WSOM) (to appear)
Rossignol S, Depalle P, Soumagne J, Rodet X, Collette JL (1999a) Vibrato: detection, estimation, extraction, modification. In: Proceedings of the COST-G6 workshop on digital audio effects (DAFx-99)
Rossignol S, Rodet X, Soumagne J, Collette JL and Depalle P (1999). Automatic characterisation of musical signals: feature extraction and temporal segmentation. J New Music Res 28(4): 281–295
Article Google Scholar
Röver C, Klefenz F, Weihs C (2005) Identification of musical instruments by means of the Hough-transformation. In: Weihs C, Gaul W (eds) Classification—the ubiquitous challenge. Springer, Berlin, pp 608–615
Rubner Y, Tomasi C, Guibas LJ (1998) A metric for distributions with applications to image databases. In: Proceedings of the IEEE international conference on computer vision, Bombay, India, pp 59–66
Salton G and Buckley C (1988). Term-weighting approaches in automatic text retrieval. Inform Process Manage 24(5): 513–523
Article Google Scholar
Sandvold V, Herrera P (2005) Towards a semantic descriptor of subjective intensity in music. In: Proceedings of the international computer music conference
Schedl M, Pohle TP, Knees P, Widmer G (2006) Assigning and visualizing music genres by web-based co-occurance analysis. In: Proceedings of the 7th international conference on music information retrieval, pp 260–265
Scheirer ED (1998). Tempo and beat analysis of acoustic musical signals. J Acoust Soc Am 103(1): 588–601
Article Google Scholar
Schellenberg EG (1997). Simplifying the implication-realization model of melodic expectancy. Music Percept 14: 295–318
Google Scholar
Seidner W and Wendler J (1997). Die Sängerstimme. Henschel, Berlin
Google Scholar
Shao X, Xu C, Kankanhalli MS (2004) Unsupervised classification of music genre using Hidden Markov Model. In: Proceedings of the IEEE international conference on multimedia and expo, pp 2023–2026
Shapiro S (1978). Feature space transforms for curve detection. Pattern Recogn 10: 129–143
Article MATH Google Scholar
Smaragdis P, Brown J (2003) Non-negative matrix factorization for polyphonic music transcription. In: IEEE workshop on applications of signal processing to audio and acoustics, pp 177–180
Steinbeck W (1982). Struktur und ähnlichkeit: Methoden automatisierter Melodieanalyse. Bärenreiter, Kassel
Google Scholar
Stenzel R, Kamps T (2005) Improving content-based similarity measures by training a collaborative model. In: Proceedings of the 6th international conference on music information retrieval, pp 264–271
Stevens S and Volkmann J (1940). The relation of pitch to frequency. Am J Psychol 53(3): 329–353
Article Google Scholar
Streich S, Herrera P (2004) Toward describing perceived complexity of songs: computational methods and implementation. In: Proceedings of the 25th international AES conference
Streich S, Herrera P (2005) Detrended fluctuation analysis of music signals: Danceability estimation and further semantic characterization. In: Proceedings of the 118th AES convention
Temperley D (2001). The cognition of basic musical structures. MIT, Cambridge
Google Scholar
Temperley D (2004). Bayesian models of musical structure and cognition. Music Sci 8(2): 175–205
Google Scholar
Temperley D (2006) A probabilistic model of melody perception. In: Proceeding of the 7th international conference on music information retrieval, pp 276–279, http://ismir2006.ismir.net/PAPERS/ISMIR0630_Paper.pdf
Temperley D (2007). Music and probability. MIT, Cambridge
MATH Google Scholar
Thomassen J (1982). Melodic accent: experiments and a tentative model. J Acoust Soc Am 71: 1596–1605
Article Google Scholar
Torrens M, Hertzog P, Arcos JL (2004) Visualizing and exploring personal music libraries. In: Proceedings of the 5th international conference on music information retrieval, pp 421–424
Tzanetakis G and Cook P (2000). MARSYAS: a framework for audio analysis. Organ Sound 4(30): 169–175
Article Google Scholar
Tzanetakis G and Cook P (2002). Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5): 293–302
Article Google Scholar
Tzanetakis G, Ermolinskyi A, Cook P (2002a) Beyond the query-by-example paradigm: New query interfaces for music. In: Proceedings of the international computer music conference, pp 177–183
Tzanetakis G, Ermolinskyi A, Cook P (2002b) Pitch histograms in audio and symbolic music information retrieval. In: Proceedings of the 3rd international conference on music information retrieval, pp 31–38
Tzanetakis G, Essl G, Cook P (2002c) Human perception and computer extraction of beat strength. In: Proceedings of the international conference on digital audio effects (DAFx-02), pp 257–261
Ultsch A (1993) Self-organizing neural networks for visualization and classification. In: Opitz O, Lausen B, Klar R (eds) Information and classification—concepts, methods, and applications, Springer, Berlin, pp 307–313
Ultsch A (1996) Self organizing neural networks perform different from statistical k-means clustering. In: BMBF Statusseminar Künstliche Intelligenz, Neuroinformatik und Intelligente Systeme, München, pp 433–443
Ultsch A, Mörchen F (2005) ESOM-Maps: Tools for clustering, visualization, and classification with emergent SOM. Tech. Rep. 46, Department of Mathematics and Computer Science, University of Marburg, Germany
Van Trees H (2001) Detection, estimation, and modulation theory, Part I, reprint edn. Wiley-Interscience, Melbourne
Vembu S, Baumann S (2005) A self-organizing map based knowledge discovery for music recommendation systems. In: Computer music modeling and retrieval, pp 119–229
Vignoli F, Pauws S (2005) A music retrieval system based on user driven similarity and its evaluation. In: Proceedings of the 6th international conference on music information retrieval, pp 272–279
Vignoli F, van Gulik R, van de Wetering H (2004) Mapping music in the palm of your hand, explore and discover your collection. In: Proceedings of the 5th International Conference on Music Information Retrieval, pp 409–414
Viste H, Evangelista G (2001) Sounds source separation: Preprocessing for hearing aids and structured audio coding. In: Proceedings of the COST G-6 conference on digital audio effects (DAFX-01), Limerick, pp 67–70
Viste H, Evangelista G (2002) An extension for source separation techniques avoiding beats. In: Proceedings of the 5th international conference on digital audio effects (DAFx-02), Hamburg, Germany, pp 71–75
Wakefield G (1999) Mathematical representation of joint time-chroma distributions. In: Proceedings of the SPIE international symposium on optical science, engineering and instrumentation, Denver, Colorado, pp 637–645
Walmsley P, Godsill S, Rayner P (1999) Polyphonic pitch tracking using joint Bayesian estimation of multiple frame parameters. In: IEEE workshop on applications of signal processing to audio and acoustics, New Paltz, pp 119–122
Wapnick J and Ekholm E (1997). Expert consensus in solo voice performance evaluation. J Voice 11(4): 429–436
Article Google Scholar
Weihs C, Ligges U (2005) From local to global analysis of music time series. In: Morik K, Boulicaut JF, Siebes A (eds) Local pattern detection. Springer, Berlin, Lecture Notes in Artificial Intelligence 3539, pp 217–231
Weihs C, Ligges U (2006) Parameter optimization in automatic transcription of music. In: Spiliopoulou M, Kruse R, Nürnberger A, Borgelt C, Gaul W (eds) From data and information analysis to knowledge engineering. Springer, Berlin, pp 740–747
Weihs C, Berghoff S, Hasse-Becker P, Ligges U (2001) Assessment of purity of intonation in singing presentations by discriminant analysis. In: Kunert J, Trenkler G (eds) Mathematical statistics and biometrical applications. Josef Eul, Bergisch-Gladbach, pp 395–410
Weihs C, Ligges U, Sommer K (2006a) Analysis of music time series. In: Rizzi A, Vichi M (eds) COMPSTAT 2006—proceedings in computational statistics. Physica Verlag, Heidelberg, pp 147–159
Weihs C, Szepannek G, Ligges U, Luebke K, Raabe N (2006b) Local models in register classification by timbre. In: Batagelj V, Bock HH, Ferligoj A, Ziberna A (eds) Data science and classification, Springer, Berlin, pp 315–322
West K, Cox S, Lamere P (2006) Incorporating machine-learning into music similarity estimation. In: Proceedings of the first ACM workshop on Audio and music computing multimedia (AMCMM). ACM, New York, pp 89–96
Whiteley N, Cemgil A, Godsill S (2006) Bayesian modelling of temporal structure in musical audio. In: 7th international conference on music information retrieval, Victoria, Canada, pp 29–34
Whittaker J (1990). Graphical models in applied multivariate statistics. Wiley, New York
MATH Google Scholar
Wolfe P, Godsill S and Ng WJ (2004). Bayesian variable selection and regularization for time-frequency surface estimation. J R Stat Soc: Ser B (Stat Methodol) 66(3): 575–589
Article MATH MathSciNet Google Scholar
Zils A, Pachet F (2004) Automatic extraction of music descriptors from acoustic signals using EDS. In: Proceedings of the 116th AES Convention
Zwicker E and Stevens S (1957). Critical bandwidths in loudness summation. J Acoust Soc Am 29(5): 548–557
Article Google Scholar

Download references

Author information

Authors and Affiliations

Fachbereich Statistik, Universität Dortmund, 44221, Dortmund, Germany
Claus Weihs & Uwe Ligges
Siemens Corporate Research, 755 College Road East, Princeton, NJ, 08540, USA
Fabian Mörchen
Department of Computing, Goldsmiths College, University of London, London, UK
Daniel Müllensiefen

Authors

Claus Weihs
View author publications
You can also search for this author in PubMed Google Scholar
Uwe Ligges
View author publications
You can also search for this author in PubMed Google Scholar
Fabian Mörchen
View author publications
You can also search for this author in PubMed Google Scholar
Daniel Müllensiefen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Claus Weihs.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Weihs, C., Ligges, U., Mörchen, F. et al. Classification in music research. ADAC 1, 255–291 (2007). https://doi.org/10.1007/s11634-007-0016-x

Download citation

Received: 26 January 2007
Revised: 11 September 2007
Accepted: 03 November 2007
Published: 21 November 2007
Issue Date: December 2007
DOI: https://doi.org/10.1007/s11634-007-0016-x

Keywords

Mathematics Subject Classification (2000)

62H30

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Classification in music research

Abstract

Access this article

Similar content being viewed by others

Music Genre Classification: A Semi-supervised Approach

Comparing Audio Features and Playlist Statistics for Music Classification

A Bayesian Approach to Classify the Music Scores on the Basis of the Music Style

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification (2000)

Navigation

Classification in music research

Abstract

Access this article

Similar content being viewed by others

Music Genre Classification: A Semi-supervised Approach

Comparing Audio Features and Playlist Statistics for Music Classification

A Bayesian Approach to Classify the Music Scores on the Basis of the Music Style

References

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification (2000)

Search

Navigation