Abstract
Since a few years, classification in music research is a very broad and quickly growing field. Most important for adequate classification is the knowledge of adequate observable or deduced features on the basis of which meaningful groups or classes can be distinguished. Unsupervised classification additionally needs an adequate similarity or distance measure grouping is to be based upon. Evaluation of supervised learning is typically based on the error rates of the classification rules. In this paper we first discuss typical problems and possible influential features derived from signal analysis, mental mechanisms or concepts, and compositional structure. Then, we present typical solutions of such tasks related to music research, namely for organization of music collections, transcription of music signals, cognitive psychology of music, and compositional structure analysis.
Similar content being viewed by others
References
Adak S (1998). Time-dependent spectral analysis of nonstationary time series. J Am Stat Assoc 93: 1488–1501
Ahrendt P (2006) Music genre classification systems—a computational approach. PhD thesis, Technical University of Denmark, DTU
Alonso M, David B, Richard G (2003) A study of tempo tracking algorithms from polyphonic music signals. In: Proceedings of the 4th COST 276 workshop, information and knowledge management for integrated media communication, Bordeaux, France, pp 1–5
Amatriain X, Arumi P, Ramirez M (2002) CLAM, yet another library for audio and music processing? In: Proceedings of the 17th annual acm conference on Object-Oriented Programming, Systems, Languages and Applications, ACM press, Seattle, WA, USA, pp 46–47
von Ameln F (2001) Blind source separation in der Praxis. Diplomarbeit, Fachbereich Statistik, Universität Dortmund, Dortmund, Germany
Arenas-Garca J, Larsen J, Hansen LK, Meng A (2006) Optimal filtering of dynamics in short-time features for music organization. In: Proceedings of the 7th international conference on music information retrieval, Victoria, Canada, pp 290–295
Aucouturier JJ, Pachet F (2002) Finding songs that sound the same. In: Proceedings of the IEEE Benelux workshop on model based processing and coding of audio, Leuven, Belgium, pp 1–8
Aucouturier JJ and Pachet F (2004). Improving timbre similarity: how high is the sky. J Neg Results Speech Audio Sci 1(1): 1–13
Bainbridge D, Cunningham SJ, Downie JS (2004) Visual collaging of music in a digital library. In: Proceedings of the 5th international conference on music information retrieval, pp 397–402
Baumann S (2003) Music similarity analysis in a P2P environment. In: Proceedings of the 4th European workshop on image analysis for multimedia interactive services, London, UK, pp 314–319
Beran J (2004). Statistics in musicology. Chapman & Hall/CRC, Boca Raton
Berenzweig A, Ellis D, Lawrence S (2002) Using voice segments to improve artist classification of music. In: Proceedings of the 22nd international AES conference, Espoo, Finland, pp 119–122
Berenzweig A, Ellis D, Lawrence S (2003) Anchor space for classification and similarity measurement of music. In: Proceedings of the IEEE international conference on multimedia and expo, pp I–29–32
Berenzweig A, Logan B, Ellis D and Whitman B (2004). A large-scale evaluation of acoustic and subjective music-similarity measures. Comput Music J 28(2): 63–76
Bloomfield P (2000). Fourier analyis of time series—an introduction, 2nd edn. Wiley, New York
Brandenburg K, Popp H (2000) An introduction to MPEG Layer 3. EBU Technical review
Breiman L, Friedman J, Olshen R and Stone C (1984). Classification and regression trees. Wadsworth, Belmont
Brillinger D (1975). Time series: data analysis and theory. Holt, Rinehart & Winston Inc., New York
Brown H, Butler D and Jones M (1994). Musical and temporal influences on key discovery. Music Percept 11(4): 371–407
Bruderer M (2003) Automatic recognition of musical instruments. Master thesis, Ecole Polytechnique Fédérale de Lausanne
Cano P, Loscos A, Bonada J (1999) Score-performance matching using HMMs. In: Proceedings of the international computer music conference, Beijing, China, pp 441–444
Cano P, Kaltenbrunner M, Gouyon F, Battle E (2002) On the use of FastMap for audio retrieval and browsing. In: Proceedings of the 3rd international conference on music information retrieval, Paris, France, pp 275–276
Cemgil A and Kappen B (2003). Monte Carlo methods for tempo tracking and rhythm quantization. J Artif Intell Res 18: 45–81
Cemgil A, Kappen B, Desain P and Honing H (2001). On tempo tracking: tempogram representation and Kalman filtering. J New Music Res 29(4): 259–273
Cemgil T, Desain P and Kappen B (2000). Rhythm quantization for transcription. Comput Music J 24(2): 60–76
Chew E (2000) Towards a mathematical model of tonality. PhD thesis, Department of Operaitons Research, MIT, Cambridge
Chuan CH, Chew E (2005) Audio key finding: considerations in system design, and the selecting and evaluating of solutions. In: International conference on multimedia and expo (ICME), pp 21–24
Costa M, Fine P and Ricci Bitti PE (2004). Interval distribution, mode, and tonal strength of melodies as predictors of perceived emotion. Music Percept 22(1): 1–14
Dahlhaus R (1997). Fitting time series models to nonstationary processes. Ann Stat 25: 1–37
Davies M, Plumbley M (2004) Causal tempo tracking of audio. In: Proceedings of the 5th international conference on music information retrieval, Audiovisual Institute, Universitat Pompeu Fabra, Barcelona, Spain, pp 164–169
Davy M, Godsill S (2002) Bayesian harmonic models for musical pitch estimation and analysis. Technical Report 431, Cambridge University Engineering Department, Cambridge
Dixon S (1996). Multiphonic note identification. Aust Comput Sci Commun 17(1): 318–323
Dixon S, Goebl W and Cambouropoulos E (2006). Perceptual smoothness of tempo in expressively performed music. Music Percept 23(3): 195–214
Downie JS (1999) Evaluating a simple approach to music information retrieval: Conceiving melodic n-grams as text. PhD thesis, Faculty of Information and Media Studies, University of Western Ontario, London (Ontario), Canada, http://people.lis.uiuc.edu.jdownie/mir_papers/thesis_missing_some_music_figs.pdf
Eerola T, Järvinen T, Louhivuori J and Toiviainen P (2002). Statistical features and perceived similarity of folk melodies. Music Percept 18(3): 275–296
Ellis D, Whitman B, Berenzweig A, Lawrence S (2002) The quest for ground truth in musical artist similarity. In: Proceedings of the 3rd international conference on music information retrieval, pp 170–177
Evangelista G (2001). Flexible wavelets for music signal processing. J New Music Res 30(1): 13–22
Faloutsos C, Lin KI (1995) FastMap: A fast algorithm for indexing, data mining and visualization of traditional and multimedia datasets. In: Carey MJ, Schneider DA (eds) Proceedings of the 1995 ACM SIGMOD international conference on management of data, San Jose, pp 163–174
Flexer A, Pampalk E, Widmer G (2005) Hidden Markov models for spectral similarity of songs. In: Proceedings of the 8th international conference on digital audio effects, Madrid, Spain
Foote J (2002) Audio retrieval by rhythmic similarity. In: Proceedings of the 3rd international conference on music information retrieval
Foote J, Uchihashi S (2001) The beat spectrum: a new approach to rhythm analysis. In: Proceedings of the IEEE international conference on multimedia and expo, Tokyo, Japan, pp 224–228
Friedman J (1989). Regularized discriminant analysis. J Am Stat Assoc 84: 165–175
Fucks W (1962). Mathematical analysis of formal structure of music. IEEE Trans Inform Theory 8(5): 225–228
Fucks W (1963) Mathematische Analyse von Formalstrukturen von Werken der Musik (mit Diskussion). In: Arbeitsgemeinschaft für Forschung des Landes Nordrhein-Westfalen, Westdeutscher Verlag, Köln und Opladen, pp 39–114
Fucks W (1964) Gibt es mathematische Gesetze in Sprache und Musik? In: Frank H (ed) Kybernetik – Brücke zwischen den Wissenschaften, Umschau Verlag, Frankfurt am Main, pp 171–183
Fucks W (1968). Nach allen Regeln der Kunst. DVA, Stuttgart
Fucks W and Lauter J (1965). Exaktwissenschaftliche Musikanalyse. Westdeutscher Verlag, Köln und Opladen
Godsill S, Davy M (2003) Bayesian modelling of music audio signals. In: Bulletin of the International Statistical Institute, 54th Session, Berlin, vol LX, book 2, pp 504–506
Godsill S, Davy M (2005) Bayesian computational models for inharmonicity in musical instruments. In: IEEE workshop on applications of signal processing to audio and acoustics, New Paltz, NY, pp 283–286
Gomez E (2004). Tonal description of polyphonic audio for music content processing. INFORMS J Comput Spec Clust Comput Music 18(3): 294–304
Gomez E (2006) Tonal description of music audio signals: harmonic pitch class profiles, tonality and tonal similarity of polyphonic audio signals. PhD thesis, Departament de Tecnologia, Universitat Pompeu Fabra, Barcelona, Spain
Goto M (2003) A chorus-section detecting method for musical audio signals. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, pp 437–440
Goto M (2004) A predominant-F0 estimation method for polyphonic musical audio signals. In: Proceedings of the 18th international congress on acoustics (ICA’04), Acoustical Society of Japan, Kyoto, Japan, pp 1085–1088
Gouyon F (2005) A computational approach to rhythm description: Audio features for the computation of rhythm periodicity functions and their use in tempo induction and music content processing. PhD thesis, Universitat Pompeu Fabra, Departament de Tecnologia, Barcelona, Spain
Gouyon F and Dixon S (2005). A review of automatic rhythm description systems. Comput Music J 29(1): 34–54
Gouyon F, Klapuri A, Dixon S, Alonso M, Tzanetakis G, Uhle C and Cano P (2006). An experimental comparison of audio tempo induction algorithms. IEEE Trans Speech Audio Process 14(5): 1832–1844
Gromko JE (1993). Perceptual differences between expert and novice music listeners at multidimensional scaling analysis. Psychol Music 21: 34–47
Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer, New York, http://www-stat.stanford.edu.tibs/ElemStatLearn/
Herre J, Allamanche E, Ertel C (2003) How similar do songs sound? Towards modeling human perception of musical similarity. In: Proceedings of the IEEE workshop on applications of signal processing to audio and acoustics, pp 83–86
Herrera P, Sandvold V, Gouyon F (2004) Percussion-related semantic descriptors of music audio files. In: Proceedings of the 25th international AES conference, London, United Kingdom
Hyvärinen A, Karhunen J and Oja E (2001). Independent component analysis. Wiley, New York
Jürgensen F, Knopke I (2004) A comparison of automated methods for the analysis of style in fifteenth-century song intabulations. In: Parncutt R, Kessler A, Zimmer F (eds) Proceedings of the conference on interdisciplinary musicology (CIM04), http://www-gewi.uni-graz.at/staff/parncutt/cim04/CIM04_paper_pdf/JurgensenKnopke.pdf
Kantz H and Schreiber T (1997). Nonlinear time series analysis. Cambridge University Press, Cambridge
Klapuri A (2001) Multipitch estimation and sound separation by the spectral smoothness principle. In: IEEE international conference on acoustics, speech and signal processing (ICASSP), vol 5, pp 3381–3384
Klapuri A (2004). Automatic music transcription as we know it today. J New Music Res 33(3): 269–282
Klapuri A, Davy M (eds) (2006). Signal processing methods for music transcription. Springer, New York
Kleber B (2002) Evaluation von Stimmqualität in westlichem, klassischen Gesang. Diploma Thesis, Fachbereich Psychologie, Universität Konstanz, Germany
Knees P, Pampalk E, Widmer G (2004) Artist classification with web-based data. In: Proceedings of the 5th international conference on music information retrieval. Barcelona, Spain, pp 517–524
Knuth D (1984). The TEXbook. Addison-Wesley, Reading
Kohonen T (1995). Self-organizing maps. Springer, Berlin
Kopiez R, Weihs C, Ligges U and Lee JI (2006). Classification of high and low achievers in a music sight-reading task. Psychol Music 34(1): 5–26
Koza JR (1992). Genetic programming: on the programming of computers by means of natural selection. MIT, Cambridge
Kranenburg Pv, Backer E (2004) Musical style recognition—a quantitative approach. In: Parncutt R, Kessler A, Zimmer F (eds) Proceedings of the conference on interdisciplinary musicology (CIM04), http://www-gewi.uni-graz.at/staff/parncutt/cim04/CIM04_paper_pdf/Kranenburg_Backer_CIM04_proceedings.pdf
Krumhansl CL (1990). Cognitive foundations of musical pitch. Oxford Psychology Series 17. Oxford University Press, Oxford
Kulesh V, Sethi I, V P (2003) Indexing and retrieval of music via Gaussian mixture models. In: Proceedings of the 3rd international workshop on content based multimedia indexing, Rennes, France, pp 201–205
Kullback S and Leibler RA (1951). On information and sufficiency. Ann Math Stat 22: 79–86
Kurth F, Gehrmann T, Müller M (2006) The cyclic-beat spectrum: Tempo-related audio features for time-scale invariant audio identification. In: Proceedings of the 7th international conference on music information retrieval, pp 35–40
Lambrou T, Kudumakis P, Speller R, Sandler M, Linney A (1998) Classification of audio signals using statistical features on time and wavelet transform domains. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, vol 6, pp 3621–3624
Lamport L (1994). LATEX , a document preparation system, 2nd edn. Addison-Wesley, Reading
Lehwark P, Risi S, Ultsch A (2007) Visualization and clustering of tagged music data. In: Proceedings GfKl 2007, Freiburg, Germany (to appear)
Lesaffre M, Tanghe K, Martens G, Moelants D, Leman M, De Baets B, De Meyer H, Martens JP (2003) The MAMI query-by-voice experiment: collecting and annotating vocal queries for music information retrieval. In: Proceedings of the 4th international conference on music information retrieval, Baltimore, Maryland, USA and Library of Congress, Washington, DC, USA, pp 65–71
Levy M, Sandler M (2006) Lightweight measures for timbral similarity of musical audio. In: Proceedings of the first ACM workshop on audio and music computing multimedia (AMCMM). ACM, New York, pp 27–36
Li D, Sethi I, Dimitrova N and McGee T (2001). Classification of general audio data for content-based retrieval. Pattern Recogn Lett 22: 533–544
Li T, Ogihara M, Li Q (2003) A comparative study on content-based music genre classification. In: Proceedings of the 26th international ACM SIGIR conference on research and development in information retrieval. ACM, New York, pp 282–289
Lidy T, Rauber A (2005) Evaluation of feature extractors and psycho-acoustic transformations for music genre classification. In: Proceedings of the 6th international conference on music information retrieval, pp 34–41
Ligges U (2006) Transkription monophoner Gesangszeitreihen. Dissertation, Fachbereich Statistik, Universität Dortmund, Dortmund, Germany, http://hdl.handle.net/2003/22521
Logan B (2000) Mel frequency cepstral coefficients for music modeling. In: Proceedings of the first international conference on music information retrieval, pp 23–25
Logan B, Salomon A (2001) A music similarity function based on signal analysis. In: Proceedings of the IEEE international conference on multimedia and expo, pp 745–748
Mandel M, Ellis D (2005) Song-level features and SVMs for music classification. In: Proceedings of the 6th international conference on music information retrieval, pp 594–599
Markuse B and Schneider A (1996). ähnlichkeit, Nähe, Distanz: zur Anwendung multidimensionaler Skalierung in musik-wissenschaftlichen Untersuchungen. Systematische Musikwissenschaft / Systematic Musicology / Musicologie syst[[’e]]matique 4: 53–89
McEnnis D, McKay C, Fujinaga I, Depalle P (2005) jAudio: a feature extraction library. In: Proceedings of the 6th international conference on music information retrieval, pp 600–603
McKinney M, Breebaart J (2003) Features for audio and music classification. In: Proceedings of the 4th international conference on music information retrieval, pp 151–158
Meng A (2006) Temporal feature integration for music organisation. PhD thesis, Informatics and Mathematical Modelling, Technical University of Denmark, DTU
Meng A, Ahrendt P, Larsen J (2005) Improving music genre classification by short-time feature integration. In: Proceedings of the IEEE international conference on acoustics, speech, and signal processing, vol V, pp 497–500
Meng A, Ahrendt P, Larsen J and Hansen LK (2006). Temporal feature integration for music genre classification. IEEE Trans Signal Process 15: 1654–1664
Meyer J (1995) Akustik und musikalische Aufführungspraxis. Bochinsky, Frankfurt am Main
Meyer LB (1957). Meaning in music and information theory. J Aesthet Art Criticism 15: 412–424
Microsoft Corporation (1991) Multimedia programming interface and data specification, 1.0. Joint design by IBM Corporation and Microsoft Corporation
MIDI Manufacturers Association (2001) Complete MIDI 1.0 Detailed Specification, 2nd edn, http://www.midi.org
Mierswa I and Morik K (2005). Automatic feature extraction for classifying audio data. Mach Learn J 58: 127–149
Mierswa I, Wurst M, Klinkenberg R, Scholz M, Euler T (2006) YALE: Rapid prototyping for complex data mining tasks. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, New York, NY, USA, pp 935–940
Moles A (1958). Th[[’e]]orie de l’information et perception est[[’e]]tique. Flammarion, Paris
Moles A (1971). Informationstheorie und ästhetische Wahrnehmung. DuMont Schauberg, Köln
Moore BCJ and Glasberg BR (1996). A revision of Zwickers loudness model. ACTA Acustica 82: 335–345
Mörchen F, Ultsch A, Nöcker M, Stamm C (2005a) Databionic visualization of music collections according to perceptual distance. In: Proceedings of the 6th international conference on music information retrieval, pp 396–403
Mörchen F, Ultsch A, Thies M, Löhken I, Nöcker M, Stamm C, Efthymiou N, Kümmerer M (2005b) MusicMiner: visualizing timbre distances of music as topographical maps. Tech. rep., Department of Mathematics and Computer Science, University of Marburg, Germany
Mörchen F, Mierswa I, Ultsch A (2006a) Understandable models of music collections based on exhaustive feature generation with temporal statistics. In: Proceedings of the 12th ACM SIGKDD international conference on knowledge discovery and data mining. ACM, Philadelphia, PA, USA, pp 882–891
Mörchen F, Ultsch A, Thies M and Löhken I (2006b). Modelling timbre distance with temporal statistics from polyphonic music. IEEE Trans Speech Audio Process 14(1): 81–90
Müllensiefen D and Frieler K (2004a). Cognitive adequacy in the measurement of melodic similarity: Algorithmic vs. human judgments. Comput Musicol 13: 147–176
Müllensiefen D, Frieler K (2004b) Optimizing measures of melodic similarity for the exploration of a large folk song database. In: 5th international conference on music information retrieval, Audiovisual Institute, Universitat Pompeu Fabra, Barcelona, Spain, pp 274–280
Müllensiefen D, Hennig C (2006) Modeling memory for melodies. In: Spiliopoulou M, Kruse R, Borgelt C, Nürnberger A, Gaul W (eds) From data and information analysis to knowledge engineering, Springer, Berlin, pp 732–739
Narmour E (1990) The Analysis and Cognition of Basic Melodic Structures: The Implication-Realization Model. University of Chicago Press, Chicago
Nienhuys HW, Nieuwenhuizen J, et al (2005) GNU LilyPond—the music typesetter. Free Software Foundation, http://www.lilypond.org/,version 2.6.5
Ombao H, Raz J, Malow B and Sachs R (2001). Automatic statistical analysis of bivariate nonstationary time series. J Am Stat Assoc 96(454): 543–560
Oppenheim A, Schafer R and Buck J (1999). Discrete-time signal processing, 2nd edn. Prentice-Hall, New Jersey
Pachet F, Zils A (2003) Evolving automatically high-level music descriptors from acoustic signals. In: Proceedings of the international symposium on computer music modeling and retrieval, pp 42–53
Pampalk E (2004) A MATLAB toolbox to compute music similarity from audio. In: Proceedings of the 5th international conference on music information retrieval, Barcelona, Spain, pp 254–257
Pampalk E (2006a) Audio-based music similarity and retrieval: Combining a spectral similarity model with information extracted from fluctuation patterns. In: 3rd Annual Music Information Retrieval eXchange (MIREX’06), http://pampalk.at/publications/
Pampalk E (2006b) Computational models of music similarity and their application in music information retrieval. PhD thesis, Computer Science Department, Technical University Vienna, Austria
Pampalk E, Goto M (2006) MusicRainbow: a new user interface to discover artists using audio-rased similarity and web-based labeling. In: Proceedings of the 7th international conference on music information retrieval, pp 367–370
Pampalk E, Rauber A, Merkl D (2002) Content-based organization and visualization of music archives. In: Proceedings of the 10th ACM international conference on multimedia, pp 570–579
Pampalk E, Dixon S, Widmer G (2003a) Exploring music collections by browsing different views. In: Proceedings of the 4th international conference on music information retrieval, pp 201–208
Pampalk E, Dixon S, Widmer G (2003b) On the evaluation of perceptual similarity measures for music. In: Proceedings of the international conference on digital audio effects, pp 6–12
Pampalk E, Flexer A, Widmer G (2005) Hierarchical organization and description of music collections at the artist level. In: Proceedings of the 9th European conference on research and advanced technology for digital libraries, pp 37–48
Pang H and Yoon D (2005). Automatic detection of vibrato in monophonic music. Pattern Recogn 38(7): 1135–1138
Pearce MT and Wiggins GA (2004). Improved methods for statistical modelling of monophonic music. J New Music Res 33(4): 367–385
Pearce MT and Wiggins GA (2006). Expectation in melody: the influence of context and learning. Music Percept 23(5): 377–405
Pierce JR (1992). The science of musical sound, 2nd ed. W.H. Freeman and Co., New York
Plumbley M (2003). Algorithms for nonnegative independent component analysis. IEEE Trans Neural Netw 14(3): 534–543
Plumbley M (2004) Optimization using Fourier expansion over a geodesic for non-negative ICA. In: Proceedings of the international conference on independent component analysis and blind signal separation (ICA 2004), Granada, Spain, pp 49–56
Plumbley M, Abdallah S, Blumensath T, Jafari M, Nesbit A, Vincent E, Wang B (2006) Musical audio analysis using sparse representations. In: COMPSTAT 2006—proceedings in computational statistics, Physica Verlag, Heidelberg, pp 104–117
Pohle T (2006) Post processing music similarity computations. In: The second annual music information retrieval evaluation eXchange (MIREX 2006), pp 16–18, http://www.music-ir.org/evaluation/MIREX/2006_abstracts/AS_pohle.pdf
Pohle T, Pampalk E, Widmer G (2005) Evaluation of frequently used audio features for classification of music into perceptual categories. In: Proceedings of the 4th international workshop on content-based multimedia indexing (CBMI), Riga, Latvia
Polotti P, Evangelista G (2000) Harmonic-band wavelet coefficient modeling for pseudo-periodic sound processing. In: Proceedings of the COST G-6 conference on digital audio effects (DAFX-00), Verona, Italy, pp 103–108
Polotti P, Evangelista G (2001) Multiresolution sinusoidal/stochastic model for voiced-sounds. In: Proceedings of the COST G-6 conference on digital audio effects (DAFX-01), Limerick, Ireland, pp 120–124
Pressing J, Lawrence P (1993) Transcribe: a comprehensive autotranscription program. In: Proceedings of the international computer music conference, Tokyo, Japan, pp 343–345
Pye D (2000) Content-based methods for managing electronic music. In: Proceedings of the international conference on acoustics, speech, and signal processing, pp 2437–2440
R Development Core Team (2007) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org,ISBN3-900051-07-0
Rabiner L and Juang BH (1993). Fundamentals of speech recognition. Prentice-Hall, New York
Rabiner LR (1989). A tutorial on Hidden Markov Models and selected applications in speech recognition. Proc IEEE 77(2): 257–286
Raphael C (2001). A probabilistic expert system for automatic musical accompaniment. J Comput Graph Stat 10(3): 487–512
Risi S, Mörchen F, Ultsch A, Lewark P (2007) Visual mining in music collections with emergent SOM. In: Proceedings workshop on self-organizing maps (WSOM) (to appear)
Rossignol S, Depalle P, Soumagne J, Rodet X, Collette JL (1999a) Vibrato: detection, estimation, extraction, modification. In: Proceedings of the COST-G6 workshop on digital audio effects (DAFx-99)
Rossignol S, Rodet X, Soumagne J, Collette JL and Depalle P (1999). Automatic characterisation of musical signals: feature extraction and temporal segmentation. J New Music Res 28(4): 281–295
Röver C, Klefenz F, Weihs C (2005) Identification of musical instruments by means of the Hough-transformation. In: Weihs C, Gaul W (eds) Classification—the ubiquitous challenge. Springer, Berlin, pp 608–615
Rubner Y, Tomasi C, Guibas LJ (1998) A metric for distributions with applications to image databases. In: Proceedings of the IEEE international conference on computer vision, Bombay, India, pp 59–66
Salton G and Buckley C (1988). Term-weighting approaches in automatic text retrieval. Inform Process Manage 24(5): 513–523
Sandvold V, Herrera P (2005) Towards a semantic descriptor of subjective intensity in music. In: Proceedings of the international computer music conference
Schedl M, Pohle TP, Knees P, Widmer G (2006) Assigning and visualizing music genres by web-based co-occurance analysis. In: Proceedings of the 7th international conference on music information retrieval, pp 260–265
Scheirer ED (1998). Tempo and beat analysis of acoustic musical signals. J Acoust Soc Am 103(1): 588–601
Schellenberg EG (1997). Simplifying the implication-realization model of melodic expectancy. Music Percept 14: 295–318
Seidner W and Wendler J (1997). Die Sängerstimme. Henschel, Berlin
Shao X, Xu C, Kankanhalli MS (2004) Unsupervised classification of music genre using Hidden Markov Model. In: Proceedings of the IEEE international conference on multimedia and expo, pp 2023–2026
Shapiro S (1978). Feature space transforms for curve detection. Pattern Recogn 10: 129–143
Smaragdis P, Brown J (2003) Non-negative matrix factorization for polyphonic music transcription. In: IEEE workshop on applications of signal processing to audio and acoustics, pp 177–180
Steinbeck W (1982). Struktur und ähnlichkeit: Methoden automatisierter Melodieanalyse. Bärenreiter, Kassel
Stenzel R, Kamps T (2005) Improving content-based similarity measures by training a collaborative model. In: Proceedings of the 6th international conference on music information retrieval, pp 264–271
Stevens S and Volkmann J (1940). The relation of pitch to frequency. Am J Psychol 53(3): 329–353
Streich S, Herrera P (2004) Toward describing perceived complexity of songs: computational methods and implementation. In: Proceedings of the 25th international AES conference
Streich S, Herrera P (2005) Detrended fluctuation analysis of music signals: Danceability estimation and further semantic characterization. In: Proceedings of the 118th AES convention
Temperley D (2001). The cognition of basic musical structures. MIT, Cambridge
Temperley D (2004). Bayesian models of musical structure and cognition. Music Sci 8(2): 175–205
Temperley D (2006) A probabilistic model of melody perception. In: Proceeding of the 7th international conference on music information retrieval, pp 276–279, http://ismir2006.ismir.net/PAPERS/ISMIR0630_Paper.pdf
Temperley D (2007). Music and probability. MIT, Cambridge
Thomassen J (1982). Melodic accent: experiments and a tentative model. J Acoust Soc Am 71: 1596–1605
Torrens M, Hertzog P, Arcos JL (2004) Visualizing and exploring personal music libraries. In: Proceedings of the 5th international conference on music information retrieval, pp 421–424
Tzanetakis G and Cook P (2000). MARSYAS: a framework for audio analysis. Organ Sound 4(30): 169–175
Tzanetakis G and Cook P (2002). Musical genre classification of audio signals. IEEE Trans Speech Audio Process 10(5): 293–302
Tzanetakis G, Ermolinskyi A, Cook P (2002a) Beyond the query-by-example paradigm: New query interfaces for music. In: Proceedings of the international computer music conference, pp 177–183
Tzanetakis G, Ermolinskyi A, Cook P (2002b) Pitch histograms in audio and symbolic music information retrieval. In: Proceedings of the 3rd international conference on music information retrieval, pp 31–38
Tzanetakis G, Essl G, Cook P (2002c) Human perception and computer extraction of beat strength. In: Proceedings of the international conference on digital audio effects (DAFx-02), pp 257–261
Ultsch A (1993) Self-organizing neural networks for visualization and classification. In: Opitz O, Lausen B, Klar R (eds) Information and classification—concepts, methods, and applications, Springer, Berlin, pp 307–313
Ultsch A (1996) Self organizing neural networks perform different from statistical k-means clustering. In: BMBF Statusseminar Künstliche Intelligenz, Neuroinformatik und Intelligente Systeme, München, pp 433–443
Ultsch A, Mörchen F (2005) ESOM-Maps: Tools for clustering, visualization, and classification with emergent SOM. Tech. Rep. 46, Department of Mathematics and Computer Science, University of Marburg, Germany
Van Trees H (2001) Detection, estimation, and modulation theory, Part I, reprint edn. Wiley-Interscience, Melbourne
Vembu S, Baumann S (2005) A self-organizing map based knowledge discovery for music recommendation systems. In: Computer music modeling and retrieval, pp 119–229
Vignoli F, Pauws S (2005) A music retrieval system based on user driven similarity and its evaluation. In: Proceedings of the 6th international conference on music information retrieval, pp 272–279
Vignoli F, van Gulik R, van de Wetering H (2004) Mapping music in the palm of your hand, explore and discover your collection. In: Proceedings of the 5th International Conference on Music Information Retrieval, pp 409–414
Viste H, Evangelista G (2001) Sounds source separation: Preprocessing for hearing aids and structured audio coding. In: Proceedings of the COST G-6 conference on digital audio effects (DAFX-01), Limerick, pp 67–70
Viste H, Evangelista G (2002) An extension for source separation techniques avoiding beats. In: Proceedings of the 5th international conference on digital audio effects (DAFx-02), Hamburg, Germany, pp 71–75
Wakefield G (1999) Mathematical representation of joint time-chroma distributions. In: Proceedings of the SPIE international symposium on optical science, engineering and instrumentation, Denver, Colorado, pp 637–645
Walmsley P, Godsill S, Rayner P (1999) Polyphonic pitch tracking using joint Bayesian estimation of multiple frame parameters. In: IEEE workshop on applications of signal processing to audio and acoustics, New Paltz, pp 119–122
Wapnick J and Ekholm E (1997). Expert consensus in solo voice performance evaluation. J Voice 11(4): 429–436
Weihs C, Ligges U (2005) From local to global analysis of music time series. In: Morik K, Boulicaut JF, Siebes A (eds) Local pattern detection. Springer, Berlin, Lecture Notes in Artificial Intelligence 3539, pp 217–231
Weihs C, Ligges U (2006) Parameter optimization in automatic transcription of music. In: Spiliopoulou M, Kruse R, Nürnberger A, Borgelt C, Gaul W (eds) From data and information analysis to knowledge engineering. Springer, Berlin, pp 740–747
Weihs C, Berghoff S, Hasse-Becker P, Ligges U (2001) Assessment of purity of intonation in singing presentations by discriminant analysis. In: Kunert J, Trenkler G (eds) Mathematical statistics and biometrical applications. Josef Eul, Bergisch-Gladbach, pp 395–410
Weihs C, Ligges U, Sommer K (2006a) Analysis of music time series. In: Rizzi A, Vichi M (eds) COMPSTAT 2006—proceedings in computational statistics. Physica Verlag, Heidelberg, pp 147–159
Weihs C, Szepannek G, Ligges U, Luebke K, Raabe N (2006b) Local models in register classification by timbre. In: Batagelj V, Bock HH, Ferligoj A, Ziberna A (eds) Data science and classification, Springer, Berlin, pp 315–322
West K, Cox S, Lamere P (2006) Incorporating machine-learning into music similarity estimation. In: Proceedings of the first ACM workshop on Audio and music computing multimedia (AMCMM). ACM, New York, pp 89–96
Whiteley N, Cemgil A, Godsill S (2006) Bayesian modelling of temporal structure in musical audio. In: 7th international conference on music information retrieval, Victoria, Canada, pp 29–34
Whittaker J (1990). Graphical models in applied multivariate statistics. Wiley, New York
Wolfe P, Godsill S and Ng WJ (2004). Bayesian variable selection and regularization for time-frequency surface estimation. J R Stat Soc: Ser B (Stat Methodol) 66(3): 575–589
Zils A, Pachet F (2004) Automatic extraction of music descriptors from acoustic signals using EDS. In: Proceedings of the 116th AES Convention
Zwicker E and Stevens S (1957). Critical bandwidths in loudness summation. J Acoust Soc Am 29(5): 548–557
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Weihs, C., Ligges, U., Mörchen, F. et al. Classification in music research. ADAC 1, 255–291 (2007). https://doi.org/10.1007/s11634-007-0016-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11634-007-0016-x
Keywords
- Classification in musicology
- Automatic transcription
- Music psychology
- Organization of music collections
- Compositional structure analysis