ABSTRACT
This paper presents my dissertation framework to extract local keys, chords, and segment popular music from audio signals; all unsupervised. Music signals are denoised using wavelet transform to obtain a smoother approximation for chroma extraction. We extract a bag of local keys from the chromagram using an infinite Gaussian mixture and use the key information to extract a time series of chords. Using chords, we transform the bag of keys into a timed sequence of local keys. The two time series, local keys and chords, are used to construct multi-dimensioned "harmonic rhythm" as segmentation cues. We propose to calculate the strangeness of the cues from the perspective of keys, speed, and dependence as a basis for change detection in the framework of a martingale-based algorithm to find segmentation boundaries. Given the structural information, the chord sequence can be further improved in a refinement loop consisting of keys, chords, and segmentations.
- Basseville, M. and Nikiforov, I.V. 1993. Detection of Abrupt Changes: Theory and Application. Prentice Hall. Google ScholarDigital Library
- Catteau, B., Martens, J., and Leman, M. 2007. A probabilistic framework for audio-based tonal key and chord recognition. Adv. in Data Analysis Proc. 30th Annu. Conf. Gesellschaft Für Klassifikation, pp. 637--644.Google Scholar
- Chen, R and Li, M. 2011. Music structural segmentation by combining harmonic and timbral information. Proc. Int. Conf. Music Inf. Retrieval (ISMIR)Google Scholar
- Foote, J. 1999. Visualizing music and audio using self-similarity. Proc. of ACM Multimedia, pp. 77--80. Google ScholarDigital Library
- Fujishima, T. 1999. Realtime chord recognition of musical sound: a system using Common Lisp Music. Proc. of the International Computer Music Conference (ICMC), pp. 464--467.Google Scholar
- Harte, C., Sandler, M., Abdallah, S., and Gómez, E. 2005. Symbolic representation of musical chords: A proposed syntax for text annotations. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google Scholar
- Ho, S-.S. and Wechser, H. 2010. A martingale framework for detecting changes in data streams by testing exchangeability. IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 12. Google ScholarDigital Library
- Jensen, K. 2007. Multiple scale music segmentatio nusing rhythm, timbre, and harmony. EURASIP Journal on Advances in Signal Processing. Article ID 73205. Google ScholarDigital Library
- Krumhansl, C. 1990. Cognitive Foundation of Musical Pitch. Oxford University Press.Google Scholar
- Müller, M. and Ewert, S. 2011. Chroma Toolbox: MATLAB Implementations for Extracting Variants of Chroma-Based Audio Features. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google Scholar
- Oudre, L., Févotte, C., and Grenier, Y. 2011. "Probabilistic template-based chord recognition," IEEE Transactions on Audio, Speech and Language Processing, 19(8):2249--2259. Google ScholarDigital Library
- Paulus, J., Müller, M. and Klapuri, A. 2010. Audio-based music structure analysis. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google Scholar
- Pauwels, J., Martens, J.-P., and Leman, M. 2011. Improving the key extracting performance of a simultaneous local key and chord estimation system. Proc. of the International Conference on Multimedia and Expo (ICME), IEEE, pp. 1--6. Google ScholarDigital Library
- Pollack, A.W. Notes on & Series http://www.icce.rug.nl/~soundscapes/DATABASES/AWP/awp-alphabet.shtml. Retrieved on October 13, 2011.Google Scholar
- Rasmussen, C.E. 2000. The infinite Gaussian mixture model. Advances in Neural Information Processing Systems, pp. 554--560, MIT Press.Google Scholar
- Rocher, T., Robine, M., Hanna, P., and Oudre, L. 2010. Concurrent estimation of chords and keys from audio. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google Scholar
- Sheh, A. and Ellis, D. 2003. Chord segmentation and recognition using EM-trained hidden Markov models. In Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google Scholar
- Stephenson, K. 2002. What to Listen for in Rock: A Stylistic Analysis. Yale University Press.Google Scholar
- Swain, J.P. 2002. Harmonic Rhythm: Analysis and Interpretation. Oxford University Press.Google Scholar
Index Terms
- Toward segmentation of popular music
Recommendations
Computational Analysis of Jazz Music: Estimating Tonality through Chord Progression Distances
CSAE '23: Proceedings of the 7th International Conference on Computer Science and Application EngineeringCurrently, research in music informatics focuses extensively on music theory, particularly on the theoretical systems of Western classical music dating back to the 19th century. However, contemporary popular music genres such as pop, rock, and jazz often ...
A Query-by-Singing System for Retrieving Karaoke Music
This paper investigates the problem of retrieving karaoke music using query-by-singing techniques. Unlike regular CD music, where the stereo sound involves two audio channels that usually sound the same, karaoke music encompasses two distinct channels ...
Automatic singer recognition of popular music recordings via estimation and modeling of solo vocal signals
In this paper, we investigate the problem of automatic singer identification, detection and tracking in popular music recordings with one or multiple singers. This problem reflects an important issue in multimedia applications that require the ...
Comments