skip to main content
10.1145/2461466.2461534acmconferencesArticle/Chapter ViewAbstractPublication PagesicmrConference Proceedingsconference-collections
extended-abstract

Toward segmentation of popular music

Published:16 April 2013Publication History

ABSTRACT

This paper presents my dissertation framework to extract local keys, chords, and segment popular music from audio signals; all unsupervised. Music signals are denoised using wavelet transform to obtain a smoother approximation for chroma extraction. We extract a bag of local keys from the chromagram using an infinite Gaussian mixture and use the key information to extract a time series of chords. Using chords, we transform the bag of keys into a timed sequence of local keys. The two time series, local keys and chords, are used to construct multi-dimensioned "harmonic rhythm" as segmentation cues. We propose to calculate the strangeness of the cues from the perspective of keys, speed, and dependence as a basis for change detection in the framework of a martingale-based algorithm to find segmentation boundaries. Given the structural information, the chord sequence can be further improved in a refinement loop consisting of keys, chords, and segmentations.

References

  1. Basseville, M. and Nikiforov, I.V. 1993. Detection of Abrupt Changes: Theory and Application. Prentice Hall. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. Catteau, B., Martens, J., and Leman, M. 2007. A probabilistic framework for audio-based tonal key and chord recognition. Adv. in Data Analysis Proc. 30th Annu. Conf. Gesellschaft Für Klassifikation, pp. 637--644.Google ScholarGoogle Scholar
  3. Chen, R and Li, M. 2011. Music structural segmentation by combining harmonic and timbral information. Proc. Int. Conf. Music Inf. Retrieval (ISMIR)Google ScholarGoogle Scholar
  4. Foote, J. 1999. Visualizing music and audio using self-similarity. Proc. of ACM Multimedia, pp. 77--80. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. Fujishima, T. 1999. Realtime chord recognition of musical sound: a system using Common Lisp Music. Proc. of the International Computer Music Conference (ICMC), pp. 464--467.Google ScholarGoogle Scholar
  6. Harte, C., Sandler, M., Abdallah, S., and Gómez, E. 2005. Symbolic representation of musical chords: A proposed syntax for text annotations. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google ScholarGoogle Scholar
  7. Ho, S-.S. and Wechser, H. 2010. A martingale framework for detecting changes in data streams by testing exchangeability. IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 32, no. 12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jensen, K. 2007. Multiple scale music segmentatio nusing rhythm, timbre, and harmony. EURASIP Journal on Advances in Signal Processing. Article ID 73205. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Krumhansl, C. 1990. Cognitive Foundation of Musical Pitch. Oxford University Press.Google ScholarGoogle Scholar
  10. Müller, M. and Ewert, S. 2011. Chroma Toolbox: MATLAB Implementations for Extracting Variants of Chroma-Based Audio Features. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google ScholarGoogle Scholar
  11. Oudre, L., Févotte, C., and Grenier, Y. 2011. "Probabilistic template-based chord recognition," IEEE Transactions on Audio, Speech and Language Processing, 19(8):2249--2259. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. Paulus, J., Müller, M. and Klapuri, A. 2010. Audio-based music structure analysis. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google ScholarGoogle Scholar
  13. Pauwels, J., Martens, J.-P., and Leman, M. 2011. Improving the key extracting performance of a simultaneous local key and chord estimation system. Proc. of the International Conference on Multimedia and Expo (ICME), IEEE, pp. 1--6. Google ScholarGoogle ScholarDigital LibraryDigital Library
  14. Pollack, A.W. Notes on & Series http://www.icce.rug.nl/~soundscapes/DATABASES/AWP/awp-alphabet.shtml. Retrieved on October 13, 2011.Google ScholarGoogle Scholar
  15. Rasmussen, C.E. 2000. The infinite Gaussian mixture model. Advances in Neural Information Processing Systems, pp. 554--560, MIT Press.Google ScholarGoogle Scholar
  16. Rocher, T., Robine, M., Hanna, P., and Oudre, L. 2010. Concurrent estimation of chords and keys from audio. Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google ScholarGoogle Scholar
  17. Sheh, A. and Ellis, D. 2003. Chord segmentation and recognition using EM-trained hidden Markov models. In Proc. Int. Conf. Music Inf. Retrieval (ISMIR).Google ScholarGoogle Scholar
  18. Stephenson, K. 2002. What to Listen for in Rock: A Stylistic Analysis. Yale University Press.Google ScholarGoogle Scholar
  19. Swain, J.P. 2002. Harmonic Rhythm: Analysis and Interpretation. Oxford University Press.Google ScholarGoogle Scholar

Index Terms

  1. Toward segmentation of popular music

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in
        • Published in

          cover image ACM Conferences
          ICMR '13: Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
          April 2013
          362 pages
          ISBN:9781450320337
          DOI:10.1145/2461466

          Copyright © 2013 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 16 April 2013

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • extended-abstract

          Acceptance Rates

          ICMR '13 Paper Acceptance Rate38of96submissions,40%Overall Acceptance Rate254of830submissions,31%

          Upcoming Conference

          ICMR '24
          International Conference on Multimedia Retrieval
          June 10 - 14, 2024
          Phuket , Thailand
        • Article Metrics

          • Downloads (Last 12 months)3
          • Downloads (Last 6 weeks)0

          Other Metrics

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader