Skip to main content

Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks

  • Chapter
Book cover Advances in Music Information Retrieval

Abstract

In this chapter, we present a simple and fast method to separate a monaural audio signal into harmonic and percussive components, which leads to a useful pre-processing for MIR-related tasks. Exploiting the anisotropies of the power spectrograms of harmonic and percussive components, we define objective functions based on spectrogram gradients, and, applying to them the auxiliary function approach, we derive simple and fast update equations which guarantee the decrease of the objective function at each iteration. We show experimental results for sound separation on popular and jazz music pieces, and also present the application of the proposed technique to automatic chord recognition and rhythm-pattern extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 129.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 169.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 169.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. http://www.music-ir.org/mirex2008/index.php

  2. Uhle, C., Dittmar, C., Sporer, T.: Extraction of drum tracks from polyphonic music using independent subspace analysis. In: Proc. ICA, April 2003, pp. 843–847 (2003)

    Google Scholar 

  3. Helen, M., Virtanen, T.: Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine. In: Proc. EUSIPCO (September 2005)

    Google Scholar 

  4. Itoyama, K., Goto, M., Komatani, K., Ogata, T., Okuno, H.: Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals. In: Proc. ICASSP, April 2007, pp. 57–60 (2007)

    Google Scholar 

  5. Daudet, L.: A Review on Techniques for the Extraction of Transients in Musical Signals. In: Kronland-Martinet, R., Voinier, T., Ystad, S. (eds.) CMMR 2005. LNCS, vol. 3902, pp. 219–232. Springer, Heidelberg (2006)

    Chapter  Google Scholar 

  6. Csiszár, I.: I-Divergence Geometry of Probability Distributions and Minimization Problems. The Annals of Probability 3(1), 146–158 (1975)

    Article  MATH  Google Scholar 

  7. Lee, D.D., Seung, H.S.: Algorithms for Non-Negative Matrix Factorization. In: Proc. NIPS, pp. 556–562 (2000)

    Google Scholar 

  8. Kameoka, H., Nishimoto, T., Sagayama, S.: A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. IEEE Trans. ASLP 15(3), 982–994 (2007)

    Google Scholar 

  9. Le Roux, J., Kameoka, H., Ono, N., de Cheveigne, A., Sagayama, S.: Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments. IEEE Trans. ASLP 15(4), 1135–1145 (2007)

    Google Scholar 

  10. Kameoka, H., Ono, N., Sagayama, S.: Auxiliary Function Approach to Parameter Estimation of Constrained Sinusoidal Model. In: Proc. ICASSP, April 2008, pp. 29–32 (2008)

    Google Scholar 

  11. Ono, N., Miyamoto, K., Kameoka, H., Sagayama, S.: A Real-time Equalizer of Harmonic and Percussive Components in Music Signals. In: Proc. ISMIR, Sepember 2008, pp. 139–144 (2008)

    Google Scholar 

  12. Ono, N., Miyamoto, K., Le Roux, J., Kameoka, H., Sagayama, S.: Separation of a Monaural Audio Signal into Harmonic/Percussive Components by Complementary Diffusion on Spectrogram. In: Proc. EUSIPCO (August 2008)

    Google Scholar 

  13. Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: Popular, classical, and jazz music databases. In: Proc. ISMIR, October 2002, pp. 287–288 (2002)

    Google Scholar 

  14. Kawakami, T., Nakai, M., Shimodaira, H., Sagayama, S.: Harmonization for melody using HMM. In: Proc. JHES, F-61, p. 361 (1999) (in Japanese)

    Google Scholar 

  15. Fujishima, T.: Real-time chord recognition of musical sound: A system using common lisp music. In: Proc. ICMC, pp. 464–467 (1999)

    Google Scholar 

  16. Bartsch, M.A., Wakefield, G.H.: To catch a chorus: Using chroma-based representations for audio thumbnailing. In: Proc. WASPAA, pp. 15–18 (2001)

    Google Scholar 

  17. Sheh, A., Ellis, D.P.W.: Chord segmentation and recognition using EM-trained hidden Markov models. In: Proc. ISMIR, pp. 183–189 (2003)

    Google Scholar 

  18. Bello, J.P., Pickens, J.: A robust mid-level representation for harmonic content in music signal. In: Proc. ISMIR, pp. 304–311 (2005)

    Google Scholar 

  19. Lee, K., Slaney, M.: Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio. IEEE Trans. on Audio Speech and Language Processing 16(2), 291–301 (2008)

    Article  Google Scholar 

  20. http://www.music-ir.org/mirex/2008/results/MIREX2008_overview_A0.pdf

  21. Uchiyama, Y., Miyamoto, K., Nishimoto, T., Ono, N., Sagayama, S.: Automatic Chord Detection Using Harmonic Sound Emphasized Chroma from Musical Acoustic Signal. In: Proc. ASJ Spring Meeting, March 2008, pp. 901–902 (2008) (in Japanese)

    Google Scholar 

  22. Uchiyama, Y., Nishimoto, T., Ono, N., Sagayama, S.: HMM-based Audio Chord Detection Using Harmonic Emphasizing and Fourier-Transformed Chroma. IEEE Trans. on Audio Speech and Language Processing (submitted)

    Google Scholar 

  23. Goto, M.: An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds. Journal of New Music Research 30(2), 159–171 (2001)

    Article  Google Scholar 

  24. Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech and Audio Processing 10(5), 293–302 (2002)

    Article  Google Scholar 

  25. Peeters, G.: Rhythm Classification Using Spectral Rhythm Patterns. In: Proc. ISMIR, September 2005, pp. 644–647 (2005)

    Google Scholar 

  26. Paulus, J., Klapuri, A.: Measuring the Similarity of Rhythmic Patterns. In: Proc. ISMIR, pp. 150–156 (2002)

    Google Scholar 

  27. Dixon, S., Guyon, F., Widmer, G.: Towards Characterization of Music via Rhythmic Patterns. In: Proc. ISMIR, pp. 509–516 (2004)

    Google Scholar 

  28. Ney, H.: The Use of a One-stage Dynamic Programming Algorithm for Connected Word Recognition. In: Proc. ICASSP, pp. 263–271 (1984)

    Google Scholar 

  29. Tsunoo, E., Miyamoto, K., Ono, N., Sagayama, S.: Rhythmic Features Extraction from Music Acoustic Signals using Harmonic/Non-Harmonic Sound Separation. In: Proc. ASJ Spring Meeting, March 2008, pp. 905–906 (2008) (in Japanese)

    Google Scholar 

  30. Tsunoo, E., Ono, N., Sagayama, S.: Rhythm Map: Extraction of Unit Rhythmic Patterns and Analysis of Rhythmic Structure from Music Acoustic Signals. In: Proc. ICASSP (April 2009)

    Google Scholar 

  31. Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC Music Database: Music Genre Database and Musical Instrument Sound Database. In: Proc. ISMIR, October 2003, pp. 229–230 (2003)

    Google Scholar 

  32. Tsunoo, E., Tzanetakis, G., Ono, N., Sagayama, S.: Audio Genre Classification by Clustering Percussive Patterns. In: Proc. ASJ Spring Meeting, March 2009, pp. 877–878 (2009)

    Google Scholar 

  33. Tsunoo, E., Tzanetakis, G., Ono, N., Sagayama, S.: Audio Genre Classification Using Percussive Pattern Clustering Combined with Timbral Features. In: Proc. ICME (June 2009)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Ono, N. et al. (2010). Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks. In: RaÅ›, Z.W., Wieczorkowska, A.A. (eds) Advances in Music Information Retrieval. Studies in Computational Intelligence, vol 274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11674-2_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-11674-2_10

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-11673-5

  • Online ISBN: 978-3-642-11674-2

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics