Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks

Ono, Nobutaka; Miyamoto, Kenichi; Kameoka, Hirokazu; Le Roux, Jonathan; Uchiyama, Yuuki; Tsunoo, Emiru; Nishimoto, Takuya; Sagayama, Shigeki

doi:10.1007/978-3-642-11674-2_10

Nobutaka Ono⁴,
Kenichi Miyamoto⁴,
Hirokazu Kameoka⁴,
Jonathan Le Roux⁴,
Yuuki Uchiyama⁴,
Emiru Tsunoo⁴,
Takuya Nishimoto⁴ &
…
Shigeki Sagayama⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 274))

2108 Accesses
14 Citations

Abstract

In this chapter, we present a simple and fast method to separate a monaural audio signal into harmonic and percussive components, which leads to a useful pre-processing for MIR-related tasks. Exploiting the anisotropies of the power spectrograms of harmonic and percussive components, we define objective functions based on spectrogram gradients, and, applying to them the auxiliary function approach, we derive simple and fast update equations which guarantee the decrease of the objective function at each iteration. We show experimental results for sound separation on popular and jazz music pieces, and also present the application of the proposed technique to automatic chord recognition and rhythm-pattern extraction.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 129.00; Price excludes VAT (USA)

Softcover Book: USD 169.99; Price excludes VAT (USA)

Hardcover Book: USD 169.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

http://www.music-ir.org/mirex2008/index.php
Uhle, C., Dittmar, C., Sporer, T.: Extraction of drum tracks from polyphonic music using independent subspace analysis. In: Proc. ICA, April 2003, pp. 843–847 (2003)
Google Scholar
Helen, M., Virtanen, T.: Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine. In: Proc. EUSIPCO (September 2005)
Google Scholar
Itoyama, K., Goto, M., Komatani, K., Ogata, T., Okuno, H.: Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals. In: Proc. ICASSP, April 2007, pp. 57–60 (2007)
Google Scholar
Daudet, L.: A Review on Techniques for the Extraction of Transients in Musical Signals. In: Kronland-Martinet, R., Voinier, T., Ystad, S. (eds.) CMMR 2005. LNCS, vol. 3902, pp. 219–232. Springer, Heidelberg (2006)
Chapter Google Scholar
Csiszár, I.: I-Divergence Geometry of Probability Distributions and Minimization Problems. The Annals of Probability 3(1), 146–158 (1975)
Article MATH Google Scholar
Lee, D.D., Seung, H.S.: Algorithms for Non-Negative Matrix Factorization. In: Proc. NIPS, pp. 556–562 (2000)
Google Scholar
Kameoka, H., Nishimoto, T., Sagayama, S.: A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. IEEE Trans. ASLP 15(3), 982–994 (2007)
Google Scholar
Le Roux, J., Kameoka, H., Ono, N., de Cheveigne, A., Sagayama, S.: Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments. IEEE Trans. ASLP 15(4), 1135–1145 (2007)
Google Scholar
Kameoka, H., Ono, N., Sagayama, S.: Auxiliary Function Approach to Parameter Estimation of Constrained Sinusoidal Model. In: Proc. ICASSP, April 2008, pp. 29–32 (2008)
Google Scholar
Ono, N., Miyamoto, K., Kameoka, H., Sagayama, S.: A Real-time Equalizer of Harmonic and Percussive Components in Music Signals. In: Proc. ISMIR, Sepember 2008, pp. 139–144 (2008)
Google Scholar
Ono, N., Miyamoto, K., Le Roux, J., Kameoka, H., Sagayama, S.: Separation of a Monaural Audio Signal into Harmonic/Percussive Components by Complementary Diffusion on Spectrogram. In: Proc. EUSIPCO (August 2008)
Google Scholar
Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: Popular, classical, and jazz music databases. In: Proc. ISMIR, October 2002, pp. 287–288 (2002)
Google Scholar
Kawakami, T., Nakai, M., Shimodaira, H., Sagayama, S.: Harmonization for melody using HMM. In: Proc. JHES, F-61, p. 361 (1999) (in Japanese)
Google Scholar
Fujishima, T.: Real-time chord recognition of musical sound: A system using common lisp music. In: Proc. ICMC, pp. 464–467 (1999)
Google Scholar
Bartsch, M.A., Wakefield, G.H.: To catch a chorus: Using chroma-based representations for audio thumbnailing. In: Proc. WASPAA, pp. 15–18 (2001)
Google Scholar
Sheh, A., Ellis, D.P.W.: Chord segmentation and recognition using EM-trained hidden Markov models. In: Proc. ISMIR, pp. 183–189 (2003)
Google Scholar
Bello, J.P., Pickens, J.: A robust mid-level representation for harmonic content in music signal. In: Proc. ISMIR, pp. 304–311 (2005)
Google Scholar
Lee, K., Slaney, M.: Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio. IEEE Trans. on Audio Speech and Language Processing 16(2), 291–301 (2008)
Article Google Scholar
http://www.music-ir.org/mirex/2008/results/MIREX2008_overview_A0.pdf
Uchiyama, Y., Miyamoto, K., Nishimoto, T., Ono, N., Sagayama, S.: Automatic Chord Detection Using Harmonic Sound Emphasized Chroma from Musical Acoustic Signal. In: Proc. ASJ Spring Meeting, March 2008, pp. 901–902 (2008) (in Japanese)
Google Scholar
Uchiyama, Y., Nishimoto, T., Ono, N., Sagayama, S.: HMM-based Audio Chord Detection Using Harmonic Emphasizing and Fourier-Transformed Chroma. IEEE Trans. on Audio Speech and Language Processing (submitted)
Google Scholar
Goto, M.: An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds. Journal of New Music Research 30(2), 159–171 (2001)
Article Google Scholar
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech and Audio Processing 10(5), 293–302 (2002)
Article Google Scholar
Peeters, G.: Rhythm Classification Using Spectral Rhythm Patterns. In: Proc. ISMIR, September 2005, pp. 644–647 (2005)
Google Scholar
Paulus, J., Klapuri, A.: Measuring the Similarity of Rhythmic Patterns. In: Proc. ISMIR, pp. 150–156 (2002)
Google Scholar
Dixon, S., Guyon, F., Widmer, G.: Towards Characterization of Music via Rhythmic Patterns. In: Proc. ISMIR, pp. 509–516 (2004)
Google Scholar
Ney, H.: The Use of a One-stage Dynamic Programming Algorithm for Connected Word Recognition. In: Proc. ICASSP, pp. 263–271 (1984)
Google Scholar
Tsunoo, E., Miyamoto, K., Ono, N., Sagayama, S.: Rhythmic Features Extraction from Music Acoustic Signals using Harmonic/Non-Harmonic Sound Separation. In: Proc. ASJ Spring Meeting, March 2008, pp. 905–906 (2008) (in Japanese)
Google Scholar
Tsunoo, E., Ono, N., Sagayama, S.: Rhythm Map: Extraction of Unit Rhythmic Patterns and Analysis of Rhythmic Structure from Music Acoustic Signals. In: Proc. ICASSP (April 2009)
Google Scholar
Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC Music Database: Music Genre Database and Musical Instrument Sound Database. In: Proc. ISMIR, October 2003, pp. 229–230 (2003)
Google Scholar
Tsunoo, E., Tzanetakis, G., Ono, N., Sagayama, S.: Audio Genre Classification by Clustering Percussive Patterns. In: Proc. ASJ Spring Meeting, March 2009, pp. 877–878 (2009)
Google Scholar
Tsunoo, E., Tzanetakis, G., Ono, N., Sagayama, S.: Audio Genre Classification Using Percussive Pattern Clustering Combined with Timbral Features. In: Proc. ICME (June 2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Information Physics and Computing, Graduate School of Information Science and Technology, The University of Tokyo, 7-3-1 Hongo Bunkyo-ku, Tokyo, 113-8656, Japan
Nobutaka Ono, Kenichi Miyamoto, Hirokazu Kameoka, Jonathan Le Roux, Yuuki Uchiyama, Emiru Tsunoo, Takuya Nishimoto & Shigeki Sagayama

Authors

Nobutaka Ono
View author publications
You can also search for this author in PubMed Google Scholar
Kenichi Miyamoto
View author publications
You can also search for this author in PubMed Google Scholar
Hirokazu Kameoka
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan Le Roux
View author publications
You can also search for this author in PubMed Google Scholar
Yuuki Uchiyama
View author publications
You can also search for this author in PubMed Google Scholar
Emiru Tsunoo
View author publications
You can also search for this author in PubMed Google Scholar
Takuya Nishimoto
View author publications
You can also search for this author in PubMed Google Scholar
Shigeki Sagayama
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

University of North Carolina, Charlotte, NC, USA
Zbigniew W. Raś
Polish-Japanese Institute of IT, Warsaw, Poland
Alicja A. Wieczorkowska

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ono, N. et al. (2010). Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks. In: Raś, Z.W., Wieczorkowska, A.A. (eds) Advances in Music Information Retrieval. Studies in Computational Intelligence, vol 274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11674-2_10

Download citation

DOI: https://doi.org/10.1007/978-3-642-11674-2_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11673-5
Online ISBN: 978-3-642-11674-2
eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics