Abstract
In this chapter, we present a simple and fast method to separate a monaural audio signal into harmonic and percussive components, which leads to a useful pre-processing for MIR-related tasks. Exploiting the anisotropies of the power spectrograms of harmonic and percussive components, we define objective functions based on spectrogram gradients, and, applying to them the auxiliary function approach, we derive simple and fast update equations which guarantee the decrease of the objective function at each iteration. We show experimental results for sound separation on popular and jazz music pieces, and also present the application of the proposed technique to automatic chord recognition and rhythm-pattern extraction.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Uhle, C., Dittmar, C., Sporer, T.: Extraction of drum tracks from polyphonic music using independent subspace analysis. In: Proc. ICA, April 2003, pp. 843–847 (2003)
Helen, M., Virtanen, T.: Separation of drums from polyphonic music using non-negative matrix factorization and support vector machine. In: Proc. EUSIPCO (September 2005)
Itoyama, K., Goto, M., Komatani, K., Ogata, T., Okuno, H.: Integration and Adaptation of Harmonic and Inharmonic Models for Separating Polyphonic Musical Signals. In: Proc. ICASSP, April 2007, pp. 57–60 (2007)
Daudet, L.: A Review on Techniques for the Extraction of Transients in Musical Signals. In: Kronland-Martinet, R., Voinier, T., Ystad, S. (eds.) CMMR 2005. LNCS, vol. 3902, pp. 219–232. Springer, Heidelberg (2006)
Csiszár, I.: I-Divergence Geometry of Probability Distributions and Minimization Problems. The Annals of Probability 3(1), 146–158 (1975)
Lee, D.D., Seung, H.S.: Algorithms for Non-Negative Matrix Factorization. In: Proc. NIPS, pp. 556–562 (2000)
Kameoka, H., Nishimoto, T., Sagayama, S.: A Multipitch Analyzer Based on Harmonic Temporal Structured Clustering. IEEE Trans. ASLP 15(3), 982–994 (2007)
Le Roux, J., Kameoka, H., Ono, N., de Cheveigne, A., Sagayama, S.: Single and Multiple F0 Contour Estimation Through Parametric Spectrogram Modeling of Speech in Noisy Environments. IEEE Trans. ASLP 15(4), 1135–1145 (2007)
Kameoka, H., Ono, N., Sagayama, S.: Auxiliary Function Approach to Parameter Estimation of Constrained Sinusoidal Model. In: Proc. ICASSP, April 2008, pp. 29–32 (2008)
Ono, N., Miyamoto, K., Kameoka, H., Sagayama, S.: A Real-time Equalizer of Harmonic and Percussive Components in Music Signals. In: Proc. ISMIR, Sepember 2008, pp. 139–144 (2008)
Ono, N., Miyamoto, K., Le Roux, J., Kameoka, H., Sagayama, S.: Separation of a Monaural Audio Signal into Harmonic/Percussive Components by Complementary Diffusion on Spectrogram. In: Proc. EUSIPCO (August 2008)
Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC music database: Popular, classical, and jazz music databases. In: Proc. ISMIR, October 2002, pp. 287–288 (2002)
Kawakami, T., Nakai, M., Shimodaira, H., Sagayama, S.: Harmonization for melody using HMM. In: Proc. JHES, F-61, p. 361 (1999) (in Japanese)
Fujishima, T.: Real-time chord recognition of musical sound: A system using common lisp music. In: Proc. ICMC, pp. 464–467 (1999)
Bartsch, M.A., Wakefield, G.H.: To catch a chorus: Using chroma-based representations for audio thumbnailing. In: Proc. WASPAA, pp. 15–18 (2001)
Sheh, A., Ellis, D.P.W.: Chord segmentation and recognition using EM-trained hidden Markov models. In: Proc. ISMIR, pp. 183–189 (2003)
Bello, J.P., Pickens, J.: A robust mid-level representation for harmonic content in music signal. In: Proc. ISMIR, pp. 304–311 (2005)
Lee, K., Slaney, M.: Acoustic chord transcription and key extraction from audio using key-dependent HMMs trained on synthesized audio. IEEE Trans. on Audio Speech and Language Processing 16(2), 291–301 (2008)
http://www.music-ir.org/mirex/2008/results/MIREX2008_overview_A0.pdf
Uchiyama, Y., Miyamoto, K., Nishimoto, T., Ono, N., Sagayama, S.: Automatic Chord Detection Using Harmonic Sound Emphasized Chroma from Musical Acoustic Signal. In: Proc. ASJ Spring Meeting, March 2008, pp. 901–902 (2008) (in Japanese)
Uchiyama, Y., Nishimoto, T., Ono, N., Sagayama, S.: HMM-based Audio Chord Detection Using Harmonic Emphasizing and Fourier-Transformed Chroma. IEEE Trans. on Audio Speech and Language Processing (submitted)
Goto, M.: An Audio-based Real-time Beat Tracking System for Music With or Without Drum-sounds. Journal of New Music Research 30(2), 159–171 (2001)
Tzanetakis, G., Cook, P.: Musical genre classification of audio signals. IEEE Trans. Speech and Audio Processing 10(5), 293–302 (2002)
Peeters, G.: Rhythm Classification Using Spectral Rhythm Patterns. In: Proc. ISMIR, September 2005, pp. 644–647 (2005)
Paulus, J., Klapuri, A.: Measuring the Similarity of Rhythmic Patterns. In: Proc. ISMIR, pp. 150–156 (2002)
Dixon, S., Guyon, F., Widmer, G.: Towards Characterization of Music via Rhythmic Patterns. In: Proc. ISMIR, pp. 509–516 (2004)
Ney, H.: The Use of a One-stage Dynamic Programming Algorithm for Connected Word Recognition. In: Proc. ICASSP, pp. 263–271 (1984)
Tsunoo, E., Miyamoto, K., Ono, N., Sagayama, S.: Rhythmic Features Extraction from Music Acoustic Signals using Harmonic/Non-Harmonic Sound Separation. In: Proc. ASJ Spring Meeting, March 2008, pp. 905–906 (2008) (in Japanese)
Tsunoo, E., Ono, N., Sagayama, S.: Rhythm Map: Extraction of Unit Rhythmic Patterns and Analysis of Rhythmic Structure from Music Acoustic Signals. In: Proc. ICASSP (April 2009)
Goto, M., Hashiguchi, H., Nishimura, T., Oka, R.: RWC Music Database: Music Genre Database and Musical Instrument Sound Database. In: Proc. ISMIR, October 2003, pp. 229–230 (2003)
Tsunoo, E., Tzanetakis, G., Ono, N., Sagayama, S.: Audio Genre Classification by Clustering Percussive Patterns. In: Proc. ASJ Spring Meeting, March 2009, pp. 877–878 (2009)
Tsunoo, E., Tzanetakis, G., Ono, N., Sagayama, S.: Audio Genre Classification Using Percussive Pattern Clustering Combined with Timbral Features. In: Proc. ICME (June 2009)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Ono, N. et al. (2010). Harmonic and Percussive Sound Separation and Its Application to MIR-Related Tasks. In: RaÅ›, Z.W., Wieczorkowska, A.A. (eds) Advances in Music Information Retrieval. Studies in Computational Intelligence, vol 274. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-11674-2_10
Download citation
DOI: https://doi.org/10.1007/978-3-642-11674-2_10
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-11673-5
Online ISBN: 978-3-642-11674-2
eBook Packages: EngineeringEngineering (R0)