Monaural Music Source Separation: Nonnegativity, Sparseness, and Shift-Invariance

Kim, Minje; Choi, Seungjin

doi:10.1007/11679363_77

Minje Kim²⁰ &
Seungjin Choi²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3889))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

3017 Accesses
15 Citations

Abstract

In this paper we present a method for polyphonic music source separation from their monaural mixture, where the underlying assumption is that the harmonic structure of a musical instrument remains roughly the same even if it is played at various pitches and is recorded in various mixing environments. We incorporate with nonnegativity, shift-invariance, and sparseness to select representative spectral basis vectors that are used to restore music sources from their monaural mixture. Experimental results with monaural instantaneous mixture of voice/cello and monaural convolutive mixture of saxophone/viola, are shown to confirm the validity of our proposed method.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Lee, D.D., Seung, H.S.: Learning the parts of objects by non-negative matrix factorization. Nature 401, 788–791 (1999)
Article Google Scholar
Smaragdis, P.: Non-negative matrix factor deconvolution: Extraction of multiple sound sources from monophonic inputs. In: Proc. Int’l Conf. Independent Component Analysis and Blind Signal Separation, Granada, Spain, pp. 494–499 (2004)
Google Scholar
Plumbley, M.D., Abdallah, S.A., Bello, J.P., Davies, M.E., Monti, G., Sandler, M.B.: Automatic transcription and audio source separation. Cybernetics and Systems, 603–627 (2002)
Google Scholar
Smaragdis, P., Brown, J.C.: Non-negative matrix factorization for polyphonic music transcription. In: Proc. IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 177–180 (2003)
Google Scholar
Abdallah, S.A., Plumbley, M.D.: Polyphonic music transcription by non-negative sparse coding of power spectra. In: Proc. Int’l Conf. Music Information Retrieval, Barcelona, Spain, pp. 318–325 (2004)
Google Scholar
Helén, M., Virtanin, T.: Separation of drums from polyphonic music using nonnegative matrix factorization and support vector machine. In: Proc. European Signal Processing Conference, Antalaya, Turkey (2005)
Google Scholar
Cho, Y.C., Choi, S.: Nonnegative features of spectro-temporal sounds for classfication. Pattern Recognition Letters 26, 1327–1336 (2005)
Article Google Scholar
Eggert, J., Wersing, H., Körner, E.: Transformation-invariant representation and NMF. In: Proc. Int’l Joint Conf. Neural Networks (2004)
Google Scholar
Kim, M., Choi, S.: On spectral basis selection for single channel polyphonic music separation. In: Duch, W., Kacprzyk, J., Oja, E., Zadrożny, S. (eds.) ICANN 2005. LNCS, vol. 3697, pp. 157–162. Springer, Heidelberg (2005)
Google Scholar
FitzGerald, D., Cranitch, M., Coyle, E.: Generalised prior subspace analysis for polyphonic pitch transcription. In: Proc. Int’l Conf. Digital Audio Effects (2005)
Google Scholar
Hoyer, P.O.: Non-negative matrix factorization with sparseness constraints. Journal of Machine Learning Research 5, 1457–1469 (2004)
MathSciNet Google Scholar
Ru, P., Chi, T., Shamma, S.: NSL Toolbox (1997)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science, Pohang University of Science and Technology, San 31 Hyoja-dong, Nam-gu, Pohang, 790-784, Korea
Minje Kim & Seungjin Choi

Authors

Minje Kim
View author publications
You can also search for this author in PubMed Google Scholar
Seungjin Choi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Siemens Corporate Research, 755 College Road East, 08540, Princeton, NJ, USA
Justinian Rosca
Department of CSEE, Oregon Health and Science University, Portland, Oregon, USA
Deniz Erdogmus
Dep. of Electrical and Computer Engineering, University of Florida, Gainesville, Florida, USA
José C. Príncipe
McMaster University, 1280 Main Street West, L8S 4K1, Hamilton, Ontario, Canada
Simon Haykin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kim, M., Choi, S. (2006). Monaural Music Source Separation: Nonnegativity, Sparseness, and Shift-Invariance. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_77

Download citation

DOI: https://doi.org/10.1007/11679363_77
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32630-4
Online ISBN: 978-3-540-32631-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics