Estimating the Spatial Position of Spectral Components in Audio

Parry, R. Mitchell; Essa, Irfan

doi:10.1007/11679363_83

R. Mitchell Parry²⁰ &
Irfan Essa²⁰

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3889))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

2213 Accesses
17 Citations

Abstract

One way of separating sources from a single mixture recording is by extracting spectral components and then combining them to form estimates of the sources. The grouping process remains a difficult problem. We propose, for instances when multiple mixture signals are available, clustering the components based on their relative contribution to each mixture (i.e., their spatial position). We introduce novel factorizations of magnitude spectrograms from multiple recordings and derive update rules that extend independent subspace analysis and non-negative matrix factorization to concurrently estimate the spectral shape, time envelope and spatial position of each component. We show that estimated component positions are near the position of their corresponding source, and that multichannel non-negative matrix factorization can distinguish three pianos by their position in the mixture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Hyvärinen, A.: Independent Component Analysis. Wiley, New York (2001)
Book Google Scholar
Casey, M., Westner, W.: Separation of mixed audio sources by independent subspace analysis. In: Proceedings of the International Computer Music Conference, Berlin (2000)
Google Scholar
Smaragdis, P.: Redundancy Reduction for Computational Audition, a Unifying Approach. PhD thesis, MAS Department, Massachusetts Institute of Technology (2001)
Google Scholar
FitzGerald, D., Coyle, E., Laylor, B.: Sub-band independent subspace analysis for drum transcription. In: Proceedings of International Conference on Digital Audio Effects, Hamburg, Germany, pp. 65–69 (2002)
Google Scholar
Brown, J.C., Smaragdis, P.: Independent component analysis for automatic note extraction from musical trills. Journal of the Acoustical Society of America 115(5), 2295–2306 (2004)
Article Google Scholar
Abdallah, S.A., Plumbley, M.D.: Polyphonic transcription by non-negative sparse coding of power spectra. In: Proceedings of the International Conference on Music Information Retrieval, Barcelona, Spain, pp. 318–325 (2004)
Google Scholar
Smaragdis, P., Brown, J.C.: Non-negative matrix factorization for polyphonic music transcription. In: Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 177–180 (2003)
Google Scholar
FitzGerald, D., Cranitch, M., Coyle, E.: Non-negative tensor factorisation for sound source separation. In: Proceedings of Irish Signals and Systems Conference, Dublin, Ireland (2005)
Google Scholar
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems 13, pp. 556–562. MIT Press, Cambridge (2001)
Google Scholar
Bell, A., Sejnowski, T.J.: An information-maximization approach to blind separation and blind deconvolution. Neural Computation 7, 1129–1159 (1995)
Article Google Scholar
Stone, J.V., Porrill, J.: Undercomplete independent component analysis for signal separation and dimension reduction. Technical report, Department of Psychology, University of Sheffield, Sheffield, England (1997)
Google Scholar
Trefethen, L.N., Bau, D.B.: Numerical Linear Algebra. SIAM, Philadelphia (1997)
Book MATH Google Scholar

Download references

Author information

Authors and Affiliations

College of Computing / GVU Center, Georgia Institute of Technology, 85 5th Street, NW, Atlanta, Georgia, USA
R. Mitchell Parry & Irfan Essa

Authors

R. Mitchell Parry
View author publications
You can also search for this author in PubMed Google Scholar
Irfan Essa
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Siemens Corporate Research, 755 College Road East, 08540, Princeton, NJ, USA
Justinian Rosca
Department of CSEE, Oregon Health and Science University, Portland, Oregon, USA
Deniz Erdogmus
Dep. of Electrical and Computer Engineering, University of Florida, Gainesville, Florida, USA
José C. Príncipe
McMaster University, 1280 Main Street West, L8S 4K1, Hamilton, Ontario, Canada
Simon Haykin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Parry, R.M., Essa, I. (2006). Estimating the Spatial Position of Spectral Components in Audio. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_83

Download citation

DOI: https://doi.org/10.1007/11679363_83
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32630-4
Online ISBN: 978-3-540-32631-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics