Abstract
One way of separating sources from a single mixture recording is by extracting spectral components and then combining them to form estimates of the sources. The grouping process remains a difficult problem. We propose, for instances when multiple mixture signals are available, clustering the components based on their relative contribution to each mixture (i.e., their spatial position). We introduce novel factorizations of magnitude spectrograms from multiple recordings and derive update rules that extend independent subspace analysis and non-negative matrix factorization to concurrently estimate the spectral shape, time envelope and spatial position of each component. We show that estimated component positions are near the position of their corresponding source, and that multichannel non-negative matrix factorization can distinguish three pianos by their position in the mixture.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Hyvärinen, A.: Independent Component Analysis. Wiley, New York (2001)
Casey, M., Westner, W.: Separation of mixed audio sources by independent subspace analysis. In: Proceedings of the International Computer Music Conference, Berlin (2000)
Smaragdis, P.: Redundancy Reduction for Computational Audition, a Unifying Approach. PhD thesis, MAS Department, Massachusetts Institute of Technology (2001)
FitzGerald, D., Coyle, E., Laylor, B.: Sub-band independent subspace analysis for drum transcription. In: Proceedings of International Conference on Digital Audio Effects, Hamburg, Germany, pp. 65–69 (2002)
Brown, J.C., Smaragdis, P.: Independent component analysis for automatic note extraction from musical trills. Journal of the Acoustical Society of America 115(5), 2295–2306 (2004)
Abdallah, S.A., Plumbley, M.D.: Polyphonic transcription by non-negative sparse coding of power spectra. In: Proceedings of the International Conference on Music Information Retrieval, Barcelona, Spain, pp. 318–325 (2004)
Smaragdis, P., Brown, J.C.: Non-negative matrix factorization for polyphonic music transcription. In: Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 177–180 (2003)
FitzGerald, D., Cranitch, M., Coyle, E.: Non-negative tensor factorisation for sound source separation. In: Proceedings of Irish Signals and Systems Conference, Dublin, Ireland (2005)
Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems 13, pp. 556–562. MIT Press, Cambridge (2001)
Bell, A., Sejnowski, T.J.: An information-maximization approach to blind separation and blind deconvolution. Neural Computation 7, 1129–1159 (1995)
Stone, J.V., Porrill, J.: Undercomplete independent component analysis for signal separation and dimension reduction. Technical report, Department of Psychology, University of Sheffield, Sheffield, England (1997)
Trefethen, L.N., Bau, D.B.: Numerical Linear Algebra. SIAM, Philadelphia (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Parry, R.M., Essa, I. (2006). Estimating the Spatial Position of Spectral Components in Audio. In: Rosca, J., Erdogmus, D., PrÃncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_83
Download citation
DOI: https://doi.org/10.1007/11679363_83
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-32630-4
Online ISBN: 978-3-540-32631-1
eBook Packages: Computer ScienceComputer Science (R0)