Skip to main content

Estimating the Spatial Position of Spectral Components in Audio

  • Conference paper
Independent Component Analysis and Blind Signal Separation (ICA 2006)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 3889))

Abstract

One way of separating sources from a single mixture recording is by extracting spectral components and then combining them to form estimates of the sources. The grouping process remains a difficult problem. We propose, for instances when multiple mixture signals are available, clustering the components based on their relative contribution to each mixture (i.e., their spatial position). We introduce novel factorizations of magnitude spectrograms from multiple recordings and derive update rules that extend independent subspace analysis and non-negative matrix factorization to concurrently estimate the spectral shape, time envelope and spatial position of each component. We show that estimated component positions are near the position of their corresponding source, and that multichannel non-negative matrix factorization can distinguish three pianos by their position in the mixture.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Hyvärinen, A.: Independent Component Analysis. Wiley, New York (2001)

    Book  Google Scholar 

  2. Casey, M., Westner, W.: Separation of mixed audio sources by independent subspace analysis. In: Proceedings of the International Computer Music Conference, Berlin (2000)

    Google Scholar 

  3. Smaragdis, P.: Redundancy Reduction for Computational Audition, a Unifying Approach. PhD thesis, MAS Department, Massachusetts Institute of Technology (2001)

    Google Scholar 

  4. FitzGerald, D., Coyle, E., Laylor, B.: Sub-band independent subspace analysis for drum transcription. In: Proceedings of International Conference on Digital Audio Effects, Hamburg, Germany, pp. 65–69 (2002)

    Google Scholar 

  5. Brown, J.C., Smaragdis, P.: Independent component analysis for automatic note extraction from musical trills. Journal of the Acoustical Society of America 115(5), 2295–2306 (2004)

    Article  Google Scholar 

  6. Abdallah, S.A., Plumbley, M.D.: Polyphonic transcription by non-negative sparse coding of power spectra. In: Proceedings of the International Conference on Music Information Retrieval, Barcelona, Spain, pp. 318–325 (2004)

    Google Scholar 

  7. Smaragdis, P., Brown, J.C.: Non-negative matrix factorization for polyphonic music transcription. In: Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, pp. 177–180 (2003)

    Google Scholar 

  8. FitzGerald, D., Cranitch, M., Coyle, E.: Non-negative tensor factorisation for sound source separation. In: Proceedings of Irish Signals and Systems Conference, Dublin, Ireland (2005)

    Google Scholar 

  9. Lee, D.D., Seung, H.S.: Algorithms for non-negative matrix factorization. In: Advances in Neural Information Processing Systems 13, pp. 556–562. MIT Press, Cambridge (2001)

    Google Scholar 

  10. Bell, A., Sejnowski, T.J.: An information-maximization approach to blind separation and blind deconvolution. Neural Computation 7, 1129–1159 (1995)

    Article  Google Scholar 

  11. Stone, J.V., Porrill, J.: Undercomplete independent component analysis for signal separation and dimension reduction. Technical report, Department of Psychology, University of Sheffield, Sheffield, England (1997)

    Google Scholar 

  12. Trefethen, L.N., Bau, D.B.: Numerical Linear Algebra. SIAM, Philadelphia (1997)

    Book  MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2006 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Parry, R.M., Essa, I. (2006). Estimating the Spatial Position of Spectral Components in Audio. In: Rosca, J., Erdogmus, D., Príncipe, J.C., Haykin, S. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2006. Lecture Notes in Computer Science, vol 3889. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11679363_83

Download citation

  • DOI: https://doi.org/10.1007/11679363_83

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-32630-4

  • Online ISBN: 978-3-540-32631-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics