Skip to main content

Modeling the Short Time Fourier Transform Ratio and Application to Underdetermined Audio Source Separation

  • Conference paper
Independent Component Analysis and Signal Separation (ICA 2009)

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5441))

Abstract

This paper presents the theoretical background for the Model Based Underdetermined Source Separation presented in [5]. We show that for a given frequency band, in contrast to customary assumption, the observed Short-Time Fourier Transform (STFT) ratio coming from one source is not constant in time, but is a random variable whose distribution we have obtained. Using this distribution and the Time-Frequency (TF) “disjoint” assumption of sources, we are able to obtain promising results in separating four audio sources from two microphones in a real reverberant room.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Balan, R., Rosca, J.: Statistical Properties of STFT Ratios for two Channel Systems and Applications to Blind Source Separation. In: Proc. ICA 2000, Helsinki, Findland, pp. 429–434 (June 2000)

    Google Scholar 

  2. Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing 52(7), 1830–1847 (2004)

    Article  MathSciNet  Google Scholar 

  3. Sawada, H., Araki, S., Makino, S.: Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS. In: ISCAS, New Orleans, USA (May 2007)

    Google Scholar 

  4. Araki, S., Sawada, H., Makino, S.: K-means Based Underdetermined Blind Speech Separation. In: Makino, S., Lee, T.-W., Sawada, H. (eds.) Blind Speech Separation, pp. 243–270. Springer, New-York (2007)

    Chapter  Google Scholar 

  5. El-Chami, Z., Pham, D.-T., Servière, C., Guérin, A.: A New-Model Based Underdetermined Source Separation. In: IWAENC, Seattle, USA (September 2008)

    Google Scholar 

  6. Signal Separation Evaluation Campaign (2008), http://sisec.wiki.irisa.fr/tiki-index.php

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Pham, DT., El-Chami, Z., Guérin, A., Servière, C. (2009). Modeling the Short Time Fourier Transform Ratio and Application to Underdetermined Audio Source Separation. In: Adali, T., Jutten, C., Romano, J.M.T., Barros, A.K. (eds) Independent Component Analysis and Signal Separation. ICA 2009. Lecture Notes in Computer Science, vol 5441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00599-2_13

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-00599-2_13

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-00598-5

  • Online ISBN: 978-3-642-00599-2

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics