Modeling the Short Time Fourier Transform Ratio and Application to Underdetermined Audio Source Separation

Pham, Dinh-Tuan; El-Chami, Zaher; Guérin, Alexandre; Servière, Christine

doi:10.1007/978-3-642-00599-2_13

Dinh-Tuan Pham²⁰,
Zaher El-Chami²¹,
Alexandre Guérin²¹ &
…
Christine Servière²²

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 5441))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

3275 Accesses
3 Citations

Abstract

This paper presents the theoretical background for the Model Based Underdetermined Source Separation presented in [5]. We show that for a given frequency band, in contrast to customary assumption, the observed Short-Time Fourier Transform (STFT) ratio coming from one source is not constant in time, but is a random variable whose distribution we have obtained. Using this distribution and the Time-Frequency (TF) “disjoint” assumption of sources, we are able to obtain promising results in separating four audio sources from two microphones in a real reverberant room.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Balan, R., Rosca, J.: Statistical Properties of STFT Ratios for two Channel Systems and Applications to Blind Source Separation. In: Proc. ICA 2000, Helsinki, Findland, pp. 429–434 (June 2000)
Google Scholar
Yilmaz, O., Rickard, S.: Blind separation of speech mixtures via time-frequency masking. IEEE Transactions on Signal Processing 52(7), 1830–1847 (2004)
Article MathSciNet Google Scholar
Sawada, H., Araki, S., Makino, S.: Measuring Dependence of Bin-wise Separated Signals for Permutation Alignment in Frequency-domain BSS. In: ISCAS, New Orleans, USA (May 2007)
Google Scholar
Araki, S., Sawada, H., Makino, S.: K-means Based Underdetermined Blind Speech Separation. In: Makino, S., Lee, T.-W., Sawada, H. (eds.) Blind Speech Separation, pp. 243–270. Springer, New-York (2007)
Chapter Google Scholar
El-Chami, Z., Pham, D.-T., Servière, C., Guérin, A.: A New-Model Based Underdetermined Source Separation. In: IWAENC, Seattle, USA (September 2008)
Google Scholar
Signal Separation Evaluation Campaign (2008), http://sisec.wiki.irisa.fr/tiki-index.php

Download references

Author information

Authors and Affiliations

Laboratory Jean Kuntzmann, CNRS - INPG - UJF Grenoble, France
Dinh-Tuan Pham
Orange Labs, Lannion, France
Zaher El-Chami & Alexandre Guérin
GIPSA-lab, CNRS - INPG Grenoble, France
Christine Servière

Authors

Dinh-Tuan Pham
View author publications
You can also search for this author in PubMed Google Scholar
Zaher El-Chami
View author publications
You can also search for this author in PubMed Google Scholar
Alexandre Guérin
View author publications
You can also search for this author in PubMed Google Scholar
Christine Servière
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science and Electrical Engineering, ITE 324, University of Maryland, Baltimore County, 1000 Hilltop Circle, MD 21250, Baltimore, USA
Tülay Adali
Domaine Universitaire, GIPSA-lab, BP 46, 38402, Saint Martin d’Hères Cedex, France
Christian Jutten
Departamento de Microonda e Óptica (DMO), FEEC / Unicamp, Avenida Albert Einstein 400, 13083-852, Campinas, Sao Paulo, Brazil
João Marcos Travassos Romano
Centro Tecnológico, Curso de Engenharia Elétrica, Universidade Federal do Maranhão, Avenida dos Portugueses, s/n, Bacanga, 65080-040, São Luís, MA, Brazil
Allan Kardec Barros

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pham, DT., El-Chami, Z., Guérin, A., Servière, C. (2009). Modeling the Short Time Fourier Transform Ratio and Application to Underdetermined Audio Source Separation. In: Adali, T., Jutten, C., Romano, J.M.T., Barros, A.K. (eds) Independent Component Analysis and Signal Separation. ICA 2009. Lecture Notes in Computer Science, vol 5441. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00599-2_13

Download citation

DOI: https://doi.org/10.1007/978-3-642-00599-2_13
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00598-5
Online ISBN: 978-3-642-00599-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics