Multichannel Speech Separation Using Adaptive Parameterization of Source PDFs

Kokkinakis, Kostas; Nandi, Asoke K.

doi:10.1007/978-3-540-30110-3_62

Kostas Kokkinakis² &
Asoke K. Nandi²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3195))

Included in the following conference series:

International Conference on Independent Component Analysis and Signal Separation

1422 Accesses
4 Citations

Abstract

Convolutive and temporally correlated mixtures of speech are tackled with an LP-based temporal pre-whitening stage combined with the natural gradient algorithm (NGA), to essentially perform spatial separation by maximizing entropy at the output of a nonlinear function. In the past, speech sources have been parameterized by the generalized Gaussian density (GGD) model, in which the exponent parameter directly relates to the exponent of the corresponding optimal nonlinear function. In this paper, we present an adaptive, source dependent estimation of this parameter, controlled exclusively by the statistics of the output source estimates. Comparative experimental results illustrate the inherent flexibility of the proposed method, as well as an overall increase in convergence speed and separation performance over existing approaches.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 74.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Multimicrophone MMSE-Based Speech Source Separation

Deep Neural Network Based Multichannel Audio Source Separation

Multichannel Spatial Clustering Using Model-Based Source Separation

References

Amari, S., Cichocki, A., Yang, H.: A New Learning Algorithm for Blind Signal Separation. In: Advances in Neural Information Processing Systems 8, pp. 757–763. MIT Press, Cambridge (1996)
Google Scholar
Bell, A., Sejnowski, T.: An Information Maximization Approach to Blind Separation and Blind Deconvolution. Neural Computation 7(6), 1129–1159 (1995)
Article Google Scholar
Choi, S., Cichocki, A., Amari, S.: Flexible Independent Component Analysis. Journal of VLSI Signal Processing 26(1), 25–38 (2000)
Article Google Scholar
Do, M.N., Vetterli, M.: Wavelet-Based Texture Retrieval Using Generalized Gaussian Density and Kullback-Leibler Distance. IEEE Trans. on Image Processing 11(2), 146–158 (2002)
Article MathSciNet Google Scholar
Gazor, S., Zhang, W.: Speech Probability Distribution. IEEE Signal Processing Letters 10(7), 204–207 (2003)
Article Google Scholar
Kokkinakis, K., Zarzoso, V., Nandi, A.K.: Blind Separation of Acoustic Mixtures based on Linear Prediction Analysis. In: Proc. Fourth Int. Symp. on ICA and BSS, Nara, Japan, April 1-4, pp. 343–348 (2003)
Google Scholar
Kokkinakis, K., Nandi, A.K.: Optimal Blind Separation of Convolutive Audio Mixtures without Temporal Constraints. In: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Montreal, Canada, May 17-21, pp. 217–220 (2004)
Google Scholar
Kokkinakis, K., Nandi, A.K.: Multichannel Blind Deconvolution for Source Separation in Convolutive Mixtures of Speech. Submitted to IEEE Trans. on Speech and Audio Processing (February 2004)
Google Scholar
Lambert, R.H.: Multichannel Blind Deconvolution: FIR Matrix Algebra and Separation of Multipath Mixtures. Ph.D. Thesis, University of Southern California (May 1996)
Google Scholar
Lambert, R.H., Bell, A.J.: Blind Separation of Multiple Speakers in a Multipath Environment. In: Proc. IEEE Int. Conf. on Acoustics, Speech and Signal Processing, Munich, Germany, April 21-24, pp. 423–426 (1997)
Google Scholar
Lee, T.-W., Bell, A.J., Orglmeister, R.: Blind Source Separation of Real World Signals. In: Proc. ICNN, Texas, June 9-12, pp. 2129–2135 (1997)
Google Scholar
Lee, T.-W., Girolami, M., Sejnowski, T.: Independent Component Analysis Using an Extended Infomax Algorithm for Mixed Subgaussian and Supergaussian Sources. Neural Computation 11(2), 417–441 (1999)
Article Google Scholar
Mathis, H., von Hoff, T.P., Joho, M.: Blind Separation of Signals with Mixed Kurtosis Signs Using Threshold Activation Functions. IEEE Trans. on Neural Networks 12(3), 618–624 (2001)
Article Google Scholar
Varanasi, M.K., Aazhang, B.: Parametric Generalized Gaussian Density Estimation. J. Acoust. Soc. America 86(4), 1404–1415 (1989)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Signal Processing and Communications Group, Department of Electrical Engineering and Electronics, The University of Liverpool, Brownlow Hill, Liverpool, L69 3GJ, UK
Kostas Kokkinakis & Asoke K. Nandi

Authors

Kostas Kokkinakis
View author publications
You can also search for this author in PubMed Google Scholar
Asoke K. Nandi
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dept. of Architecture and Computer Technology, University of Granada, Spain
Carlos G. Puntonet & Alberto Prieto &

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Kokkinakis, K., Nandi, A.K. (2004). Multichannel Speech Separation Using Adaptive Parameterization of Source PDFs. In: Puntonet, C.G., Prieto, A. (eds) Independent Component Analysis and Blind Signal Separation. ICA 2004. Lecture Notes in Computer Science, vol 3195. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-30110-3_62

Download citation

DOI: https://doi.org/10.1007/978-3-540-30110-3_62
Published: 27 October 2004
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-23056-4
Online ISBN: 978-3-540-30110-3
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics