As a method to tackle blind source separation (BSS) in the frequency domain, we introduce independent vector analysis (IVA), an extension of independent component analysis (ICA) from univariate components to multivariate components. Given a mixture of statistically independent multivariate sources where the mixing is constrained to be component-wise, ICA needs to be followed by an additional algorithmic scheme in order to correct the permutation disorder that occurs after the component-wise separation, whereas IVA utilizes the inner dependency of the multivariate components and separates the fellow source components together. The efficiency of this new formulation in solving the permutation problem has been proven in its application to convolutive mixture of independent speech signals. Maximum likelihood (ML) approaches or information theoretic approaches have been employed where the time–frequency model of speech has been modelled by several multivariate joint densities, and natural gradient or Newton method algorithms have been derived. Here, we present a gentle tutorial on IVA for the separation of speech signals in the frequency domain.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
P. Smaragdis, “Blind separation of convolved mixtures in the frequency domain,” Neurocomputing, vol. 22, pp. 21-34, 1998.
L. Parra and C. Spence, “Convolutive blind separation of non-stationary sources,” IEEE Trans. on Speech and Audio Processing, vol. 8, no. 3, pp. 320-327, 2000.
J. Anemueller and B. Kollmeier, “Amplitude modulation decorrelation for convolutive blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 215-220, 2000.
N. Murata, S. Ikeda, and A. Ziehe, “An approach to blind source separation based on temporal structure of speech signals,” Neurocomputing, vol. 41, pp. 1-24, 2001.
M. Z. Ikram and D. R. Morgan, “A beamforming approach to permutation alignment for multichannel frequency-domain blind speech separation,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 881-884, 2002.
H. Sawada, R. Mukai , S. Araki, and S. Makino, “A robust and precise method for solving the permutation problem of frequency-domain blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 505-510, 2003.
T. Kim, H. T. Attias, S.-Y. Lee, and T.-W. Lee, “Blind source separation exploiting higher-order frequency dependencies,” IEEE Trans. on Speech and Audio Processing, vol. 15, no. 1, pp. 70-79, 2007.
I. Lee, T. Kim, and T.-W. Lee, “Complex FastIVA: a robust maximum like-lihood approach of MICA for convolutive BSS,” Lecture Notes in Computer Science, vol. 3889, pp. 625-632, 2006.
A. Hiroe, “Solution of permutation problem in frequency domain ICA, using multivariate probability density functions,” Lecture Notes in Computer Science, vol. 3889, pp. 601-608, 2006.
M. Davies, “Audio Source Separation,” Mathematics in Signal Processing 5, Oxford University Press, pp. 57-68, 2002.
I. Lee, T. Kim, and T.-W. Lee, “Fast Fixed-Point Independent Vector Analysis Algorithms for Convolutive Blind Source Separation”, Signal Processing, vol. 87, no. 8, pp. 1859-1871, 2007.
A. J. Bell and T. J. Sejnowski, “An information maximization approach to blind separation and blind deconvolution,” Neural Computation, vol. 7, no. 6, pp. 1129-1159, 1995.
J.-F. Cardoso, “Infomax and maximum likelihood for source separation,” IEEE Signal Processing Letters, vol. 4, no. 4, 1997.
E. Bingham and A. Hyvärinen, “A fast fixed-point algorithm for independent componenet analysis of complex-valued signals,” Int. J. of Neural Systems, vol. 10, no. 1, pp. 1-8, 2000.
J.-F. Cardoso, “Entropic contrasts for source separation: Geometry and stability,” S. Haykin, Ed. Unsupervised Adaptive Filtering, vol. 1, pp. 139-189, John Wiley and Sons, 2000.
J.-F. Cardoso, “Multidimensional independent component analysis,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 1941-1944, 1998.
A. Hyvärinen and P. O. Hoyer, “Emergence of phase and shift invariant features by decomposition of natural images into independent feature subspaces,” Neural Computation, vol. 12, no. 7, pp. 1705-1720, 2000.
D. H. Brandwood, “A complex gradient operator and its application in adaptive array theory,” IEE Proc. F and H, vol. 130, no. 1, pp. 11-16, 1983.
A. van den Bos, “Complex gradient and Hessian,” IEE Proceedings on Vision, Image and Signal Processing, vol. 141, pp. 380-382, 1994.
H. Brehm and W. Stammler, “Description and generation of spherically invariant speech-model signals,” Signal Processing, no. 12, pp. 119-141, 1987.
H. Buchner, R. Aichner, and W. Kellermann, “Blind source separation for convolutive mixtures: A unified treatment,” Audio Signal Processing for NextGeneration Multimedia Communication Systems, Y. Huang and J. Benesty Ed., Kluwer Academic Publishers, Boston, pp. 255-293, 2004.
I. Lee and T.-W. Lee, “On the Assumption of Spherical Symmetry and Sparseness for the Frequency-Domain Speech Model”, IEEE Trans. on Speech, Audio and Language Processing, vol. 15, no. 5, pp. 1521-1528, 2007.
G.-J. Jang, I. Lee, and T.-W. Lee, “Independent Vector Analysis using NonSpherical Joint Densities for the Separation of Speech Signals”, Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, vol. 2, pp. 629-632, 2007.
S.-I. Amari, A. Cichocki, and H. H. Yang, “A new learning algorithm for blind signal separation,” Adv. Neural Information Processing Systems, vol. 8, pp. 757-763, 1996.
J.-F. Cardoso, “The invariant approach to source separation,” Proc. International Symposium on Nonlinear Theory and Applications (NOLTA), vol. 1, pp. 55-60, 1995.
A. Hyvärinen and E. Oja, “A Fast fixed-Point algorithm for independent component analysis,” Neural Computation, vol. 9, no. 7, pp. 1483-1492, 1997.
A. Hyvärinen, “Fast and Robust fixed-point algorithms for independent componenet analysis, IEEE Trans. on Neural Networks, vol. 10, no. 3, pp. 626-634, 1999.
E. G. Learned-Miller and J. W. Fisher III, “ICA using spacings estimates of entropy,” J. of Machine Learning Research, vol. 4, pp. 1271-1295, 2003.
M. Z. Ikram and D. R. Morgan, “Exploring permutation inconsistency in blind separation of signals in a reverberant environment,” Proc. IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, pp. 1041-1044, 2000.
K. Matsuoka and S. Nakashima, “Minimal distortion principle for blind source separation,” Proc. Int. Conf. on Independent Component Analysis and Blind Source Separation, pp. 722-727, 2001.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this chapter
Cite this chapter
Lee, I., Kim, T., Lee, TW. (2007). Independent Vector Analysis for Convolutive Blind Speech Separation. In: Makino, S., Sawada, H., Lee, TW. (eds) Blind Speech Separation. Signals and Communication Technology. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-6479-1_6
Download citation
DOI: https://doi.org/10.1007/978-1-4020-6479-1_6
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-6478-4
Online ISBN: 978-1-4020-6479-1
eBook Packages: EngineeringEngineering (R0)